Amazon Product Reviews Dataset
Dataset Overview
Add to BookmarkData Type | Real | Default Task | Classification,Regression |
---|---|---|---|
Attribute Type | Multivariate, Sequential, Temporal aspect | Published Year | 2018 |
Area of Dataset | Includes ratings, review text, and helpfulness votes | Missing Values | Yes |
No. of Instances | 233 million | No. of Attribute | 16+ |
Dataset Description:
This dataset is an updated and expanded version of the Amazon review dataset originally released in 2014. It contains a comprehensive collection of customer reviews, product metadata, and relational links useful for recommendation systems and data analysis.
Key Features:
- Large Scale: Over 233 million reviews spanning from May 1996 to October 2018 (compared to 142.8 million in the 2014 release).
- Review Data: Includes ratings, review text, and helpfulness votes.
- Rich Metadata: Enhanced product information such as color, size, package type, bullet-point descriptions, technical details (attribute-value pairs), and images taken post-purchase.
- Relational Links: Graph data showing “also viewed” and “also bought” product relationships.
- Expanded Categories: Five new product categories added to cover a wider range of items.
Applications:
Ideal for building and evaluating machine learning models in sentiment analysis, recommendation systems, opinion mining, and other NLP and data mining tasks.
Formats Available:
Typically available in JSON or CSV formats.
Source:
https://snap.stanford.edu/data/web-Amazon-links.html
Prepare for Interview
- JavaScript Interview Questions for 5+ Years Experience
- JavaScript Interview Questions for 2–5 Years Experience
- JavaScript Interview Questions for 1–2 Years Experience
- JavaScript Interview Questions for 0–1 Year Experience
- JavaScript Interview Questions For Fresher
- SQL Interview Questions for 5+ Years Experience
- SQL Interview Questions for 2–5 Years Experience
- SQL Interview Questions for 1–2 Years Experience
- SQL Interview Questions for 0–1 Year Experience
- SQL Interview Questions for Freshers
- Design Patterns in Python
Random Blogs
- SQL Joins Explained: A Complete Guide with Examples
- Top 10 Knowledge for Machine Learning & Data Science Students
- Loan Default Prediction Project Using Machine Learning
- Generative AI - The Future of Artificial Intelligence
- Understanding HTAP Databases: Bridging Transactions and Analytics
- The Ultimate Guide to Artificial Intelligence (AI) for Beginners
- Understanding LLMs (Large Language Models): The Ultimate Guide for 2025
- Transforming Logistics: The Power of AI in Supply Chain Management
- Mastering SQL in 2025: A Complete Roadmap for Beginners
- How to Install Tableau and Power BI on Ubuntu Using VirtualBox