Amazon Product Reviews Dataset
Dataset Overview
Add to BookmarkData Type | Real | Default Task | Classification,Regression |
---|---|---|---|
Attribute Type | Multivariate, Sequential, Temporal aspect | Published Year | 2018 |
Area of Dataset | Includes ratings, review text, and helpfulness votes | Missing Values | Yes |
No. of Instances | 233 million | No. of Attribute | 16+ |
Dataset Description:
This dataset is an updated and expanded version of the Amazon review dataset originally released in 2014. It contains a comprehensive collection of customer reviews, product metadata, and relational links useful for recommendation systems and data analysis.
Key Features:
- Large Scale: Over 233 million reviews spanning from May 1996 to October 2018 (compared to 142.8 million in the 2014 release).
- Review Data: Includes ratings, review text, and helpfulness votes.
- Rich Metadata: Enhanced product information such as color, size, package type, bullet-point descriptions, technical details (attribute-value pairs), and images taken post-purchase.
- Relational Links: Graph data showing “also viewed” and “also bought” product relationships.
- Expanded Categories: Five new product categories added to cover a wider range of items.
Applications:
Ideal for building and evaluating machine learning models in sentiment analysis, recommendation systems, opinion mining, and other NLP and data mining tasks.
Formats Available:
Typically available in JSON or CSV formats.
Source:
https://snap.stanford.edu/data/web-Amazon-links.html
Prepare for Interview
- JavaScript Interview Questions for 0–1 Year Experience
- JavaScript Interview Questions For Fresher
- SQL Interview Questions for 5+ Years Experience
- SQL Interview Questions for 2–5 Years Experience
- SQL Interview Questions for 1–2 Years Experience
- SQL Interview Questions for 0–1 Year Experience
- SQL Interview Questions for Freshers
- Design Patterns in Python
- Dynamic Programming and Recursion in Python
- Trees and Graphs in Python
- Linked Lists, Stacks, and Queues in Python
Random Blogs
- Mastering SQL in 2025: A Complete Roadmap for Beginners
- Python Challenging Programming Exercises Part 2
- Downlaod Youtube Video in Any Format Using Python Pytube Library
- Understanding HTAP Databases: Bridging Transactions and Analytics
- What to Do When Your MySQL Table Grows Too Wide
- Loan Default Prediction Project Using Machine Learning
- SQL Joins Explained: A Complete Guide with Examples
- How AI is Making Humans Weaker – The Hidden Impact of Artificial Intelligence
- Top 10 Blogs of Digital Marketing you Must Follow
- Understanding OLTP vs OLAP Databases: How SQL Handles Query Optimization