This project involves analyzing sales data from Amazon to derive insights and answer specific queries related to product discounts, reviews, and pricing. The analysis aims to facilitate data-driven decision-making by identifying trends and patterns in the sales data.
Dataset Link : https://drive.google.com/file/d/1hJOXNpOUXOpyjsrBt1O8MyIJHfWgzakS/view The dataset includes information about various products, including their actual price, discounted price, category, and customer reviews.
The analysis involves executing the following SQL queries:
- List all products with a discounted price below ₹500.
- Find products with a discount percentage of 50% or more.
- Retrieve all products where the name contains the word "Cable."
- Display the difference between the average of the actual price and the discounted price for each product.
- Query reviews that mention "fast charging" in their content.
- Identify products with a discount percentage between 20% and 40%.
- Find products that have an actual price above ₹1,000 and are rated 4 stars or above.
- Find products where the discounted price ends with a 9.
- Display review contents that contain words like worst, waste, poor, or not good.
- List all products where the category includes "Accessories."
- For the complete SQL queries, please refer to the attached PDF document in the project repository.
-
Set Up Your Database: Ensure your database includes the relevant tables with the necessary columns.
-
Run Queries: Execute these queries(refer project.sql) in your SQL environment to analyze your data. Adjust table and column names as needed to match your database schema.
-
Review Results: Use the results from these queries to gain insights into product pricing, discounts, and review content.
Feel free to fork this repository and submit pull requests with any improvements or additional queries.