Skip to content

This project demonstrates a complete data analytics workflow that includes data extraction, cleaning, transformation, loading, and analysis. It is ideal for data engineers and analysts looking to learn ETL (Extract, Transform, Load) processes and SQL-driven analytics.

Notifications You must be signed in to change notification settings

theDhanendra/retail-data-analysis-with-Python-and-SQL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

6 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿ›’ Retail Orders Data Analysis and Visualization ๐Ÿ“Š

This project focuses on analyzing a retail orders dataset to generate actionable business insights. The dataset was processed, cleaned, and analyzed using Python, SQL, and various visualization tools. The insights derived can help businesses optimize their operations, improve profitability, and enhance customer experience.


โœจ Project Highlights

  • ๐Ÿงน Data Cleaning and Preprocessing using Python.
  • ๐Ÿš€ Feature Engineering: Creation of new columns like discount, sale price, and profit.
  • ๐Ÿ› ๏ธ SQL Queries for in-depth analysis and insights.
  • ๐Ÿ“ˆ Advanced Visualizations: Bar charts, Column Charts, line plots, Pie Chart and Heatmap.
  • ๐Ÿ” Actionable Insights to drive business decisions.

๐Ÿ”‘ Key Features

  1. ๐Ÿงน Data Cleaning:

    • Replaced invalid or missing values.
    • Standardized column names for consistency.
    • Converted data types for better usability.
  2. ๐Ÿงฎ Data Analysis:

    • SQL queries to analyze top products, regions, and profitability.
    • ๐Ÿ“… Month-over-month and year-over-year comparisons.
    • ๐ŸŽฏ Impact of discounts on sales and profits.
  3. ๐Ÿ“Š Visualizations:

    • ๐ŸŸ  Category-wise Performance (Bar Charts).
    • ๐Ÿ—บ๏ธ Top Cities on a Map (Folium).
    • ๐Ÿ“‰ Monthly Trends (Line Charts).
    • ๐ŸŒ Regional Profitability and Product Popularity.

๐Ÿ’ป Technologies Used

  • Python Libraries:

    • ๐Ÿผ Pandas
    • ๐Ÿ“Š Matplotlib
    • ๐Ÿ–Œ๏ธ Seaborn
    • ๐Ÿ› ๏ธ mysql.connector
    • ๐Ÿ“ sqlalchemy for create_engine
  • SQL: Queries for analysis and insights.

    • Top Revenue Generating Products
    • Best-Selling Products by Region
    • Month-over-Month Sales Growth
    • Category-Wise Best Sales Month
    • Sub-Category Profit Growth
    • Discount Effectiveness
    • Shipping Mode Analysis
  • Database: ๐Ÿ—„๏ธ MySQL for data storage and querying.


โš™๏ธ Setup Instructions

  1. Clone this repository:
    git clone https://github.com/theDhanendra/retail-orders-analysis.git
    
  2. Navigate the project directory:
    cd retail-orders-analysis
    
  3. Install the required Python libraries:
    pip install -r requirements.txt
    
  4. Set up MySQL and create the database and table using the provided SQL script (retail_orders.sql).
  5. Run the Jupyter Notebook for analysis.

Thank You ๐Ÿ˜Š

About

This project demonstrates a complete data analytics workflow that includes data extraction, cleaning, transformation, loading, and analysis. It is ideal for data engineers and analysts looking to learn ETL (Extract, Transform, Load) processes and SQL-driven analytics.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published