An End-to-End Data Analytics Project demonstrating data cleaning, SQL analytics, and interactive Power BI visualization to uncover actionable business insights.
In today's competitive retail landscape, understanding who your customers are and how they shop is critical for growth. The business lacked visibility into key purchasing patterns, such as seasonal trends, high-value customer segments, and the most profitable product categories. This project solves that problem by building a comprehensive analytics pipeline that transforms raw transaction data into a clear, interactive dashboard—enabling stakeholders to make data-driven decisions on marketing strategies and inventory management.
This project follows a complete end-to-end data analytics workflow:
- Data Cleaning & Preprocessing (Python/Jupyter): Imported raw transaction data, handled missing values, standardized formats, and prepared a clean dataset for accurate analysis.
- SQL Analytics: Queried the database to extract key metrics (e.g., total revenue, average order value, customer segmentation).
- Power BI Dashboard: Connected the cleaned data to Power BI to design interactive visualizations that allow users to filter and explore insights dynamically.
- Final Report Generation: Summarized the findings, methodologies, and business recommendations into a concise PDF report for leadership.
- Database & Querying: SQL
- Data Preprocessing: Python (Pandas, Jupyter Notebook)
- Data Visualization: Power BI
- Documentation & Reporting: PDF / Markdown
Customer-Behavior-Analysis/
│
├── customer_shopping_behavior.csv # Cleaned dataset used for analysis
├── Customer_Data_Analyst_Portfolio_Project.ipynb # Data cleaning and preprocessing notebook
├── SQL_Customer_Behavior_Analysis.sql # SQL scripts for data exploration & insights
├── Customer_Data_Dashboard.pbix # Interactive Power BI dashboard file
├── Power_BI_Dashboard_Image/ # Folder containing dashboard screenshots
│ ├── Best and Worst items .png
│ └── Customer Analysis Revenue .png
├── Images_used/ # Additional assets and icons
└── Report_Customer-Behaviour-Analysis.pdf # Final insights report
- High-Value Segments: Identified the most profitable demographic groups, providing a clear target for future marketing campaigns.
- Product Performance: Highlighted the top-performing and lowest-performing items, offering actionable guidance for inventory management.
- Revenue Trends: Uncovered purchasing trends across different categories and timeframes, helping to align promotions with customer behavior.
- Payment Methods: Analyzed the distribution of payment methods to understand customer preferences at checkout.
Here are snapshots of the Power BI Dashboard created for this project:
To explore this project locally:
- Clone the repository:
git clone https://github.com/CodingWithRishi/Customer-Behavior-Analysis.git
- View the Code:
- Open
Customer_Data_Analyst_Portfolio_Project.ipynbin Jupyter to see the data cleaning process. - Open
SQL_Customer_Behavior_Analysis.sqlin your preferred SQL client to review the analytical queries.
- Open
- Explore the Dashboard:
- Download and open
Customer_Data_Dashboard.pbixusing Power BI Desktop.
- Download and open
- Read the Report:
- Open
Report_Customer-Behaviour-Analysis.pdffor a high-level executive summary of the findings.
- Open
- Predictive Analytics: Implement machine learning to predict future customer purchases or churn.
- Automated Data Pipeline: Set up an automated ETL process to refresh dashboard data seamlessly when new transactions occur.
- Deeper Segmentation: Add RFM (Recency, Frequency, Monetary) analysis to further categorize customer loyalty.
Author: Rishi Patel

