This project focuses on end-to-end automobile data analysis using Python. The main objective of this project is to clean, process, analyze, and visualize automobile industry data to generate meaningful business insights.
The project includes:
- Data Cleaning & Preprocessing
- Exploratory Data Analysis (EDA)
- Feature Engineering
- Outlier Detection
- Data Visualization
- Business Insights Generation
- Python
- Pandas
- NumPy
- Matplotlib
- Seaborn
- Jupyter Notebook
Automobile-Analytics-System/
│
├── data/
│ ├── Automobile_data.csv
│ └── cleaned_automobile_data.csv
│
├── notebooks/
│ └── automobile_analysis.ipynb
│
├── visuals/
│ ├── correlation_heatmap.png
│ ├── price_distribution.png
│ ├── company_boxplot.png
│ ├── horsepower_vs_price.png
│ └── fuel_type_count.png
│
├── reports/
│ └── business_insights.txt
│
├── requirements.txt
└── README.md- Handled missing values
- Replaced invalid values
- Removed duplicate records
- Converted object data types into numerical format
- Average vehicle price analysis
- Company-wise price comparison
- Fuel type analysis
- Horsepower analysis
- Correlation analysis
- Price category creation
- Price per horsepower calculation
- Mileage analysis
- Performance categorization
- Boxplot analysis
- IQR method for detecting extreme price values
- Histogram
- Correlation Heatmap
- Scatter Plot
- Box Plot
- Count Plot
- Pair Plot
-
BMW and Mercedes-Benz vehicles have the highest average prices in the dataset.
-
Toyota and Honda cars are more budget friendly compared to luxury brands.
-
Horsepower has a strong positive relationship with vehicle price.
-
Diesel vehicles provide better mileage efficiency.
-
Luxury vehicles contain more pricing outliers.
-
Sedan body style is the most common vehicle category in the dataset.
-
Higher engine size generally increases vehicle price and performance.
-
Mid-range vehicles dominate the automobile market segment.
-
Gas fuel type vehicles are more common than diesel vehicles.
-
Correlation analysis showed that horsepower, engine-size, and curb-weight strongly influence car prices.
pip install -r requirements.txtjupyter notebook- Python Programming
- Data Cleaning
- Exploratory Data Analysis
- Data Visualization
- Statistical Analysis
- Feature Engineering
- Business Analytics
- Problem Solving
Shreenidhi B D




