GitHub - jayakumarudayakumar/Databricks_Python_cityofchicago_taxitrips: Optimizing Traffic Fleet Management and Reducing Traffic Congestion

Optimizing Traffic Fleet Management and Reducing Traffic Congestion

Summary

This project optimizes taxi fleet management using Big Data Management tools like Apache Spark. Some of our project objectives are,

Analyze historical and real-time taxi trip data from the City of Chicago
Identify high-demand pickup areas, peak hours, and traffic congestion hotspots
Propose fleet reallocation strategies

Methodology

Utilized Databricks and Apache Spark to perform distributed data processing and analysis
The dataset was preprocessed, cleaned, and aggregated to analyze trip volumes, fare distribution, and pickup/drop-off locations across various timeframes and geographic regions

Dataset

The City of Chicago taxi trip data set was used for this analysis. It contains fare amounts, trip miles, trip durations, pickup and drop-off timestamps, and the geographical location (community area) of pickups and drop-offs. Data Pre-processing, Descriptive Statistics, Demand Analysis, Congestion Analysis, Fleet Reallocation Recommendations

System Design

API Call - Historic Data (Jan 2024 - Sep 2024) and Real-time Data (Oct 2024 - Future)
Data Cleaning and Transformation
Load Data to Parquet files on Cloud Storage
Create Data Warehouse tables on PySpark
Load Data from Parquet files to DW tables
Data Analysis and Visualization

RDBS Schema

Trip (Fact Table)
Company (Dimension Table)
Transaction (Fact Table)

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Big Data Project Code (2).html		Big Data Project Code (2).html
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Optimizing Traffic Fleet Management and Reducing Traffic Congestion

Summary

Methodology

Dataset

System Design

RDBS Schema

About

Uh oh!

Releases

Packages

Languages

jayakumarudayakumar/Databricks_Python_cityofchicago_taxitrips

Folders and files

Latest commit

History

Repository files navigation

Optimizing Traffic Fleet Management and​ Reducing Traffic Congestion

Summary

Methodology

Dataset

System Design

RDBS Schema

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Optimizing Traffic Fleet Management and Reducing Traffic Congestion

Packages