This project explores the Netflix titles dataset to understand the type of content available on the platform and how it has changed over time.
It focuses on simple, clear data analysis — cleaning, visualizing, and drawing insights using Python, Pandas, and Matplotlib.
- Clean and prepare the dataset by handling missing values.
- Analyze the number of Movies vs TV Shows.
- Identify the top content-producing countries.
- Study yearly trends in Netflix releases.
- Find the most common genres.
- Present insights visually and clearly.
The dataset contains information about Netflix titles such as:
type, title, country, date_added, release_year, duration, and listed_in.
Source: Kaggle Netflix Dataset
- Python
- Pandas
- Matplotlib
- Jupyter Notebook / VS Code
netflix_analysis.ipynb– main notebook with code, analysis, and visualsnetflix_cleaned.csv– cleaned dataset after preprocessingNetflix_analysis.pdf– final PDF summaryREADME.md– project documentation
- Movies are more common than TV Shows.
- The United States is the top content producer.
- Netflix’s content library grew rapidly after 2015.
- Drama and Comedy are the most frequent genres.
Mubashir – Student learning AI and Data Science, aiming to build strong freelancing and academic skills.