Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
103 changes: 102 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1,2 +1,103 @@
# Netflix_data_sci
Explore Netflix's World 🌍🍿. An in-depth analysis of Netflix's vast content library! Dive into the data behind over 10,000 movies and TV shows. Discover trends, genres, top creators, and more with Python and data visualizations. Uncover the magic of Netflix data!
🎬 Netflix Data Analysis with Python 📊
Dive into the world of Netflix with Python! Analyze over 10,000 titles to uncover trends, insights, and visual stories from your favorite streaming giant.

📌 Overview
Netflix, one of the world’s leading media and video streaming platforms, boasts a massive library of movies 🎥 and TV shows 📺 with over 222 million subscribers globally (as of mid-2021). This project presents a comprehensive exploratory data analysis (EDA) of Netflix’s vast content catalog, using powerful Python libraries to clean, analyze, and visualize the data.

The dataset contains rich metadata like titles, cast, directors, genres, ratings, countries, durations, and more. Using Python’s data science stack, we unlock hidden trends and insights about how Netflix curates and evolves its content for a global audience.

🔍 Project Highlights
🛠️ Libraries Used
Pandas 🐼 – for data manipulation

NumPy 🧮 – for numerical operations

Matplotlib 📊 – for static plotting

Seaborn 📈 – for aesthetic visualizations

WordCloud ☁️ – for generating textual data clouds

📊 Dataset Columns
Column Description
show_id Unique identifier for each title
type Whether it's a Movie or TV Show
title Title of the content
director Director’s name
cast Main cast members
country Country of production
date_added Date added to Netflix
release_year Year of release
rating Maturity rating (e.g., TV-MA, PG)
duration Duration in minutes or number of seasons
listed_in Genre(s)
description Short summary

📈 Key Insights & Visualizations
🌍 Countrywise Rating Distribution

🎭 Genre Trends by Country

🔞 Genre vs. Rating Matrix

📊 Correlation Heatmaps

🎤 Most Active Actors

⏱️ Content Duration Analysis

📅 Time Trends

👶 Age Group Classification

🎬 Top Genres & Directors

🌎 Global Distribution

🍿 Content Type Breakdown

🎯 Goals of This Project
Provide an intuitive and visual summary of Netflix’s content catalog.

Practice real-world EDA skills using Python and open data.

Understand how genres, ratings, and countries influence content strategy.

Explore temporal trends in Netflix content additions.

Identify dominant creators and popular content categories.

🌟 Features
Cleaned and preprocessed dataset.

Interactive and aesthetic plots using Seaborn and Matplotlib.

Word clouds for cast and genres.

Grouped visualizations for type-based insights.

Year-wise and country-wise trend comparisons.

🔮 Future Enhancements
Add interactive dashboards using Plotly or Streamlit.

Include recommendation engine prototype.

Perform sentiment analysis on descriptions.

Integrate with live Netflix APIs for updated content.

Compare with other platforms like Prime, Hulu, or Disney+.

🤝 Contributors
Esha Yalagi

Mudabbir Nargaddi

🧠 Topics & Skills
#Exploratory-Data-Analysis #Data-Visualization #Python #Netflix #Pandas #Seaborn #WordCloud #EDA #DataCleaning

📣 About
Explore Netflix’s World 🌍🍿
An engaging, visual deep-dive into Netflix’s library using Python. Whether you're a data enthusiast, student, or Netflix lover, this project will help you understand the magic of content strategy behind one of the world's biggest streaming platforms.