This project involves using PostgreSQL to explore the Sakila DVD Rental database, which contains data related to a company that rents movie DVDs. The objective is to gain insights into customer behavior, movie rentals, payment earnings, and store performance. SQL queries are used to extract relevant data, and visualizations are created to showcase the findings.
The Sakila DVD Rental database can be downloaded from the following link: Sakila DVD Rental Database
The project consists of SQL queries and data visualizations. Here's a brief overview of the key components:
-
SQL Queries: SQL queries are written to extract and analyze data from the database. Four main queries are used to answer specific questions and generate insights.
- Query 1: Examines movie rental patterns across different film categories.
- Query 2: Divides family movies into quartiles based on rental duration.
- Query 3: Analyzes the count of movies within each category for different rental duration categories.
- Query 4: Provides information on store performance in terms of fulfilling rental orders.
-
Data Visualizations: Visualizations are created to present the results of the SQL queries. The visualizations can include bar charts, line graphs, or any other relevant types to effectively communicate the findings (in
report.pdffile).
To run the SQL queries in this project, follow these steps:
- Database Setup: Ensure you have PostgreSQL installed on your computer.
- Download Dataset: Download the Sakila DVD Rental database from the provided link.
- Connect to Database: Use a SQL client (e.g., pgAdmin, DBeaver, or psql command-line) to connect to the Sakila database.
- Execute Queries: Open and execute the SQL queries provided in the project.
The project's SQL queries provide insights into customer behavior, movie rentals, and store performance. Visualizations are generated to present the findings effectively.
queries.txt: Contains the SQL queries used in the project.report.pdf: File containing visualizations generated from the query results and insights.
- PostgreSQL: The project requires a PostgreSQL database to run the SQL queries.
- This project was completed as part of the Programming for Data Science Nanodegree offered by Udacity. Its already the first project of Introduction to Python Programming course on the whole program.