Welcome to my collection of data science projects completed during my internship at CodSoft. Each project focuses on building machine learning models to solve real-world problems using various datasets.
- Iris Flower Classification
- Sales Prediction
- Fraudulent Credit Card Transactions
- Movie Rating Prediction
- Titanic Survival Prediction
This project utilizes the Iris dataset to classify iris flowers into three species based on sepal and petal measurements. The dataset includes features such as sepal_length, sepal_width, petal_length, petal_width, and species.
- Python
- Pandas
- Scikit-learn
- Matplotlib
- Seaborn
In this project, a machine learning model predicts future sales based on factors like advertising expenditures across different platforms (TV, Radio, and Newspaper). The goal is to optimize advertising strategies.
- Python
- Pandas
- Scikit-learn
- Matplotlib
- Seaborn
This project involves building a model to identify fraudulent credit card transactions. The dataset includes various features such as transaction time and amounts, while handling class imbalance is critical in this task.
- Python
- Pandas
- Scikit-learn
- Matplotlib
- Seaborn
This project aims to predict movie ratings based on features like genre, director, and actors. The goal is to analyze historical movie data to develop an accurate rating estimation model.
- Python
- Pandas
- Scikit-learn
- Matplotlib
- Seaborn
This classic beginner project utilizes the Titanic dataset to build a model that predicts whether a passenger survived the disaster. The dataset contains information about passengers, such as their age, gender, ticket class, fare, cabin, and survival status.
- Python
- Pandas
- Scikit-learn
- Matplotlib
- Seaborn
- Data Exploration: Analyze the dataset to understand the features.
- Data Preprocessing: Handle missing values and convert categorical variables.
- Feature Engineering: Select relevant features for model training.
- Model Training: Implement machine learning algorithms to predict survival.
- Model Evaluation: Assess the model's performance using accuracy, precision, and recall.
These projects demonstrate the application of data science techniques and machine learning algorithms to solve various problems. Each project is a step towards mastering data analysis and predictive modeling.
Feel free to explore each project in detail and reach out if you have any questions!
This project is licensed under the MIT License - see the LICENSE file for details.