This repository contains a collection of basic machine learning projects I worked on as part of a selection process for a college technical club. While I am no longer part of the selection process, I’ve retained these projects here to document my learning and growth in core ML concepts.
These tasks focus on foundational machine learning workflows using structured data. I explored various algorithms and techniques, including:
- Linear Regression
- Logistic Regression
- Decision Trees
- Clustering (Unsupervised Learning)
Each project includes:
- 📄 The dataset used (in
.csvformat) - 📓 A Jupyter Notebook with code and visualisations
-
linear_reg/train.csvtest.csvtask2_0_linearregression.ipynb
-
logistic_reg/Freyja_Pumpkins-test.csvGotem_Pumpkins-train.csvtask2_1_logisticregression.ipynb
-
decision_trees/Threats (1).csvtask3_2_Decision_trees.ipynb
-
clustering/Clustering_Data.csvtask3_0_clustering.ipynb
-
README.md
- Python
- Jupyter Notebooks
- NumPy, Pandas
- Matplotlib, Seaborn
- Scikit-learn
This repo marks my hands-on experience with machine learning. Through these mini-projects, I learned how to:
- Handle real-world CSV data
- Clean and preprocess datasets
- Build and evaluate ML models
- Visualise data insights and model performance