Skip to content

Data analysis and logistic regression project in Python. Focused on exploratory data analysis, data visualization and implementing logistic regression from scratch.

Notifications You must be signed in to change notification settings

beatriangu/DSLR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

33 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DSLR

DataScience x Logistic Regression - School-42 project

Goals:

  • Learn how to read a dataset, to visualize it in different ways, to select and clean unnecessary information from your data.
  • Implement one-vs-all logistic regression that will solve classification problem

Look at subject.pdf for more information

Data Analysis

To show the dataframe from a complete csvfile with all types of values ​​and with all subjects

Miniatura

To display dataframe exactly as in the subject . Only numerical values, transposed texts and first four columns of Features

Miniatura

To display a dataframe with only numerical values, transposed axes and first four Subjects.

Miniatura

Data Visualitation

Histogram : show course marks distribution. For example ; The Best Hand concept does not provide us with any significant insights.

Histogram for All Subjects

Scatter: how values for two courses using Cartesian coordinates.

Null or weak correlation example between Arithmancy and Care of Magical Creatures

Miniatura

Pair Plot: This scatter plot matrix visualizes the pairwise relationships between each subject and all other subjects. It allows you to see how each pair of subjects correlates with each other.

Miniatura

Training and Evaluating:

python3 logreg_train.py [-h] [-v] dataset

Thumbnail

python3 logreg_predict.py dataset weights Generate a file with all predictions for a given dataset

Evaluate.py Thumbnail

Screenshot from 2024-09-06 16-37-46

About

Data analysis and logistic regression project in Python. Focused on exploratory data analysis, data visualization and implementing logistic regression from scratch.

Topics

Resources

Stars

Watchers

Forks

Languages