Skip to content

LABOSO123/Adhd_diagnosis_model-2025_WIDs_datathon

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

10 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

ADHD Prediction and Gender Classification Model

πŸ“Œ Overview

This repository contains a machine learning pipeline developed for the WiDS Datathon 2025, focusing on predicting ADHD and classifying gender based on neurophysiological data. The study explores feature engineering, model selection, and explainability techniques to enhance ADHD diagnosis, particularly in females.

🌟 Why This Matters

ADHD diagnosis in females is understudied and often misdiagnosed due to symptom variability. This model leverages machine learning to identify distinct brain activity patterns, enabling personalized interventions and improved early diagnosis.

πŸ“Š Dataset

The dataset used in this project is part of the WiDS Datathon 2025. Register and download it from the competition page.

   - Folders: Train, Test, and a Data Dictionary (available on Kaggle)
    
    -Data Type: Neurophysiological features linked to ADHD diagnosis in connectome matrices
    
    -Features Include: EEG signals, response times (quantitative) & demographic attributes (categorical)
    
    -Preprocessing Steps: Handling missing values, outliers, and feature transformations

πŸ“Œ Methodology & Approach

The model development follows a structured machine learning pipeline:

    1.Exploratory Data Analysis (EDA) – Understanding feature distributions, correlations, and missing patterns.
    
    2.Feature Engineering (FE) – Creating meaningful features and reducing dimensionality.
    
    3.Data Preprocessing – Encoding categorical variables, scaling numerical data, handling imbalanced classes.
    
    4.Model Selection – Training and comparing multiple models:

            Logistic Regression
            
            Random Forest
            
            Gradient Boosting (Best Performing)
    
    5.Hyperparameter Tuning – Optimized models using GridSearchCV / Optuna.
    
    6.Evaluation Metrics – Performance measured using:
    
            Accuracy, Precision, Recall, F1-score (classification)
            
            ROC-AUC Score (for ADHD prediction)
            
            SHAP & LIME for interpretability

πŸ“Œ Results & Insights

Best Performing Model: Gradient Boosting Classifier

Key Findings:

    Feature 22 in the brain matrices had the most impact on ADHD prediction.
    
    Gender classification improved significantly with Feature Y.
    
    Addressing class imbalance significantly enhanced model performance.

πŸ“Œ How to Run the Project

    πŸ”Ή Clone the Repository

    git clone https://github.com/LABOSO123/adhd-prediction-model.git  
    cd adhd-prediction-model  

    πŸ”Ή Install Dependencies

    pip install -r requirements.txt  

    πŸ”Ή Run the Model

    python train_model.py  

    πŸ”Ή Make Predictions

python predict.py --input data/sample_input.csv

πŸ“Œ Repository Structure

    πŸ“‚ adhd-prediction-model  
    β”œβ”€β”€ πŸ“ data/                   # Dataset (or instructions on where to get it)  
    β”œβ”€β”€ πŸ“ notebooks/               # Jupyter Notebooks for EDA and experimentation  
    β”œβ”€β”€ πŸ“ src/                     # Source code for data preprocessing & modeling  
    β”œβ”€β”€ πŸ“ models/                  # Trained model files  
    β”œβ”€β”€ train_model.py              # Script to train the model  
    β”œβ”€β”€ predict.py                  # Script to generate predictions  
    β”œβ”€β”€ requirements.txt            # Dependencies  
    β”œβ”€β”€ README.md                   # Project documentation  

πŸ“Œ Future Work

Incorporate deep learning techniques (CNNs, RNNs) for improved prediction.

Deploy the model using Flask / FastAPI for real-world application.

Implement feature importance analysis for better interpretability.

πŸ“Œ Acknowledgments

WiDS Datathon organizers for the dataset and challenge.

πŸ“Œ Let's Connect!

πŸ’Ό LinkedIn: Faith Chemutai LabosoπŸ™ GitHub: LABOSO123

πŸš€ Feel free to fork this repository, contribute, or reach out with any questions!

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors