ALZHEIMER_CLASSIFICATION_MLOPS

Transforming Alzheimer Detection with Seamless Automation

Built with the tools and technologies:

Overview

Alzheimer_classification_mlops is a comprehensive MLOps toolkit designed to streamline the development, training, and deployment of Alzheimer's disease classification models. It uses audio features like MFCC and Log-Mel Spectrograms extracted from speech data to power its models. The project integrates automated pipeline orchestration, environment setup, and experiment tracking to ensure reproducibility and efficiency from start to finish.

Key Features

🎯 Automated pipeline execution that handles dependencies, data versioning, and artifact management.
🚀 Seamless environment setup and deployment, including secret management and repository cloning.
🔬 Reproducible experiments with DVC and MLflow, supporting robust cross-validation and performance tracking.
📊 Consistent audio feature extraction using MFCC and LogMel, ensuring reliable data for model training.
⚙️ Orchestrated training and evaluation workflows, facilitating scalable model development and deployment.

MLOps Workflow

This project is built on a modern MLOps stack to automate and manage the entire machine learning lifecycle:

🚀 Automated CI Workflow with GitHub Actions & Kaggle: The core of our automation is a Continuous Integration (CI) pipeline powered by GitHub Actions. This workflow is uniquely configured to:
1. Utilize Kaggle Kernels as Runners: Instead of standard GitHub runners, our jobs execute on Kaggle's free GPU environment, which is ideal for training ML models.
2. Run the Full Pipeline: The action automatically checks out the code, installs dependencies, and runs the complete DVC pipeline for training and evaluation.
3. Commit Results Back: Upon successful completion, the workflow automatically commits the new results—such as updated metric files, evaluation plots, and DVC pointers—back to the Git repository. This creates a fully automated loop where new code triggers a run, and its results are immediately versioned and available.
📦 Data & Model Versioning with DVC: We use Data Version Control (DVC) to manage large datasets, intermediate files, and trained models. This ensures that every experiment is fully reproducible by versioning not just the code (with Git), but also the exact data, parameters, and models used in each run.
🔬 Experiment Tracking with MLflow: Every training run is meticulously logged with MLflow. It captures hyperparameters, performance metrics (e.g., accuracy, loss), and model artifacts for each fold in our cross-validation. The MLflow UI provides a clear, centralized dashboard to compare different experiments and identify the best-performing models.

(back to top)

Getting Started

Follow these instructions to get a copy of the project up and running on your local machine for development and testing purposes.

Prerequisites

This project requires the following dependencies:

Programming Language: Python 3.8+
Package Manager: Pip

Installation

Build Alzheimer_classification_mlops from the source and install dependencies:

Clone the repository:

git clone git@github.com:labyedh/Alzheimer_classification_mlops.git

Navigate to the project directory:
```
cd Alzheimer_classification_mlops
```
Install the dependencies:

Using pip:
```
pip install -r requirements.txt
```

(back to top)

Usage

Run the main project pipeline using main.py. You must specify a stage (train, evaluate, or full-pipeline). You can also temporarily override the feature type defined in params.yaml.

1. Run the full pipeline (Training and Evaluation):

python main.py full-pipeline

2. Run only the training stage:

python main.py train

3. Run only the evaluation stage:

python main.py evaluate

4. Override the feature type for a run: This command will run the full pipeline using mfcc features, temporarily overriding the value in params.yaml.

python main.py full-pipeline --feature mfcc

This command will run the full pipeline using log_mel features, temporarily overriding the value in params.yaml.

python main.py full-pipeline --feature log_mel

(back to top)

Contributing

Contributions are what make the open-source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

If you have a suggestion that would make this better, please fork the repo and create a pull request. You can also simply open an issue with the tag "enhancement".

Fork the Project
Create your Feature Branch (git checkout -b feature/AmazingFeature)
Commit your Changes (git commit -m 'Add some AmazingFeature')
Push to the Branch (git push origin feature/AmazingFeature)
Open a Pull Request

(back to top)

License

Distributed under the MIT License. See LICENSE.txt for more information.

(back to top)

Acknowledgements

(back to top)

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
.dvc		.dvc
.github/workflows		.github/workflows
metrics		metrics
scripts		scripts
src		src
.dvcignore		.dvcignore
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data.dvc		data.dvc
dvc.lock		dvc.lock
dvc.yaml		dvc.yaml
kaggle_runner.ipynb		kaggle_runner.ipynb
kernel-metadata.json		kernel-metadata.json
main.py		main.py
mlruns.dvc		mlruns.dvc
params.yaml		params.yaml
requirements.txt		requirements.txt
run_pipeline.sh		run_pipeline.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ALZHEIMER_CLASSIFICATION_MLOPS

Table of Contents

Overview

Key Features

MLOps Workflow

Getting Started

Prerequisites

Installation

Usage

Contributing

License

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ALZHEIMER_CLASSIFICATION_MLOPS

Table of Contents

Overview

Key Features

MLOps Workflow

Getting Started

Prerequisites

Installation

Usage

Contributing

License

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages