MRI Preprocessing Pipeline

A generalized implementation of MRI preprocessing for various ML/AI tasks within the Parra Lab. This project is designed to automate the ingestion, analysis, and processing of raw DICOM MRI data into model-ready inputs.

Overview

The MRI Preprocessing Pipeline is a modular system built to handle large datasets of MRI scans. It runs within a Docker container to ensure a consistent environment and supports both an interactive web-based control system and a scriptable command-line interface.

The core functionality resides in code/preprocessing/, where a series of Python scripts handle everything from DICOM extraction to NIfTI conversion and spatial alignment.

Key Features

Automated Scanning: Recursively scans directories for MRI DICOM files.
Metadata Extraction: Extracts and standardizes DICOM header information into CSV tables.
Intelligent Parsing: Identifies scan types (T1, T2, etc.) and orders sequences based on acquisition times.
Modular Design: Each step of the pipeline is a standalone script, allowing for flexible execution and debugging.
Containerized Environment: Fully Dockerized setup for easy deployment on Linux and WSL systems.
Web Interface: (In Development) A Flask-based dashboard to monitor and control the processing status.

Project Structure

MRI_preprocessing/
├── code/
│   └── preprocessing/       # Core python scripts for data processing
│       ├── 01_scanDicom.py  # Scans and extracts DICOM metadata
│       ├── 02_parseDicom.py # Filters and orders scans
│       ├── ...              # Subsequent processing steps
│       ├── DICOM.py         # DICOM handling utilities
│       └── toolbox.py       # General helper functions
├── control_system/          # Docker and Web App configuration
│   ├── app/                 # Flask web application
│   └── docker*              # Docker Compose files
├── data/                    # Data storage (mounted volumes)
├── test/                    # Unit and integration tests
├── start_control.sh         # Main entry point script
└── install.py               # Dependency installation script

Installation

Prerequisites

Linux or Windows Subsystem for Linux (WSL2)
Python 3.x
Docker & Docker Compose (installed automatically via install.py if not present)

Steps

Clone the repository:

git clone https://github.com/TheParraLab/MRI_preprocessing
cd MRI_preprocessing

Install dependencies and setup Docker:
```
python3 install.py
```
Note: This script attempts to install Docker and configure GPU access. If you prefer, you can install Docker manually.

Usage

Starting the System

The primary way to interact with the pipeline is through the start_control.sh script.

bash start_control.sh

You will be prompted to:

Enable the webserver component (y/n).
Provide the path to your raw DICOM data on the host machine.

The system maps your local data directory to /FL_system/data/raw/ inside the Docker container.

Web Control Interface

If enabled, the web interface is accessible at http://localhost:5000. It provides a dashboard to view the status of the preprocessing steps. (Note: The web interface is currently under active development).

Command Line Interface (CLI)

For batch processing or direct control, you can access the container's shell:

Option 1: Convenience Script

bash access_preprocessing.sh

Option 2: Direct Docker Exec

docker exec -it control bash
cd /FL_system/code/preprocessing/

Preprocessing Workflow

The pipeline consists of numbered scripts in code/preprocessing/ that should generally be run in order:

01_scanDicom.py: Scans raw data and builds a Data_table.csv of all found DICOM files.
- Documentation: See code/preprocessing/01_scanDicom.py for detailed usage and arguments.
02_parseDicom.py: Filters relevant scans (e.g., T1) and orders them by time.
03_saveNifti.py: Converts selected DICOM series to NIfTI format.
04_saveRAS.py: Reorients NIfTI files to RAS orientation.
05_alignScans.py: Aligns scans to a reference volume.
06_genInputs.py: Generates final model inputs.

To run a specific step manually inside the container:

python 01_scanDicom.py --scan_dir /FL_system/data/raw --save_dir /FL_system/data

Testing

Unit and integration tests are located in the test/ directory.

To run tests (ensure you have pytest installed):

pytest test/

Contributing

Fork the repository.
Create a feature branch (git checkout -b feature/NewFeature).
Commit your changes.
Push to the branch.
Open a Pull Request.

Please ensure all new code is well-documented and passes existing tests.

Acknowledgements

Parra Lab
Contributors: [Add names here]

For questions or support, please contact nleotta000@citymail.cuny.edu

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
code/preprocessing		code/preprocessing
control_system		control_system
data/raw		data/raw
test		test
.gitignore		.gitignore
README.md		README.md
access_preprocessing.sh		access_preprocessing.sh
install.py		install.py
pytest.ini		pytest.ini
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
start_control.sh		start_control.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MRI Preprocessing Pipeline

Table of Contents

Overview

Key Features

Project Structure

Installation

Prerequisites

Steps

Usage

Starting the System

Web Control Interface

Command Line Interface (CLI)

Preprocessing Workflow

Testing

Contributing

Acknowledgements

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

TheParraLab/MRI_preprocessing

Folders and files

Latest commit

History

Repository files navigation

MRI Preprocessing Pipeline

Table of Contents

Overview

Key Features

Project Structure

Installation

Prerequisites

Steps

Usage

Starting the System

Web Control Interface

Command Line Interface (CLI)

Preprocessing Workflow

Testing

Contributing

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages