PathBench-MIL

Downstream evaluation scripts and benchmark pipelines for PathBench — multiple-instance learning (MIL) based WSI classification and survival prediction.

Overview
Features
Repository Layout
Data Preparation
Quick Start
- WSI Classification
- Survival Prediction
Citation
Contributing
License

Overview

PathBench-MIL contains the downstream evaluation code used by the PathBench benchmark. It provides training and evaluation pipelines for:

WSI classification
Survival prediction

The repository expects features to be extracted from whole-slide images (WSIs) using a separate extraction pipeline (for example, PrePATH). See the Data Preparation section below.

Features

Modular training/evaluation scripts for classification and survival tasks
Example splits and dataset templates
Simple shell wrappers (run.sh) for reproducible experiments

Repository Layout

README.md
classification/
    ├── main.py
    ├── run.sh
    ├── datasets/
    └── models/
    ├── splits/
    └── utils/
survival/
    ├── main_kfold.py
    ├── run.sh
    └── models/
    ├── splits/
    └── utils/

Data Preparation

Recommended workflow:

Split each WSI into patches and extract features with a pretrained model (outside this repo, e.g., PrePATH).
Store features as .pt or .h5 files and organize them under a cohort directory.

Suggested layout:

DATA_ROOT/
└── CohortName/
        ├── pt_files/
        │   ├── resnet50/
        │   │   ├── slide_1.pt
        │   │   ├── slide_2.pt
        │   │   └── ...
        │   └── ...
        └── patches/
                ├── slide_1.h5
                ├── slide_2.h5
                └── ...

Add dataset paths and labels to classification/splits/datasets.xlsx for classification experiments.
For survival experiments, follow the survival/splits/example.xlsx format (time-to-event and event flag columns).

Note: If you change storage format or file layout, please update the corresponding data loader in datasets/.

Quick Start

We recommend using conda with Python 3.10 and installing dependencies from requirements.txt for reproducible environments.

# create and activate a conda environment with Python 3.10
conda create -n pathbench python=3.10 -y
conda activate pathbench

# install from requirements.txt (preferred)
pip install --upgrade pip
pip install -r requirements.txt

WSI Classification

Prepare feature files and update classification/splits/datasets.xlsx.
Edit classification/run.sh to set dataset root, hyperparameters and other options.
Run the training/benchmark:

cd classification
bash run.sh

Survival Prediction

Prepare an Excel with survival info following survival/splits/example.xlsx.
Edit survival/run.sh (paths, hyperparameters).
Run:

cd survival
bash run.sh

Citation

If you use this code or the benchmark in your research, please cite:

@article{ma2025pathbench,
    title={PathBench: A comprehensive comparison benchmark for pathology foundation models towards precision oncology},
    author={Ma, Jiabo and Xu, Yingxue and Zhou, Fengtao and Wang, Yihui and Jin, Cheng and Guo, Zhengrui and Wu, Jianfeng and Tang, On Ki and Zhou, Huajun and Wang, Xi and others},
    journal={arXiv preprint arXiv:2505.20202},
    year={2025}
}

How to evaluate new models

To evaluate a new MIL model using this benchmark, please visit the PrePATH repository.

License

This project is licensed under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).

See the full license text in the LICENSE file at the repository root or online:

https://creativecommons.org/licenses/by-nc/4.0/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PathBench-MIL

Table of Contents

Overview

Features

Repository Layout

Data Preparation

Quick Start

WSI Classification

Survival Prediction

Citation

How to evaluate new models

License

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

PathBench-MIL

Table of Contents

Overview

Features

Repository Layout

Data Preparation

Quick Start

WSI Classification

Survival Prediction

Citation

How to evaluate new models

License