- Overview
- Key Functionalities
- Subpackages
- Getting Started with the Package
- Acknowledgement
- References
- Authors
- Statement
- Supported Python Versions
The MED3pa package is specifically designed to address critical challenges in deploying machine learning models, with a particular focus on the robustness and reliability of models under real-world conditions. It provides comprehensive tools for evaluating model stability and performance in the face of prediction uncertainty and disadvantaged data profiles associated with degraded model performance. This work is developed alongside the associated methodological article, published in the Journal of the American Medical Informatics Association (JAMIA): https://doi.org/10.1093/jamia/ocag034. The full code used to generate the results presented in the article is available here: https://github.com/MEDomicsLab/study_3pa.
-
Model Confidence Estimation: Through the MED3pa subpackage, the package measures the predictive confidence at both individual and group (profile) levels. This helps in understanding the reliability of model predictions and in making informed decisions based on model outputs.
-
Identification of disadvantaged Profiles: MED3pa analyzes data profiles for whom the BaseModel consistently leads to poor model performance. This capability allows developers to refine training datasets or retrain models to handle these edge cases effectively.
The package is structured into four distinct subpackages:
- datasets: Stores and manages the dataset.
- models: Handles ML models operations.
- med3pa: Evaluates the model’s performance & extracts disadvantaged profiles.
This modularity allows users to easily integrate and utilize specific functionalities tailored to their needs without dealing with unnecessary complexities.
To get started with MED3pa, follow the installation instructions and usage examples provided in the documentation.
pip install MED3paWe have created a simple example of using the MED3pa package. See the full example here
from MED3pa.datasets import DatasetsManager
from MED3pa.med3pa import Med3paExperiment
from MED3pa.models import BaseModelManager
from MED3pa.visualization.mdr_visualization import visualize_mdr
from MED3pa.visualization.profiles_visualization import visualize_tree
...
# Initialize the DatasetsManager
datasets = DatasetsManager()
datasets.set_from_data(dataset_type="testing",
observations=x_evaluation.to_numpy(),
true_labels=y_evaluation,
column_labels=x_evaluation.columns)
# Initialize the BaseModelManager
base_model_manager = BaseModelManager(model=clf)
# Execute the MED3PA experiment
results = Med3paExperiment.run(
datasets_manager=datasets,
base_model_manager=base_model_manager,
**med3pa_params
)
# Save the results to a specified directory
results.save(file_path='results/oym')
# Visualize results
visualize_mdr(result=results, filename='results/oym/mdr')
visualize_tree(result=results, filename='results/oym/profiles')MED3pa is an open-source package developed at the MEDomicsLab laboratory. We welcome any contribution and feedback.
- Olivier Lefebvre: Student (Ph. D. Computer science) at Université de Sherbrooke
- Lyna Chikouche: Research intern at MEDomicsLab laboratory.
- Ludmila Amriou: Research intern at MEDomicsLab laboratory.
- Martin Vallières: Associate professor, Department of Oncology at McGill University
The MED3pa package is developed and tested with Python 3.12.3.
Additionally, it is compatible with the following Python versions:
- Python 3.11.x
- Python 3.10.x
- Python 3.9.x
While the package may work with other versions of Python, these are the versions we officially support and recommend.