All notable changes to ProteoPy will be documented in this file. The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
- Preprocessing (
pr.pp):summarize_modifications()for modification summarization - Analysis (
pr.tl): ANOVA support indifferential_abundance() - Visualization (
pr.pl):binary_heatmap(),box(),volcano(),peptides_on_sequence(),peptides_on_prot_sequence();print_statsparameter across multiple plot functions - Datasets (
pr.datasets):williams_2018()andkarayel_2020()download functions - Utilities (
pr.utils): Public API withis_proteodata(),check_proteodata(),is_log_transformed() - Documentation: Sphinx documentation site; proteoform inference and protein-level analysis tutorials
- Reader (
pr.read):diann()now supports version >=1.9.1 with automatic version dispatch - Preprocessing (
pr.pp):impute_downshift()now supportsgroup_by;normalize_median()gainsmethodparameter;remove_contaminants()defaults toinplace=True - Validation:
is_proteodata()now checks for NaN in ID columns, infinite values in.X/layers, and obs/var index sync
volcano_plottype incompatibility and label displayn_cat1_per_cat2_histminimum bin width
Initial release of ProteoPy.
- Data import (
pr.read): Support for DIA-NN and generic long-format tables - Annotation (
pr.ann): Functions to annotate samples (.obs) and variables (.var) - Quality control (
pr.pp): Completeness filtering, CV calculation, contaminant removal - Preprocessing (
pr.pp): Median normalization, downshift imputation - Differential abundance (
pr.tl): t-test, Welch's test, ANOVA with multiple testing correction - Proteoform inference (
pr.tl): COPF algorithm reimplementation for detecting functional proteoform groups - Visualization (
pr.pl): Volcano plots, abundance rank plots, intensity distributions, CV plots, correlation matrices, hierarchical clustering profiles - Datasets (
pr.datasets): Built-in example datasets (Karayel 2020)