@@ -10,20 +10,16 @@ This package contain supplementary code for the
1010### Package
1111
1212The ` mitcom ` package contains all relevant methods for handling protein profiles
13- and single components: from detection over correlation to distance matrix and
14- similarity calculation.
15-
16- ### Notebook
17-
18- The ` MitCOM.ipynb ` notebook makes use of these methods and shows the
19- way from data import to t-SNE output. It requires the ` mitcom ` package to be
20- installed/available.
13+ and single profile components: from component detection and correlation to
14+ similarity distance matrix calculation.
2115
2216### Peak detection script
2317
24- For detecting peaks in protein profile data, the package makes the
25- ` complex_find_peaks ` script available upon installation.
26- It reads two tab-separated input files (tsv) with the following structure:
18+ The ` complex_find_peaks ` executable for detecting peaks (aka components)
19+ in protein profile data is available after package installation.
20+
21+ It expects at least two tab-separated (tsv) input files with the following
22+ structure:
2723
2824** File 1:** Protein abundance values
2925
@@ -53,16 +49,54 @@ number of parallel workers, specify the output file names, a protein whitelist
5349and more.
5450Run ` complex_find_peaks --help ` to see all options.
5551
52+ ### Notebook
53+
54+ The ` MitCOM.ipynb ` notebook exemplifies the way from data import to t-SNE
55+ output. It requires the ` mitcom ` package to be installed.
56+
57+ #### Data files loaded from notebook
58+ - * protein_abundance_file* :
59+ Protein abundances (see above)
60+
61+ - * components_file* :
62+ Component output file from peak detection script
63+
64+ - * protein_whitelist_file* :
65+ Text file containing whitelisted proteins (optional)
66+
67+ - * predefined_complexes_file* :
68+ Two-column text file (tsv). Can be used to define * known*
69+ complexes by mapping component identifiers to arbitrary complex names.
70+ This mapping is only used during t-SNE visualization. Data points belonging
71+ to the same complex will be plotted as one individual trace that can be
72+ shown/hidden by clicking on its associated legend item.
73+ _ Note:_ The file is expected to exist and contain at least the header row.
74+
75+ Example:
76+ ```
77+ component complex
78+ ATPA_P6 ATP#1
79+ ATPB_P6 ATP#1
80+ ATPD_P3 ATP#1
81+ SDH3_P8 SDH
82+ SDHA_P8 SDH
83+ SDHB_P9 SDH
84+ ... ...
85+ ```
86+
87+
5688## Installation
5789
5890### Standard
5991
6092```bash
61- # with pip (consider to run this in a virtual environment)
62- pip install git+https://github.com/phys2/complexomics-mitcom.git
93+ # optional: create and activate virtual environment
94+ python -m venv .venv && .venv/Scripts/activate
95+
96+ # optional: install wheels to speed up installations
97+ pip install wheels
6398
64- # with poetry
65- poetry install git+https://github.com/phys2/complexomics-mitcom.git
99+ pip install git+https://github.com/phys2/complexomics-mitcom.git@main
66100```
67101
68102### With Jupyter notebook support
@@ -72,11 +106,10 @@ for running the Jupyter notebook, namely *jupyter*, *jupyterlab* and *plotly*.
72106To have these included, run these commands instead:
73107
74108``` bash
75- # with pip
76- pip install " git+https://github.com/phys2/complexomics-mitcom.git#egg=complexomics-mitcom[notebook]"
109+ pip install " git+https://github.com/phys2/complexomics-mitcom.git@main#egg=complexomics-mitcom[notebook]"
77110
78- # with poetry
79- poetry install --extras notebook git+ https://github.com/phys2/complexomics-mitcom.git
111+ # download MitCOM notebook
112+ curl -L -O https://github.com/phys2/complexomics-mitcom/raw/main/mitcom/notebooks/MitCOM.ipynb
80113```
81114
82115
0 commit comments