Conversation
not needed just yet
There was a problem hiding this comment.
For the web folder, is that something that can be a separate pull request? It mentions in the high level that this is something that someone can't review right now.
There was a problem hiding this comment.
True! Ill have to make some light changes, but I'll make that separate 👍
ntalluri
left a comment
There was a problem hiding this comment.
I did a light review of the PR; did not look to hard at the code itself yet. I mostly was gathering ideas on what was happening from the READMEs.
There was a problem hiding this comment.
Can we change the name of this. DMMM isn't a very universal term to all the algorithms in this config. Maybe something like prize-active?
There was a problem hiding this comment.
Not a big fan of either names, though I'm much more worried for the config being universal to the provided datasets rather than to the provided algorithms.
There was a problem hiding this comment.
Based on how we are setting up the configs for the paper, we will need to make and have each config dataset collection specific instead of algorithm type specific anyways. This would be a better way to organize these configs. Also so people have access to the configs we used.
| g: 0 | ||
|
|
||
| datasets: | ||
| - label: dmmmegfr_string |
There was a problem hiding this comment.
the EGFR dataset is also not specific to algorithms that fit into the DMMM category only.
| - label: dmmmegfr_string | |
| - label: egfr_string |
| edge_files: ["processed/interactome.tsv"] | ||
| node_files: ["processed/prizes.txt"] | ||
| other_files: [] | ||
| - label: dmmmegfr_irefindex |
There was a problem hiding this comment.
| - label: dmmmegfr_irefindex | |
| - label: egfr_irefindex |
Co-authored-by: Neha Talluri <78840540+ntalluri@users.noreply.github.com>
| ), | ||
| ``` | ||
|
|
||
| When a file is requested, `cached`, `pinned`, and `unpinned` are all downloaded, and we characterize them as follows: |
There was a problem hiding this comment.
Still don't understand what these terms actual are mean and are associated to before them leading to failure or updating; the following descriptions don't help since they involve using multiple terms.
|
|
||
| ## Snakemake | ||
|
|
||
| We also provide a `Snakefile`, which can be imported in dataset Snakefiles through: |
There was a problem hiding this comment.
| We also provide a `Snakefile`, which can be imported in dataset Snakefiles through: | |
| We also provide a `Snakefile`, which can contains dataset fetching functions that are imported in dataset specific Snakefiles through: |
We bundle EGFR along with the rest of the caching infrastructure. Notes:
cache/README.md.pra.yamlfor now, as the only PRAs are the synthetic data and the ResponseNet data, and soon the DepMap data.CONTRIBUTING.mdfile is not finalized, and is simply there to not break Changes to CONTRIBUTING guide #57. I may split all contributing material into Changes to CONTRIBUTING guide #57 later.directory.pycontains unnecessary files from other datasets that were deemed universal.webfolder even though I'm aware no one is currently in a position to review it.