Skip to content

Update get_cancer_genes() to fit consensus defenition of cancer gene#113

Open
mathieulemieux wants to merge 4 commits into
developfrom
task/KBDEV-1532-cancer-gene-traverse-children-nodes
Open

Update get_cancer_genes() to fit consensus defenition of cancer gene#113
mathieulemieux wants to merge 4 commits into
developfrom
task/KBDEV-1532-cancer-gene-traverse-children-nodes

Conversation

@mathieulemieux
Copy link
Copy Markdown
Member

@mathieulemieux mathieulemieux commented May 11, 2026

See https://www.bcgsc.ca/jira/browse/KBDEV-1532

get_cancer_genes() will now work with both 'tumourigenesis' (for backward compatibility) and 'cancer gene' as umbrella terms for all cancer-related genes (i.e. oncogenes, tumour suppressive genes and other cancer genes)

@mathieulemieux
Copy link
Copy Markdown
Member Author

@dustinbleile , do you mind we discard PR #111 ?

@elewis2
Copy link
Copy Markdown
Collaborator

elewis2 commented May 12, 2026

@mathieulemieux is it possible to include this filter in the query itself, in _get_tumourigenesis_genes_list? Might reduce the number of records retrieved.

for statement in statements:
    if statement['subject'].get('biotype', '') == 'gene':

Speculating: I'm wondering if there's a way we can avoid querying gkb three times during report prep to get this same info (this step is done in get_gene_information). I think _get_tumourigenesis could probably be updated to include the matched relevance in the result, if the input relevance is a list... but I don't think the info can be processed in a way that leaves get_cancer_genes usable as-is by iprobe since it would need to change the return type to include the separate flags. Maybe something we can address in a later pr.

@elewis2
Copy link
Copy Markdown
Collaborator

elewis2 commented May 12, 2026

Re subject biotype in query - Just tested, it wouldn't actually reduce the number of statements retrieved at all.

@codecov-commenter
Copy link
Copy Markdown

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 83.50%. Comparing base (8777c66) to head (458f88f).
⚠️ Report is 2 commits behind head on develop.

Additional details and impacted files
@@             Coverage Diff             @@
##           develop     #113      +/-   ##
===========================================
+ Coverage    83.46%   83.50%   +0.03%     
===========================================
  Files           18       18              
  Lines         2546     2552       +6     
===========================================
+ Hits          2125     2131       +6     
  Misses         421      421              
Flag Coverage Δ
unittests 83.50% <100.00%> (+0.03%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@mathieulemieux
Copy link
Copy Markdown
Member Author

Re subject biotype in query - Just tested, it wouldn't actually reduce the number of statements retrieved at all.

yeah, all these relevance terms are used in statement with a gene as the subject.

@mathieulemieux
Copy link
Copy Markdown
Member Author

Speculating: I'm wondering if there's a way we can avoid querying gkb three times [...]

ok, let me add a new function for that, so we keep old behavior for iprobe. I'll have to refactor things a bit.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants