#30 backfill: syntrophy/methanogenesis cohort batch 1 (10 communities)#94
Open
realmarcin wants to merge 1 commit into
Open
#30 backfill: syntrophy/methanogenesis cohort batch 1 (10 communities)#94realmarcin wants to merge 1 commit into
realmarcin wants to merge 1 commit into
Conversation
Starts the syntrophy/methanogenesis arm of the #30 related_ingredients backfill. This cohort backfilled richly with specific named compounds (no broad-class fallbacks). Every CHEBI id verified live against the ChEBI sqlite db via OAK; snippets copied verbatim from cached PMID/DOI abstracts. No cross-repo IDs. related_ingredients adoption: 46/265 -> 56/265. | Community | Ingredients (CHEBI-verified) | |---|---| | Syntrophobacter_Methanobacterium_Syntrophy | propionate, formate, dihydrogen | | Syntrophobacter_Methanospirillum_Syntrophy | propionate, dihydrogen, formate | | Syntrophomonas_Methanospirillum_Syntrophy | butyrate, acetate, dihydrogen | | Syntrophomonas_Methanococcus_Butyrate_Coculture | butyrate, dihydrogen, methane | | Pelotomaculum_Methanothermobacter_Syntrophy | propionate | | Pelotomaculum_Methanocella_Propionate_RNASeq | propionate, formate, dihydrogen, methane | | Desulfovibrio_Methanococcus_Syntrophy | lactate, dihydrogen | | Desulfovibrio_Methanosarcina_Lactate_Syntrophy | lactate, acetate, dihydrogen, methane | | Geobacter_Methanosaeta_DIET | ethanol, carbon dioxide, methane | | Geobacter_Methanosarcina_DIET | ethanol, acetate, methane, carbon dioxide | Compounds were dropped (not invented) where the cited+cached abstract had no verbatim mention (e.g. acetate/methane/CO2 in some descriptions). Pre-existing CHEBI bugs flagged (out of scope), in growth_media this time: Syntrophomonas_Methanospirillum (CHEBI:88613 sodium butyrate, CHEBI:87157 sodium sulfide, CHEBI:32447 L-cysteine HCl, CHEBI:48953 resazurin) and both Geobacter DIET files (CHEBI:35235 L-cysteine, CHEBI:85357 sodium sulfide) all resolve to unrelated compounds. Added to the cleanup list. Test plan: just test (136 passed, 9 skipped), all 10 files validate clean. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
There was a problem hiding this comment.
Pull request overview
Backfills the new related_ingredients field on ten syntrophy/methanogenesis community YAML records, completing the first batch of the #30 backfill effort. Each entry references a CHEBI-verified ingredient and cites a verbatim snippet from a cached PMID/DOI abstract.
Changes:
- Add
related_ingredientsblocks (1–4 entries each) to ten community files, naming compounds such as propionate, butyrate, lactate, ethanol, formate, dihydrogen, acetate, carbon dioxide, and methane. - Each ingredient entry includes
chebi_term(id/label),relevance, and at least oneevidenceitem withreference,supports,evidence_source,snippet, andexplanation. - Raises
related_ingredientsadoption from 46/265 to 56/265 communities.
Reviewed changes
Copilot reviewed 10 out of 10 changed files in this pull request and generated 1 comment.
Show a summary per file
| File | Description |
|---|---|
| kb/communities/Syntrophobacter_Methanobacterium_Syntrophy.yaml | Adds propionate, formate, dihydrogen entries |
| kb/communities/Syntrophobacter_Methanospirillum_Syntrophy.yaml | Adds propionate, dihydrogen, formate entries |
| kb/communities/Syntrophomonas_Methanospirillum_Syntrophy.yaml | Adds butyrate, acetate, dihydrogen entries |
| kb/communities/Syntrophomonas_Methanococcus_Butyrate_Growth_Coordination_Coculture.yaml | Adds butyrate, dihydrogen, methane entries |
| kb/communities/Pelotomaculum_Methanothermobacter_Syntrophy.yaml | Adds propionate entry |
| kb/communities/Pelotomaculum_Methanocella_Propionate_RNASeq_Coculture.yaml | Adds propionate, formate, dihydrogen, methane entries |
| kb/communities/Desulfovibrio_Methanococcus_Syntrophy.yaml | Adds lactate, dihydrogen entries |
| kb/communities/Desulfovibrio_Methanosarcina_Lactate_Syntrophy.yaml | Adds lactate, acetate, dihydrogen, methane entries |
| kb/communities/Geobacter_Methanosaeta_DIET.yaml | Adds ethanol, carbon dioxide, methane entries |
| kb/communities/Geobacter_Methanosarcina_DIET.yaml | Adds ethanol, acetate, methane, carbon dioxide entries |
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
Comment on lines
+373
to
+385
| - preferred_term: dihydrogen | ||
| chebi_term: | ||
| id: CHEBI:18276 | ||
| label: dihydrogen | ||
| relevance: > | ||
| Hydrogen (H2/CO2) is an interspecies electron-transfer intermediate | ||
| feeding the hydrogenotrophic methanogenic partner. | ||
| evidence: | ||
| - reference: PMID:30038609 | ||
| supports: SUPPORT | ||
| evidence_source: IN_VITRO | ||
| snippet: but not H2/CO2 | ||
| explanation: Anchors H2/CO2 as an interspecies transfer intermediate tested during syntrophy. |
This was referenced Jun 1, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Starts the syntrophy/methanogenesis arm of the #30
related_ingredientsbackfill. This cohort backfilled richly — all entries are specific named compounds (no broad-class fallbacks).Every CHEBI id verified live against the ChEBI sqlite db via OAK; snippets copied verbatim from cached PMID/DOI abstracts.
related_ingredientsadoption: 46/265 → 56/265.Communities (10)
Compounds were dropped (not invented) where the cited+cached abstract had no verbatim mention.
growth_mediathis time)Syntrophomonas_Methanospirillum(CHEBI:88613sodium butyrate → a PI lipid;CHEBI:87157sodium sulfide → a morpholine;CHEBI:32447L-cysteine HCl → L-cysteinyl group;CHEBI:48953resazurin → cyclohexenones) and both Geobacter DIET files (CHEBI:35235L-cysteine → zwitterion;CHEBI:85357sodium sulfide → unrelated). Added to the running cleanup list.Test plan
just test→ 136 passed, 9 skippedlinkml-validate)🤖 Generated with Claude Code