Skip to content

#30 backfill: syntrophy/methanogenesis cohort batch 1 (10 communities)#94

Open
realmarcin wants to merge 1 commit into
mainfrom
backfill-syntrophy-batch1
Open

#30 backfill: syntrophy/methanogenesis cohort batch 1 (10 communities)#94
realmarcin wants to merge 1 commit into
mainfrom
backfill-syntrophy-batch1

Conversation

@realmarcin
Copy link
Copy Markdown
Contributor

Starts the syntrophy/methanogenesis arm of the #30 related_ingredients backfill. This cohort backfilled richly — all entries are specific named compounds (no broad-class fallbacks).

Every CHEBI id verified live against the ChEBI sqlite db via OAK; snippets copied verbatim from cached PMID/DOI abstracts.

related_ingredients adoption: 46/265 → 56/265.

Communities (10)

Community Ingredients (CHEBI-verified)
Syntrophobacter_Methanobacterium_Syntrophy propionate, formate, dihydrogen
Syntrophobacter_Methanospirillum_Syntrophy propionate, dihydrogen, formate
Syntrophomonas_Methanospirillum_Syntrophy butyrate, acetate, dihydrogen
Syntrophomonas_Methanococcus_Butyrate_Coculture butyrate, dihydrogen, methane
Pelotomaculum_Methanothermobacter_Syntrophy propionate
Pelotomaculum_Methanocella_Propionate_RNASeq propionate, formate, dihydrogen, methane
Desulfovibrio_Methanococcus_Syntrophy lactate, dihydrogen
Desulfovibrio_Methanosarcina_Lactate_Syntrophy lactate, acetate, dihydrogen, methane
Geobacter_Methanosaeta_DIET ethanol, carbon dioxide, methane
Geobacter_Methanosarcina_DIET ethanol, acetate, methane, carbon dioxide

Compounds were dropped (not invented) where the cited+cached abstract had no verbatim mention.

⚠️ Pre-existing CHEBI bugs flagged (out of scope, growth_media this time)

Syntrophomonas_Methanospirillum (CHEBI:88613 sodium butyrate → a PI lipid; CHEBI:87157 sodium sulfide → a morpholine; CHEBI:32447 L-cysteine HCl → L-cysteinyl group; CHEBI:48953 resazurin → cyclohexenones) and both Geobacter DIET files (CHEBI:35235 L-cysteine → zwitterion; CHEBI:85357 sodium sulfide → unrelated). Added to the running cleanup list.

Test plan

  • just test → 136 passed, 9 skipped
  • All 10 files validate clean (linkml-validate)

🤖 Generated with Claude Code

Starts the syntrophy/methanogenesis arm of the #30 related_ingredients
backfill. This cohort backfilled richly with specific named compounds
(no broad-class fallbacks). Every CHEBI id verified live against the
ChEBI sqlite db via OAK; snippets copied verbatim from cached PMID/DOI
abstracts. No cross-repo IDs.

related_ingredients adoption: 46/265 -> 56/265.

| Community | Ingredients (CHEBI-verified) |
|---|---|
| Syntrophobacter_Methanobacterium_Syntrophy | propionate, formate, dihydrogen |
| Syntrophobacter_Methanospirillum_Syntrophy | propionate, dihydrogen, formate |
| Syntrophomonas_Methanospirillum_Syntrophy | butyrate, acetate, dihydrogen |
| Syntrophomonas_Methanococcus_Butyrate_Coculture | butyrate, dihydrogen, methane |
| Pelotomaculum_Methanothermobacter_Syntrophy | propionate |
| Pelotomaculum_Methanocella_Propionate_RNASeq | propionate, formate, dihydrogen, methane |
| Desulfovibrio_Methanococcus_Syntrophy | lactate, dihydrogen |
| Desulfovibrio_Methanosarcina_Lactate_Syntrophy | lactate, acetate, dihydrogen, methane |
| Geobacter_Methanosaeta_DIET | ethanol, carbon dioxide, methane |
| Geobacter_Methanosarcina_DIET | ethanol, acetate, methane, carbon dioxide |

Compounds were dropped (not invented) where the cited+cached abstract
had no verbatim mention (e.g. acetate/methane/CO2 in some descriptions).

Pre-existing CHEBI bugs flagged (out of scope), in growth_media this time:
Syntrophomonas_Methanospirillum (CHEBI:88613 sodium butyrate, CHEBI:87157
sodium sulfide, CHEBI:32447 L-cysteine HCl, CHEBI:48953 resazurin) and
both Geobacter DIET files (CHEBI:35235 L-cysteine, CHEBI:85357 sodium
sulfide) all resolve to unrelated compounds. Added to the cleanup list.

Test plan: just test (136 passed, 9 skipped), all 10 files validate clean.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Copilot AI review requested due to automatic review settings June 1, 2026 18:32
Copy link
Copy Markdown

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Backfills the new related_ingredients field on ten syntrophy/methanogenesis community YAML records, completing the first batch of the #30 backfill effort. Each entry references a CHEBI-verified ingredient and cites a verbatim snippet from a cached PMID/DOI abstract.

Changes:

  • Add related_ingredients blocks (1–4 entries each) to ten community files, naming compounds such as propionate, butyrate, lactate, ethanol, formate, dihydrogen, acetate, carbon dioxide, and methane.
  • Each ingredient entry includes chebi_term (id/label), relevance, and at least one evidence item with reference, supports, evidence_source, snippet, and explanation.
  • Raises related_ingredients adoption from 46/265 to 56/265 communities.

Reviewed changes

Copilot reviewed 10 out of 10 changed files in this pull request and generated 1 comment.

Show a summary per file
File Description
kb/communities/Syntrophobacter_Methanobacterium_Syntrophy.yaml Adds propionate, formate, dihydrogen entries
kb/communities/Syntrophobacter_Methanospirillum_Syntrophy.yaml Adds propionate, dihydrogen, formate entries
kb/communities/Syntrophomonas_Methanospirillum_Syntrophy.yaml Adds butyrate, acetate, dihydrogen entries
kb/communities/Syntrophomonas_Methanococcus_Butyrate_Growth_Coordination_Coculture.yaml Adds butyrate, dihydrogen, methane entries
kb/communities/Pelotomaculum_Methanothermobacter_Syntrophy.yaml Adds propionate entry
kb/communities/Pelotomaculum_Methanocella_Propionate_RNASeq_Coculture.yaml Adds propionate, formate, dihydrogen, methane entries
kb/communities/Desulfovibrio_Methanococcus_Syntrophy.yaml Adds lactate, dihydrogen entries
kb/communities/Desulfovibrio_Methanosarcina_Lactate_Syntrophy.yaml Adds lactate, acetate, dihydrogen, methane entries
kb/communities/Geobacter_Methanosaeta_DIET.yaml Adds ethanol, carbon dioxide, methane entries
kb/communities/Geobacter_Methanosarcina_DIET.yaml Adds ethanol, acetate, methane, carbon dioxide entries

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment on lines +373 to +385
- preferred_term: dihydrogen
chebi_term:
id: CHEBI:18276
label: dihydrogen
relevance: >
Hydrogen (H2/CO2) is an interspecies electron-transfer intermediate
feeding the hydrogenotrophic methanogenic partner.
evidence:
- reference: PMID:30038609
supports: SUPPORT
evidence_source: IN_VITRO
snippet: but not H2/CO2
explanation: Anchors H2/CO2 as an interspecies transfer intermediate tested during syntrophy.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants