Skip to content

Backfill related_ingredients in 8 more communities (#30)#145

Merged
realmarcin merged 1 commit into
mainfrom
backfill/related-ingredients-batch8f
Jun 14, 2026
Merged

Backfill related_ingredients in 8 more communities (#30)#145
realmarcin merged 1 commit into
mainfrom
backfill/related-ingredients-batch8f

Conversation

@realmarcin

Copy link
Copy Markdown
Contributor

Continues the #30 related_ingredients backfill — 34 CHEBI-grounded ingredients across 8 communities, strict no-fabrication protocol.

File #
Clostridium_Carboxidivorans_Kluyveri_CO_Chain_Elongation_Coculture 7
Methylocystis_Rhodococcus_Methane_VFA_PHBV_Coculture 7
Lake_Washington_Methane_Oxygen_Methylotroph_Community 8
Thiocyanate_Afipia_Thiobacillus_Bioreactor_Community 4
Acetylene_Fueled_TCE_Dechlorination_Groundwater_Enrichment 3
Variovorax_Cryptococcus_Vitamin_Mutualism_Microcosm 2
Trichoderma_Streptomyces_Filamentous_Cellulose_Coculture 1
Naica_Deep_Subsurface_Thermophilic 2

Protocol (enforced + independently verified)

  • CHEBI ids OAK-verified, canonical labels exact → 34/34 labels canonical (PHBV kept as preferred_term only, no clean CHEBI term).
  • Every snippet is a verbatim contiguous substring of a cited+cached reference (full-text .md preferred) → 35/35 snippets exact.
  • Compounds not named in the cited+cached text were omitted (no fabrication).

Validation

  • linkml-validate all 8 → exit 0
  • 34/34 labels canonical; 35/35 snippets exact
  • additions-only (463 insertions)

Adoption: 173 → 181 / 265.

🤖 Generated with Claude Code

Adds 34 CHEBI-grounded related_ingredients across 8 communities, strict
no-fabrication protocol (OAK-verified canonical CHEBI labels; every snippet a
verbatim substring of a reference already cited + cached in the file).

- Clostridium_Carboxidivorans_Kluyveri_CO_Chain_Elongation_Coculture: 7
- Methylocystis_Rhodococcus_Methane_VFA_PHBV_Coculture: 7 (methane/O2, VFAs,
  PHBV [no clean CHEBI, preferred_term only])
- Lake_Washington_Methane_Oxygen_Methylotroph_Community: 8 (methane, methanol,
  formaldehyde, methylamine, pyruvate, nitrate)
- Thiocyanate_Afipia_Thiobacillus_Bioreactor_Community: 4 (thiocyanate, cyanate,
  ammonia, CO2)
- Acetylene_Fueled_TCE_Dechlorination_Groundwater_Enrichment: 3
- Variovorax_Cryptococcus_Vitamin_Mutualism_Microcosm: 2 (thiamine, pantothenate)
- Trichoderma_Streptomyces_Filamentous_Cellulose_Coculture: 1 (cellulose)
- Naica_Deep_Subsurface_Thermophilic: 2 (calcium sulfate, gypsum)

Verified: 34/34 labels canonical, 35/35 snippets exact substrings, all 8 pass
linkml-validate, additions-only.

Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
@realmarcin realmarcin merged commit 3b1574b into main Jun 14, 2026
3 checks passed
@realmarcin realmarcin deleted the backfill/related-ingredients-batch8f branch June 14, 2026 07:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant