Add Initial LLM Extraction for Predator Diet Survey Metrics #26

SeanClay10 · 2026-01-18T21:34:54Z

Summary:
Implements a minimal LLM-based pipeline for extracting key metrics from preprocessed predator diet survey texts using a Ollama client.

Changes:

Added LLM script with structured extraction using Pydantic schemas
Extracts: species name, study location, study date, empty/non-empty stomach counts, and sample size
Includes post-processing validation to correct LLM calculation errors and compute fraction of feeding predators

Usage:
python src/llm/local_llm.py data/processed-text/paper.txt

Next Steps

This provides the foundational structure for LLM-based extraction. Responses are not the best right now due to preprocessing difficulties and the sheer number of tokens that are sent to the model from the long papers, so changes to the prompts and how the preprocessed text is passed to the LLM needs to be looked into.

raymondcen

Everything looks good. The only thing I can really pick out from this are better prompts but we can always change that down the road if we find a better model. We can also just use AI to generate a better prompt. Besides that it looks great.

SeanClay10 added 3 commits January 18, 2026 13:20

Update requirements for initial LLM implementation.

8dbbc91

Feat: Initial LLM implementation.

5ef944c

Fix: formatting issues.

82905dd

SeanClay10 requested review from QuiteRocks, bradleyrule and raymondcen January 18, 2026 21:34

SeanClay10 self-assigned this Jan 18, 2026

raymondcen approved these changes Jan 19, 2026

View reviewed changes

SeanClay10 merged commit 9ab3b1e into main Jan 25, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Initial LLM Extraction for Predator Diet Survey Metrics #26

Add Initial LLM Extraction for Predator Diet Survey Metrics #26

Uh oh!

SeanClay10 commented Jan 18, 2026

Uh oh!

raymondcen left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add Initial LLM Extraction for Predator Diet Survey Metrics #26

Add Initial LLM Extraction for Predator Diet Survey Metrics #26

Uh oh!

Conversation

SeanClay10 commented Jan 18, 2026

Next Steps

Uh oh!

raymondcen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants