Commit a96731d

and

committed

feat(tool): add arxiv HTML affiliation fetch for institutional claim verification

Enhance arxiv_search with optional fetch_affiliations config that scrapes the ltx_authors section from arXiv HTML paper pages, providing authoritative author+institution text (e.g. "1 Shanghai AI Laboratory 2 Abaka AI") that the arXiv Atom API and feedparser do not expose. Changes: - arxiv_search.py: add ArxivConfig.fetch_affiliations field (default False), _fetch_html_affiliations() method, and per-result HTML enrichment loop in execute(); fix 429 error message (was generic "Search failed: HTTPError") - agent_article_fact_checker.py: update TOOLS_DESCRIPTION, WORKFLOW_STEPS and PER_CLAIM_VERIFICATION_PROMPT so agents treat affiliations_text as the authoritative source for institutional/attribution claims - Enable fetch_affiliations=True in all three entry points: .claude/skills/dingo-verify/scripts/fact_check.py, skills/dingo-verify/scripts/fact_check.py, examples/agent/agent_article_fact_checking_example.py Result: institutional claims previously UNVERIFIABLE (e.g. "OmniDocBench released by Tsinghua/Alibaba/Shanghai AI Lab") now correctly judged FALSE from paper affiliation data without requiring Tavily web search. Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>

1 parent 5729c44 commit a96731dCopy full SHA for a96731d

5 files changed

.claude/skills/dingo-verify/scripts
- fact_check.py
dingo/model/llm/agent
- agent_article_fact_checker.py
- tools
  - arxiv_search.py
examples/agent
- agent_article_fact_checking_example.py
skills/dingo-verify/scripts
- fact_check.py

Comments

(0)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Commit a96731d

File tree

0 commit comments