Summary
Carry source, decision, and output provenance through the main workflow so downstream agents can audit and cite it.
This issue was generated from an org-wide EvalOps mining pass on 2026-05-10 07:57 UTC. It combines live GitHub repo signals with a per-repo arXiv search. Treat the research links as grounding for a concrete implementation, not as a request for a literature review.
Repo Evidence
- Repository description: Agent PM: OpenAI Agents-powered product management orchestrator with automated PRDs, tickets, and comms.
- Tree signals: 2 docs files, 1 workflows, 0 proto files, 25 test-like files.
README.md:18 includes latent-spec language: - uv package manager - OpenAI, Slack, Jira, GitHub, and calendar credentials as needed
pyproject.toml:88 includes latent-spec language: disable_error_code = ["import-untyped", "import-not-found"] exclude = ["^evals/"] ignore_errors = true
evals/pm_prd_eval.py:1 includes latent-spec language: """Inspect AI evaluation suite for PRD generation and revisions."""
evals/pm_prd_eval.py:20 includes latent-spec language: "## Goals / Non-Goals", "## Acceptance Criteria", ]
evals/pm_prd_eval.py:40 includes latent-spec language: text = output.get("prd_markdown", "") section = text.split("## Acceptance Criteria")[-1] return float("- " in section)
evals/pm_prd_eval.py:71 includes latent-spec language: input={ "title": "Add in-app eval dashboards", "context": "EvalOps users want PRD->eval->release visibility",
Research Grounding
Repo axes: memory, governance, evaluation, tooling
Search keywords: plugins, plugin, secrets, config, prd, get, optional, post, run, plan, registry, slack
- arXiv:2504.08893v1 Knowledge Graph-extended Retrieval Augmented Generation for Question Answering (Jasper Linders, Jakub M. Tomczak), 2025.
- arXiv:2405.15436v1 Hybrid Context Retrieval Augmented Generation Pipeline: LLM-Augmented Knowledge Graphs and Vector Database for Accreditation Reporting Assistance (Candace Edwards), 2024.
- arXiv:2504.05163v2 Evaluating Knowledge Graph Based Retrieval Augmented Generation Methods under Knowledge Incompleteness (Dongzhuoran Zhou, Yuqicheng Zhu, Xiaxia Wang, Yuan He, Jiaoyan Chen, Steffen Staab), 2025.
- arXiv:2508.09460v1 Towards Self-cognitive Exploration: Metacognitive Knowledge Graph Retrieval Augmented Generation (Xujie Yuan, Shimin Di, Jielong Tang, Libin Zheng, Jian Yin), 2025.
- arXiv:2512.20626v2 MegaRAG: Multimodal Knowledge Graph-Based Retrieval Augmented Generation (Chi-Hsiang Hsiao, Yi-Cheng Wang, Tzung-Sheng Lin, Yi-Ren Yeh, Chu-Song Chen), 2025.
- arXiv:2502.01113v3 GFM-RAG: Graph Foundation Model for Retrieval Augmented Generation (Linhao Luo, Zicheng Zhao, Gholamreza Haffari, Dinh Phung, Chen Gong, Shirui Pan), 2025.
- arXiv:2502.06864v1 Knowledge Graph-Guided Retrieval Augmented Generation (Xiangrong Zhu, Yuexiang Xie, Yi Liu, Yaliang Li, Wei Hu), 2025.
- arXiv:2506.21556v3 VAT-KG: Knowledge-Intensive Multimodal Knowledge Graph Dataset for Retrieval-Augmented Generation (Hyeongcheol Park, Jiyoung Seo, MinHyuk Jang, Hogun Park, Ha Dam Baek, Gyusam Chang), 2025.
- arXiv:2507.16826v1 A Query-Aware Multi-Path Knowledge Graph Fusion Approach for Enhancing Retrieval-Augmented Generation in Large Language Models (Qikai Wei, Huansheng Ning, Chunlong Han, Jianguo Ding), 2025.
- arXiv:2511.11017v1 AI Agent-Driven Framework for Automated Product Knowledge Graph Construction in E-Commerce (Dimitar Peshevski, Riste Stojanov, Dimitar Trajanov), 2025.
What To Build
- Add stable identifiers for source records, derived decisions, and emitted outputs.
- Thread those identifiers through logs/events/API responses without leaking secrets.
- Provide a query or debug surface that reconstructs the chain for one completed workflow.
Acceptance Criteria
Notes
- Generated issue 2/5 for
evalops/agent-pm by evalops_org_miner.py.
- Before implementation, confirm the sampled latent-spec snippets still match
main; this issue intentionally cites exact file paths/lines where the mining pass saw them.
Summary
Carry source, decision, and output provenance through the main workflow so downstream agents can audit and cite it.
This issue was generated from an org-wide EvalOps mining pass on 2026-05-10 07:57 UTC. It combines live GitHub repo signals with a per-repo arXiv search. Treat the research links as grounding for a concrete implementation, not as a request for a literature review.
Repo Evidence
README.md:18includes latent-spec language: - uv package manager - OpenAI, Slack, Jira, GitHub, and calendar credentials as neededpyproject.toml:88includes latent-spec language: disable_error_code = ["import-untyped", "import-not-found"] exclude = ["^evals/"] ignore_errors = trueevals/pm_prd_eval.py:1includes latent-spec language: """Inspect AI evaluation suite for PRD generation and revisions."""evals/pm_prd_eval.py:20includes latent-spec language: "## Goals / Non-Goals", "## Acceptance Criteria", ]evals/pm_prd_eval.py:40includes latent-spec language: text = output.get("prd_markdown", "") section = text.split("## Acceptance Criteria")[-1] return float("- " in section)evals/pm_prd_eval.py:71includes latent-spec language: input={ "title": "Add in-app eval dashboards", "context": "EvalOps users want PRD->eval->release visibility",Research Grounding
Repo axes: memory, governance, evaluation, tooling
Search keywords: plugins, plugin, secrets, config, prd, get, optional, post, run, plan, registry, slack
What To Build
Acceptance Criteria
Notes
evalops/agent-pmbyevalops_org_miner.py.main; this issue intentionally cites exact file paths/lines where the mining pass saw them.