fix(benchmarks): guard against empty choices and message=None in LLM eval calls by qizwiz · Pull Request #219 · EverMind-AI/EverOS

qizwiz · 2026-05-18T07:44:56Z

What

Add guards at three LLM evaluation call sites in the EvoAgentBench domain evaluators before accessing choices[0].message.content.

Why

client.chat.completions.create() can return two empty-response shapes:

choices = [] — on content-policy rejections, rate-limit errors, or provider failures
choices[0].message = None — e.g. Gemini 2.5 Flash (via OpenAI-compatible endpoint) returns HTTP 200 with finish_reason: PROHIBITED_CONTENT and message=None

Both crash with IndexError or AttributeError. The existing try/except blocks catch these as generic "LLM evaluation failed: list index out of range" errors, making benchmark runs hard to diagnose.

Files changed

File	Fix
`benchmarks/EvoAgentBench/src/domains/information_retrieval/judge.py`	Guard before `resp.choices[0].message.content or ""`
`benchmarks/EvoAgentBench/src/domains/knowledge_work/evaluate.py`	Guard before `resp.choices[0].message.content`
`benchmarks/EvoAgentBench/src/domains/reasoning/evaluate.py`	Guard before `response.choices[0].message.content`

# Before
eval_text = resp.choices[0].message.content

# After
if not resp.choices or resp.choices[0].message is None:
    raise ValueError("LLM returned empty or filtered response")
eval_text = resp.choices[0].message.content

Corpus context

Detected by pact (llm_response_unguarded mode), a Z3-verified static analyzer for LLM crash vectors. This pattern was found across 13.8k violations in 800+ repos.

Feat/profilev2 See merge request npc-work/aic/ai/evermemos-opensource!61

Feat/update demo See merge request npc-work/aic/ai/evermemos-opensource!62

Use self-deployed embedding and rerank APIs by default See merge request npc-work/aic/ai/evermemos-opensource!64

…ader info

vLLM Rerank API adopts an instruction-tuned approach See merge request npc-work/aic/ai/evermemos-opensource!65

Update EverMemOS: optimize search perf, improve skill search

chore: add devcontainer configuration and development tooling

Fix/unify port in demo and docs to the default 1995

Update the GitHub asset URLs for all banner images to ensure they point to the correct and current locations. This includes fixing a typo in a section title from "LAI Wearable" to "AI Wearable".

Update the asset IDs for the banner GIFs in the README to point to the correct new assets.

…ind-AI#186) * feat: add game of throne demo and claude code plugin use cases Add two new use cases to the repository: 1. Game of Thrones Story Memory Demo - A full-stack web application demonstrating EverMem's memory capabilities through a side-by-side comparison interface for book Q&A 2. Claude Code Plugin - A memory plugin for Claude Code that automatically stores and retrieves context from past coding sessions The demo includes React frontend, Express backend, Docker configurations, and novel loading scripts. The plugin provides hooks for automatic memory injection and search capabilities. * docs: update readme links to use relative paths for use cases

Update the banner image URL in the README file to point to the new asset location.

…erMind-AI#192) server.ts used 8001 but the EverMemOS server default is 1995. This broke local demo runs unless EVERMEMOS_URL was manually set. Fixes EverMind-AI#28 Co-authored-by: pazyork <pazyorkcc@gmail.com>

Skip tool call/response msg in profile generation

* feat(use-cases): add OpenHer persona engine with EverMemOS integration OpenHer is an AI Being engine that creates personas with emergent personality, emotional thermodynamics, and long-term memory. EverMemOS integration: - 4D relationship vector (depth, valence, trust, foresight) expands neural network perception from 8D to 12D - Async two-stage memory retrieval (fire on Turn N, collect on Turn N+1) with 500ms timeout + graceful fallback - Semi-emergent relationship EMA blending EverMemOS priors with LLM-judged deltas per turn - Fire-and-forget turn storage via asyncio.create_task Includes: - README with architecture diagrams and integration walkthrough - Runnable demo with simulation mode (no EverMemOS needed) - Core integration code: mixin, types, context features - .env.example with placeholder values Repo: https://github.com/kellyvv/OpenHer * docs: rewrite README — storytelling style, emotion-first * docs: English README, storytelling style, no emoji * rename: openher-persona-engine → openher --------- Co-authored-by: kellyzxiaowei <129767595+kellyzxiaowei@users.noreply.github.com>

* chore: rename project from evermemos to EverCore This commit renames the project directory and updates all internal references from "evermemos" to "EverCore". The changes include: - Renaming the main directory from `methods/evermemos` to `methods/EverCore` - Updating all import paths and module references - Maintaining the same code structure and functionality - Adding new configuration files (.vscode/settings.json, .pylintrc, pyrightconfig.json) - Updating Dockerfile and project metadata * docs: update references from evermemos to EverCore Update documentation files to reflect the renaming of the 'evermemos' directory to 'EverCore'. This includes fixing clone commands, directory paths, and documentation links across multiple files to ensure consistency and correct navigation for users. * chore: rename EverMemOS to EverCore across codebase This is a project-wide rebranding from EverMemOS to EverCore. The changes include: - Update project name in source files, documentation, and configuration - Rename API references, environment variables, and service names - Modify demo descriptions and benchmark configurations - Update URLs and citations to reflect new project identity All functionality remains identical; only naming has changed to align with the new project branding. * docs: update README with EverCore focus and restructured TOC - Add line break before Table of Contents for better visual separation - Rewrite project description to highlight EverCore as the central component - Reorder directory tree to prioritize benchmarks and methods over use-cases - Update use-cases list with more examples and clarify they are templates - Improve flow from Quick Start to use-cases to benchmarks * docs: update README with clearer methods description and benchmarks Add benchmark numbers directly in the method summaries for better visibility. Clarify introductory text to emphasize choice and composition of methods. * docs: fix markdown formatting in README table of contents Adjust whitespace and line breaks to ensure proper rendering of the collapsible table of contents section.

…d-AI#204) - Replace specific EverMemBench-Dynamic badge with general EverMind-AI HuggingFace badge - Remove redundant License badge - Change "Methods" section heading to "Architecture Methods" - Update sub-section headings from h4 (####) to h3 (###) for better hierarchy

…rMind-AI#208) * docs: restructure README and add AGENTS.md for better navigation - Reorder sections to emphasize architecture methods and use cases - Move use cases section before quick start for better flow - Rename "Methods" to "Architecture Methods" for clarity - Add AGENTS.md with quick commands and key entry points - Update section headers to improve document hierarchy - Maintain all existing content while improving organization * docs: add community and contribution files * docs: reorder README directory tree for logical grouping * docs: move community files to .github/ and update references * ci: change deploy workflow trigger from feature branch to main

* docs: restructure README and add subdirectory guides Move the directory tree from the main README to new dedicated README files for each top-level folder (use-cases, methods, benchmarks). Add detailed introductions and tables to guide users to the appropriate subprojects. This improves navigation and provides clear entry points for different use cases. * docs: expand showcase section with new projects and links Add six new project entries to the README showcase, each with a banner image, description, and code/plugin link. Also update an existing benchmark entry to include a dataset link. This enhances the repository's demonstration of real-world applications and available resources.

* docs(readme): update project links and formatting * docs(use-cases): enhance README with visual catalogue of demos Expand the use cases section from a simple table to a detailed visual catalogue with project banners, descriptions, and links. This improves user engagement and provides a better showcase of community integrations and demos. * docs: update READMEs and add validation for use-case links

…#215)

…eval calls client.chat.completions.create() can return choices=[] on content-policy rejections or provider errors, and choices[0].message=None on filtered responses (e.g. Gemini PROHIBITED_CONTENT via OpenAI-compatible endpoint). Both crash with IndexError/AttributeError. The existing try/except blocks catch these as generic 'LLM evaluation failed' errors, making them hard to diagnose. Explicit guards surface the root cause clearly.

hui zhang and others added 30 commits December 25, 2025 13:35

Merge branch 'feat/profilev2' into 'dev'

f78b3dd

Feat/profilev2 See merge request npc-work/aic/ai/evermemos-opensource!61

feat:add readable profile to fetch function

76d0156

feat:add readable profile to fetch function, update demo playground

360dc0f

fix:delete 'v2' in commemt

cf9b303

fix:delete readable context in profile

25979f5

Merge branch 'feat/update_demo' into 'dev'

4ee3905

Feat/update demo See merge request npc-work/aic/ai/evermemos-opensource!62

⚡ redis pipeline

98b8243

⚡ monkey patch kafka producer

faa8d36

⚡ monkey patch kafka producer

78032a2

feat: Prioritize self-deployed embedding services

f64eae7

feat: Reranking supports self-deployed rerank services

f552f0f

feat: truncate the embedding length of the vLLM Embedding API to 1024

9d206b4

fix: Fix the bug where the latest embedding configuration cannot be read

e2a7cfd

refactor: Refactor the configurations for embedding and reranking

a5915cb

fix: Remove references to the OpenTelemetry package

5f7a1a0

fix: change uv lock

227cc4e

fix: change uv lock

ff78815

Merge branch 'feature/selfdeploy-embedding' into 'dev'

529fd80

Use self-deployed embedding and rerank APIs by default See merge request npc-work/aic/ai/evermemos-opensource!64

🐛 conversation data bug fix & request log refactor & remove tenant he…

aae4364

…ader info

🐛 fix rerank interface

80dbd5d

test: Add rerank integration tests

791ec26

🐛 fix es connection register

64f9081

🐛 fix es connection register

44fbcfb

fix: vLLM Rerank API adopts an instruction-tuned approach

81b8b61

fix: vLLM Rerank API adopts an instruction-tuned approach

c12612c

⚡ redis connection pool size

64e5d3e

Merge branch 'feature/selfdeploy-embedding' into 'dev'

1de53cf

vLLM Rerank API adopts an instruction-tuned approach See merge request npc-work/aic/ai/evermemos-opensource!65

⚰️ remove longjob manager

ec5e604

feat: metrics client and rerank/vectorize/retrieve metrics

b3958cb

✨ fetch add group_ig

d792644

shallyan and others added 27 commits April 16, 2026 14:11

Update tests

d8af00a

Merge pull request EverMind-AI#178 from EverMind-AI/feat/update_everos

82a40cf

Update EverMemOS: optimize search perf, improve skill search

Merge pull request EverMind-AI#176 from EverMind-AI/feat/0415-2

26cbc6a

chore: add devcontainer configuration and development tooling

Unify port to default 1995

6333331

Update tests

42c3a47

Merge pull request EverMind-AI#181 from EverMind-AI/fix/unify_port

29be775

Fix/unify port in demo and docs to the default 1995

docs: update asset URLs in README for consistency (EverMind-AI#183)

ea06106

Update the GitHub asset URLs for all banner images to ensure they point to the correct and current locations. This includes fixing a typo in a section title from "LAI Wearable" to "AI Wearable".

docs: update README banner image URLs (EverMind-AI#184)

1c8ca2b

Update the asset IDs for the banner GIFs in the README to point to the correct new assets.

Update directory path in setup instructions (EverMind-AI#180)

17d8a66

docs: add guidelines for usecases (EverMind-AI#187)

e0d9e17

Skip tool info in profile extraction

5acbc72

Remove verbose code

825b55e

docs: fix typo and update project structure in README (EverMind-AI#190)

ca8bc55

docs: update README banner image URL (EverMind-AI#194)

b37999f

Update the banner image URL in the README file to point to the new asset location.

fix: unify game-of-throne-demo EVERMEMOS_URL default port to 1995 (Ev…

f06c303

…erMind-AI#192) server.ts used 8001 but the EverMemOS server default is 1995. This broke local demo runs unless EVERMEMOS_URL was manually set. Fixes EverMind-AI#28 Co-authored-by: pazyork <pazyorkcc@gmail.com>

Merge pull request EverMind-AI#188 from EverMind-AI/fix_tool

d830a1d

Skip tool call/response msg in profile generation

docs: comment out broken GIF links in README (EverMind-AI#209)

6dda863

docs: uncomment banner images in use cases section (EverMind-AI#210)

fa8d77e

docs(readme): add two new AI assistant use case sections (EverMind-AI…

29d555c

…#215)

github-actions Bot mentioned this pull request May 18, 2026

[watch] Overnight fork patrol: 2026-05-18 Fearvox/EverOS#34

Open

cyfyifanchen closed this Jun 6, 2026

cyfyifanchen force-pushed the main branch from 773e19b to 518b8ec Compare June 6, 2026 00:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(benchmarks): guard against empty choices and message=None in LLM eval calls#219

fix(benchmarks): guard against empty choices and message=None in LLM eval calls#219
qizwiz wants to merge 652 commits into
EverMind-AI:mainfrom
qizwiz:fix/guard-empty-llm-response

qizwiz commented May 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

15 participants

Conversation

qizwiz commented May 18, 2026

What

Why

Files changed

Corpus context

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

15 participants