You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The agent skill (skills/structured-context/) needs automated evals to verify it remains accurate as the tool and schemas evolve. Without evals, regressions in skill quality are invisible.
Use cases to cover
Validate a space — agent runs validate on a space and correctly interprets errors
Schema design and authoring — agent writes or modifies a schema file using correct field names and structure
Troubleshoot a validation error — agent diagnoses a common error (e.g. missing field, broken wikilink) from validate output
Context
The agent skill (
skills/structured-context/) needs automated evals to verify it remains accurate as the tool and schemas evolve. Without evals, regressions in skill quality are invisible.Use cases to cover
validateon a space and correctly interprets errorsschemas showand applies them to review content (depends on Add qualitative content assessment to the agent skill #37)Notes
bun run testinfrastructure or a separateevals/directoryCloses once eval suite is running in CI or as a documented manual process.