Update claude and codex skills. Fix corresponding sections in README etc by ScSteffen · Pull Request #104 · AI-ModCon/BaseSIM_APEIRON

ScSteffen · 2026-06-04T18:37:05Z

Summary

Consolidates the agent-skill set into four workflow-oriented skills maintained in parallel for both Claude Code and Codex, and removes the now-dead visualization config (baseline/output) that no bundled renderer consumes. Docs (CLAUDE.md, README.md) and example configs are brought in line with the current src/apeiron/ package layout.

Motivation & Context

The old skills were fine-grained, per-task scaffolds (new-detector,
new-updater, new-config, visualize, lint-check, …) that no longer matched how people actually use the framework, and several referenced a visualization entry point (src/visualize.py) and VisualizationCfg fields that the package no longer ships. This PR replaces them with task-level workflows and prunes the stale config/doc references so the repo describes what actually exists.

Approach

Skills, reworked into 4 workflows (each maintained for both .claude/skills/
and .codex/skills/):
- install-apeiron — add Apeiron as a dependency to another project.
- explore-examples — run a bundled MNIST/CIFAR example.
- custom-experiment — scaffold harness + data utils + TOML for your own
  data/model and run it.
- integrate-apeiron — bolt drift detection / CL onto an existing training loop.
- Removed the old granular Claude skills (debug-experiment, explain,
  lint-check, new-config, new-detector, new-harness, new-updater,
  run-experiment, visualize).
Config cleanup: VisualizationCfg keeps only input (the CSV path run
metrics are written to); the unused baseline and output fields are dropped.
Example TOMLs (mnist, cifar10_vgg11, cifar10_vit) are updated to match.
Docs: CLAUDE.md and README.md updated to the src/apeiron/ package
paths, MLflow/WandB logging backends, the EnsembleDetector
NotImplementedError caveat, and a new "Agent Skills" README section. Removed
references to the dropped src/visualize.py entry point.

Screenshots / Logs (optional)

N/A — docs/config/tooling change.

API / CLI Changes

VisualizationCfg — removed fields baseline: float and output: str; only
input: str remains (default "output/cl_only.csv").

Breaking Changes

TOML configs that set [visualization] baseline = … or [visualization] output = …
must drop those keys; only input is still recognized. (The bundled example
configs are already updated.)

Performance (optional)

N/A.

Security & Privacy

No secrets committed
Input validation added where needed (N/A — no new input paths)

Dependencies

None added or removed. poetry.lock content-hash/generator header updated only.

Testing Plan

Unit tests — tests/test_config.py::test_visualization_cfg updated to assert
the remaining input default.
Integration tests
e2e / smoke test
Manual steps: poetry run pytest, poetry run ruff check .,
poetry run mypy .

Documentation

Docstrings updated
User docs / README updated
CHANGELOG entry

Checklist

Code formatted (Ruff) → ruff format --check
Lint passes (Ruff) → ruff check .
Types pass (mypy/pyright) → mypy src
Tests pass (pytest) → pytest -q
Backward compatibility considered (see Breaking Changes)
Adequate comments for tricky parts
CI green

Risk & Rollback Plan

Low risk — docs, skills, and a small config-field removal. Rollback by reverting
the PR.

Notes for Reviewers

Start with src/apeiron/config/configuration.py + tests/test_config.py for
the one behavioral change.
The bulk of the diff is skill markdown moving from many granular skills to four
workflow skills, mirrored across .claude/skills/ and .codex/skills/.
Note the README's reminder to keep the two skill trees in sync.

anagainaru

Looks great, I'll test them in a little bit and come back to approve. This is a good base on which to add the driver, drift detection and CL skills.

anagainaru

All the skills except the custom-experiment went well. I had some issues creating a good harness with the custom experiment. I think once we have a better documentation for the model harness this might go better. Let's merge this for now.

ScSteffen added 3 commits June 4, 2026 14:31

updated skills for claude and codex

9e7fede

update config files to remove stale settings

60ecec0

fix readme

260e12a

anagainaru reviewed Jun 4, 2026

View reviewed changes

anagainaru approved these changes Jun 5, 2026

View reviewed changes

anagainaru merged commit 4fb7939 into main Jun 6, 2026
3 checks passed

anagainaru deleted the skills_update branch June 6, 2026 02:50

This was referenced Jun 8, 2026

Add documentation for creating model harness #101

Merged

Agent Skill need documentation #81

Closed

Create MCP tools for our Framework to enable interactions with agents. #62

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update claude and codex skills. Fix corresponding sections in README etc#104

Update claude and codex skills. Fix corresponding sections in README etc#104
anagainaru merged 3 commits into
mainfrom
skills_update

ScSteffen commented Jun 4, 2026 •

edited

Loading

Uh oh!

anagainaru left a comment

Uh oh!

anagainaru left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ScSteffen commented Jun 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Motivation & Context

Approach

Screenshots / Logs (optional)

API / CLI Changes

Breaking Changes

Performance (optional)

Security & Privacy

Dependencies

Testing Plan

Documentation

Checklist

Risk & Rollback Plan

Notes for Reviewers

Uh oh!

anagainaru left a comment

Choose a reason for hiding this comment

Uh oh!

anagainaru left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ScSteffen commented Jun 4, 2026 •

edited

Loading