Update claude and codex skills. Fix corresponding sections in README etc#104
Merged
Conversation
anagainaru
reviewed
Jun 4, 2026
anagainaru
left a comment
Collaborator
There was a problem hiding this comment.
Looks great, I'll test them in a little bit and come back to approve. This is a good base on which to add the driver, drift detection and CL skills.
anagainaru
approved these changes
Jun 5, 2026
anagainaru
left a comment
Collaborator
There was a problem hiding this comment.
All the skills except the custom-experiment went well. I had some issues creating a good harness with the custom experiment. I think once we have a better documentation for the model harness this might go better. Let's merge this for now.
This was referenced Jun 8, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Consolidates the agent-skill set into four workflow-oriented skills maintained in parallel for both Claude Code and Codex, and removes the now-dead visualization config (
baseline/output) that no bundled renderer consumes. Docs (CLAUDE.md,README.md) and example configs are brought in line with the currentsrc/apeiron/package layout.Motivation & Context
The old skills were fine-grained, per-task scaffolds (
new-detector,new-updater,new-config,visualize,lint-check, …) that no longer matched how people actually use the framework, and several referenced a visualization entry point (src/visualize.py) andVisualizationCfgfields that the package no longer ships. This PR replaces them with task-level workflows and prunes the stale config/doc references so the repo describes what actually exists.Approach
.claude/skills/and
.codex/skills/):install-apeiron— add Apeiron as a dependency to another project.explore-examples— run a bundled MNIST/CIFAR example.custom-experiment— scaffold harness + data utils + TOML for your owndata/model and run it.
integrate-apeiron— bolt drift detection / CL onto an existing training loop.debug-experiment,explain,lint-check,new-config,new-detector,new-harness,new-updater,run-experiment,visualize).VisualizationCfgkeeps onlyinput(the CSV path runmetrics are written to); the unused
baselineandoutputfields are dropped.Example TOMLs (
mnist,cifar10_vgg11,cifar10_vit) are updated to match.CLAUDE.mdandREADME.mdupdated to thesrc/apeiron/packagepaths, MLflow/WandB logging backends, the
EnsembleDetectorNotImplementedErrorcaveat, and a new "Agent Skills" README section. Removedreferences to the dropped
src/visualize.pyentry point.Screenshots / Logs (optional)
N/A — docs/config/tooling change.
API / CLI Changes
VisualizationCfg— removed fieldsbaseline: floatandoutput: str; onlyinput: strremains (default"output/cl_only.csv").Breaking Changes
[visualization] baseline = …or[visualization] output = …must drop those keys; only
inputis still recognized. (The bundled exampleconfigs are already updated.)
Performance (optional)
N/A.
Security & Privacy
Dependencies
poetry.lockcontent-hash/generator header updated only.Testing Plan
tests/test_config.py::test_visualization_cfgupdated to assertthe remaining
inputdefault.poetry run pytest,poetry run ruff check .,poetry run mypy .Documentation
Checklist
ruff format --checkruff check .mypy srcpytest -qRisk & Rollback Plan
Low risk — docs, skills, and a small config-field removal. Rollback by reverting
the PR.
Notes for Reviewers
src/apeiron/config/configuration.py+tests/test_config.pyforthe one behavioral change.
workflow skills, mirrored across
.claude/skills/and.codex/skills/.