Conversation
|
Note Architectural Review — Claude Opus | Blind review (docs + diff only) Summary This PR introduces a citation system ( Alignment Strong alignment. The citation system extends provenance from the build graph into artifact content — claims inside synthesized artifacts now link back to source labels via Observations
Verdict This is a well-structured incremental step that extends Synix's provenance model into artifact content, fixes real usability issues (#54, #55), and ships a compelling new template — all coherent with the project vision. Review parameters
|
|
Note Red Team Review — OpenAI o3 | Adversarial review (docs + diff only) Threat assessment — Medium-high. Adds new public surface area (citation URI scheme, search-db schema, CLI flags) with scant versioning or migration story; several brittle heuristics and silent-failure paths. One-way doors
Findings
Missing
Verdict — Block. API & storage one-way doors added without versioning, plus critical label-parsing bug and data-loss on index recreation. Require schema-versioning plan, regex fix, and explicit documentation before merge. Review parameters
|
…mpat, tracebacks, --plain, --version) Closes #54 - Manifest validation: gracefully handle foreign manifest.json files instead of crashing on non-synix entries - OpenAI API compat: use max_completion_tokens for reasoning models (o1/o3/o4) and gpt-5.x; omit temperature for reasoning models - Build error display: show exception type at default verbosity, file:line at -v, full traceback at -vv - Plain output mode: --plain flag on build/run for CI-safe timestamped line output without Rich Live TUI - Version flag: synix --version now works via click.version_option
Issue 2 simplification: Replace model detection heuristics with config-driven passthrough. LLMConfig gains max_completion_tokens and nullable temperature — pipelines control what params are sent to the API. Golden comparison improvements: - output_masks: per-step regex patterns to exclude non-deterministic lines - cleanup: case.py declares files to remove before/after runs - _normalize_output: BUILD_COUNTS, BUILD_TOTAL, drop LLM stats lines, normalize progressive projection status - Regenerated all demo goldens with improved normalization
2df6713 to
e130c1f
Compare
|
Note Red Team Review — OpenAI GPT-5.2 | Adversarial review (docs + diff only) Threat assessment — Medium risk: mostly “quality of life” changes, but it quietly alters persisted build behavior (manifest parsing) and introduces new CLI surface area. One-way doors
Findings
Missing
VerdictShip with fixes — the intent is fine, but you’re baking in “silent recovery” semantics for a core build artifact and adding a CI flag without documenting/stabilizing its contract. Review parameters
|
|
Note Architectural Review — Claude Opus | Blind review (docs + diff only) Summary — This PR bundles four changes: (1) defensive manifest parsing so foreign Alignment — All changes fit the build-system-for-memory model. The manifest hardening protects artifact store integrity without relaxing immutability — invalid entries are skipped, not silently accepted. The Observations
Verdict — A solid incremental PR that hardens the artifact store, improves CI ergonomics, and adds real model compatibility — all without drifting from the project's core abstractions. The duplicated control flow in Review parameters
|
Customer-filed integration report documenting 5 issues hit during LENS Datagen pipeline integration with Synix 0.10.0.
|
Note Red Team Review — OpenAI GPT-5.2 | Adversarial review (docs + diff only) Threat assessment — Medium risk: mostly additive/defensive changes, but it quietly cements artifact manifest behavior and introduces a “plain” progress mode that can drift out of sync with the runner. One-way doors
Findings
Missing
VerdictShip with fixes — the error verbosity and OpenAI param passthrough are fine, but the manifest “collision” is still a footgun because you didn’t claim ownership of the filename/path; that’s the one-way door you shouldn’t half-solve. Review parameters
|
|
Note Architectural Review — Claude Opus | Blind review (docs + diff only) Summary This PR addresses five issues discovered during a real integration (LENS datagen pipeline), documented in a detailed integration report. The fixes are: (1) graceful handling of foreign Alignment Strong fit. The manifest collision fix directly protects artifact immutability and content-addressed caching — a corrupted manifest was silently breaking the cache layer. The integration report itself embodies the DESIGN.md hypothesis that "memory architecture is a runtime concern" — someone iterated on a real pipeline and hit real edges. The Observations
Verdict This is a solid, well-tested PR that fixes real issues found during real usage, advances CI usability (a documented limitation), and does so without architectural drift — a good incremental step. Review parameters
|
|
Note Red Team Review — OpenAI GPT-5.2 | Adversarial review (docs + diff only) Threat assessment — Medium risk: mostly additive fixes, but it subtly changes caching/manifest behavior and OpenAI request semantics in ways that can brick existing builds if assumptions differ. One-way doors
Findings
Missing
VerdictShip with fixes — the direction is right, but you’re papering over a manifest namespace collision without resolving it, and the new “tolerant manifest” behavior risks silent cache breakage and surprise rebuild cost. Review parameters
|
|
Note Architectural Review — Claude Opus | Blind review (docs + diff only) Summary — This PR addresses five issues discovered during a real integration (LENS datagen pipeline), documented in a detailed integration report. The code changes fix: (1) manifest format collisions when the build directory contains a foreign Alignment — Strong. Every fix strengthens the "workbench" thesis from DESIGN.md. The manifest collision fix protects artifact immutability and content-addressed caching when build directories are shared with other tools — a realistic scenario for pipeline authors. The Observations
Verdict — This is a solid, well-tested PR that fixes real issues found during actual pipeline usage, with changes that are architecturally consistent and well-scoped. The duplication in the build command's plain/rich branches should be cleaned up, and the progress interface should get a shared protocol, but neither blocks merge. Review parameters
|
Closes #54
Summary
Addresses 5 issues from the LENS Datagen integration report:
ArtifactStore._load_manifest()validates JSON structure, skips invalid/foreign entries, warns instead of crashingLLMConfig.max_completion_tokensand nullabletemperaturelet pipelines control exactly what params hit the API_print_error()helper: default showsTypeError: msg,-vaddsat file.py:line,-vvshows full traceback--plainflag —PlainBuildProgresswith timestamped line output for CI/piped environments (partially closes Non-interactive automation mode for CI/scripted runs #54)--versionflag —@click.version_option(package_name="synix")on the CLI groupAlso includes demo golden comparison improvements:
output_masksfor per-step regex exclusions,cleanupfor run artifacts, improved normalization.Test plan
uv run releasepasses (874 tests, 3 demos)synix --versionprints version stringsynix build --plainproduces pipe-safe output-vvand broken pipeline shows full tracebackmanifest.jsondoesn't crash the buildmax_completion_tokensin llm_config passes through to OpenAI API