Release - New agent demo#16
Merged
Merged
Conversation
… eval cases Tools (demos/engineering_agent/tools.py): code_search — find symbols/patterns across a codebase (by keyword, returns file+line+snippet) file_read — read a specific file, optional line range pr_review — fetch PR issues by severity/category (security, perf, style, correctness) dependency_check — outdated versions + CVE lookup per repo/package manager Mock data covers realistic scenarios: authenticate_user defined in src/auth/service.py, used across 3 files SQL injection patterns in src/db/queries.py (raw f-strings) PR-42: critical timing-attack + missing rate limit PR-99: docs-only, clean PR-17: DB refactor, 2 warnings backend/pip: 4 outdated, cryptography CRITICAL CVE + requests HIGH CVE frontend/npm: lodash HIGH + axios MEDIUM CVEs 5 scenarios: find_function, review_pr, dependency_audit, inspect_file, security_investigation 10 eval cases (evals/dataset/engineering_cases.py): eng_001–003: code_search (definition, usage, SQL injection) eng_004–005: file_read (existing file, not-found graceful handling) eng_006–008: pr_review (critical PR, clean PR, warning-only PR) eng_009: dependency_check (must surface critical CVE) eng_010: multi-step code_search + file_read Makefile: make eval-engineering, make eval-all, make demo-eng now active. 134 tests pass.
…-agent feat(phase-5): engineering assistant agent — 4 tools, 5 scenarios, 10 eval cases
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
No description provided.