Skip to content

eval: rh-ai-engineer-ai-observability#50

Open
GuyZivRH wants to merge 13 commits into
mainfrom
eval/rh-ai-engineer-ai-observability
Open

eval: rh-ai-engineer-ai-observability#50
GuyZivRH wants to merge 13 commits into
mainfrom
eval/rh-ai-engineer-ai-observability

Conversation

@GuyZivRH
Copy link
Copy Markdown
Collaborator

Skill evaluation submission for rh-ai-engineer/ai-observability.

Copies structure from agentic-collections and tests from skillsbench.

GuyZivRH added 2 commits May 11, 2026 11:16
Harbor OpenShift runs as non-root (uid=1001390000). /root is not
writable. /solution/ is the correct writable emptyDir mount.
@GuyZivRH GuyZivRH marked this pull request as ready for review May 12, 2026 15:58
@GuyZivRH GuyZivRH marked this pull request as draft May 14, 2026 08:16
@GuyZivRH GuyZivRH marked this pull request as ready for review May 14, 2026 17:58
@GuyZivRH GuyZivRH marked this pull request as draft May 17, 2026 06:28
Remove baseline and non-differentiating checks both variants pass.
@GuyZivRH GuyZivRH marked this pull request as ready for review May 17, 2026 09:47
@GuyZivRH GuyZivRH marked this pull request as draft May 17, 2026 12:32
…lity

Remove tests both control and treatment pass at 100% in recent runs.
Keep only checks that separate skilled vs unskilled per trial logs.

Co-authored-by: Cursor <cursoragent@cursor.com>
@GuyZivRH GuyZivRH marked this pull request as ready for review May 17, 2026 19:13
@GuyZivRH GuyZivRH marked this pull request as draft May 18, 2026 11:21
…alues

Add analyze_vllm tool, text-gen-legacy model reference, GPU memory
utilization (22GB/24GB), p99 latency (2800ms), and get_gpu_info tool.

Co-authored-by: Cursor <cursoragent@cursor.com>
@GuyZivRH GuyZivRH marked this pull request as ready for review May 18, 2026 21:27
@GuyZivRH GuyZivRH marked this pull request as draft May 24, 2026 18:08
Co-authored-by: Cursor <cursoragent@cursor.com>
@GuyZivRH GuyZivRH marked this pull request as ready for review May 24, 2026 21:11
@GuyZivRH GuyZivRH marked this pull request as draft May 25, 2026 06:07
@GuyZivRH GuyZivRH marked this pull request as ready for review May 25, 2026 10:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant