Add Kepler power monitoring for TNF clusters#52
Add Kepler power monitoring for TNF clusters#52lucaconsalvi wants to merge 2 commits intoopenshift-eng:mainfrom
Conversation
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: lucaconsalvi The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
|
Important Review skippedAuto reviews are limited based on label configuration. 🚫 Review skipped — only excluded labels are configured. (1)
Please check the settings in the CodeRabbit UI or the ⚙️ Run configurationConfiguration used: Organization UI Review profile: CHILL Plan: Enterprise Run ID: You can disable this status message by setting the Use the checkbox below for a quick retry:
✨ Finishing Touches🧪 Generate unit tests (beta)
Comment |
Summary
monitoring on TNF clusters
/tnf-power) that queries Kepler metrics via Prometheus and generatesa power consumption report
Details
Deployment
kepler.ymlplaybook withkeplerAnsible role handling namespace, RBAC, DaemonSet,ServiceMonitor, and user workload monitoring setup
containers)
make deploy-kepler/make remove-keplertargets and corresponding shell scripts-e kepler_state=absentClaude Code Skill
/tnf-powerskill queries Prometheus for node and container CPU power metricsDocumentation
docs/kepler/README.md— Setup and usage guidedocs/kepler/KEPLER-ARCHITECTURE.md— How Kepler works on TNF clustersdocs/kepler/KEPLER-PRESENTATION.md— Demo/presentation walkthroughTest plan
make deploy-kepler/tnf-powerskill and verify report outputmake remove-keplerand verify cleanup