Prometheus metrics setup by pablin-10 · Pull Request #125 · flashbots/flashbots-images

pablin-10 · 2026-03-19T13:31:25Z

Here we add the observability features but in a separate module.

This PR aims to replace these two:

alexhulbert

One small change, otherwise lgtm

Not relevant anymore

MoeMahhouk · 2026-05-19T09:41:41Z

+if [ -n "${METRICS_ENDPOINTS:-}" ]; then
+    for ip in $METRICS_ENDPOINTS; do
+        accept_dst_ip_port $CHAIN_ALWAYS_OUT tcp "$ip" $HTTPS_PORT "Metrics endpoint (Flashbots)"
+    done
+fi


I know this is important but I have concerns about it from different angles:

it introduces dynamic IP allowlisting which deviates from the Flashbox L1 images having everything static and part of the measurements for attestation/verification purposes

should we consider dropping those opened endpoints manually for the searcher's rootless podman container as we do for couple always out endpoints in the init-container.sh? or what is the rational behind leaving that out? what's the impact if the searcher's container could reach those endpoints too beside the guest-os?

I recall you mentioned those IP endpoints might change, what is the process to update those and refresh the firewall rules at runtime? how invasive it is? does it have potential downtime? is it automated or manually triggered?

MoeMahhouk · 2026-05-19T09:47:54Z

out of curiosity, what is this used/needed for here inside the observability module itself?

MoeMahhouk · 2026-05-19T10:43:15Z

+  --web.console.templates=/usr/share/prometheus/consoles \
+  --web.console.libraries=/usr/share/prometheus/console_libraries \
+  --web.listen-address=127.0.0.1:9090
+ExecReload=/bin/kill -HUP $MAINPID


why is this needed? doesnt systemd handle this automatically?

MoeMahhouk · 2026-05-19T15:21:24Z

+if [ -z "${METRICS_FLASHBOTS_URL:-}" ]; then
+    echo "No metrics URL configured, remote_write disabled"
+    exit 0
+fi


wouldnt this always trigger an exit 0 here or where is "METRICS_FLASHBOTS_URL" being populated beforehand?

Ruteri · 2026-05-20T18:22:31Z

+    local key value
+    for key in $keys; do
+        value=$(echo "$secret_data" | jq -rc --arg k "$key" '.[$k] // ""')
+        export "${key}=${value}"


I wonder if this is sanitized enough to later call source on. Can we verify this somehow?

Claude (under some pressure) gave an example payload using the fact that the export is not sanitized (using various gadgets from this PR), I think it would work in practice (nothing fancy, bare bones sh)
exploit.sh

In general, I'd try to avoid doing these stuff manually in bash script.
Could we do templating with minimal render engine like original did in BuilderNet using mustache (examples).

This way, the template would exactly render those values into the corresponding place-holders and avoid potential malicious attacks

…h to aws prom

pablin-10 requested review from MoeMahhouk and alexhulbert March 19, 2026 13:31

pablin-10 changed the title ~~Split observability prototype into a separate module~~ Split observability recipes into a separate module Mar 19, 2026

alexhulbert previously requested changes Mar 23, 2026

View reviewed changes

Comment thread modules/flashbox/observability/mkosi.extra/etc/systemd/system/node-exporter.service

pablin-10 force-pushed the pablo/observability_v2 branch from 924cb6e to 89fa85b Compare April 21, 2026 18:19

pablin-10 force-pushed the pablo/observability_v2 branch 2 times, most recently from bd99350 to c0ce099 Compare April 29, 2026 23:19

pablin-10 force-pushed the pablo/observability_v2 branch from 6b2fc99 to 738717b Compare May 13, 2026 16:44

pablin-10 changed the title ~~Split observability recipes into a separate module~~ Prometheus metrics setup May 13, 2026

MoeMahhouk requested a review from alexhulbert May 19, 2026 09:17

MoeMahhouk reviewed May 19, 2026

View reviewed changes

Ruteri reviewed May 20, 2026

View reviewed changes

pablin-10 added 12 commits May 26, 2026 11:14

Split observability prototype into a separate module

56f45b4

Add obs to L1 also

1c22ca5

Rework observability fetching config

46218b6

Refactor to make it cleaner

d68cfde

Remove uppercasing logic

a3571d2

Add logging to vault fetch script

5942584

Add host label to prometheus metrics

637cd0a

Accept multiple endpoints for metrics in firewall + use sigv4 for aut…

fc79bba

…h to aws prom

Refactor obs firefwall rules loading

bf2897c

Make container up rule show only 1 or 0

bee12eb

Handle hosts to IP translation on boot

0deb378

Specify vault role from metadata

736a61c

pablin-10 force-pushed the pablo/observability_v2 branch from 65c0e7e to 736a61c Compare May 26, 2026 14:14

pablin-10 requested a review from a team as a code owner May 26, 2026 14:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prometheus metrics setup#125

Prometheus metrics setup#125
pablin-10 wants to merge 12 commits into
mainfrom
pablo/observability_v2

pablin-10 commented Mar 19, 2026

Uh oh!

alexhulbert left a comment

Uh oh!

Uh oh!

MoeMahhouk May 19, 2026

Uh oh!

MoeMahhouk May 19, 2026

Uh oh!

MoeMahhouk May 19, 2026

Uh oh!

MoeMahhouk May 19, 2026

Uh oh!

Ruteri May 20, 2026 •

edited

Loading

Uh oh!

MoeMahhouk May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

pablin-10 commented Mar 19, 2026

Uh oh!

alexhulbert left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

MoeMahhouk May 19, 2026

Choose a reason for hiding this comment

Uh oh!

MoeMahhouk May 19, 2026

Choose a reason for hiding this comment

Uh oh!

MoeMahhouk May 19, 2026

Choose a reason for hiding this comment

Uh oh!

MoeMahhouk May 19, 2026

Choose a reason for hiding this comment

Uh oh!

Ruteri May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

MoeMahhouk May 21, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Ruteri May 20, 2026 •

edited

Loading