-
Notifications
You must be signed in to change notification settings - Fork 1
Open
Labels
bugSomething isn't workingSomething isn't working
Description
Our current drain_on_shutdown strategy for stopping Nomad agents is:
- On Shutdown the Nomad agent gets ineligible and no new runners are being scheduled.
- In the
drain-on-shutdowndeadlineall running executions have time to finish.- ⚡ We still start new Executions in runners on the draining agent that may not have enough time to finish
- After that the Nomad Agent shuts down
The executions that don't have enough time to finish result in a user-visible error.
We might need to "exclude" some runners for new executions as soon as the respective Nomad agent is about to shut down.
See #651
Unfortunately, we currently don't have any metric to count how often this issue occurs.
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working