Skip to content

[Slurm 25.11] Enable expedited requeue by default#3117

Merged
hehe7318 merged 6 commits intoaws:developfrom
hehe7318:wip/enable-expedited-requeue
Mar 4, 2026
Merged

[Slurm 25.11] Enable expedited requeue by default#3117
hehe7318 merged 6 commits intoaws:developfrom
hehe7318:wip/enable-expedited-requeue

Conversation

@hehe7318
Copy link
Copy Markdown
Contributor

@hehe7318 hehe7318 commented Feb 11, 2026

Description of changes

  • Enable expedited requeue by default

Tests

  • Kitchen tests passed

Checklist

  • Make sure you are pointing to the right branch.
  • If you're creating a patch for a branch other than develop add the branch name as prefix in the PR title (e.g. [release-3.6]).
  • Check all commits' messages are clear, describing what and why vs how.
  • Make sure to have added unit tests or integration tests to cover the new/modified code.
  • Check if documentation is impacted by this change.

Please review the guidelines for contributing and Pull Request Instructions.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

# CLOUD CONFIGS OPTIONS
<% if node['cluster']['use_private_hostname'] == 'true' or node['cluster']['dns_domain'].nil? or node['cluster']['dns_domain'].empty? -%>
SlurmctldParameters=idle_on_node_suspend,power_save_min_interval=30,node_reg_mem_percent=<%= node['cluster']['slurm_node_reg_mem_percent'] %>
SlurmctldParameters=idle_on_node_suspend,power_save_min_interval=30,node_reg_mem_percent=<%= node['cluster']['slurm_node_reg_mem_percent'] %>,enable_expedited_requeue
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we cover this change with test?
Sincew we need this change asap in the build, if adding the test requires too much boilerplate/time, let;s merge this fix and work on the test in a separate PR.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done, testing

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Test passed.

@hehe7318 hehe7318 added the 3.x label Mar 4, 2026
@@ -26,10 +26,10 @@ LaunchParameters=enable_nss_slurm
#
# CLOUD CONFIGS OPTIONS
<% if node['cluster']['use_private_hostname'] == 'true' or node['cluster']['dns_domain'].nil? or node['cluster']['dns_domain'].empty? -%>
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[not blocking - CodeStyle,Maintenance] The if branch sets both SlurmctldParameters and TreeWidth, but we always set the same value to SlurmctldParameters. We can define SlurmctldParameters outside of the if branch.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You are right! Fixed!

gmarciani
gmarciani previously approved these changes Mar 4, 2026
…branch

SlurmctldParameters was duplicated in both if/else branches with the
same value. Move it outside the conditional.
@hehe7318 hehe7318 force-pushed the wip/enable-expedited-requeue branch from 7667b25 to bf6ac16 Compare March 4, 2026 18:45
@hehe7318 hehe7318 enabled auto-merge (squash) March 4, 2026 20:56
@hehe7318 hehe7318 disabled auto-merge March 4, 2026 20:56
@hehe7318 hehe7318 enabled auto-merge (squash) March 4, 2026 20:56
@hehe7318 hehe7318 merged commit 3a0dc8d into aws:develop Mar 4, 2026
28 of 30 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants