Skip to content

Cap PyTorch build parallelism for all GCC docker images#20123

Open
rascani wants to merge 1 commit into
pytorch:mainfrom
rascani:fix-docker-gcc14-oom
Open

Cap PyTorch build parallelism for all GCC docker images#20123
rascani wants to merge 1 commit into
pytorch:mainfrom
rascani:fix-docker-gcc14-oom

Conversation

@rascani

@rascani rascani commented Jun 8, 2026

Copy link
Copy Markdown
Contributor

Summary

The gcc14 docker build was intermittently timing out on linux.4xlarge runners because it built PyTorch from source with unlimited parallelism, unlike gcc11 which capped MAX_JOBS=6. Generalize the guard to all GCC variants so gcc14, gcc15, and future additions get the same protection.

Images that set SKIP_PYTORCH (gcc9-nopytorch, cuda-windows) are unaffected because the existing SKIP_PYTORCH guard excludes them.

Fixes #19881

Test plan

CI

The gcc14 docker build was intermittently timing out on linux.4xlarge
runners because it built PyTorch from source with unlimited parallelism,
unlike gcc11 which capped MAX_JOBS=6. Generalize the guard to all GCC
variants so gcc14, gcc15, and future additions get the same protection.

Images that set SKIP_PYTORCH (gcc9-nopytorch, cuda-windows) are
unaffected because the existing SKIP_PYTORCH guard excludes them.

Fixes pytorch#19881

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@pytorch-bot

pytorch-bot Bot commented Jun 8, 2026

Copy link
Copy Markdown

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20123

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures, 6 Cancelled Jobs

As of commit 85a114c with merge base ac3003e (image):

NEW FAILURES - The following jobs have failed:

CANCELLED JOBS - The following jobs were cancelled. Please retry:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@meta-cla meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 8, 2026
@rascani rascani requested a review from mergennachin June 8, 2026 20:56
@github-actions

github-actions Bot commented Jun 8, 2026

Copy link
Copy Markdown

This PR needs a release notes: label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

docker-build (linux.4xlarge, executorch-ubuntu-24.04-gcc14) is failing

1 participant