Cortex-M backend: Fix conv2d scratch buffer allocation to match CMSIS-NN wrapper dispatch by rascani · Pull Request #17766 · pytorch/executorch

rascani · 2026-02-28T01:38:51Z

Summary

Use arm_convolve_wrapper_s8_get_buffer_size instead of arm_convolve_s8_get_buffer_size so the buffer size matches whichever specialized kernel arm_convolve_wrapper_s8 will actually dispatch to at runtime (1x1 fast, 1xN, or general).

Also remove the Error::NotFound carve-out that silently proceeded with a null scratch buffer — CMSIS-NN returns ARM_CMSIS_NN_ARG_ERROR when ctx->buf is NULL and a buffer is required, so fail immediately on any allocation error, consistent with the other cortex_m conv ops.

Update CMSIS-NN from v7.0.0 to 84303a51fd867c7ddbd23068b7ce930af1b6269d
and remove GIT_SHALLOW (incompatible with SHA-based FetchContent pins).

Fixes #18044

cc @digantdesai @SS-JIA @freddan80 @per @zingo @oscarandersson8218 @mansnils @Sebastian-Larsson @robell

pytorch-bot · 2026-02-28T01:38:54Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17766

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 1 Pending, 1 Unrelated Failure

As of commit 94cbb57 with merge base 6c02866 ():

NEW FAILURE - The following job has failed:

pull / unittest / macos / macos-job (gh)
export/tests/test_target_recipes.py::TestTargetRecipes::test_linear_model

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / test-binary-size-linux-gcc / linux-job (gh) (trunk failure)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-02-28T01:39:35Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

rascani · 2026-02-28T01:46:37Z

Merge will be blocked until ARM-software/CMSIS-NN#200 has merged and we can update the CMSIS version.

Use arm_convolve_wrapper_s8_get_buffer_size instead of arm_convolve_s8_get_buffer_size so the buffer size matches whichever specialized kernel arm_convolve_wrapper_s8 will actually dispatch to at runtime (1x1 fast, 1xN, or general). Also remove the Error::NotFound carve-out that silently proceeded with a null scratch buffer — CMSIS-NN returns ARM_CMSIS_NN_ARG_ERROR when ctx->buf is NULL and a buffer is required, so fail immediately on any allocation error, consistent with the other cortex_m conv ops. Update CMSIS-NN from v7.0.0 to 84303a51fd867c7ddbd23068b7ce930af1b6269d and remove GIT_SHALLOW (incompatible with SHA-based FetchContent pins). Co-authored-by: Claude <noreply@anthropic.com>

Co-authored-by: Claude <noreply@anthropic.com>

rascani · 2026-03-14T00:34:12Z

Failures unrelated.

@digantdesai

…-NN wrapper dispatch (pytorch#17766) ### Summary Use arm_convolve_wrapper_s8_get_buffer_size instead of arm_convolve_s8_get_buffer_size so the buffer size matches whichever specialized kernel arm_convolve_wrapper_s8 will actually dispatch to at runtime (1x1 fast, 1xN, or general). Also remove the Error::NotFound carve-out that silently proceeded with a null scratch buffer — CMSIS-NN returns ARM_CMSIS_NN_ARG_ERROR when ctx->buf is NULL and a buffer is required, so fail immediately on any allocation error, consistent with the other cortex_m conv ops. Update CMSIS-NN from v7.0.0 to 84303a51fd867c7ddbd23068b7ce930af1b6269d and remove GIT_SHALLOW (incompatible with SHA-based FetchContent pins). Fixes pytorch#18044 cc @digantdesai @SS-JIA @freddan80 @per @zingo @oscarandersson8218 @mansnils @Sebastian-Larsson @robell --------- Co-authored-by: Claude <noreply@anthropic.com>

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 28, 2026

rascani mentioned this pull request Feb 28, 2026

Fix dispatch condition mismatch in convolve wrapper buffer size functions ARM-software/CMSIS-NN#200

Merged

zingo added the partner: arm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Arm label Mar 5, 2026

rascani force-pushed the cortex-m-conv-buffer-size branch from 71db0c9 to 4e6bafa Compare March 7, 2026 01:13

rascani requested review from AdrianLundell and psiddh March 7, 2026 01:15

rascani marked this pull request as ready for review March 7, 2026 01:15

rascani requested review from kirklandsign and larryliu0820 as code owners March 7, 2026 01:15

zingo approved these changes Mar 9, 2026

View reviewed changes

mansnils approved these changes Mar 9, 2026

View reviewed changes

zingo changed the title ~~Fix conv2d scratch buffer allocation to match CMSIS-NN wrapper dispatch~~ Cortex-M backend: Fix conv2d scratch buffer allocation to match CMSIS-NN wrapper dispatch Mar 11, 2026

rascani and others added 2 commits March 13, 2026 11:47

Merge branch 'main' into cortex-m-conv-buffer-size

3ff3ab7

Update CMSIS-NN version to 098d54a6

94cbb57

Co-authored-by: Claude <noreply@anthropic.com>

rascani merged commit a391d67 into pytorch:main Mar 14, 2026
156 of 158 checks passed

rascani deleted the cortex-m-conv-buffer-size branch March 14, 2026 00:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cortex-M backend: Fix conv2d scratch buffer allocation to match CMSIS-NN wrapper dispatch#17766

Cortex-M backend: Fix conv2d scratch buffer allocation to match CMSIS-NN wrapper dispatch#17766
rascani merged 3 commits intopytorch:mainfrom
rascani:cortex-m-conv-buffer-size

rascani commented Feb 28, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Feb 28, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Feb 28, 2026

Uh oh!

rascani commented Feb 28, 2026

Uh oh!

rascani commented Mar 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

rascani commented Feb 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

pytorch-bot bot commented Feb 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17766

❌ 1 New Failure, 1 Pending, 1 Unrelated Failure

Uh oh!

github-actions bot commented Feb 28, 2026

This PR needs a release notes: label

Uh oh!

rascani commented Feb 28, 2026

Uh oh!

rascani commented Mar 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rascani commented Feb 28, 2026 •

edited

Loading

pytorch-bot bot commented Feb 28, 2026 •

edited

Loading

This PR needs a `release notes:` label