[Feature] Base/Row alignment support for Tensors by zacharyvincze · Pull Request #136 · ROCm/rocCV

zacharyvincze · 2026-03-10T18:48:42Z

PR Description

General

Adds padding (row/base pointer alignment) support for roccv::Tensor.
Adds MemAlign class which is used as a parameter when allocating memory for Tensors. This specifies what the row alignment and base alignment should be for newly created tensors.

Samples

Changes support reading/writing images from host <-> device with tensor padding in mind.
Improves help messages and command line argument parsing across all samples.

Tensors

Stride calculation logic changed to take non-contiguous data (row padding) into account.
Reshape method implementation changed to take into account non-contiguous tensors. Ensures that the shape cannot be changed across the row padding boundary, as this is supposed to be a zero-copy operation which retains the initial memory layout.
Quality of life methods for checking whether a tensor is contiguous, and for returning the amount of non-padded (non-garbage) bytes being used by the tensor.
Adds copyToHost and copyFromHost methods to the tensor, which accepts and produces contiguous host buffers. Since padding makes copying memory from the host to the device non-trivial, this allows the user to perform a copy which is agnostic to the underlying row padding that a tensor may have.

Python bindings (rocpycv)

Adds correctness changes to DLPack conversion. Checks whether strides are provided by DLPack first, if so, uses them directly after converting from element-wise to byte-wise strides, otherwise, assumes a contiguous tensor is being provided by DLPack and recalculates with that assumption.

Tests

Helpers which copy from host to device now use Tensor::copyFromHost, Tensor::copyToHost directly instead of using duplicate code.
Adds Tensor unit tests to ensure padding is calculated properly, and that any additional methods added to roccv::Tensor are covered. Special attention is paid to the reshape method, since this has undergone the most changes.

Copilot

Pull request overview

This PR adds base-pointer and per-row alignment (padding) support to roccv::Tensor, updating stride/reshape semantics and introducing host<->device copy helpers that are padding-aware. It also updates samples, Python bindings, benchmarks, and tests to operate correctly with non-contiguous (row-padded) tensors.

Changes:

Introduce MemAlignment and update Tensor allocation/stride/reshape logic to support row/base alignment and non-contiguous layouts.
Add Tensor::copyFromHost / Tensor::copyToHost and refactor tests/helpers/samples to use padding-aware copies.
Update Python DLPack import to honor provided strides (and convert element-strides to byte-strides), with a contiguous fallback.

Reviewed changes

Copilot reviewed 24 out of 24 changed files in this pull request and generated 7 comments.

Show a summary per file

File	Description
`src/core/tensor.cpp`	Core implementation updates: alignment-aware stride calc, reshape for non-contiguous tensors, host copy helpers.
`include/core/tensor.hpp`	Public API updates for alignment-aware construction + new copy/contiguity helpers.
`include/core/mem_alignment.hpp` / `src/core/mem_alignment.cpp`	New alignment configuration type used during tensor allocation.
`include/core/utils.hpp`	New low-level helpers (power-of-two + align-up) used for alignment/stride logic.
`src/core/tensor_storage.cpp` / `include/core/tensor_storage.hpp`	Expose allocator used by `TensorStorage`.
`src/op_non_max_suppression.cpp`	Update reshape call sites to new `reshape(dtype, shape)` signature.
`python/src/py_tensor.cpp` / `python/include/py_tensor.hpp`	DLPack import correctness fixes + documentation/typing cleanup.
`tests/roccv/cpp/src/tests/core/tensor/test_tensor.cpp`	Add/adjust tensor tests for padding/reshape/copy helpers.
`tests/roccv/cpp/include/test_helpers.hpp`	Refactor host<->device copy helpers to use new tensor copy APIs.
`samples/common/utils.hpp`	Add batch image load/write utilities that respect tensor padding.
`samples/*.cpp` (multiple)	Update CLI parsing + use new image IO helpers (padding-aware).
`benchmarks/src/roccv/roccv_bench_helpers.cpp`	Update benchmark filling logic to account for padded allocation sizes.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

codecov-commenter · 2026-03-23T23:42:27Z

Codecov Report

❌ Patch coverage is 77.87234% with 52 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/core/tensor.cpp	76.47%	33 Missing and 15 partials ⚠️
include/core/utils.hpp	78.57%	3 Missing ⚠️
src/core/tensor_storage.cpp	0.00%	1 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop     #136      +/-   ##
===========================================
- Coverage    78.13%   77.98%   -0.15%     
===========================================
  Files           79       82       +3     
  Lines         3347     3538     +191     
  Branches       733      771      +38     
===========================================
+ Hits          2615     2759     +144     
- Misses         369      403      +34     
- Partials       363      376      +13

Files with missing lines	Coverage Δ
include/core/mem_alignment.hpp	`100.00% <100.00%> (ø)`
include/core/tensor.hpp	`62.50% <ø> (ø)`
src/core/mem_alignment.cpp	`100.00% <100.00%> (ø)`
src/op_non_max_suppression.cpp	`41.18% <100.00%> (ø)`
src/core/tensor_storage.cpp	`81.48% <0.00%> (-3.13%)`	⬇️
include/core/utils.hpp	`78.57% <78.57%> (ø)`
src/core/tensor.cpp	`72.53% <76.47%> (+1.23%)`	⬆️

... and 1 file with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…cze/rocCV into zv/feature/add-memalign-class

zacharyvincze added 30 commits December 2, 2025 13:52

Add MemAlignment to Tensor constructors

742920c

Document all constructors for roccv::Tensor

f345ea0

Add MemAlign parameter to Tensor::CalcStrides()

cf99a9c

Added Tensor copy constructor an Tensor::dataSize() implementation

21f0cb8

Remove copy constructor for roccv::Tensor

0a761ce

Implement memory padding and alignment for tensors

fd84334

Fix alignment calculations

c232d1f

Take padding into account during copies in test helpers

c90475b

Add isContiguous member function for roccv::Tensor

18d0890

Remove non-contiguous reshaping from tests

e97207a

Update utils.hpp to use memcpy2D to handle tensor padding

6102764

Fix incorrect imageBytes calculation

12ff5e9

Use HIP streams where possible

e562ebc

Fix CopyMakeBorder and GammaContrast samples

cae2e8e

Fix WarpPerspective sample

6c42212

Fix CustomCrop sample

f7f3839

Fix BilateralFilter sample

1063baa

Fix CenterCrop sample

612cfb6

Fix/cleanup BndBox sample

b1ae692

Fix/clean Composite sample

e85d3a4

Merge branch 'develop' into zv/feature/add-memalign-class

d544ae6

Fix crop and resize example

1d1879b

Add more information to the help message

b7cff20

Merge branch 'develop' into zv/feature/add-memalign-class

ebabcfb

Fix tensor copies for benchmarking suite

370c337

Fix tensor padding calculations

3ee9636

Add documentation to helper function

8c0b2fb

Reuse reshape logic

8073969

Merge branch 'develop' into zv/feature/add-memalign-class

a0fc480

Temp commit

97efadb

zacharyvincze requested review from jeffqjiangNew and paveltc and removed request for paveltc March 23, 2026 22:11

Copilot started reviewing on behalf of zacharyvincze March 23, 2026 22:14 View session

zacharyvincze added the ci:precheckin label Mar 23, 2026

Copilot AI reviewed Mar 23, 2026

View reviewed changes

Address PR comments

ea91e85

zacharyvincze added 2 commits April 7, 2026 14:01

Merge branch 'develop' into zv/feature/add-memalign-class

ae83206

Merge branch 'develop' into zv/feature/add-memalign-class

30ba649

jeffqjiangNew reviewed Apr 8, 2026

View reviewed changes

Comment thread benchmarks/src/roccv/roccv_bench_helpers.cpp Outdated

jeffqjiangNew reviewed Apr 8, 2026

View reviewed changes

Comment thread include/core/mem_alignment.hpp Outdated

jeffqjiangNew reviewed Apr 8, 2026

View reviewed changes

Comment thread include/core/tensor.hpp Outdated

jeffqjiangNew reviewed Apr 8, 2026

View reviewed changes

Comment thread include/core/tensor.hpp Outdated

zacharyvincze added 7 commits April 8, 2026 13:45

Fix warnings

8b5949f

Fix random generation for benchmarking

875da7e

Fix copyright year

6a87f15

Change tensor copy signatures for clarity

770a674

Merge branch 'develop' into zv/feature/add-memalign-class

5c651d6

Add comments to tensor requirement members

d65da50

Merge branch 'zv/feature/add-memalign-class' of github.com:zacharyvin…

4d234f1

…cze/rocCV into zv/feature/add-memalign-class

jeffqjiangNew reviewed Apr 9, 2026

View reviewed changes

Comment thread src/core/tensor.cpp

jeffqjiangNew reviewed Apr 9, 2026

View reviewed changes

Comment thread src/core/tensor.cpp

jeffqjiangNew approved these changes Apr 10, 2026

View reviewed changes

zacharyvincze added 5 commits April 23, 2026 11:28

Merge branch 'develop' into zv/feature/add-memalign-class

bbd0958

Merge branch 'develop' into zv/feature/add-memalign-class

1a0ece4

Merge branch 'develop' into zv/feature/add-memalign-class

2b74997

Merge branch 'develop' into zv/feature/add-memalign-class

bb89044

Merge branch 'develop' into zv/feature/add-memalign-class

4d0979c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Base/Row alignment support for Tensors#136

[Feature] Base/Row alignment support for Tensors#136
zacharyvincze wants to merge 57 commits into
ROCm:developfrom
zacharyvincze:zv/feature/add-memalign-class

zacharyvincze commented Mar 10, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov-commenter commented Mar 23, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

zacharyvincze commented Mar 10, 2026

PR Description

General

Samples

Tensors

Python bindings (rocpycv)

Tests

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov-commenter commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

codecov-commenter commented Mar 23, 2026 •

edited

Loading