Skip to content

Commit 7335c05

Browse files
authored
bump cuda.core to v0.4.0 (#1097)
1 parent fcd7b99 commit 7335c05

File tree

4 files changed

+10
-5
lines changed

4 files changed

+10
-5
lines changed

cuda_core/cuda/core/_version.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,4 +2,4 @@
22
#
33
# SPDX-License-Identifier: Apache-2.0
44

5-
__version__ = "0.3.3a0"
5+
__version__ = "0.4.0"

cuda_core/docs/nv-versions.json

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -3,6 +3,10 @@
33
"version": "latest",
44
"url": "https://nvidia.github.io/cuda-python/cuda-core/latest/"
55
},
6+
{
7+
"version": "0.4.0",
8+
"url": "https://nvidia.github.io/cuda-python/cuda-core/0.4.0/"
9+
},
610
{
711
"version": "0.3.2",
812
"url": "https://nvidia.github.io/cuda-python/cuda-core/0.3.2/"

cuda_core/docs/source/release.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ Release Notes
77
.. toctree::
88
:maxdepth: 3
99

10-
0.X.Y <release/0.X.Y-notes>
10+
0.4.0 <release/0.4.0-notes>
1111
0.3.2 <release/0.3.2-notes>
1212
0.3.1 <release/0.3.1-notes>
1313
0.3.0 <release/0.3.0-notes>

cuda_core/docs/source/release/0.X.Y-notes.rst renamed to cuda_core/docs/source/release/0.4.0-notes.rst

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -19,9 +19,10 @@ Highlights
1919
Breaking Changes
2020
----------------
2121

22-
- **CUDA 11 support dropped**: CUDA 11 support is no longer tested and it may or may not work with cuda.bindings and CTK 11.x. Users are encouraged to migrate to CUDA 12.x or 13.x.
22+
- CUDA 11 support dropped: CUDA 11 is no longer tested and it may or may not work with ``cuda.bindings`` and CTK 11.x. Users are encouraged to migrate to CUDA 12.x or 13.x.
2323
- Support for ``cuda-bindings`` (and ``cuda-python``) < 12.6.2 is dropped. Internally, ``cuda.core`` now always requires the `new binding module layout <https://nvidia.github.io/cuda-python/cuda-bindings/latest/release/12.6.1-notes.html#cuda-namespace-cleanup-with-a-new-module-layout>`_. As per the ``cuda-bindings`` `support policy <https://nvidia.github.io/cuda-python/cuda-bindings/latest/support.html>`_), CUDA 12 users are encouraged to use the latest ``cuda-bindings`` 12.9.x, which is backward-compatible with all CUDA Toolkit 12.y.
24-
- **LaunchConfig grid parameter interpretation**: When :attr:`LaunchConfig.cluster` is specified, the :attr:`LaunchConfig.grid` parameter now correctly represents the number of clusters instead of blocks. Previously, the grid parameter was incorrectly interpreted as blocks, causing a mismatch with the expected C++ behavior. This change ensures that ``LaunchConfig(grid=4, cluster=2, block=32)`` correctly produces 4 clusters × 2 blocks/cluster = 8 total blocks, matching the C++ equivalent ``cudax::make_hierarchy(cudax::grid_dims(4), cudax::cluster_dims(2), cudax::block_dims(32))``.
24+
- Change in :class:`LaunchConfig` grid parameter interpretation: When :attr:`LaunchConfig.cluster` is specified, the :attr:`LaunchConfig.grid` parameter now correctly represents the number of clusters instead of blocks. Previously, the grid parameter was incorrectly interpreted as blocks, causing a mismatch with the expected C++ behavior. This change ensures that ``LaunchConfig(grid=4, cluster=2, block=32)`` correctly produces 4 clusters × 2 blocks/cluster = 8 total blocks, matching the C++ equivalent ``cudax::make_hierarchy(cudax::grid_dims(4), cudax::cluster_dims(2), cudax::block_dims(32))``.
25+
- The :class:`Buffer` objects now deallocate on the stream that was used to allocate it, instead of on the default stream. We encourage users to overwrite the deallocation stream explicitly through the :meth:`~Buffer.close` method if desired. Establishing a proper stream order is the user responsibility.
2526

2627

2728
New features
@@ -32,7 +33,7 @@ New features
3233
- Stream-ordered memory allocation can now be shared on Linux via :class:`DeviceMemoryResource`.
3334
- Added NVVM IR support to :class:`Program`. NVVM IR is now understood with ``code_type="nvvm"``.
3435
- Added an :attr:`ObjectCode.code_type` attribute for querying the code type.
35-
- Added :class:`VirtualMemoryResource` for low-level virtual memory management.
36+
- Added :class:`VirtualMemoryResource` for low-level virtual memory management on Linux.
3637

3738

3839
New examples

0 commit comments

Comments
 (0)