@@ -5,64 +5,64 @@ Examples
55========
66
77This page links to the ``cuda.bindings `` examples shipped in the
8- `cuda-python repository <https://github.com/NVIDIA/cuda-python/tree/main /cuda_bindings/examples >`_.
8+ `cuda-python repository <https://github.com/NVIDIA/cuda-python/tree/|cuda_bindings_github_ref| /cuda_bindings/examples >`_.
99Use it as a quick index when you want a runnable sample for a specific API area
1010or CUDA feature.
1111
1212Introduction
1313------------
1414
15- - `clock_nvrtc_test .py <https://github.com/NVIDIA/cuda-python/blob/main /cuda_bindings/examples/0_Introduction/clock_nvrtc_test .py >`_
15+ - `clock_nvrtc .py <https://github.com/NVIDIA/cuda-python/blob/|cuda_bindings_github_ref| /cuda_bindings/examples/0_Introduction/clock_nvrtc .py >`_
1616 uses NVRTC-compiled CUDA code and the device clock to time a reduction
1717 kernel.
18- - `simpleCubemapTexture_test .py <https://github.com/NVIDIA/cuda-python/blob/main /cuda_bindings/examples/0_Introduction/simpleCubemapTexture_test .py >`_
18+ - `simple_cubemap_texture .py <https://github.com/NVIDIA/cuda-python/blob/|cuda_bindings_github_ref| /cuda_bindings/examples/0_Introduction/simple_cubemap_texture .py >`_
1919 demonstrates cubemap texture sampling and transformation.
20- - `simpleP2P_test .py <https://github.com/NVIDIA/cuda-python/blob/main /cuda_bindings/examples/0_Introduction/simpleP2P_test .py >`_
20+ - `simple_p2p .py <https://github.com/NVIDIA/cuda-python/blob/|cuda_bindings_github_ref| /cuda_bindings/examples/0_Introduction/simple_p2p .py >`_
2121 shows peer-to-peer memory access and transfers between multiple GPUs.
22- - `simpleZeroCopy_test .py <https://github.com/NVIDIA/cuda-python/blob/main /cuda_bindings/examples/0_Introduction/simpleZeroCopy_test .py >`_
22+ - `simple_zero_copy .py <https://github.com/NVIDIA/cuda-python/blob/|cuda_bindings_github_ref| /cuda_bindings/examples/0_Introduction/simple_zero_copy .py >`_
2323 uses zero-copy mapped host memory for vector addition.
24- - `systemWideAtomics_test .py <https://github.com/NVIDIA/cuda-python/blob/main /cuda_bindings/examples/0_Introduction/systemWideAtomics_test .py >`_
24+ - `system_wide_atomics .py <https://github.com/NVIDIA/cuda-python/blob/|cuda_bindings_github_ref| /cuda_bindings/examples/0_Introduction/system_wide_atomics .py >`_
2525 demonstrates system-wide atomic operations on managed memory.
26- - `vectorAddDrv_test .py <https://github.com/NVIDIA/cuda-python/blob/main /cuda_bindings/examples/0_Introduction/vectorAddDrv_test .py >`_
26+ - `vector_add_drv .py <https://github.com/NVIDIA/cuda-python/blob/|cuda_bindings_github_ref| /cuda_bindings/examples/0_Introduction/vector_add_drv .py >`_
2727 uses the CUDA Driver API and unified virtual addressing for vector addition.
28- - `vectorAddMMAP_test .py <https://github.com/NVIDIA/cuda-python/blob/main /cuda_bindings/examples/0_Introduction/vectorAddMMAP_test .py >`_
28+ - `vector_add_mmap .py <https://github.com/NVIDIA/cuda-python/blob/|cuda_bindings_github_ref| /cuda_bindings/examples/0_Introduction/vector_add_mmap .py >`_
2929 uses virtual memory management APIs such as ``cuMemCreate `` and
3030 ``cuMemMap `` for vector addition.
3131
3232Concepts and techniques
3333-----------------------
3434
35- - `streamOrderedAllocation_test .py <https://github.com/NVIDIA/cuda-python/blob/main /cuda_bindings/examples/2_Concepts_and_Techniques/streamOrderedAllocation_test .py >`_
35+ - `stream_ordered_allocation .py <https://github.com/NVIDIA/cuda-python/blob/|cuda_bindings_github_ref| /cuda_bindings/examples/2_Concepts_and_Techniques/stream_ordered_allocation .py >`_
3636 demonstrates ``cudaMallocAsync `` and ``cudaFreeAsync `` together with
3737 memory-pool release thresholds.
3838
3939CUDA features
4040-------------
4141
42- - `globalToShmemAsyncCopy_test .py <https://github.com/NVIDIA/cuda-python/blob/main /cuda_bindings/examples/3_CUDA_Features/globalToShmemAsyncCopy_test .py >`_
42+ - `global_to_shmem_async_copy .py <https://github.com/NVIDIA/cuda-python/blob/|cuda_bindings_github_ref| /cuda_bindings/examples/3_CUDA_Features/global_to_shmem_async_copy .py >`_
4343 compares asynchronous global-to-shared-memory copy strategies in matrix
4444 multiplication kernels.
45- - `simpleCudaGraphs_test .py <https://github.com/NVIDIA/cuda-python/blob/main /cuda_bindings/examples/3_CUDA_Features/simpleCudaGraphs_test .py >`_
45+ - `simple_cuda_graphs .py <https://github.com/NVIDIA/cuda-python/blob/|cuda_bindings_github_ref| /cuda_bindings/examples/3_CUDA_Features/simple_cuda_graphs .py >`_
4646 shows both manual CUDA graph construction and stream-capture-based replay.
4747
4848Libraries and tools
4949-------------------
5050
51- - `conjugateGradientMultiBlockCG_test .py <https://github.com/NVIDIA/cuda-python/blob/main /cuda_bindings/examples/4_CUDA_Libraries/conjugateGradientMultiBlockCG_test .py >`_
51+ - `conjugate_gradient_multi_block_cg .py <https://github.com/NVIDIA/cuda-python/blob/|cuda_bindings_github_ref| /cuda_bindings/examples/4_CUDA_Libraries/conjugate_gradient_multi_block_cg .py >`_
5252 implements a conjugate-gradient solver with cooperative groups and
5353 multi-block synchronization.
54- - `nvidia_smi.py <https://github.com/NVIDIA/cuda-python/blob/main /cuda_bindings/examples/4_CUDA_Libraries/nvidia_smi.py >`_
54+ - `nvidia_smi.py <https://github.com/NVIDIA/cuda-python/blob/|cuda_bindings_github_ref| /cuda_bindings/examples/4_CUDA_Libraries/nvidia_smi.py >`_
5555 uses NVML to implement a Python subset of ``nvidia-smi ``.
5656
5757Advanced and interoperability
5858-----------------------------
5959
60- - `isoFDModelling_test .py <https://github.com/NVIDIA/cuda-python/blob/main /cuda_bindings/examples/extra/isoFDModelling_test .py >`_
60+ - `iso_fd_modelling .py <https://github.com/NVIDIA/cuda-python/blob/|cuda_bindings_github_ref| /cuda_bindings/examples/extra/iso_fd_modelling .py >`_
6161 runs isotropic finite-difference wave propagation across multiple GPUs with
6262 peer-to-peer halo exchange.
63- - `jit_program_test .py <https://github.com/NVIDIA/cuda-python/blob/main /cuda_bindings/examples/extra/jit_program_test .py >`_
63+ - `jit_program .py <https://github.com/NVIDIA/cuda-python/blob/|cuda_bindings_github_ref| /cuda_bindings/examples/extra/jit_program .py >`_
6464 JIT-compiles a SAXPY kernel with NVRTC and launches it through the Driver
6565 API.
66- - `numba_emm_plugin.py <https://github.com/NVIDIA/cuda-python/blob/main /cuda_bindings/examples/extra/numba_emm_plugin.py >`_
66+ - `numba_emm_plugin.py <https://github.com/NVIDIA/cuda-python/blob/|cuda_bindings_github_ref| /cuda_bindings/examples/extra/numba_emm_plugin.py >`_
6767 shows how to back Numba's EMM interface with the NVIDIA CUDA Python Driver
6868 API.
0 commit comments