Skip to content

[Fix] Add CUDA guard for multi-device environment#372

Open
AsherJingkongChen wants to merge 1 commit intomit-han-lab:masterfrom
AsherJingkongChen:fix/cuda-guard
Open

[Fix] Add CUDA guard for multi-device environment#372
AsherJingkongChen wants to merge 1 commit intomit-han-lab:masterfrom
AsherJingkongChen:fix/cuda-guard

Conversation

@AsherJingkongChen
Copy link

Issue: CUDA kernels launch on wrong device in multi-GPU environments

Rootcause: torchsparse CUDA kernels use current_device for kernel launch but tensor may be on a different device

Fix: Add c10::cuda::CUDAGuard guard(tensor.device()) at entry of all CUDA kernel wrapper functions (16 files)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant

Comments