Skip to content

Pull requests: NVlabs/cuda-oxide

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: add redux.sync.add warp reduction (sm_80+)
#223 opened Jun 14, 2026 by goog00 Contributor Loading…
6 tasks done
fix(cargo-oxide): preserve existing LIBRARY_PATH during backend build
#221 opened Jun 14, 2026 by enochjung Loading…
5 of 6 tasks
feat: add asin/acos intrinsic support for f32 and f64
#216 opened Jun 13, 2026 by goog00 Contributor Loading…
mir-importer: read closure upvars from generic suffix
#212 opened Jun 12, 2026 by niklebedenko Contributor Loading…
mir-importer: index slice data pointers by element
#210 opened Jun 12, 2026 by niklebedenko Contributor Loading…
mir-importer: preserve volatile load and store intrinsics
#208 opened Jun 12, 2026 by niklebedenko Contributor Loading…
mir-importer: support pointer distance intrinsics
#206 opened Jun 12, 2026 by niklebedenko Contributor Loading…
cuda-core: restore DeviceBuffer async methods
#204 opened Jun 12, 2026 by niklebedenko Contributor Loading…
cuda-core: add safe DeviceCopy parity impls
#202 opened Jun 12, 2026 by niklebedenko Contributor Loading…
fix(mir-importer): lower nested index assignments
#199 opened Jun 12, 2026 by niklebedenko Contributor Loading…
fix(mir-importer): lower copy_nonoverlapping to memcpy
#196 opened Jun 12, 2026 by niklebedenko Contributor Loading…
feat(llvm-export): AsmKind enum for inline asm optimization hints
#195 opened Jun 12, 2026 by honeyspoon Contributor Loading…
feat(cuda-core): derive DeviceCopy
#193 opened Jun 12, 2026 by niklebedenko Contributor Loading…
feat(cuda-device): add bf16x2 FMA intrinsic
#191 opened Jun 12, 2026 by niklebedenko Contributor Loading…
fix(pipeline): preserve inline always
#189 opened Jun 12, 2026 by niklebedenko Contributor Loading…
fix(intrinsics): lower fast float ops
#187 opened Jun 12, 2026 by niklebedenko Contributor Loading…
fix(codegen): align export and call names
#185 opened Jun 12, 2026 by niklebedenko Contributor Loading…
fix(mir-importer): lower array constants
#183 opened Jun 12, 2026 by niklebedenko Contributor Loading…
fix(pipeline): create output_dir before emission
#181 opened Jun 12, 2026 by niklebedenko Contributor Loading…
Preserve device contexts when duplicate init fails
#176 opened Jun 12, 2026 by fallintoplace Contributor Loading…
feat(codegen): fma contraction and an opt -O3 pass to match nvcc defaults codegen Device code-generation pipeline (Rust MIR to IR to PTX) enhancement New feature or request perf Performance of generated code or of the compiler itself
#117 opened Jun 6, 2026 by alejandro-soto-franco Contributor Loading…
Support typed NVVM IR for pre-Blackwell libNVVM targets Depends Blocked on another PR/issue or an upstream dependency enhancement New feature or request llvm-export Textual LLVM/NVVM IR exporter (llvm-export crate)
#101 opened May 30, 2026 by mohamedsamirx Contributor Loading…
ProTip! Add no:assignee to see everything that’s not assigned.