This repository contains CUDA implementations demonstrating GPU acceleration.
- Vector Addition using CUDA kernel
- Matrix Addition using 2D threads
- CPU vs GPU performance comparison
- Parallel Reduction using shared memory
- CUDA
- C++
- Parallel Computing
nvcc vector_add.cu -o vector_add ./vector_add