Optimize
Benchmark
Repeat
A platform for GPU programming challenges. Write efficient GPU kernels and compare your solutions with other developers.
Want to submit from your IDE? Check out our CLI tool
vector-add.cu
#include <cuda_runtime.h>
__global__ void vectorAdd(const float* A, const float* B, float* C, int N) {
}