Home | Tensara

1 min read Original article ↗

Optimize

Benchmark

Repeat

A platform for GPU programming challenges. Write efficient GPU kernels and compare your solutions with other developers.

Start Solving

GitHub

Discord

Want to submit from your IDE? Check out our CLI tool

vector-add.cu

#include <cuda_runtime.h>

__global__ void vectorAdd(const float* A, const float* B, float* C, int N) {

}