Settings

Theme

CUDA to cuTile transpiler for Nvidia's CUDA 13.1

github.com

1 points by rncode 20 days ago · 1 comment

Reader

rncodeOP 20 days ago

We just dropped a new open source project: CUDA to cuTile transpiler for NVIDIA's CUDA 13.1

We built a transpiler that converts your CUDA kernels to cuTile automatically. It figures out what your kernel does (flash attention, matrix multiplication, RoPE) and writes the cuTile version

Zero AI involved! It's pure pattern matching and code analysis

Currently supports 18 kernel patterns: - Core: GEMM, Reduction, Scan, Stencil, Elementwise, FFT - ML/DL: Convolution (1D/2D/3D), Pooling, Normalization - LLM: Flash Attention, RoPE, KV Cache, Quantization (INT8/FP8) - Specialized: Sparse matrices, Histogram, Sorting

Contributions we need:More kernel pattern templates

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection