Settings

Theme

Show HN: Nanbeige 4.1-3B running in the browser via WebGPU

huggingface.co

6 points by victormustar 2 months ago · 2 comments

Reader

victormustarOP 2 months ago

This is a 3B parameter model from Nanbeige with surprisingly strong benchmarks. It beats Qwen3-32B on Arena-Hard and LiveCodeBench despite being 10x smaller. (Also be warned it thinks a lot).

I wrapped it in a simple browser demo using Transformers.js + WebGPU. It downloads the q4 ONNX weights (~1.7GB) and runs fully client-side. no server required. Falls back to WASM if WebGPU isn't available.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection