Settings

Theme

Apriel-H1: Towards Efficient Enterprise Reasoning Models

arxiv.org

1 points by guiriduro a month ago · 1 comment

Reader

guiriduroOP a month ago

Apriel-H1-15b-Thinker-SFT uses incremental distillation from Apriel-Nemotron-15B-Thinker, selectively replacing less critical attention layers with linear Mamba blocks to reduce computational complexity while preserving reasoning quality.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection