Settings

Theme

Nemotron 3 Ultra: Open Moe Hybrid Mamba-Transformer for Agentic Reasoning [pdf]

research.nvidia.com

23 points by victormustar 19 days ago · 2 comments

Reader

throwa356262 19 days ago

Is this the one from Jensens Computex presentation the other day?

It is significantly bigger than Qwen for the same level of intelligence, but I think the key strength was inference speed.

2001zhaozhao 19 days ago

This model seems like a really big deal. Is this the biggest Western open-source AI model in the world (beating out Llama3 405B)?

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection