Settings

Theme

DeepSeek-V3-0324

huggingface.co

5 points by desideratum 9 months ago · 2 comments

Reader

reissbaker 9 months ago

It's "just" a minor version update, but in my own testing it seems much stronger than the original V3 — basically on par with R1 for the usual tricks I throw at LLMs, without needing <think> tokens.

I'm sure they're re-RL-training an R1-[minor bump] on top of this model, or perhaps even an R2; it'll be extremely strong when it comes out. For now I've swapped most of my usage to this new V3, since it's basically on-par for my use cases with R1 and doesn't require waiting for thinking tokens.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection