Settings

Theme

Deepseek R1 Zero learns to reason using reinforcement learning on base model [pdf]

github.com

6 points by virde a year ago · 1 comment

Reader

No comments yet.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection