Settings

Theme

Kat-Dev-32B, Kat-Coder with Scalable Agentic RL

kwaipilot.github.io

1 points by robert-zaremba 3 months ago · 1 comment

Reader

robert-zarembaOP 3 months ago

KAT-Dev-32B and KAT-Coder are optimized via several stages of training, including a mid-training stage, supervised fine-tuning (SFT) & reinforcement fine-tuning (RFT) stage and an large-scale agentic reinforcement learning (RL) stage.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection