Settings

Theme

Serverless RL: Faster, Cheaper and More Flexible RL Training

openpipe.ai

9 points by slewis 3 months ago · 3 comments

Reader

Arctic_fly 3 months ago

Interesting post. Did the difference in wall clock training time take the reduction in cold start time into account? Seems like that could be a significant factor for small jobs and negligible for large ones.

altryne1 3 months ago

Will the rate limits go higher? How about other models? Qwen 2.5 is nice but 3 is nicer

cmatrub 3 months ago

higher abstraction than Tinker, more flexible than OpenAI RFT. i like integration to production inference, so i can switch between training and inference for continuous learning.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection