Settings

Theme

An Intuitive Introduction to PPO and GRPO

mesuvash.github.io

5 points by mesuvash 2 months ago · 3 comments

Reader

thw20 a month ago

This is so amazing. What a masterpiece for intro to reinforcement learning in llm.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection