Settings

Theme

Reinforcement Learning – A Reference

jakubhalmes.substack.com

108 points by jac08h a year ago · 4 comments

Reader

jac08hOP a year ago

While studying for an RL course, I created a reference for several algorithms with a brief description of what limitations they solve. Example:

Problem: SARSA pushes q-values towards the current policy, but ideally we'd want optimal values. Solution: Use the best action in TD-target calculation -> Q-learning

Perhaps someone else will find it helpful!

  • hevomada a year ago

    Very cool write-up! I also took the course this semester. What a coincidence.

    Only wish you publicised it before the exam haha :-)

    492982

    • jac08hOP a year ago

      Haha, cool, thank you! I had some notes ready but didn't get around to finishing it sooner. Besides, I'm sure the course slides were much better material for exam prep ;)

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection