The Beginner's RL Playground

1 min read Original article ↗

Algorithm Settings

0.1

0.1

0.1

0.1

0.1

0.9

0.2

1.0

Interactive Simulation

Tip: Click to cycle state content (💎/☠️/🚧/empty).
Shift+Click to set start state (🏠).

Environment Settings

5

-0.1

10

-10

100

750

Action Values Q(s,a)

0.00

0.00

0.00

0.00

Action Probabilities π(a | s)

0.25

0.25

0.25

0.25

Learning Progress Plot