Settings

Theme

Proximal Policy Optimization with Clojure and PyTorch

clojurecivitas.org

2 points by wedesoft 14 days ago · 1 comment

Reader

wedesoftOP 14 days ago

A Clojure port of XinJingHao’s PPO implementation using libpython-clj2, PyTorch, and Quil. PPO is a reinforcement learning method which has become popular because it addresses the problem of stability. The PPO implementation is tested using the inverted pendulum problem.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection