Proximal Policy Optimization with Clojure and PyTorch

2 points by wedesoft 2 months ago · 1 comment

Reader

A Clojure port of XinJingHao’s PPO implementation using libpython-clj2, PyTorch, and Quil. PPO is a reinforcement learning method which has become popular because it addresses the problem of stability. The PPO implementation is tested using the inverted pendulum problem.

Settings

Proximal Policy Optimization with Clojure and PyTorch

Keyboard Shortcuts