monadicmonad Karma -1 Created 2 years ago Recent Submissions 1. ▲ Experimenting with policy gradient methods in Jax (github.com) 2 points · 7 months ago · 0 comments 2. ▲ Policy Evaluation in Grid World (github.com) 1 point · 1 year ago · 0 comments All submissions on HN · View profile on HN