Show HN: RS-Poker V5 The one with self learning multi-threaded async Agents

3 points by eclark 15 days ago · 2 comments · 1 min read

Reader

rs-poker has been my passion project for a long time. This release is huge because it's the first time that I know of for an open source poker bot/agent implementation to include all the state of the art.

- tokio based async exploration - rust slab allocation based tree structure for regret minimization - perfect hashing for faster hand ranking - a TUI via https://ratatui.rs/

Creating your own poker bot and having them compete in an arena should be less than 100 lines of code: https://docs.rs/rs_poker/latest/rs_poker/arena/index.html

I need more eyes on the implementation, and more attempts to make the algorithms and agents state of the art. I know I can't have found the optimal configurations and algorithms; I'd love for the open source community to prove me wrong.

There's one glaring limitation that I need to fix. Right now the CFR agents can't predict their opponents hands so all regret minimization is either using the exact hand (so pretty conservative) or random (so too wide). I have some ideas here but I need more data and more discussion.

direct github: https://github.com/elliottneilclark/rs-poker

zuzululu 15 days ago

is this pluribus ?

eclarkOP 15 days ago

No this doesn't currently use a model for valuation. It uses recursive exploration(with a stockfish like early terminition) and keeps track of the reward and regret at each node. With brute force on number of cores and compact memory layout I was able to get reasonable convergence with 6 way rings.
I do have some plans to release a follow on with some ml components ala Pluribus.

Settings

Show HN: RS-Poker V5 The one with self learning multi-threaded async Agents

Keyboard Shortcuts