How to run
make ./tic-tac-toe
What is it?
tic-tac-toe.c implements a reinforcement learning training loop that creates two capable machine tic-tac-toe players. It then lets you play against one of them, either as X or as O.
If you want to understand the logic behind the code, pages 6-10 in Richard Sutton's reinforcement learning book is sufficient introduction!