Contents

3. Planning

Planning in bandits: pure exploration, best arm identification. Planning in MDP: Monte Carlo Tree Search.

Practical session - Best arm identification

Forban (bandit library)

Practical session - MCTS on TicTacToe

Last updated on Mar 31, 2021