Contents

Planning

Planning in bandits: pure exploration, best arm identification. Planning in MDP: Monte Carlo Tree Search.

Practical session - Best arm identification

Practical session - MCTS on TicTacToe

Answers - Best arm identification

Answers - MCTS on TicTacToe

Last updated on Feb 20, 2022