Search

Patrick Saux
Patrick Saux
  • Home
  • Research
  • Teaching
  • Projects
  • Talks
  • Posts
  • Contact
  • Light Dark Automatic
  • Teaching
Reinforcement Learning (CentraleSupelec M2 2020-2021)
  • 1. MDP
  • 2. Bandits
  • 3. Planning
  • 4. Deep Reinforcement Learning
  • Contents
  1. Home
  2. Teaching
  3. Reinforcement Learning (CentraleSupelec M2 2020-2021)
  4. 3. Planning

3. Planning

Planning in bandits: pure exploration, best arm identification. Planning in MDP: Monte Carlo Tree Search.

Practical session - Best arm identification

Forban (bandit library)

Practical session - MCTS on TicTacToe

Previous
2. Bandits
Next
4. Deep Reinforcement Learning

Last updated on Mar 31, 2021

Published with Wowchemy — the free, open source website builder that empowers creators.

Cite
Copy Download