From Optimality to Robustness: Dirichlet Sampling Strategies in Stochastic Bandits (NeurIPS 2021)

State-of-the-art randomised bandit algorithm with guarantees under weak assumptions.