1. Constrained Markov decision processes, Vol. 7;Altman,1999
2. Finite-time analysis of the multiarmed bandit problem;Auer;Machine Learning,2002
3. A simple model of herd behavior;Banerjee;Quarterly Journal of Economics,1992
4. Quickest change detection approach to optimal control in Markov decision processes with model changes;Banerjee,2017
5. Causality and batch reinforcement learning: complementary approaches to planning in unknown domains;Bannon,2020