1. Aaronson S (2018) Shadow tomography of quantum states. In: Proceedings of the 50th annual ACM SIGACT symposium on theory of computing, pp 325–338
2. Abbasi-Yadkori , Pál D, Szepesvári Cs (2011) Improved algorithms for linear stochastic bandits. In: Advances in neural information processing systems, Curran Associates, Inc., 24
3. Abe N, Biermann AW, Long PM (2003) Reinforcement learning with immediate rewards, linear hypotheses. Algorithmica 37(4):263–293
4. Agrawal S, Goyal N (2013) Thompson sampling for contextual bandits with linear payoffs. In: International conference on machine learning, PMLR, pp 127–135
5. Auer P (2003) Using confidence bounds for exploitation-exploration trade-offs. Journal of Machine Learning Research, 3:397–422. ISSN 1532-4435