1. Effective Evaluation Using Logged Bandit Feedback from Multiple Loggers
2. Ioana Bica , James Jordon , and Mihaela van der Schaar . 2020. Estimating the Effects of Continuous-valued Interventions using Generative Adversarial Networks. CoRR abs/2002.12326 ( 2020 ). arXiv:2002.12326https://arxiv.org/abs/2002.12326 Ioana Bica, James Jordon, and Mihaela van der Schaar. 2020. Estimating the Effects of Continuous-valued Interventions using Generative Adversarial Networks. CoRR abs/2002.12326 (2020). arXiv:2002.12326https://arxiv.org/abs/2002.12326
3. Counterfactual Reasoning and Learning Systems: The Example of Computational Advertising.;Bottou Léon;Journal of Machine Learning Research,2013
4. Moustapha Cisse , Piotr Bojanowski , Edouard Grave , Yann Dauphin , and Nicolas Usunier . 2017 . Parseval networks: Improving robustness to adversarial examples . In International Conference on Machine Learning. PMLR, 854–863 . Moustapha Cisse, Piotr Bojanowski, Edouard Grave, Yann Dauphin, and Nicolas Usunier. 2017. Parseval networks: Improving robustness to adversarial examples. In International Conference on Machine Learning. PMLR, 854–863.
5. Miroslav Dudík John Langford and Lihong Li. 2011. Doubly Robust Policy Evaluation and Learning. In ICML. Miroslav Dudík John Langford and Lihong Li. 2011. Doubly Robust Policy Evaluation and Learning. In ICML.