Top Challenges from the first Practical Online Controlled Experiments Summit

Author:

Gupta Somit1,Kohavi Ronny1,Tang Diane2,Xu Ya3,Andersen Reid4,Bakshy Eytan5,Cardin Niall2,Chandran Sumita6,Chen Nanyu3,Coey Dominic5,Curtis Mike2,Deng Alex1,Duan Weitao3,Forbes Peter7,Frasca Brian1,Guy Tommy1,Imbens Guido W.8,Saint Jacques Guillaume3,Kantawala Pranav2,Katsev Ilya9,Katzwer Moshe10,Konutgan Mikael5,Kunakova Elena9,Lee Minyong4,Lee MJ6,Liu Joseph11,McQueen James12,Najmi Amir2,Smith Brent12,Trehan Vivek10,Vermeer Lukas13,Walker Toby1,Wong Jeffrey7,Yashkov Igor9

Affiliation:

1. Microsoft

2. Google

3. LinkedIn

4. Airbnb

5. Facebook

6. Lyft

7. Netflix

8. Stanford

9. Yandex

10. Uber

11. Twitter

12. Amazon

13. Booking.com

Abstract

Online controlled experiments (OCEs), also known as A/B tests, have become ubiquitous in evaluating the impact of changes made to software products and services. While the concept of online controlled experiments is simple, there are many practical challenges in running OCEs at scale. To understand the top practical challenges in running OCEs at scale and encourage further academic and industrial exploration, representatives with experience in large-scale experimentation from thirteen different organizations (Airbnb, Amazon, Booking.com, Facebook, Google, LinkedIn, Lyft, Microsoft, Netflix, Twitter, Uber, Yandex, and Stanford University) were invited to the first Practical Online Controlled Experiments Summit. All thirteen organizations sent representatives. Together these organizations have tested more than one hundred thousand experiment treatments last year. Thirty-four experts from these organizations participated in the summit in Sunnyvale, CA, USA on December 13-14, 2018. While there are papers from individual organizations on some of the challenges and pitfalls in running OCEs at scale, this is the first paper to provide the top challenges faced across the industry for running OCEs at scale and some common solutions.

Publisher

Association for Computing Machinery (ACM)

Reference77 articles.

1. {Uber Marketplace} Marketplace Experimentation -- Vivek Trehan - YouTube: 2018. https://www.youtube.com/watch?v=IR000RqN7pw. Accessed: 2019-02-05. {Uber Marketplace} Marketplace Experimentation -- Vivek Trehan - YouTube: 2018. https://www.youtube.com/watch?v=IR000RqN7pw. Accessed: 2019-02-05.

2. 25 Cognitive Biases Home Page | 25 Cognitive Biases - "The Psychology of Human Misjudgment" http://25cognitivebiases.com/. Accessed: 2019-02-08. 25 Cognitive Biases Home Page | 25 Cognitive Biases - "The Psychology of Human Misjudgment" http://25cognitivebiases.com/. Accessed: 2019-02-08.

3. A/B Testing at Scale Tutorial Strata 2018: https://expplatform. com/2018StrataABtutorial/. Accessed: 2019-02- 05. A/B Testing at Scale Tutorial Strata 2018: https://expplatform. com/2018StrataABtutorial/. Accessed: 2019-02- 05.

4. AdvancedTopic_BestControlledExperiments.docx - Microsoft Word Online: https://onedrive.live.com/view.aspx?resid=8612090E610 871E4!323967&ithint=file%2Cdocx&app=Word&authk ey=!APyJuF_t0dOFj_M. Accessed: 2019-02-05. AdvancedTopic_BestControlledExperiments.docx - Microsoft Word Online: https://onedrive.live.com/view.aspx?resid=8612090E610 871E4!323967&ithint=file%2Cdocx&app=Word&authk ey=!APyJuF_t0dOFj_M. Accessed: 2019-02-05.

5. AdvancedTopic_OEC.docx - Microsoft Word Online: https://onedrive.live.com/view.aspx?resid=8612090E610 871E4!282179&ithint=file%2Cdocx&app=Word&authk ey=!ANFGOBrhVt91ODk. Accessed: 2019-02-05. AdvancedTopic_OEC.docx - Microsoft Word Online: https://onedrive.live.com/view.aspx?resid=8612090E610 871E4!282179&ithint=file%2Cdocx&app=Word&authk ey=!ANFGOBrhVt91ODk. Accessed: 2019-02-05.

Cited by 79 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Semiparametric estimation of long-term treatment effects;Journal of Econometrics;2023-12

2. The Price is Right: Removing A/B Test Bias in a Marketplace of Expirable Goods;Proceedings of the 32nd ACM International Conference on Information and Knowledge Management;2023-10-21

3. Statistical Challenges in Online Controlled Experiments: A Review of A/B Testing Methodology;The American Statistician;2023-09-08

4. Experimentation Platforms Meet Reinforcement Learning: Bayesian Sequential Decision-Making for Continuous Monitoring;Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2023-08-04

5. Semi-parametric estimation of treatment effects in randomised experiments;Journal of the Royal Statistical Society Series B: Statistical Methodology;2023-07-19

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3