Ensemble methods for testing a global null

Author:

Liu Yaowu1ORCID,Liu Zhonghua2ORCID,Lin Xihong3ORCID

Affiliation:

1. Southwestern University of Finance and Economics School of Statistics, , Chengdu , China

2. Columbia University Department of Biostatistics, , New York, NY , USA

3. Harvard University Department of Biostatistics and Department of Statistics, , Boston, MA , USA

Abstract

Abstract Testing a global null is a canonical problem in statistics and has a wide range of applications. In view of the fact that no uniformly most powerful test exists, prior and/or domain knowledge are commonly used to focus on a certain class of alternatives to improve the testing power. However, it is generally challenging to develop tests that are particularly powerful against a certain class of alternatives. In this paper, motivated by the success of ensemble learning methods for prediction or classification, we propose an ensemble framework for testing that mimics the spirit of random forests to deal with the challenges. Our ensemble testing framework aggregates a collection of weak base tests to form a final ensemble test that maintains strong and robust power for global nulls. We apply the framework to four problems about global testing in different classes of alternatives arising from whole-genome sequencing (WGS) association studies. Specific ensemble tests are proposed for each of these problems, and their theoretical optimality is established in terms of Bahadur efficiency. Extensive simulations and an analysis of a real WGS dataset are conducted to demonstrate the type I error control and/or power gain of the proposed ensemble tests.

Publisher

Oxford University Press (OUP)

Subject

Statistics, Probability and Uncertainty,Statistics and Probability

Reference48 articles.

1. Global testing under sparse alternatives: ANOVA, multiple comparisons and the higher criticism;Arias-Castro;The Annals of Statistics,2011

2. Stochastic comparison of tests;Bahadur;Annals of Mathematical Statistics,1960

3. The generalized higher criticism for testing SNP-set effects in genetic association studies;Barnett;Journal of the American Statistical Association,2017

4. Analytical p-value calculation for the higher criticism test in finite-d problems;Barnett;Biometrika,2014

5. Goodness-of-fit test statistics that dominate the Kolmogorov statistics;Berk;Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete,1979

全球学者库

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"全球学者库"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前全球学者库共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2023 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3