Similarity-based multimodal regression

Author:

Chen Andrew A1ORCID,Weinstein Sarah M2ORCID,Adebimpe Azeez34,Gur Ruben C45,Gur Raquel E45,Merikangas Kathleen R6,Satterthwaite Theodore D34,Shinohara Russell T78,Shou Haochang78

Affiliation:

1. Department of Public Health Sciences, Medical University of South Carolina , Charleston, SC 29425, USA

2. Department of Epidemiology and Biostatistics, Temple University College of Public Health , Philadelphia, PA 19122, USA

3. Penn Lifespan Informatics & Neuroimaging Center, Department of Psychiatry, University of Pennsylvania , Philadelphia, PA 19104, USA

4. Department of Psychiatry, University of Pennsylvania , Philadelphia, PA 19104, USA

5. Lifespan Brain Institute Penn Medicine and CHOP, University of Pennsylvania , Philadelphia, PA 19104, USA

6. Genetic Epidemiology Research Branch, Intramural Research Program, National Institute of Mental Health , Bethesda, MD 20892, USA

7. Penn Statistics in Imaging and Visualization Center, Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania , Philadelphia, PA 19104, USA

8. Center for Biomedical Image Computing and Analytics, University of Pennsylvania , Philadelphia, PA 19104, USA

Abstract

Summary To better understand complex human phenotypes, large-scale studies have increasingly collected multiple data modalities across domains such as imaging, mobile health, and physical activity. The properties of each data type often differ substantially and require either separate analyses or extensive processing to obtain comparable features for a combined analysis. Multimodal data fusion enables certain analyses on matrix-valued and vector-valued data, but it generally cannot integrate modalities of different dimensions and data structures. For a single data modality, multivariate distance matrix regression provides a distance-based framework for regression accommodating a wide range of data types. However, no distance-based method exists to handle multiple complementary types of data. We propose a novel distance-based regression model, which we refer to as Similarity-based Multimodal Regression (SiMMR), that enables simultaneous regression of multiple modalities through their distance profiles. We demonstrate through simulation, imaging studies, and longitudinal mobile health analyses that our proposed method can detect associations between clinical variables and multimodal data of differing properties and dimensionalities, even with modest sample sizes. We perform experiments to evaluate several different test statistics and provide recommendations for applying our method across a broad range of scenarios.

Funder

National Institute of Neurological Disorders and Stroke

National Multiple Sclerosis Society

National Institute of Mental Health

University of Pennsylvania Center for Biomedical Image Computing and Analytics

Publisher

Oxford University Press (OUP)

Subject

Statistics, Probability and Uncertainty,General Medicine,Statistics and Probability

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3