Abstract
As terabytes of multi-omics data are being generated, there is an ever-increasing need for methods facilitating the integration and interpretation of such data. Current multi-omics integration methods typically output lists, clusters, or subnetworks of molecules related to an outcome. Even with expert domain knowledge, discerning the biological processes involved is a time-consuming activity. Here we propose PathIntegrate, a method for integrating multi-omics datasets based on pathways, designed to exploit knowledge of biological systems and thus provide interpretable models for such studies. PathIntegrate employs single-sample pathway analysis to transform multi-omics datasets from the molecular to the pathway-level, and applies a predictive single-view or multi-view model to integrate the data. Model outputs include multi-omics pathways ranked by their contribution to the outcome prediction, the contribution of each omics layer, and the importance of each molecule in a pathway. Using semi-synthetic data we demonstrate the benefit of grouping molecules into pathways to detect signals in low signal-to-noise scenarios, as well as the ability of PathIntegrate to precisely identify important pathways at low effect sizes. Finally, using COPD and COVID-19 data we showcase how PathIntegrate enables convenient integration and interpretation of complex high-dimensional multi-omics datasets. PathIntegrate is available as an open-source Python package.
Funder
Wellcome Trust
Biotechnology and Biological Sciences Research Council
Medical Research Council
Foundation for the National Institutes of Health
Agence Nationale de la Recherche
National Heart, Lung, and Blood Institute
NIH
COPD Foundation
Publisher
Public Library of Science (PLoS)
Reference74 articles.
1. State of the Field in Multi-Omics Research: From Computational Needs to Data Mining and Sharing.;M Krassowski;Front Genet.,2020
2. Multi-omics Data Integration, Interpretation, and Its Application.;I Subramanian;Bioinformatics and Biology Insights.,2020
3. Metabolomics and multi-omics integration: A survey of computational methods and resources.;T Eicher;Metabolites.,2020
4. Prospects and challenges of multi-omics data integration in toxicology;S Canzler;Arch Toxicol,2020
5. Methods for the integration of multi-omics data: Mathematical aspects;M Bersanelli;BMC Bioinformatics,2016
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献