DECO: decompose heterogeneous population cohorts for patient stratification and discovery of sample biomarkers using omic data profiling-Reference-Cited by-同舟云学术

DECO: decompose heterogeneous population cohorts for patient stratification and discovery of sample biomarkers using omic data profiling

Published:2019-03-01 Issue:19 Volume:35 Page:3651-3662
ISSN:1367-4803
Container-title:Bioinformatics
language:en
Short-container-title:

Author:

Campos-Laborie F J¹,Risueño A²,Ortiz-Estévez M²,Rosón-Burgo B¹,Droste C¹,Fontanillo C²,Loos R²,Sánchez-Santos J M¹,Trotter M W²,De Las Rivas J¹^ORCID

Affiliation:

1. Bioinformatics and Functional Genomics Group, Cancer Research Center (CiC-IMBCC, CSIC/USAL/IBSAL), Consejo Superior de Investigaciones Científicas (CSIC), University of Salamanca (USAL), Campus Miguel de Unamuno s/n, Salamanca, Spain

2. Celgene Institute for Translational Research Europe (CITRE), Parque Científico y Tecnológico Cartuja 93, Sevilla, Spain

Abstract

Abstract Motivation Patient and sample diversity is one of the main challenges when dealing with clinical cohorts in biomedical genomics studies. During last decade, several methods have been developed to identify biomarkers assigned to specific individuals or subtypes of samples. However, current methods still fail to discover markers in complex scenarios where heterogeneity or hidden phenotypical factors are present. Here, we propose a method to analyze and understand heterogeneous data avoiding classical normalization approaches of reducing or removing variation. Results DEcomposing heterogeneous Cohorts using Omic data profiling (DECO) is a method to find significant association among biological features (biomarkers) and samples (individuals) analyzing large-scale omic data. The method identifies and categorizes biomarkers of specific phenotypic conditions based on a recurrent differential analysis integrated with a non-symmetrical correspondence analysis. DECO integrates both omic data dispersion and predictor–response relationship from non-symmetrical correspondence analysis in a unique statistic (called h-statistic), allowing the identification of closely related sample categories within complex cohorts. The performance is demonstrated using simulated data and five experimental transcriptomic datasets, and comparing to seven other methods. We show DECO greatly enhances the discovery and subtle identification of biomarkers, making it especially suited for deep and accurate patient stratification. Availability and implementation DECO is freely available as an R package (including a practical vignette) at Bioconductor repository (http://bioconductor.org/packages/deco/). Supplementary information Supplementary data are available at Bioinformatics online.

Funder

Instituto de Salud Carlos III

Fondo Europeo de Desarrollo Regional

FEDER

Spanish Ministry MINECO

Torres-Quevedo Programme

Publisher

Oxford University Press (OUP)

Subject

Computational Mathematics,Computational Theory and Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Statistics and Probability

Link

http://academic.oup.com/bioinformatics/article-pdf/35/19/3651/30061524/btz148.pdf

Reference72 articles.

1. Intratumoral heterogeneity as a source of discordance in breast cancer biomarker classification;Allott;Breast Cancer Res,2016

2. Towards precision medicine;Ashley;Nat. Rev. Genet,2016

3. Subsample and half-sample methods;Babu;Ann. Inst. Statist. Math,1992

4. Specificity of phosphorylation responses to mitogen activated protein (MAP) kinase pathway inhibitors in melanoma cells;Basken;Mol. Cell Proteomics,2018

5. Stability of gene contributions and identification of outliers in multivariate analysis of microarray data;Baty;BMC Bioinformatics,2008

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Individualized Coexpression Network Strategies Employing Transcriptomic Data to Address Challenges in Stratification;2023-09-26

2. RAS-p110α signalling in macrophages is required for effective inflammatory response and resolution of inflammation;2023-08-18

3. Unraveling patient heterogeneity in complex diseases through individualized co-expression networks: a perspective;Frontiers in Genetics;2023-08-10

4. Heterogeneity-Preserving Discriminative Feature Selection for Subtype Discovery;2023-05-14

5. Genome-wide effect of non-optimal temperatures under anaerobic conditions on gene expression in Saccharomyces cerevisiae;Genomics;2022-07