INSIDER: Interpretable sparse matrix decomposition for RNA expression data analysis-Reference-Cited by-同舟云学术

INSIDER: Interpretable sparse matrix decomposition for RNA expression data analysis

Published:2024-03-14 Issue:3 Volume:20 Page:e1011189
ISSN:1553-7404
Container-title:PLOS Genetics
language:en
Short-container-title:PLoS Genet

Author:

Zhao Kai^ORCID,Huang Sen,Lin Cuichan,Sham Pak Chung^ORCID,So Hon-Cheong,Lin Zhixiang^ORCID

Abstract

RNA sequencing (RNA-Seq) is widely used to capture transcriptome dynamics across tissues, biological entities, and conditions. Currently, few or no methods can handle multiple biological variables (e.g., tissues/ phenotypes) and their interactions simultaneously, while also achieving dimension reduction (DR). We propose INSIDER, a general and flexible statistical framework based on matrix factorization, which is freely available at https://github.com/kai0511/insider. INSIDER decomposes variation from different biological variables and their interactions into a shared low-rank latent space. Particularly, it introduces the elastic net penalty to induce sparsity while considering the grouping effects of genes. It can achieve DR of high-dimensional data (of > = 3 dimensions), as opposed to conventional methods (e.g., PCA/NMF) which generally only handle 2D data (e.g., sample × expression). Besides, it enables computing ’adjusted’ expression profiles for specific biological variables while controlling variation from other variables. INSIDER is computationally efficient and accommodates missing data. INSIDER also performed similarly or outperformed a close competing method, SDA, as shown in simulations and can handle complex missing data in RNA-Seq data. Moreover, unlike SDA, it can be used when the data cannot be structured into a tensor. Lastly, we demonstrate its usefulness via real data analysis, including clustering donors for disease subtyping, revealing neuro-development trajectory using the BrainSpan data, and uncovering biological processes contributing to variables of interest (e.g., disease status and tissue) and their interactions.

Funder

Chinese University of Hong Kong

Faculty of Science, Chinese University of Hong Kong

Research Grants Council, University Grants Committee

Publisher

Public Library of Science (PLoS)

Reference44 articles.

1. RNA-Seq: a revolutionary tool for transcriptomics;Z Wang;Nature reviews genetics,2009

2. The genotype-tissue expression (GTEx) project;J Lonsdale;Nat Genet,2013

3. Transcriptional landscape of the prenatal human brain;JA Miller;Nature,2014

4. The psychencode project;S Akbarian;Nat Neurosci,2015

5. Transcriptomic analysis of autistic brain reveals convergent molecular pathology;I Voineagu;Nature,2011

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. scParser: sparse representation learning for scalable single-cell RNA sequencing data analysis;Genome Biology;2024-08-16