scDeepSort: a pre-trained cell-type annotation method for single-cell transcriptomics using deep learning with a weighted graph neural network

Author:

Shao Xin12ORCID,Yang Haihong34,Zhuang Xiang3,Liao Jie12,Yang Penghui1,Cheng Junyun1,Lu Xiaoyan15,Chen Huajun364,Fan Xiaohui1257ORCID

Affiliation:

1. Pharmaceutical Informatics Institute, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China

2. iMedicine Lab, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Zhejiang University, Hangzhou 310058, China

3. College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China

4. Hangzhou Innovation Center, Zhejiang University, Hangzhou 310058, China

5. Innovation Center in Zhejiang University, State Key Laboratory of Component-Based Chinese Medicine, Hangzhou 310058, China

6. The First Affiliated Hospital, School of Medicine, Zhejiang University, Hangzhou 310003, China

7. Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou 310058, China

Abstract

Abstract Advances in single-cell RNA sequencing (scRNA-seq) have furthered the simultaneous classification of thousands of cells in a single assay based on transcriptome profiling. In most analysis protocols, single-cell type annotation relies on marker genes or RNA-seq profiles, resulting in poor extrapolation. Still, the accurate cell-type annotation for single-cell transcriptomic data remains a great challenge. Here, we introduce scDeepSort (https://github.com/ZJUFanLab/scDeepSort), a pre-trained cell-type annotation tool for single-cell transcriptomics that uses a deep learning model with a weighted graph neural network (GNN). Using human and mouse scRNA-seq data resources, we demonstrate the high performance and robustness of scDeepSort in labeling 764 741 cells involving 56 human and 32 mouse tissues. Significantly, scDeepSort outperformed other known methods in annotating 76 external test datasets, reaching an 83.79% accuracy across 265 489 cells in humans and mice. Moreover, we demonstrate the universality of scDeepSort using more challenging datasets and using references from different scRNA-seq technology. Above all, scDeepSort is the first attempt to annotate cell types of scRNA-seq data with a pre-trained GNN model, which can realize the accurate cell-type annotation without additional references, i.e. markers or RNA-seq profiles.

Funder

National Natural Science Foundation of China

Natural Science Foundation of Zhejiang Province

National Youth Top-notch Talent Support Program

Publisher

Oxford University Press (OUP)

Subject

Genetics

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3