MSI-XGNN: an explainable GNN computational framework integrating transcription- and methylation-level biomarkers for microsatellite instability detection

Author:

Cao Yang1,Wang Dan2,Wu Jin3,Yao Zhanxin3,Shen Si4,Niu Chao1,Liu Ying1,Zhang Pengcheng1,Wang Quannian5,Wang Jinhao1,Li Hua6,Wei Xi6,Wang Xinxing1,Dong Qingyang1

Affiliation:

1. Department of Environmental Medicine, Tianjin Institute of Environmental and Operational Medicine , Tianjin 300050 , China

2. Department of Bioinformatics, Yicon (Beijing) Biomedical Technology Inc

3. Tianjin Institute of Environmental and Operational Medicine , Tianjin 300050 , China

4. School and Hospital of Stomatology, Tianjin Medical University , Tianjin 300050 , China

5. School of Basic Medicine, Jiamusi University

6. Department of Diagnostic and Therapeutic Ultrasonography, Tianjin Medical University Cancer Institute and Hospital , Tianjin 300060 , China

Abstract

Abstract Microsatellite instability (MSI) is a hypermutator phenotype caused by DNA mismatch repair deficiency. MSI has been reported in various human cancers, particularly colorectal, gastric and endometrial cancers. MSI is a promising biomarker for cancer prognosis and immune checkpoint blockade immunotherapy. Several computational methods have been developed for MSI detection using DNA- or RNA-based approaches based on next-generation sequencing. Epigenetic mechanisms, such as DNA methylation, regulate gene expression and play critical roles in the development and progression of cancer. We here developed MSI-XGNN, a new computational framework for predicting MSI status using bulk RNA-sequencing and DNA methylation data. MSI-XGNN is an explainable deep learning model that combines a graph neural network (GNN) model to extract features from the gene-methylation probe network with a CatBoost model to classify MSI status. MSI-XGNN, which requires tumor-only samples, exhibited comparable performance with two well-known methods that require tumor-normal paired sequencing data, MSIsensor and MANTIS and better performance than several other tools. MSI-XGNN also showed good generalizability on independent validation datasets. MSI-XGNN identified six MSI markers consisting of four methylation probes (EPM2AIP1|MLH1:cg14598950, EPM2AIP1|MLH1:cg27331401, LNP1:cg05428436 and TSC22D2:cg15048832) and two genes (RPL22L1 and MSH4) constituting the optimal feature subset. All six markers were significantly associated with beneficial tumor microenvironment characteristics for immunotherapy, such as tumor mutation burden, neoantigens and immune checkpoint molecules such as programmed cell death-1 and cytotoxic T-lymphocyte antigen-4. Overall, our study provides a powerful and explainable deep learning model for predicting MSI status and identifying MSI markers that can potentially be used for clinical MSI evaluation.

Funder

Tianjin Institute of Environmental and Operational Medicine

Tianjin Municipal Natural Science Foundation

Publisher

Oxford University Press (OUP)

Subject

Molecular Biology,Information Systems

全球学者库

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"全球学者库"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前全球学者库共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2023 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3