GENESIS: Gene-Specific Machine Learning Models for Variants of Uncertain Significance Found in Catecholaminergic Polymorphic Ventricular Tachycardia and Long QT Syndrome-Associated Genes

Author:

Draelos Rachel L.12ORCID,Ezekian Jordan E.3ORCID,Zhuang Farica1ORCID,Moya-Mendez Mary E.3ORCID,Zhang Zhushan3ORCID,Rosamilia Michael B.3ORCID,Manivannan Perathu K.R.3ORCID,Henao Ricardo45ORCID,Landstrom Andrew P.36ORCID

Affiliation:

1. Department of Computer Science, Trinity College of Arts and Sciences (R.L.D., F.Z.), Duke University.

2. Medical Scientist Training Program (R.L.D.), Duke University School of Medicine, Durham, NC.

3. Department of Pediatrics, Division of Cardiology (J.E.Z., M.E.M.-M., Z.Z., M.B.R., P.K.R.M., A.P.L.), Duke University School of Medicine, Durham, NC.

4. Department of Electrical and Computer Engineering, Pratt School of Engineering (R.H.), Duke University.

5. Department of Biostatistics and Bioinformatics (R.H.), Duke University School of Medicine, Durham, NC.

6. Department of Cell Biology (A.P.L.), Duke University School of Medicine, Durham, NC.

Abstract

Background: Cardiac channelopathies such as catecholaminergic polymorphic tachycardia and long QT syndrome predispose patients to fatal arrhythmias and sudden cardiac death. As genetic testing has become common in clinical practice, variants of uncertain significance (VUS) in genes associated with catecholaminergic polymorphic ventricular tachycardia and long QT syndrome are frequently found. The objective of this study was to predict pathogenicity of catecholaminergic polymorphic ventricular tachycardia-associated RYR2 VUS and long QT syndrome-associated VUS in KCNQ1 , KCNH2 , and SCN5A by developing gene-specific machine learning models and assessing them using cross-validation, cellular electrophysiological data, and clinical correlation. Methods: The GENe-specific EnSemble grId Search framework was developed to identify high-performing machine learning models for RYR2 , KCNQ1 , KCNH2 , and SCN5A using variant- and protein-specific inputs. Final models were applied to datasets of VUS identified from ClinVar and exome sequencing. Whole cell patch clamp and clinical correlation of selected VUS was performed. Results: The GENe-specific EnSemble grId Search models outperformed alternative methods, with area under the receiver operating characteristics up to 0.87, average precisions up to 0.83, and calibration slopes as close to 1.0 (perfect) as 1.04. Blinded voltage-clamp analysis of HEK293T cells expressing 2 predicted pathogenic variants in KCNQ1 each revealed an ≈80% reduction of peak Kv7.1 current compared with WT. Normal Kv7.1 function was observed in KCNQ1-V241I HEK cells as predicted. Though predicted benign, loss of Kv7.1 function was observed for KCNQ1-V106D HEK cells. Clinical correlation of 9/10 variants supported model predictions. Conclusions: Gene-specific machine learning models may have a role in post-genetic testing diagnostic analyses by providing high performance prediction of variant pathogenicity.

Publisher

Ovid Technologies (Wolters Kluwer Health)

Subject

Physiology (medical),Cardiology and Cardiovascular Medicine

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3