Prospection of Peptide Inhibitors of Thrombin from Diverse Origins Using a Machine Learning Pipeline

Author:

Balakrishnan Nivedha1,Katkar Rahul1,Pham Peter V.1,Downey Taylor2,Kashyap Prarthna1,Anastasiu David C.2,Ramasubramanian Anand K.1ORCID

Affiliation:

1. Department of Chemical and Materials Engineering, San José State University, San Jose, CA 95192, USA

2. Department of Computer Science and Engineering, Santa Clara University, Santa Clara, CA 95053, USA

Abstract

Thrombin is a key enzyme involved in the development and progression of many cardiovascular diseases. Direct thrombin inhibitors (DTIs), with their minimum off-target effects and immediacy of action, have greatly improved the treatment of these diseases. However, the risk of bleeding, pharmacokinetic issues, and thrombotic complications remain major concerns. In an effort to increase the effectiveness of the DTI discovery pipeline, we developed a two-stage machine learning pipeline to identify and rank peptide sequences based on their effective thrombin inhibitory potential. The positive dataset for our model consisted of thrombin inhibitor peptides and their binding affinities (KI) curated from published literature, and the negative dataset consisted of peptides with no known thrombin inhibitory or related activity. The first stage of the model identified thrombin inhibitory sequences with Matthew’s Correlation Coefficient (MCC) of 83.6%. The second stage of the model, which covers an eight-order of magnitude range in KI values, predicted the binding affinity of new sequences with a log room mean square error (RMSE) of 1.114. These models also revealed physicochemical and structural characteristics that are hidden but unique to thrombin inhibitor peptides. Using the model, we classified more than 10 million peptides from diverse sources and identified unique short peptide sequences (<15 aa) of interest, based on their predicted KI. Based on the binding energies of the interaction of the peptide with thrombin, we identified a promising set of putative DTI candidates. The prediction pipeline is available on a web server.

Funder

College of Engineering, San José State University

Publisher

MDPI AG

Subject

Bioengineering

Reference103 articles.

1. Defending the priority of “remarkable researches”: The discovery of fibrin ferment;Marcum;Hist. Philos. Life Sci.,1998

2. Mechanisms coupling thrombin to metastasis and tumorigenesis;Remiker;Thromb. Res.,2018

3. Thrombin Inhibition by Argatroban: Potential Therapeutic Benefits in COVID-19;Aliter;Cardiovasc. Drugs Ther.,2021

4. Directing thrombin;Lane;Blood,2005

5. Thrombin formation;Mann;Chest,2003

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3