Efficient prediction of peptide self-assembly through sequential and graphical encoding

Author:

Liu Zihan123ORCID,Wang Jiaqi45,Luo Yun15,Zhao Shuang45,Li Wenbin45,Li Stan Z23

Affiliation:

1. College of Computer Science and Technology, Zhejiang University , Hangzhou 310058 , China

2. AI Lab , Research Center for Industries of the Future, , Hangzhou 310030 , China

3. Westlake University , Research Center for Industries of the Future, , Hangzhou 310030 , China

4. Research Center for the Industries of the Future, Westlake University , Hangzhou 310030 , China

5. School of Engineering, Westlake University , Hangzhou 310030 , China

Abstract

Abstract In recent years, there has been an explosion of research on the application of deep learning to the prediction of various peptide properties, due to the significant development and market potential of peptides. Molecular dynamics has enabled the efficient collection of large peptide datasets, providing reliable training data for deep learning. However, the lack of systematic analysis of the peptide encoding, which is essential for artificial intelligence-assisted peptide-related tasks, makes it an urgent problem to be solved for the improvement of prediction accuracy. To address this issue, we first collect a high-quality, colossal simulation dataset of peptide self-assembly containing over 62 000 samples generated by coarse-grained molecular dynamics. Then, we systematically investigate the effect of peptide encoding of amino acids into sequences and molecular graphs using state-of-the-art sequential (i.e. recurrent neural network, long short-term memory and Transformer) and structural deep learning models (i.e. graph convolutional network, graph attention network and GraphSAGE), on the accuracy of peptide self-assembly prediction, an essential physiochemical process prior to any peptide-related applications. Extensive benchmarking studies have proven Transformer to be the most powerful sequence-encoding-based deep learning model, pushing the limit of peptide self-assembly prediction to decapeptides. In summary, this work provides a comprehensive benchmark analysis of peptide encoding with advanced deep learning models, serving as a guide for a wide range of peptide-related predictions such as isoelectric points, hydration free energy, etc.

Funder

National Key Research and Development Program of China

National Natural Science Foundation of China and Project

Center of Synthetic Biology and Integrated Bioengineering of Westlake University

Research Center for Industries of the Future at Westlake University

Zhejiang Postdoctoral Science Foundation

National Natural Science Foundation of China

Publisher

Oxford University Press (OUP)

Subject

Molecular Biology,Information Systems

Reference44 articles.

1. Introduction to Peptides and Proteins

2. Amyloid inspired self-assembled peptide nanofibers;Cinar;Biomacromolecules,2012

3. Near infrared fluorescent peptide nanoparticles for enhancing esophageal cancer therapeutic efficacy. Nature;Fan;Communications,2018

4. Self-assembling peptide semiconductors;Tao;Science,2017

5. Molecular self-assembly and applications of designer peptide amphiphiles;Zhao;Chem Soc Rev,2010

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3