PD-BertEDL: An Ensemble Deep Learning Method Using BERT and Multivariate Representation to Predict Peptide Detectability-Reference-Cited by-同舟云学术

PD-BertEDL: An Ensemble Deep Learning Method Using BERT and Multivariate Representation to Predict Peptide Detectability

Published:2022-10-16 Issue:20 Volume:23 Page:12385
ISSN:1422-0067
Container-title:International Journal of Molecular Sciences
language:en
Short-container-title:IJMS

Author:

Wang Huiqing¹,Wang Juan¹,Feng Zhipeng¹,Li Ying¹,Zhao Hong¹^ORCID

Affiliation:

1. College of Information and Computer, Taiyuan University of Technology, Taiyuan 030024, China

Abstract

Peptide detectability is defined as the probability of identifying a peptide from a mixture of standard samples, which is a key step in protein identification and analysis. Exploring effective methods for predicting peptide detectability is helpful for disease treatment and clinical research. However, most existing computational methods for predicting peptide detectability rely on a single information. With the increasing complexity of feature representation, it is necessary to explore the influence of multivariate information on peptide detectability. Thus, we propose an ensemble deep learning method, PD-BertEDL. Bidirectional encoder representations from transformers (BERT) is introduced to capture the context information of peptides. Context information, sequence information, and physicochemical information of peptides were combined to construct the multivariate feature space of peptides. We use different deep learning methods to capture the high-quality features of different categories of peptides information and use the average fusion strategy to integrate three model prediction results to solve the heterogeneity problem and to enhance the robustness and adaptability of the model. The experimental results show that PD-BertEDL is superior to the existing prediction methods, which can effectively predict peptide detectability and provide strong support for protein identification and quantitative analysis, as well as disease treatment.

Funder

Youth Project of Shanxi Province

Publisher

MDPI AG

Link

https://www.mdpi.com/1422-0067/23/20/12385/pdf

Reference48 articles.

1. A computational approach toward label-free protein quantification using predicted peptide detectability;Tang;Bioinformatics,2006

2. Analysis of intrinsic peptide detectability via integrated label-free and SRM-based absolute quantitative proteomics;Jarnuczak;J. Proteome Res.,2016

3. Computational prediction of proteotypic peptides for quantitative proteomics;Mallick;Nat. Biotechnol.,2007

4. Absolute protein expression profiling estimates the relative contributions of transcriptional and translational regulation;Lu;Nat. Biotechnol.,2007

5. Definition and characterization of a “trypsinosome” from specific peptide characteristics by nano-HPLC-MS/MS and in silico analysis of complex protein mixtures;Bihan;J. Proteome Res.,2004

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. DeepPD: A Deep Learning Method for Predicting Peptide Detectability Based on Multi-feature Representation and Information Bottleneck;Interdisciplinary Sciences: Computational Life Sciences;2024-12-11

2. An in silico scheme for optimizing the enzymatic acquisition of natural biologically active peptides based on machine learning and virtual digestion;Analytica Chimica Acta;2024-04

3. Knowledge-based Dual External Attention Network for peptide detectability prediction;Knowledge-Based Systems;2024-02