A deep learning-based method for the prediction of DNA interacting residues in a protein-Reference-Cited by-同舟云学术

A deep learning-based method for the prediction of DNA interacting residues in a protein

Published:2022-08-08 Issue:5 Volume:23 Page:
ISSN:1467-5463
Container-title:Briefings in Bioinformatics
language:en
Short-container-title:

Author:

Patiyal Sumeet¹,Dhall Anjali¹,Raghava Gajendra P S¹^ORCID

Affiliation:

1. Department of Computational Biology, Indraprastha Institute of Information Technology , Okhla Phase 3, New Delhi-110020, India

Abstract

Abstract DNA–protein interaction is one of the most crucial interactions in the biological system, which decides the fate of many processes such as transcription, regulation and splicing of genes. In this study, we trained our models on a training dataset of 646 DNA-binding proteins having 15 636 DNA interacting and 298 503 non-interacting residues. Our trained models were evaluated on an independent dataset of 46 DNA-binding proteins having 965 DNA interacting and 9911 non-interacting residues. All proteins in the independent dataset have less than 30% of sequence similarity with proteins in the training dataset. A wide range of traditional machine learning and deep learning (1D-CNN) techniques-based models have been developed using binary, physicochemical properties and Position-Specific Scoring Matrix (PSSM)/evolutionary profiles. In the case of machine learning technique, eXtreme Gradient Boosting-based model achieved a maximum area under the receiver operating characteristics (AUROC) curve of 0.77 on the independent dataset using PSSM profile. Deep learning-based model achieved the highest AUROC of 0.79 on the independent dataset using a combination of all three profiles. We evaluated the performance of existing methods on the independent dataset and observed that our proposed method outperformed all the existing methods. In order to facilitate scientific community, we developed standalone software and web server, which are accessible from https://webs.iiitd.edu.in/raghava/dbpred.

Publisher

Oxford University Press (OUP)

Subject

Molecular Biology,Information Systems

Link

https://academic.oup.com/bib/article-pdf/23/5/bbac322/45937347/bbac322.pdf

Reference71 articles.

1. DNA-protein interaction: identification, prediction and data analysis;Emamjomeh;Mol Biol Rep,2019

2. An overview of the prediction of protein DNA-binding sites;Si;Int J Mol Sci,2015

3. DNA deformation energy as an indirect recognition mechanism in protein-DNA interactions;Aeling;IEEE/ACM Trans Comput Biol Bioinform,2007

4. A comparison study for DNA motif modeling on protein binding microarray;Wong;IEEE/ACM Trans Comput Biol Bioinform,2016

5. Prediction of RNA-binding amino acids from protein and RNA sequences;Choi;BMC Bioinformatics,2011

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep-HPI-pred: An R-Shiny applet for network-based classification and prediction of Host-Pathogen protein-protein interactions;Computational and Structural Biotechnology Journal;2024-12

2. Deciphering the Language of Protein-DNA Interactions: A Deep Learning Approach Combining Contextual Embeddings and Multi-Scale Sequence Modeling;Journal of Molecular Biology;2024-11

3. Advances in the Application of Protein Language Modeling for Nucleic Acid Protein Binding Site Prediction;Genes;2024-08-18

4. A hybrid approach for predicting transcription factors;Frontiers in Bioinformatics;2024-07-25

5. Prediction of anti-freezing proteins from their evolutionary profile;2024-04-30