Flexible protein–protein docking with a multitrack iterative transformer-Reference-Cited by-同舟云学术

Flexible protein–protein docking with a multitrack iterative transformer

Published:2024-01-23 Issue:2 Volume:33 Page:
ISSN:0961-8368
Container-title:Protein Science
language:en
Short-container-title:Protein Science

Author:

Chu Lee‐Shin¹^ORCID,Ruffolo Jeffrey A.²,Harmalkar Ameya¹,Gray Jeffrey J.¹²^ORCID

Affiliation:

1. Department of Chemical and Biomolecular Engineering Johns Hopkins University Baltimore Maryland USA

2. Program in Molecular Biophysics Johns Hopkins University Baltimore Maryland USA

Abstract

AbstractConventional protein–protein docking algorithms usually rely on heavy candidate sampling and reranking, but these steps are time‐consuming and hinder applications that require high‐throughput complex structure prediction, for example, structure‐based virtual screening. Existing deep learning methods for protein–protein docking, despite being much faster, suffer from low docking success rates. In addition, they simplify the problem to assume no conformational changes within any protein upon binding (rigid docking). This assumption precludes applications when binding‐induced conformational changes play a role, such as allosteric inhibition or docking from uncertain unbound model structures. To address these limitations, we present GeoDock, a multitrack iterative transformer network to predict a docked structure from separate docking partners. Unlike deep learning models for protein structure prediction that input multiple sequence alignments, GeoDock inputs just the sequences and structures of the docking partners, which suits the tasks when the individual structures are given. GeoDock is flexible at the protein residue level, allowing the prediction of conformational changes upon binding. On the Database of Interacting Protein Structures (DIPS) test set, GeoDock achieves a 43% top‐1 success rate, outperforming all other tested methods. However, in the standard DIPS train/test splits, we discovered contamination of close homologs in the training set. After decontaminating the training set, the success rate is 31%. On the DB5.5 test set and a benchmark dataset of antibody–antigen complexes, GeoDock outperforms the deep learning models trained using the same dataset but falls behind most of the conventional methods and AlphaFold‐Multimer. GeoDock attains an average inference speed of under 1 s on a single GPU, enabling its application in large‐scale structure screening. Although binding‐induced conformational changes are still a challenge owing to limited training and evaluation data, our architecture sets up the foundation to capture this backbone flexibility. Code and a demonstration Jupyter notebook are available at https://github.com/Graylab/GeoDock.

Publisher

Wiley

Subject

Molecular Biology,Biochemistry

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1002/pro.4862

Reference69 articles.

1. ICM?A new method for protein modeling and design: Applications to docking and structure prediction from the distorted native conformation

2. The Rosetta All-Atom Energy Function for Macromolecular Modeling and Design

3. UniProt: the Universal Protein knowledgebase

4. Accurate prediction of protein structures and interactions using a three-track neural network

5. Accounting for loop flexibility during protein-protein docking

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Molecular Modeling Methods in the Development of Affine and Specific Protein-Binding Agents;Biochemistry (Moscow);2024-08

2. Quantum Chemistry-Based Protein–Protein Docking without Empirical Parameters;Journal of Chemical Theory and Computation;2024-06-07

3. Review and Comparative Analysis of Methods and Advancements in Predicting Protein Complex Structure;Interdisciplinary Sciences: Computational Life Sciences;2024-06

4. ABAG-docking benchmark: a non-redundant structure benchmark dataset for antibody–antigen computational docking;Briefings in Bioinformatics;2024-01-22