Riemannian geometry for efficient analysis of protein dynamics data

Author:

Diepeveen Willem1ORCID,Esteve-Yagüe Carlos1ORCID,Lellmann Jan2,Öktem Ozan3ORCID,Schönlieb Carola-Bibiane1

Affiliation:

1. Faculty of Mathematics, University of Cambridge, CB3 0WA Cambridge, United Kingdom

2. Institute of Mathematics and Image Computing, University of Lübeck, 23562 Lübeck, Germany

3. Department of Mathematics, Kungliga Tekniska högskolan (KTH), 114 28 Stockholm, Sweden

Abstract

An increasingly common viewpoint is that protein dynamics datasets reside in a nonlinear subspace of low conformational energy. Ideal data analysis tools should therefore account for such nonlinear geometry. The Riemannian geometry setting can be suitable for a variety of reasons. First, it comes with a rich mathematical structure to account for a wide range of geometries that can be modeled after an energy landscape. Second, many standard data analysis tools developed for data in Euclidean space can be generalized to Riemannian manifolds. In the context of protein dynamics, a conceptual challenge comes from the lack of guidelines for constructing a smooth Riemannian structure based on an energy landscape. In addition, computational feasibility in computing geodesics and related mappings poses a major challenge. This work considers these challenges. The first part of the paper develops a local approximation technique for computing geodesics and related mappings on Riemannian manifolds in a computationally feasible manner. The second part constructs a smooth manifold and a Riemannian structure that is based on an energy landscape for protein conformations. The resulting Riemannian geometry is tested on several data analysis tasks relevant for protein dynamics data. In particular, the geodesics with given start- and end-points approximately recover corresponding molecular dynamics trajectories for proteins that undergo relatively ordered transitions with medium-sized deformations. The Riemannian protein geometry also gives physically realistic summary statistics and retrieves the underlying dimension even for large-sized deformations within seconds on a laptop.

Funder

UKRI | Engineering and Physical Sciences Research Council

Wellcome Innovator Awards

European Union Horizon 2020 research and innovation programme

Swedish Foundation for Strategic Research

Swedish Research Council

Philip Leverhulme Prize

the Royal Society Wolfson Fellowship

the Cantab Capital Institute for the Mathematics of Information

the Alan Turing Institute

Publisher

Proceedings of the National Academy of Sciences

Reference54 articles.

1. C. Kolloff S. Olsson Machine learning in molecular dynamics simulations of biomolecular systems. arXiv [Preprint] (2022). http://arxiv.org/abs/2205.03135 (Accessed 1 August 2023).

2. J. Rydzewski M. Chen O. Valsson Manifold learning in atomistic simulations: A conceptual review. arXiv [Preprint] (2023). http://arxiv.org/abs/2303.08486 (Accessed 1 August 2023).

3. K. Jamali D. Kimanius S. H. Scheres “A graph neural network approach to automated model building in cryo-EM maps” in The Eleventh International Conference on Learning Representations (2023).

4. CryoDRGN: reconstruction of heterogeneous cryo-EM structures using neural networks

5. A survey of coarse-grained methods for modeling protein conformational transitions

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3