Automated assembly of molecular mechanisms at scale from text mining and curated databases-Reference-Cited by-同舟云学术

Automated assembly of molecular mechanisms at scale from text mining and curated databases

Published:2022-08-31 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Bachman John A.^ORCID,Gyori Benjamin M.^ORCID,Sorger Peter K.^ORCID

Abstract

ABSTRACTThe analysis of ‘omic data depends heavily on machine-readable information about protein interactions, modifications, and activities. Key resources include protein interaction networks, databases of post-translational modifications, and curated models of gene and protein function. Software systems that read primary literature can potentially extend and update such resources while reducing the burden on human curators, but machine-reading software systems have a high error rate. Here we describe an approach to precisely assemble molecular mechanisms at scale using natural language processing systems and the Integrated Network and Dynamical Reasoning Assembler (INDRA). INDRA identifies overlaps and redundancies in information extracted from published papers and pathway databases and uses probability models to reduce machine reading errors. INDRA enables the automated creation of high-quality, non-redundant corpora for use in data analysis and causal modeling. We demonstrate the use of INDRA in extending protein-protein interaction databases and explaining co-dependencies in the Cancer Dependency Map.

Publisher

Cold Spring Harbor Laboratory

Reference97 articles.

1. CLARINET: Efficient learning of dynamic network models from literature;Bioinforma. Adv,2021

2. Complex Event Extraction using DRUM;ACL-IJCNLP,2015

3. powerlaw: A Python Package for Analysis of Heavy-Tailed Distributions

4. Event-based text mining for biology and functional genomics

5. Gene Ontology: tool for the unification of biology

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Technologies for whole‐cell modeling: Genome‐wide reconstruction of a cell in silico;Development, Growth & Differentiation;2023-11-08

2. Nociceptor neuroimmune interactomes reveal cell type- and injury-specific inflammatory pain pathways;2023-02-03

3. Prediction and Curation of Missing Biomedical Identifier Mappings with Biomappings;2022-12-02