Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks-Reference-Cited by-同舟云学术

Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks

Published:2019-12 Issue:S16 Volume:20 Page:
ISSN:1471-2105
Container-title:BMC Bioinformatics
language:en
Short-container-title:BMC Bioinformatics

Author:

Zhang Canlin,Biś Daniel,Liu Xiuwen,He Zhe

Abstract

Abstract Background In recent years, deep learning methods have been applied to many natural language processing tasks to achieve state-of-the-art performance. However, in the biomedical domain, they have not out-performed supervised word sense disambiguation (WSD) methods based on support vector machines or random forests, possibly due to inherent similarities of medical word senses. Results In this paper, we propose two deep-learning-based models for supervised WSD: a model based on bi-directional long short-term memory (BiLSTM) network, and an attention model based on self-attention architecture. Our result shows that the BiLSTM neural network model with a suitable upper layer structure performs even better than the existing state-of-the-art models on the MSH WSD dataset, while our attention model was 3 or 4 times faster than our BiLSTM model with good accuracy. In addition, we trained “universal” models in order to disambiguate all ambiguous words together. That is, we concatenate the embedding of the target ambiguous word to the max-pooled vector in the universal models, acting as a “hint”. The result shows that our universal BiLSTM neural network model yielded about 90 percent accuracy. Conclusion Deep contextual models based on sequential information processing methods are able to capture the relative contextual information from pre-trained input word embeddings, in order to provide state-of-the-art results for supervised biomedical WSD tasks.

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Structural Biology

Link

http://link.springer.com/content/pdf/10.1186/s12859-019-3079-8.pdf

Reference26 articles.

1. Savova GK, Coden AR, Sominsky IL, Johnson R, Ogren PV, Groen PCd, Chute CG. Word sense disambiguation across two domains: Biomedical literature and clinical notes. J Biomed Inform. 2008; 41(6):1088–100. https://doi.org/10.1016/j.jbi.2008.02.003.

2. Navigli R. Word sense disambiguation: A survey. ACM Comput Surv (CSUR). 2009; 41(2):10.

3. Liu H, Teller V, Friedman C. Research paper: A multi-aspect comparison study of supervised word sense disambiguation. J Am Med Inform Assoc JAMIA. 2004; 11 4:320–31.

4. Xu H, Markatou M, Dimova R, Liu H, Friedman C. Machine learning and word sense disambiguation in the biomedical domain: design and evaluation issues. BMC Bioinformatics. 2006; 7:334.

5. Wang Y, Zheng K, Xu H, Mei Q. Interactive medical word sense disambiguation through informed learning. J Am Med Inform Assoc. 2018; 25(7):800–8.

Cited by 21 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A novel deep learning rainfall–runoff model based on Transformer combined with base flow separation;Hydrology Research;2024-05-01

2. Investigation of Medical Image Technology Based on Big Data Neuroscience in Exercise Rehabilitation;Current Medical Imaging Formerly Current Medical Imaging Reviews;2024-04-02

3. Improving Semantic Information Retrieval Using Multinomial Naive Bayes Classifier and Bayesian Networks;Information;2023-05-03

4. Extraction of Unstructured Electronic Healthcare Records using Natural Language Processing;2023 International Conference on Networking and Communications (ICNWC);2023-04-05

5. Multi-Head Self-Attention Gated-Dilated Convolutional Neural Network for Word Sense Disambiguation;IEEE Access;2023