CR‐M‐SpanBERT: Multiple embedding‐based DNN coreference resolution using self‐attention SpanBERT-Reference-Cited by-同舟云学术

CR‐M‐SpanBERT: Multiple embedding‐based DNN coreference resolution using self‐attention SpanBERT

Published:2024-02 Issue:1 Volume:46 Page:35-47
ISSN:1225-6463
Container-title:ETRI Journal
language:en
Short-container-title:ETRI Journal

Author:

Jung Joon‐young¹^ORCID

Affiliation:

1. Superintelligence Creative Research Laboratory Electronics and Telecommunications Research Institute Daejeon Republic of Korea

Abstract

AbstractThis study introduces CR‐M‐SpanBERT, a coreference resolution (CR) model that utilizes multiple embedding‐based span bidirectional encoder representations from transformers, for antecedent recognition in natural language (NL) text. Information extraction studies aimed to extract knowledge from NL text autonomously and cost‐effectively. However, the extracted information may not represent knowledge accurately owing to the presence of ambiguous entities. Therefore, we propose a CR model that identifies mentions referring to the same entity in NL text. In the case of CR, it is necessary to understand both the syntax and semantics of the NL text simultaneously. Therefore, multiple embeddings are generated for CR, which can include syntactic and semantic information for each word. We evaluate the effectiveness of CR‐M‐SpanBERT by comparing it to a model that uses SpanBERT as the language model in CR studies. The results demonstrate that our proposed deep neural network model achieves high‐recognition accuracy for extracting antecedents from NL text. Additionally, it requires fewer epochs to achieve an average F1 accuracy greater than 75% compared with the conventional SpanBERT approach.

Funder

Electronics and Telecommunications Research Institute

Publisher

Wiley

Link

https://onlinelibrary.wiley.com/doi/pdf/10.4218/etrij.2023-0308

Reference35 articles.

1. F. M.Suchanek G.Kasneci andG.Weikum YAGO: a core of semantic knowledge (Proc. Int. Conf. WWW Banff Canada) 2007 pp.697–706.

2. Wikidata

3. S.Auer C.Bizer G.Kobilarov J.Lehmann R.Cyganiak Z.Ives DBpedia: a nucleus for a web of open data (Proc. Int. Semantic Web Conf. Busan Republic of Korea) 2007 pp.722–735.

4. DG‐based SPO tuple recognition using self‐attention M‐Bi‐LSTM

5. N. Q.LuongandA.Popescu‐Belis Improving pronoun translation by modeling coreference uncertainty (Proc. Conf. Machine Translation Berlin Germany) 2016 pp.12–20.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Special issue on speech and language AI technologies;ETRI Journal;2024-02