Submitted:
27 November 2023
Posted:
28 November 2023
You are already at the latest version
Abstract
Keywords:
1. Introduction
- The unveiling of the Advanced Forest-Based Tagging Framework (AFTF), an avant-garde tagging paradigm employing a forest structure to succinctly encapsulate relation triples within medical texts.
- The formulation of a comprehensive medical RE model that harnesses the power of AFTF to autonomously predict relation triples.
- Empirical validation of AFTF’s efficacy, underscored by its exceptional performance on two dedicated medical datasets and its versatility across three diverse public datasets, thereby evidencing its proficiency in managing overlapping relational structures with ease.
2. Related Work
3. Proposed Framework
3.1. Advanced Forest-Based Tagging Framework (AFTF)
| Algorithm 1 Relation-to-Forest Transformation |
|
3.1.1. Advanced Handling of ELS Sentences
- Part 1 () signifies the word’s position within entity node e, using the “” (Begin, Inside, End, Single) system.
- Part 2 () relates to the edge between e and its parent in B, indicating root or sibling relationships, or showcasing the child’s position and role in the entity pair.
- Part 3 () and Part 4 () denote the relationship between e and its left and right children in B, respectively, or indicate the absence of such children.
3.1.2. Handling ILS Sentences in AFTF
3.2. From Tags to Triples in AFTF
3.3. Joint Relation Extraction Model
3.3.1. Text Embedding
3.3.2. Encoder
3.3.3. Decoder
3.3.4. Loss Function
4. Experiments
4.1. Experimental Setup
4.2. Main Results
4.3. Efficiency of AFTF-based Models
4.4. Ablation Study
5. Conclusions
References
- Gehui Shen, Zhi-Hong Deng, Ting Huang, and Xi Chen. Learning to compose over tree structures via pos tags for sentence representation. Expert Systems with Applications 2020, 141, 112917. ISSN 0957-4174. [CrossRef]
- Richard Socher, Andrej Karpathy, Quoc V. Le, Christopher D. Manning, and Andrew Y. Ng. Grounded compositional semantics for finding and describing images with sentences. Transactions of the Association for Computational Linguistics 2014, 2, 207–208. ISSN 2307-387X. [CrossRef]
- Hao Fei, Meishan Zhang, and Donghong Ji. Cross-lingual semantic role labeling with high-quality translated training corpus. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 7014–7026, 2020a.
- Shengqiong Wu, Hao Fei, Leigang Qu, Wei Ji, and Tat-Seng Chua. Next-gpt: Any-to-any multimodal llm, 2023a.
- Kai Sheng Tai, Richard Socher, and Christopher D. Manning. Improved semantic representations from tree-structured long short-term memory networks. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). Association for Computational Linguistics, 2015. [CrossRef]
- Hao Fei, Yafeng Ren, and Donghong Ji. Retrofitting structure-aware transformer language model for end tasks. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, pages 2151–2161, 2020b.
- Baohua Wang, Junlian Huang, Haihong Zheng, and Hui Wu. Semi-supervised recursive autoencoders for social review spam detection. In 2016 12th International Conference on Computational Intelligence and Security (CIS). IEEE, December 2016. [CrossRef]
- D. Zeng, K. Liu, S. Lai, G. Zhou, and J. Zhao. Relation classification via convolutional deep neural network. in Proceedings of COLING, pages 2335–2344, 2014.
- M. Miwa and M. Bansal. End-to-end relation extraction using lstms on sequences and tree structures. in Proceedings of ACL, pages 1105–1116, 2016.
- S. Zheng, F. Wang, H. Bao, Y. Hao, P. Zhou, and B. Xu. Joint extraction of entities and relations based on a novel tagging scheme. in Proceedings of ACL, pages 1227–1236, 2017.
- X. Zeng, D. Zeng, S. He, K. Liu, and J. Zhao. Extracting relational facts by an end-to-end neural model with copy mechanism. in Proceedings of ACL, pages 506–514, 2018.
- D. Dai, X. Xiao, Y. Lyu, S. Dou, Q. She, and H. Wang. Joint extraction of entities and overlapping relations using position-attentive sequence labeling. in Proceedings of AAAI, pages 6300–6308, 2019. [CrossRef]
- Hao Fei, Shengqiong Wu, Jingye Li, Bobo Li, Fei Li, Libo Qin, Meishan Zhang, Min Zhang, and Tat-Seng Chua. Lasuie: Unifying information extraction with latent adaptive structure-aware generative language model. In Proceedings of the Advances in Neural Information Processing Systems, NeurIPS 2022, pages 15460–15475, 2022a.
- Y. Wang, B. Yu, Y. Zhang, T. Liu, H. Zhu, and L. Sun. Tplinker: single-stage joint extraction of entities and relations through token pair linking. in Proceedings of COLING, pages 1572–1582, 2020.
- Hao Fei, Fei Li, Bobo Li, and Donghong Ji. Encoder-decoder based unified semantic role labeling with label-aware syntax. In Proceedings of the AAAI Conference on Artificial Intelligence, pages 12794–12802, 2021a. [CrossRef]
- Y. Xu, L. Mou, G. Li, Y. Chen, H. Peng, and Z. Jin. Classifying relations via long short term memory networks along shortest dependency paths. in Proceedings of EMNLP, pages 1785–1794, 2015.
- N. T. Vu, H. Adel, P. Gupta, and H. Schutze. Combining recurrent and convolutional neural networks for relation classification. in Proceedings of ACL, pages 534–539, 2016.
- A. Katiyar and C. Cardie. Investigating lstms for joint extraction of opinion entities and relations. in Proceedings of ACL, pages 919–929, 2016.
- Z. Wei, J. Su, Y. Wang, Y. Tian, and Y. Chang. A novel cascade binary tagging framework for relational triple extraction. in Proceedings of ACL, pages 1476–1488, 2020.
- T. Nayak and H. Tou. Effective modeling of encoder-decoder architecture for joint entity and relation extraction. in Proceedings of AAAI, pages 8528–8535, 2020.
- D. Zeng, H. Zhang, and Q. Liu. Copymtl: copy mechanism for joint extraction of entities and relations with multi-task learning. in Proceedings of AAAI, pages 9507–9514, 2020. [CrossRef]
- F. Li, Y. Zhang, and D. Ji. Joint models for extracting adverse drug events from biomedical text. in Proceedings of IJCAI, pages 2838–2844, 201.
- F. Li, M. Zhang, G. Fu, and D. Ji. A neural joint model for entity and relation extraction from biomedical text. bmc bioinform. 2017, 18, 198:1–198:11. [CrossRef]
- T. Qi, S. Qiu, X. Shen, H. Chen, S. Yang, and H. Wen. et al., "kemre: knowledge-enhanced medical relation extraction for chinese medicine instructions, " j. In Biomed. Informatics, vol. 120,. 103834, 2021. [CrossRef]
- . Devlin, M. W. Chang, K. Lee, and K. Toutanova. Bert: pre-training of deep bidirectional transformers for language understanding. in Proceedings of ACL, pages 4171–4186, 2019.
- Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. Attention is all you need. In Advances in neural information processing systems, pages 5998–6008, 2017.
- S. Hochreiter and J. Schmidhuber. Long short-term memory. 1735-1780,, 1997.
- H. Gurulingappa, A. Mateen, J. Fluck Angus, M. Hofmann-Apitius, and L. Toldo. Development of a benchmark corpus to support the automatic extraction of drug-related adverse effects from medical case reports. In J, pages 885–892. Biomed. Informatics, vol. 45, no. 5, 2012. [CrossRef]
- T. Guan, H. Zan, X. Zhou, H. Xu, and K. Zhang. CMeIE: construction and evaluation of Chinese medical information extraction dataset. In proceedings of NLPCC, 2020.
- S. Riedel, L. Yao, and A. McCallum. Modeling relations and their mentions without labeled text. in Proceedings of ECML-PKDD, pages 148–163, 2010.
- C. Gardent, A. Shimorina, S. Narayan, and L. Perez-Beltrachini. Creating training corpora for nlg micro-planners. in Proceedings of ACL, pages 179–188, 2017.
- S. Li, W. He, Y. Shi, W. Jiang, H. Liang, and Y. Jiang. Duie: a large-scale chinese dataset for information extraction. in Proceedings of NLPCC, pages 791–800, 2019.
- Z. Yan, C. Zhang, J. Fu, Q. Zhang, and Z. Wei. A partition filter network for joint entity and relation extraction. in Proceedings of EMNLP, pages 185–197, 2021.
- G. Bekoulis, J. Deleu, T. Demeester, and C. Develder. Joint entity recognition and relation extraction as a multi-head selection problem. Expert Systems with Applications, pages 34–45, 2018a. [CrossRef]
- G. Bekoulis, J. Deleu, T. Demeester, and C. Develder. Adversarial training for multi-context joint entity and relation extraction. in Proceedings of EMNLP, pages 2830–2836, 2018b.
- T. Tran and R. Kavuluru. Neural metric learning for fast end-to-end relation extraction. CoRR, 1905, 2019.
- J. Wang and W. Lu. Two are better than one: Joint entity and relation extraction with table-sequence encoders. in Proceedings of EMNLP, pages 1706–1721, 2020.
- T. J. Fu, P. H. Li, and W. Y. Ma. Graphrel: modeling text as relational graphs for joint entity and relation extraction. in Proceedings of ACL, pages 1409–1418, 2019.
- N. Zhang, M. Chen, Z. Bi, X. Liang, L. Li, and X. Shang. et al., "cblue: a chinese biomedical language understanding evaluation. in Proceedings of ACL, pages 7888–7915, 2022.
- Shengqiong Wu, Hao Fei, Yafeng Ren, Donghong Ji, and Jingye Li. Learn from syntax: Improving pair-wise aspect and opinion terms extraction with rich syntactic knowledge. In Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, pages 3957–3963, 2021.
- Ling Zhuang, Hao Fei, and Po Hu. Knowledge-enhanced event relation extraction via event ontology prompt. Inf. Fusion 2023, 100, 101919. [CrossRef]
- Shengqiong Wu, Hao Fei, Fei Li, Meishan Zhang, Yijiang Liu, Chong Teng, and Donghong Ji. Mastering the explicit opinion-role interaction: Syntax-aided neural transition system for unified opinion role labeling. In Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, pages 11513–11521, 2022. [CrossRef]
- Shengqiong Wu, Hao Fei, Wei Ji, and Tat-Seng Chua. Cross2StrA: Unpaired cross-lingual image captioning with cross-lingual cross-modal structure-pivoted alignment. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 2593–2608, 2023b.
- Hao Fei, Yue Zhang, Yafeng Ren, and Donghong Ji. Latent emotion memory for multi-label emotion classification. In Proceedings of the AAAI Conference on Artificial Intelligence, pages 7692–7699, 2020c.
- Hao Fei, Yafeng Ren, Yue Zhang, Donghong Ji, and Xiaohui Liang. Enriching contextualized language model from knowledge graph for biomedical information extraction. Briefings in Bioinformatics, 22(3), 2021b. [CrossRef]
- Junshan Wang, Zhicong Lu, Guojia Song, Yue Fan, Lun Du, and Wei Lin. Tag2vec: Learning tag representations in tag networks. In The World Wide Web Conference, WWW ’19. ACM, May 2019. [CrossRef]
- Shengqiong Wu, Hao Fei, Yixin Cao, Lidong Bing, and Tat-Seng Chua. Information screening whilst exploiting! multimodal relation extraction with feature denoising and multimodal topic modeling. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 14734–14751, 2023c.
- Hao Fei, Yafeng Ren, and Donghong Ji. Boundaries and edges rethinking: An end-to-end neural model for overlapping entity relation extraction. Information Processing & Management 2020, 57, 102311. [CrossRef]
- Jingye Li, Hao Fei, Jiang Liu, Shengqiong Wu, Meishan Zhang, Chong Teng, Donghong Ji, and Fei Li. Unified named entity recognition as word-word relation classification. In Proceedings of the AAAI Conference on Artificial Intelligence, pages 10965–10973, 2022. [CrossRef]
- Jingye Li, Kang Xu, Fei Li, Hao Fei, Yafeng Ren, and Donghong Ji. MRN: A locally and globally mention-based reasoning network for document-level relation extraction. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, pages 1359–1370, 2021.
- Fengqi Wang, Fei Li, Hao Fei, Jingye Li, Shengqiong Wu, Fangfang Su, Wenxuan Shi, Donghong Ji, and Bo Cai. Entity-centered cross-document relation extraction. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 9871–9881, 2022.
- Hu Cao, Jingye Li, Fangfang Su, Fei Li, Hao Fei, Shengqiong Wu, Bobo Li, Liang Zhao, and Donghong Ji. OneEE: A one-stage framework for fast overlapping and nested event extraction. In Proceedings of the 29th International Conference on Computational Linguistics, pages 1953–1964, 2022.
- Wenxuan Shi, Fei Li, Jingye Li, Hao Fei, and Donghong Ji. Effective token graph modeling using a novel labeling strategy for structured sentiment analysis. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 4232–4241, 2022.
- Hao Fei, Shengqiong Wu, Yafeng Ren, Fei Li, and Donghong Ji. Better combine them together! integrating syntactic constituency and dependency representations for semantic role labeling. In Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, pages 549–559, 2021c.
- Wenhao Zhu, Tengjun Yao, Wu Zhang, and Baogang Wei. Part-of-speech-based long short-term memory network for learning sentence representations. IEEE Access, 2019; 7, 51810–51816. ISSN 2169-3536. [CrossRef]
- Hao Fei, Shengqiong Wu, Yafeng Ren, and Meishan Zhang. Matching structure for dual learning. In Proceedings of the International Conference on Machine Learning, ICML, pages 6373–6391, 2022b.
| Dataset | EPO | ELS | ILS | Overlap Samples |
ELS Ratio |
|---|---|---|---|---|---|
| ADE [28] | 118 | 1,216 | 159 | 1,391 | 0.874 |
| CMeIE [29] | 381 | 8,805 | 457 | 9,213 | 0.956 |
| NYT [30] | 17,004 | 10,740 | 2,006 | 25,422 | 0.422 |
| WebNLG [31] | 622 | 2,894 | 1,294 | 3,957 | 0.731 |
| DuIE [32] | 15,672 | 94,891 | 11,780 | 109,675 | 0.865 |
| Model | Encoder | Prec | Rec | F1 | ||||
|---|---|---|---|---|---|---|---|---|
| ADE | ||||||||
| Neural Joint [22] | L | 64.0 | 62.9 | 63.4 | ||||
| Multi-head [34] | L | 72.1 | 77.2 | 74.5 | ||||
| Multi-head + AT [35] | L | - | - | 75.5 | ||||
| Rel-Metric [36] | L+C | 77.4 | 77.3 | 77.3 | ||||
| Table-Sequence [37] | ALB | - | - | 80.1 | ||||
| PFN [33] | Bb | - | - | 80.0 | ||||
| AFTF-Med (Ours) | Bb | 83.1 | 81.3 | 82.1 | ||||
| CMeIE | ||||||||
NovelTagging [10]
|
Bb | 51.4 | 17.1 | 25.6 | ||||
GraphRel-1p [38]
|
Bb+G | 31.2 | 26.0 | 28.4 | ||||
GraphRel-2p [38]
|
Bb+G | 28.5 | 23.1 | 25.5 | ||||
CasRel [19]
|
Bb | 53.5 | 28.2 | 37.0 | ||||
| ER+RE [39] | ALB | - | - | 47.6 | ||||
| AFTF-Med (Ours) | Bb | 55.6 | 45.5 | 50.1 |
| Model | Encoder | Prec | Rec | F1 | ||||
| NYT | ||||||||
NovelTagging [10]
|
L | 62.4 | 31.7 | 42.0 | ||||
CopyRE-Mul [11]
|
L | 61.0 | 56.6 | 58.7 | ||||
GraphRel-2p [38]
|
L+G | 63.9 | 60.0 | 61.9 | ||||
| PA [12] | L | 49.4 | 59.1 | 53.8 | ||||
| CopyMTL-Mul [21] | L | 75.7 | 68.7 | 72.0 | ||||
NovelTagging [10]
|
Bb | 89.0 | 55.6 | 69.3 | ||||
CopyRE-Mul [11]
|
Bb | 39.1 | 36.5 | 37.8 | ||||
GraphRel-2p [38]
|
Bb+G | 82.5 | 57.9 | 68.1 | ||||
CasRel [19]
|
Bb | 89.7 | 89.5 | 89.6 | ||||
| AFTF-LSTM (Ours) | L | 66.5 | 76.3 | 71.1 | ||||
| AFTF-BERT (Ours) | Bb | 89.7 | 88.0 | 88.9 | ||||
| WebNLG | ||||||||
NovelTagging [10]
|
L | 52.5 | 19.3 | 28.3 | ||||
CopyRE-Mul [11]
|
L | 37.7 | 36.4 | 37.1 | ||||
GraphRel-2p [38]
|
L+G | 44.7 | 41.1 | 42.9 | ||||
| CopyMTL-Mul [21] | L | 58.0 | 54.9 | 56.4 | ||||
| TPLinker [14] | Bb | 88.9 | 84.5 | 86.7 | ||||
| AFTF-LSTM (Ours) | L | 83.8 | 66.0 | 73.8 | ||||
| AFTF-BERT (Ours) | Bb | 89.1 | 83.0 | 86.2 | ||||
| DuIE | ||||||||
NovelTagging [10]
|
Bb | 75.0 | 38.0 | 50.4 | ||||
GraphRel-1p [38]
|
Bb+G | 52.2 | 23.9 | 32.8 | ||||
GraphRel-2p [38]
|
Bb+G | 41.1 | 25.8 | 31.8 | ||||
CaseRel [19]
|
Bb | 75.7 | 80.0 | 77.8 | ||||
| AFTF-BERT (Ours) | Bb | 75.7 | 80.6 | 78.0 |
| Metrics | w/o Group | w/o Bidirectional | w/o Multi-head | AFTF-BERT | |
| F1-EPO | 74.0 | 90.2 | 90.2 | 91.2 | |
| F1-ELS | 76.2 | 84.5 | 85.5 | 87.3 | |
| F1-ILS | 68.3 | 74.7 | 81.8 | 85.5 | |
| F1-All | 81.9 | 88.2 | 87.7 | 88.9 | |
| Decoder Params | 71.3 | 48.0 | 68.5 | 48.9 | |
| Training Time | - | - | 2125.0 | 1739.0 | |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

