Submitted:
26 November 2024
Posted:
27 November 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
- This paper applies entity type information to guide relation extraction which can further improve the performance of relationship classification and avoid the ambiguity in the sentence.
- An entity type fusing framework is proposed, which can utilize entity type information and construct a deep representation of word vectors carrying entity type information through self-attention mechanism.
- A new dataset construction format is proposed which can store entity type information in our experiments, and this approach can effectively improve the performance of relationship extraction. We conduct experiments on three public datasets, and the weighted F1 value of our model achieves an absolute improvement by +1.5%, +1.0%, +1.2% on CL, NYT, and SciERC dataset respectively.
2. Related Work
3. Problem Statements
4. Method
4.1. BERT Embedder
4.2. Transformer Fusing
4.3. Relation Classifier
5. Experiments
5.1. Experimental Setup
5.2. Implementation Details
5.3. Experimental Result
5.4. Ablation Study
6. Conclusion
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Appendix A.
Appendix A.1. Examples of Ambiguous Sentences
| Somewhat chastened by his retreat in the polls , Mr. Blair acknowledged that Britons had turned against him in part over accusations that he led them into a war in <e1>Iraq</e1> on dubious legal grounds and on the false premise that Saddam <e2>Hussein</e2> presented a direct threat because of a supposed arsenal of unconventional weapons that was never found . |
| type: people; type: deceased_person |
| ✓Our Model: place_of_death(,) |
| ✗R-BERT Model: place_of_birth(,) |
| Somewhat chastened by his retreat in the polls , Mr. Blair acknowledged that Britons had turned against him in part over accusations that he led them into a war in <e1>Iraq</e1> on dubious legal grounds and on the false premise that Saddam <e2>Hussein</e2> presented a direct threat because of a supposed arsenal of unconventional weapons that was never found. |
| type: people; type: person |
| ✓Our Model: place_of_birth(,) |
| ✓R-BERT Model: place_of_birth(,) |
| Kerry <e1>Packer</e1>, who became Australia ’s richest man by turning a magazine and television inheritance worth millions into a diverse business worth billions , died yesterday in <e2>Sydney</e2>. |
| type: people; type: deceased_person |
| ✓Our Model: place_of_death(,) |
| ✗R-BERT Model: place_lived(,) |
| Kerry <e1>Packer</e1>, who became Australia ’s richest man by turning a magazine and television inheritance worth millions into a diverse business worth billions , died yesterday in <e2>Sydney</e2>. |
| type: people; type: person |
| ✗Our Model: place_of_birth(,) |
| ✓R-BERT Model: place_lived(,) |
| With an eye to enlivening some furniture in my home office , I have been stocking up on 1960 ’s and 70 ’s fabrics from Retro Age Vintage Fabric , a dealer in <e1>Victoria</e1>, <e2>Australia</e2>, that sells everything from trippy Art Nouveau-style patterns to Lilly Pulitzer-style florals . |
| type: location; type: country |
| ✓Our Model: administrative_divisions(,) |
| ✓R-BERT Model: administrative_divisions(,) |
| With an eye to enlivening some furniture in my home office , I have been stocking up on 1960 ’s and 70 ’s fabrics from Retro Age Vintage Fabric , a dealer in <e1>Victoria</e1>, <e2>Australia</e2>, that sells everything from trippy Art Nouveau-style patterns to Lilly Pulitzer-style florals. |
| type: location; type: location |
| ✓Our Model: contains(,) |
| ✗R-BERT Model: administrative_divisions(,) |
| The husband , Joseph Vione , 43 , who had been living in Garden City , says in papers filed last week in State Supreme Court in <e1><e2>Manhattan</e2></e1> that the Rev . Thomas K. Tewell , 56 , the senior pastor of the Fifth Avenue Presbyterian Church in Midtown Manhattan , used confidential information obtained during marriage counseling to seduce Mr. Vione ’s wife , Rachel , 42 . |
| type: location; type: location |
| ✓Our Model: contains(,) |
| ✗R-BERT Model: neighborhood_of(,) |
| The husband , Joseph Vione , 43 , who had been living in Garden City , says in papers filed last week in State Supreme Court in <e1><e2>Manhattan</e2></e1> that the Rev . Thomas K. Tewell , 56 , the senior pastor of the Fifth Avenue Presbyterian Church in Midtown Manhattan , used confidential information obtained during marriage counseling to seduce Mr. Vione ’s wife , Rachel , 42 . |
| type: location; type: neighborhood |
| ✓Our Model: neighborhood_of(,) |
| ✓R-BERT Model: neighborhood_of(,) |
| New <e1>Zealand</e1> : <e2>Marlborough</e2>, Hawke ’s Bay and Central Otago Despite having vines dating to 1819 , New Zealand wines were not known globally until the mid-1980 ’s , when their vibrant , minerally , tropical-fruit-filled sauvignon blanc set the wine world abuzz . |
| type: location; type: country |
| ✓Our Model: administrative_divisions(,) |
| ✗R-BERT Model: country(,) |
| New <e1>Zealand</e1> : <e2>Marlborough</e2>, Hawke ’s Bay and Central Otago Despite having vines dating to 1819 , New Zealand wines were not known globally until the mid-1980 ’s , when their vibrant , minerally , tropical-fruit-filled sauvignon blanc set the wine world abuzz . |
| type: location; type: location |
| ✓Our Model: contains(,) |
| ✗R-BERT Model: country(,) |
| But from downtown ’s Little Tokyo , home to many izakaya hole-in-the-walls , to West Los <e1><e2> Angeles</e2></e1>, where a new-wave izakaya serves duck breast marinated in sake along with Basque sheep ’s milk cheese , Los Angeles may have the most inventive permutations of izakaya-style restaurants in the United States . |
| type: location; type: location |
| ✓Our Model: contains(,) |
| ✗R-BERT Model: neighborhood_of(,) |
| But from downtown ’s Little Tokyo , home to many izakaya hole-in-the-walls , to West Los <e1><e2> Angeles</e2></e1>, where a new-wave izakaya serves duck breast marinated in sake along with Basque sheep ’s milk cheese , Los Angeles may have the most inventive permutations of izakaya-style restaurants in the United States . |
| type: location; type: neighborhood |
| ✓Our Model: neighborhood_of(,) |
| ✓R-BERT Model: neighborhood_of(,) |
| 1 <e1>Bali</e1> Suspects Elude Capture Three men wanted in connection with bombings in Bali that killed 222 people over three years , narrowly escaped capture in the last two days , two in the Philippines and one in <e2>Indonesia</e2>, according to officials in those countries . |
| type: location; type: location |
| ✓Our Model: contains(,) |
| ✗R-BERT Model: administrative_divisions(,) |
| 1 <e1>Bali</e1> Suspects Elude Capture Three men wanted in connection with bombings in Bali that killed 222 people over three years , narrowly escaped capture in the last two days , two in the Philippines and one in <e2>Indonesia</e2>, according to officials in those countries . |
| type: location; type: country |
| ✓Our Model: administrative_divisions(,) |
| ✓R-BERT Model: administrative_divisions(,) |
| An exhibition of Mr. Parr ’s images opened at Danziger Projects in Chelsea this week , showing parallels between staged photography of models wearing designer collections and candid ones of people he encountered in <e1>Dakar</e1>, <e2>Senegal</e2> and Cuba . |
| type: location; type: location |
| ✓Our Model: contains(,) |
| ✓R-BERT Model: capital(,) |
| An exhibition of Mr. Parr ’s images opened at Danziger Projects in Chelsea this week , showing parallels between staged photography of models wearing designer collections and candid ones of people he encountered in <e1>Dakar</e1>, <e2>Senegal</e2> and Cuba . |
| type: location; type: country |
| ✓Our Model: capital(,) |
| ✗R-BERT Model: capital(,) |
| By contrast , cities in the export-oriented Guangdong <e1>Province</e1> in southeastern <e2>China</e2> raised monthly minimum wages this summer by 18 percent , to 100 a month , after factories reported that they had one million more jobs than workers to fill them . |
| type: location; type: location |
| ✓Our Model: contains(,) |
| ✗R-BERT Model: administrative_divisions(,) |
| By contrast , cities in the export-oriented Guangdong <e1>Province</e1> in southeastern <e2>China</e2> raised monthly minimum wages this summer by 18 percent , to 100 a month , after factories reported that they had one million more jobs than workers to fill them . |
| type: location; type: country |
| ✓Our Model: administrative_divisions(,) |
| ✓R-BERT Model: administrative_divisions(,) |
| Leading the parade is actually the Duke of <e1>Saxony</e1> -LRB- below -RRB- , whose suit and that of his mount were made by one of <e2>Germany</e2>’s leading armorers , Kunz Lochner , in 1548 . |
| type: location; type: location |
| ✓Our Model: contains(,) |
| ✗R-BERT Model: administrative_divisions(,) |
| Leading the parade is actually the Duke of <e1>Saxony</e1> -LRB- below -RRB- , whose suit and that of his mount were made by one of <e2>Germany</e2>’s leading armorers , Kunz Lochner , in 1548 . |
| type: location; type: country |
| ✓Our Model: administrative_divisions(,) |
| ✓R-BERT Model: administrative_divisions(,) |
| <e1>Canada</e1> can thank British <e2>Columbia</e2> for much of its talent . |
| type: location; type: country |
| ✓Our Model: administrative_divisions(,) |
| ✗R-BERT Model: country(,) |
| <e1>Canada</e1> can thank British <e2>Columbia</e2> for much of its talent . |
| type: location; type: location |
| ✓Our Model: contains(,) |
| ✗R-BERT Model: country(,) |
| Even Larry <e1>Page</e1>, the <e2>Google</e2> co-founder who enjoyed rock-star treatment at the World Economic Forum in Davos , Switzerland , seemed to have reverted to regular-guy status in Sun Valley – as regular as you can be with billions of dollars of Google stock . |
| type: business; type: company_shareholder |
| ✓Our Model: major_shareholder_of(,) |
| ✗R-BERT Model: company(,) |
| Even Larry <e1>Page</e1>, the <e2>Google</e2> co-founder who enjoyed rock-star treatment at the World Economic Forum in Davos , Switzerland , seemed to have reverted to regular-guy status in Sun Valley – as regular as you can be with billions of dollars of Google stock . |
| type: business; type: person |
| ✓Our Model: company(,) |
| ✓R-BERT Model: company(,) |
References
- Pujara, J.; Miao, H.; Getoor, L.; Cohen, W. Knowledge Graph Identification. The Semantic Web – ISWC 2013; Alani, H., Kagal, L., Fokoue, A., Groth, P., Biemann, C., Parreira, J.X., Aroyo, L., Noy, N., Welty, C., Janowicz, K., Eds.; Springer Berlin Heidelberg: Berlin, Heidelberg, 2013; pp. 542–557. [Google Scholar]
- Dong, L.; Wei, F.; Zhou, M.; Xu, K. Question Answering over Freebase with Multi-Column Convolutional Neural Networks. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers); Association for Computational Linguistics: Beijing, China, 2015; pp. 260–269. [Google Scholar] [CrossRef]
- Zhang, F.; Yuan, N.J.; Lian, D.; Xie, X.; Ma, W.Y. Collaborative Knowledge Base Embedding for Recommender Systems. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; Association for Computing Machinery: New York, NY, USA, 2016; pp. 353–362. [Google Scholar] [CrossRef]
- Zelenko, D.; Aone, C.; Richardella, A. Kernel Methods for Relation Extraction. Journal of Machine Learning Research 2003, 3, 1083–1106. [Google Scholar]
- Chan, Y.S.; Roth, D. Exploiting syntactico-semantic structures for relation extraction. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011, pp. 551–560.
- Mintz, M.; Bills, S.; Snow, R.; Jurafsky, D. Distant supervision for relation extraction without labeled data. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP; Association for Computational Linguistics: Suntec, Singapore, 2009; pp. 1003–1011. [Google Scholar]
- Zeng, D.; Liu, K.; Lai, S.; Zhou, G.; Zhao, J. Relation Classification via Convolutional Deep Neural Network. In Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers; Dublin City University and Association for Computational Linguistics: Dublin, Ireland, 2014; pp. 2335–2344. [Google Scholar]
- Xu, Y.; Mou, L.; Li, G.; Chen, Y.; Peng, H.; Jin, Z. Classifying Relations via Long Short Term Memory Networks along Shortest Dependency Paths. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing; Association for Computational Linguistics: Lisbon, Portugal, 2015; pp. 1785–1794. [Google Scholar] [CrossRef]
- Miwa, M.; Bansal, M. End-to-End Relation Extraction using LSTMs on Sequences and Tree Structures. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers); Association for Computational Linguistics: Berlin, Germany, 2016; pp. 1105–1116. [Google Scholar] [CrossRef]
- Devlin, J.; Chang, M.W.; Lee, K.; Toutanova, K. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers); Association for Computational Linguistics: Minneapolis, Minnesota, 2019; pp. 4171–4186. [Google Scholar] [CrossRef]
- Wu, S.; He, Y. Enriching pre-trained language model with entity information for relation classification. Proceedings of the 28th ACM international conference on information and knowledge management, 2019, pp. 2361–2364.
- Hoffmann, R.; Zhang, C.; Ling, X.; Zettlemoyer, L.; Weld, D.S. Knowledge-Based Weak Supervision for Information Extraction of Overlapping Relations. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies; Association for Computational Linguistics: Portland, Oregon, USA, 2011; pp. 541–550. [Google Scholar]
- Zheng, H.; Wen, R.; Chen, X.; Yang, Y.; Zhang, Y.; Zhang, Z.; Zhang, N.; Qin, B.; Xu, M.; Zheng, Y. PRGC: Potential relation and global correspondence based joint relational triple extraction. arXiv preprint arXiv:2106.09895 2021. arXiv:2106.09895.
- Fu, T.J.; Ma, W.Y. GraphRel: Modeling Text as Relational Graphs for Joint Entity and Relation Extraction. ACL, 2019.
- Riedel, S.; Yao, L.; Mccallum, A.K. Modeling relations and their mentions without labeled text. Machine Learning and Knowledge Discovery in Databases, European Conference, ECML PKDD 2010, Barcelona, Spain, September 20-24, 2010, Proceedings, Part III, 2010.
- Subakan, C.; Ravanelli, M.; Cornell, S.; Bronzi, M.; Zhong, J. Attention is all you need in speech separation. ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2021, pp. 21–25.
- Bollacker, K. Freebase : A collaboratively created graph database for structuring human knowledge. Proc. SIGMOD’ 08 2008.
- Ratinov, L.; Roth, D. Design challenges and misconceptions in named entity recognition. Proceedings of the thirteenth conference on computational natural language learning (CoNLL-2009), 2009, pp. 147–155.
- Lin, Y.; Shen, S.; Liu, Z.; Luan, H.; Sun, M. Neural Relation Extraction with Selective Attention over Instances. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2016.
- Luo, B.; Feng, Y.; Wang, Z.; Zhu, Z.; Huang, S.; Yan, R.; Zhao, D. Learning with noise: Enhance distantly supervised relation extraction with dynamic transition matrix. arXiv preprint arXiv:1705.03995 2017. arXiv:1705.03995.
- Han, X.; Liu, Z.; Sun, M. Neural knowledge acquisition via mutual attention between knowledge graph and text. Proceedings of the AAAI Conference on Artificial Intelligence, 2018, Vol. 32.
- Wei, Z.; Su, J.; Wang, Y.; Tian, Y.; Chang, Y. A novel cascade binary tagging framework for relational triple extraction. arXiv preprint arXiv:1909.03227 2019. arXiv:1909.03227.
- Xia, S.X.S.; Lehong, D.L.D. Feature-Based Approach to Chinese Term Relation Extraction. International Conference on Signal Processing Systems, 2009.
- Rink, B.; Harabagiu, S. UTD: Classifying semantic relations by combining lexical and semantic resources. Association for Computational Linguistics 2010.
- Amin, S.; Dunfield, K.A.; Vechkaeva, A.; Neumann, G. A Data-driven Approach for Noise Reduction in Distantly Supervised Biomedical Relation Extraction. arXiv e-prints 2020. arXiv:2005.12565.
- Lee, J.; Yoon, W.; Kim, S.; Kim, D.; Kim, S.; So, C.H.; Kang, J. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 2019.
- Lütkebohle, I. Chinese-Literature-NER-RE-Dataset. https://github.com/lancopku/Chinese-Literature-NER-RE-Dataset/tree/master/relation_extraction, 2008. [Online; accessed 19-July-2008].
- Riedel, S.; Yao, L.; McCallum, A. Modeling Relations and Their Mentions without Labeled Text BT - Machine Learning and Knowledge Discovery in Databases. Ecml 2010, pp. 148–163. [CrossRef]
- Luan, Y.; He, L.; Ostendorf, M.; Hajishirzi, H. Multi-task identification of entities, relations, and coreference for scientific knowledge graph construction. arXiv preprint arXiv:1808.09602 2018. arXiv:1808.09602.
- Stenetorp, P.; Pyysalo, S.; Topić, G.; Ohta, T.; Ananiadou, S.; Tsujii, J. brat: a Web-based Tool for NLP-Assisted Text Annotation. In Proceedings of the Demonstrations Session at EACL 2012; Association for Computational Linguistics: Avignon, France, 2012. [Google Scholar]
- Wang, Y.; Yu, B.; Zhang, Y.; Liu, T.; Zhu, H.; Sun, L. TPLinker: Single-stage joint extraction of entities and relations through token pair linking. arXiv preprint arXiv:2010.13415 2020. arXiv:2010.13415.
- Zhong, Z.; Chen, D. A frustratingly easy approach for entity and relation extraction. arXiv preprint arXiv:2010.12812, arXiv:2010.12812 2020.





| Dataset | Relation types | Entity types |
| CL | 10 | 15 |
| NYT | 24 | 14 |
| SciERC | 7 | 6 |
| Dataset | Train | Test | Validation |
| CL | 11553 | 2889 | 2889 |
| NYT | 80000 | 8106 | 8233 |
| SciERC | 3215 | 974 | 812 |
| CL | NYT | SciERC | |||||||
| Method | Prec. | Rec. | F1 | Prec. | Rec. | F1 | Prec. | Rec. | F1 |
| R-BERT [11] | 78.0 | 78.0 | 77.8 | 77.9 | 69.8 | 69.2 | 86.2 | 86.3 | 85.7 |
| CasRel‡ [22] | - | - | - | 89.7 | 89.5 | 89.6 | - | - | - |
| TPLinker‡ [31] | - | - | - | 91.3 | 92.5 | 91.9 | - | - | - |
| PRGC‡ [13] | - | - | - | 93.3 | 91.9 | 92.6 | - | - | - |
| PURE [32] | - | - | - | - | - | - | 68.4 | 77.8 | |
| Ours | 87.2 | ||||||||
| Dataset | Model | Prec. | Rec. | F1 |
| NYT | Ours | 95.0 | 94.1 | 93.6 |
| LSTM-Replace | 88.8 | 86.6 | 86.3 | |
| BERT-Random | 95.1 | 94.0 | 93.7 | |
| Type-Strip | 90.8 | 91.0 | 90.9 | |
| CL | Ours | 79.4 | 79.7 | 79.3 |
| LSTM-Replace | 73.1 | 73.0 | 72.8 | |
| BERT-Random | 79.0 | 79.0 | 78.8 | |
| Type-Strip | 76.7 | 77.1 | 78.3 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
