Submitted:
30 July 2024
Posted:
30 July 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Work
3. Method Architecture
3.1. Model Overview
3.2. Semantic Encoding
3.3. Pinyin Encoding
3.4. POS Encoding
3.5. Feature Fusion Coding
3.5.1. Pinyin-Semantic Fusion
3.5.2. POS-Semantic Fusion
3.6. Progressive Contrast Constraint
3.7. Loss Function
4. Experiment
4.1. Dataset
4.2. Experimental Environment and Parameter Settings
| Experimental Environment | Configuration |
|---|---|
| Operating system | Windows 10 |
| Programming language | Python 3.8.19 |
| Deep learning frameworks | Pytorch 1.13.0 |
| Graphics | Tesla V100 GPU |
| Memory | 128GB |
| Parameter | Parameter Value |
|---|---|
| Bert dimension | 768 |
| Learning rate | 1e-5 |
| Batch size | 32 |
| Maximum text length | 300 |
| Pinyin embedding dimension | 768 |
| POS embedding dimension | 768 |
| Dropout parameters | 0.1 |
| Transformer layers | 12 |
| Loss balance coefficient | 0.7 |
| Epoch | 100 |
4.3. Evaluation Indicators
4.4. Baseline Model
4.5. Experimental Results Analysis
4.5.1. Comparative Test
4.5.2. Ablation Experiment
| Model | Accuracy | Recall | F1 |
|---|---|---|---|
| base | 0.772 | 0.764 | 0.768 |
| base + a | 0.816 | 0.817 | 0.817 |
| CJE-PCHF | 0.821 | 0.822 | 0.822 |
4.5.3. Loss Balance Coefficient Experiment
5. Conclusion
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Liu, Q.; Li, Y.; Duan, H.; Liu, Y.; Qin, Z. A survey of knowledge mapping construction techniques. J. Comput. Res. Dev. 2016, 53, 582–600. [Google Scholar]
- Sun, C.; Gong, Y.; Wu, Y.; Gong, M.; Jiang, D.; Lan, M.; Sun, S.; Duan, N. Joint type inference on entities and relations via graph convolutional networks. In Proceedings of the Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 2019; pp. 1361-1370.
- Mao, T.; Li, X.; LIU, J.; Zhang, D.; Yan, W. Chinese entity and relation extraction model based on parallel heterogeneous graph and sequential attention mechanism. J. Comput. Appl. 2023, 0. [Google Scholar] [CrossRef]
- LU, X.; SUN, L.; LING, C.; TONG, Z.; LIU, J.; TANG, Q. Named Entity Recognition of Chinese Electronic Health Records Incorporating Phonetic and Part-of-speech Features. J. Chin. Comput. Syst. 2024, 1–12. [Google Scholar]
- Saunshi, N.; Plevrakis, O.; Arora, S.; Khodak, M.; Khandeparkar, H. A theoretical analysis of contrastive unsupervised representation learning. In Proceedings of the International Conference on Machine Learning, 2019. 5628–5637.
- Shao, Y. Research on Chinese entity relationship extraction based on deep learning. 2020.
- Miwa, M.; Bansal, M. End-to-end relation extraction using lstms on sequences and tree structures. arXiv 2016, arXiv:1601.00770. [Google Scholar]
- Katiyar, A.; Cardie, C. Going out on a limb: Joint extraction of entity mentions and relations without dependency trees. In Proceedings of the Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2017. 917–928.
- Zheng, S.; Wang, F.; Bao, H.; Hao, Y.; Zhou, P.; Xu, B. Joint extraction of entities and relations based on a novel tagging scheme. arXiv 2017, arXiv:1706.05075. [Google Scholar]
- Zeng, X.; Zeng, D.; He, S.; Liu, K.; Zhao, J. Extracting relational facts by an end-to-end neural model with copy mechanism. In Proceedings of the Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2018. 506–514.
- Fu, T.; Li, P.; Ma, W. Graphrel: Modeling text as relational graphs for joint entity and relation extraction. In Proceedings of the Proceedings of the 57th annual meeting of the association for computational linguistics, 2019. 1409–1418.
- Wei, Z.; Su, J.; Wang, Y.; Tian, Y.; Chang, Y. A novel cascade binary tagging framework for relational triple extraction. arXiv 2019, arXiv:1909.03227. [Google Scholar]
- Wang, Y.; Zhang, C.; Bai, F.; Wang, Z.; Ji, C. Review of Chinese named entity recognition research. J. Front. Comput. Sci. Technol. 2023, 17, 324. [Google Scholar] [CrossRef]
- Zhang, Y.; Liu, Y.; Zhu, J.; Zheng, Z.; Liu, X.; Wang, W.; Chen, Z.; Zhai, S. Learning Chinese word embeddings from stroke, structure and pinyin of characters. In Proceedings of the Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019; pp. 1011–1020.
- Wang, H.; Shi, Y.; Liu, G.; Duan, J. Named Entity Recognition of Microblog Texts Based on POS Features. J. North China Univ. Technol. 2019, 31. [Google Scholar]
- Zhu, W.; Jin, X.; Ni, J.; Wei, B.; Lu, Z. Improve word embedding using both writing and pronunciation. PLoS ONE 2018, 13, e0208785. [Google Scholar] [CrossRef] [PubMed]
- Wu, S.; Song, X.; Feng, Z. MECT: Multi-metadata embedding based cross-transformer for Chinese named entity recognition. arXiv 2021, arXiv:2107.05418. [Google Scholar]
- Yu, X.; Li, L.; ZhouJianlun; Ma, H.; Chen, P. AMFRel: A method for joint extraction of enity relations in Chinese electronic medical records. J. Chongqing Univ. Technol. (Nat. Sci.) 2024, 38, 189–197. [CrossRef]
- Li, S.; Chang, Z.; Liu, Y. Joint Extraction of Entities and Relations Based on Multi-feature Fusion. Int. J. Emerg. Technol. Adv. Appl. 2024, 1. [Google Scholar] [CrossRef]
- Ge, J.; Li, S.; Fang, Y. Joint extraction method of Chinese entity relationship based on mixture of characters and words. Appl. Res. Comput. 2021, 38. [Google Scholar] [CrossRef]
- Lei, J.; Lai, K.; Yang, S.; Wu, Y. Joint entity and relation extraction based on contextual semantic enhancement. J. Comput. Appl. 2023, 43, 1438–1444. [Google Scholar] [CrossRef]
- Mikolov, T.; Sutskever, I.; Chen, K.; Corrado, G.S.; Dean, J. Distributed representations of words and phrases and their compositionality. Adv. Neural Inf. Process. Syst. 2013, 26. [Google Scholar]
- Fang, H.; Wang, S.; Zhou, M.; Ding, J.; Xie, P. CERT: Contrastive Self-supervised Learning for Language Understanding. arXiv 2020, arXiv:2005.12766. [Google Scholar]
- Li, S.; He, W.; Shi, Y.; Jiang, W.; Liang, H.; Jiang, Y.; Zhang, Y.; Lyu, Y.; Zhu, Y. Duie: A large-scale chinese dataset for information extraction. In Proceedings of the Natural Language Processing and Chinese Computing: 8th CCF International Conference, NLPCC 2019, Dunhuang, China, October 9–14, 2019, Proceedings, Part II 8, 2019; pp. 791–800.
- Hoffmann, R.; Zhang, C.; Ling, X.; Zettlemoyer, L.; Weld, D.S. Knowledge-based weak supervision for information extraction of overlapping relations. In Proceedings of the Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies, 2011; pp. 541–550.
- Ren, X.; Wu, Z.; He, W.; Qu, M.; Voss, C.R.; Ji, H.; Abdelzaher, T.F.; Han, J. Cotype: Joint extraction of typed entities and relations with knowledge bases. In Proceedings of the Proceedings of the 26th international conference on world wide web, 2017; pp. 1015–1024.
- Wang, Y.; Mu, H.; Zhou, L.; Xing, W. Joint extraction method of entity and relationship based on pointer network. Appl. Res. Comput. 2021, 38. [Google Scholar] [CrossRef]
- Chen, R.; Zheng, X.; Zhu, Y. Joint entity and relation extraction fusing entity type information. Comput. Eng. 2022, 48, 46–53. [Google Scholar] [CrossRef]
- Yu, K. Joint Extraction of Chinese Entity Relationship Based on Bidirectional Semantic Learning Model. 2022.




| Data | Training | Training Triplets | Validation | Validation Triplets |
|---|---|---|---|---|
| DuIE | 173108 | 349266 | 21639 | 43739 |
| Model | Accuracy | Recall | F1 |
|---|---|---|---|
| MultiR | 0.577 | 0.356 | 0.440 |
| CoType | 0.661 | 0.605 | 0.632 |
| Pointer Annotation Model | 0.694 | 0.639 | 0.665 |
| FETI | 0.757 | 0.356 | 0.440 |
| CasRel | 0.772 | 0.764 | 0.768 |
| Word Mixture Model | 0.813 | 0.781 | 0.797 |
| BSCRE | 0.816 | 0.795 | 0.805 |
| CJE-PCHF(ours) | 0.821 | 0.822 | 0.822 |
| λ | 0.0 | 0.1 | 0.2 | 0.3 | 0.4 | 0.5 | 0.6 | 0.7 | 0.8 | 0.9 | 1.0 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| F1 | 0.793 | 0.802 | 0.812 | 0.821 | 0.819 | 0.819 | 0.820 | 0.822 | 0.820 | 0.819 | 0.809 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).