Submitted:
02 August 2024
Posted:
08 August 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Neural Machine Translation for Annotation Projection
3. Materials and Methods
3.1. Clinical Corpora: DisTEMIST, DrugTEMIST and MEDDOPROF
3.2. Corpus Translation and Annotation Projection
3.3. Corpus Validation and Correction Process
3.4. Clinical Named Entity Recognition Models
- none: all tokens have the same importance.
- freq: each token has a weight inversely proportional to the frequency of its ground truth class (IOB tag) in the training split.where is the total number of tokens, the number of tokens for class i, and the number of classes.
- freq_sqrt: each token has a weight inversely proportional to the square root of the frequency of its ground truth class.
3.4.1. Model Selection
3.4.2. Model Availability
4. Results
4.1. Cross-Language Model Evaluation
4.2. MT Error Analysis
4.3. Annotation Projection and NER System Error Analysis
5. Discussion
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Al Kuwaiti, A.; Nazer, K.; Al-Reedy, A.; Al-Shehri, S.; Al-Muhanna, A.; Subbarayalu, A.V.; Al Muhanna, D.; Al-Muhanna, F.A. A Review of the Role of Artificial Intelligence in Healthcare. Journal of Personalized Medicine 2023, 13, 951. [Google Scholar] [CrossRef] [PubMed]
- Houssein, E.H.; Mohamed, R.E.; Ali, A.A. Machine Learning Techniques for Biomedical Natural Language Processing: A Comprehensive Review. IEEE Access 2021, 9, 140628–140653. [Google Scholar] [CrossRef]
- Kundeti, S.R.; Vijayananda, J.; Mujjiga, S.; Kalyan, M. Clinical Named Entity Recognition: Challenges and Opportunities. In Proceedings of the 2016 IEEE International Conference on Big Data (Big Data); 2016; pp. 1937–1945. [Google Scholar] [CrossRef]
- Pagad, N.S.; Pradeep, N. Clinical Named Entity Recognition Methods: An Overview. In Proceedings of the International Conference on Innovative Computing and Communications; Khanna, A.; Gupta, D.; Bhattacharyya, S.; Hassanien, A.E.; Anand, S.; Jaiswal, A., Eds., Singapore; 2022; pp. 151–165. [Google Scholar] [CrossRef]
- Wu, Y.; Jiang, M.; Xu, J.; Zhi, D.; Xu, H. Clinical Named Entity Recognition Using Deep Learning Models. AMIA Annual Symposium Proceedings 2018, 2017, 1812–1819. [Google Scholar] [PubMed]
- Uzuner, Ö.; South, B.R.; Shen, S.; DuVall, S.L. 2010 I2b2/VA Challenge on Concepts, Assertions, and Relations in Clinical Text. Journal of the American Medical Informatics Association : JAMIA 2011, 18, 552–556. [Google Scholar] [CrossRef] [PubMed]
- Luo, Y.; Thompson, W.K.; Herr, T.M.; Zeng, Z.; Berendsen, M.A.; Jonnalagadda, S.R.; Carson, M.B.; Starren, J. Natural Language Processing for EHR-Based Pharmacovigilance: A Structured Review. Drug Safety 2017, 40, 1075–1089. [Google Scholar] [CrossRef]
- Hovy, D.; Prabhumoye, S. Five Sources of Bias in Natural Language Processing. Language and Linguistics Compass 2021, 15, e12432. [Google Scholar] [CrossRef] [PubMed]
- Névéol, A.; Dalianis, H.; Velupillai, S.; Savova, G.; Zweigenbaum, P. Clinical natural language processing in languages other than English: opportunities and challenges. Journal of biomedical semantics 2018, 9, 1–13. [Google Scholar] [CrossRef]
- Schneider, E.T.R.; de Souza, J.V.A.; Knafou, J.; e Oliveira, L.E.S.; Copara, J.; Gumiel, Y.B.; de Oliveira, L.F.A.; Paraiso, E.C.; Teodoro, D.; Barra, C.M.C.M. BioBERTpt - A Portuguese Neural Language Model for Clinical Named Entity Recognition. In Proceedings of the Proceedings of the 3rd Clinical Natural Language Processing Workshop; pp. 202065–72. [CrossRef]
- García-Izquierdo, I.; Montalt, V. Cultural Competence and the Role of the Patient’s Mother Tongue: An Exploratory Study of Health Professionals’ Perceptions. Societies 2022, 12, 1735–1780. [Google Scholar] [CrossRef]
- Montalt, V. Ethical Considerations in the Translation of Health Genres in Crisis Communication. In Translating Crises; Bloomsbury Publishing, 2022; pp. 17–36.
- Schäfer, H.; Idrissi-Yaghir, A.; Horn, P.; Friedrich, C. Cross-Language Transfer of High-Quality Annotations: Combining Neural Machine Translation with Cross-Linguistic Span Alignment to Apply NER to Clinical Texts in a Low-Resource Language. In Proceedings of the Proceedings of the 4th Clinical Natural Language Processing Workshop; pp. 202253–62. [CrossRef]
- Xie, S.; Xia, Y.; Wu, L.; Huang, Y.; Fan, Y.; Qin, T. End-to-End Entity-Aware Neural Machine Translation. Machine Learning 2022, 111, 1181–1203. [Google Scholar] [CrossRef]
- Jain, A.; Paranjape, B.; Lipton, Z.C. 2019; arXiv:cs, stat/1909.05356]. [CrossRef]
- Mikolov, T.; Le, Q.V.; Sutskever, I. 2013; arXiv:cs/1309.4168]. [CrossRef]
- Altintas, K.; Cicekli, I. A Machine Translation System Between a Pair of Closely Related Languages. In International Symposium on Computer and Information Sciences; CRC Press, 2002.
- Briva-Iglesias, V. English-Catalan Neural Machine Translation: State-of-the-Art Technology, Quality, and Productivity. Tradumàtica: tecnologies de la traducció. [CrossRef]
- Castilho, S.; Moorkens, J.; Gaspari, F.; Calixto, I.; Tinsley, J.; Way, A. Is Neural Machine Translation the New State of the Art? The Prague Bulletin of Mathematical Linguistics 2017. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser. ; Polosukhin, I. Attention Is All You Need. Advances in neural information processing systems 2017, 30. [Google Scholar]
- Costa-Jussà, M.R.; Farrús, M.; Serrano Pons, J. Machine translation in medicine: A quality analysis of statistical machine translation in the medical domain 2012.
- Soto, X.; Perez-de Viñaspre, O.; Oronoz, M.; Labaka, G. Development of a machine translation system for promoting the use of a low resource language in the clinical domain: The case of Basque. In Natural Language Processing In Healthcare; CRC Press, 2022; pp. 139–158.
- Soares, F.; Krallinger, M. BSC participation in the WMT translation of biomedical abstracts. In Proceedings of the Proceedings of the Fourth Conference on Machine Translation (Volume 3: Shared Task Papers, Day 2), 2019, pp.; pp. 175–178.
- Neves, M.; Yepes, A.J.; Siu, A.; Roller, R.; Thomas, P.; Navarro, M.V.; Yeganova, L.; Wiemann, D.; Di Nunzio, G.M.; Vezzani, F.; et al. Findings of the WMT 2022 biomedical translation shared task: Monolingual clinical case reports. In Proceedings of the WMT22-Seventh Conference on Machine Translation; 2022; pp. 694–723. [Google Scholar]
- Almagro, M.; Martínez, R.; Montalvo, S.; Fresno, V. A cross-lingual approach to automatic ICD-10 coding of death certificates by exploring machine translation. Journal of biomedical informatics 2019, 94, 103207. [Google Scholar] [CrossRef] [PubMed]
- Pilegaard, M. Translation of medical research articles. Benjamins Translation Library 1997, 26, 159–184. [Google Scholar]
- Frei, J.; Frei-Stuber, L.; Kramer, F. GERNERMED++: Semantic annotation in German medical NLP through transfer-learning, translation and word alignment. Journal of Biomedical Informatics 2023, 147, 104513. [Google Scholar] [CrossRef] [PubMed]
- Schäfer, H.; Idrissi-Yaghir, A.; Horn, P.; Friedrich, C. Cross-Language Transfer of High-Quality Annotations: Combining Neural Machine Translation with Cross-Linguistic Span Alignment to Apply NER to Clinical Texts in a Low-Resource Language. In Proceedings of the Proceedings of the 4th Clinical Natural Language Processing Workshop; pp. 202253–62. [CrossRef]
- Zaghir, J.; Bjelogrlic, M.; Goldman, J.P.; Aananou, S.; Gaudet-Blavignac, C.; Lovis, C. FRASIMED: a Clinical French Annotated Resource Produced through Crosslingual BERT-Based Annotation Projection. arXiv preprint arXiv:2309.10770, arXiv:2309.10770 2023.
- Deléger, L.; Merkel, M.; Zweigenbaum, P. Translating medical terminologies through word alignment in parallel text corpora. Journal of Biomedical Informatics 2009, 42, 692–701. [Google Scholar] [CrossRef] [PubMed]
- Villegas, M.; Intxaurrondo, A.; Gonzalez-Agirre, A.; Marimon, M.; Krallinger, M. The MeSpEN resource for English-Spanish medical machine translation and terminologies: census of parallel corpora, glossaries and term translations. LREC MultilingualBIO: multilingual biomedical text processing.
- Yang, R.; Tan, T.F.; Lu, W.; Thirunavukarasu, A.J.; Ting, D.S.W.; Liu, N. Large language models in health care: Development, applications, and challenges. Health Care Science 2023, 2, 255–263. [Google Scholar] [CrossRef] [PubMed]
- Briva-Iglesias, V.; Camargo, J.L.C.; Dogru, G. Large Language Models "Ad Referendum": How Good Are They at Machine Translation in the Legal Domain? 2024; arXiv:cs.CL/2402.07681]. [Google Scholar]
- ten Hacken, P. The Language of Medicine in the Romance Languages. In Oxford Research Encyclopedia of Linguistics; 2023.
- Miranda-Escalada, A.; Farré-Maduell, E.; Lima-López, S.; Estrada, D.; Gascó, L.; Krallinger, M. Mention detection, normalization & classification of species, pathogens, humans and food in clinical documents: Overview of LivingNER shared task and resources. Procesamiento del Lenguaje Natural 2022. [Google Scholar]
- Lima-López, S.; Farré-Maduell, E.; Brivá-Escalada, V.; Gascó, L.; Krallinger, M. MEDDOPLACE Shared Task overview: recognition, normalization and classification of locations and patient movement in clinical texts. Procesamiento del Lenguaje Natural 2023, 71. [Google Scholar]
- Lima-López, S.; Farré-Maduell, E.; Gasco-Sánchez, L.; Rodríguez-Miret, J.; Krallinger, M. Overview of SympTEMIST at BioCreative VIII: corpus, guidelines and evaluation of systems for the detection and normalization of symptoms, signs and findings from text. In Proceedings of the Proceedings of the BioCreative VIII Challenge and Workshop: Curation and Evaluation in the era of Generative Models; 2023. [Google Scholar]
- Lima-López, S.; Farré-Maduell, E.; Gascó, L.; Nentidis, A.; Krithara, A.; Katsimpras, G.; Paliouras, G.; Krallinger, M. Overview of MedProcNER task on medical procedure detection and entity linking at BioASQ 2023. In Proceedings of the Working Notes of CLEF 2023 - Conference and Labs of the Evaluation Forum; 2023. [Google Scholar]
- Shaitarova, A.; Zaghir, J.; Lavelli, A.; Krauthammer, M.; Rinaldi, F. Exploring the Latest Highlights in Medical Natural Language Processing across Multiple Languages: A Survey. Yearbook of medical informatics 2023, 32, 230–243. [Google Scholar] [CrossRef]
- Lima-López, S.; Farré-Maduell, E.; Rodríguez-Miret, J.; Rodríguez-Ortega, M.; Lilli, L.; Lenkowicz, J.; Ceroni, G.; Kossoff, J.; Shah, A.; Nentidis, A.; et al. Overview of MultiCardioNER task at BioASQ 2024 on Medical Speciality and Language Adaptation of Clinical NER Systems for Spanish, English and Italian. In Proceedings of the CLEF Working Notes; Faggioli, G.; Ferro, N.; Galuščáková, P.; García Seco de Herrera, A., Eds. 2024. [Google Scholar]
- Miranda-Escalada, A.; Gascó, L.; Lima-López, S.; Farré-Maduell, E.; Estrada, D.; Nentidis, A.; Krithara, A.; Katsimpras, G.; Paliouras, G.; Krallinger, M. Overview of DisTEMIST at BioASQ: Automatic detection and normalization of diseases from clinical texts: results, methods, evaluation and multilingual resources 2022.
- Lima-López, S.; Farré-Maduell, E.; Miranda-Escalada, A.; Brivá-Iglesias, V.; Krallinger, M. NLP applied to occupational health: MEDDOPROF shared task at IberLEF 2021 on automatic recognition, classification and normalization of professions and occupations from medical texts. Procesamiento del Lenguaje Natural 2021, 67, 243–256. [Google Scholar]
- Team, N.; Costa-jussà, M.R.; Cross, J.; Çelebi, O.; Elbayad, M.; Heafield, K.; Heffernan, K.; Kalbassi, E.; Lam, J.; Licht, D.; et al. S: Language Left Behind, 2022; arXiv:cs/2207.04672]. [CrossRef]
- Ethnologue. The Catalan Language. https://www.ethnologue.com/language/cat/, 2017.
- Stenetorp, P.; Pyysalo, S.; Topić, G.; Ohta, T.; Ananiadou, S.; Tsujii, J. BRAT: a web-based tool for NLP-assisted text annotation. In Proceedings of the Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics, 2012, pp.; pp. 102–107.
- Tian, S.; Erdengasileng, A.; Yang, X.; Guo, Y.; Wu, Y.; Zhang, J.; Bian, J.; He, Z. Transformer-Based Named Entity Recognition for Parsing Clinical Trial Eligibility Criteria. In Proceedings of the Proceedings of the 12th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, New York, NY, USA, 2021; pp. 211–6. [CrossRef]
- Yang, F.; Wang, X.; Ma, H.; Li, J. Transformers-Sklearn: A Toolkit for Medical Language Understanding with Transformer-Based Models. BMC Medical Informatics and Decision Making 2021, 21, 90. [Google Scholar] [CrossRef] [PubMed]
- Ratnaparkhi, A.; Marcus, M.P. Maximum entropy models for natural language ambiguity resolution. PhD thesis, USA, 1998. AAI9840230.
- Lange, L.; Adel, H.; Strötgen, J. Boosting Transformers for Job Expression Extraction and Classification in a Low-Resource Setting, 2021. arXiv:2109. 1: [cs] version, 0859; 1. [Google Scholar] [CrossRef]
- Moscato, V.; Postiglione, M.; Sperlì, G. Biomedical Spanish Language Models for entity recognition and linking at BioASQ DisTEMIST. 09 2022.
- Carrino, C.P.; Llop, J.; Pàmies, M.; Gutiérrez-Fandiño, A.; Armengol-Estapé, J.; Silveira-Ocampo, J.; Valencia, A.; Gonzalez-Agirre, A.; Villegas, M. Pretrained Biomedical Language Models for Clinical NLP in Spanish. In Proceedings of the Proceedings of the 21st Workshop on Biomedical Language Processing. [CrossRef]
- Cañete, J.; Chaperon, G.; Fuentes, R.; Ho, J.H.; Kang, H.; Pérez, J. Spanish Pre-Trained BERT Model and Evaluation Data. In Proceedings of the PML4DC at ICLR 2020, 2020. [Google Scholar]
- Lommel, A. Metrics for Translation Quality Assessment: A Case for Standardising Error Typologies. In Translation Quality Assessment: From Principles to Practice; Moorkens, J., Castilho, S., Gaspari, F., Doherty, S., Eds.; Springer International Publishing: Cham, 2018; pp. 109–127. [Google Scholar] [CrossRef]
- Freitag, M.; Foster, G.; Grangier, D.; Ratnakar, V.; Tan, Q.; Macherey, W. Experts, Errors, and Context: A Large-Scale Study of Human Evaluation for Machine Translation. Transactions of the Association for Computational Linguistics 2021, 9, 1460–1474. [Google Scholar] [CrossRef]
- Moorkens, J.; Castilho, S.; Gaspari, F.; Doherty, S. (Eds.) Translation Quality Assessment: From Principles to Practice; Vol. 1, Machine Translation: Technologies and Applications, Springer International Publishing: Cham, 2018. [Google Scholar] [CrossRef]
- Chinchor, N.; Sundheim, B. MUC-5 Evaluation Metrics. In Proceedings of the Fifth Message Understanding Conference (MUC-5): Proceedings of a Conference Held in Baltimore, Maryland, 1993, 1993., August 25-27.
- Reyna, O.G.P. El Diccionario panhispánico de términos médicos. Revista de la Sociedad Peruana de Medicina Interna 2023, 36, 169–170. [Google Scholar]
- Soto, X.; Perez-de Viñaspre, O.; Labaka, G.; Oronoz, M. Neural machine translation of clinical texts between long distance languages. Journal of the American Medical Informatics Association 2019, 26, 1478–1487. [Google Scholar] [CrossRef]
- Miranda-Escalada, A.; Farré-Maduell, E.; Krallinger, M. Named entity recognition, concept normalization and clinical coding: Overview of the CANTEMIST track for cancer text mining in Spanish, corpus, guidelines, methods and results. In Proceedings of the Proceedings of the Iberian Languages Evaluation Forum (IberLEF CEUR Workshop Proceedings, 2020.
- Gonzalez-Agirre, A.; Marimon, M.; Intxaurrondo, A.; Rabal, O.; Villegas, M.; Krallinger, M. Pharmaconer: Pharmacological substances, compounds and proteins named entity recognition track. In Proceedings of the Proceedings of The 5th Workshop on BioNLP Open Shared Tasks, 2019, pp.; pp. 1–10.


| Dataset | Topic | Documents | Tokens | Annotations |
|---|---|---|---|---|
| DisTEMIST | Diseases | 1,000 | 406,318 | 10,664 |
| DrugTEMIST | Medications | 1,000 | 406,318 | 2,782 |
| MEDDOPROF | Occupations | 1,844 | 1,291,186 | 4,770 |
| Dataset | Documents | GS Ann. | Not Projected | % Not Projected |
|---|---|---|---|---|
| DisTEMIST train+dev | 750 | 8,065 | 98 | 1.21% |
| DisTEMIST test | 250 | 2,599 | 55 | 2.11% |
| DrugTEMIST train+dev | 750 | 2,090 | 34 | 1.62% |
| DrugTEMIST test | 250 | 692 | 18 | 2.60% |
| MEDDOPROF train+dev | 1,500 | 3,658 | 139 | 3.79% |
| MEDDOPROF test | 344 | 1,085 | 54 | 4.97% |
| Drugs | Diseases | Occupations | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Trained on/Eval. on | P | R | F1 | P | R | F1 | P | R | F1 |
| Spa/Spa | 0.917 | 0.909 | 0.913 | 0.754 | 0.759 | 0.757 | 0.785 | 0.776 | 0.780 |
| Spa/Cat corr. | 0.886 | 0.809 | 0.846 | 0.610 | 0.608 | 0.609 | 0.642 | 0.337 | 0.442 |
| Spa/CataCCC | 0.908 | 0.857 | 0.882 | 0.742 | 0.702 | 0.721 | 0.750 | 0.444 | 0.558 |
| Cat corr./Spa | 0.884 | 0.880 | 0.882 | 0.672 | 0.644 | 0.658 | 0.723 | 0.726 | 0.725 |
| Cat corr./Cat corr. | 0.885 | 0.874 | 0.879 | 0.701 | 0.718 | 0.709 | 0.743 | 0.717 | 0.729 |
| Cat corr./CataCCC | 0.921 | 0.904 | 0.913 | 0.775 | 0.818 | 0.796 | 0.838 | 0.794 | 0.815 |
| Drugs | Diseases | Occupations | ||||
|---|---|---|---|---|---|---|
| Trained on/ Eval. on | Strict | Relaxed | Strict | Relaxed | Strict | Relaxed |
| Spa/Spa | 0.913 | 0.954 | 0.757 | 0.892 | 0.782 | 0.872 |
| Cat corr./Cat corr. | 0.879 | 0.937 | 0.709 | 0.866 | 0.729 | 0.840 |
| Cat corr./CataCCC | 0.913 | 0.957 | 0.796 | 0.889 | 0.815 | 0.875 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).