Submitted:
16 April 2025
Posted:
16 April 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
- -
- we introduce a flexible, multilingual, and adaptable pipeline for medical NER
- -
- our pipeline enables effective extraction of medical entities (problem, test and treatment) from a given body of text
- -
- our pipeline is effective across diverse languages, with a focus on low-resource languages
2. Materials and Methods
2.1. Overall Methodology
2.2. Data Sources and Corpus Creation


2.3. Preprocessing
2.4. Annotation Process
2.5. Translation and Multilingual Adaptation
- German - Helsinki-NLP/opus-mt-en-de (https://huggingface.co/Helsinki-NLP/opus-mt-en-de)
- Greek - Helsinki-NLP/opus-mt-en-el (https://huggingface.co/Helsinki-NLP/opus-mt-en-el)
- Spanish - Helsinki-NLP/opus-mt-tc-big-en-es (https://huggingface.co/Helsinki-NLP/opus-mt-tc-big-en-es)
- Italian - Helsinki-NLP/opus-mt-tc-big-en-it (https://huggingface.co/Helsinki-NLP/opus-mt-tc-big-en-it)
- Polish - gsarti/opus-mt-tc-en-pl (https://huggingface.co/gsarti/opus-mt-tc-en-pl)
- Portuguese - Helsinki-NLP/opus-mt-tc-big-en-pt (https://huggingface.co/Helsinki-NLP/opus-mt-tc-big-en-pt)
- Slovenian - facebook/mbart-large-50-many-to-many-mmt model (https://huggingface.co/facebook/mbart-large-50-many-to-many-mmt)
2.6. Word Alignment
2.7. Data Augmentation
2.8. Data Splitting
2.9. Model Configuration
2.10. Evaluation
3. Results
3.1. Models’ Comparison with Existing Models
3.2. Case Studies
- The patient complained of severe (B-PROBLEM) headaches (E-PROBLEM) and nausea (S-PROBEM) that had persisted for two days. To alleviate the (B-PROBLEM) symptoms (E-PROBLEM), he was prescribed paracetamol (S-TREATMENT) and advised to rest and drink plenty of fluids.
- The patient exhibited symptoms (S-PROBLEM) of fever (S-PROBLEM), cough (S-PROBLEM), and body (B-PROBLEM) aches (E-PROBLEM). A (B-TEST) chest (I-TEST) X-ray (I-TEST) was taken to rule out pneumonia (S-PROBLEM). He was prescribed an (B-TREATMENT) antibiotic (E-TREATMENT) and advised to rest.
- The patient complained of dizziness (S-PROBLEM), vision (B-PROBLEM) disturbances (E-PROBLEM), and numbness (B-PROBLEM) in (I-PROBLEM) her (I-PROBLEM) hands (E-PROBLEM). An (B-TEST) MRI (I-TEST) of (I-TEST) the (I-TEST) brain (E-TEST) was ordered to rule out a (B-PROBLEM) neurological (I-PROBLEM) cause (E-PROBLEM). A (B-TREATMENT) beta-blocker (I-TREATMENT) was prescribed to stabilize her (B-TEST) blood (I-TEST) pressure (E-TEST).
- The patient complained of severe (B-PROBLEM) headaches (E-PROBLEM) and nausea (S-PROBEM) that had persisted for two days. To alleviate the (B-PROBLEM) symptoms (E-PROBLEM), he was prescribed paracetamol (S-TREATMENT) and advised to rest and drink plenty of fluids.
- The patient exhibited symptoms (S-PROBLEM) of fever (S-PROBLEM), cough (S-PROBLEM), and body (B-PROBLEM) aches (E-PROBLEM). A (B-TEST) chest (I-TEST) X-ray (I-TEST) was taken to rule out pneumonia (S-PROBLEM). He was prescribed an (B-TREATMENT) antibiotic (E-TREATMENT) and advised to rest.
- The patient complained of dizziness (S-PROBLEM), vision (B-PROBLEM) disturbances (E-PROBLEM), and numbness (B-PROBLEM) in (I-PROBLEM) her (I-PROBLEM) hands (E-PROBLEM). An (B-TEST) MRI (I-TEST) of (I-TEST) the (I-TEST) brain (E-TEST) was ordered to rule out a (B-PROBLEM) neurological (I-PROBLEM) cause (E-PROBLEM). A (B-TREATMENT) beta (I-TREATMENT ) – (I-TREATMENT) blocker (E-TREATMENT) was prescribed to stabilize her (B-TEST) blood (I-TEST) pressure (E-TEST).
- El paciente se quejó de fuertes (B-PROBLEM) dolores (E-PROBLEM) de cabeza y náuseas (S-PROBLEM) que habían persistido durante dos días. Para aliviar los síntomas, se le recetó paracetamol (S-TREATMENT) y se le aconsejó descansar y beber muchos líquidos.
- El paciente presentó síntomas (S-PROBLEM) de fiebre (S-PROBLEM), tos y dolores (E-PROBLEM) corporals (E-PROBLEM). Se le realizó una (B-TEST) radiografía (E-TEST) de tórax para descartar una (B-PROBLEM) neumonía (E-PROBLEM). Se le recetó un (B-TREATMENT) antibiótico (E-TREATMENT) y se le aconsejó descansar.
- La paciente se quejó de mareos (S-PROBLEM), alteraciones (E-PROBLEM) de la vision (B-PROBLEM) y entumecimiento (B-PROBLEM) en (I-PROBLEM) las manos (E-PROBLEM). Se ordenó una (B-TEST) resonancia (I-TEST) magnética (E-TEST) del (I-TEST) cerebro (E-TEST) para descartar una causa (E-PROBLEM) neurológica (I-PROBLEM). Se le recetó un (B-TREATMENT) betabloqueante (E-TREATMENT) para estabilizar su (B-TEST) presión (E-TEST) arterial (I-TEST).
- El paciente se quejó de fuertes dolores (B-DISO) de (I-DISO) cabeza (I-PROBLEM) y náuseas (B-DISO) que habían persistido durante dos días. Para aliviar los síntomas (B-PROBLEM), se le recetó paracetamol (B-CHEM) y se le aconsejó descansar (B-PROC) y beber muchos líquidos.
- El paciente presentó síntomas (B-DISO) de (I-DISO) fiebre (I-DISO), tos (I-DISO) y dolores (B-DISO) corporals (I-DISO). Se le realizó una radiografía (B-PROC) de (I-PROC) tórax (I-PROC) para descartar una neumonía (B-DISO). Se le recetó (B-PROC) un antibiótico (B-CHEM) y se le aconsejó descansar (B-PROC).
- La paciente se quejó de mareos (B-DISO), alteraciones (B-DISO) de (I-DISO) la (I-DISO) vision (I-DISO) y entumecimiento (B-DISO) en (I-DISO) las (I-DISO) manos (I-DISO). Se ordenó una resonancia (B-PROC) magnética (I-PROC) del (I-PROC) cerebro (I-PROC) para descartar una causa neurológica. Se le recetó un betabloqueante (B-CHEM) para estabilizar (B-PROC) su presión (B-PROC) arterial (I-PROC).
- The patient complained of severe headaches and nausea that had persisted for two days. To relieve the symptoms, paracetamol was prescribed, and the patient was advised to rest and drink plenty of fluids.
- The patient presented symptoms of fever, cough, and body aches. A chest X-ray was performed to rule out pneumonia. An antibiotic was prescribed, and the patient was advised to rest.
- The patient complained of dizziness, vision disturbances, and numbness in the hands. An MRI of the brain was ordered to rule out a neurological cause. A beta-blocker was prescribed to stabilize her blood pressure.
- Il paziente ha lamentato forti (B-PROBLEM) mal (E-PROBLEM) di testa (E-PROBLEM) e nausea (S-PROBLEM) che persistevano da due giorni. Per alleviare i sintomi (E-PROBLEM), gli è stato prescritto il paracetamolo (S-TREATMENT) e gli è stato consigliato di riposare e bere molti liquidi.
- Il paziente ha manifestato sintomi (S-PROBLEM) di febbre (S-PROBLEM), tosse (S-PROBLEM) e dolori (E-PROBLEM) muscolari. È stata eseguita una (B-TEST) radiografia (E-TEST) del torace (E-TEST) per escludere una (B-PROBLEM) polmonite (E-PROBLEM). Gli è stato prescritto un (B-TREATMENT) antibiotico (E-TREATMENT) e gli è stato consigliato di riposare.
- La paziente ha lamentato vertiginid (S-PROBLEM), disturbi (E-PROBLEM) visivi (B-PROBLEM) e intorpidimento (B-PROBLEM) delle (I-PROBLEM) mani (E-PROBLEM). È stata ordinata una (B-TREATMENT) risonanza (I-TEST) magnetica (I-TEST) del (I-TEST) cervello per escludere una (B-PROBLEM) causa (E-PROBLEM) neurologica (I-PROBLEM). È stato prescritto un (B-TREATMENT) betabloccante (E-TREATMENT) per stabilizzare la pressione (E-TEST) sanguigna.
- Il paziente ha lamentato forti mal di testa e nausea che persistevano da due giorni. Per alleviare i sintomi, gli è stato prescritto il paracetamolo (TRATTAMENTO FARMACOLOGICO (B)) e gli è stato consigliato di riposare e bere molti liquidi.
- Il paziente ha manifestato sintomi di febbre, tosse e dolori muscolari. È stata eseguita una radiografia (TEST (B)) del torace per escludere una polmonite. Gli è stato prescritto un antibiotico e gli è stato consigliato di riposare.
- La paziente ha lamentato vertigini, disturbi visivi e intorpidimento delle mani. È stata ordinata una risonanza (TEST (B)) magnetica (TEST (B)) del (TEST (I)) cervello (TEST (I)) per escludere una causa neurologica. È stato prescritto un betabloccante (TRATTAMENTO FARMACOLOGICO (B)) per stabilizzare la pressione sanguigna.
- The patient complained of severe headaches and nausea that had persisted for two days. To relieve the symptoms, paracetamol was prescribed, and he was advised to rest and drink plenty of fluids.
- The patient presented symptoms of fever, cough, and muscle aches. A chest X-ray was performed to rule out pneumonia. An antibiotic was prescribed, and he was advised to rest.
- The patient complained of dizziness, visual disturbances, and numbness in the hands. An MRI of the brain was ordered to rule out a neurological cause. A beta-blocker was prescribed to stabilize blood pressure.
4. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| NER | Named Entity Recognition |
| PROM | Patient-Reported Outcome Measure |
| NLP | Natural Language Processing |
| BERT | Bidirectional Encoder Representations from Transformers |
| BLEU | Bilingual Evaluation Understudy |
| MLM | Masked language modeling |
| NSP | Next Sentence Prediction |
References
- S. W. Salmond and M. Echevarria, “Healthcare Transformation and Changing Roles for Nursing,” Orthopaedic Nursing, vol. 36, no. 1, p. 12, Feb. 2017. [CrossRef]
- G. Anderson, J. Horvath, J. R. Knickman, D. C. Colby, S. Schear, and M. Jung, “Chronic Conditions: Making the Case for Ongoing Care”, Accessed: Dec. 02, 2024. [Online]. Available: https://www.policyarchive.org/handle/10207/21756.
- J. Li and D. Porock, “Resident outcomes of person-centered care in long-term care: A narrative review of interventional research,” International Journal of Nursing Studies, vol. 51, no. 10, pp. 1395–1415, Oct. 2014. [CrossRef]
- M. Calvert, D. Kyte, G. Price, J. M. Valderas, and N. H. Hjollund, “Maximising the impact of patient reported outcome assessment for patients and society,” BMJ, vol. 364, p. k5267, Jan. 2019. [CrossRef]
- H. Nguyen, P. Butow, H. Dhillon, and P. Sundaresan, “A review of the barriers to using Patient-Reported Outcomes (PROs) and Patient-Reported Outcome Measures (PROMs) in routine cancer care,” Journal of Medical Radiation Sciences, vol. 68, no. 2, pp. 186–195, 2021. [CrossRef]
- N. Black, “Patient reported outcome measures could help transform healthcare,” BMJ, vol. 346, p. f167, Jan. 2013. [CrossRef]
- J. Greenhalgh, “The applications of PROs in clinical practice: what are they, do they work, and why?,” Qual Life Res, vol. 18, no. 1, pp. 115–123, Feb. 2009. [CrossRef]
- C. F. Snyder, N. K. Aaronson, A. K. Choucair, T. E. Elliott, J. Greenhalgh, M. Y. Halyard, et al., “Implementing patient-reported outcomes assessment in clinical practice: a review of the options and considerations,” Qual Life Res, vol. 21, no. 8, pp. 1305–1314, Oct. 2012. [CrossRef]
- V. Šafran, S. Lin, J. Nateqi, A. G. Martin, U. Smrke, U. Ariöz, et al., “Multilingual Framework for Risk Assessment and Symptom Tracking (MRAST),” Sensors, vol. 24, no. 4, Art. no. 4, Jan. 2024. [CrossRef]
- I. Mlakar, U. Arioz, U. Smrke, N. Plohl, V. Šafran, and M. Rojc, “An End-to-End framework for extracting observable cues of depression from diary recordings,” Expert Systems with Applications, vol. 257, p. 125025, Dec. 2024. [CrossRef]
- B. Bardak and M. Tan, “Improving clinical outcome predictions using convolution over medical entities with multimodal learning,” Artificial Intelligence in Medicine, vol. 117, p. 102112, Jul. 2021. [CrossRef]
- N. Sasikumar and K. S. I. Mantri, “Transfer Learning for Low-Resource Clinical Named Entity Recognition,” in Proceedings of the 5th Clinical Natural Language Processing Workshop, T. Naumann, A. Ben Abacha, S. Bethard, K. Roberts, and A. Rumshisky, Eds., Toronto, Canada: Association for Computational Linguistics, Jul. 2023, pp. 514–518. [CrossRef]
- A. Shaitarova, J. Zaghir, A. Lavelli, M. Krauthammer, and F. Rinaldi, “Exploring the Latest Highlights in Medical Natural Language Processing across Multiple Languages: A Survey,” Yearbook of Medical Informatics, vol. 32, pp. 230–243, Dec. 2023. [CrossRef]
- S. Zhu, Supryadi, S. Xu, H. Sun, L. Pan, M. Cui, et al., “Multilingual Large Language Models: A Systematic Survey,” Nov. 19, 2024, arXiv: arXiv:2411.11072. [CrossRef]
- J. Agerskov, K. Nielsen, and C. F. Pedersen, “Computationally Efficient Labeling of Cancer-Related Forum Posts by Non-clinical Text Information Retrieval,” SN COMPUT. SCI., vol. 4, no. 6, p. 711, Sep. 2023. [CrossRef]
- O. Rohanian, M. Nouriborji, H. Jauncey, S. Kouchaki, I. C. C. Group, L. Clifton, et al., “Lightweight Transformers for Clinical Natural Language Processing,” Feb. 09, 2023, arXiv: arXiv:2302.04725. [CrossRef]
- Y. Zhang, Y. Zhang, P. Qi, C. D. Manning, and C. P. Langlotz, “Biomedical and clinical English model packages for the Stanza Python NLP library,” Journal of the American Medical Informatics Association, vol. 28, no. 9, pp. 1892–1899, Sep. 2021. [CrossRef]
- P. Qi, Y. Zhang, Y. Zhang, J. Bolton, and C. D. Manning, “Stanza: A Python Natural Language Processing Toolkit for Many Human Languages,” Apr. 23, 2020, arXiv: arXiv:2003.07082. Accessed: May 08, 2024. [Online]. Available: http://arxiv.org/abs/2003.07082.
- J. Frei and F. Kramer, “GERNERMED -- An Open German Medical NER Model,” Dec. 10, 2021, arXiv: arXiv:2109.12104. [CrossRef]
- C. Crema, T. M. Buonocore, S. Fostinelli, E. Parimbelli, F. Verde, C. Fundarò, et al., “Advancing Italian biomedical information extraction with transformers-based models: Methodological insights and multicenter practical application,” Journal of Biomedical Informatics, vol. 148, p. 104557, Dec. 2023. [CrossRef]
- L. Campillos-Llanos, A. Valverde-Mateos, A. Capllonch-Carrión, and A. Moreno-Sandoval, “A clinical trials corpus annotated with UMLS entities to enhance the access to evidence-based medicine,” BMC Medical Informatics and Decision Making, vol. 21, no. 1, p. 69, Feb. 2021. [CrossRef]
- E. T. R. Schneider, J. V. A. de Souza, J. Knafou, L. E. S. e Oliveira, J. Copara, Y. B. Gumiel, et al., “BioBERTpt - A Portuguese Neural Language Model for Clinical Named Entity Recognition,” in Proceedings of the 3rd Clinical Natural Language Processing Workshop, A. Rumshisky, K. Roberts, S. Bethard, and T. Naumann, Eds., Online: Association for Computational Linguistics, Nov. 2020, pp. 65–72. [CrossRef]
- Ö. Uzuner, B. R. South, S. Shen, and S. L. DuVall, “2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text,” J Am Med Inform Assoc, vol. 18, no. 5, pp. 552–556, 2011. [CrossRef]
- J. Tiedemann and S. Thottingal, “OPUS-MT – Building open translation services for the World,” in Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, A. Martins, H. Moniz, S. Fumega, B. Martins, F. Batista, L. Coheur, C. Parra, I. Trancoso, M. Turchi, A. Bisazza, J. Moorkens, A. Guerberof, M. Nurminen, L. Marg, and M. L. Forcada, Eds., Lisboa, Portugal: European Association for Machine Translation, Nov. 2020, pp. 479–480. Accessed: Nov. 13, 2024. [Online]. Available: https://aclanthology.org/2020.eamt-1.61.
- Y. Tang, C. Tran, X. Li, P.-J. Chen, N. Goyal, V. Chaudhary, et al., “Multilingual Translation with Extensible Multilingual Pretraining and Finetuning,” Aug. 02, 2020, arXiv: arXiv:2008.00401. [CrossRef]
- Z.-Y. Dou and G. Neubig, “Word Alignment by Fine-tuning Embeddings on Parallel Corpora,” Aug. 12, 2021, arXiv: arXiv:2101.08231. [CrossRef]
- J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding,” May 24, 2019, arXiv: arXiv:1810.04805. [CrossRef]
- L. A. Ramshaw and M. P. Marcus, “Text Chunking using Transformation-Based Learning,” May 23, 1995, arXiv: arXiv:cmp-lg/9505040. [CrossRef]
- R. Nayak, “Focal Loss : A better alternative for Cross-Entropy,” Medium. Accessed: Nov. 15, 2024. [Online]. Available: https://towardsdatascience.com/focal-loss-a-better-alternative-for-cross-entropy-1d073d92d075.
- A. Idrissi-Yaghir, A. Dada, H. Schäfer, K. Arzideh, G. Baldini, J. Trienes, et al., “Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding,” in Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), N. Calzolari, M.-Y. Kan, V. Hoste, A. Lenci, S. Sakti, and N. Xue, Eds., Torino, Italia: ELRA and ICCL, May 2024, pp. 3654–3665. Accessed: Dec. 10, 2024. [Online]. Available: https://aclanthology.org/2024.lrec-main.324.

| Model | Dataset | Entity Types | Number of Tokens (train/dev/test) | F1-score |
|---|---|---|---|---|
| i2b2 clinical model | i2b2-2010 corpus | Problem, Test, Treatment | 106,597/44,672/269,954 | 88.13 |
| Facebook MBART (ML-FT N to 1) |
Helsinki NLP | |
|---|---|---|
| De | 41.5 | 47.5 |
| El | / | 56.4 |
| Es | 28.6 | 54.9 |
| It | 43.9 | 53.9 |
| Pl | 32.9 | 47.5 |
| Pt | 49.3 | 50.4 |
| Sl | 33.9 | / |
| BERT-BASE-CASED | |
|---|---|
| max sequence length | 128 |
| batch size | 64 |
| learning rate | 3e-05 |
| warmup steps | 66,540 |
| checkpoint every | 2000 |
| weight decay | 0.01 |
| max number of train epochs | 200 |
| layers | 12 |
| hidden states | 768 |
| attention heads | 12 |
| vocab size | 28,996 |
| train time (hours) | 12-23 |
| loss | focal loss |
| number GPUs | 1 |
| GPU type | NVIDIA A100-PCIE-40GB |
| Metric | Formula |
|---|---|
| Precision (P) | P = TP / (TP + FP) |
| Recall (R) | R = TP / (TP + FN) |
| F1-Score (F1) | F1 = 2 * (P * R) / (P + R) |
| Language | Precision | Recall | F1 score | Loss | BLEU score |
|---|---|---|---|---|---|
| En | 80.85% | 79.30% | 80.07% | 0.24 | N/A |
| De | 78.94% | 78.46% | 78.70% | 0.30 | 47.5 |
| El | 70.69% | 67.58% | 69.10% | 0.41 | 56.4 |
| Es | 77.14% | 78.08% | 77.61% | 0.33 | 54.9 |
| It | 75.91% | 75.28% | 75.60% | 0.34 | 53.9 |
| Pl | 75.52% | 75.60% | 75.56% | 0.40 | 47.5 |
| Pt | 77.25% | 77.16% | 77.21% | 0.34 | 50.4 |
| Sl | 75.78% | 75.66% | 75.72% | 0.37 | 33.9 |
| Language | Existing model | Fine-tuned model |
|---|---|---|
| English | 67.69% | 80.07% |
| Italian | 57.06% | 75.60% |
| Spanish | 62.60% | 77.61% |
| Label in Spanish model | Mapped label |
|---|---|
| DISO | PROBLEM |
| PROC | TREATMENT |
| CHEM | TREATMENT |
| Label in Italian | Mapped label |
|---|---|
| SINTOMI COGNITIVI | PROBLEM |
| TRATTAMENTO FARMACOLOGICO | TREATMENT |
| DIAGNOSI E COMORBIDITA | PROBLEM |
| SINTOMI NEUROPSICHIATRICI | PROBLEM |
| TEST | TEST |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).