Modern Clinical Text Mining: A Guide and Review

Bethany Percha

doi:10.20944/preprints202010.0649.v1

Submitted:

29 October 2020

Posted:

30 October 2020

Read the latest preprint version here

Abstract

Electronic health records (EHRs) are becoming a vital source of data for healthcare quality improvement, research, and operations. However, much of the most valuable information contained in EHRs remains buried in unstructured text. The field of clinical text mining has advanced rapidly in recent years, transitioning from rule-based approaches to machine learning and, more recently, deep learning. With new methods come new challenges, however, especially for those new to the field. This review provides an overview of clinical text mining for those who are encountering it for the first time (e.g. physician researchers, operational analytics teams, machine learning scientists from other domains). While not a comprehensive survey, it describes the state of the art, with a particular focus on new tasks and methods developed over the past few years. It also identifies key barriers between these remarkable technical advances and the practical realities of implementation at health systems and in industry.

Keywords:

text mining

;

natural language processing

;

electronic health records

;

clinical text

;

machine learning

Subject:

Computer Science and Mathematics - Information Systems

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Modern Clinical Text Mining: A Guide and Review

Abstract

Keywords:

Subject:

MDPI Initiatives

Important Links

Subscribe