Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

A Survey of Data Processing of EMR (Electronic Medical Record) Based on Data Mining

Version 1 : Received: 11 August 2017 / Approved: 15 August 2017 / Online: 15 August 2017 (05:46:43 CEST)

How to cite: Sun, W.; Liu, F.; Cai, Z.; Fang, S.; Wang, G. A Survey of Data Processing of EMR (Electronic Medical Record) Based on Data Mining. Preprints 2017, 2017080055. https://doi.org/10.20944/preprints201708.0055.v1 Sun, W.; Liu, F.; Cai, Z.; Fang, S.; Wang, G. A Survey of Data Processing of EMR (Electronic Medical Record) Based on Data Mining. Preprints 2017, 2017080055. https://doi.org/10.20944/preprints201708.0055.v1

Abstract

At present, medical institutes generally use EMR to record patient's condition, including diagnostic information, procedures performed and treatment results. EMR has been recognized as a valuable resource for large scale analysis. However, EMR has the characteristics of diversity, incompleteness, redundancy and privacy, which make it difficult to carry out data mining and analysis directly. Therefore, it is necessary to preprocess the source data in order to improve data quality and improve the data mining results. Different types of data require different processing technologies. Most structured data commonly needs classic preprocessing technologies, including data cleansing, data integration, data transformation and data reduction. For semi-structured or unstructured data, such as medical text, containing more health information, it requires more complex and challenging processing methods. The task of information extraction for medical texts mainly includes NER (Named Entity Recognition) and RE (Relation Extraction). In this paper, we introduce the process of EMR processing, including data collection, data preprocessing, data mining, evaluation and knowledge application, analyze the current status of the key technologies, such as data preprocessing and data mining, and provide an overview of the application domains and prospects of EMR mining technologies. Finally, we summarize the existing problems in the research of EMR mining, and review the development trends.

Keywords

EMR; data preprocessing; text mining; information extraction; medical decision support system

Subject

Computer Science and Mathematics, Information Systems

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.