| Literature DB >> 29849998 |
Wencheng Sun1, Zhiping Cai1, Yangyang Li2, Fang Liu3, Shengqun Fang1, Guoyan Wang4.
Abstract
Currently, medical institutes generally use EMR to record patient's condition, including diagnostic information, procedures performed, and treatment results. EMR has been recognized as a valuable resource for large-scale analysis. However, EMR has the characteristics of diversity, incompleteness, redundancy, and privacy, which make it difficult to carry out data mining and analysis directly. Therefore, it is necessary to preprocess the source data in order to improve data quality and improve the data mining results. Different types of data require different processing technologies. Most structured data commonly needs classic preprocessing technologies, including data cleansing, data integration, data transformation, and data reduction. For semistructured or unstructured data, such as medical text, containing more health information, it requires more complex and challenging processing methods. The task of information extraction for medical texts mainly includes NER (named-entity recognition) and RE (relation extraction). This paper focuses on the process of EMR processing and emphatically analyzes the key techniques. In addition, we make an in-depth study on the applications developed based on text mining together with the open challenges and research issues for future work.Entities:
Mesh:
Year: 2018 PMID: 29849998 PMCID: PMC5911323 DOI: 10.1155/2018/4302425
Source DB: PubMed Journal: J Healthc Eng ISSN: 2040-2295 Impact factor: 2.682
Figure 1EMR data processing flow.
Figure 2Process of data preprocessing on EMR mining.
Figure 3Progress of text mining.
Result on the test set of the ScienceIE dataset, using the official train/dev/test split.
| Relation | Precision | Recall | F1-score |
|---|---|---|---|
| Synonym of | 0.820 | 0.813 | 0.816 |
| Hyponym of | 0.455 | 0.421 | 0.437 |
| Microaveraged | 0.658 | 0.633 | 0.645 |
Figure 4Modules of proposed system: between-sentence, sectime-event, and within-sentence.