| Literature DB >> 29495428 |
Runtong Zhang1, Fuzhi Chu2, Donghua Chen3, Xiaopu Shang4.
Abstract
Chinese Electronic Medical Records (EMRs) contains a large number of complex medical free text which includes a variety of information, such as temporal information, patients' symptoms and laboratory data. However, as an important knowledge base, these unstructured text data in EMR are hard to process directly by computer to support further medical research. This paper proposes a novel text structuring method to extract knowledge from EMR texts and reorganize them in chronological order according to the temporal information in the text. By implementing some entropy-based algorithms as contrast, experiments evaluate the performance of the proposed method, which indicates the new method can significantly reduce the complexity of EMR text. This work is significant in structuring the EMR free text into temporal-structured data for further medical analysis.Entities:
Keywords: Chinese; electronic medical records; information entropy; temporal information; text structuring method
Mesh:
Year: 2018 PMID: 29495428 PMCID: PMC5876947 DOI: 10.3390/ijerph15030402
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1The architecture of MTRTI.
Figure 2Overview of TDM-PC.
Figure 3Data flow and data format in the MTRTI.
Figure 4The changes of entropy in four phases (three selected EMR text samples).
Figure 5The changes of entropy with the increasing number of temporal nodes in EMR text.
Figure 6The changes of entropy with the increasing of the number of sentences, the number of part-of-speeches, and the number of segments.