| Literature DB >> 30708220 |
Minoo Modaresnezhad1, Ali Vahdati2, Hamid Nemati3, Ali Ardestani4, Fereidoon Sadri5.
Abstract
Utilization of existing clinical data for improving patient outcomes poses a number of challenging and complex problems involving lack of data integration, the absence of standardization across inhomogeneous data sources and computationally-demanding and time-consuming exploration of very large datasets. In this paper, we will present a robust semantic data integration, standardization and dimensionality reduction method to tackle and solve these problems. Our approach enables the integration of clinical data from diverse sources by resolving canonical inconsistencies and semantic heterogeneity as required by the National Library of Medicine's Unified Medical Language System (UMLS) to produce standardized medical data. Through a combined application of rule-based semantic networks and machine learning, our approach enables a large reduction in dimensionality of the data and thus allows for fast and efficient application of data mining techniques to large clinical datasets. An example application of the techniques developed in our study is presented for the prediction of bariatric surgery outcomes.Entities:
Keywords: Data integration; Data standardization; Dimensionality reduction; Machine learning; Medical informatics; Medical information systems; Semantic integration; UMLS
Mesh:
Year: 2019 PMID: 30708220 DOI: 10.1016/j.compbiomed.2019.01.019
Source DB: PubMed Journal: Comput Biol Med ISSN: 0010-4825 Impact factor: 4.589