Literature DB >> 17238389

Data preparation framework for preprocessing clinical data in data mining.

Jau-Huei Lin1, Peter J Haug.   

Abstract

Electronic health records are designed to provide online transactional data recording and reporting services that support the health care process. The characteristics of clinical data as it originates during the process of clinical documentation, including issues of data availability and complex representation models can make data mining applications challenging. Data preprocessing and transformation are required before one can apply data mining to clinical data. In this article, an approach to data preparation that utilizes information from the data, metadata and sources of medical knowledge is described. Heuristic rules and policies are defined for these three types of supporting information. Compared with an entirely manual process for data preparation, this approach can potentially reduce manual work by achieving a degree of automation in the rule creation and execution. A pilot experiment demonstrates that data sets created through this approach lead to a better model learning results than a fully manual process.

Mesh:

Year:  2006        PMID: 17238389      PMCID: PMC1839316     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  4 in total

1.  The HELP hospital information system: update 1998.

Authors:  R M Gardner; T A Pryor; H R Warner
Journal:  Int J Med Inform       Date:  1999-06       Impact factor: 4.046

2.  Automation of a problem list using natural language processing.

Authors:  Stephane Meystre; Peter J Haug
Journal:  BMC Med Inform Decis Mak       Date:  2005-08-31       Impact factor: 2.796

3.  The meaning and use of the area under a receiver operating characteristic (ROC) curve.

Authors:  J A Hanley; B J McNeil
Journal:  Radiology       Date:  1982-04       Impact factor: 11.105

4.  Uniqueness of medical data mining.

Authors:  Krzysztof J Cios; G William Moore
Journal:  Artif Intell Med       Date:  2002 Sep-Oct       Impact factor: 5.326

  4 in total
  4 in total

1.  E-portfolio competency metadata: pilot study for a call to action.

Authors:  Sishir Rao; Andrew Swartz; Leila Obeid; Sevith Rao; Barbara Joyce; Sarah Whitehouse; Mathilda Horst; Jack Butler; Ryan Kinnen; Alexander Shepard; Ilan Rubinfeld
Journal:  J Med Syst       Date:  2010-05-06       Impact factor: 4.460

2.  A survey on statistical methods for health care fraud detection.

Authors:  Jing Li; Kuei-Ying Huang; Jionghua Jin; Jianjun Shi
Journal:  Health Care Manag Sci       Date:  2008-09

3.  Preprocessing structured clinical data for predictive modeling and decision support. A roadmap to tackle the challenges.

Authors:  José Carlos Ferrão; Mónica Duarte Oliveira; Filipe Janela; Henrique M G Martins
Journal:  Appl Clin Inform       Date:  2016-12-07       Impact factor: 2.342

4.  Data extraction for epidemiological research (DExtER): a novel tool for automated clinical epidemiology studies.

Authors:  Krishna Margadhamane Gokhale; Joht Singh Chandan; Konstantinos Toulis; Georgios Gkoutos; Peter Tino; Krishnarajah Nirantharakumar
Journal:  Eur J Epidemiol       Date:  2020-08-27       Impact factor: 8.082

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.