Literature DB >> 30306898

Secondary Use of Healthcare Structured Data: The Challenge of Domain-Knowledge Based Extraction of Features.

Emmanuel Chazard1, Grégoire Ficheur1, Alexandre Caron1, Antoine Lamer2, Julien Labreuche2, Marc Cuggia3, Michaël Genin1, Guillaume Bouzille3, Alain Duhamel1.   

Abstract

Secondary use of clinical structured data takes an important place in healthcare research. It was first described by Fayyad as "knowledge discovery in databases". Feature extraction is an important phase but received little attention. The objectives of this paper are: 1) to propose an updated representation of data reuse in healthcare, 2) to illustrate methods and objectives of feature extraction, and 3) to discuss the place of domain-specific knowledge.
MATERIAL AND METHODS: an updated representation is proposed. Then, a case study consists of automatically identifying acute renal failure and discovering risk factors, by secondary use of structured data. Finally, a literature review published par Meystre et al. is analyzed.
RESULTS: 1) we propose a description of data reuse in 5 phases. Phase 1 is data preprocessing (cleansing, linkage, terminological alignment, unit conversions, deidentification), it enables to construct a data warehouse. Phase 2 is feature extraction. Phase 3 is statistical and graphical mining. Phase 4 consists of expert filtering and reorganization of statistical results. Phase 5 is decision making. 2) The case study illustrates how time-dependent features can be extracted from laboratory results and drug administrations, using domain-specific knowledge. 3) Among the 200 papers cited by Meystre et al., the first and last authors were affiliated to health institutions in 74% (68% for methodological papers, and 79% for applied papers). DISCUSSION: features extraction has a major impact on success of data reuse. Specific knowledge-based reasoning takes an important place in feature extraction, which requires tight collaboration between computer scientists, statisticians, and health professionals.

Entities:  

Keywords:  Data reuse; data transformation; feature extraction

Mesh:

Year:  2018        PMID: 30306898

Source DB:  PubMed          Journal:  Stud Health Technol Inform        ISSN: 0926-9630


  3 in total

1.  Is the survival of patients treated with ipilimumab affected by antibiotics? An analysis of 1585 patients from the French National hospital discharge summary database (PMSI).

Authors:  Pierre-Yves Cren; Nicolas Bertrand; Marie-Cécile Le Deley; Michaël Génin; Laurent Mortier; Pascal Odou; Nicolas Penel; Emmanuel Chazard
Journal:  Oncoimmunology       Date:  2020-11-22       Impact factor: 8.110

2.  Leveraging National Claims and Hospital Big Data: Cohort Study on a Statin-Drug Interaction Use Case.

Authors:  Aurélie Bannay; Mathilde Bories; Pascal Le Corre; Christine Riou; Pierre Lemordant; Pascal Van Hille; Emmanuel Chazard; Xavier Dode; Marc Cuggia; Guillaume Bouzillé
Journal:  JMIR Med Inform       Date:  2021-12-13

3.  Psychiatric Adverse Events Associated With Infliximab: A Cohort Study From the French Nationwide Discharge Abstract Database.

Authors:  Eve-Marie Thillard; Sophie Gautier; Evgeniya Babykina; Louise Carton; Ali Amad; Guillaume Bouzillé; Jean-Baptiste Beuscart; Grégoire Ficheur; Emmanuel Chazard
Journal:  Front Pharmacol       Date:  2020-04-22       Impact factor: 5.810

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.