Literature DB >> 21167313

Data mining methods for classification of Medium-Chain Acyl-CoA dehydrogenase deficiency (MCADD) using non-derivatized tandem MS neonatal screening data.

Tim Van den Bulcke1, Paul Vanden Broucke, Viviane Van Hoof, Kristien Wouters, Seppe Vanden Broucke, Geert Smits, Elke Smits, Sam Proesmans, Toon Van Genechten, François Eyskens.   

Abstract

Newborn screening programs for severe metabolic disorders using tandem mass spectrometry are widely used. Medium-Chain Acyl-CoA dehydrogenase deficiency (MCADD) is the most prevalent mitochondrial fatty acid oxidation defect (1:15,000 newborns) and it has been proven that early detection of this metabolic disease decreases mortality and improves the outcome. In previous studies, data mining methods on derivatized tandem MS datasets have shown high classification accuracies. However, no machine learning methods currently have been applied to datasets based on non-derivatized screening methods. A dataset with 44,159 blood samples was collected using a non-derivatized screening method as part of a systematic newborn screening by the PCMA screening center (Belgium). Twelve MCADD cases were present in this partially MCADD-enriched dataset. We extended three data mining methods, namely C4.5 decision trees, logistic regression and ridge logistic regression, with a parameter and threshold optimization method and evaluated their applicability as a diagnostic support tool. Within a stratified cross-validation setting, a grid search was performed for each model for a wide range of model parameters, included variables and classification thresholds. The best performing model used ridge logistic regression and achieved a sensitivity of 100%, a specificity of 99.987% and a positive predictive value of 32% (recalibrated for a real population), obtained in a stratified cross-validation setting. These results were further validated on an independent test set. Using a method that combines ridge logistic regression with variable selection and threshold optimization, a significantly improved performance was achieved compared to the current state-of-the-art for derivatized data, while retaining more interpretability and requiring less variables. The results indicate the potential value of data mining methods as a diagnostic support tool.
Copyright © 2010 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 21167313     DOI: 10.1016/j.jbi.2010.12.001

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  7 in total

Review 1.  Text Mining for Precision Medicine: Bringing Structure to EHRs and Biomedical Literature to Understand Genes and Health.

Authors:  Michael Simmons; Ayush Singhal; Zhiyong Lu
Journal:  Adv Exp Med Biol       Date:  2016       Impact factor: 2.622

2.  Explainable artificial intelligence models using real-world electronic health record data: a systematic scoping review.

Authors:  Seyedeh Neelufar Payrovnaziri; Zhaoyi Chen; Pablo Rengifo-Moreno; Tim Miller; Jiang Bian; Jonathan H Chen; Xiuwen Liu; Zhe He
Journal:  J Am Med Inform Assoc       Date:  2020-07-01       Impact factor: 4.497

3.  Text Mining of the Electronic Health Record: An Information Extraction Approach for Automated Identification and Subphenotyping of HFpEF Patients for Clinical Trials.

Authors:  Siddhartha R Jonnalagadda; Abhishek K Adupa; Ravi P Garg; Jessica Corona-Cox; Sanjiv J Shah
Journal:  J Cardiovasc Transl Res       Date:  2017-06-05       Impact factor: 4.132

Review 4.  A review of approaches to identifying patient phenotype cohorts using electronic health records.

Authors:  Chaitanya Shivade; Preethi Raghavan; Eric Fosler-Lussier; Peter J Embi; Noemie Elhadad; Stephen B Johnson; Albert M Lai
Journal:  J Am Med Inform Assoc       Date:  2013-11-07       Impact factor: 4.497

5.  Opportunities and challenges in machine learning-based newborn screening-A systematic literature review.

Authors:  Elaine Zaunseder; Saskia Haupt; Ulrike Mütze; Sven F Garbade; Stefan Kölker; Vincent Heuveline
Journal:  JIMD Rep       Date:  2022-03-23

6.  dbRUSP: An Interactive Database to Investigate Inborn Metabolic Differences for Improved Genetic Disease Screening.

Authors:  Gang Peng; Yunxuan Zhang; Hongyu Zhao; Curt Scharfe
Journal:  Int J Neonatal Screen       Date:  2022-08-29

Review 7.  Diagnosis support systems for rare diseases: a scoping review.

Authors:  Carole Faviez; Xiaoyi Chen; Nicolas Garcelon; Antoine Neuraz; Bertrand Knebelmann; Rémi Salomon; Stanislas Lyonnet; Sophie Saunier; Anita Burgun
Journal:  Orphanet J Rare Dis       Date:  2020-04-16       Impact factor: 4.123

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.