Literature DB >> 15126220

Dental data mining: potential pitfalls and practical issues.

S A Gansky1.   

Abstract

Knowledge Discovery and Data Mining (KDD) have become popular buzzwords. But what exactly is data mining? What are its strengths and limitations? Classic regression, artificial neural network (ANN), and classification and regression tree (CART) models are common KDD tools. Some recent reports (e.g., Kattan et al., 1998) show that ANN and CART models can perform better than classic regression models: CART models excel at covariate interactions, while ANN models excel at nonlinear covariates. Model prediction performance is examined with the use of validation procedures and evaluating concordance, sensitivity, specificity, and likelihood ratio. To aid interpretation, various plots of predicted probabilities are utilized, such as lift charts, receiver operating characteristic curves, and cumulative captured-response plots. A dental caries study is used as an illustrative example. This paper compares the performance of logistic regression with KDD methods of CART and ANN in analyzing data from the Rochester caries study. With careful analysis, such as validation with sufficient sample size and the use of proper competitors, problems of naïve KDD analyses (Schwarzer et al., 2000) can be carefully avoided.

Entities:  

Mesh:

Year:  2003        PMID: 15126220     DOI: 10.1177/154407370301700125

Source DB:  PubMed          Journal:  Adv Dent Res        ISSN: 0895-9374


  6 in total

1.  Boosted classification trees result in minor to modest improvement in the accuracy in classifying cardiovascular outcomes compared to conventional classification trees.

Authors:  Peter C Austin; Douglas S Lee
Journal:  Am J Cardiovasc Dis       Date:  2011-04-23

2.  Caries Risk Assessment Item Importance: Risk Designation and Caries Status in Children under Age 6.

Authors:  Benjamin W Chaffee; John D B Featherstone; Stuart A Gansky; Jing Cheng; Ling Zhan
Journal:  JDR Clin Trans Res       Date:  2016-05-05

Review 3.  At the crossroads of oral health inequities and precision public health.

Authors:  Stuart A Gansky; Sarah Shafik
Journal:  J Public Health Dent       Date:  2019-05-07       Impact factor: 1.821

4.  Using methods from the data-mining and machine-learning literature for disease classification and prediction: a case study examining classification of heart failure subtypes.

Authors:  Peter C Austin; Jack V Tu; Jennifer E Ho; Daniel Levy; Douglas S Lee
Journal:  J Clin Epidemiol       Date:  2013-02-04       Impact factor: 6.437

5.  Decision Tree Approach to the Impact of Parents' Oral Health on Dental Caries Experience in Children: A Cross-Sectional Study.

Authors:  Shinechimeg Dima; Kung-Jeng Wang; Kun-Huang Chen; Yung-Kai Huang; Wei-Jen Chang; Sheng-Yang Lee; Nai-Chia Teng
Journal:  Int J Environ Res Public Health       Date:  2018-04-06       Impact factor: 3.390

6.  Prediction of Early Childhood Caries Based on Single Nucleotide Polymorphisms Using Neural Networks.

Authors:  Katarzyna Zaorska; Tomasz Szczapa; Maria Borysewicz-Lewicka; Michał Nowicki; Karolina Gerreth
Journal:  Genes (Basel)       Date:  2021-03-24       Impact factor: 4.096

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.