| Literature DB >> 24943527 |
Sunmoo Yoon1, Basirah Taha1, Suzanne Bakken1.
Abstract
The purposes of this methodological paper are: 1) to describe data mining methods for building a classification model for a chronic disease using a U.S. behavior risk factor data set, and 2) to illustrate application of the methods using a case study of depressive disorder. Methods described include: 1) six steps of data mining to build a disease model using classification techniques, 2) an innovative approach to analyzing high-dimensionality data, and 3) a visualization strategy to communicate with clinicians who are unfamiliar with advanced statistics. Our application of data mining strategies identified childhood experience living with mentally ill and sexual abuse, and limited usual activity as the strongest correlates of depression among hundreds variables. The methods that we applied may be useful to others wishing to build a classification model from complex, large volume datasets for other health conditions.Entities:
Mesh:
Year: 2014 PMID: 24943527 PMCID: PMC4580372
Source DB: PubMed Journal: Stud Health Technol Inform ISSN: 0926-9630