| Literature DB >> 27583845 |
Azra Ramezankhani1, Ali Kabir, Omid Pournik, Fereidoun Azizi, Farzad Hadaegh.
Abstract
Hypertension is a critical public health concern worldwide. Identification of risk factors using traditional multivariable models has been a field of active research. The present study was undertaken to identify risk patterns associated with hypertension incidence using data mining methods in a cohort of Iranian adult population.Data on 6205 participants (44% men) age > 20 years, free from hypertension at baseline with no history of cardiovascular disease, were used to develop a series of prediction models by 3 types of decision tree (DT) algorithms. The performances of all classifiers were evaluated on the testing data set.The Quick Unbiased Efficient Statistical Tree algorithm among men and women and Classification and Regression Tree among the total population had the best performance. The C-statistic and sensitivity for the prediction models were (0.70 and 71%) in men, (0.79 and 71%) in women, and (0.78 and 72%) in total population, respectively. In DT models, systolic blood pressure (SBP), diastolic blood pressure, age, and waist circumference significantly contributed to the risk of incident hypertension in both genders and total population, wrist circumference and 2-h postchallenge plasma glucose among women and fasting plasma glucose among men. In men, the highest hypertension risk was seen in those with SBP > 115 mm Hg and age > 30 years. In women those with SBP > 114 mm Hg and age > 33 years had the highest risk for hypertension. For the total population, higher risk was observed in those with SBP > 114 mm Hg and age > 38 years.Our study emphasizes the utility of DTs for prediction of hypertension and exploring interaction between predictors. DT models used the easily available variables to identify homogeneous subgroups with different risk pattern for the hypertension.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27583845 PMCID: PMC5008529 DOI: 10.1097/MD.0000000000004143
Source DB: PubMed Journal: Medicine (Baltimore) ISSN: 0025-7974 Impact factor: 1.889
Baseline characteristics of male population: Tehran Lipid and Glucose Study (1999–2012).
Baseline characteristics of female population: Tehran Lipid and Glucose Study (1999–2012).
Performance of the models in the male, female, and total population: Tehran Lipid and Glucose Study (1999–2012).
Figure 1ROC curves and ROCCH for the 3 classifiers on 3 testing datasets, Tehran Lipid and Glucose Study (1999–2012). ROC = receiver operating characteristics, ROCCH = ROC Convex Hull.
Figure 2Decision tree models for prediction of hypertension derived from training dataset: (A) female population and (B) male population. Tehran Lipid and Glucose Study (1999–2012). 2h-PCPG = 2-h postchallenge plasma glucose (mmol/L), DBP = diastolic blood pressure (mm Hg), SBP = systolic blood pressure (mm Hg), WAIST = waist circumference (cm).
Figure 3Decision tree models for prediction of hypertension in entire population derived from training dataset: Tehran Lipid and Glucose Study (1999–2012). DBP = diastolic blood pressure (mm Hg), FPG = fasting plasma glucose (mmol/L), SBP = systolic blood pressure (mm Hg), WAIST = waist circumference (cm).