| Literature DB >> 35390039 |
Mohammad Ziaul Islam Chowdhury1,2,3, Iffat Naeem1, Hude Quan1, Alexander A Leung1,4, Khokan C Sikdar5, Maeve O'Beirne2, Tanvir C Turin1,2.
Abstract
OBJECTIVE: We aimed to identify existing hypertension risk prediction models developed using traditional regression-based or machine learning approaches and compare their predictive performance.Entities:
Mesh:
Year: 2022 PMID: 35390039 PMCID: PMC8989291 DOI: 10.1371/journal.pone.0266334
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1PRISMA diagram for systematic review of studies presenting hypertension prediction models developed in the general population.
Characteristics of included studies that describe traditional regression-based hypertension prediction models.
| Study | Location Model Developed/ Ethnicity | Study Design | Age | Gender | Events (n)/Total Participants (N) | Definition of Outcome Predicted/Hypertension | Duration of Follow-up |
|---|---|---|---|---|---|---|---|
| Pearson et al. [ | USA/Mixed, mainly Whites | Prospective cohort | ≤ 25 years | Male only | 114/1130 | Self-reported use of blood pressure-lowering medications | 30 years |
| Parikh et al. [ | USA/Mainly Whites | Prospective cohort | 20–69 years | Both | 796/1717 | SBP ≥ 140 mmHg or DBP ≥ 90 mmHg or use of BP-lowering medications | Median 3.8 years |
| Paynter et al. [ | USA/ Whites and Blacks | Prospective cohort | 45–64 years | Female only | Derivation cohort: 1935/9427 Validation cohort: 1068/5395 | Self-report or SBP ≥ 140 mmHg or DBP ≥ 90 mmHg | 8 years |
| Kivimäki et al. [ | England/Mainly Whites | Prospective cohort | 35–68 years | Both | 1258/8207 | SBP ≥ 140 mmHg or DBP ≥ 90 mmHg or use of BP-lowering medications | Median 5.6 years |
| Kivimäki et al. [ | England/Mainly Whites | Prospective cohort | 36–68 years | Both | Derivation cohort: 614/4135 Validation cohort: 438/2785 | SBP ≥ 140 mmHg or DBP ≥ 90 mmHg or use of antihypertensive medications | Median 5.8 years |
| Kshirsagar et al. [ | USA/Mixed but mainly Whites | Prospective cohort | 45–64 years | Both | 3795/11,407 (7610 for derivation sample and 3692 for the validation sample) | SBP ≥ 140 mmHg or DBP ≥ 90 mmHg or reported use of BP-lowering medications | Up to 9 years |
| Bozorgmanesh et al., [ | Iran/Asians | Prospective cohort | ≥ 20 years | Both | 805/4656 | SBP ≥ 140 mmHg or DBP ≥ 90 mmHg or reported use of BP-lowering medications | 6 years |
| Chien et al. [ | Taiwan/Chinese | Prospective cohort | ≥ 35 years | Both | 1029/2506 | SBP ≥ 140 mmHg or DBP ≥ 90 mmHg or reported use of BP-lowering medications | Median 6.15 years |
| Fava et al. [ | Sweden/Whites | Prospective cohort | Middle-aged | Both | NR/10,781 | SBP ≥ 140 mmHg or DBP ≥ 90 mmHg or reported use of BP-lowering medications | Over average 23-years |
| Lim et al. [ | Korea/Asians | Prospective cohort | 40–69 years | Both | 819/4747. Derivation cohort: 483/2840 Validation cohort: 336/1907 | SBP ≥ 140 mmHg or DBP ≥ 90 mmHg or reported use of BP lowering medications | 4 years |
| Choi et al. [ | USA/Mexicans | Prospective cohort | NR | Both | NR/443 | SBP >140 mm Hg, DBP >90 mm Hg, or use of antihypertensive medication | NR |
| Lim et al. [ | Korean/Asians | Prospective cohort | 40–69 years | Both | NR/5632 | SBP ≥140 mm Hg or DBP ≥90 mm Hg or use of antihypertensive medication | 4-year |
| Otsuka et al. [ | Japan/Asians | Prospective cohort | 19–63 years | Male only | 1633/15,025 | SBP ≥140 mm Hg or DBP ≥90 mm Hg or use of antihypertensive medication | Median 4 years |
| Asgari et al. [ | Iran/Asians | Prospective cohort | ≥ 20 years | Both | ISH: 235/4574 IDH: 470/4809 | ISH: SBP ≥ 140 mmHg and DBP < 90 mmHg IDH: SBP <140 mmHg and DBP ≥ 90 mmHg | ISH: Median 9.57 years, IDH: Median 9.62 years |
| Sathish et al. [ | India/Asians | Prospective cohort | 15–64 years | Both | 70/297 | SBP ≥140 mm Hg or DBP ≥90 mm Hg or use of antihypertensive medication | Mean 7.1 years |
| Lee et al. [ | Korea/Asians | Prospective cohort | 40–69 years | Both | Men: 384/2128 Women: 374/2326 | SBP ≥140 mm Hg or DBP ≥90 mm Hg or use of antihypertensive medication | 4 years |
| Lee et al. [ | Korea/Asians | Cross-sectional | 21–85 years | Both | NR/12,789 | SBP ≥ 140 mmHg and/or DBP ≥ 90 mmHg or physician-diagnosed hypertension | NR |
| Kanegae et al. [ | Japan/Asians | Prospective cohort | 18–83 years | Both | 7402/63,495 | SBP/DBP ≥ 140/90 mm Hg and/or the initiation of antihypertensive medications with self-reported hypertension | Mean 3.4 years |
| Chen et al. [ | China/Asians | Prospective cohort | Average age 41.73 years (men), 39.49 years (women) | Both | 2021 (men), 764 (women) 7537 (men), 4960 (women) | First occurrence at any follow-up medical check-up of SBP > 140 mm Hg or DBP > 90 mm Hg or of the person taking antihypertensive medication | Median 4.0 years |
| Díaz-Gutiérrez et al. [ | Spain/Spanish | Prospective cohort | Age presented according to the number of healthy lifestyle factors | Both | 1406/14057 | SBP ≥ 130 mmHg, DBP≥ 80 mmHg, or use of any antihypertensive drug | Median 10.2 years |
| Wang et al. [ | China/Asians | Longitudinal | 18–90 years | Both | 882/5265 (derivation) NR/1597 (validation) | Taking antihypertensive drugs or SBP at least 140 mmHg or DBP at least 90 mmHg | Average follow-up of 8.05 ± 5.27 years |
| Niiranen et al. [ | Finland/Whites | Prospective cohort | ≥ 30 years | Both | NR/2045 | BP ≥ 140/90 mm Hg and/or antihypertensive medication | 11 years |
| Yeh et al. [ | Taiwan/Chinese | Prospective cohort | ≥ 20 years | Both | 88/2374 | SBP ≥140 mm Hg or DBP ≥90 mm Hg | Average 3.23 years |
| Syllos et al. [ | Brazil/South Americans | Prospective cohort | 35–74 years | Both | 1088/8027; Derivation: 4825 Validation: 3202 | SBP ≥ 140 mm Hg, DBP ≥ 90 mm Hg or the use of blood pressure-lowering medications | 4 years |
| Wang et al. [ | China/Asians | Prospective cohort | ≥ 18 years | Both | 1658/9034 | SBP ≥ 140 mm Hg, DBP ≥ 90 mm Hg or the use of blood pressure-lowering medications | Median 6 years |
| Xu et al. [ | China/Asians | Prospective cohort | 35–74 years | Both | 1036/4796 (Training) | SBP ≥ 140 mm Hg and/or DBP ≥ 90 mm Hg, and/or a diagnosis of hypertension by a physician and currently receiving anti-hypertension treatment | 6 years |
| Kadomatsu et al. [ | Japan/Asians | Prospective cohort | Mean age 51.3 years | Both | 324/3936 | SBP ≥ 140 mm Hg, DBP ≥ 90 mm Hg, or use of antihypertensive medication | Median 5 years |
| Wang et al. [ | USA/Multi-ethnic | Telephone-based health survey | ≥ 18 years | Both | NR/308,711 | NR | NR |
| Muntner et al. [ | USA/ Multi-ethnic (Whites, Blacks, Hispanics, and Asians) | NR | 45–84 years | Both | 849/3013 | The first study visit, subsequent to baseline, at which SBP ≥ 140 mm Hg and/or DBP ≥ 90 mm Hg and/or the initiation of antihypertensive medication | Median of 1.6 years and 4.8 years |
| Ture et al. [ | Turkey/ Europeans | Retrospective | Average 48.2 years (hypertension) 46.5 (control) | Both | 694 (452 patients with hypertension and 242 controls) | Average of 3 or more DBP measurements on at least 3 subsequent visits is ≥ 90 mmHg or when the average of multiple SBP readings on 3 or more subsequent visits is consistently ≥ 140 mmHg | NR |
| Yamakado et al. [ | Japan/Asians | Prospective cohort | ≥ 20 years | Both | 424/2637 | SBP ≥ 140 mm Hg or DBP ≥ 90 mm Hg or use of antihypertensive medication | 4 years |
| Qi et al. [ | China/Asians | Case-control | Case: 64.48 ± 8.53 years; Control: 64.23 ± 10.13 years | Both | Patients: NR/1009 Controls = NR/756 | SBP ≥ 140 mm Hg or DBP ≥ 90 mm Hg or use of antihypertensive medication | NR |
| Lu et al. [ | China/Asians | Prospective cohort | 35–74 years | Both | 2559/7724 | SBP ≥ 140 mm Hg or DBP ≥ 90 mm Hg or use of antihypertensive medication | Mean 7.9 years |
| Zhang et al. [ | China/Asians | Prospective cohort | 18–88 years | Both | 3793/17,471 | SBP ≥ 140 mm Hg or DBP ≥ 90 mm Hg or use of antihypertensive medication | 5 years |
NR, not reported; SBP, systolic blood pressure; DBP, diastolic blood pressure; BP, blood pressure; ISH, isolated systolic hypertension; IDH, isolated diastolic hypertension
The features of hypertension prediction models developed using a traditional regression-based modeling approach.
| Study | Risk Factors Included | Modeling Method | Discrimination | Calibration | Model Validation: Internal or External |
|---|---|---|---|---|---|
| Pearson et al. [ | Age, SBP at baseline, paternal history of hypertension, and BMI | Cox regression | NR | NR | NR |
| Parikh et al. [ | Age, sex, SBP, DBP, BMI, parental hypertension, and cigarette smoking | Weibull regression | C-statistic = 0.788 [0.733–0.803] | HL Chi-square = 4.35 (p = 0.88) | Internal, apparent |
| Paynter et al. [ | Inclusive Model: Age, ethnicity, BMI, total grain intake, SBP, DBP, apolipoprotein B, lipoprotein (a), and C-reactive protein. Simplified Model with Lipids: Age, BMI, SBP, DBP, ethnicity, and total to HDL- cholesterol ratio Simplified Model: Age, BMI, ethnicity, SBP, and DBP | Logistic regression | Inclusive Model: C-statistic = 0.705; Simplified Model with Lipids: C-statistic = 0.705; Simplified Model: C-statistic = 0.703 | Inclusive Model: HL Chi-square = 24.6 (p = 0.002), Simplified Model with Lipids: HL Chi-square = 20.7 (p = 0.008), Simplified Model: HL Chi-square = 12.3 (p = 0.140) | Internal, split-sample 2:1 |
| Kivimäki et al. [ | Age, sex, SBP, DBP, BMI, parental hypertension, and cigarette smoking | Weibull regression | C-statistic = 0.804 | HL Chi-square = 14.3 (p = 0.88) | Internal, split-sample 6:4 |
| Kivimäki et al. [ | Repeat Measure BP Model: Age, sex, BMI, parental hypertension, repeat measures of BP, and cigarette smoking Average BP Model: Age, sex, BMI, parental hypertension, average BP, and cigarette smoking | Weibull regression | Repeat Measure BP Model: C-statistic = 0.799; Average BP Model: C-statistic = 0.794 | Repeat Measure BP Model: HL Chi-square = 6.5; Average BP Model: NR | Internal, split-sample 6:4 |
| Kshirsagar et al. [ | Age, level of SBP or DBP, smoking, family history of hypertension, diabetes mellitus, BMI, female sex, and lack of exercise | Logistic regression | AUC = 0.742 (3years), 0.750 (6 years), 0.791 (9 years), and 0.775 (ever) | NR | Internal, split-sample 2:1 |
| Bozorgmanesh et al., [ | For Women: age, waist circumference, DBP, SBP, and family history of premature CVD | Weibull regression | C-statistic = 0.731 [0.706–0.755] for women, C-statistic = 0.741 [0.719–0.763] for men | HL Chi-square = 7.8 (p = 0.554) for women; HL Chi-square = 8.8 (p = 0.452) for men | NR |
| Chien et al. [ | Clinical Model: Age, gender, BMI, SBP, and DBP | Weibull regression | Clinical Model: AUC = 0.732 [0.712–0.752] (point based), AUC = 0.737 (coefficient based); Biochemical Model: AUC = 0.735 [0.715–0.755] (point based), AUC = 0.74 (coefficient based) | Clinical Model: HL Chi-square = 8.3, p = 0.40 (point based), 10.9, p = 0.21 (coefficient based); Biochemical Model: HL Chi-square = 13.2, p = 0.11 (point based), 6.4, p = 0.60 (coefficient based) | Internal, fivefold cross- validation |
| Fava et al. [ | Age, sex, sex times age, heart rate, obesity, diabetes, hypertriglyceridemia, prehypertension, family history of hypertension, sedentary in spare time, problematic alcohol behavior, married or living as a couple, high-level non-manual work, smoking | Logistic regression | AUC = 0.662 [0.651–0.672] | NR | NR |
| Lim et al. [ | Age, sex, smoking, SBP, DBP, parental hypertension, BMI | Weibull regression | AROC = 0.791 [0.766–0.817] | HL Chi-square = 4.17 (p = 0.8415) | Internal, split-sample 6:4 |
| Choi et al. [ | Age, gender, smoke, age x gender, Rs10510257 (AA), Rs10510257 (AG), Rs1047115 (GT) | GEE for marginal model and logistic random effect model for conditional model | Marginal model: AUC = 0.839 (with SNPs), 0.826 (without SNPs) Conditional model: AUC = 0.973 (with SNPs), 0.973 (without SNPs) | NR | NR |
| Lim et al. [ | Traditional variables: age, gender, SBP, current smoking status, family history of hypertension, BMI, and one genetic variable (cGRS or wGRS derived from the 4 SNPs): rs995322, rs17249754, rs1378942, rs12945290 | Logistic regression | Derivation cohort: C-statistic = 0.810 [0.796–0.824] (model without wGRS, C-statistic = 0.811 [0.797–0.825] (model with wGRS); Validation cohort: Mean C-statistic = 0.811 [0.809–0.816] | HL Chi-square = 6.916 (model without wGRS), HL Chi-square = 5.711 (model with wGRS) | Internal validation, fivefold cross-validation |
| Otsuka et al. [ | Age, BMI, SBP and DBP, current smoking status, excessive alcohol intake, parental history of hypertension | Cox regression | Validation cohort: C-statistic = 0.861 [0.844–0.877] (model), C-statistic = 0.858 [0.840–0.876] (score) | Validation cohort: HL Chi-square = 15.2 (p = 0.085) (model), HL Chi-square = 9.30 (p = 0.41) (score) | Internal validation, split sample 4:1 |
| Asgari et al. [ | ISH: Age, SBP, BMI, 2 hours post-challenge plasma glucose IDH: Age, DBP, waist circumference, marital status, gender, HDL-C | Cox regression | ISH: C-statistic = 0.91, IDH: C-statistic = 0.76 | NR | NR |
| Sathish et al. [ | Age, sex, years of schooling, daily intake of fruits or vegetables, current smoking, alcohol use, BP, prehypertension, central obesity, history of high blood glucose | Logistic regression | AUC = 0.802 [0.748–0.856] | Hosmer-Lemeshow p = 0.940 | NR |
| Lee et al. [ | BMI, waist circumference, waist-to-hip ratio, waist-to-height ratio | Cox regression | Men: AROC = 0.58 [0.56–0.60] (BMI), 0.62 [0.60–0.64] (WC, WHR, WHtR) Women: AROC = 0.57 [0.55–0.59] (BMI), 0.66 [0.64–0.68] (WC), 0.68 [0.66–0.70] (WHR, WHtR) | NR | NR |
| Lee et al. [ | Women: Height, age, neckC, axillaryC, ribC, waistC, pelvicC, rib_hip, waist_hip, pelvic_hip, rib_pelvic, axillary_rib, chest_rib, axillary_chest, forehead_neck (CFS), height, weight, BMI, age, chestC, forehead_hip, waist_hip, chest_pelvic, waist_pelvic, axillary_waist, forehead_rib, neck_axillary (LR-wrapper) | Logistic regression | Women: AUC = 0.713 (LR-CFS), 0.721 (LR-wrapper) Men: AUC = 0.637 (LR-CFS), 0.652 (LR-wrapper) | NR | Internal, 10-fold cross- validation |
| Kanegae et al. [ | Age, sex, BMI, SBP, DBP, low-density lipoprotein cholesterol, uric acid, proteinuria, current smoking, alcohol intake, eating rate, DBP by age, and BMI by age | Cox regression | C-statistic = 0.885 [0.865–0.903] | Greenwood-Nam-D’Agostino χ2 statistic = 13.6) | External validation |
| Chen et al. [ | Men: Age, BMI, SBP, DBP, gamma-glutamyl transferase, fasting blood glucose, drinking, age x BMI, age x DBP | Cox regression | Derivation: AUC = 0.761 [0.752–0.771] (men), 0.753 [0.741–0.765] (women) Validation: AUC = 0.760 [0.751–0.770] (men), 0.749 [0.737–0.761] (women) | NR | Internal, 10-fold cross-validation |
| Díaz-Gutiérrez et al. [ | No smoking, moderate-to-high physical activity, Mediterranean diet adherence, healthy BMI, moderate alcohol intake, and no binge drinking | Cox regression | NR | NR | NR |
| Wang et al. [ | Age, sex, education, marriage, smoking, drinking, BMI, energy, carbo, fat, protein | Multistate Markov model | NR | NR | Temporal validation |
| Niiranen et al. [ | Model 1: GRS Model 2: Model 1 + age + sex Model 3: Model 2 + smoking, diabetes, education, hyper-cholesterolemia, leisure-time exercise, and BMI | Multiple linear and logistic regression | C-index = 0.731 (Model 1) | NR | NR |
| Yeh et al. [ | Age, DM, and fibrinogen concentration (Men) | Cox regression | NR | NR | NR |
| Syllos et al. [ | Age, sex, educational level, parental history of hypertension, leisure-time physical activity, BMI, neck circumference, smoking, SBP, DBP | Logistic regression | AUC = 0.830 [0.810–0.849] | H-L Chi-square = 8.22, p = 0.41 | Internal, split sample 6:4 ratio |
| Wang et al. [ | Age, parental hypertension, SBP, DBP, BMI, and age by BMI | Logistic regression | C-index = 0.795 [0.7733–0.810] (Training set), C-index = 0.7914 [0.773–0.809] (Testing set) | H–L Chi-square = 7.747, P = 0.459 (Training set) | Internal, Bootstrap validation |
| Xu et al. [ | M1 Model: Age, SBP, DBP, hypertension parental history, WC, interaction item of age with WC, and interaction item of age with DBP W1 Model: Age, SBP, DBP, WC, fruit and vegetable intake, hypertension parental history, interaction item of age with WC, and interaction of age with DBP were included in W1 model | Cox regression | Testing Set Men: AUC = 0.771 [0.750–0.791] (M1) | Testing Set Men: Modified Nam-D’Agostino test Chi-square = 6.305, p = 0.708 (M1) Testing Set women: Modified Nam-D’Agostino test Chi-square = 6.783, p = 0.147(W1); 7.404, p = 0.115 (W2) | Internal, 10-fold cross-validation in training data and external in the testing data |
| Kadomatsu et al. [ | Age, sex, BMI, current smoking habit, ethanol consumption, presence of DM, parental hypertension history, SBP, DBP | Logistic regression | AUC = 0.826 [0.804–0.848] (Entire cohort validation) Median AUC = 0.83 [0.828–0.832] (Cross-validation) | H–L Chi-square = 7.06, p = 0.53, (Entire cohort validation); H–L Chi-square = 12.2 (Cross-validation) | Internal, split-sample cross-validation 6:4 ratio |
| Wang et al. [ | Exercise, diabetes, hyperlipemia, age, marriage, education, income, weight, height, sex, smoke, drink | Logistic regression | Accuracy, sensitivity, specificity, and AUC. AUC = 0.74±0.001 (logistic), Accuracy = 71.96% (logistic) | NR | Internal, split sample 7:3 ratio |
| Muntner et al. [ | SBP-alone model (7 SBP categories) | Repeated-measures Poisson regression model | SBP model: C-statistic = 0.768 [0.751–0.785] (1.6 years follow-up), 0.773 [0.775–0.791] (4.8 years follow-up); Age-specific DBP Model: C-statistic = 0.699 [0.681–0.717] (1.6 years follow-up), 0.691 [0.671–0.711] (4.8 years follow-up) | NR | NR |
| Ture et al. [ | Age, sex, family history of hypertension, smoking habits, lipoprotein (a), triglyceride, uric acid, total cholesterol, and BMI | Logistic regression, Flexible discriminant analysis, multivariate additive regression splines (degree 1), multivariate additive regression splines (degree 2) | Sensitivity, specificity, and predictive rate (PR) | NR | Internal, split sample 3:1 ratio |
| Yamakado et al. [ | PFAA Index 1: Leucine, alanine, tyrosine, asparagine, tryptophan, and glycine; PFAA Index 2: Isoleucine, alanine, tyrosine, phenylalanine, methionine, and histidine | Logistic regression | NR | NR | Internal, LOOCV and validation in a cohort dataset |
| Qi et al. [ | rs17030613, rs16849225, rs1173766, rs11066280, rs35444, rs880315, rs16998073, rs11191548, rs17249754 | Logistic regression | NR | NR | NR |
| Lu et al. [ | Model1: GRS+ (age, sex, and BMI); Model2: GRS +Model1 + smoking, drinking, pulse rate, and education; Model3: GRS+ Model2 + SBP and DBP | Logistic regression and Cox regression | Model1: C-statistic = 0.650 [0.637–0.663] (without GRS), 0.655 [0.642–0.668] (with GRS) Model 2: C-statistic = 0.683 [0.670–0.695] (without GRS), 0.687 [0.675–0.700] (with GRS) Model 3: C-statistic = 0.774 [0.763–0.785] (without GRS), 0.777 [0.766–0.787] (with GRS) | NR | NR |
| Zhang et al. [ | Five latent factors extracted from 11 biomarkers (BMI, SBP, DBP, FBG, TG, HDL-C, Hb, HCT, WBC, LC, NGC): inflammatory factor, blood viscidity factor, insulin resistance factor, blood pressure factor, lipid resistance factor, and age | Cox regression | Derivation cohort: AUC = 0.755 [0.746–0.763] (men), AUC = 0.801 [0.792–0.810] (women) Validation cohort: AUC = 0.755 [0.746–0.763] (men), AUC = 0.800 [0.791–0.810] (women) | NR | Internal, 10-fold cross- validation |
NR, not reported; SBP, systolic blood pressure; DBP, diastolic blood pressure; BP, blood pressure; BMI, body mass index; CVD, cardiovascular disease; HDL, high-density lipoprotein; WC, waist circumference; DM, diabetes mellitus; WHR, waist to hip ratio; WHtR, waist to height ratio; ISH, isolated systolic hypertension; IDH, isolated diastolic hypertension; AUC, area under the curve; AROC, area under the receiver operating characteristic curve; LR, logistic regression; GEE, Generalized estimating equations; LOOCV, leave-one-out cross-validation: HL, Hosmer Lemeshow; GRS, genetic risk score; SNP, single-nucleotide polymorphism; CFS, correlation-based feature subset selection; FBG, fasting blood glucose; TG, triglycerides; HDL-C, high-density lipoprotein cholesterol; Hb, hemoglobin; HCT, hematocrit; WBC, white blood cell count; LC, lymphocyte count; NGC, neutrophil granulocyte count
Fig 2Conventional risk factors considered by traditional regression-based models (A) and by machine learning-based models (B).
Fig 3Forest plot of traditional regression-based models with 95% prediction interval.
Fig 4Graphical summary presenting the percentage of hypertension risk prediction studies rated by level of concern, risk of bias (ROB), and applicability for each domain.
Information about existing hypertension prediction models developed using machine learning algorithms from selected studies.
| Study | Data Location | Sample Size | Risk Factors Included | Outcome Considered | Definition of Outcome Predicted | Modeling Method Used | Performance Measure |
|---|---|---|---|---|---|---|---|
| Falk CT [ | USA | 300 records each for training and validating | Seven input values: sex; age; total cholesterol; fasting glucose; fasting HDL; fasting triglycerides; body mass index (BMI) | High blood pressure | SBP > 140 mm Hg or DBP > 90 mm Hg | Two neural network programs: NNdriver and SNNS | Classification success rate. Training: 91%-98%, (Strategy 1), 70%-87% (Strategy 2); Validation: 59% (Strategy 1), 63% (Strategy 2) |
| Farran et al. [ | Kuwait | 10,632 (6759 hypertensive and 3873 non-hypertensive) | BMI, age, ethnicity, and diagnosis for diabetes | Incident hypertension, type 2 diabetes, and comorbidity | NR | Logistic regression (LR), k-nearest neighbors, support vector machines, and multifactor dimensionality reduction (MDR) | Classification accuracy: 90% (hypertension) |
| Huang et al. [ | China | Training: 2438, Validation: 616 | High educational level, predominantly sedentary work, positive family history of HTN, overweight, dysarteriotony, alcohol intake, salty diet, more vegetable and fruit intake, meat consumption, and regular physical exercise | Hypertension | Average SBP or DBP > 139 mmHg or > 89 mmHg, respectively | Logistic regression model (LRM) and artificial neural network (ANN) model (back-propagated delta rule networks) | AUC: 0.900 ± 0.014 (ANN model) |
| Kwong et al. [ | NR | 498 | Age, BMI, exercise level, alcohol consumption level, smoking status, stress level, and salt intake level | Systolic blood pressure (SBP) | BP readings > 140 mmHg | Two artificial neural networks (ANN): Back-propagation (BP) neural network and radial basis function (RBF) neural network validate the prediction system | Average Accuracy, BP ANN: 94.28% (male), 93.74% (female) |
| Polak et al. [ | USA | 159,989 records | High blood cholesterol, number of cigarettes smoked now, age, weight, height, sex | Hypertension | NR | Artificial neural network (ANN): Around 250 architectures of backpropagation (BP) and fuzzy networks | Classification rate and AUROC, different values for different Nets architecture |
| Priyadarshini et al. [ | USA | NR | SBP, DBP, total cholesterol (TC), high-density lipoprotein (HDL), low-density lipoprotein (LDL), plasma glucose concentration (PGC), and heart rate (HR) | Hypertension attack | DBP or SBP > 90 mm Hg or > 120 mm Hg, respectively, for at least two measuring instances | Deep neural network model | Confusion/performance matrix formed out of four evaluating parameters: accuracy 88%, precision 92%, recall 82%, and F1 score 76% (average value over 20 iterations) |
| Sakr et al. [ | USA | 23,095 | Age, METS, resting systolic blood pressure, peak diastolic blood pressure, resting diastolic blood pressure, HX coronary artery disease, the reason for the test, history of diabetes, percentage HR achieved, race, history of hyperlipidemia, Aspirin use, hypertension response | Hypertension | NR | Six machine learning techniques: LogitBoost (LB), Bayesian network classifier (BN), locally weighted naïve Bayes (LWB), artificial neural network (ANN), support vector machine (SVM), and random tree forest (RTF) | AUC, F-Score, Sensitivity, Specificity, Precision, and RMSE. AUC (0.93), F-Score (86.70%), Sensitivity (69,96%) and Specificity (91.71%) for RTF model in 10-fold cross-validation AUC (0.88), Sensitivity (74.30%), Precision (73.50%), and F-Score (73.90%) for RTF model in holdout method |
| Tayefi et al. [ | Iran | 9078 | Age, gender, BMI, marital status, level of education, occupation status, depression and anxiety status, physical activity level, smoking status, LDL, triglyceride, total cholesterol, fasting blood glucose, uric acid, and hs-CRP in Model 1 | Hypertension | SBP of 140 mm Hg, DBP of 90 mm Hg, and/or current use of antihypertensive drugs | Decision tree | Accuracy, sensitivity, specificity, and area under the ROC curve (AUC): For Model 1, the values are 73%, 63%, 77% and 0.72, respectively, and for Model 2 were 70%, 61%, 74% and 0.68, respectively |
| Wu et al. [ | USA | 75 females and 165 males | Age, gender, serum cholesterol, fasting blood sugar and electrocardiographic signal, heart rate | Systolic blood pressure | SBP and DBP > 140 mm Hg and 90 mm Hg, respectively | Two neural network algorithms: back-propagation neural network and radial basis function network | The absolute difference (error) between the real value and predicted values |
| Wu et al. [ | NR | 498 | Age, BMI, gender, exercise level, alcohol consumption, stress level, salt intake level, smoke status, cholesterol, and blood glucose | Systolic blood pressure | SBP > 140 mm Hg | Two artificial neural networks: back-propagation neural network and radial basis function neural network | The average prediction errors (absolute difference between the predicted value and measured value): 51.9% for men and 52.5% for women (backpropagation neural network) |
| Ye et al. [ | USA | 823,627 (training cohort/retrospective cohort), 680,810 (validation cohort/prospective cohort) | Total 169 features: 2 demographic features, 14 socioeconomic characteristics, 30 diagnostic diseases, 6 laboratory tests, 98 medication prescriptions, and 19 clinical utilization measures | Incident essential hypertension | ICD, 9th Revision, Clinical Modification (ICD-9-CM) diagnosis codes from category 401 | A supervised machine learning and data mining tool, XGBoost | AUC = 0.917 (retrospective cohort), AUC = 0.870 (prospective cohort) |
| Zhang et al. [ | NR | A total of 15,628,501 sets of valid characteristic attributes data | Seven input features: right atrium (AVR), left atrium (AVL), anterior atrium (AVF), photoplethysmography (PPG), oxygen saturation (SPO2), pulse transit time (PTT), heart rate (HR) | Blood pressure | NR | CART (classification and regression tree) model | Four evaluation indexes: accuracy rate, root mean square error (RMSE), deviation rate, and the Theil inequality coefficient (TIC) |
| Völzke et al. [ | Germany | Training set: 803 Validation set: 802 | Age, mean arterial pressure, rs16998073, serum glucose, and urinary albumin | Incident hypertension | SBP ≥ 140 mmHg and DBP ≥ 90 mmHg | Bayesian network | Training set: AUC = 0.78 [0.74–0.82], Validation set: AUC = 0.79 [0.75–0.83], External validation set: AUC = 0.77 [0.74–0.80]; |
| Lee et al. [ | Korea | 12,789 | Women: Height, age, neckC, axillaryC, ribC, waistC, pelvicC, rib_hip, waist_hip, pelvic_hip, rib_pelvic, axillary_rib, chest_rib, axillary_chest, forehead_neck (CFS), height, age, foreheadC, neckC, hipC, axillary_hip, axillary_pelvic, chest_pelvic, chest_rib (NB-wrapper) | Hypertension and hypotension | SBP ≥ 140 mmHg and/or DBP ≥ 90 mmHg or physician-diagnosed hypertension | Naive Bayes algorithm (NB) | Women: AUC = 0.696 (NB-CFS), 0.713 (NB-wrapper) |
| Xu et al. [ | China | 4796 | M1 Model: Age, SBP, DBP, hypertension parental history, WC, interaction item of age with WC, and interaction item of age with DBP | Hypertension | SBP ≥ 140 mm Hg and/or DBP ≥ 90 mm Hg and/or a diagnosis of hypertension by a physician and currently receiving anti-hypertension treatment | Artificial neural network (ANN), naive Bayes classifier (NBC), and classification and regression tree (CART) | Testing Set Men: AUC = 0.773 [0.752–0.793] (ANN), 0.760 [0.738–0.781] (NBC), 0.722 [0.699–0.743] (CART) |
| Wang et al. [ | USA | 308,711 | Exercise, diabetes, hyperlipemia, age, marriage, education, income, weight, height, sex, smoke, drink | Hypertension | NR | Multi-layer perception neural network | Accuracy, sensitivity, specificity, and AUC. Average AUC = 0.77 with h vary from 8 to 11 (neural network); Accuracy = 72% (neural network) |
| Ture et al. [ | Turkey | 694 | Age, sex, family history of hypertension, smoking habits, lipoprotein (a), triglyceride, uric acid, total cholesterol, and BMI | Essential hypertension | The average of 3 or more DBP measurements on at least 3 subsequent visits is ≥ 90 mmHg, or when the average of multiple SBP readings on 3 or more subsequent visits is consistently ≥ 140 mmHg | Three decision trees (Chi-squared automatic interaction detector. Classification and regression tree, quick, unbiased, efficient statistical tree); two neural networks (multi-layer perceptron, radial basis function) | Sensitivity, specificity, and predictive rate (PR). Values not reported. |
| Zhao et al. [ | China/Asians | Total: 4759 (2411 hypertensive and 2,348 age-matched and sex-matched healthy controls) | MDR Model: 4-locus model consisted of the SNP KCNMB1-rs11739136, RGS2-rs34717272, PRKG1-rs1881597, and MYLK-rs36025624; CART Model: RGS2, PRKG1, KCNMB1, and MYLK | Hypertension | Average SBP ≥ 150 mm Hg, an average DBP ≥ 95 mm Hg, or current use of antihypertensive medication | Multifactor-dimensionality reduction (MDR) and classification and regression trees (CART) | MDR Model: Accuracy = 52.98%, cross-validation consistency = 9.7 |
| Wang et al. [ | China/Asians | 1009 hypertensive patients and 756 normotensive controls | Genes | Hypertension | Mean SBP ≥ 140 mmHg and/or DBP ≥ 90 mmHg on two occasions and/or the current usage of antihypertensive drug treatment | Multifactor dimensionality reduction (MDR) model | The best MDR model testing accuracy = 0.6331, cross-validation consistency = 10 |
| Zhao et al. [ | China/Asians | 1009 hypertensive patients and 756 normotensive controls | The best MDR model included rs5804 and BMI | Hypertension | Mean SBP of at least 140 mmHg or a mean DBP of at least 90 mmHg or the current intake of antihypertensive drugs | Multifactor dimensionality reduction (MDR) model | The best MDR model: testing accuracy of 0.7309 and a maximum cross-validation consistency of 10 (P < 0.001) |
ICD, international classification of diseases
Fig 5Forest plot of machine regression-based models with 95% prediction interval.