| Literature DB >> 32606332 |
Zhigang Ren1,2, Benchen Rao1,2, Siqi Xie3, Ang Li1,2, Lijun Wang3, Guangying Cui1,2, Tiantian Li3, Hang Yan3, Zujiang Yu4,5, Suying Ding6.
Abstract
Hypertension is a global public health issue and leading risk for death and disability. It is urgent to search novel methods predicting hypertension. Herein, we chose 73158 samples of physical examiners in central China from June 2008 to June 2018. After strict exclusion processes, 33570 participants with hypertension and 35410 healthy controls were included. We randomly chose 70% samples as the train set and the remaining 30% as the test set. Clinical parameters including age, gender, height, weight, body mass index, triglyceride, total cholesterol, low-density lipoprotein, blood urea nitrogen, uric acid, and creatinine were significantly increased, while high-density lipoprotein was decreased in the hypertension group versus controls. Nine optimal markers were identified by a logistic regression model, and achieved AUC value of 76.52% in the train set and 75.81% in the test set for hypertension. In conclusions, this study is the first to establish predicted models for hypertension using the logistic regression model in Central China, which provide risk factors and novel prediction method to predict and prevent hypertension.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32606332 PMCID: PMC7327010 DOI: 10.1038/s41598-020-64980-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Study design and flow diagram. A total of 73158 participants were randomly enrolled. After removing 4165 participants with nulls values and 13 participants with outliers, the remaining 68980 participants, including 33570 participants and 35410 healthy controls, were used for further statistical analysis. We randomly chose 70% of the samples as the train set (n = 48404) and the remaining 30% as the test set (n = 20576) from both groups.
Clinical characteristics of the enrolled participants.
| Clinical indexes | Hypertension group (n = 33570) | Healthy controls (n = 35410) | P values |
|---|---|---|---|
| Age (year) | 51.98 ± 14.77 | 42.72 ± 12.91 | <2E-16 |
| Gender | <2E-16 | ||
| Female | 9932 (35.71%) | 17878 (64.29%) | |
| Male | 23638 (57.42%) | 17532 (42.58%) | |
| Height (cm) | 168.35 ± 8.67 | 167.46 ± 8.13 | 4.02E-44 |
| Weight (kg) | 73.28 ± 12.90 | 66.36 ± 11.98 | <2E-16 |
| BMI | 25.75 ± 3.39 | 23.56 ± 3.18 | <2E-16 |
| TG (mmol/L) | 1.79 ± 1.43 | 1.35 ± 1.01 | <2E-16 |
| TC (mmol/L) | 4.78 ± 0.95 | 4.57 ± 0.87 | 1.48E-208 |
| LDL (mmol/L) | 3.00 ± 0.83 | 2.80 ± 0.78 | 3.08E-231 |
| HDL (mmol/L) | 1.27 ± 0.35 | 1.38 ± 0.37 | 3.52E-302 |
| BUN (mmol/L) | 5.04 ± 1.32 | 4.67 ± 1.22 | 3.11E-312 |
| UA (umol/L) | 333.54 ± 87.69 | 301.77 ± 83.40 | <2E-16 |
| Cr (mmol/L) | 72.92 ± 18.50 | 68.39 ± 15.08 | 1.84E-268 |
All data were presented as mean ± standard deviation (SD). The T test was used to evaluate the differences between the sets of continuous variables. The chi-square test was used to evaluate the differences between the sets of categorical variables. BMI, body mass index; TG, triglyceride; TC, serum total cholesterol; LDL, low-density lipoprotein; HDL, high-density lipoprotein; BUN, blood urea nitrogen; UA, uric acid; Cr, creatinine.
Figure 2The difference and comparison of clinical parameters between the hypertension group (n = 33570) and healthy controls (n = 35410). The student T test was used to analyze the significant differences of clinical parameters between the hypertension group (case) and healthy controls (control). Clinical parameters included age, gender, height, weight, body mass index (BMI), triglyceride (TG), total cholesterol (TC), low-density lipoprotein (LDL), high-density lipoprotein (HDL), blood urea nitrogen (BUN), uric acid (UA), and creatinine (Cr).
Logistic regression model in the train set.
| Estimate | Std. Error | Z value | P value | |
|---|---|---|---|---|
| (Intercept) | −0.30624 | 0.325476 | −0.941 | 0.347 |
| Gender | 0.681627 | 0.034855 | 19.556 | <2E-16 |
| Age | 0.04933 | 0.00083 | 59.463 | <2E-16 |
| Height | −0.04423 | 0.002062 | −21.445 | <2E-16 |
| Weight | 0.054823 | 0.001293 | 42.4 | <2E-16 |
| TG | 0.15022 | 0.010256 | 14.648 | <2E-16 |
| LDL | 0.106544 | 0.013039 | 8.171 | 3.05E-16 |
| HDL | 0.311486 | 0.034023 | 9.155 | <2E-16 |
| UA | 0.001137 | 0.000157 | 7.225 | 5.00E-13 |
| Cr | −0.00323 | 0.000793 | −4.077 | 4.55E-05 |
The 9 optimal distinguishing markers, including gender, age, height, weight, TG, LDL, HDL, UA and Cr for hypertension were identified through a logistic regression model in the train set. TG, triglyceride; LDL, low-density lipoprotein; HDL, high-density lipoprotein; UA, uric acid; Cr, creatinine.
Figure 3Identification and validation of the predicted model for hypertension by a logistic regression model. (a) The POD index was calculated based on the selected 9 optimal markers for each sample, and the mean POD value was significantly increased in the hypertension group versus healthy controls in the train set (p < 0.001). (b) The AUC value of the POD index achieved 76.52% (95% CI: 76.11–76.94%) between the hypertension group and healthy controls in the train set. (c) The mean POD value was significantly increased in the hypertension group versus healthy controls in the test set (p < 0.001). (d) The AUC value of the POD index achieved 75.81% (95% CI: 75.16–76.46%) between the hypertension group and healthy controls in the test set.