| Literature DB >> 26600199 |
Huiling Chen1,2, Bo Yang3,2, Dayou Liu3,2, Wenbin Liu1, Yanlong Liu4, Xiuhua Zhang5, Lufeng Hu5.
Abstract
The number of the overweight people continues to rise across the world. Studies have shown that being overweight can increase health risks, such as high blood pressure, diabetes mellitus, coronary heart disease, and certain forms of cancer. Therefore, identifying the overweight status in people is critical to prevent and decrease health risks. This study explores a new technique that uses blood and biochemical measurements to recognize the overweight condition. A new machine learning technique, an extreme learning machine, was developed to accurately detect the overweight status from a pool of 225 overweight and 251 healthy subjects. The group included 179 males and 297 females. The detection method was rigorously evaluated against the real-life dataset for accuracy, sensitivity, specificity, and AUC (area under the receiver operating characteristic (ROC) curve) criterion. Additionally, the feature selection was investigated to identify correlating factors for the overweight status. The results demonstrate that there are significant differences in blood and biochemical indexes between healthy and overweight people (p-value < 0.01). According to the feature selection, the most important correlated indexes are creatinine, hemoglobin, hematokrit, uric Acid, red blood cells, high density lipoprotein, alanine transaminase, triglyceride, and γ-glutamyl transpeptidase. These are consistent with the results of Spearman test analysis. The proposed method holds promise as a new, accurate method for identifying the overweight status in subjects.Entities:
Mesh:
Year: 2015 PMID: 26600199 PMCID: PMC4658146 DOI: 10.1371/journal.pone.0143003
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
All of the features used in this study and their brief descriptions.
| Feature | Brief description | Feature | Brief description |
|---|---|---|---|
|
| Age |
| absolute value of monocyte (AVM) |
|
| triglyceride (TG) |
| red blood cell (RBC) |
|
| glucose (GLU) |
| red blood cell in urine (RBCU) |
|
| low density lipoprotein (LDL) |
| hematokrit (HCT) |
|
| high density lipoprotein (HDL) |
| percentage of leukomonocyte (PLC) |
|
| total cholesterol (CHO) |
| absolute value of leukomonocyte (AVLC) |
|
| alanine transaminase (ALT) |
| mean corpuscular volume (MCV) |
|
| aspartate aminotransferase (AST) |
| mean corpuscular hemoglobin (MCH) |
|
| γ-glutamyl transpeptidase (γ-GT) |
| mean corpuscular hemoglobin concentration (MCHC) |
|
| total protein (TP) |
| mean platelet volume (MPL) |
|
| albumin (ALB) |
| absolute value of eosinophils (AVE) |
|
| creatinine (CR) |
| percentage of eosinophils (PE) |
|
| urea nitrogen (BUN) |
| hemoglobin (HB) |
|
| alkaline phosphatase (AKP) |
| blood platelet (PLT) |
|
| total bilirubin (TBIL) |
| thrombocytocrit (THR) |
|
| direct bilirubin (DBIL) |
| percentage of neutrophils (PN) |
|
| uric acid (UA) |
| absolute value of neutrophils (AVN) |
|
| white blood cell in urine (WBCU) |
| red cell volume distribution width (RBCVD) |
|
| percentage of monocyte (PMC) |
Characteristics of blood and biochemical parameters in healthy and overweight people.
| Index | Overweight (n = 225) | Healthy (n = 251) | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Mean | SD | Min | Max | Mean | SD | Min | Max | p-value | |
| AGE | 44.73 | 10.41 | 22.00 | 76.00 | 42.90 | 10.41 | 24 | 82 | 0.057 |
| TG | 2.20 | 1.80 | 0.46 | 14.17 | 0.98 | 0.64 | 0.23 | 5.73 | 0.000 |
| GLU | 5.92 | 1.24 | 4.50 | 17.00 | 5.44 | 0.88 | 4.20 | 15.20 | 0.000 |
| LDL | 2.87 | 0.71 | 1.28 | 5.85 | 2.57 | 0.76 | 0.76 | 5.82 | 0.000 |
| HDL | 1.23 | 0.28 | 0.69 | 2.21 | 1.61 | 0.32 | 0.88 | 2.72 | 0.000 |
| CHO | 5.00 | 0.91 | 2.98 | 9.97 | 4.69 | 0.90 | 2.54 | 8.25 | 0.000 |
| ALT | 35.66 | 23.31 | 4.00 | 137.00 | 16.46 | 10.01 | 2.00 | 69.00 | 0.000 |
| AST | 25.13 | 9.84 | 10.00 | 83.00 | 19.37 | 5.91 | 10.00 | 49.00 | 0.000 |
| γ-GT | 56.95 | 65.98 | 10.00 | 564.00 | 15.95 | 10.57 | 6.00 | 121.00 | 0.000 |
| TP | 76.65 | 3.50 | 66.00 | 86.60 | 76.14 | 3.80 | 66.20 | 85.50 | 0.130 |
| ALB | 47.69 | 2.66 | 38.80 | 54.50 | 46.48 | 2.38 | 40.80 | 53.30 | 0.000 |
| CR | 72.46 | 13.41 | 41.00 | 107.00 | 51.92 | 7.01 | 34.00 | 78.00 | 0.000 |
| BUN | 5.28 | 1.22 | 2.80 | 9.10 | 4.73 | 1.17 | 2.30 | 8.50 | 0.000 |
| AKP | 82.71 | 20.65 | 32.00 | 170.00 | 65.35 | 19.91 | 23.00 | 185.00 | 0.000 |
| TBIL | 11.82 | 4.13 | 5.00 | 27.00 | 9.82 | 3.20 | 5.00 | 22.00 | 0.000 |
| DBIL | 3.76 | 1.45 | 2.00 | 11.00 | 3.10 | 1.06 | 2.00 | 7.00 | 0.000 |
| UA | 381.48 | 83.29 | 98.00 | 601.00 | 267.31 | 50.55 | 128.00 | 403.00 | 0.000 |
| WBCU | 11.27 | 25.04 | 0.00 | 203.10 | 31.67 | 56.41 | 0.30 | 474.50 | 0.000 |
| PMC | 0.07 | 0.02 | 0.02 | 0.14 | 0.07 | 0.02 | 0.03 | 0.14 | 0.005 |
| AVM | 0.50 | 0.17 | 0.00 | 1.30 | 0.39 | 0.12 | 0.20 | 0.80 | 0.000 |
| RBC | 5.04 | 0.48 | 3.75 | 6.87 | 4.46 | 0.38 | 3.45 | 6.03 | 0.000 |
| RBCU | 54.01 | 473.50 | 0.00 | 6977.40 | 42.96 | 116.47 | 1.20 | 1500.00 | 0.721 |
| HCT | 0.46 | 0.04 | 0.31 | 0.53 | 0.41 | 0.03 | 0.29 | 0.46 | 0.000 |
| PLC | 0.37 | 0.07 | 0.15 | 0.59 | 0.37 | 0.08 | 0.11 | 0.60 | 0.781 |
| AVLC | 2.47 | 0.70 | 0.00 | 5.40 | 2.11 | 0.57 | 0.90 | 4.60 | 0.000 |
| MCV | 91.04 | 6.23 | 62.20 | 108.20 | 91.22 | 6.45 | 56.50 | 104.30 | 0.757 |
| MCH | 30.17 | 2.53 | 18.90 | 34.80 | 29.40 | 2.48 | 15.40 | 33.10 | 0.001 |
| MCHC | 331.01 | 11.65 | 268.00 | 357.00 | 321.98 | 9.66 | 272.00 | 345.00 | 0.000 |
| MPL | 10.83 | 1.91 | 0.00 | 14.10 | 10.71 | 2.17 | 0.00 | 14.20 | 0.521 |
| AVE | 0.17 | 0.12 | 0.00 | 0.60 | 0.13 | 0.10 | 0.00 | 0.90 | 0.000 |
| PE | 0.02 | 0.02 | 0.00 | 0.10 | 0.02 | 0.01 | 0.00 | 0.11 | 0.011 |
| HB | 151.59 | 15.03 | 83.00 | 179.00 | 130.51 | 9.63 | 78.00 | 149.00 | 0.000 |
| PLT | 223.42 | 49.68 | 102.00 | 437.00 | 226.57 | 51.96 | 85.00 | 451.00 | 0.500 |
| THR | 0.24 | 0.06 | 0.00 | 0.45 | 0.24 | 0.07 | 0.00 | 0.42 | 0.772 |
| PN | 0.53 | 0.08 | 0.34 | 0.78 | 0.54 | 0.08 | 0.30 | 0.84 | 0.403 |
| AVN | 3.67 | 1.25 | 0.00 | 9.60 | 3.16 | 1.04 | 1.30 | 7.30 | 0.000 |
| RBCVD | 12.85 | 0.98 | 11.40 | 19.30 | 12.90 | 1.10 | 11.60 | 20.90 | 0.536 |
Fig 1Overall procedure of the proposed method.
Fig 2The relationship between ELM with different activation functions and the different number of hidden neurons.
Fig 3ACC, AUC, sensitivity, and specificity versus different hidden neuron numbers for ELM via 10-fold CV.
Average classification performance results of 10-fold CV.
| Different number of hidden neurons | Average classification performance | |||
|---|---|---|---|---|
| ACC (%) | AUC (%) | Sensitivity (%) | Specificity (%) | |
| 5 | 76.69 | 76.28 | 68.85 | 83.71 |
| 20 | 88.44 | 88.02 | 80.81 | 95.23 |
| 35 |
|
|
|
|
| 50 | 88.87 | 88.56 | 82.71 | 94.42 |
| 65 | 87.61 | 87.28 | 81.74 | 92.83 |
| 80 | 87.39 | 87.10 | 82.15 | 92.05 |
| 95 | 87.80 | 87.54 | 82.25 | 92.83 |
The best results have been shown in bold.
The detailed results obtained by the ELM model.
| Fold | ACC (%) | AUC (%) | Sensitivity (%) | Specificity (%) |
|---|---|---|---|---|
| 1# | 89.36 | 88.91 | 81.82 | 96.00 |
| 2# | 91.67 | 91.48 | 86.96 | 96.00 |
| 3# | 87.50 | 87.30 | 82.61 | 92.00 |
| 4# | 91.67 | 91.48 | 86.96 | 96.00 |
| 5# | 91.67 | 91.30 | 82.61 | 100.00 |
| 6# | 93.75 | 93.65 | 91.30 | 96.00 |
| 7# | 91.67 | 91.26 | 86.36 | 96.15 |
| 8# | 87.23 | 86.36 | 72.73 | 100.00 |
| 9# | 89.36 | 89.18 | 86.36 | 92.00 |
| 10# | 89.36 | 88.91 | 81.82 | 96.00 |
| Avg. | 90.32 | 89.98 | 83.95 | 96.02 |
| Dev. | 2.09 | 2.23 | 4.97 | 2.67 |
Fig 4Comparison results of classification performance among ELM, SVM, and BPNN.
The confusion matrix obtained by ELM, SVM and BPNN via 10-fold CV.
| ELM | Predicted overweight people | Predicted healthy controls |
|---|---|---|
| Overweight people | 189 | 36 |
| Healthy controls | 10 | 241 |
|
| ||
| Overweight people | 184 | 41 |
| Healthy controls | 7 | 244 |
|
| ||
| Overweight people | 188 | 37 |
| Healthy controls | 40 | 211 |
Fig 5The importance of each index obtained by the Fisher Score feature selection.
Fig 6The incremental feature selection curve: the classification accuracy against the feature subset.
Fig 7Comparison results of classification performance between ELM with and without feature selection.
The confusion matrix obtained by ELM with and without feature selection.
| ELM with the whole feature set | Predicted overweight people | Predicted healthy controls |
|---|---|---|
| Overweight people | 189 | 36 |
| Healthy controls | 10 | 241 |
|
| ||
| Overweight people | 188 | 37 |
| Healthy controls | 8 | 243 |