| Literature DB >> 30379873 |
Bo Chen1, Jie Yu2, Xiu-E Gao3, Qing-Guo Zheng2.
Abstract
RESEARCH: The body composition model is closely related to the physiological characteristics of the human body. At the same time there can be a large number of physiological characteristics, many of which may be redundant or irrelevant. In existing human physiological feature selection algorithms, it is difficult to overcome the impact that redundancy and irrelevancy may have on human body composition modeling. This suggests a role for selection algorithms, where human physiological characteristics are identified using a combination of filtering and improved clustering. To do this, a feature filtering method based on Hilbert-Schmidt dependency criteria is first of all used to eliminate irrelevant features. After this, it is possible to use improved Chameleon clustering to increase the combination of sub-clusters amongst the characteristics, thereby removing any redundant features to obtain a candidate feature set for human body composition modeling. Method We report here on the use of an algorithm to filter the characteristic parameters in INBODY770 (this paper used INBODY 770 as body composition analyzer.) measurement data, which has three commonly-used impedance bands (1 kHZ, 250 kHZ, 500 kHZ). This algorithm is able to filter out parameters that have a low correlation with body composition BFM. The algorithm is also able to draw upon improved clustering techniques to reduce the initial feature set from 29 parameters to 10 parameters for any parameters of the 250 kHZ band that remain after filtering. In addition, we also examined the impact of different sample sizes on feature selection. RESULT: The proposed algorithm is able to remove irrelevant and redundant features and the resulting correlation between the model and the body composition (BFM which is a whole body fat evaluation can better assess the body's overall fat and muscle composition.) is 0.978, thereby providing an improved model for prediction with a relative error of less than 0.12.Entities:
Mesh:
Year: 2018 PMID: 30379873 PMCID: PMC6209155 DOI: 10.1371/journal.pone.0204816
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Outline view of the Chameleon clustering algorithm.
Fig 2Schematic diagram of the improved Chameleon algorithm.
Fig 3Body feature parameter selection process.
Fig 4The correlation between the characteristic parameters calculated by the filtering algorithm within the 1 kHZ band.
Fig 5The correlation between the characteristic parameters calculated by the filtering algorithm within the 250 kHZ band.
Fig 6The correlation between the characteristic parameters calculated by the filtering algorithm within the 500 kHZ band.
Features after running the filter algorithm in different frequency bands.
| Frequency Bands | Category | Primitive Feature Set | Filtered Characteristics | Feature Set |
|---|---|---|---|---|
| 1 kHZ | Body Fat (BFM) | 29 | 7 | |
| 250 kHZ | Body Fat (BFM) | 29 | 14 | |
| 500 kHZ | Body Fat (BFM) | 29 | 6 |
Fig 7Analysis of the number of clustered parameters after using the filter algorithm.
The legend for Fig 7 is” TBW, BFM, FFM”.
Fig 8Distance between the characteristic parameters and the BFM index when the number of samples is split into four categories.
The characteristics of the parameters after using the filtering and improved clustering algorithms.
| Frequency Band | Category | The Characteristics of the Algorithm X | Feature Set |
|---|---|---|---|
| 250 KHZ | Body Fat (BFM) | 10 | A,H,W,R5,R1R2,R2R3,R4R5,1/R5, R22, R52 |
Comparison of optimal feature sets and complexity.
| Number | Algorithm Used | Characteristic Clustering X | Time Consumption (s) | Feature Set |
|---|---|---|---|---|
| 1 | mRMR | 10 | 3.2 | W,S, A, R3, 1/R2, 1/R1, 1/R3, R42, R4R5, R52 |
| 2 | Filter & Wrapper | 9 | 2.9 | 1/R3,W,S, R22, R42, R4R5, R52, 1/R1, R5 |
| 3 | This Paper | 10 | 2.8 | A,H,W,R5,R1R2,R2R3,R4R5,1/R5, R22, R52 |
Model summary.
| Model | R | R Side | Adjusted R Side | Standard Estimation Error |
|---|---|---|---|---|
| W,S, A, R3, 1/R2, 1/R1, 1/R3, R4^2, R4R5, R5^2 | 0.927 | 0.859 | 0.843 | 2.6173 |
| 1/R3,W,S, R22, R4^2, R4R5, R5^2, 1/R1, R5 | 0.906 | 0.821 | 0.803 | 2.9340 |
| A,H,W,R5,R1R2,R2R3,R4R5,1/R5, R2^2, R5^2 | 0.978 | 0.957 | 0.953 | 1.4399 |
a. Predictor: (constant), W,S, A, R3, 1/R2, 1/R1, 1/R3, R4^2, R4R5, R5^2
b. Predictor: (constant), 1/R3,W,S, R2^2, R4^2, R4R5, R5^2, 1/R1, R5
c. Predictor: (constant), A,H,W,R5,R1R2,R2R3,R4R5,1/R5, R2^2, R5^2
Fig 9Comparison of predicted and actual values for the BFM model.
The legend for Fig 9 is “predicted and actual values, model 1, model 2, model 3”.
Fig 10Comparison of the relative error for the predicted values of the BFM model.
The legend for Fig 10 is “model 1, model 2, model 3”.