| Literature DB >> 23573116 |
Bum Ju Lee1, Boncho Ku, Jun-Su Jang, Jong Yeol Kim.
Abstract
Obesity is a serious public health problem because of the risk factors for diseases and psychological problems. The focus of this study is to diagnose the patient BMI (body mass index) status without weight and height measurements for the use in future clinical applications. In this paper, we first propose a method for classifying the normal and the overweight using only speech signals. Also, we perform a statistical analysis of the features from speech signals. Based on 1830 subjects, the accuracy and AUC (area under the ROC curve) of age- and gender-specific classifications ranged from 60.4 to 73.8% and from 0.628 to 0.738, respectively. We identified several features that were significantly different between normal and overweight subjects (P < 0.05). Also, we found compact and discriminatory feature subsets for building models for diagnosing normal or overweight individuals through wrapper-based feature subset selection. Our results showed that predicting BMI status is possible using a combination of speech features, even though significant features are rare and weak in age- and gender-specific groups and that the classification accuracy with feature selection was higher than that without feature selection. Our method has the potential to be used in future clinical applications such as automatic BMI diagnosis in telemedicine or remote healthcare.Entities:
Year: 2013 PMID: 23573116 PMCID: PMC3612486 DOI: 10.1155/2013/150265
Source DB: PubMed Journal: Evid Based Complement Alternat Med ISSN: 1741-427X Impact factor: 2.629
All features used in this study and brief descriptions.
| Feature | Brief description | Feature | Brief description |
|---|---|---|---|
| aF0 | Basic pitch of A | oPPQ | Smoothing value around JITA of O |
| aJITA | Mean ratio of change in pitch period of A | oF60_120_240_480 | (energy of 60~120 Hz)/(energy of 240~480 Hz) of O |
| aJITT | Percentage of JITA value of A | oF240_480_960_1920 | (energy of 240~480 Hz)/(energy of 960~1920 Hz) of O |
| aPPQ | Smoothing value around JITA of A | oF60_120_oF960_1920 | (energy of 60~120 Hz)/(energy of 960~1920 Hz) of O |
| aF60_120_F240_480 | (energy of 60~120 Hz)/(energy of 240~480 Hz) of A | oF1 | Formant of first in 4 frequency periods of O |
| aF240_480_960_1920 | (energy of 240~480 Hz)/(energy of 960~1920 Hz) of A | oF2 | Formant of second in 4 frequency periods of O |
| aF60_120_960_1920 | (energy of 60~120 Hz)/(energy of 960~1920 Hz) of A | oF2_F1 | Difference of frequencies (oF2-F1) |
| aF1 | Formant of first in 4 frequency periods of A | uF0 | Basic pitch of U |
| aF2 | Formant of second in 4 frequency periods of A | uJITA | Mean ratio of change in pitch period of U |
| aF2_F1 | aF2/F1 | uJITT | Percentage of JITA value of U |
| eF0 | Basic pitch of E | uPPQ | Smoothing value around JITA of U |
| eJITA | Mean ratio of change in pitch period of E | uF60_120_240_480 | (energy of 60~120 Hz)/(energy of 240~480 Hz) of U |
| eJITT | Percentage of JITA value of E | uF240_480_960_1920 | (energy of 240~480 Hz)/(energy of 960~1920 Hz) of U |
| ePPQ | Smoothing value around JITA of E | uF60_120_960_1920 | (energy of 60~120 Hz)/(energy of 960~1920 Hz) of U |
| eF60_120_240_480 | (energy of 60~120 Hz)/(energy of 240~480 Hz) of E | uF1 | Formant of first in 4 frequency periods of U |
| eF240_480_960_1920 | (energy of 240~480 Hz)/(energy of 960~1920 Hz) of E | uF2 | Formant of second in 4 frequency periods of U |
| eF60_120_960_1920 | (energy of 60~120 Hz)/(energy of 960~1920 Hz) of E | uF2_F1 | Difference of frequencies (uF2-F1) |
| eF1 | Formant of first in 4 frequency periods of E | iF0_aF0 | Difference of frequencies (iF0-aF0) |
| eF2 | Formant of second in 4 frequency periods of E | uF0_oF0 | Difference of frequencies (uF0-oF0) |
| eF2_F1 | Difference of frequencies (eF2-F1) | aMFCC4 | The terms of Mel frequency cepstral coefficients of A |
| iF0 | Basic pitch of I | eMFCC4 | The terms of Mel frequency cepstral coefficients of E |
| iJITA | Mean ratio of change in pitch period of I | iMFCC4 | The terms of Mel frequency cepstral coefficients of I |
| iJITT | Percentage of JITA value of I | oMFCC4 | The terms of Mel frequency cepstral coefficients of O |
| iPPQ | Smoothing value around JITA of I | uMFCC4 | The terms of Mel frequency cepstral coefficients of U |
| iF60_120_240_480 | (energy of 60~120 Hz)/(energy of 240~480 Hz) of I | CORR | Pearson correlation coefficients between F0 and intensity |
| iF240_480_960_1920 | (energy of 240~480 Hz)/(energy of 960~1920 Hz) of I | P50 | 50th percentile of F0 |
| iF60_120_960_1920 | (energy of 60~120 Hz)/(energy of 960~1920 Hz) of I | I50 | 50th percentile of intensity |
| iF1 | Formant of first in 4 frequency periods of I | SF0 | Mean pitch of sentence |
| iF2 | Formant of second in 4 frequency periods of I | SSTD | Standard deviation of mean pitch of sentence |
| iF2_F1 | Difference of frequencies (iF2-F1) | SITS | Intensity average |
| oF0 | Basic pitch of O | SISTD | Standard deviation of intensity |
| oJITA | Mean ratio of change in pitch period of O | SSPD | Time to read one sentence |
| oJITT | Percentage of JITA value of O |
|
|
Figure 1Sample of speech signal recording of 5 vowels and one sentence ((a): signals of 5 vowels and one sentence and (b): detailed signal of one vowel to demonstrate the difference between noise and signal).
Mean and standard deviation of age and BMI by each group.
| Female: 20–30 | Female: 40–50 | Female: 60 | Male: 20–30 | Male: 40–50 | Male: 60 | |
|---|---|---|---|---|---|---|
| Age | 28.22 ± 6.326 | 48.7 ± 5.555 | 67.14 ± 5.254 | 27.34 ± 5.433 | 49.24 ± 5.257 | 66.75 ± 4.995 |
| BMI | 21.76 ± 2.489 | 23.76 ± 3.048 | 24.96 ± 3.042 | 23.71 ± 2.971 | 24.67 ± 3.090 | 23.59 ± 2.3 |
Specific performance results (with feature selection, N: number of subjects of each class).
| Model (group) | Class |
| Sensitivity | False positive rate (1-specificity) | Precision |
|
|---|---|---|---|---|---|---|
| Female: 20–30 | Normal | 364 | 0.926 | 0.774 | 0.766 | 0.838 |
| Overweight | 133 | 0.226 | 0.074 | 0.526 | 0.316 | |
| Female: 40–50 | Normal | 201 | 0.512 | 0.311 | 0.575 | 0.542 |
| Overweight | 244 | 0.689 | 0.488 | 0.632 | 0.659 | |
| Female: 60 | Normal | 41 | 0.366 | 0.173 | 0.455 | 0.405 |
| Overweight | 104 | 0.827 | 0.634 | 0.768 | 0.796 | |
| Male: 20–30 | Normal | 175 | 0.52 | 0.325 | 0.576 | 0.547 |
| Overweight | 206 | 0.675 | 0.48 | 0.623 | 0.648 | |
| Male: 40–50 | Normal | 77 | 0.377 | 0.14 | 0.537 | 0.443 |
| Overweight | 179 | 0.86 | 0.623 | 0.762 | 0.808 | |
| Male: 60 | Normal | 35 | 0.429 | 0.239 | 0.469 | 0.448 |
| Overweight | 71 | 0.761 | 0.571 | 0.73 | 0.745 |
Selected features by feature selection in each group (N: number of selected features).
| Model (group) |
| Selected features |
|---|---|---|
| Female: 20–30 | 25 | aJITT, aPPQ, aF60_120_F240_480, aF240_480_960_1960, aF60_120_960_1960, aF1, eF0, eJITA, ePPQ, eF240_480_960_1960, eF2, iPPQ, iF60_120_960_1960, oF0, oJITT, oF1, oF2, uF0, uJITT, aMFCC4, eMFCC4, oMFCC4, uMFCC4, SF0, SITS |
| Female: 40–50 | 29 | aF0, aJITA, aJITT, aF240_480_960_1960, aF2, eF0, eJITT, ePPQ, eF2_F1, iJITA, iPPQ, iF60_120_240_480, iF240_480_960_1960, iF60_120_960_1960, oF0, oF240_480_960_1960, oF1, oF2, uF0, uPPQ, uF60_120_960_1960, uF1, uF2, uF2_F1, aMFCC4, uMFCC4, CORR, I50, SISTD |
| Female: 60 | 22 | aJITA, aJITT, aF60_120_F240_480, aF240_480_960_1960, eJITT, ePPQ, eF240_480_960_1960, eF2_F1, iF60_120_240_480, iF240_480_960_1960, iF60_120_960_1960, iF2, oF0, oJITT, oF2, oF2_F1, uF0, uJITA, uF60_120_240_480, uF60_120_960_1960, uMFCC4, SISTD |
| Male: 20–30 | 8 | aJITA, aPPQ, eF2, iF1, oJITT, uPPQ, eMFCC4, uMFCC4 |
| Male: 40–50 | 24 | aF0, eF0, eJITA, eJITT, eF60_120_960_1960, eF1, eF2, eF2_F1, iF0, iJITA, iPPQ, iF60_120_240_480, iF240_480_960_1960, iF2, oJITA, oJITT, oPPQ, oF60_120_oF960_1960, oF1, oF2, uF0, uF60_120_960_1960, eMFCC4, SF0 |
| Male: 60 | 23 | aJITT, aF60_120_F240_480, aF60_120_960_1960, aF1, eF240_480_960_1960, eF1, eF2_F1, iJITA, iPPQ, iF240_480_960_1960, iF60_120_960_1960, iF1, iF2, iF2_F1, oF60_120_240_480, uF2, uF0_oF0, oMFCC4, P50, I50, SSTD, SITS, SSPD |
Figure 2Accuracy comparison of experiment results between full-feature set and FS-feature set in 6 groups.
Figure 3AUC comparison of experiment results between full-feature set and FS-feature set in 6 groups.
Confusion matrix (also called contingency table or error matrix) of 6 models according to age and gender in classification experiments with feature selection.
| Classification modela | |||||
|---|---|---|---|---|---|
| Group | Actual | Predicted | Subjectsb | ||
| Overweight | Normal | Overweight | Normal | ||
| Female: 20–30 | Overweight | 30 | 103 | 133 | 364 |
| Normal | 27 | 337 | |||
| Female: 40–50 | Overweight | 168 | 76 | 244 | 201 |
| Normal | 98 | 103 | |||
| Female: 60 | Overweight | 86 | 18 | 104 | 41 |
| Normal | 26 | 15 | |||
| Male: 20–30 | Overweight | 139 | 67 | 206 | 175 |
| Normal | 84 | 91 | |||
| Male: 40–50 | Overweight | 154 | 25 | 179 | 77 |
| Normal | 48 | 29 | |||
| Male: 60 | Overweight | 54 | 17 | 71 | 35 |
| Normal | 20 | 15 | |||
aResults of confusion matrix by classification model; bnumber of subjects of each class (overweight and normal) in original data.
Statistical analysis results by independent two sample t-test and Benjamin-Hochberg's method.
| Group | Feature | Class | Mean | Std. |
|
| Adj. |
|---|---|---|---|---|---|---|---|
| Female: 20–30 | aF60_120_F240_480 | Normal | 0.834 | 0.390 | 3.474 | <0.001 | 0.005 |
| Overweight | 0.699 | 0.365 | |||||
| aF240_480_960_1960 | Normal | 2.285 | 0.818 | 3.510 | <0.001 | 0.005 | |
| Overweight | 1.996 | 0.806 | |||||
| aF60_120_960_1960 | Normal | 2.135 | 1.416 | 3.618 | <0.001 | 0.005 | |
| Overweight | 1.631 | 1.248 | |||||
| eF240_480_960_1960 | Normal | 3.033 | 0.627 | 3.342 | <0.001 | <0.01 | |
| Overweight | 2.818 | 0.660 | |||||
| eMFCC4 | Normal | 1.277 | 6.836 | 2.581 | <0.05 | <0.05 | |
| Overweight | −0.801 | 8.315 | |||||
| oMFCC4 | Normal | −4.087 | 5.624 | 2.757 | <0.01 | <0.05 | |
| Overweight | −5.989 | 7.191 | |||||
| SITS | Normal | 56.14 | 7.515 | 3.106 | <0.005 | 0.01 | |
| Overweight | 53.73 | 8.074 | |||||
|
| |||||||
| Male: 20–30 | eMFCC4 | Normal | 5.057 | 6.678 | 3.393 | <0.001 | <0.01 |
| Overweight | 2.679 | 6.929 | |||||
P value < 0.05 was considered statistically significant. The P values were adjusted using the Benjamin-Hochberg method to control the false discovery rate. Only statistically significant features among all features selected by wrapper-based feature subset selection in each group are described in this table (Std: standard deviation, Adj: adjusted).