| Literature DB >> 22529874 |
Keun Ho Kim1, Boncho Ku, Namsik Kang, Young-Su Kim, Jun-Su Jang, Jong Yeol Kim.
Abstract
The voice has been used to classify the four constitution types, and to recognize a subject's health condition by extracting meaningful physical quantities, in traditional Korean medicine. In this paper, we propose a method of selecting the reliable variables from various voice features, such as frequency derivative features, frequency band ratios, and intensity, from vowels and a sentence. Further, we suggest a process to extract independent variables by eliminating explanatory variables and reducing their correlation and remove outlying data to enable reliable discriminant analysis. Moreover, the suitable division of data for analysis, according to the gender and age of subjects, is discussed. Finally, the vocal features are applied to a discriminant analysis to classify each constitution type. This method of voice classification can be widely used in the u-Healthcare system of personalized medicine and for improving diagnostic accuracy.Entities:
Year: 2012 PMID: 22529874 PMCID: PMC3313581 DOI: 10.1155/2012/831543
Source DB: PubMed Journal: Evid Based Complement Alternat Med ISSN: 1741-427X Impact factor: 2.629
Figure 1The vocal feature extraction program.
Figure 2Vocal features of 5 vowels.
Figure 3Vocal features of a sentence.
Figure 4Selection of vocal features.
Figure 5Transformation of variable (feature) distribution.
Figure 6The flow of the overall procedure.
Figure 7Analysis of the mean difference between voice features (the pitch and the first formant frequency of “a”) according to gender.
Figure 8Analysis of the mean of aF0 according to age.
92 features selected as having less than 20% CV in each of the six subjects.
| Features | |||
|---|---|---|---|
| aF0 | oJITT | sF10 | uF1_ln |
| aF2/aF1 | oF2/oF1 | sF50 | aF2_ln |
| aF3/aF1 | oF3/oF1 | sF90 | eF2_ln |
| aF4/aF1 | oF4/oF1 | sFHL | iF2_ln |
| aF3/aF2 | oF3/oF2 | sI10 | oF2_ln |
| aF4/aF2 | oF4/oF2 | sI50 | uF2_ln |
| aF4/aF3 | oF4/oF3 | sI90 | aF3_ln |
| eF0 | uT0 | sIHL | eF3_ln |
| eF2/eF1 | uPPQ | sF0 | iF3_ln |
| eF3/eF1 | uF2/uF1 | sFSTD | oF3_ln |
| eF4/eF1 | uF3/uF1 | sI0 | uF3_ln |
| eF3/eF2 | uF4/uF1 | sISTD | aF4_ln |
| eF4/eF2 | uF3/uF2 | iFB240_480/iFB960_1920 | eF4_ln |
| eF4/eF3 | uF4/uF2 | oFB240_480/oFB960_1920 | iF4_ln |
| iF0 | uF4/uF3 | uFB240_480/uFB960_1920 | oF4_ln |
| iT0 | aMFCC13 | sFB60_120/sFB240_480 | uF4_ln |
| iF2/iF1 | eMFCC2 | sFB240_480/sFB960_1920 | eT0_ln |
| iF3/iF1 | eMFCC13 | sFB60_120/sFB960_1920 | sSPD_ln |
| iF4/iF1 | iMFCC2 | uF0_ln | aJITA_sqrt |
| iF3/iF2 | iMFCC13 | aF1_ln | eJITA_sqrt |
| iF4/iF3 | oMFCC6 | eF1_ln | iJITA_sqrt |
| oF0 | oMFCC13 | iF1_ln | oJITA_sqrt |
| oT0 | uMFCC13 | oF1_ln | uJITA_sqrt |
Analytic results from the VIF calculation of vocal features.
| Variable | VIF | Variable | VIF |
|---|---|---|---|
| uF0_ln | 4.94 | oF4_ln | 2.39 |
| aF1_ln | 3.12 | uF4_ln | 2.13 |
| eF1_ln | 2.96 | eT0_ln | 5.68 |
| iF1_ln | 1.87 | eMFCC13 | 3.77 |
| oF1_ln | 3.30 | eMFCC2 | 6.38 |
| uF1_ln | 2.40 | iMFCC2 | 6.26 |
| aF2_ln | 2.52 | oMFCC6 | 1.88 |
| eF2_ln | 2.86 | sI0 | 3.52 |
| iF2_ln | 2.05 | sSPD_ln | 1.08 |
| oF2_ln | 1.96 | aFB60_120/aFB240_480 | 2.52 |
| uF2_ln | 2.44 | iFB240_480/iFB960_1920 | 2.42 |
| aF3_ln | 1.95 | oFB240_480/oFB960_1920 | 4.12 |
| eF3_ln | 4.04 | uFB240_480/uFB960_1920 | 3.38 |
| iF3_ln | 2.29 | sFB240_480/sFB960_1920 | 2.33 |
| oF3_ln | 2.05 | aJITA_sqrt | 2.81 |
| uF3_ln | 2.07 | eJITA_sqrt | 2.95 |
| aF4_ln | 2.55 | iJITA_sqrt | 1.76 |
| eF4_ln | 3.41 | oJITA_sqrt | 1.34 |
| iF4_ln | 2.71 | uJITA_sqrt | 1.23 |
Distribution of datasets after removal of outliers.
| TE | SE | SY | Total | |
|---|---|---|---|---|
| Male | 322 | 181 | 249 | 752 |
|
| ||||
| Female | 409 | 330 | 432 | 1171 |
|
| ||||
| 731 | 511 | 681 | 1923 | |
The coefficients of the linear discriminant function for subjects in their 20 s.
| Male | SC | Female | SC | ||||
| TE | SE | SY | TE | SE | SY | ||
|
| |||||||
| uF1_ln | 43.40 | 44.45 | 42.78 | aF1_ln | 42.20 | 43.02 | 41.15 |
| aF2_ln | 811.02 | 813.80 | 806.72 | aF2_ln | 233.56 | 236.08 | 234.27 |
| iF2_ln | 454.78 | 446.43 | 451.97 | iF2_ln | −119.84 | −121.15 | −120.32 |
| eF3_ln | 1263.15 | 1272.38 | 1270.05 | oF3_ln | 601.51 | 600.09 | 604.50 |
| eF4_ln | 2682.67 | 2691.38 | 2690.17 | uF3_ln | 921.89 | 928.00 | 925.54 |
|
|
|
| iF4_ln | 1700.96 | 1707.70 | 1703.23 | |
|
|
|
| uF4_ln | 2233.97 | 2226.67 | 2230.50 | |
|
|
|
| eT0_ln | 300.49 | 297.49 | 301.75 | |
|
|
|
| oMFCC6 | −3.19 | −3.16 | −3.15 | |
| aFB60_120/aFB240_480 | −40.32 | −39.39 | −40.48 | iFB240_480/iFB960_1920 | 163.16 | 161.13 | 161.99 |
| oFB240_480/oFB960_1920 | 22.25 | 21.12 | 21.06 | sFB240_480/sFB960_1920 | −76.00 | −74.78 | −75.67 |
|
|
|
| oJITA_sqrt | 4.24 | 3.32 | 2.74 | |
| (Constant) | −20662.87 | −20765.44 | −20718.69 | (Constant) | −23380.96 | −23416.62 | −23413.53 |
The discriminant results for subjects in their 20 s.
|
| Male | Female | ||||||||
| FC | Prediction | Total | Accuracy | Prediction | Total | Accuracy | ||||
| TE | SE | SY | TE | SE | SY | |||||
|
| ||||||||||
| TE | 61 | 25 | 31 | 117 | 0.52 | 65 | 31 | 23 | 119 | 0.55 |
| SE | 14 | 38 | 20 | 72 | 0.53 | 19 | 57 | 23 | 99 | 0.58 |
| SY | 25 | 21 | 36 | 82 | 0.44 | 32 | 35 | 64 | 131 | 0.49 |
|
| ||||||||||
| Average |
|
|
|
| 0.5 |
|
|
|
| 0.53 |
The coefficients of the linear discriminant function for subjects in their 30 s/40 s.
| Male | SC | Female | SC | ||||
| TE | SE | SY | TE | SE | SY | ||
|
| |||||||
| uF1_ln | 85.30 | 85.85 | 83.25 | aF1_ln | 1.43 | 2.31 | 3.13 |
| eF2_ln | 894.33 | 901.15 | 892.94 | uF1_ln | 202.00 | 200.27 | 201.95 |
| uF3_ln | 477.98 | 476.34 | 474.32 | aF2_ln | 531.59 | 533.79 | 531.03 |
| eF4_ln | 1528.36 | 1523.34 | 1529.25 | uF4_ln | 4378.71 | 4383.83 | 4380.97 |
| uF4_ln | 208.76 | 209.21 | 213.63 |
|
|
|
|
|
|
|
|
| sI0 | 4.11 | 4.15 | 4.10 |
| oFB240_480/oFB960_1920 | 11.09 | 10.69 | 10.39 | sFB240_480/sFB960_1920 | 69.47 | 69.80 | 70.48 |
| eJITA_sqrt | −79.09 | −77.53 | −77.82 | oJITA_sqrt | −19.49 | −20.20 | −19.70 |
| uJITA_sqrt | −9.09 | −10.98 | −10.24 |
|
|
|
|
| (Constant) | −12602.18 | −12604.26 | −12595.92 | (Constant) | −20883.77 | −20939.87 | −20912.08 |
The discriminant results for subjects in their 30 s/40 s.
|
| Male | Female | ||||||||
| FC | Prediction | Total | Accuracy | Prediction | Total | Accuracy | ||||
| TE | SE | SY | TE | SE | SY | |||||
|
| ||||||||||
| TE | 40 | 27 | 28 | 95 | 0.42 | 63 | 40 | 46 | 149 | 0.42 |
| SE | 14 | 38 | 13 | 65 | 0.58 | 32 | 71 | 38 | 141 | 0.50 |
| SY | 19 | 22 | 45 | 86 | 0.52 | 46 | 56 | 83 | 185 | 0.45 |
|
| ||||||||||
| Average |
|
|
|
| 0.50 |
|
|
|
| 0.46 |
The coefficients of the linear discriminant function in subjects over 50 years old.
| Male | SC | Female | SC | ||||
| TE | SE | SY | TE | SE | SY | ||
|
| |||||||
| oF1_ln | 206.57 | 203.55 | 206.83 | eF1_ln | 214.50 | 214.33 | 216.28 |
| aF2_ln | 726.79 | 729.70 | 729.64 | iF1_ln | 313.68 | 313.04 | 315.48 |
| oF3_ln | 1157.23 | 1155.75 | 1153.21 | iF2_ln | 167.21 | 169.08 | 166.61 |
|
|
|
|
| eF3_ln | 699.49 | 698.85 | 701.71 |
|
|
|
|
| uF3_ln | 1452.33 | 1447.92 | 1450.98 |
| oMFCC6 | −0.69 | −0.74 | −0.68 |
|
|
|
|
|
|
|
|
| sI0 | 0.46 | 0.49 | 0.49 |
| oFB240_480/oFB960_1920 | 25.36 | 24.27 | 24.66 | aFB60_120/aFB240_480 | 100.48 | 100.49 | 101.74 |
| uFB240_480/uFB960_1920 | 284.86 | 285.94 | 283.19 |
|
|
|
|
| sFB240_480/sFB960_1920 | 53.43 | 55.70 | 55.19 |
|
|
|
|
| (Constant) | −8441.34 | −8440.88 | −8428.15 | (Constant) | −10873.65 | −10844.66 | −10900.15 |
The discriminant results in subjects over 50 years old.
| Male | Female | |||||||||
| FC | Prediction | Total | Accuracy | Prediction | Total | Accuracy | ||||
| TE | SE | SY | TE | SE | SY | |||||
|
| ||||||||||
| TE | 59 | 29 | 22 | 110 | 0.54 | 56 | 43 | 42 | 141 | 0.4 |
| SE | 11 | 24 | 9 | 44 | 0.55 | 23 | 45 | 22 | 90 | 0.5 |
| SY | 22 | 22 | 37 | 81 | 0.46 | 25 | 28 | 63 | 116 | 0.54 |
|
| ||||||||||
| Average | 0.51 | 0.47 | ||||||||