| Literature DB >> 19745013 |
Imhoi Koo1, Jong Yeol Kim, Myoung Geun Kim, Keun Ho Kim.
Abstract
Recently, oriental medicine has received attention for providing personalized medicine through consideration of the unique nature and constitution of individual patients. With the eventual goal of globalization, the current trend in oriental medicine research is the standardization by adopting western scientific methods, which could represent a scientific revolution. The purpose of this study is to establish methods for finding statistically significant features in a facial image with respect to distinguishing constitution and to show the meaning of those features. From facial photo images, facial elements are analyzed in terms of the distance, angle and the distance ratios, for which there are 1225, 61 250 and 749 700 features, respectively. Due to the very large number of facial features, it is quite difficult to determine truly meaningful features. We suggest a process for the efficient analysis of facial features including the removal of outliers, control for missing data to guarantee data confidence and calculation of statistical significance by applying ANOVA. We show the statistical properties of selected features according to different constitutions using the nine distances, 10 angles and 10 rates of distance features that are finally established. Additionally, the Sasang constitutional meaning of the selected features is shown here.Entities:
Keywords: ANOVA; correlation; data preprocessing; hierarchical clustering; significant features
Year: 2009 PMID: 19745013 PMCID: PMC2741622 DOI: 10.1093/ecam/nep065
Source DB: PubMed Journal: Evid Based Complement Alternat Med ISSN: 1741-427X Impact factor: 2.629
Figure 1.The flowchart shows the data-mining procedure. Under the SOP, operators take a picture of a subject and fill feature points using a Microsoft Excel file. Let us calculate the quantitative measures. The next step is a data preprocess of controlling outliers and missing values. The next part of the procedure involves determining the significant features considering ANOVA and correlation.
Table of significant distance features associated with the Four Constitution types
| Index | Missing rate (%) | Mean ± SD | |||||
|---|---|---|---|---|---|---|---|
| All | TE | SY | SE | ||||
| 1 | 8.63E-04 | 3.4836 | 46.86 ± 7.74 | 48.78 ± 7.07 | 44.66 ± 7.74 | 45.76 ± 8.14 | |
| 2 | 3.24E-04 | 3.0737 | 56.84 ± 7.86 | 59.07 ± 7.48 | 54.70 ± 9.08 | 54.99 ± 5.52 | |
| 3 | 3.66E-03 | 2.8688 | 35.76 ± 6.63 | 37.37 ± 6.21 | 34.38 ± 6.95 | 34.28 ± 6.38 | |
| 4 | 5.59E-04 | 3.0737 | 58.06 ± 8.21 | 60.40 ± 7.77 | 56.16 ± 9.26 | 55.71 ± 6.28 | |
| 5 | 6.76E-04 | 3.8934 | 69.49 ± 8.83 | 71.76 ± 8.47 | 67.37 ± 10.27 | 67.55 ± 6.14 | |
| 6 | 9.61E-04 | 2.8688 | 71.05 ± 8.82 | 73.31 ± 8.44 | 69.03 ± 10.20 | 69.05 ± 6.31 | |
| 7 | 1.38E-03 | 2.8688 | 62.98 ± 8.33 | 65 ± 7.85 | 61.18 ± 9.96 | 61.17 ± 5.77 | |
| 8 | 3.27E-03 | 2.8688 | 97.11 ± 9.09 | 98.98 ± 9.18 | 95.88 ± 8.01 | 94.92 ± 9.39 | |
| 9 | 5.04E-03 | 3.6885 | 105.02 ± 8.56 | 107.10 ± 8.12 | 103.33 ± 9.26 | 102.97 ± 7.59 | |
The term d(i, j) denotes the distance from the i-th and j-th feature points. The P-value comes from the ANOVA test.
Table of significant angle features associated with the Four Constitution types
The term denotes the angle with the vertex j-th point and the endpoints i-th and k-th feature points. The P-value comes from the ANOVA test.
Table of significant rate of distances features associated with Four Constitution type
| Index | Missing rate (%) | Mean ± SD | |||||
|---|---|---|---|---|---|---|---|
| All | TE | SY | SE | ||||
| 1 | 6.80E-07 | 5.7377 | 1.116 ± 0.142 | 1.082 ± 0.136 | 1.133 ± 0.141 | 1.165 ± 0.137 | |
| 2 | 6.60E-08 | 3.8934 | 2.013 ± 0.344 | 1.923 ± 0.338 | 2.06 ± 0.349 | 2.137 ± 0.301 | |
| 3 | 2.99E-06 | 7.7868 | 0.84 ± 0.103 | 0.815 ± 0.099 | 0.862 ± 0.101 | 0.865 ± 0.103 | |
| 4 | 4.15E-11 | 5.9426 | 0.692 ± 0.086 | 0.665 ± 0.082 | 0.708 ± 0.081 | 0.727 ± 0.082 | |
| 5 | 8.04E-07 | 4.5081 | 0.826 ± 0.113 | 0.798 ± 0.108 | 0.844 ± 0.115 | 0.861 ± 0.108 | |
| 6 | 3.89E-08 | 12.704 | 2.458 ± 0.395 | 2.346 ± 0.368 | 2.523 ± 0.383 | 2.599 ± 0.401 | |
| 7 | 7.84E-07 | 4.3032 | 0.542 ± 0.072 | 0.524 ± 0.069 | 0.553 ± 0.073 | 0.564 ± 0.068 | |
| 8 | 1.94E-08 | 4.0983 | 0.98 ± 0.135 | 0.943 ± 0.131 | 1.005 ± 0.133 | 1.026 ± 0.123 | |
| 9 | 2.84E-09 | 5.7377 | 0.15 ± 0.029 | 0.143 ± 0.026 | 0.15 ± 0.023 | 0.164 ± 0.036 | |
| 10 | 4.60E-07 | 4.0983 | 0.386 ± 0.029 | 0.38 ± 0.029 | 0.388 ± 0.029 | 0.397 ± 0.025 | |
The term r(i, j/k,l) denotes the ratio of distance of line segment i-th and j-the feature points to distance of line segment k-th and l-the feature points. The P-value comes from the ANOVA test.