| Literature DB >> 30534557 |
Zhi-Yu Luo1, Ji Cui1, Xiao-Juan Hu2, Li-Ping Tu1, Hai-Dan Liu1, Wen Jiao1, Ling-Zhi Zeng1, Cong-Cong Jing1, Li-Jie Qiao1, Xu-Xiang Ma1, Yu Wang1, Jue Wang1, Ching-Hsuan Pai1, Zhen Qi1, Zhi-Feng Zhang1, Jia-Tuo Xu1.
Abstract
OBJECTIVE: In this study, machine learning was utilized to classify and predict pulse wave of hypertensive group and healthy group and assess the risk of hypertension by observing the dynamic change of the pulse wave and provide an objective reference for clinical application of pulse diagnosis in traditional Chinese medicine (TCM).Entities:
Mesh:
Year: 2018 PMID: 30534557 PMCID: PMC6252205 DOI: 10.1155/2018/2964816
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Collection instrument. (a) Pulse diagnosis instrument (PDA-1). (b) Interface of pulse diagnosis and analysis system.
TD features.
| No. | Features | Meaning |
|---|---|---|
| 1 | h1 | Main wave amplitude. It reflects the compliance of the aorta and the cardiac ejection function of the left ventricular |
|
| ||
| 2 | h2 | Main isthmus wave amplitude. Same physiological significance as h3. |
|
| ||
| 3 | h3 | Heavy wave front wave amplitude. It reflects the elasticity of arterial vessels and its peripheral resistance. |
|
| ||
| 4 | h4 | Dicrotic notch amplitude. It reflects the peripheral resistance of arterial vessels and the closure of aortic valve. |
|
| ||
| 5 | h5 | Gravity wave amplitude. It reflects the compliance of the aorta and the function of aortic valve. |
|
| ||
| 6 | t1 | Left ventricular rapid ejection period. The time value from the start point to the crest point of the main wave on the pulse graph. |
|
| ||
| 7 | t2 | The duration of the beginning of the tidal wave. |
|
| ||
| 8 | t3 | The duration of the crest of the tidal wave. |
|
| ||
| 9 | t4 | Left ventricular systolic duration. The time value from the start point to the dicrotic notch on the pulse graph. |
|
| ||
| 10 | t5 | Left ventricular diastolic duration. The time value from the dicrotic notch to the end point on the pulse graph. |
|
| ||
| 11 | t | Includes left ventricular systolic and diastolic duration. The time value from the start point to the end point on the pulse graph. |
|
| ||
| 12 | w1 | main wave 1/3 height. The duration of maintaining high intravascular pressure. |
|
| ||
| 13 | w2 | main wave 1/5 height. The duration of maintaining high intravascular pressure. |
|
| ||
| 14 | w1/t | The ratio of the width of the main wave at its 1/3 height to the entire pulse cycle. It reflects the proportion of the duration time of continuous high pressure in the aorta in the entire pulse cycle. |
|
| ||
| 15 | w2/t | The proportion of the duration time of continuous high pressure in the aorta in the entire pulse cycle. |
|
| ||
| 16 | h1/t1 | cardiovascular function |
|
| ||
| 17 | As | Systolic area. The area on the pulse graph is related to cardiac output. |
|
| ||
| 18 | Ad | Diastolic area. |
Figure 2The main measurement parameters of pulse wave cycle.
Figure 3Flow chart.
Figure 4k-means clustering analysis. The x-axis represents the number of clusters while the y-axis represents the variable h1/t1. Red points include noise pulse wave. Blue points include normal pulse wave.
Comparison of the variables between hypertension group and healthy group ().
|
|
|
|
|
|---|---|---|---|
|
| 44.44 ± 9.204 | 44.7 ± 8.706 | 0.37 |
|
| 23.91 ± 2.961 | 25.55 ± 3.306 | 0.0 |
|
| 0.18 ± 0.035 | 0.18 ± 0.033 | 0.55 |
|
| 0.13 ± 0.034 | 0.14 ± 0.034 | 0.05 |
|
| 0.22 ± 0.029 | 0.22 ± 0.028 | 1 |
|
| 0.11 ± 0.035 | 0.1 ± 0.036 | 0.0 |
|
| 116.0 ± 35.992 | 126.11 ± 44.893 | 0.0 |
|
| 84.98 ± 32.215 | 93.18 ± 40.319 | 0.02 |
|
| 78.02 ± 29.356 | 85.73 ± 36.019 | 0.01 |
|
| 44.78 ± 15.047 | 47.55 ± 18.016 | 0.09 |
|
| 12.8 ± 4.455 | 12.16 ± 4.056 | 0.04 |
|
| 0.14 ± 0.021 | 0.14 ± 0.022 | 0.31 |
|
| 0.24 ± 0.037 | 0.24 ± 0.041 | 0.0 |
|
| 0.27 ± 0.033 | 0.27 ± 0.038 | 0.08 |
|
| 0.36 ± 0.03 | 0.36 ± 0.034 | 0.23 |
|
| 0.41 ± 0.023 | 0.41 ± 0.028 | 0.0 |
|
| 0.85 ± 0.119 | 0.83 ± 0.128 | 0.01 |
|
| 838.82 ± 276.686 | 919.03 ± 355.812 | 0.0 |
|
| 0.67 ± 0.128 | 0.68 ± 0.133 | 0.36 |
|
| 0.39 ± 0.082 | 0.38 ± 0.082 | 0.11 |
|
| 0.21 ± 0.036 | 0.22 ± 0.033 | 0.0 |
|
| 0.16 ± 0.036 | 0.16 ± 0.035 | 0.0 |
|
| 77.08 ± 9.613 | 80.4 ± 11.839 | 0.0 |
|
| 85.74 ± 4.868 | 76.38 ± 10.331 | 0.0 |
Compared with healthy group. ∗ P <0.05, ∗∗ P <0.01.
Results on the classification of machine learning model.
| Model | ACC | AUC | SP | ST |
|---|---|---|---|---|
| RF | 0.841 | 0.832 | 0.936 | 0.728 |
| RF | 0.853 | 0.848 | 0.905 | 0.792 |
| Gradient Boosting | 0.852 | 0.843 | 0.941 | 0.746 |
| Gradient Boosting | 0.864 | 0.859 | 0.920 | 0.798 |
| SVM | 0.796 | 0.792 | 0.833 | 0.752 |
| SVM | 0.832 | 0.828 | 0.865 | 0.792 |
| AdaBoost | 0.839 | 0.830 | 0.921 | 0.740 |
| AdaBoost | 0.864 | 0.858 | 0.925 | 0.792 |
| KNeighbors | 0.729 | 0.716 | 0.852 | 0.580 |
| KNeighbors | 0.736 | 0.728 | 0.830 | 0.625 |
Models with an asterisk ∗ mean that K-means is applied before using these models.
Figure 5The ROC curve in different model. (a) Data without noise reduction. (b) Data with noise reduction. The x-axis denotes false positive rate. The y-axis is true positive rate. In Figure 5, the dark blue line represents Random Forest (RF). The green line represents support vector machine (SVM). The red line represents Adaboost. The light blue line represents Gradient Boosting. The purple line represents K-nearest neighbor (KNN).
Figure 6The importance of the classification variables features. The bar charts (a), (b), and (c) represent the results of feature importance of AdaBoost, Gradient Boosting, and Random Forest (RF), respectively. The y-axis represents the value of the importance for variables features. Note that “yes" represents that K-means is used to reduce noise in this model and vice versa.
Selected (top 12) features in model.
| model | Feature variable | Feature importance |
|---|---|---|
| Adaboost | t | 0.126 |
| BMI | 0.109 | |
| HR | 0.083 | |
| h1/t1 | 0.066 | |
| Ad | 0.057 | |
| w2 | 0.051 | |
| t2 | 0.049 | |
| t3 | 0.049 | |
| w1/t | 0.040 | |
| h5 | 0.034 | |
| h3/h1 | 0.034 | |
| As | 0.034 | |
|
| ||
| Gradient Boosting | H20 score | 0.128 |
| BMI | 0.075 | |
| t | 0.065 | |
| h1/t1 | 0.055 | |
| t3 | 0.048 | |
| h5 | 0.042 | |
| t5 | 0.042 | |
| Ad | 0.041 | |
| t2 | 0.041 | |
| t1 | 0.039 | |
| h4 | 0.039 | |
| w2/t | 0.039 | |
Figure 7The analysis of the importance on the features of traditional statistics and machine learning.