| Literature DB >> 35590889 |
Zhiqiang Ni1,2, Fangmin Sun1, Ye Li1.
Abstract
Accurate assessment of physical fatigue is crucial to preventing physical injury caused by excessive exercise, overtraining during daily exercise and professional sports training. However, as a subjective feeling of an individual, physical fatigue is difficult for others to objectively evaluate. Heart rate variability (HRV), which is derived from electrocardiograms (ECG) and controlled by the autonomic nervous system, has been demonstrated to be a promising indicator for physical fatigue estimation. In this paper, we propose a novel method for the automatic and objective classification of physical fatigue based on HRV. First, a total of 24 HRV features were calculated. Then, a feature selection method was proposed to remove useless features that have a low correlation with physical fatigue and redundant features that have a high correlation with the selected features. After feature selection, the best 11 features were selected and were finally used for physical fatigue classifying. Four machine learning algorithms were trained to classify fatigue using the selected features. The experimental results indicate that the model trained using the selected 11 features could classify physical fatigue with high accuracy. More importantly, these selected features could provide important information regarding the identification of physical fatigue.Entities:
Keywords: feature selection; heart rate variability; machine learning; physical fatigue
Mesh:
Year: 2022 PMID: 35590889 PMCID: PMC9100264 DOI: 10.3390/s22093199
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Participants’ statistical characteristics.
| Statistical Characteristic | Value |
|---|---|
| Number of Subjects | 80 (42 Males, 38 Females) |
| Age (years) | 29.1 ± 6.5 |
| Height (cm) | 168.0 ± 8.1 |
| Weight (kg) | 61.7 ± 11.2 |
Protocol of the modified Bruce treadmill test.
| Stage | Duration (min) | Speed (km/h) | Incline (%) |
|---|---|---|---|
| Pre-rest | 5 | 0 | 0 |
| Ex-1 | 5 | 3 | 5 |
| Ex-2 | 5 | 5 | 5 |
| Ex-3 | 5 | 6.4 | 5 |
| Ex-4 | 5 | 7.8 | 5 |
| Ex-5 | 5 | 10.2 | 5 |
| Ex-6 | Until exhausted | 11.6 | 5 |
Figure 1The scenario of the data collection experiment.
The RPE scale and its description.
| Borg Rating | Description |
|---|---|
| 6 | Nothing |
| 7 to 8 | Very, very light |
| 9 to 10 | Very light |
| 11 to 12 | Fairly light |
| 13 to 14 | Somewhat hard |
| 15 to 16 | Hard |
| 17 to 18 | Very hard |
| 19 to 20 | Very, very hard |
Figure 2The distribution of the collected dataset.
All HRV features.
| Measures | Feature | Unit | Description |
|---|---|---|---|
| Time domain | meanNN | ms | Mean of NN interval sequence. |
| meanHR | 1/min | Mean of heart rate sequence. | |
| SDNN | ms | Standard deviation of NN interval sequence. | |
| RMSSD | ms | Root mean square of successive differences in NN interval sequence. | |
| NN50 | count | Number of successive differences in NN interval sequences greater than 50 ms. | |
| pNN50 | % | Percentage of NN50 in total intervals. | |
| SDANN | ms | Standard deviation of the averages of the segmented chunks. | |
| SDNNi | ms | Average of the standard deviations of the segmented chunks. | |
| HRVTi | - | Ratio of total number of all intervals to the height of the histogram. | |
| TINN | ms | Baseline width of the minimum square difference triangular interpolation of the highest peak of the histogram. | |
| Frequency domain | aVLF | ms2 | Absolute powers of VLF band. |
| aLF | ms2 | Absolute powers of LF band. | |
| aHF | ms2 | Absolute powers of HF band. | |
| LF/HF | - | Ratio of aLF/aHF. | |
| peakVLF | Hz | Peak frequency for VLF band. | |
| peakLF | Hz | Peak frequency for LF band. | |
| peakHF | Hz | Peak frequency for HF band. | |
| Nonlinear domain | sampen | - | Negative natural logarithm of the conditional probability that two sequences remain similar at the next point. |
| SD1 | ms | Standard deviations along the major axis of the ellipse. | |
| SD2 | ms | Standard deviations along the minor axis of the ellipse. | |
| SD1/SD2 | - | Ratio of SD1 to SD2. | |
| α | - | Slope of a fitting line of the root mean square fluctuation of an integrated and detrended time series on a log–log scale. | |
| α1 | - | α on first linear region. | |
| α2 | - | α on second linear region. |
Figure 3The scores calculated by Equation (4) of all the 24 original features.
Figure 4Correlations between 14 important features.
Optimal feature set.
| Time Domain | Frequency Domain | Nonlinear Domain |
|---|---|---|
| meanHR | aVLF | sampen |
| NN50 | SD2 | |
| SDANN | SD1/SD2 | |
| HRVTi | α | |
| TINN | α1 |
Performance of the four machine learning models using different features.
| Model | Using All Features | Using Selected Features | ||||||
|---|---|---|---|---|---|---|---|---|
| Accuracy | Precision | Recall | F1 Score | Accuracy | Precision | Recall | F1 Score | |
| DT | 0.728 ± 0.043 | 0.646 ± 0.062 | 0.644 ± 0.053 | 0.642 ± 0.056 | 0.772 ± 0.030 | 0.711 ± 0.036 | 0.706 ± 0.034 | 0.705 ± 0.034 |
| KNN | 0.780 ± 0.045 | 0.712 ± 0.044 | 0.694 ± 0.041 | 0.696 ± 0.044 | 0.805 ± 0.020 | 0.754 ± 0.031 | 0.735 ± 0.038 | 0.736 ± 0.033 |
| SVM | 0.810 ± 0.038 | 0.752 ± 0.048 | 0.748 ± 0.053 | 0.747 ± 0.052 | 0.831 ± 0.037 | 0.780 ± 0.045 | 0.770 ± 0.040 | 0.769 ± 0.043 |
| LightGBM | 0.841 ± 0.030 | 0.811 ± 0.035 | 0.777 ± 0.041 | 0.781 ± 0.038 | 0.855 ± 0.015 | 0.829 ± 0.032 | 0.800 ± 0.031 | 0.801 ± 0.025 |
Figure 5Overall confusion Matrix of LightGBM of the 10-fold cross validation.