| Literature DB >> 28753975 |
Alicia Nieto-Reyes1, Rafael Duque2, José Luis Montaña3, Carmen Lage4.
Abstract
Functional data analysis and artificial neural networks are the building blocks of the proposed methodology that distinguishes the movement patterns among c's patients on different stages of the disease and classifies new patients to their appropriate stage of the disease. The movement patterns are obtained by the accelerometer device of android smartphones that the patients carry while moving freely. The proposed methodology is relevant in that it is flexible on the type of data to which it is applied. To exemplify that, it is analyzed a novel real three-dimensional functional dataset where each datum is observed in a different time domain. Not only is it observed on a difference frequency but also the domain of each datum has different length. The obtained classification success rate of 83 % indicates the potential of the proposed methodology.Entities:
Keywords: Alzheimer; functional data analysis; healthcare; hypothesis testing; pattern recognition; supervised classification; ubiquitous computing
Mesh:
Year: 2017 PMID: 28753975 PMCID: PMC5539862 DOI: 10.3390/s17071679
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Representation of the three axes into which the accelerations are measured; and shown in two different positions, panels (a) and (b).
Figure 2Measured acceleration on the x-axis for the accelerations of the patients in m/s versus the time in s, at different stages of the disease: early (left), middle (central) and late (right).
Figure 3Histograms of 1000 p-values resulting from applying the two functional ANOVA tests based on random projections: based on the Bonferroni correction (top row) and on the FDR (middle row) and the functional ANOVA based on the F-statistic (bottom row).
Mean, standard deviation and proportion of values smaller or equal than 0.05 of 1000 p-values resulting of applying separately for each coordinate axis, y and the hypothesis tests: functional ANOVA tests based on random projections using the Bonferroni correction (first column) and the FDR (second column), functional ANOVA test based on the F-statistic (third column) and functional test of means based on the norm for contrasting the acceleration curves corresponding to patients in the early-stage of the disease against the middle (fourth column), the early against the late (fifth column) and the middle against the late (sixth column).
| Tests | ||||||
|---|---|---|---|---|---|---|
| Bonferroni | FDR | F-Statistic | ||||
| mean | 0.9266 | 0.8957 | 0.0018 | 0.0013 | 0.7356 | 0.0020 |
| stand. dev. | 0.2027 | 0.2060 | 0.0011 | 0.0006 | 0.1256 | 0.0012 |
| proportion | 0.004 | 0.004 | 1 | 1 | 0 | 1 |
| mean | 0.3603 | 0.1616 | 0.0066 | 0.0021 | 0.0492 | 0.2267 |
| stand. dev. | 0.3015 | 0.0931 | 0.0046 | 0.0012 | 0.0168 | 0.0381 |
| proportion | 0.128 | 0.138 | 1 | 1 | 0.588 | 0 |
| mean | 0.2971 | 0.1282 | 0.1465 | 0.1733 | 0.0377 | 0.2702 |
| stand. dev. | 0.2145 | 0.0620 | 0.0762 | 0.0616 | 0.0139 | 0.0853 |
| proportion | 0.065 | 0.087 | 0.081 | 0.005 | 0.833 | 0 |
Figure 4Histograms of 1000 p-values resulting from applying the test of equality of means based on the norm for the null hypothesis: (top row), (middle row) and (bottom row) on the x-axis (left column), y-axis (middle column) and z-axis (right column).
Misclassification and success rate obtained in the functional supervised classification when applied to the data in the x-axis, y-axis and -axis.
| Missclassifications | Success Rate | |||||||
|---|---|---|---|---|---|---|---|---|
| Early | Middle | Late | Total | Early | Middle | Late | Total | |
| 7 | 2 | 9 | 18 | 0% | 89% | 10% | 49% | |
| 7 | 4 | 8 | 19 | 0% | 78% | 20% | 46% | |
| 6 | 2 | 9 | 17 | 14% | 89% | 10% | 51% | |
Stages of the patients selected in the training and test sample for each of the performed splittings.
| Training Sample | Test Sample | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Splitting | Early | Middle | Late | Total | Early | Middle | Late | Total | |||
| studied | 6 | 15 | 8 | 29 | 1 | 3 | 2 | 6 | |||
| 1 | 5 | 15 | 8 | 28 | 2 | 3 | 2 | 7 | |||
| 2 | 5 | 14 | 8 | 27 | 2 | 4 | 2 | 8 | |||
Figure 5Boxplot of the ercentage of misclassified patients. The test data corresponding to the splitting, on training and test sample, under study (left), to Splitting 1 (middle) and to Splitting 2 (right).
Configuration of the neural network selected.
| Neural Network Parameters | Values |
|---|---|
| Package | neuralnet |
| Input neurons | 90 |
| Hidden neurons | 175 |
| Output neurons | 1 |
| Bias | 1 per hidden layer |
| Max iterations | 1000 |
| Activation function | logistic |
| Algorithm | resilient back-propagation with weight backtracking (rprop+) |
Experiments with other methods for supervised classification. NN stands for neural networks, RT for random trees, RF for randon forest, and SVM for support vector machine.
| Missclassification Rate | Success Rate | |||||||
|---|---|---|---|---|---|---|---|---|
| Technique | Early-Stage | Middle-Stage | Late-Stage | Total | Early-Stage | Middle-Stage | Late-Stage | Total |
| NN | 0 | 0 | 1 | 1 | 100% | 100% | 50% | 83% |
| RT | 1 | 0 | 2 | 3 | 0% | 100% | 0% | 50% |
| RF | 1 | 0 | 2 | 3 | 0% | 100% | 0% | 50% |
| SVM | 1 | 0 | 2 | 3 | 0% | 100% | 0% | 50% |