| Literature DB >> 30671562 |
Jörn Lötsch1,2, Thomas Hummel3.
Abstract
BACKGROUND: The functional performance of the human sense of smell can be approached via assessment of the olfactory threshold, the ability to discriminate odors or the ability to identify odors. Contemporary clinical test batteries include all or a selection of these components, with some dissent about the required number and choice.Entities:
Keywords: Anosmia; Data science; Human olfaction; Machine-learning; Olfactory diagnostics; Patients
Year: 2019 PMID: 30671562 PMCID: PMC6330373 DOI: 10.1016/j.ibror.2019.01.002
Source DB: PubMed Journal: IBRO Rep ISSN: 2451-8301
Fig. 1Original individual data shown in “spaghetti plots”, separately for the three olfactory diagnoses for better visibility. The individual values of olfactory threshold, odor discrimination and odor identification are connected by straight lines. The data are slightly jittered to enhance visibility by reducing superimposition of data points. The bold dashed black lines indicate the medians across the whole cohort. The figure has been created using the R software package (version 3.4.2 for Linux; http://CRAN.R-project.org/ (R Development Core Team, 2008)).
Results of the analysis of variance for repeated measurements (rm-ANOVA) and correlation analyses. Specifically, the rm-ANOVA was designed with “subtest”, i.e., olfactory threshold, odor discrimination and odor identification as within-subject factor and “olfactory diagnosis”, i.e., anosmia, hyposmia or normosmia, “gender” as between subject factors and “age as covariate. Degrees of freedom, F-values and p-values are shown for main effects and interactions.
| Effect | Degrees of freedom | F-value | p-value |
|---|---|---|---|
| Subtest (threshold, discrimination, identification) | 2,21412 | 70.742 | 2.38 · 10−31 |
| Olfactory diagnosis (anosmia, hyposmia, normosmia)) | 2,10706 | 27550.541 | < 10−100 |
| Gender (male, female) | 1,10706 | 66.355 | 4.18 · 10−16 |
| Age | 1,10706 | 33.169 | 8.68 · 10−9 |
| Subtest * olfactory diagnosis | 4,21412 | 1035.703 | < 10−100 |
| Subtest * gender | 2,21412 | 1.199 | 0.301 |
| Subtest * age | 2,21412 | 190.135 | 1.41 · 10−82 |
| Subtest * olfactory diagnosis * gender | 4,21412 | 2.385 | 0.049 |
| Olfactory diagnosis * age | 2,10706 | 4.386 | 0.012 |
Fig. 2Explorative analysis of the correlations between age, the single olfactory subjects, and the TDI sum score, separately for the three olfactory diagnoses. At the lower left parts, the correlations are shown as ellipses, with the direction toward positive (upwards) or negative (downwards) correlations, and colored according to the color code of Spearman’s ρ (Spearman, 1904) shown at the bottom of the panels. At the upper right parts, the correlations are provided numerically as values of Spearman’s ρ (colored). The p-values are shown in black numbers below the correlation coefficients; “0″ indicates p < 1 · 10−5. The figure has been created using the R software package (version 3.4.2 for Linux; http://CRAN.R-project.org/ (R Development Core Team, 2008)) and the library “corrplot” (https://cran.r-project.org/package=corrplot (Wei and Simko, 2017)).
Performance measures of classifiers obtained using different machine-learned methods (ordinal logistic regression, naïve Bayes, classification and regression trees (CART), k-nearest neighbors, random forests, support vector machines, multilayer perceptron) on the data set comprising the results of three olfactory subtests (assessment of olfactory threshold, odor discrimination and odor identification) acquired in 10,713 subjects. Results represent the medians of the test performance measures from 100 model runs using random splits of the data set into training data (2/3 of the data set) and test data (1/3 of the data set). The classifiers were trained on the original data set and again on randomly permuted training data.
| Data set | Original data | Permuted data | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Parameter | CART | Random forests | k-nearest neighbors | Support vector machines | Ordinal logistic regression | Multilayer perceptron | Naïve Bayes | CART | Random forests | k-nearest neighbors | Support vector machines | Ordinal logistic regression | Multilayer perceptron | Naïve Bayes |
| Sensitivity, recall [%] | 96.5 | 97.3 | 97.2 | 98 | 97.2 | 97.9 | 93 | 34.7 | 15.1 | 41.2 | 0.8 | 0 | 4.4 | 0 |
| Specificity [%] | 98.5 | 98.8 | 98.7 | 99.1 | 98.8 | 99.1 | 95.6 | 64.9 | 77.9 | 57.7 | 98.4 | 100 | 97.5 | 100 |
| Positive predictive value, precision [%] | 96.5 | 97.3 | 97.1 | 98.1 | 97 | 97.8 | 91 | 34.3 | 39.9 | 34.8 | 40.5 | 40.4 | 40.4 | 40.6 |
| Negative predictive value [%] | 98.6 | 98.8 | 98.8 | 99.1 | 98.7 | 99 | 97.1 | 65.7 | 66.7 | 66.4 | 73.6 | 72.8 | 73.4 | 73.7 |
| Precision | 96.5 | 97.3 | 97.1 | 98.1 | 97 | 97.8 | 91 | 34.3 | 39.9 | 34.8 | 40.5 | 40.4 | 40.4 | 40.6 |
| Balanced accuracy [%] | 97.2 | 97.8 | 97.7 | 98.4 | 95.4 | 98.2 | 94.4 | 50.2 | 50.7 | 50 | 50 | 50 | 50 | 50 |
Balanced accuracy of the olfactory diagnosis of different machine-learned classifiers (ordinal logistic regression, naïve Bayes, classification and regression trees (CART), k-nearest neighbors, random forests, support vector machines, multilayer perceptron) trained with data from the full data set comprising the results of three olfactory subtests (assessment of olfactory threshold, odor discrimination and odor identification) acquired in 10,713 subjects, and with reduced data sets consisting of the results of two or one olfactory subtests (olfactory threshold, T, odor discrimination, D, odor identification, I). In addition, the classification accuracy of age or sex or combinations with the full data set has been assessed. Results represent the medians of the test performance measures from 100 model runs using random splits of the data set into training data (2/3 of the data set) and test data (1/3 of the data set).
| Subtests | Complete | None | Threshold | Discrimination | Identification | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| + Sex | + Sex + age | Age and sex | Alone | + Discrimination | + Identification | Alone | + Identification | Alone | ||
| TDI | TDI + sex | TDI + sex + age | none | T | TD | TI | D | DI | I | |
| Methods | w/o identification | w/o discrimination | w/o threshold | |||||||
| CART | 97.2 | 97.3 | 96.9 | 59 | 83.6 | 91.1 | 90.9 | 68.4 | 85.2 | 84.9 |
| Random forests | 97.8 | 98.3 | 98 | 59.4 | 83.6 | 91 | 91.1 | 68.4 | 85.5 | 84.9 |
| k-nearest neighbors | 97.7 | 97.7 | 96.4 | 55.3 | 82 | 90.1 | 90.4 | 65.9 | 84 | 83.2 |
| Support vector machines | 98.4 | 99.1 | 98.7 | 59.9 | 83.5 | 91.1 | 91.5 | 68.5 | 85.9 | 84.9 |
| Ordinal logistic regression | 95.4 | 95.5 | 95.6 | 55.5 | 84.2 | 89.3 | 91.3 | 67.2 | 83.9 | 85.1 |
| Multilayer perceptron | 98.2 | 99.1 | 99.1 | 59.7 | 83.5 | 91.3 | 91.4 | 67.5 | 85.8 | 85.1 |
| Naïve Bayes | 94.4 | 94.4 | 93.2 | 60.4 | 84.9 | 90 | 90.9 | 67.6 | 86.5 | 85.1 |
Fig. 3Radar plot of the balanced accuracy of different classifiers (ordinal logistic regression, naïve Bayes, classification and regression trees (CART), k-nearest neighbors, random forests, support vector machines, multilayer perceptrons) to establish the clinical olfactory diagnosis (anosmia, hyposmia or normosmia) from olfactory subtest results. The classification performance has been assessed in of 100 model runs using random resampling with splits into 2/3 of the data (training data subset) and 1/3 (test data subset). The plot shows the balanced accuracies in a spider web form. Each category, i.e., machine-learning method, has a separate axis, scaled from 60 to 100% balanced accuracy. The axes are arranged in a circle in 360 degrees evenly, and the values of each series are connected with lines indicating the results obtained with either of the three data sets, i.e., the fully featured set of olfactory subtest results comprising the olfactory threshold, T, odor discrimination, D, odor identification, I, or with reduced-feature data sets from which one or two olfactory subtest results had been omitted. The figure has been created using the R software package (version 3.4.2 for Linux; http://CRAN.R-project.org/ (R Development Core Team, 2008)) with the “radarchart” function provided in the library “fmsb” (M. Nakazawa, https://cran.r-project.org/package=fmsb).