| Literature DB >> 29354684 |
James F Morley1,2, Abigail Cohen3, Laura Silveira-Moriyama4, Andrew J Lees4, David R Williams5, Regina Katzenschlager6, Christopher Hawkes7, Julie P Shtraks1, Daniel Weintraub1,2,8, Richard L Doty9, John E Duda1,2.
Abstract
The 40-item University of Pennsylvania Smell Identification Test (UPSIT) is an effective instrument to detect olfactory dusfunction in Parkinson's disease (PD). It is not clear, however, whether tests of this length are necessary to detect such dysfunction. Several studies have suggested that detection of certain odors is selectively compromised in PD, and that a test comprised of these odors could be shorter and more specific for this purpose. Therefore, we attempted to identify a subset of UPSIT odors that distinguish PD from controls with similar or improved test characteristics compared to the full test. The discriminatory power of each odor was examined using UPSIT data from a discovery cohort of 314 PD patients and 314 matched controls and ranked using multiple methods (including odds ratios, regression coefficients and discriminant analysis). To validate optimally discriminant subsets, we calculated test characteristics using data from two independent cohorts (totaling 306 PD and 343 controls). In the discovery cohort, multiple novel 12-item subsets (and the previously described Brief Smell Identification Test-B) performed similarly or improved upon the UPSIT and were better than 12 random items. However, in validation studies from independent cohorts, multiple subsets retained test characteristics similar to the full UPSIT, but did not outperform 12 random items. Differential discriminatory power of individual items is not conserved across independent cohorts arguing against selective hyposmia in PD. However, multiple 12-item subsets performed as well as the full UPSIT. These subsets could form the basis for shorter olfactory tests in the clinical evaluation of Parkinsonism.Entities:
Year: 2018 PMID: 29354684 PMCID: PMC5768805 DOI: 10.1038/s41531-017-0039-8
Source DB: PubMed Journal: NPJ Parkinsons Dis ISSN: 2373-8057
Examples of currently available olfactory tests used in PD and previously proposed discriminant subsets of odors
| Test/Author | # Odors | Comment | Ref |
|---|---|---|---|
| Scratch and Sniff-based | |||
| UPSIT | 40 | Odor identification. Used in >100 PD studies |
|
| B-SIT | 12 | Designed to be shorter and cross-culturally valid. Not intended to be PD specific |
|
| B-SIT-B | 12 | Based on the BSIT. Modified with the intention to be more specific for PD |
|
| Double | 5 | Gasoline, banana, pineapple, smoke, cinnamon Identified 82% of PD cases correctly |
|
| Pocket Smell Test | 3 | Lemon, lilac, smoke. Not intended to be PD -specific |
|
| Bohnen | 3 | Banana, licorice, dill pickle. 75% accurate in identifying PD. Better correlated with dopamine transporter imaging than total UPSIT score |
|
| Hawkes | 2 | Pizza, wintergreen. 90% sensitivity 86% specificity for PD |
|
| Odor pen-based | |||
| Sniffin’ Sticks | 16 | Odor identification (modules for threshold and discrimination as well). Well-characterized in PD |
|
| Mahlknecht | 8 | Licorice, anise, mint, cinnamon, banana, pineapple, rose, coffee. 84% sensitivity, 88% specificity for PD |
|
| Casjens | 3 | Coffee, peppermint, anise. Similar misclassification for rate for PD compared to using 16 odors |
|
| Hummel | 3 | Cloves, coffee, rose. 96% sensitivity and 66% specificity for olfactory dysfunction in the general population. Not intended to be PD-specific |
|
UPSIT University of Pennsylvania smell identification test, B-SIT brief smell identification test, B-SIT-B brief smell identification test, version B
Different sets of odors distinguish between PD and control subjects
| Items | Control mean (SD) | PD mean (SD) |
| AUC (95% CI) | Sen | Spe | Cut | |
|---|---|---|---|---|---|---|---|---|
| UPSIT | 40 | 28 (8.7) | 19 (7.2) | <0.001 | 0.78 (0.74–0.82) | 0.84 | 0.66 | 27 |
| BSIT | 12 | 8.7 (2.7) | 5.8 (2.6) | <0.001 | 0.78 (0.74–0.82) | 0.85 | 0.62 | 9 |
| BSIT-B | 12 | 8.3 (3.0) | 5.0 (2.3) | <0.001 | 0.80 (0.76–0.83) | 0.86 | 0.67 | 8 |
| Double | 5 | 3.5 (1.3) | 2.2 (1.4) | <0.001 | 0.75 (0.71–0.79) | 0.79 | 0.58 | 4 |
| Bohnen | 3 | 2.0 (1.0) | 1.0 (0.85) | <0.001 | 0.75 (0.71–0.79) | 0.73 | 0.70 | 2 |
| PST | 3 | 2.2 (0.89) | 1.6 (1.0) | <0.001 | 0.69 (0.65–0.74) | 0.78 | 0.54 | 3 |
| Hawkes | 2 | 1.3 (0.75) | 0.85 (0.75) | <0.001 | 0.67 (0.63–0.71) | 0.79 | 0.50 | 2 |
Data are mean(SD) of the number of correctly identified odors, Area under the receiver-operator characteristic curve (AUC), sensitivity (Sen) and specificity (Spe). Cut = cut-off number of correct answers used for point sensitivity and specificity
UPSIT University of Pennsylvania smell identification test, B-SIT brief smell identification test, B-SIT-B brief smell identification test, version B, PST pocket smell test
Putative subsets highly discriminant of UPSIT items
| Rank | Difference | Odds ratio | Discriminant | Regression | Combined | Worst |
|---|---|---|---|---|---|---|
| 1 |
|
| Lime | Grass |
| Rootbeer |
| 2 | Motor oil | Grass | Turpentine | Lime | Grass | Watermelon |
| 3 | Soap | Licorice | Smoke | Banana | Turpentine | Leather |
| 4 | Gasoline | Lemon | Banana | Turpentine |
| Onion |
| 5 | Paint thinner | Motor oil |
|
| Lime | Gingerbread |
| 6 | Peanut | Turpentine | Grape |
|
| Peach |
| 7 | Grass | Dill pickle | Soap | Cherry | Motoroil | Cheddar cheese |
| 8 | Lemon |
| Dill pickle | Grape | Banana | Cinnamon |
| 9 | Wintergreen |
|
|
|
| Chocolate |
| 10 | Grape | Gasoline | Pine | Mint | Grape | Mint |
| 11 |
| Lime | Cedar | Cinnamon | Lemon | Cherry |
| 12 |
| Paint thinner | Gasoline |
| Gasoline | Strawberry |
Summary of subsets of UPSIT items that were identified as highly discriminant for differentiating PD from control subjects in the discovery cohort. Listed are the top 12 most discriminant UPSIT items ranked by five different methods: 1) the absolute difference in percentage of PD and control subjects answering incorrectly (Difference), 2) odds ratio, 3) discriminant function analysis (Discriminant),4) logistic regression (Regression), 5) a weighted average combining the first four methods (Combined). The worst 12 items using the difference method (Worst)are shown for comparison. To highlight odors that were found as highly discriminant for differentiating PD from controls using multiple different ranking methods, items appearing on the all of the “Difference”, “Odds Ratio”, “Discriminant”, “Regression” and “Combined” lists are shown as bold. 12 item lists were used to facilitate comparison with existing, commercially available smell tests such as the B-SIT
Test characteristics of putative subsets of highly discriminant UPSIT items from the discovery cohort
| Items | AUC (95% CI) | Sensitivity | Specificity | Cut-off | |
|---|---|---|---|---|---|
| UPSIT | 40 | 0.78(0.74–0.82) | 0.84 | 0.66 | 27 |
| Difference | 12 | 0.82(0.78–0.85) | 0.77 | 0.74 | 8 |
| Odds Ratio | 12 | 0.81(0.79–0.86) | 0.84 | 0.67 | 8 |
| Regression | 12 | 0.80(0.76–0.83) | 0.82 | 0.68 | 8 |
| Discriminant | 12 | 0.81(0.78–0.75) | 0.83 | 0.69 | 8 |
| Combined | 12 | 0.83(0.80–0.86) | 0.84 | 0.71 | 8 |
| Random | 12 | 0.76(0.72–0.79) | 0.78 | 0.65 | 8 |
| Worst | 12 | 0.65(0.61–0.70) | 0.72 | 0.53 | 9 |
| Items from the “Combined” List | |||||
| 11 | 0.83(0.80–0.86) | 0.80 | 0.72 | 8 | |
| 10 | 0.82(0.79–0.85) | 0.76 | 0.76 | 8 | |
| 9 | 0.81(0.79–0.85) | 0.70 | 0.77 | 6 | |
| 8 | 0.80(0.79–0.85) | 0.75 | 0.74 | 6 | |
| 7 | 0.80(0.78–0.85) | 0.85 | 0.67 | 5 | |
| 6 | 0.79(0.76–0.84) | 0.80 | 0.70 | 4 | |
| 5 | 0.79(0.76–0.84) | 0.67 | 0.74 | 3 | |
| 4 | 0.78(0.75–0.82) | 0.71 | 0.70 | 3 | |
| 3 | 0.77(0.74–0.81) | 0.68 | 0.74 | 2 | |
| 2 | 0.73(0.70–0.77) | 0.84 | 0.57 | 2 | |
Data are area under the receiver-operator characteristic curve (AUC), sensitivity and specificity for differentiating PD from control subjects in the discovery cohort. Cut-off = number of correct answers used to determine the point sensitivity and specificity. The subsets of highly discriminant items were determined by ranking odors using five different methods: 1) the absolute difference in percentage of PD and control subjects answering incorrectly (difference), 2) odds ratio, 3) discriminant analysis (discriminant), 4) logistic regression (regression), 5) a weighted average combining the first four methods (combined). For comparison, test characteristics for 12 random items and the worst 12 items using the difference method (worst) are shown. 12 item lists were used to facilitate comparison with existing, commercially available smell tests such as the B-SIT. In the second half of the Table, test characteristics for subsets containing decreasing numbers of the 12 most highly discriminatory items from the discovery cohort are shown
UPSIT University of Pennsylvania smell identification test
Effect of age and sex on olfactory test characteristics
| Test | Age | AUC (95% CI) | Cut-off | Sensitivity | Specificity | ||||
|---|---|---|---|---|---|---|---|---|---|
| Men | Women | Men | Women | Men | Women | Men | Women | ||
| UPSIT | <63 | 0.86 (0.82–0.90) | 0.81 (0.75–0.87) | 27 | 29 | 0.79 | 0.74 | 0.83 | 0.79 |
| 63–73 | 0.88 (0.84–0.92) | 0.88 (0.83–0.94) | 23 | 26 | 0.83 | 0.83 | 0.79 | 0.86 | |
| >73 | 0.76 (0.70–0.82) | 0.82 (0.71–0.93) | 23 | 24 | 0.78 | 0.80 | 0.68 | 0.83 | |
| BSIT-B | <63 | 0.85 (0.81–0.90) | 0.80 (0.74–0.87) | 8 | 9 | 0.80 | 0.71 | 0.80 | 0.77 |
| 63–73 | 0.87 (0.83–0.91) | 0.88 (0.82–0.94) | 7 | 8 | 0.89 | 0.85 | 0.75 | 0.85 | |
| >73 | 0.77 (0.71–0.83) | 0.82 (0.71–0.93) | 7 | 8 | 0.78 | 0.87 | 0.68 | 0.72 | |
| Combined | <63 | 0.85 (0.81–0.89) | 0.76 (0.70–0.83) | 8 | 9 | 0.74 | 0.67 | 0.81 | 0.74 |
| 63–73 | 0.86 (0.82–0.91) | 0.86 (0.80–0.92) | 7 | 8 | 0.79 | 0.82 | 0.77 | 0.77 | |
| >73 | 0.78 (0.72–0.84) | 0.82 (0.72–0.92) | 7 | 7 | 0.78 | 0.77 | 0.63 | 0.72 | |
Data are area under the receiver-operator characteristic curve (AUC), sensitivity and specificity for differentiating PD from control subjects in the discovery cohort. Cut-off = number of correct answers used to determine the point sensitivity and specificity. Subjects from all three cohorts (N = 1279) were divided by sex and age tertile (<63 years old, 63–73 years old and >73 years old). UPSIT University of Pennsylvania smell identification test, BSIT-B brief smell identification test, version B. Combined: Top 12 items found most highly discriminatory in the discovery cohort
Fig. 1Novel UPSIT subsets do no retain discriminatory power across independent cohorts. Data are area under the receiver-operator characteristic curve (error bars represent the 95% confidence interval) for the 40 UPSIT items (red), or items from the BSIT-B (green), “combined” subset (blue), random 12 items (white) or worst 12 items (from the training cohort) when tested using data from the training cohort (Penn) or two independent cohorts (Barts, UCL)