Optimizing olfactory testing for the diagnosis of Parkinson’s disease: item analysis of the university of Pennsylvania smell identification test

Morley, James F.; Cohen, Abigail; Silveira-Moriyama, Laura; Lees, Andrew J.; Williams, David R.; Katzenschlager, Regina; Hawkes, Christopher; Shtraks, Julie P.; Weintraub, Daniel; Doty, Richard L.; Duda, John E.

doi:10.1038/s41531-017-0039-8

Download PDF

Article
Open access
Published: 15 January 2018

Optimizing olfactory testing for the diagnosis of Parkinson’s disease: item analysis of the university of Pennsylvania smell identification test

James F. Morley^1,2,
Abigail Cohen³,
Laura Silveira-Moriyama⁶,
Andrew J. Lees⁶,
David R. Williams⁷,
Regina Katzenschlager⁸,
Christopher Hawkes⁹,
Julie P. Shtraks¹,
Daniel Weintraub^1,2,4,
Richard L. Doty⁵ &
…
John E. Duda^1,2

npj Parkinson's Disease volume 4, Article number: 2 (2018) Cite this article

8958 Accesses
42 Citations
3 Altmetric
Metrics details

Subjects

Abstract

The 40-item University of Pennsylvania Smell Identification Test (UPSIT) is an effective instrument to detect olfactory dusfunction in Parkinson’s disease (PD). It is not clear, however, whether tests of this length are necessary to detect such dysfunction. Several studies have suggested that detection of certain odors is selectively compromised in PD, and that a test comprised of these odors could be shorter and more specific for this purpose. Therefore, we attempted to identify a subset of UPSIT odors that distinguish PD from controls with similar or improved test characteristics compared to the full test. The discriminatory power of each odor was examined using UPSIT data from a discovery cohort of 314 PD patients and 314 matched controls and ranked using multiple methods (including odds ratios, regression coefficients and discriminant analysis). To validate optimally discriminant subsets, we calculated test characteristics using data from two independent cohorts (totaling 306 PD and 343 controls). In the discovery cohort, multiple novel 12-item subsets (and the previously described Brief Smell Identification Test-B) performed similarly or improved upon the UPSIT and were better than 12 random items. However, in validation studies from independent cohorts, multiple subsets retained test characteristics similar to the full UPSIT, but did not outperform 12 random items. Differential discriminatory power of individual items is not conserved across independent cohorts arguing against selective hyposmia in PD. However, multiple 12-item subsets performed as well as the full UPSIT. These subsets could form the basis for shorter olfactory tests in the clinical evaluation of Parkinsonism.

Predictive value of abbreviated olfactory tests in prodromal Parkinson disease

Article Open access 29 June 2023

Pavan A. Vaswani, James F. Morley, … the PARS Investigators

Odor identification performance in children aged 3–6 years

Article Open access 26 July 2020

Valentin A. Schriever, Liesa Zscheile, … Thomas Hummel

Odor Mixtures in Identification Testing Using Sniffin’ Sticks: The SSomix Test

Article Open access 18 May 2020

David Tianxiang Liu, Gerold Besser, … Christian Albert Mueller

Introduction

Olfactory impairment is a common finding in Parkinson’s disease (PD), with estimates of prevalence ranging from 50% to more than 90%.^1,2,3,4,5,6 Neurons of the olfactory system are among the first to display PD-related Lewy pathology and clinical anosmia or hyposmia may be detected years before motor symptoms present, suggesting that olfactory impairment may be one of the earliest manifestations of synucleinopathy.^7,8,9 Whether or not such pathology causes olfactory dysfunction is unknown, as other explanations for the early deficits are possible.¹⁰ The high prevalence, persistence throughout disease, and ease of olfactory testing has fostered interest in the use of olfaction as a biomarker for early diagnostic strategies, differential diagnosis and prediction of clinical outcomes of PD and related diseases.¹¹

Numerous tests have been used to measure olfactory function in PD with odor identification tests being the most common.^12,13,14,15 Among the best-characterized and robust of such tests is the University of Pennsylvania Smell Identification Test (UPSIT).¹⁶ The UPSIT is comprised of four booklets, each of which contains 10 pages. An odorized “scratch & sniff” label is present on each page of each booklet. The subject scratches the label and then indicates which of four response alternatives best matches the perceived smell. The UPSIT is a robust measure of olfactory dysfunction in PD and has been described in numerous studies.¹⁷ However, use of the UPSIT (and other well-characterized methods such as “Sniffin Sticks”¹⁸) can be limited by difficulty of incorporating such a test into routine clinical encounter. Shorter tests would seem to be preferable both from the perspective of the patient and the neurologist, particularly within a busy clinical setting.

Shorter tests have indeed been developed, although, as noted in the discussion, there is a trade-off between test length, sensitivity, and reliability. Among such tests is the 12-item Brief Smell Identification Test (B-SIT),^19,20 whose test items, derived from the UPSIT, were designed to be cross-cultural in familiarity. This test has been used to assess the prevalence of, or conversion to, such neurodegenerative diseases as Alzheimer’s disease (AD)^21,22 and PD²³ and has several parallel forms. These forms include odors and response alternatives potentially more sensitive to specific neurodegenerative diseases [e.g., B-SIT Version A for AD based upon²⁴ and B-SIT Version B for PD based upon.²⁵ Numerous other brief screening tests also have been developed, including ones using as few as two odors, although not all have been administered to neurodegenerative disease populations. These include members of the 3-item and 4-item Pocket Smell Test^TM series (PSTs),^26,27 the 3-item Quick Smell Identification Test^TM (Q-SIT),²⁸ a 3-item version of the Sniffin’ Sticks test,^29,30 and a 2-item version of the Open Essence Smell Identification Test,³¹ a recent modification of the more widely used Japanese Odor Stick Identification Test.³² Similarly, subsets of other well-characterized olfactory tests (Sniffin Sticks) have been proposed as shorter and more convenient assays for clinical screening.^29,30

In addition to the development of briefer tests is the question as to whether a pattern of smell loss can be identified that is more specific to PD relative to aging or other disorders that impact smell function. Double and colleagues identified a set of 5 B-SIT items that correctly differentiated 82% of PD cases (ref. ³³, and an early study by Hawkes suggested that 2 UPSIT items alone could effectively distinguish PD patients from controls.³⁴ Bohnen and colleagues identified three odors that were 75% accurate in differentiating PD from controls and were better correlated with dopamine transporter imaging than total UPSIT score.³⁵ Other studies using Sniffin Sticks have similarly proposed odors that are selectively affected in PD compared to other causes of hyposmia including head trauma or aging.^29,36 However, as summarized in Table 1, the putative “PD-specific” items vary widely across studies, raising questions about their reliability and validity in the wider PD population. Other studies have found no such selectivity (refs. ^6,37 Whether there is a selective pattern of hyposmia in PD that can be observed across different cohorts is an unanswered question that has important implications for the development of shorter, more sensitive and specific assays.

Table 1 Examples of currently available olfactory tests used in PD and previously proposed discriminant subsets of odors

Full size table

The objective of this study was to determine whether a shorter version of the UPSIT could be developed that retained or improved the sensitivity and specificity in detecting hyposmia in PD. Our approach was to comprehensively analyze the discriminatory power of individual UPSIT items using a variety of statistical methods to identify subsets of odors that robustly distinguish PD patients from controls. We first derived candidate subsets in a large matched discovery cohort and then examined their performance in two independent populations of PD patients and controls.

Results

Many subsets of UPSIT items distinguish PD from controls

We first examined the test characteristics of previously proposed or commercially available subsets of UPSIT odors to distinguish PD patients from control subjects. Group means for the full 40 item-UPSIT, the 12 items comprising the B-SIT, B-SIT-B, 5 items identified by Double et al.,³³ odors from the 3-item Pocket Smell Test, 3 items previously identified by Bohnen et al.³⁵ and two items suggested by Hawkes³⁴ were significantly lower in PD patients compared to controls (Table 2). Sensitivity and specificity were similar between the UPSIT and each of the 12 item tests (Table 2). The 5 item scale based on odors from Double et al., 3-item subsets and the 2 items proposed by Hawkes had lower sensitivity and/or specificity compared to the full test (Table 2).

Table 2 Different sets of odors distinguish between PD and control subjects

Full size table

Development of novel UPSIT subsets for the detection of hyposmia in PD

We attempted to identify novel subsets of UPSIT odors that might outperform the full test using different statistical ranking strategies (see methods for full details). This approach narrowed the 40 UPSIT items to a total of 22 unique items that were in the top 12 of at least 1 of the 4 initial ranking approaches (Table 3). Eleven of the items appeared on at least 3 of the 12-item lists. Four items (smoke, soap, licorice, bubblegum) appeared on all lists. Multiple 12-item subsets had test characteristics similar to the full UPSIT (Table 4). Some, such as the 12-item Combined list, had slightly better test characteristics compared to full UPSIT (Sens/Spec, 0.84/0.77 vs. UPSIT 0.84/0.71, Table 4). Relatively poorer test characteristics were observed for 12-item subsets derived at random (0.78/0.65) or from the worst ranking items (0.72/0.53, Table 4) in the discovery cohort. Further shortening the top-12 items lead to steady declines in AUC and/or the optimal combination of sensitivity and specificity as items beyond the top 11 were removed (Table 4).

Table 3 Putative subsets highly discriminant of UPSIT items

Full size table

Table 4 Test characteristics of putative subsets of highly discriminant UPSIT items from the discovery cohort

Full size table

“PD-specific” subscales derived in one population do not retain discriminatory power across independent cohorts

We examined the performance of our putative PD-specific subsets with individual item UPSIT data from two independently derived validation cohorts. As in the discovery cohort, test characteristics including sensitivity, specificity and AUC for multiple 12-item subsets were similar to those for the UPSIT indicating that smaller subscales can maintain comparable discriminatory power (Figure). However, when tested in the independent samples, the most highly discriminatory subsets from the discovery cohort did not perform better than a random subset or, in fact, the worst ranking 12 items derived from the discovery cohort. For example, AUCs for the Combined-12 subset, full UPSIT and Worst-12 subset calculated with data from the discovery cohort were 0.83 (95% CI 0.80–0.87), 0.78 (95% CI 0.74–0.82), and 0.66 (95% CI 0.62–0.74) respectively (Tables 2, 3). Using data from the Barts cohort, however, these values were essentially identical to one another (AUCs: Combined-12 = 0.85, UPSIT = 0.87, Worst-12 = 0.86, Figure). Similar results were observed using data from the UCL cohort (AUCs: Combined-12 = 0.82, UPSIT = 0.90, Worst-12 = 0.87, Figure).

Effect of age and gender on olfactory test performance

As age and sex are important determinants of olfactory function, we examined the test characteristics of the UPSIT, B-SIT-B and 12 UPSIT items (“Combined” list) we defined as most highly discriminatory in the discovery cohort (Table 5) as a function of age and sex using data from all three cohorts (N = 1279). We found that although the test AUCs were fairly similar in men and women, higher cut-off values were required for optimal sensitivity/specificity in women. Additionally, the tests effectively distinguished between PD and controls in all age groups but, generally, we observed higher AUCs in subjects less than 74 years old (Table 5). The pattern of age/sex influence was similar across the different tests.

Table 5 Effect of age and sex on olfactory test characteristics

Full size table

Discussion

Detecting anosmia or hyposmia is of significant interest for early identification and differential diagnosis of PD and related disorders. Although the 40-item UPSIT has been found to be an effective instrument to detect anosmia or hyposmia in PD, it is not clear whether tests employing fewer UPSIT items are equally useful in detecting such olfactory dysfunction. Several studies have suggested that certain odors are selectively compromised in PD, and that a test comprised of these odors could be shorter, easier to administer, and more specific for this purpose. However, little uniformity exists across studies. Some of the candidate subsets identified using “scratch and sniff” tests (UPSIT, B-SIT versions) include gasoline, banana, pineapple, smoke and cinnamon,³³ licorice, banana, dill pickle³⁵ and wintergreen and pizza³⁴ (Table 1). Studies using Sniffin Sticks have similarly proposed subsets of highly discriminant odors including coffee, peppermint and anise,³⁶ cloves, coffee and rose,³⁰ and a recent study identified set of 8 (of 16) Sniffin Sticks that had excellent diagnostic accuracy for early PD and even correctly identified subjects with idiopathic REM-sleep behavior disorder who went on to develop PD.²⁹ While some odors have been suggested by multiple groups (smoke, coffee, banana, licorice), none have been reported uniformly.

When we used four different methods to assess discriminatory capacity, 4 items appeared among the top 12 on all of the lists (Table 3). By chance alone, one would expect less than one item to appear on all lists, suggesting the possibility that these methods did enrich for more highly discriminatory odors in the discovery cohort. Poorer test characteristics were observed for 12-item subsets derived at random or from the worst ranking items (Table 4) compared to the highest ranking items in accord with the idea that we may have identified subsets with greater discriminatory power. However, when examined in two independent validation cohorts, the putative highly discriminant subsets performed no better than randomly selected items or even the least discriminatory items from the discovery cohort (Figure).

Support for our overall findings of a lack of a consistent small subset of odorants that differentiates PD patients from controls comes from an item analysis performed on the UPSIT in 1988.⁶ In this study, the pattern of responses of 81 PD patients (based upon the proportion of persons correctly answering an item) across the 40 items of the UPSIT was similar to that of 81 matched controls (Spearman rank order correlation across odor items = 0.75), suggesting the deficit is a general one and unlikely confined to any subset of UPSIT items.

There are several reasons why a uniform set of odorants specific to PD has not been found. First, it is conceivable that such a set is not detected because it is overshadowed by the variability derived from cultural or other differences between populations that have been previously studied.^20,38,39,40 Second, the lack of specificity to PD may reflect the absence of specific damage to different receptor classes or receptor channels, either at the level of the epithelium or at higher levels within the central nervous system, including the olfactory bulb. The human olfactory nerve is comprised of 6–10 million olfactory receptor cells, of which there are nearly 400 types harboring G-protein coupled odor receptors (GPCRs) on their cilia, with a given cell expressing only one type of receptor. In most cases, each receptor responds to a range of odorants, such that even a single chemical can stimulate multiple sets of receptor cells. Even if some subset of receptors were damaged specifically by PD, the gestalt of a given smell, like the perception of visual objects, can likely resist the loss of some segments of the olfactory “object” and still retain identification ability via feature-detection processes.³¹ Third, the search for odorants specific to PD is further complicated by the fact that most if not all of the odorants employed in the extant olfactory tests are comprised of multiple chemicals. Until there is a better understanding of the relative distribution numbers of the ~400 classes of receptor types within the epithelium and the nature and range of ligands that activate each receptor type, finding sets of odorants that might be specifically damaged by PD or any other disease is unlikely. Finally, the quest is further confounded by attempting to compare results across studies using different tests with seemingly the same “odors”. Even if the qualitative “odor” from one test appears to be the same qualitative “odor” as that from another, different chemicals and combinations of chemicals can make up the same “odor”. In other words, different odorants or combinations of odorants often are being compared.

While the large number of subjects in the discovery cohort and use of multiple independent validation cohorts are strengths, this study has several limitations that are important to consider. Most patients involved in the study were not autopsy verified so that some of the PD subjects likely had non-Lewy body Parkinsonism and some of the controls may have had pre-motor PD other conditions associated with olfactory dysfunction.^41,42,43 Similarly, in many cases, these were subjects with well-established PD and it is not clear that our results can be generalized to patients with early or de novo PD. Our analysis of individual items and novel combinations was retrospective using existing UPSIT data and, therefore, cannot account for item ordering or the effect of distractor choices that would be present if the proposed UPSIT subsets had been presented together as independent tests. Smoking history was not available for all subjects, but smoking has a relatively small impact on olfactory function, compared to factors such as age, sex or the presence underlying neurologic disease, such as PD (ref. ⁴⁴). Indeed, age and sex are significant determinants of olfactory function such that optimal UPSIT cut-off scores can differ between men and women or among different age groups.⁴⁵ Similarly, we found that higher cut-off values were required for optimal sensitivity/specificity in women, reflecting generally better olfactory performance compared to men. The tests effectively distinguished between PD and controls in all age groups but performed best in subjects less than 74 years old. However, the influence of age and sex were similar using the full-length UPSIT or subsets of UPSIT items (Table 5).

Finally, this study examined multiple international cohorts for discovery and validation but only included subjects from the US and UK. Cultural factors influencing recognition of certain odors are known to affect performance on olfactory identification tests in other populations, possibly limiting generalizability of these results to other cultures.^20,38,39,40 Similarly, cultural heterogeneity between the discovery and validation cohorts could explain some of the variable performance of different subsets of UPSIT items between the cohorts (Fig. 1).

While our results, along with those of earlier studies, argue against selective anosmia or hyposmia in PD, they do suggest that shorter versions of the UPSIT or Sniffin’ Sticks retain much of the discriminatory power of the full tests for detecting olfactory dysfunction in PD. The decision to employ a short or long test for a given clinical or research purpose depends on a number of factors, including the setting of the administration, proposed indication, and pre-test probability of PD in the population studied. As discussed in detail, shorter tests may maintain suitable test characteristics for a binary outcome (diagnosis). However, longer tests are more sensitive to subtle alterations in function and allow for distinctions between degrees of dysfunction, which can be critical for counseling patients regarding prognosis, including patients with non-neurodegenerative disorders such as head trauma.⁴⁶ Longer tests also allow for the detection of malingering on the basis of improbable forced-choice responding,⁴⁷ which cannot be discerned from shorter tests, and are clearly more reliable than shorter tests.⁴⁸ We found that decreasing even the most discriminatory set of items to fewer than 11 odors resulted in steadily decreasing test performance. This can have an impact when small samples are being tested or when individual patients or subjects are being assessed. It must be kept in mind, of course, that while olfactory testing can be a very sensitive aid in diagnosing PD, e.g., in differentiating between PD from progressive supranuclear palsy and essential tremor, it is not specific to PD.¹⁰

Our findings that 12-item UPSIT subsets performed better that the full 40-item test in a discovery cohort but not in independent replication cohorts has several practical implications for the use of olfactory tests for PD. First, 12-item tests are sufficient and may save time and cost compared to the full UPSIT. Second, attempts to discover new “PD-specific” odor sets may be ill-advised as they can be defined by chance in any cohort but are unlikely to generalize to the broader PD population. Further, we found that AUC, sensitivity and specificity declined as items were removed from the 12-item subsets suggesting that significantly shorter tests would lack sufficient diagnostic utility. Additionally, any such shorter or “PD-specific” test would lack normative data for categorizing individual patients and would need prospective validation in new cohorts. Overall, the balance of evidence suggests that shorter versions of the UPSIT—particularly the currently available B-SIT-B—should be employed with confidence to allow decreased time of administration and cost of olfactory assessment in a variety of clinical and research applications for the evaluation of Parkinsonism.

Methods

Subjects and olfactory assessment

For the initial (discovery cohort) phase of the UPSIT item analysis, we examined individual UPSIT item results from a convenience sample of PD patients (N = 314) and age-matched controls (N = 314) that had been administered in several protocols at the Michael J. Crescenz VA Medical Center in Philadelphia and the University of Pennsylvania. The mean (SD) age in each group was 67.4 (10.0) years and each was comprised of 83% males and were 94% Caucasian. Among PD patients, the median(interquartile range) Hoehn and Yahr stage and mean (SD) UPDRS motor scores were 2(2–3) and 22 (10.1), respectively. In an attempt to validate the performance of putative PD-specific UPSIT subsets, we used individual item data from two independent validation cohorts of PD patients and control subjects derived at University College, London (UCL Cohort) and Barts & The London School of Medicine and Dentistry (Barts Cohort). The Barts cohort was comprised of 176 PD patients with a mean age 60 (9.8) years and 177 control subjects with a mean age of 62 (10.7) years (p = 0.15). Subjects in the Penn cohort were only 6% non-white. Race data were not collected for all of the Barts/UCL subjects but they were largely drawn from the Oldchurch /Queens and UCL hospital patients. The vast majority were middle class Caucasian British. There were 167 PD subjects (mean age = 63 (9.9)) and 130 controls (mean age = 65(9.5)) in the UCL cohort. Most subjects were screened extensively for nasal disease. However, some subjects, particularly controls that were tested in community settings such as malls or state fairs, did not undergo rigorous screening, though subjects with clear active rhinitis of any etiology were not included. All studies from which UPSIT results were analyzed were approved by Institutional Review Boards (IRB) at Cresencz VA Medical Center, University of Pennsylvania, Barts and The London School of Medicine and Dentistry and University College London. Methods were performed in accordance with relevant regulations and guidelines. Informed consent was obtained from patients before participation in protocols.

Statistical analysis

Individual responses to each of the 40 items were recorded as correct or incorrect. Discriminatory power of individual odors to differentiate between PD patients and control subjects was tested using several statistical approaches. First, individual odors were ranked by the difference between the percentages of PD patients versus controls answering incorrectly (Difference). A complimentary approach ordered odors by odds ratio of PD versus controls grouping for each item (Odds Ratio). The third method used discriminant function analysis, a method based on ANOVA that generates models incorporating all items into one or more weighted functions to come up with two sets, one that best discriminated PD versus controls (Discriminant) and one that least discriminated PD versus controls (Worst).⁴⁹ We also used logistic regression to identify items that best explained variation in outcome using diagnosis of PD versus controls as the dependent variable and ranking individual odors by the associated beta-coefficient (Regression). Finally, we generated a fifth list (Combined) using a weighted matrix by taking the top-12 items identified by each of the four methods (see Table 2), assigning 12 points for highest rank, 11 for second, 10 for third, etc. and summing the score for each item, in an attempt to capture items identified in common with the different statistical approaches. A random list of twelve items (Random) was assembled by using a random number generator taking integers 1–40 and using the first 12 corresponding UPSIT items based on their order of presentation during the full test. Twelve odor subsets were chosen to facilitate comparison with several commercially available tests also containing 12 items (Table 1). Test characteristics including sensitivity (number of PD subjects scoring below the cut-off value/total number of PD subjects), specificity (number of PD subjects scoring below the cut-off value/(number of PD + control subjects scoring below the cut-off value) and area under the receiver-operator characteristic curve (AUC) were calculated for candidate subsets. Cut-offs for point sensitivities and specificities were chosen to maximize the sum of both values.

Statistical analyses were conducted using SPSS version 17.0 (SPSS Inc.; Chicago, IL). All statistical tests were two-sided and significance was set at the 0.05 level. Data that support the findings of this study are available from the corresponding author upon request.

Disclaimer

The views expressed in this article are those of the authors and do not necessarily reflect the position or policy of the Department of Veterans Affairs or the United States government

References

Haugen, J. et al. Prevalence of impaired odor identification in Parkinson disease with imaging evidence of nigrostriatal denervation. J. Neural Transm. 123, 421–424 (2016).
Article CAS PubMed PubMed Central Google Scholar
McKinnon, J. et al. Olfaction in the elderly: a cross-sectional analysis comparing Parkinson's disease with controls and other disorders. Int. J. Neurosci. 120, 36–39 (2010).
Article PubMed Google Scholar
Haehner, A. et al. Prevalence of smell loss in Parkinson's disease--a multicenter study. Park. Relat. Disord. 15, 490–494 (2009).
Article CAS Google Scholar
Doty, R. L. Olfactory dysfunction in Parkinson disease. Nat. Rev. Neurol. 8, 329–339 (2012).
Article CAS PubMed Google Scholar
Ansari, K. A. & Johnson, A. Olfactory function in patients with Parkinson's disease. J. Chronic Dis. 28, 493–497 (1975).
Article CAS PubMed Google Scholar
Doty, R. L., Deems, D. A. & Stellar, S. Olfactory dysfunction in parkinsonism: a general deficit unrelated to neurologic signs, disease stage, or disease duration. Neurology 38, 1237–1244 (1988).
Article CAS PubMed Google Scholar
Ponsen, M. M. et al. Idiopathic hyposmia as a preclinical sign of Parkinson's disease. Ann. Neurol. 56, 173–181 (2004).
Article PubMed Google Scholar
Braak, H. et al. Staging of brain pathology related to sporadic Parkinson's disease. Neurobiol. Aging 24, 197–211 (2003).
Article PubMed Google Scholar
Ponsen, M. M. et al. Idiopathic hyposmia as a preclinical sign of Parkinson's disease. Ann. Neurol. 56, 173–181 (2004).
Article PubMed Google Scholar
Doty, R. L. Olfactory dysfunction in neurodegenerative diseases: is there a common pathological substrate? Lancet Neurol. 16, 478–488 (2017).
Article PubMed Google Scholar
Morley, J. F. & Duda, J. E. Olfaction as a biomarker in Parkinson's disease. Biomark. Med. 4, 661–670 (2010).
Article CAS PubMed Google Scholar
Doty, R. L. Office procedures for quantitative assessment of olfactory function. Am. J. Rhinol. 21, 460–473 (2007).
Article PubMed Google Scholar
Boesveldt, S. et al. A comparative study of odor identification and odor discrimination deficits in Parkinson's disease. Mov. Disord. 23, 1984–1990 (2008).
Article PubMed Google Scholar
Landis, B. N. et al. Retronasal olfactory function in Parkinson's disease. Laryngoscope 119, 2280–2283 (2009).
Article PubMed Google Scholar
Doty, R. L., Hawkes, C. H., Good, K. P. & Duda, J. E. in Handbook of Olfaction and Gustation (ed Doty, R. L.) Vol. 403 (Wiley, Indianapolis, IN, USA, 2015).
Doty, R. L., Shaman, P. & Dann, M. Development of the University of Pennsylvania Smell Identification Test: a standardized microencapsulated test of olfactory function. Physiol. Behav. 32, 489–502 (1984).
Article CAS PubMed Google Scholar
Doty, R. L. Olfaction in Parkinson's disease and related disorders. Neurobiol. Dis. 46, 527–552 (2012).
Article PubMed Google Scholar
Hummel, T., Sekinger, B., Wolf, S. R., Pauli, E. & Kobal, G. 'Sniffin' sticks': olfactory performance assessed by the combined testing of odor identification, odor discrimination and olfactory threshold. Chem. Senses 22, 39–52 (1997).
Article CAS PubMed Google Scholar
Doty, R. L., Marcus, A. & Lee, W. W. Development of the 12-item Cross-Cultural Smell Identification Test (CC-SIT). Laryngoscope 106, 353–356 (1996).
Article CAS PubMed Google Scholar
Rodriguez-Violante, M. et al. Comparing the accuracy of different smell identification tests in Parkinson's disease: relevance of cultural aspects. Clin. Neurol. Neurosurg. 123, 9–14 (2014).
Article PubMed Google Scholar
Roberts, R. O. et al. Association between olfactory dysfunction and amnestic mild cognitive impairment and Alzheimer disease dementia. JAMA Neurol. 73, 93–101 (2016).
Article PubMed PubMed Central Google Scholar
Devanand, D. P. et al. Olfactory deficits predict cognitive decline and Alzheimer dementia in an urban community. Neurology 84, 182–189 (2015).
Article CAS PubMed PubMed Central Google Scholar
Ross, G. W. et al. Association of olfactory dysfunction with risk for future Parkinson's disease. Ann. Neurol. 63, 167–173 (2008).
Article PubMed Google Scholar
Tabert, M. H. et al. A 10-item smell identification scale related to risk for Alzheimer's disease. Ann. Neurol. 58, 155–160 (2005).
Article PubMed Google Scholar
Hawkes, C. H., Shephard, B. C. & Daniel, S. E. Olfactory dysfunction in Parkinson's disease. J. Neurol. Neurosurg. Psychiatry 62, 436–446 (1997).
Article CAS PubMed PubMed Central Google Scholar
Solomon, G. S., Petrie, W. M., Hart, J. R. & Brackin, H. B. Jr. Olfactory dysfunction discriminates Alzheimer's dementia from major depression. J. Neuropsychiatry Clin. Neurosci. 10, 64–67 (1998).
Article CAS PubMed Google Scholar
Rawal, S., Hoffman, H. J., Honda, M., Huedo-Medin, T. B. & Duffy, V. B. The taste and smell protocol in the 2011-2014 US National Health and Nutrition Examination Survey (NHANES): test-retest reliability and validity testing. Chemosens. Percept. 8, 138–148 (2015).
Article PubMed PubMed Central Google Scholar
Jackman, A. H. & Doty, R. L. Utility of a three-item smell identification test in detecting olfactory dysfunction. Laryngoscope 115, 2209–2212 (2005).
Article PubMed Google Scholar
Mahlknecht, P. et al. Optimizing odor identification testing as quick and accurate diagnostic tool for Parkinson's disease. Mov. Disord. 31, 1408–1413 (2016).
Article PubMed PubMed Central Google Scholar
Hummel, T., Pfetzing, U. & Lotsch, J. A short olfactory test based on the identification of three odors. J. Neurol. 257, 1316–1321 (2010).
Article PubMed Google Scholar
Shiga, H. et al. Combinations of two odorants of smell identification test for screening of olfactory impairment. Auris Nasus Larynx 41, 523–527 (2014).
Article PubMed Google Scholar
Hashimoto, Y. et al. Usefulness of the odor stick identification test for Japanese patients with olfactory dysfunction. Chem. Senses 29, 565–571 (2004).
Article PubMed Google Scholar
Double, K. L. et al. Identifying the pattern of olfactory deficits in Parkinson disease using the brief smell identification test. Arch. Neurol. 60, 545–549 (2003).
Article PubMed Google Scholar
Daniel, S. E. & Hawkes, C. H. Preliminary diagnosis of Parkinson's disease by olfactory bulb pathology. Lancet 340, 186 (1992).
Article CAS PubMed Google Scholar
Bohnen, N. I. et al. Selective hyposmia and nigrostriatal dopaminergic denervation in Parkinson's disease. J. Neurol. 254, 84–90 (2007).
Article CAS PubMed Google Scholar
Casjens, S. et al. Diagnostic value of the impairment of olfaction in Parkinson's disease. PLoS. One 8, e64735 (2013).
Article CAS PubMed PubMed Central Google Scholar
Hahner, A. et al. Selective hyposmia in Parkinson's disease? J. Neurol. 260, 3158–3160 (2013).
Article PubMed Google Scholar
Altundag, A. et al. Cross-culturally modified University of Pennsylvania smell identification test for a Turkish population. Am. J. Rhinol. Allergy 29, e138–41 (2015).
Article PubMed Google Scholar
Fornazieri, M. A. et al. Development of normative data for the Brazilian adaptation of the University of Pennsylvania smell identification test. Chem. Senses 40, 141–149 (2015).
Article CAS PubMed Google Scholar
Ribeiro, J. C. et al. Cultural adaptation of the portuguese version of the "sniffin' sticks" smell test: reliability, validity, and normative data. PLoS. One 11, e0148937 (2016).
Article PubMed PubMed Central Google Scholar
Hughes, A. J., Daniel, S. E., Kilford, L. & Lees, A. J. Accuracy of clinical diagnosis of idiopathic Parkinson's disease: a clinico-pathological study of 100 cases. J. Neurol. Neurosurg. Psychiatry 55, 181–184 (1992).
Article CAS PubMed PubMed Central Google Scholar
Mackay-Sim, A., Johnston, A. N., Owen, C. & Burne, T. H. Olfactory ability in the healthy population: reassessing presbyosmia. Chem. Senses 31, 763–771 (2006).
Article PubMed Google Scholar
Berg, D. et al. MDS research criteria for prodromal Parkinson's disease. Mov. Disord. 30, 1600–1611 (2015).
Article PubMed Google Scholar
Moberg, P. J. et al. Olfactory dysfunction in schizophrenia: a qualitative and quantitative review. Neuropsychopharmacology 21, 325–340 (1999).
Article CAS PubMed Google Scholar
Doty, R. L., Bromley, S. M. & Stern, M. B. Olfactory testing as an aid in the diagnosis of Parkinson's disease: development of optimal discrimination criteria. Neurodegeneration 4, 93–97 (1995).
Article CAS PubMed Google Scholar
London, B. et al. Predictors of prognosis in patients with olfactory disturbance. Ann. Neurol. 63, 159–166 (2008).
Article PubMed Google Scholar
Doty, R. L. & Crastnopol, B. Correlates of chemosensory malingering. Laryngoscope 120, 707–711 (2010).
Article PubMed Google Scholar
Doty, R. L., McKeown, D. A., Lee, W. W. & Shaman, P. A study of the test-retest reliability of ten olfactory tests. Chem. Senses 20, 645–656 (1995).
Article CAS PubMed Google Scholar
Mayer, M., Wilkinson, I., Heikkinen, R., Orntoft, T. & Magid, E. Improved laboratory test selection and enhanced perception of test results as tools for cost-effective medicine. Clin. Chem. Lab. Med. 36, 683–690 (1998).
Article CAS PubMed Google Scholar
Doty, R. L., Shaman, P., Kimmelman, C. P. & Dann, M. S. University of Pennsylvania smell identification test: a rapid quantitative olfactory function test for the clinic. Laryngoscope 94, 176–178 (1984).
Article CAS PubMed Google Scholar

Download references

Author information

Authors and Affiliations

Parkinson’s Disease Research Education, Clinical and Education Center, Corporal Michael J. Crescenz VA Medical Center, Philadelphia, PA, USA
James F. Morley, Julie P. Shtraks, Daniel Weintraub & John E. Duda
Department of Neurology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
James F. Morley, Daniel Weintraub & John E. Duda
CCEB, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Abigail Cohen
Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Daniel Weintraub
Smell and Taste Center, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Richard L. Doty
UCL Institute of Neurology, Monash University, Melbourne, VIC, Australia
Laura Silveira-Moriyama & Andrew J. Lees
Department of Medicine, Monash University, Melbourne, VIC, Australia
David R. Williams
Karl Landsteiner Institute for Neuroimmunological and Neurodegenerative Disorders, Medical University of Vienna, Vienna, Austria
Regina Katzenschlager
Barts and the London School of Medicine and Dentistry, London, UK
Christopher Hawkes

Authors

James F. Morley
View author publications
You can also search for this author in PubMed Google Scholar
Abigail Cohen
View author publications
You can also search for this author in PubMed Google Scholar
Laura Silveira-Moriyama
View author publications
You can also search for this author in PubMed Google Scholar
Andrew J. Lees
View author publications
You can also search for this author in PubMed Google Scholar
David R. Williams
View author publications
You can also search for this author in PubMed Google Scholar
Regina Katzenschlager
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Hawkes
View author publications
You can also search for this author in PubMed Google Scholar
Julie P. Shtraks
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Weintraub
View author publications
You can also search for this author in PubMed Google Scholar
Richard L. Doty
View author publications
You can also search for this author in PubMed Google Scholar
John E. Duda
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.F.M.: conception and design, statistical analysis, drafting of the manuscript and revision, Guarantor. A.C.: design and execution of statistical analysis, revision of the manuscript. L.S.M.: data acquisition, statistical analysis, revision of the manuscript. A.J.L.: data acquisition, revision of the manuscript. D.R.W.: data acquisition, revision of the manuscript. R.K.: data acquisition, revision of the manuscript. J.A.: data acquisition, revision of the manuscript. D.W.: data acquisition, revision of the manuscript. C.H.: data acquisition, revision of the manuscript. R.L.D.: data acquisition, revision of the manuscript. J.E.D.: conception and design, revision of the manuscript.

Corresponding author

Correspondence to James F. Morley.

Ethics declarations

Competing interests

R.L.D. is President and major shareholder in Sensonics International, the manufacturer of smell and taste tests, some of which were assessed in this study. The remaining authors declare no competing financial interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Morley, J.F., Cohen, A., Silveira-Moriyama, L. et al. Optimizing olfactory testing for the diagnosis of Parkinson’s disease: item analysis of the university of Pennsylvania smell identification test. npj Parkinson's Disease 4, 2 (2018). https://doi.org/10.1038/s41531-017-0039-8

Download citation

Received: 10 August 2017
Revised: 08 December 2017
Accepted: 13 December 2017
Published: 15 January 2018
DOI: https://doi.org/10.1038/s41531-017-0039-8

This article is cited by

Correlation of olfactory function factors with cardiac sympathetic denervation in Parkinson’s disease
- Dong-Woo Ryu
- Sang-Won Yoo
- Joong-Seok Kim
Journal of Neurology (2024)
Predictive value of abbreviated olfactory tests in prodromal Parkinson disease
- Pavan A. Vaswani
- James F. Morley
- Kathryn Chung
npj Parkinson's Disease (2023)
Screening performances of an 8-item UPSIT Italian version in the diagnosis of Parkinson’s disease
- Annamaria Landolfi
- Marina Picillo
- Roberto Erro
Neurological Sciences (2023)
Evaluation of the PREDIGT score’s performance in identifying newly diagnosed Parkinson’s patients without motor examination
- Juan Li
- Tiago A. Mestre
- Michael G. Schlossmacher
npj Parkinson's Disease (2022)
Olfactory dysfunction is associated with motor function only in tremor-dominant Parkinson’s disease
- Fardin Nabizadeh
- Kasra Pirahesh
- Elham Khalili
Neurological Sciences (2022)