Detecting neurodegenerative disorders from web search signals

White, Ryen W.; Doraiswamy, P. Murali; Horvitz, Eric

doi:10.1038/s41746-018-0016-6

Download PDF

Brief Communication
Open access
Published: 23 April 2018

Detecting neurodegenerative disorders from web search signals

Ryen W. White¹,
P. Murali Doraiswamy² &
Eric Horvitz³

npj Digital Medicine volume 1, Article number: 8 (2018) Cite this article

5478 Accesses
35 Citations
101 Altmetric
Metrics details

Subjects

Abstract

Neurodegenerative disorders, such as Parkinson’s disease (PD) and Alzheimer’s disease (AD), are important public health problems warranting early detection. We trained machine-learned classifiers on the longitudinal search logs of 31,321,773 search engine users to automatically detect neurodegenerative disorders. Several digital phenotypes with high discriminatory weights for detecting these disorders are identified. Classifier sensitivities for PD detection are 94.2/83.1/42.0/34.6% at false positive rates (FPRs) of 20/10/1/0.1%, respectively. Preliminary analysis shows similar performance for AD detection. Subject to further refinement of accuracy and reproducibility, these findings show the promise of web search digital phenotypes as adjunctive screening tools for neurodegenerative disorders.

Disease progression strikingly differs in research and real-world Parkinson’s populations

Article Open access 13 March 2024

Crowdsourcing digital health measures to predict Parkinson’s disease severity: the Parkinson’s Disease Digital Biomarker DREAM Challenge

Article Open access 19 March 2021

Fox Insight collects online, longitudinal patient-reported outcomes and genetic data on Parkinson’s disease

Article Open access 24 February 2020

Introduction

Neurodegenerative disorders (NDs) are prevalent and a major source of healthcare expenditure.¹ NDs progress slowly,² and their symptoms may be subtle and mistaken for more common conditions.^{3, 4} Early detection of NDs enables earlier intervention, which can slow their progression. This study examines the use of digital phenotypes⁵ for detecting NDs, operationalized as patterns of search activity gathered during engagement with web search engines. Methods based on these observational data show promise in offering new pathways for the early detection of brain disease.

Prior studies with large-scale logs of the search activity of millions of people have highlighted opportunities for detection of cancer^{6, 7} and for disease surveillance.^{8, 9} This study investigates how analyses of longitudinal log data from search engines might help detect evidence of Parkinson’s disease (PD), a common progressive ND affecting some 7–10 million people worldwide. Dopaminergic deficiency in PD results in symptoms such as tremors and cognitive decline,¹⁰ evidence of which may be apparent in search log signals. PD is challenging to diagnose: the current accuracy of clinical diagnosis of probable PD for patients presenting with motor symptoms in primary care settings is around 80%, with limited improvements in the past 25 years, especially at early disease stages.¹¹ Hence, there is a need for a simple scalable test that can be used for screening in the community or at home. This work also explores whether classifiers using search log signals can help with diagnostic challenges in PD, specifically distinguishing early PD from essential tremor (ET).^{3, 4}

This study uses a total of 18 months of deidentified logs of United States search activity from the Microsoft Bing web search engine, comprising millions of English-speaking searchers from September 2015 to February 2017 inclusive. These data are routinely collected for improving search results and permitted through Bing’s Terms of Service. A range of observational features were computed per searcher over the duration of the logs: (1) Symptom: presence of PD symptom-related query terms (including synonyms) derived from published literature; (2) Motor: motor symptoms such as cursor movements, including speed, direction changes, tremors (defined as horizontal or vertical oscillations in cursor position up to 20 pixels in each direction), and vertical scrolling. Cursor position data were sampled while the cursor was in motion; (3) Repetition: presence of repeat queries, repeat result clicks, and repeat query-result click pairs, and (4) Risk Factors: presence of risk factors derived from previous work (e.g.,^12,13,14). These included age and gender (inferred using proprietary Bing classifiers), and head trauma, toxin exposure, and familial factors based on terminology appearing in query text. For the Motor class, feature values are first computed per query instance and then averaged across all query instances for the searcher. Some features align with criteria used by physicians (e.g., tremors)^{10, 15} while others are more difficult to measure in clinical practice (e.g., memory loss).¹⁶

From the full set of logs, searchers who input queries containing first-person statements about PD diagnosis (e.g., “just diagnosed with parkinsons”) were identified. These experiential diagnostic queries are used as evidence of receiving a PD diagnosis. Cases exhibiting evidence that diagnostic queries were for others (e.g., father, spouse, etc.) were excluded. Multiple additive regression trees (MART) classifiers¹⁷ were trained to detect evidence of PD diagnosis from all PD symptom searchers. Advantages of MART include model interpretability, facility for rapid training and testing, and robustness against noisy labels and missing values. There were 703 positive cases, of searchers who queried for symptoms and issued at least one experiential diagnostic query (30.8% of the experiential diagnostic searchers), and 31,321,070 negative cases, of searchers who only issued queries on PD symptoms. The data were used in classifier training as is. The application of sampling methods to correct for class imbalance is left to future work. Since NDs progress slowly² and the observation window is limited to 18 months, the classification task likely identifies the existence of PD rather than forecasting a future diagnosis.

Ten-fold cross validation was used to train and test the classifier. It predicts the input of an experiential diagnostic query for PD with strong performance (area under the receiver operating characteristic curve [AUROC] = 0.9357) using 18 months of search log data. AUROC drops to 0.8626 with 12 months of data, and 0.8151 with 6 months of data. Since false positives can generate unnecessary alarm and additional healthcare utilization in fielded uses (e.g., at population-scale in search engines), low false positive rates (FPRs) are desirable. Classifier sensitivities at FPR = 20/10/1/0.1% are 94.2/83.1/42.0/34.6%, respectively. The results offer evidence that the existence of NDs in searchers is detectable from streams of data from the use of search engines over time. Table 1 shows the list of observational features with non-zero discriminatory weights in the learned classifier. Features related to tremors—both from search terms (e.g., “hands shaking”) and from mouse cursor movements (e.g., estimated rate of cursor position oscillation), repeat queries, and repeat search-result clicks, and the inferred age and gender of searchers, had highest discriminatory weights.

Table 1 Features used in PD classifier, ranked by discriminative weight and scored with respect to the top-ranked feature: TimeBetweenRepeatQueries. Features are computed over all queries for each searcher. Features from the Motor class are first computed for each query instance and then averaged across all query instances for that searcher

Full size table

Tremors have many explanations, including ET, which shares some symptoms with PD. Distinguishing between ET and early PD is important for tremor sufferers.³ Focusing on those who searched for tremors (n = 4,262,953), a MART classifier was trained to distinguish PD (n = 309) and ET (n = 307). Figure 1 shows the ROC curve illustrating strong classifier performance (AUROC = 0.9205) using all features available to the classifier. Features related to scrolling, cursor direction changes, tremor frequencies, and query repetition were important. This is corroborated by ablation studies, where the largest drop in AUROC (23%, Z = 7.10, p < 0.001¹⁸) occurs when Motor features are excluded. Motor symptoms, including tremor frequencies, are also important in distinguishing ET and PD during clinical examinations.¹⁹

The classifiers learned from search query and motor interaction data show promise for developing new kinds of diagnostic tools for NDs. The periodic application of these methods may support the study of temporal dynamics in NDs for consenting searchers. They can also help discriminate between illnesses with similar symptoms, as shown with a case study of identifying searchers with experiential diagnostic queries for ET versus PD. The classifier leverages evidence unavailable to physicians (e.g., longitudinal query repetition, mouse cursor activity) that could aid in more traditional clinical diagnoses. Application of these classifiers could help screen for patients with higher ND likelihoods. Surfacing their predictions and confidence scores to physicians could offer additional evidence to help physicians discriminate between conditions. Identifying the specific digital phenotypes (e.g., estimated tremor frequencies) related to NDs that carry most weight for each patient may also have diagnostic utility. It is noted that while experiential diagnostic queries provide evidence of ND, definitive ground truth was unavailable in this study. Future work will expand this analysis to other NDs and perform prospective analyses with clinically diagnosed ND patients at different stages of illness to validate the diagnostic and prognostic utility of digital signals. Preliminary analysis shows that the methods in this study may scale to other NDs, such as Alzheimer’s disease (AUROC = 0.9135, classifier sensitivities at FPR = 20/10/1/0.1% are 91.0/81.5/38.8/26.1%, respectively). A recent study of keystroke typing patterns in verified PD patients²⁰ found similar results to those on PD presented herein. The findings of the two studies taken together support the promise of using digital phenotypes for early detection of PD.

Data availability statement

The data that support the findings of this study are available from Microsoft, but restrictions apply to the availability of these data. Data are however available from the authors upon reasonable request and with permission of Microsoft.

References

De Lau, L. M. & Breteler, M. M. Epidemiology of Parkinson’s disease. Lancet Neurol. 5, 525–535 (2006).
Article PubMed Google Scholar
DeKosky, S. T. & Marek, K. Looking backward to move forward: early detection of neurodegenerative disorders. Science 302, 830–834 (2003).
Article CAS PubMed Google Scholar
Meara, J. O., Bhowmick, B. K. & Hobson, P. E. Accuracy of diagnosis in patients with presumed Parkinson’s disease. Age Ageing 28, 99–102 (1999).
Article CAS PubMed Google Scholar
Hughes, A. J., Daniel, S. E., Kilford, L. & Lees, A. J. Accuracy of clinical diagnosis of idiopathic Parkinson’s disease: a clinico-pathological study of 100 cases. J. Neurol. Neurosurg. Psychiatry 55, 181–184 (1992).
Article CAS PubMed PubMed Central Google Scholar
Jain, S. H., Powers, B. W., Hawkins, J. B. & Brownstein, J. S. The digital phenotype. Nat. Biotechnol. 33, 462–463 (2015).
Article CAS PubMed Google Scholar
White, R. W. & Horvitz, E. Evaluation of the feasibility of screening patients for early signs of lung carcinoma in web search logs. JAMA Oncol. 3, 398–401 (2016).
Article Google Scholar
Paparrizos, J., White, R. W. & Horvitz, E. Screening for pancreatic adenocarcinoma using signals from web search logs: feasibility study and results. J. Oncol. Pract. 12, 737–744 (2016).
Article PubMed Google Scholar
Ginsberg, J. et al. Detecting influenza epidemics using search engine query data. Nature 457, 1012–1014 (2009).
Article CAS PubMed Google Scholar
Brownstein, J. S., Freifeld, C. C. & Madoff, L. C. Digital disease detection: harnessing the web for public health surveillance. N. Engl. J. Med. 360, 2153–2157 (2009).
Article CAS PubMed PubMed Central Google Scholar
Jankovic, J. Parkinson’s disease: clinical features and diagnosis. J. Neurol. Neurosurg. Psychiatry 79, 368–376 (2008).
Article CAS PubMed Google Scholar
Rizzo, G. et al. Accuracy of clinical diagnosis of Parkinson disease: a systematic review and meta-analysis. Neurology 86, 566–576 (2016).
Article PubMed Google Scholar
Bishop, N. A., Lu, T. & Yankner, B. A. Neural mechanisms of ageing and cognitive decline. Nature 464, 529 (2010).
Article CAS PubMed PubMed Central Google Scholar
Brown, R. C., Lockwood, A. H. & Sonawane, B. R. Neurodegenerative diseases: an overview of environmental risk factors. Environ. Health Perspect. 113, 1250 (2005).
Article CAS PubMed PubMed Central Google Scholar
Bertram, L. & Tanzi, R. E. The genetic epidemiology of neurodegenerative disease. J. Clin. Invest. 115, 1449 (2005).
Article CAS PubMed PubMed Central Google Scholar
Elble, R. J. Diagnostic criteria for essential tremor and differential diagnosis. Neurology 54, S2–S6 (2000).
Article CAS PubMed Google Scholar
Chaudhuri, K. R., Healy, D. G. & Schapira, A. H. Non-motor symptoms of Parkinson’s disease: diagnosis and management. Lancet Neurol. 5, 235–245 (2006).
Article PubMed Google Scholar
Friedman, J. H. Greedy function approximation: a gradient boosting machine. Ann. Stat. 29, 1189–1232 (2001).
Article Google Scholar
Hanley, J. A. & McNeil, B. J. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143, 29–36 (1982).
Article CAS PubMed Google Scholar
Thenganatt, M. A. & Louis, E. D. Distinguishing essential tremor from Parkinson’s disease: bedside tests and laboratory evaluations. Expert. Rev. Neurother. 12, 687–696 (2012).
Article CAS PubMed PubMed Central Google Scholar
Adams, W. R. High-accuracy detection of early Parkinson’s disease using multiple characteristics of finger movement while typing. PLoS. One 12, e0188226 (2017).
Article PubMed PubMed Central Google Scholar
Rodden, K., Fu, X., Aula, A. & Spiro, I. Eye-mouse coordination patterns on web search results pages. Proc. SIGCHI Ext. Abs. 2997–3002 (ACM, New York, NY, USA, 2008).

Download references

Author information

Authors and Affiliations

Microsoft, Bellevue, WA, 98004, USA
Ryen W. White
Duke University Health System and Duke Institute for Brain Sciences, Durham, NC, 27710, USA
P. Murali Doraiswamy
Microsoft, Redmond, WA, 98052, USA
Eric Horvitz

Authors

Ryen W. White
View author publications
You can also search for this author in PubMed Google Scholar
P. Murali Doraiswamy
View author publications
You can also search for this author in PubMed Google Scholar
Eric Horvitz
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors designed the study and co-authored the manuscript. In addition, R.W.W. mined the logs, trained and tested the machine-learned models, and performed the statistical analysis.

Corresponding author

Correspondence to Ryen W. White.

Ethics declarations

Competing interests

P.M.D. has received grants and/or advisory fees from health and technology companies for other projects and owns stock in several companies whose products are not discussed here. R.W.W. and E.H. are employees of Microsoft Corporation and own stock in the company.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

White, R.W., Doraiswamy, P.M. & Horvitz, E. Detecting neurodegenerative disorders from web search signals. npj Digital Med 1, 8 (2018). https://doi.org/10.1038/s41746-018-0016-6

Download citation

Received: 02 October 2017
Revised: 02 January 2018
Accepted: 10 January 2018
Published: 23 April 2018
DOI: https://doi.org/10.1038/s41746-018-0016-6

This article is cited by

A scoping review of neurodegenerative manifestations in explainable digital phenotyping
- Hessa Alfalahi
- Sofia B. Dias
- Leontios J. Hadjileontiadis
npj Parkinson's Disease (2023)
What can a swiped word tell us more? Demographic and behavioral correlates from shape-writing text entry
- Désirée C. A. Lemarquis
- Bereket A. Yilma
- Luis A. Leiva
Neural Computing and Applications (2023)
Disability, fairness, and algorithmic bias in AI recruitment
- Nicholas Tilmes
Ethics and Information Technology (2022)
The motor prodromes of parkinson’s disease: from bedside observation to large-scale application
- C. Simonet
- A. Schrag
- A. J. Noyce
Journal of Neurology (2021)
Population-scale hand tremor analysis via anonymized mouse cursor signals
- Ryen W. White
- Eric Horvitz
npj Digital Medicine (2019)