A solution to the single-question crowd wisdom problem

Prelec, Dražen; Seung, H. Sebastian; McCoy, John

doi:10.1038/nature21054

Letter
Published: 26 January 2017

A solution to the single-question crowd wisdom problem

Dražen Prelec^1,2,3,
H. Sebastian Seung⁴ &
John McCoy³

Nature volume 541, pages 532–535 (2017)Cite this article

28k Accesses
160 Citations
515 Altmetric
Metrics details

Subjects

Abstract

Once considered provocative¹, the notion that the wisdom of the crowd is superior to any individual has become itself a piece of crowd wisdom, leading to speculation that online voting may soon put credentialed experts out of business^2,3. Recent applications include political and economic forecasting^4,5, evaluating nuclear safety⁶, public policy⁷, the quality of chemical probes⁸, and possible responses to a restless volcano⁹. Algorithms for extracting wisdom from the crowd are typically based on a democratic voting procedure. They are simple to apply and preserve the independence of personal judgment¹⁰. However, democratic methods have serious limitations. They are biased for shallow, lowest common denominator information, at the expense of novel or specialized knowledge that is not widely shared^11,12. Adjustments based on measuring confidence do not solve this problem reliably¹³. Here we propose the following alternative to a democratic vote: select the answer that is more popular than people predict. We show that this principle yields the best answer under reasonable assumptions about voter behaviour, while the standard ‘most popular’ or ‘most confident’ principles fail under exactly those same assumptions. Like traditional voting, the principle accepts unique problems, such as panel decisions about scientific or artistic merit, and legal or historical disputes. The potential application domain is thus broader than that covered by machine learning and psychometric methods, which require data across multiple questions^{14,15,16,17,18,19,20}.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 1: Two example questions from Study 1c, described in text.**

**Figure 2: Why ‘surprisingly popular’ answers should be correct, illustrated by simple models of Philadelphia and Columbia questions with Bayesian respondents.**

**Figure 3: Selection of stimuli from Study 4 in which respondents judged the market price of 20th century artworks.**

**Figure 4: Results of aggregation algorithms on studies discussed in the text.**

**Figure 5: Logistic regressions showing the probability that an artwork is judged expensive (above $30,000) as function of actual market price.**

Improving microbial phylogeny with citizen science within a mass-market video game

Article Open access 15 April 2024

Causal machine learning for predicting treatment outcomes

Article 19 April 2024

Worldwide divergence of values

Article Open access 09 April 2024

References

Galton, F. Vox populi. Nature 75, 450–451 (1907)
Article ADS Google Scholar
Sunstein, C. Infotopia: How Many Minds Produce Knowledge (Oxford University Press, USA, 2006)
Surowiecki, J. The Wisdom of Crowds (Anchor, 2005)
Budescu, D. V. & Chen, E. Identifying expertise to extract the wisdom of crowds. Manage. Sci. 61, 267–280 (2014)
Google Scholar
Mellers, B. et al. Psychological strategies for winning a geopolitical forecasting tournament. Psychol. Sci. 25, 1106–1115 (2014)
Article Google Scholar
Cooke, R. M. & Goossens, L. L. TU Delft expert judgment data base. Reliab. Eng. Syst. Saf. 93, 657–674 (2008)
Article Google Scholar
Morgan, M. G. Use (and abuse) of expert elicitation in support of decision making for public policy. Proc. Natl Acad. Sci. USA 111, 7176–7184 (2014)
Article CAS ADS Google Scholar
Oprea, T. I. et al. A crowdsourcing evaluation of the NIH chemical probes. Nat. Chem. Biol. 5, 441–447 (2009)
Article CAS Google Scholar
Aspinall, W. A route to more tractable expert advice. Nature 463, 294–295 (2010)
Article CAS ADS Google Scholar
Lorenz, J., Rauhut, H., Schweitzer, F. & Helbing, D. How social influence can undermine the wisdom of crowd effect. Proc. Natl Acad. Sci. USA 108, 9020–9025 (2011)
Article CAS ADS Google Scholar
Chen, K., Fine, L. & Huberman, B. Eliminating public knowledge biases in information-aggregation mechanisms. Manage. Sci. 50, 983–994 (2004)
Article Google Scholar
Simmons, J. P., Nelson, L. D., Galak, J. & Frederick, S. Intuitive biases in choice versus estimation: implications for the wisdom of crowds. J. Consum. Res. 38, 1–15 (2011)
Article Google Scholar
Hertwig, R. Psychology. Tapping into the wisdom of the crowd–with confidence. Science 336, 303–304 (2012)
Article CAS ADS Google Scholar
Batchelder, W. & Romney, A. Test theory without an answer key. Psychometrika 53, 71–92 (1988)
Article MathSciNet Google Scholar
Lee, M. D., Steyvers, M., de Young, M. & Miller, B. Inferring expertise in knowledge and prediction ranking tasks. Top. Cogn. Sci. 4, 151–163 (2012)
Article Google Scholar
Yi, S. K., Steyvers, M., Lee, M. D. & Dry, M. J. The wisdom of the crowd in combinatorial problems. Cogn. Sci. 36, 452–470 (2012)
Article Google Scholar
Lee, M. D. & Danileiko, I. Using cognitive models to combine probability estimates. Judgm. Decis. Mak. 9, 259–273 (2014)
Google Scholar
Anders, R. & Batchelder, W. H. Cultural consensus theory for multiple consensus truths. J. Math. Psychol. 56, 452–469 (2012)
Article MathSciNet Google Scholar
Oravecz, Z., Anders, R. & Batchelder, W. H. Hierarchical Bayesian modeling for test theory without an answer key. Psychometrika 80, 341–364 (2015)
Article MathSciNet Google Scholar
Freund, Y. & Schapire, R. A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 55, 119–139 (1997)
Article MathSciNet Google Scholar
Goldstein, D. G. & Gigerenzer, G. Models of ecological rationality: the recognition heuristic. Psychol. Rev. 109, 75–90 (2002)
Article Google Scholar
Cooke, R. Experts in Uncertainty: Opinion and Subjective Probability in Science (Oxford University Press, USA, 1991)
Koriat, A. When are two heads better than one and why? Science 336, 360–362 (2012)
Article CAS ADS Google Scholar
Prelec, D. A Bayesian truth serum for subjective data. Science. 306, 462–466 (2004)
Article CAS ADS Google Scholar
John, L. K., Loewenstein, G. & Prelec, D. Measuring the prevalence of questionable research practices with incentives for truth telling. Psychol. Sci. 23, 524–532 (2012)
Article Google Scholar
Arrow, K. J. et al. Economics. The promise of prediction markets. Science 320, 877–878 (2008)
Article CAS Google Scholar
Lebreton, M., Abitbol, R., Daunizeau, J. & Pessiglione, M. Automatic integration of confidence in the brain valuation signal. Nat. Neurosci. 18, 1159–1167 (2015)
Article CAS Google Scholar

Download references

Acknowledgements

We thank M. Alam, A. Huang and D. Mijovic-Prelec for help with designing and conducting Study 3, and D. Suh with designing and conducting Study 4b. Supported by NSF SES-0519141, Institute for Advanced Study (Prelec), and Intelligence Advanced Research Projects Activity (IARPA) via the Department of Interior National Business Center contract number D11PC20058. The US Government is authorized to reproduce and distribute reprints for Government purposes notwithstanding any copyright annotation thereon. The views and conclusions expressed herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of IARPA, DoI/NBC, or the US Government.

Author information

Authors and Affiliations

Sloan School of Management, Massachusetts Institute of Technology, Cambridge, 02139, Massachusetts, USA
Dražen Prelec
Department of Economics, Massachusetts Institute of Technology, Cambridge, 02139, Massachusetts, USA
Dražen Prelec
Department of Brain & Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, 02139, Massachusetts, USA
Dražen Prelec & John McCoy
Princeton Neuroscience Institute and Computer Science Department, Princeton University, Princeton, 08544, New Jersey, USA
H. Sebastian Seung

Authors

Dražen Prelec
View author publications
You can also search for this author in PubMed Google Scholar
H. Sebastian Seung
View author publications
You can also search for this author in PubMed Google Scholar
John McCoy
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed extensively to the work presented in this paper.

Corresponding author

Correspondence to Dražen Prelec.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Additional information

Reviewer Information Nature thanks A. Baillon, D. Helbing and the other anonymous reviewer(s) for their contribution to the peer review of this work.

Extended data figures and tables

Extended Data Figure 1 Performance of all methods across all studies, shown with respect to the Matthews correlation coefficient.

Error bars are bootstrapped standard errors. Details of studies are given in Fig. 4 of the main text.

Extended Data Figure 2 Performance of all methods across all studies, shown with respect to the macro-averaged F1 score.

Error bars are bootstrapped standard errors. Details of studies are given in Fig. 4 of the main text.

Extended Data Figure 3 Performance of all methods across all studies, shown with respect to percentage of questions correct.

Error bars are bootstrapped standard errors. Details of studies are given in Fig. 4 of the main text.

Extended Data Figure 4 Performance of aggregation methods on simulated datasets of binary questions, under uniform sampling assumptions.

One draws a pair of coin biases (that is, signal distribution parameters), and a prior over worlds, each from independent uniform distributions. Combinations of coin biases and prior that result in recipients of both coin tosses voting for the same answer are discarded. An actual coin is sampled according to the prior, and tossed a finite number of times to produce the votes, confidences, and vote predictions required by different methods (see Supplementary Information for simulation details). As well as showing how sample size affects different aggregation methods the simulations also show that majorities become more reliable as consensus increases. A majority of 90% is correct about 90% of the time, while a majority of 55% is not much better than chance. This is not due to sampling error, but reflects the structure of the model and simulation assumptions. According to the model, an answer with x% endorsements is incorrect if counterfactual endorsements for that answer exceed x% (Theorem 2), and the chance of sampling such a problem diminishes with x.

Supplementary information

Supplementary Information

This file contains Supplementary Text and Data sections 1-3 – see contents page for details. (PDF 207 kb)

PowerPoint slides

PowerPoint slide for Fig. 1

PowerPoint slide for Fig. 2

PowerPoint slide for Fig. 3

PowerPoint slide for Fig. 4

PowerPoint slide for Fig. 5

Rights and permissions

Reprints and permissions

About this article

Cite this article

Prelec, D., Seung, H. & McCoy, J. A solution to the single-question crowd wisdom problem. Nature 541, 532–535 (2017). https://doi.org/10.1038/nature21054

Download citation

Received: 04 September 2016
Accepted: 09 December 2016
Published: 26 January 2017
Issue Date: 26 January 2017
DOI: https://doi.org/10.1038/nature21054

This article is cited by

Exploiting Meta-cognitive Features for a Machine-Learning-Based One-Shot Group-Decision Aggregation
- Hilla Shinitzky
- Dan Avraham
- Yuval Shahar
Group Decision and Negotiation (2024)
Where’s Waldo, Ohio? Using Cognitive Models to Improve the Aggregation of Spatial Knowledge
- Lauren E. Montgomery
- Charles M. Baldini
- Michael D. Lee
Computational Brain & Behavior (2024)
On an effective and efficient method for exploiting the wisdom of the inner crowd
- Itsuki Fujisaki
- Kunhao Yang
- Kazuhiro Ueda
Scientific Reports (2023)
Machine truth serum: a surprisingly popular approach to improving ensemble methods
- Tianyi Luo
- Yang Liu
Machine Learning (2023)
Experimental Philosophy and the Incentivisation Challenge: a Proposed Application of the Bayesian Truth Serum
- Philipp Schoenegger
Review of Philosophy and Psychology (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Access options

Similar content being viewed by others

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Extended data figures and tables

Supplementary information

PowerPoint slides

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links