Accelerating evidence-informed decision-making for the Sustainable Development Goals using machine learning

Porciello, Jaron; Ivanina, Maryia; Islam, Maidul; Einarson, Stefan; Hirsh, Haym

doi:10.1038/s42256-020-00235-5

Perspective
Published: 12 October 2020

Accelerating evidence-informed decision-making for the Sustainable Development Goals using machine learning

Nature Machine Intelligence volume 2, pages 559–565 (2020)Cite this article

7937 Accesses
29 Citations
93 Altmetric
Metrics details

Subjects

Abstract

The United Nations Sustainable Development Goal 2 (SDG 2) is to achieve zero hunger by 2030. We have designed Persephone, a machine learning model, to support a diverse volunteer network of 77 researchers from 23 countries engaged in creating interdisciplinary evidence syntheses in support of SDG 2. Such evidence syntheses, whatever the specific topic, assess original studies to determine the effectiveness of interventions. By gathering and summarizing current evidence and providing objective recommendations they can be valuable aids to decision-makers. However, they are time-consuming; estimates range from 18 months to three years to produce a single review. Persephone analysed 500,000 unstructured text summaries from prominent sources of agricultural research, determining with 90% accuracy the subset of studies that would eventually be selected by expert researchers. We demonstrate that machine learning models can be invaluable in placing evidence into the hands of policymakers.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: Evidence synthesis and machine learning analytics.**

**Fig. 2: Agricultural interventions ontology.**

**Fig. 4: MLMs reduce screening time for researchers.**

**Fig. 5: Comparison of human versus machine-selected studies.**

Monitoring global development aid with machine learning

Article 11 April 2022

Assessing the sustainability of the European Green Deal and its interlin kages with the SDGs

Article Open access 13 March 2024

AI for social good: unlocking the opportunity for positive impact

Article Open access 18 May 2020

References

Ioannidis, J. P. A. The mass production of redundant, misleading, and conflicted systematic reviews and meta-analyses: mass production of systematic reviews and meta-analyses. Milbank Q. 94, 485–514 (2016).
Article Google Scholar
Masaki, T., Custer, S., Eskenazi, A., Stern, A. & Latourell, R. Decoding Data Use: How Do Leaders Use Data and Use it to Accelerate Development (AidData, 2017).
Cairney, P. & Oliver, K. How should academics engage in policymaking to achieve impact? Polit. Stud. Rev. 18, 228–244 (2020).
Article Google Scholar
Bornmann, L. & Mutz, R. Growth rates of modern science: a bibliometric analysis based on the number of publications and cited references. J. Assoc. Inf. Sci. Technol. 66, 2215–2222 (2015).
Article Google Scholar
Head, B. W. Reconsidering evidence-based policy: key issues and challenges. Policy Soc. 29, 77–94 (2010).
Article Google Scholar
Littell, J. H. Conceptual and practical classification of research reviews and other evidence synthesis products. Campbell Syst. Rev. 14, 1–21 (2018).
Article Google Scholar
Gurevitch, J., Koricheva, J., Nakagawa, S. & Stewart, G. Meta-analysis and the science of research synthesis. Nature 555, 175–182 (2018).
Article Google Scholar
Parker, T. H. et al. Transparency in ecology and evolution: real problems, real solutions. Trends Ecol. Evol. 31, 711–719 (2016).
Article Google Scholar
Haddaway, N. R. & Westgate, M. J. Predicting the time needed for environmental systematic reviews and systematic maps. Conserv. Biol. 33, 434–443 (2019).
Article Google Scholar
Chalmers, I. et al. How to increase value and reduce waste when research priorities are set. Lancet 383, 156–165 (2014).
Article Google Scholar
Lau, J. Editorial: systematic review automation thematic series. Syst. Rev. 8, 70 (2019).
Article Google Scholar
Çano, E. & Morisio, M. Hybrid recommender systems: a systematic literature review. Intell. Data Anal. 21, 1487–1524 (2017).
Article Google Scholar
Howard, B. E. et al. SWIFT-Review: a text-mining workbench for systematic review. Syst. Rev. 5, 87 (2016).
Article Google Scholar
Marshall, I. J. & Wallace, B. C. Toward systematic review automation: a practical guide to using machine learning tools in research synthesis. Syst. Rev. 8, 163 (2019).
Article Google Scholar
Espey, J. Using evidence & data to drive action on the SDGs. SDSN http://unsdsn.org/news/2018/06/28/using-evidence-data-to-drive-action-on-the-sdgs/ (2018).
Moher, D., Liberati, A., Tetzlaff, J. & Altman, D. G. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. PLoS Med. 339, b2535 (2009).
Google Scholar
Caliskan, A., Bryson, J. J. & Narayanan, A. Semantics derived automatically from language corpora contain human-like biases. Science 356, 183–186 (2017).
Article Google Scholar
Rethlefsen, M. L., Farrell, A. M., Osterhaus Trzasko, L. C. & Brigham, T. J. Librarian co-authors correlated with higher quality reported search strategies in general internal medicine systematic reviews. J. Clin. Epidemiol. 68, 617–626 (2015).
Article Google Scholar
Fagan, J. C. An evidence-based review of academic web search engines, 2014–2016: implications for librarians’ practice and research agenda. Inf. Technol. Libr. 36, 7–47 (2017).
Google Scholar
Davidson, B. Storytelling and evidence-based policy: lessons from the grey literature. Palgrave Commun. 3, 17093 (2017).
Article Google Scholar
McAuley, L., Pham, B., Tugwell, P. & Moher, D. Does the inclusion of grey literature influence estimates of intervention effectiveness reported in meta-analyses? Lancet 356, 1228–1231 (2000).
Article Google Scholar
Mikolov, T., Chen, K., Corrado, G. & Dean, J. Efficient estimation of word representations in vector space. https://arxiv.org/abs/1301.3781 (2013).
Yano, T. & Kang, M. Taking advantage of Wikipedia in natural language processing. http://www.cs.cmu.edu/~taey/pub/wiki.pdf (2008).
Sharma, Y., Agrawal, G., Jain, P. & Kumar, T. Vector representation of words for sentiment analysis using GloVe. In 2017 Int. Conf. Intelligent Communication and Computational Techniques 279–284 (ICCT, 2017).
Hearst, M. A. Automatic acquisition of hyponyms from large text corpora. In Proc. 14th Conf. Computational Linguistics Vol. 2 539–545 (Association for Computational Linguistics, 1992).
Pavlidis, P., Wapinski, I. & Noble, W. S. Support vector machine classification on the web. Bioinformatics 20, 586–587 (2004).
Article Google Scholar
Veena, G., Gupta, D., Daniel, A. N. & Roshny, S. A learning method for coreference resolution using semantic role labeling features. In 2017 Int. Conf. Advances in Computing, Communications and Informatics 67–72 (ICACCI, 2017).
Paulavets, M. E., Porciello, J., Kiryllau, Y. I. & Einarson, S. A taxonomy creation for agriculture using classical machine learning algorithms. Big Data Adv. Anal. 5, 45–50 (2019).
Google Scholar
Lewis, D. P., Jebara, T. & Noble, W. S. Support vector machine learning from heterogeneous data: an empirical analysis using protein sequence and structure. Bioinformatics 22, 2753–2760 (2006).
Article Google Scholar
Bojanowski, P., Grave, E., Joulin, A. & Mikolov, T. Enriching word vectors with subword information. Trans. Assoc. Comput. Linguist. 5, 135–146 (2017).
Article Google Scholar
Joshi, M., Agarwal, R. C. & Kumar, V. Predicting rare classes: can boosting make any weak learner strong? In Proc. Eighth ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining 297–306 (ACM, 2002).
Devlin, J., Chang, M.-W., Lee, K. & Toutanova, K. BERT: pre-training of deep bidirectional transformers for language understanding. https://arxiv.org/abs/1810.04805 (2018).
Willett, J., Baldwin, T., Martinez, D. & Webb, A. Classification of study region in environmental science abstracts. In Proc. Australasian Language Technology Association Workshop 118–122 (ALTA, 2012).
Kaushik, N. & Chatterjee, N. Automatic relationship extraction from agricultural text for ontology construction. Inf. Process. Agric. 5, 60–73 (2018).
Google Scholar
Beltagy, I., Lo, K. & Cohan, A. SciBERT: a pretrained language model for scientific text. https://arxiv.org/abs/1903.10676 (2019).
Acevedo, M. et al. A scoping review of adoption of climate-resilient crops by small-scale producers in low- and middle-income countries. Nat. Plants https://doi.org/10.1038/s41477-020-00783-z (2020).
Baltenwick, I. et al. A scoping review of feed interventions and livelihoods of small-scale livestock keepers. Nat. Plants https://doi.org/10.1038/s41477-020-00786-w (2020).
Stathers, T. et al. A scoping review of interventions for crop postharvest loss reduction in sub-Saharan Africa and South Asia. Nat. Sustain. https://doi.org/10.1038/s41893-020-00622-1 (2020).
Liverpool-Tasie, L. S. O. et al. A scoping review of market links between value chain actors and small-scale producers in developing regions. Nat. Sustain. https://doi.org/10.1038/s41893-020-00621-2 (2020).
Ricciardi, V. et al. A scoping review of research funding for small-scale farmers in water scarce regions. Nat. Sustain. https://doi.org/10.1038/s41893-020-00623-0 (2020).
Piñeiro, V. et al. A scoping review on incentives for adoption of sustainable agricultural practices and their outcomes. Nat. Sustain. https://doi.org/10.1038/s41893-020-00617-y (2020).
Bizikova, L. et al. A scoping review of the contributions of farmers’ organizations to smallholder agriculture. Nat. Food https://doi.org/10.1038/s43016-020-00164-x (2020).
Maïga, W. H. E. et al. A systematic review of youth skills training programmes in agriculture in low- and middle-income countries. Nat. Food https://doi.org/10.1038/s43016-020-00172-x (2020).
Webb, P. & Kennedy, E. Impacts of agriculture on nutrition: nature of the evidence and research gaps. Food Nutr. Bull. 35, 126–132 (2014).
Article Google Scholar
Yuan, Y. & Hunt, R. H. Systematic reviews: the good, the bad, and the ugly. Am. J. Gastroenterol. 104, 1086–1092 (2009).
Article Google Scholar
Haddaway, N. R. et al. A framework for stakeholder engagement during systematic reviews and maps in environmental management. Environ. Evid. 6, 11 (2017).
Article Google Scholar
Arnott, D. Cognitive biases and decision support systems development: a design science approach. Inf. Syst. J. 16, 55–78 (2006).
Article Google Scholar
Minas, R. K. & Crosby, M. E. In Foundations of Augmented Cognition: Neuroergonomics and Operational Neuroscience (eds Schmorrow, D. D. & Fidopiastis, C. M.) 242–252 (Springer, 2016).
Gil, Y., Greaves, M., Hendler, J. & Hirsh, H. Amplify scientific discovery with artificial intelligence. Science 346, 171–172 (2014).
Article Google Scholar

Download references

Acknowledgements

We gratefully acknowledge the funding of this research was provided by the Federal Ministry of Economic Cooperation (BMZ Germany) and the Bill and Melinda Gates OPP1210352 for the project Ceres2030: Sustainable Solutions to End Hunger. Thank you to Wences Almazan for the artful reproductions of Persephone, and a very heartfelt thank you to all of collaborators and partners who participated in Ceres2030.

Author information

Authors and Affiliations

Department of Global Development, Cornell University, Ithaca, NY, USA
Jaron Porciello & Stefan Einarson
EPAM Systems, Minsk, Belarus
Maryia Ivanina
Belarusian State University, Minsk, Belarus
Maryia Ivanina
Department of Computer and Information Sciences, Cornell University, Ithaca, NY, USA
Maidul Islam & Haym Hirsh

Authors

Jaron Porciello
View author publications
You can also search for this author in PubMed Google Scholar
Maryia Ivanina
View author publications
You can also search for this author in PubMed Google Scholar
Maidul Islam
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Einarson
View author publications
You can also search for this author in PubMed Google Scholar
Haym Hirsh
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.P. designed the approach, managed the project and contributed to some of the programming. M.Ivanina performed most of the programming. M.Islam performed most of the automation and retrieval for grey literature. S.E. assisted with automation and technical infrastructure. H.H. consulted on machine-learning models. J.P. wrote the paper with contributions from M.Ivanina.

Corresponding author

Correspondence to Jaron Porciello.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplemental Figures 1–4 and Supplemental Tables 1–4

Source data

Source Data Fig. 2

Statistical source data

Source Data Fig. 4

Statistical source data

Source Data Fig. 5

Statistical source data

Rights and permissions

Reprints and permissions

About this article

Cite this article

Porciello, J., Ivanina, M., Islam, M. et al. Accelerating evidence-informed decision-making for the Sustainable Development Goals using machine learning. Nat Mach Intell 2, 559–565 (2020). https://doi.org/10.1038/s42256-020-00235-5

Download citation

Received: 09 March 2020
Accepted: 09 September 2020
Published: 12 October 2020
Issue Date: October 2020
DOI: https://doi.org/10.1038/s42256-020-00235-5

This article is cited by

A scoping review on tools and methods for trait prioritization in crop breeding programmes
- M. Occelli
- R. Mukerjee
- H. A. Tufan
Nature Plants (2024)
Discovering new pathways toward integration between health and sustainable development goals with natural language processing and network science
- Thomas Bryan Smith
- Raffaele Vacca
- Ilaria Capua
Globalization and Health (2023)
Digital artifacts reveal development and diffusion of climate research
- Bia Carneiro
- Giuliano Resce
- Tek B Sapkota
Scientific Reports (2022)
How does government expenditure impact sustainable development? Studying the multidimensional link between budgets and development gaps
- Omar A. Guerrero
- Gonzalo Castañeda
Sustainability Science (2022)
Machine-learning-based evidence and attribution mapping of 100,000 climate impact studies
- Max Callaghan
- Carl-Friedrich Schleussner
- Jan C. Minx
Nature Climate Change (2021)