A machine-learning algorithm to target COVID testing of travellers

Obermeyer, Ziad

doi:10.1038/d41586-021-02556-w

NEWS AND VIEWS
22 September 2021

A machine-learning algorithm to target COVID testing of travellers

Optimizing the testing of incoming travellers for COVID-19 involves predicting those who are most likely to test positive. A machine-learning algorithm for targeted testing has been implemented at the Greek border.

Ziad Obermeyer⁰

Ziad Obermeyer
1. Ziad Obermeyer is in the Division of Health Policy and Management, School of Public Health, University of California, Berkeley, Berkeley, California 94720, USA.
View author publications

You can also search for this author in PubMed Google Scholar

You have full access to this article via your institution.

Download PDF

It seems an obvious combination: machine learning and the fight against COVID-19. And yet, despite intense interest and increasing availability of large data sets, success stories of such combinations are few and far between. Writing in Nature, Bastani et al.¹ describe a system that they designed and deployed at points of entry into Greece, starting in August 2020. The algorithm, which is built on a method called reinforcement learning, markedly increased the efficiency of testing for the coronavirus SARS-CoV-2, and contributed to Greece’s ability to keep its borders open safely. The work also provides a clear warning about the shortcomings of the comparatively blunt policy tools that most other countries continue to use.

Read the paper: Efficient and targeted COVID-19 border testing via reinforcement learning

Testing is a problem that machine learning is well suited to solve. Imagine a border-control agent on a Greek island. A flight has just landed, and the agent’s task is to identify and detain anyone who has COVID-19. The agent might want to test all arriving passengers, but the testing capacity on the island is very limited and, more generally, it is never possible to test 100% of any population 100% of the time. The alternative — shutting down the border completely, in an economy highly dependent on tourism — has its own perils. These would include not only a huge financial cost associated with the loss of jobs and income, but also the negative effects of such losses on public health². So the border agent faces a difficult decision: who should be tested?

As has been noted³, the value of a test depends on its eventual outcome. In this scenario, a negative test generates only costs: the cost of testing and a delay for the traveller. By contrast, a positive test generates tremendous benefit: prevention of all the cases of COVID-19 that a traveller infected with SARS-CoV-2 would have caused. So, in deciding who to test, the border agent’s optimal strategy is clear: predict which travellers have the highest likelihood of testing positive, and test them. This strategy maximizes the value of testing, because it detects the most travellers with COVID-19 using the lowest number of tests.

If the border agent could predict which incoming passengers are most likely to test positive, tests could be allocated efficiently (Fig. 1). Conveniently, data about incoming passengers — their country and region of origin, age and sex — are available digitally, on the passenger locator form that all travellers complete 24 hours before arrival in Greece. It seems straightforward enough to use data from past tests of incoming passengers to predict which ‘types’ of passenger might be more likely to test positive in the future. But, as decades of research in statistics and computer science have shown⁴, this strategy runs the risk of getting locked into yesterday’s pandemic: given the rapidly evolving dynamics of COVID-19 spread, an algorithm must quickly adapt its predictions to stay one step ahead and still test the right passengers.

A member of medical staff performs a COVID test on a passenger in Athens airport — **Figure 1 | COVID-19 testing of travellers arriving at Eleftherios Venizelos International Airport in Athens**.Credit: Milos Bicanski/Getty

This is where the value of machine learning becomes clear. Just as an algorithm can be trained to play the game Go⁵ by learning which moves lead to winning the game, Bastani and colleagues trained an algorithm to allocate scarce tests, by learning which passengers are likely to test positive.

Crucially, the algorithm balances two goals. The first, and most natural, goal is to test passenger types who are likely to test positive, by exploiting patterns learnt from previous data about the outcome of tests for SARS-CoV-2 in these different groups. The second — perhaps less intuitive, but equally important — is to explore patterns not reflected in previous data, by testing passenger types about which the algorithm knows little.

Then, at a given port of entry on a given day, the algorithm delivers targeted recommendations to border agents about which passengers to test, while respecting the budget and resource constraints imposed by supply chains, staffing, laboratory capacity and delivery logistics for biological samples. These constraints are real and binding: the authors note that, at the peak of the summer tourism season, there was capacity to test only 18.4% of incoming travellers — even after the Greek National COVID-19 Committee of Experts wisely approved group testing to drive efficiency gains in the lab.

Text-message nudges encourage COVID vaccination

The authors draw on the reinforcement-learning strategies that have powered advances in online commerce and marketing⁶. But using such an algorithm in the real world raises its own technical challenges. For example, the algorithm must learn discontinuously, from large batches of testing results, rather than one-by-one from individual results. And the feedback from batch results is delayed, forcing the algorithm to operate uninformed while waiting for results. Solving these challenges required substantial tweaking of the algorithms that are typically designed for easier, more data-rich online settings.

The thorniest challenges, however, are legal and political ones. To comply with the European Union’s General Data Protection Regulation (GDPR), the authors deliberately limited the data available to the algorithm — and thus its accuracy — in close consultation with lawyers, epidemiologists and policymakers. The potential limit placed on the algorithm’s performance by the GDPR highlights how well-intentioned laws to protect privacy can have both positive and negative consequences. In a pandemic that does not respect individuals’ privacy, such regulations can ultimately hamper the ability of a government to protect the health of its citizens. The authors also adapted the algorithm with a policymaker audience in mind, choosing their optimization methods to showcase clearly the value of both algorithm goals: testing high-risk passengers and testing high-uncertainty passengers.

The results are impressive. The automated system doubled the efficiency of testing — the number of cases detected per test — allowing border agents to test and quarantine the right passengers, many of whom were asymptomatic, while letting others through to their final destination.

The success of the algorithm presented by Bastani and colleagues highlights the inadequacy of the border policies of nearly all other countries. The decisions underlying these policies — for example, whether to deny all travellers entry to the country or to force the testing or quarantine of all travellers from a given country — have two key flaws. First, these decisions are made about entire countries, rather than individuals, disregarding vast differences between people within countries. Second, they are typically made on the basis of country-level epidemiological data that, as the present study shows, have notable shortcomings.

Contact-tracing app curbed the spread of COVID in England and Wales

Had border agents denied entry to all passengers from countries that had concerning metrics, they would have prevented those people with COVID-19 from entering Greece — but at the cost of crushing a key pillar of the economy. Had they simply tested people proportional to a country’s reported COVID-19 metrics rather than algorithmic predictions, however, their testing efficiency would have been much lower. This is because reported COVID-19 metrics can be very different from actual disease prevalence among incoming travellers. Travellers are not randomly drawn from their countries’ populations, and passively collected data on cases of COVID-19 or deaths associated with the disease reflect large reporting biases and systemic barriers to access⁷.

Indeed, by efficiently testing incoming passengers, the authors’ algorithm was able to anticipate spikes in SARS-CoV-2 infection rates among traveller populations almost 9 days earlier than if they had used country-level epidemiological data alone. This indicates the enormous value of intelligent, deliberate data collection — and the dangers of relying on blunt, flawed, country-level data for important decisions.

Bastani and colleagues’ work will be remembered as one of the best examples of using data in the fight against COVID-19. It is a compelling story of how a group of researchers partnered with enlightened policymakers to produce a tool that has enormous social value. It highlights the best parts of both academic research and the civil service, and shows the great promise of artificial intelligence for making good decisions — which in many settings can be the difference between life and death.

Nature 599, 34-36 (2021)

doi: https://doi.org/10.1038/d41586-021-02556-w

References

Bastani, H. et al. Nature 599, 108–113 (2021).
Article Google Scholar
Marmot, M. & Wilkinson, R. (eds) Social Determinants of Health (Oxford Univ. Press, 2005).
Google Scholar
Mullainathan, S. & Obermeyer, Z. Diagnosing Physician Error: A Machine Learning Approach to Low-Value Health Care. National Bureau of Economic Research Working Paper 26168 (2021).
Thompson, W. R. Biometrika 25, 285–294 (1933).
Article Google Scholar
Silver, D. et al. Nature 529, 484–489 (2016).
Article PubMed Google Scholar
Li, L., Chu, W., Langford, J. & Schapire, R. E. in Proc. 19th Int. Conf. World Wide Web 661–670 (2010).
Wu, S. L. et al. Nature Commun. 11, 4507 (2020).
Article PubMed Google Scholar

Download references

Reprints and permissions

Competing Interests

The author declares no competing interests.

Subjects

Latest on:

Lethal AI weapons are here: how can we control them?

News Feature 23 APR 24

Will AI accelerate or delay the race to net-zero emissions?

Comment 22 APR 24

AI’s keen diagnostic eye

Outlook 18 APR 24

Monkeypox virus: dangerous strain gains ability to spread through sex, new data suggest

News 23 APR 24

What toilets can reveal about COVID, cancer and other health threats

News Feature 17 APR 24

Bird flu outbreak in US cows: why scientists are concerned

News Explainer 08 APR 24

WHO redefines airborne transmission: what does that mean for future pandemics?

News 24 APR 24

What toilets can reveal about COVID, cancer and other health threats

News Feature 17 APR 24

US COVID-origins hearing puts scientific journals in the hot seat

News 16 APR 24

Jobs

Director of the Czech Advanced Technology and Research Institute of Palacký University Olomouc

The Rector of Palacký University Olomouc announces a Call for the Position of Director of the Czech Advanced Technology and Research Institute of P...

Czech Republic (CZ)

Palacký University Olomouc
Course lecturer for INFH 5000

The HKUST(GZ) Information Hub is recruiting course lecturer for INFH 5000: Information Science and Technology: Essentials and Trends.

Guangzhou, Guangdong, China

The Hong Kong University of Science and Technology (Guangzhou)
Suzhou Institute of Systems Medicine Seeking High-level Talents

Full Professor, Associate Professor, Assistant Professor

Suzhou, Jiangsu, China

Suzhou Institute of Systems Medicine (ISM)
Postdoctoral Fellowships: Early Diagnosis and Precision Oncology of Gastrointestinal Cancers

We currently have multiple postdoctoral fellowship positions within the multidisciplinary research team headed by Dr. Ajay Goel, professor and foun...

Monrovia, California

Beckman Research Institute, City of Hope, Goel Lab
Postdoctoral Research Fellow Positions, Division of Rheumatology

We seek two postdoctoral fellows to join the Mustelin/Najjar lab in the Rheumatology division, University of Washington, Seattle, USA, to lead and ...

Seattle, Washington State

University of Washington, Department of Medicine, Division of Rheumatology

[1] Bastani, H. et al. Nature 599, 108–113 (2021).
Article Google Scholar

[2] Marmot, M. & Wilkinson, R. (eds) Social Determinants of Health (Oxford Univ. Press, 2005).
Google Scholar

[3] Mullainathan, S. & Obermeyer, Z. Diagnosing Physician Error: A Machine Learning Approach to Low-Value Health Care. National Bureau of Economic Research Working Paper 26168 (2021).

[4] Thompson, W. R. Biometrika 25, 285–294 (1933).
Article Google Scholar

[5] Silver, D. et al. Nature 529, 484–489 (2016).
Article PubMed Google Scholar

[6] Li, L., Chu, W., Langford, J. & Schapire, R. E. in Proc. 19th Int. Conf. World Wide Web 661–670 (2010).

[7] Wu, S. L. et al. Nature Commun. 11, 4507 (2020).
Article PubMed Google Scholar

A machine-learning algorithm to target COVID testing of travellers

References

Competing Interests

Subjects

Latest on:

Jobs

Director of the Czech Advanced Technology and Research Institute of Palacký University Olomouc

Course lecturer for INFH 5000

Suzhou Institute of Systems Medicine Seeking High-level Talents

Postdoctoral Fellowships: Early Diagnosis and Precision Oncology of Gastrointestinal Cancers

Postdoctoral Research Fellow Positions, Division of Rheumatology

Search

Quick links

References

Competing Interests

Related Articles

Subjects

Latest on:

Jobs

Director of the Czech Advanced Technology and Research Institute of Palacký University Olomouc

Course lecturer for INFH 5000

Suzhou Institute of Systems Medicine Seeking High-level Talents

Postdoctoral Fellowships: Early Diagnosis and Precision Oncology of Gastrointestinal Cancers

Postdoctoral Research Fellow Positions, Division of Rheumatology

Search

Quick links