Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

# Non-selective distribution of infectious disease prevention may outperform risk-based targeting

## Abstract

Epidemic control often requires optimal distribution of available vaccines and prophylactic tools, to protect from infection those susceptible. Well-established theory recommends prioritizing those at the highest risk of exposure. But the risk is hard to estimate, especially for diseases involving stigma and marginalization. We address this conundrum by proving that one should target those at high risk only if the infection-averting efficacy of prevention is above a critical value, which we derive analytically. We apply this to the distribution of pre-exposure prophylaxis (PrEP) of the Human Immunodeficiency Virus (HIV) among men-having-sex-with-men (MSM), a population particularly vulnerable to HIV. PrEP is effective in averting infections, but its global scale-up has been slow, showing the need to revisit distribution strategies, currently risk-based. Using data from MSM communities in 58 countries, we find that non-selective PrEP distribution often outperforms risk-based, showing that a logistically simpler strategy is also more effective. Our theory may help design more feasible and successful prevention.

## Introduction

Pre-exposure prophylaxis (PrEP) of the Human Immunodeficiency Virus (HIV) is the use of antiretroviral medications to prevent HIV acquisition, by uninfected individuals. More than ten years have passed since the first evidence that PrEP could protect people from HIV1, and PrEP is now a component of the HIV prevention cascade2. Its uptake, however, has been restricted to a few countries3, and is currently inadequate in the context of the global effort toward eliminating the HIV/AIDS epidemic2,3. A challenge to PrEP scale-up is its distribution, which requires identifying potential candidates, supplying the medication, and providing the necessary follow-up to ensure consistent use. Most guidelines4 and cost-effectiveness studies5,6,7,8 recommend offering PrEP to those at high risk of acquiring HIV9,10. Risk, however, is difficult to measure, and particularly so among those who would benefit the most from PrEP, as it is the case of men-having-sex-with-men (MSM)4,7,11: Stigma and punitive laws often marginalize communities and make them hard-to-reach12. Proposed metrics for estimating individual risk often exhibit poor accuracy4,11, or maybe operationally too challenging13, when faced with real-world complexity14. Profiling risk may also reinforce stigma7.

Risk-based distribution strategies apply to diseases other than HIV, and types of prevention other than chemoprophylaxis. It is the case with many vaccines15: early vaccination of healthcare workers against COVID-19 is the latest notable example. One underpinning of risk-based distribution is that high infection risk comes, at least partially, from having many contacts with other individuals through which the infection may be acquired. Once infected, however, having many contacts means having a high probability of further spreading the pathogen, i.e., of causing superspreading events. Thus, targeting those at high risk means preventing superspreading events, preventing a large number of infections, and lowering incidence in the population. In the formalism of complex networks, whereby nodes are individuals and links are contacts along which the pathogens can spread16, this means prioritizing the highest-degree nodes (hubs)17,18. Many extensions to this theory have appeared19,20,21,22,23,24, but the main tenet has remained the same: you should protect those who can cause superspreading events, if infected. Prevention strategies that target individuals with specific contact patterns, however, require detailed information on the underlying network structure, which is hard to get25,26, and thus not part of routine surveillance, as the case of PrEP among MSM shows8. As a result, these strategies may perform well in models, but are hard to translate into public health guidelines.

Our study helps to bypass this limitation, by proving that targeting those at high risk of causing superspreading events may not be the best-performing strategy in all settings: Simpler strategies, which are easier to implement, maybe more effective. We do that by studying the role of the individual-level efficacy of prevention: Efficacy measures how well prophylaxis, or vaccination, protects the recipient from infection27. 100% efficacy means that those who use prevention cannot be infected; below this value, efficacy is the probability that prevention averts a transmission event that would otherwise occur. We demonstrate that efficacy determines which distribution strategy works best in reducing community-level disease circulation, and that targeting those at highest risk is optimal only if efficacy is above a threshold, which we derive analytically. We also find that PrEP efficacy is below this threshold in many MSM communities in the world. In these communities, non-selective PrEP distribution likely outperforms targeted distribution, showing that the logistically simplest distribution strategy is also the most effective.

## Results

We start from the observation that, if efficacy is below 100%, higher risk of exposure to the pathogen entails a higher chance that prevention fails, leading to a breakthrough infection28,29,30. This specifically concerns hubs, as the number of contacts determines—at least partially—the risk of exposure. We thus posit the existence of a trade-off. On the one hand, standard theory tells us that protecting those with many contacts brings down population-level transmission, given that they can cause superspreading events, when infected. On the other hand, their chance of experiencing breakthrough infections may be high.

We quantitatively investigate the existence, and phenomenology, of this trade-off, using the heterogeneous mean-field formalism16 on an annealed network with degree distribution p(k). Each node in the network has degree k sampled from p(k), establishing k contacts (links) with other nodes. Heavy-tailed degree distributions are typically used to model heterogeneity in the number of contacts31. We assume here that node degrees along links are not correlated. Real contact networks may however exhibit assortative behavior31: high-degree nodes tend to be in contact with high-degree nodes. In Supplementary Note 1, we cover the case of assortative networks. Annealed networks are particularly suitable when the timescale of pathogen spread is much larger than the timescale at which contacts change16, as is the case of HIV epidemics in MSM communities (Supplementary Note 2). Also, annealed networks can be parametrized from existing surveys5,13,32, unlike more complex network models, which would require high-resolution contact data. To describe disease spread, we use the Susceptible-Infectious-Susceptible compartmental model16, by which a susceptible individual becomes infected at a rate λ, when in contact with an infectious individual. Also, those infected spontaneously transition to the susceptible state at rate μ. This last process may model recovery, or population turnover, as in the case of HIV infection5 (see also Supplementary Note 3). We also assume leaky27 prevention, with efficacy ϵ in decreasing the instantaneous transmission rate: ϵ = 1 corresponds to maximally effective prevention.

The heterogeneous mean-field formalism is a customary approach to write the equations describing the evolution in time of the spread of the disease in terms of the probability, by degree class, that a node is infected16. It can deal with arbitrary degree distributions, while factoring out all dynamical correlations in the status of connected nodes, which would render the theory intractable. In our case, these equations are

$$\left\{\begin{array}{l} \kern-2.3pc {\dot{x}}_{k}=-\mu {x}_{k}+\frac{\lambda }{\langle k\rangle }k(1-{x}_{k})\xi \\ {\dot{y}}_{k}=-\mu {y}_{k}+\frac{\lambda }{\langle k\rangle }(1-\epsilon )k(1-{y}_{k})\xi \\ \kern-1.1pc\xi =\mathop{\sum}\limits_{k}k{p}_{k}\left[(1-{g}_{k}){x}_{k}+{g}_{k}{y}_{k}\right].\end{array}\right.$$
(1)

Here, xk is the probability that an individual in degree class k who does not receive prevention is infected, yk is the probability that an individual in degree class k who receives prevention is infected, λ is the transmission rate, μ is the recovery rate, gk is the probability that an individual in degree class k receives prevention. ξ is an auxiliary variable that encodes the probability of establishing a contact with an infected individual. It is the extension, in the case of a partially immunized population, of the customary coupling term of the heterogeneous mean-field equations16. The form of ξ given in Eq. (1) implies no degree-degree correlations: see Supplementary Note 1 for nonzero assortativity. For convenience, we also define the reduced transmission rate as $$\hat{\lambda }=\lambda /(\langle k\rangle \mu )$$, where 〈k〉 is the average degree.

### Optimal distribution of prevention

Community-level prevalence can be written as a function of the quantities in Eq. (1): $$I[\,g,x,y]={\sum }_{k}{p}_{k}\left[(1-{g}_{k}){x}_{k}+{g}_{k}{y}_{k}\right]$$. We now wish to derive the impact that increasing prevention among a specific degree class has on decreasing community-level prevalence, for different values of efficacy ϵ. Optimizing the distribution strategy is relevant when large-scale distribution and adoption is not possible. We thus start from the configuration of no prevention (g = 0), and study the impact of providing prevention to a small number of individuals, in degree class k, by means of the following linear response function: $$f(k)=-{\left.(1/{p}_{k}){{{{{{{\rm{d}}}}}}}}I/{{{{{{{\rm{d}}}}}}}}{g}_{k}\right|}_{g = 0}$$ (see Methods for a detailed explanation). If protecting hubs is the best-performing strategy, then f will be a monotonously increasing function of k. The existence of the trade-off will instead be marked by the existence of maximum of f, at finite k. At the endemic equilibrium ($${\dot{x}}_{k}={\dot{y}}_{k}=0$$), we derive the expression of f from Eq. (1) (see Methods):

$$f(k) = \overbrace{\left.(x_k - y_k)\right|_{g=0}}^{F_{dir}(k)} + \overbrace{\frac{1}{p_k}\mathop {\sum}\limits_{m} p_{m} \left.\frac{dx_{m}}{dg_k}\right|_{g=0}}^{F_{indir}(k)} .$$
(2)

f has two terms. Fdir(k) quantifies the direct reduction in risk of infection among those receiving prevention. Findir(k) quantifies the indirect effect of the prevention campaign: the reduction in risk of infection among those who did not receive prevention, due to the presence of those who did. We derive the expression of both terms:

$${F}_{dir}(k)=\frac{\epsilon \hat{\lambda }zk}{\big[1+\hat{\lambda }zk\big]\big[1+(1-\epsilon )\hat{\lambda }zk\big]},$$
(3)
$${F}_{indir}(k)=\frac{\psi \hat{\lambda }}{1-\phi }k{F}_{dir}(k),$$
(4)

where $$\hat{\lambda },\psi ,\phi$$ depend on the degree distribution and the epidemic parameters, but do not depend on ϵ, k. Their detailed expressions are provided in the Methods, along with the details of the calculation (see also Supplementary Note 4). Supplementary Note 5 shows the agreement of the analytical derivation with the numerical counterpart. We first examine the direct effect. Fdir has a maximum when $$k={k}_{dir}^{* } \sim 1/\sqrt{1-\epsilon }$$. If protection is perfect (ϵ = 1), then Fdir is monotonously increasing, meaning that highly connected individuals should always be prioritized, as the current theory mandates. If ϵ < 1, $${k}_{dir}^{* }$$ is finite (Fig. 1a), and above it, the high chance of breakthrough infection among high-risk individuals offsets the direct gain in protecting those who are most likely to get infected. We now turn to the indirect effect. Findir increases monotonously for any value of efficacy ϵ (Fig. 1a). This means that providing prevention to highly connected individuals always induces the largest indirect benefit on those who are not using prevention. The combination of Fdir and Findir gives the following optimal degree for degree-prioritized prevention strategies:

$${k}^{* }=\frac{1+\sqrt{\left(\frac{z(1-\phi )}{\psi }-1\right)\left(\frac{z(1-\phi )}{\psi }(1-\epsilon )-1\right)}}{\hat{\lambda }z\left[\frac{z(1-\phi )}{\psi }(1-\epsilon )-(2-\epsilon )\right]}.$$
(5)

k* is always greater than $${k}_{dir}^{* }$$, as it is the result of the effect of direct protection, which is optimal at $${k}_{dir}^{* }$$, and indirect protection, which increases with k (Fig. 1b, c). Furthermore, while $${k}_{dir}^{* }$$ is finite whenever protection is non perfect (ϵ < 1), k* is finite if ϵ < ϵc ≤ 1, with

$${\epsilon }_{c}=\frac{(1-\phi )z-2\psi }{(1-\phi )z-\psi }.$$
(6)

Equation (6) is our main theoretical result: There exists a value of critical efficacy, which is analytically computable, and which discriminates between the two following parameter regions. In the high-efficacy region (ϵ ≥ ϵc) you should prioritize those at risk of causing superspreading events. In the low-efficacy region (ϵ < ϵc), instead, targeting individuals in degree class k = k* <  has the strongest impact on community prevalence.

Notably, the location and shape of the two parameter regions depend on baseline endemic prevalence (i.e., the prevalence in the absence of prevention). Highly prevalent diseases have higher ϵc, and, in the low-efficacy region, lower k* (see Methods for the proof). This means that the same prevention tool (fixed ϵ) may warrant different distribution strategies in different settings (Fig. 2a). In particular, individuals with many contacts should be targeted in low-prevalence communities (ϵc ≤ ϵ). Contrarily, where prevalence is high (ϵc > ϵ), they should not.

This also implies that the invasion phase of an epidemic is always in the high-efficacy region: if the aim of the prevention campaign is to minimize the likelihood of an outbreak of a disease which is not yet circulating, rather than eliminating an endemic disease, then targeting those at risk of causing superspreading events will always be optimal. This can be seen as the zero-prevalence limit of the above derivation, or more rigorously by computing the epidemic threshold16, as we do in the Methods.

The value of the critical efficacy ϵc depends on network topology, too. Specifically, more heterogeneous contact networks have higher ϵc. This is shown in Fig. 2b where, in the case of negative binomial degree distribution, ϵc increases as overdispersion increases. Supplementary Note 4 reports the same result for a power-law degree distribution. Degree-degree correlations also increase the critical efficacy ϵc, and, in the low-efficacy region, decrease k* (see Supplementary Note 1). Intuitively, this happens because assortativity increases risk of exposure among those already at high risk, exacerbating the likelihood of breakthrough infections.

We remark that efficacy (ϵ) measures the level of leakage of prevention, i.e., how well it brings down the chance of transmission upon contact27. It should not be confused with the all-or-none mode of action27, by which some instruments of prevention may completely fail to protect some individuals. This latter mechanism does not change the relative effectiveness of different distribution strategies, and is therefore not a factor in our study.

The low-efficacy region features, by definition, a class of individuals with finite degree k*, that should be prioritized. This is conceptually consequential, but it may have a limited operational impact. We thus investigate whether, when ϵ < ϵc, offering prevention non selectively (random targeting) still outperforms targeting those at risk of causing superspreading events. A new critical value – ϵr – emerges, and splits the low-efficacy region in two. When efficacy is lower than ϵc, but higher than ϵr, targeting individuals with degree k* is still the best-performing strategy, but targeting those at risk of causing superspreading events outperforms non-selective targeting. The opposite is true when, instead, efficacy is lower than ϵr. We call transition zone the part of the low-efficacy region where ϵ > ϵr: there, the choice of the distribution strategy is strongly determined by the practical constraints on being able to identify individuals at given levels of risk (see Fig. 2a). Degree-degree correlations in the contact network have the effect of lowering ϵr (see Supplementary Note 1).

### Pre-exposure prophylaxis of HIV

We now apply this theory to PrEP in MSM communities, which is a prime candidate for exhibiting the emergence of a low-efficacy region, for several reasons. First, it is generally recommended in high-prevalence settings, and MSM has 25 times greater risk of acquiring HIV than heterosexual men2. Second, the efficacy of PrEP varies widely. It depends on the regimen (daily1 v on-demand33), and on the level of adherence to the regimen: generally, efficacy falls in the range 40–90%34. Plus, the protection PrEP provides is leaky34, as resistance to tenofovir/emtricitabine—the most common oral PrEP formulation—is rare35. Finally, the distribution of the number of sexual interactions an individual has (degree distribution) is heterogeneous32,36.

We used estimates of HIV prevalence among MSM, coverage of antiretroviral treatment, prevalence of viral suppression, to compute the effective prevalence, i.e., the fraction of individuals at risk of transmitting HIV. We did it for 58 countries, and 24 cities. We used a negative binomial degree distribution with empirically-informed parameters. Data sources and details on the numerical estimations are available in Supplementary Note 6.

We set PrEP efficacy to 60%, and found that 34 out of 78 communities are in the high-efficacy region, 44 in the low-efficacy region. Among the latter, 4 are in the transition zone. Europe is in the high-efficacy region, in accordance with previous studies recommending risk-based distribution5,37. Notably, many communities in areas of active PrEP roll-out38 are in the low-efficacy region: it is the case of Brazil and of those in southern Africa (excluding Botswana). These results have four implications. First, different communities may warrant different PrEP distribution strategies, as they find themselves in different parameter regions. This provides corroborating evidence to the current international commitment to eliminating the HIV/AIDS epidemic through geographically tailored responses and interventions39. Second, effective interventions require epidemiological and behavioral data at high accuracy and resolution, whose collection is also at the center of international efforts, at least programmatically39. High accuracy ensures that the region (low-efficacy vs high-efficacy) is correctly estimated, high resolution responds to the fact that spatially contiguous communities may be epidemiologically different: Fig. 3b shows several examples of the latter phenomenon. One is Cameroon, which national estimates put in the high-efficacy region, but epidemiological data from two of its cities, Douala and Yaoundé, point to the low-efficacy region. Namibia and Botswana are another example: they are neighboring countries, which share generalized HIV epidemics, but lie in different parameter regions. Third, parameter region assignment is weakly sensitive to PrEP efficacy. We chose the mid-range value of 60%. Lowering it to the value of the IprEx trial1 (44%) would cause only 2 out of 76 communities to change parameter region (Fig. 3c). Increasing it to that of the IPERGAY study33 (86%) would cause 5 out of 76 countries to change region (Fig. 3d). This ensures that our assignment is robust across PrEP efficacy estimates.

We also tested the impact of assortative mixing, which is reported in many MSM communities. Namely, location-based partner selection40, and homophily8, may cause those at high risk to mix preferably with other high-risk individuals. Assortativity had little effect on the efficacy estimates of Fig. 3b: Specifically, with the assortativity estimated in Ref. 41, only 4 out of 34 communities moved from the high-efficacy region to the transition zone, with risk-based distribution still outperforming non-selective distribution (see Supplementary Note 1). We also checked a value of assortativity twice as much as that of Ref. 41 (see Supplementary Note 1): in that case, 7 of 34 communities moved from the high-efficacy region to the transition zone, and 2 out of 4 moved from the transition zone to the low-efficacy region. This shows that assortativity may change recommendations for PrEP distribution only at extremely high values (i.e., those at high risk strongly favoring mixing with others at high risk), and for only 2 out of the 76 communities investigated here.

Generally, populations at low coverage tend to be in the low-efficacy region (see Supplementary Note 7). In these communities, our results corroborate the calls to shift the focus away from risk7. As treatment coverage expands, communities may then transition to the high-efficacy region, showing that the scale-up of treatment and of prevention should be in sync: As treatment expands, prevention should adapt. To exemplify this, we investigated what would happen if UNAIDS’s 95-95-95 targets for testing, treatment, and viral suppression were reached39. Figure 3 shows that 38 out of 44 communities that are now in the low-efficacy region would transition to the high-efficacy region. Notably, however, many communities in Africa would remain in the low-efficacy parameter region even at that extremely high treatment coverage.

Stigma and criminalization of same-sex acts are other factors possibly associated with the low-efficacy region: They are obstacles to PrEP use, as they make it harder to supply the medication, and to provide consistent support and follow-up. This decreases adherence, which in turns decreases efficacy. Decriminalization and societal changes leading to lower stigma may thus signal a transition from the low-efficacy to the high-efficacy region.

Finally, the availability of new PrEP formulations may affect the conditions and timing of the transition to the high-efficacy region. Notably, long-acting injectable cabotegravir (CAB-LA) was recently shown to have higher efficacy than oral PrEP42. This means that communities that are now in the low-efficacy region for oral PrEP, maybe in the high-efficacy region for CAB-LA.

## Discussion

We set up a theoretical formalism to identify the best strategy for population-level distribution of primary prevention. We found that the infection-preventing efficacy of prevention, disease prevalence, and the underlying contact structure determine under which conditions nonselective distribution of prevention outperforms risk-based distribution.

We then applied it to pre-exposure prophylaxis of HIV among men-having-sex-with-men. Non-selective PrEP distribution is effective when HIV prevalence is high and/or treatment coverage is low. Then, as prevalence goes down and treatment increases, focusing on protecting individuals at the highest risk will likely become the best-performing strategy. At the same time, more consistent use of oral PrEP, or new long-acting PrEP formulations may speed up the progression to the high-efficacy region. When this happens, it is possible that many communities will find themselves in the transition zone, at least temporarily. There, risk-based distribution should already be favored over non-selective distribution, as in the high-efficacy region.

Our work has limitations. We focused on optimizing the reduction of community-level disease burden, and did not consider other aspects of primary prevention, such as providing equitable access to prevention, or improving the quality of life of marginalized individuals. Our study did not include factors which can influence risk of acquisition: in the case of HIV, we did not explicitly account for the effect of primary prevention other than PrEP2. Specifically, whereas our framework does account for an arbitrary overall rate of condom use by means of the transmissibility parameter λ, it does not include possible changes in condom use among those on PrEP, due to possible behavioral adaptation43. The compartmental model we used is a coarse-grained representation of the progression of HIV infection, and its transmission. In particular, it does not account for the different transmission probability of receptive and insertive anal sex. This, however, would potentially bias our findings only if PrEP use were consistently correlated with type of act (insertive vs receptive). Summing up, more detailed HIV models, and community-specific estimates of partner selection patterns, could provide better numerical estimates of critical efficacy, and thus be useful in applied studies focusing on specific communities. We also remark that our study applies to the infection-preventing effect of medications. Some of them also reduce morbidity and mortality among the infected, as it is the case of vaccines against COVID-19. As such, the main criterion for their distribution has been the risk of developing severe disease, which is beyond the scope of our study. Also, we show in Supplementary Note 8 that the low-efficacy parameter region of COVID-19 vaccination requires very low vaccine efficacy, or unrealistically high incidence, finding no evidence against risk-based distribution. Finally, our model does not consider the fact that targeting high-risk individuals may be inevitable, if the side effects of the medication or vaccine outweigh its benefits only when the probability to be exposed to the pathogen is high.

HIV prevention is but one public health challenge in which the low-efficacy region may be present: Whenever vaccination campaigns aim at reducing incidence in high-prevalence settings, estimating the value of critical efficacy could help optimize vaccine distribution. It might be the case, for instance, of Plasmodium falciparum malaria, as a new vaccine formulation may soon become available44. When this happens, adapting our theory to malaria – both in terms of vector-borne transmission and mixing network45 – will help inform roll-out, especially where parasite prevalence is high.

## Methods

### The linear response function f

#### Definition

The goal of f(k) is to measure the impact that providing prevention to a few individuals in degree class k has on community-level baseline prevalence. We assume a population of N individuals, and define f as the change in the number of infected individuals in the population (NI), due to a small change in the amount of prevention provided in degree class k (Npkgk):

$$f(k) \sim -{\left.\frac{{{{{{{{\rm{d}}}}}}}}(NI)}{{{{{{{{\rm{d}}}}}}}}(N{p}_{k}{g}_{k})}\right|}_{g = 0},$$
(7)

where the minus sign is due to the fact that prevention will bring prevalence down. Here I is community-level prevalence as defined previously. Then, the population size N correctly cancels out (the final result does not depend on population size), and we get to the final definition of the response function:

$$f(k)=-\frac{1}{{p}_{k}}{\left.\frac{{{{{{{{\rm{d}}}}}}}}I}{{{{{{{{\rm{d}}}}}}}}{g}_{k}}\right|}_{g = 0},$$
(8)

#### Derivation of Fdir

At the endemic equilibrium ($${\dot{x}}_{k}={\dot{y}}_{k}=0$$), one can use Eq. (1) to write yk as a function of xk:

$${y}_{k}=\frac{1-\epsilon }{1-\epsilon {x}_{k}}{x}_{k}.$$
(9)

Also, given that xk has to be evaluated at g = 0, we can use its recursive form, which comes from setting g = 0 in the first line of Eq. (1):

$${x}_{k}=\frac{z\hat{\lambda }k}{1+z\hat{\lambda }k},$$
(10)

where $$z=\left\langle kx\right\rangle$$. Angle brackets denote expectation values on the degree distribution, so in this case this would mean

$$z=\left\langle kx\right\rangle =\mathop{\sum}\limits_{k}{p}_{k}k{x}_{k}.$$
(11)

z has a clear epidemiological interpretation, as it measures the expected number of at-risk contacts that an individual makes. Specifically, $$z=\left\langle k\right\rangle l$$, where l is the probability that a given contact is with an infected individual. This measure is sensitive to the amount of heterogeneity in the network. Indeed, if the network had a homogeneous degree distribution (i.e., all individuals had degree close to $$\left\langle k\right\rangle$$), then $$z\approx \left\langle k\right\rangle I$$ (I is the prevalence as usual). Broad degree distributions give instead $$z \, > \, \left\langle k\right\rangle I$$, meaning that the probability of establishing a contact with an infected individual is higher than the probability of finding an infected individual at random in the population.

Plugging Eqs. (9)–(10) into Eq. (2), one gets Eq. (3).

#### Derivation of Findir

In the following, we implicitly assume that all should be evaluated at g = 0. At equilibrium ($$\dot{x}=0$$), we perform the derivative $$\frac{d}{d{g}_{m}}$$ on both sides of the first line in Eq. (1):

$$-\frac{d{x}_{k}}{d{g}_{m}}+\hat{\lambda }k\left[-\frac{d{x}_{k}}{d{g}_{m}}z+(1-{x}_{k})\frac{d\xi }{d{g}_{m}}\right]=0.$$
(12)

We compute the derivative of ξ from its definition in Eq. (1):

$$\frac{d\xi }{d{g}_{m}}=m{p}_{m}({y}_{m}-{x}_{m})+\mathop{\sum}\limits_{k^{\prime} }k^{\prime} {p}_{k^{\prime} }\frac{d{x}_{k^{\prime} }}{d{g}_{m}},$$
(13)

and insert it into Eq. (12):

$$\mathop{\sum}\limits_{k^{\prime} }\left[\hat{\lambda }k(1-{x}_{k})k^{\prime} {p}_{k^{\prime} }-{\delta }_{kk^{\prime} }(1+z\hat{\lambda }k)\right]\frac{d{x}_{k^{\prime} }}{d{g}_{m}}=-\hat{\lambda }k(1-{x}_{k})m{p}_{m}(\,{y}_{m}-{x}_{m}).$$
(14)

This equation constitutes a linear system for the matrix Jkm = dxk/dgm. Defining the auxiliary variables $${{{{{{{{\rm{u}}}}}}}}}_{k}=\hat{\lambda }k(1-{x}_{k})$$, vk = kpk, wk = kpk(yk − xk) and $${{{{{{{{\rm{D}}}}}}}}}_{kk^{\prime} }=(1+z\hat{\lambda }k){\delta }_{kk^{\prime} }$$, we can rewrite Eq. (14) as

$$({{{{{{{\bf{u}}}}}}}}{{{{{{{{\bf{v}}}}}}}}}^{T}-{{{{{{{\bf{D}}}}}}}}){{{{{{{\bf{J}}}}}}}}=-{{{{{{{\bf{u}}}}}}}}{{{{{{{{\bf{w}}}}}}}}}^{T}.$$
(15)

To get J, we note that the matrix uvT − D is a rank-1 perturbation of a diagonal matrix, and invert it by means of Ref. 46 (Sherman–Morrison formula):

$${{{{{{{\bf{J}}}}}}}}=\frac{1}{1-{{{{{{{{\bf{v}}}}}}}}}^{T}{{{{{{{{\bf{D}}}}}}}}}^{-1}{{{{{{{\bf{u}}}}}}}}}{{{{{{{{\bf{D}}}}}}}}}^{-1}{{{{{{{\bf{u}}}}}}}}{{{{{{{{\bf{w}}}}}}}}}^{T}.$$
(16)

By inserting the definitions of u, v, w and D into Eq. (16), and after some algebra, we get an explicit expression of J, and thus the derivative dxk/dgm.

Now, with dxk/dgm, yk, and xk at hand, and again after some algebra, we get to the final form of Findir [Eq. (4)], and thus f(k):

$$f(k)=\frac{\epsilon \hat{\lambda }zk}{(1+\hat{\lambda }zk)\big[1+(1-\epsilon )\hat{\lambda }zk\big]}\left(1+\frac{\hat{\lambda }\psi }{1-\phi }k\right),$$
(17)

where we defined

$$\phi =\hat{\lambda }\left\langle {\left(\frac{k}{1+z\hat{\lambda }k}\right)}^{2}\right\rangle \,{{{{{{{\rm{and}}}}}}}}\,\psi =\left\langle \frac{k}{{(1+z\hat{\lambda }k)}^{2}}\right\rangle .$$
(18)

Expectation values are computed similarly to Eq. (11) (see also Supplementary Note 9).

#### Critical point of f

The derivative of f(k) in Eq. (17) is proportional to the following:

$$f^{\prime} (k) \sim {k}^{2}{\hat{\lambda }}^{2}z\left[\psi (1-\phi )(2-\epsilon )+z(1-\epsilon )\right]-2\hat{\lambda }\psi (1-\phi )k+1.$$
(19)

In the above expression, we dropped a strictly positive term that multiplies the rhs. Evaluating the derivative at k = 0, we immediately see that $$f^{\prime} (0) \, > \, 0$$. Accordingly, a sufficient condition for f(k) to have a maximum in $${{\mathbb{R}}}_{+}$$ is $$\mathop{\lim }\limits_{k\to \infty }f^{\prime} (k) \, < \, 0$$. For large k, the leading term is the quadratic one in Eq. (19). Therefore, the condition $$\mathop{\lim }\limits_{k\to \infty }f^{\prime} (k) \, < \, 0$$ requires the quadratic term to be positive, i.e.

$$z(1-\phi )-2\psi \, > \, \epsilon \left[z(1-\phi )-\psi \right].$$
(20)

From their definitions, we know that z, ϕ, ψ > 0. Let us now assume that the term on the right hand side (RHS) is negative. In this case, the above condition would read

$$\frac{z(1-\phi )-2\psi }{z(1-\phi )-\psi } \, < \, \epsilon .$$
(21)

The variable ϵ is bounded between [0, 1]. Therefore, the condition in Eq. (21) can only be fulfilled if the left hand side is smaller than one. However, this is impossible since it would require 2ψ < ψ. Thus, for k* to exist, the RHS in Eq. (20) must be positive, which necessarily requires ϕ < 1. Accordingly, Eq. (20) can be written as

$$\epsilon \, < \, {\epsilon }_{c}=\frac{z(1-\phi )-2\psi }{z(1-\phi )-\psi }.$$
(22)

It is straightforward to show that ϵc < 1. Further, ϵc is positive if z(1 − ϕ) > 2ψ. Eventually, solving for $$f^{\prime} (k)=0$$ then gives k* as in Eq. (5).

### Effect of prevalence and network heterogeneity on k*, ϵc

z is sensitive both to prevalence, and to network heterogeneity. In particular, if prevalence increases, then z increases (given that $$z\ge \left\langle k\right\rangle I$$). Also, z increases if network heterogeneity increases, too. This happens because, in more heterogeneous networks, higher-degree nodes will more likely to be infected than low-degree ones. We can prove this rigorously in the case of power-law-distributed degrees. From Eqs. (10)-(11), the following equation for z follows:

$$\hat{\lambda }\mathop{\sum}\limits_{k}{p}_{k}\frac{{k}^{2}}{1+z\hat{\lambda }k}=1.$$
(23)

Assuming pk = (γ − 1)kγ, and approximating sums on k with integrals, this equation becomes

$${}_{2}{F}_{1}\left(1,\gamma -2,\gamma -1,-\frac{1}{z\hat{\lambda }}\right)=z\frac{\gamma -2}{\gamma -1},$$
(24)

where 2F1 is the ordinary hypergeometric function. This has a simple pole at γ = 2, and there, $${}_{2}{F}_{1}(1,\gamma -2,\gamma -1,-\frac{1}{z\hat{\lambda }})\approx \frac{1}{z(\gamma -2)}$$. In the vicinity of γ = 2, Eq. (24) thus becomes

$$\frac{1}{{\left[z(\gamma -2)\right]}^{2}}\approx \frac{1}{\gamma -1}:$$
(25)

The rhs is finite in γ = 2. This implies z ~ 1/(γ − 2) to kill the divergence in the lhs. Hence, z becomes larger as the network becomes more heterogeneous (i.e., γ decreases towards γ = 2).

When instead the network becomes more homogeneous (i.e., γ becomes larger and larger), Eq. (24) tends to

$$z\approx 1-\frac{1}{\hat{\lambda }};$$
(26)

which is exactly its lower bound ($$z=\left\langle k\right\rangle I$$), as previously discussed. This completes the proof that z increases when either prevalence increases, or the network becomes more heterogeneous.

Now, when z is large, the following approximate relations hold:

$$\phi \approx \frac{1}{{z}^{2}\hat{\lambda }};$$
(27)
$$\psi \approx \frac{\left\langle {k}^{-1}\right\rangle }{{z}^{2}{\hat{\lambda }}^{2}}.$$
(28)

This implies that Eqs. (5)–(6), in the z →  limit, become

$${k}^{* }\approx \frac{1}{\hat{\lambda }z}\frac{1+\sqrt{1-\epsilon }}{1-\epsilon }\to 0;.$$
(29)
$${\epsilon }_{c}\approx 1-\frac{\left\langle {k}^{-1}\right\rangle }{{z}^{3}{\hat{\lambda }}^{2}}\to 1;$$
(30)

proving that higher prevalence, and higher network heterogeneity, cause ϵc to increase, and, in the low-efficacy region, cause k* to decrease.

### Invasion stage and epidemic threshold

To calculate the epidemic threshold of the system, we study the stability of the disease-free equilibrium (xk = yk = 0), as customary16.

First, we linearize Eq. (1) around xk = yk = 0:

$$\left\{\begin{array}{l}\kern-2.4pc{\dot{x}}_{k}=-\mu {x}_{k}+\frac{\lambda }{\langle k\rangle }k\xi \\ {\dot{y}}_{k}=-\mu {y}_{k}+\frac{\lambda }{\langle k\rangle }(1-\epsilon )k\xi.\end{array}\right.$$
(31)

From these, we derive an equation for ξ, by multiplying the first line by kpk(1 − gk), the second line by kpkgk, and then sum them together, and sum over k. This gives

$$\dot{\xi }=\left[-\mu +\frac{\lambda }{\left\langle k\right\rangle }\left(\left\langle {k}^{2}\right\rangle -\left\langle g{k}^{2}\right\rangle \right)\right]\xi .$$
(32)

The disease-free equilibrium is no longer stable for transmissibility values giving $$-\mu +\frac{\lambda }{\left\langle k\right\rangle }\left(\left\langle {k}^{2}\right\rangle -\left\langle g{k}^{2}\right\rangle \right) \, > \, 0$$. This gives the following epidemic threshold:

$${\lambda }_{c}={\lambda }_{0}{\left(1-\frac{\epsilon \langle g{k}^{2}\rangle }{\langle {k}^{2}\rangle }\right)}^{-1}.$$
(33)

Here, λ0 is the well-known value of the epidemic threshold in the absence of prevention (g = 0): λ0 = μk〉/〈k247. We now define the response function for the epidemic threshold:

$${f}_{\lambda }(k)=\frac{1}{{p}_{k}}{\left.\frac{{{{{{{{\rm{d}}}}}}}}{\lambda }_{c}}{{{{{{{{\rm{d}}}}}}}}{g}_{k}}\right|}_{g = 0},$$
(34)

Unlike Eq. (8), here there is no minus sign because prevention increases the epidemic threshold. With some algebra, one gets:

$${f}_{\lambda }(k)=\frac{\epsilon {\lambda }_{0}}{\left\langle k\right\rangle }{k}^{2},$$
(35)

which is monotonously increasing in k, proving that the invasion stage of the epidemic is always in the high-efficacy region.

### Transition zone and ϵr

We measure the impact of non-selective distribution (random targeting) as the expected value of f(k) over the degree distribution: $$\left\langle f\right\rangle$$. We compare it with the impact of targeting those at risk of causing superspreading events, as $$f(\infty )=\mathop{\lim }\limits_{k\to \infty }f(k)$$. The former – $$\left\langle f\right\rangle$$ – must be evaluated numerically. The latter is easy to derive from Eqs. (3)-(4):

$$f(\infty )=\frac{\psi }{z(1-\phi )}\frac{\epsilon }{1-\epsilon }.$$
(36)

The critical value ϵr is the efficacy value for which $$\left\langle f\right\rangle =f(\infty )$$.

### Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

## Data availability

Estimates of HIV prevalence and treatment coverage as discussed in Supplementary Note 6 and Supplementary Note 7 are available from the cited references in the Supplementary Information, and from UNAIDS at https://aidsinfo.unaids.org/(accessed February 2022).

## Code availability

The code used in this study is available here: https://github.com/steinegg/non_selective_distribution_prophylaxis48.

## References

1. Grant, R. M. et al. Preexposure chemoprophylaxis for HIV prevention in men who have sex with men. New Engl. J. Med. 363, 2587–2599 (2010).

2. UNAIDS Global AIDS Update—Confronting inequalities—Lessons for pandemic responses from 40 years of AIDS (2021). Accessed: 2021-05-09.

3. Bavinton, B. R. & Grulich, A. E. HIV pre-exposure prophylaxis: scaling up for impact now and in the future. Lancet Public Health 6, e528–e533 (2021).

4. Rutstein, S. E., Smith, D. K., Dalal, S., Baggaley, R. C. & Cohen, M. S. Initiation, discontinuation, and restarting HIV pre-exposure prophylaxis: ongoing implementation strategies. Lancet HIV 7, e721–e730 (2020).

5. Nichols, B. E., Boucher, C. A. B., van der Valk, M., Rijnders, B. J. A. & van de Vijver, D. A. M. C. Cost-effectiveness analysis of pre-exposure prophylaxis for HIV-1 prevention in the Netherlands: a mathematical modelling study. Lancet Infect. Dis. 16, 1423–1429 (2016).

6. Eaton, L. A. et al. Elevated HIV prevalence and correlates of PrEP use among a community sample of black men who have sex with men. J. Acquir. Immune Defic. Syndr. 79, 339–346 (2018).

7. Amico, K. R. & Bekker, L.-G. Global PrEP roll-out: recommendations for programmatic success. Lancet HIV 6, e137–e140 (2019).

8. Weiss, K. M. et al. Egocentric sexual networks of men who have sex with men in the United States: results from the ARTnet study. Epidemics 30, 100386 (2020).

9. Wahome, E. et al. An empiric risk score to guide PrEP targeting among MSM in coastal Kenya. AIDS Behav. 22, 35–44 (2018).

10. Cordioli, M. et al. Estimating the percentage of European MSM eligible for PrEP: insights from a bio-behavioural survey in thirteen cities. Sex. Transm. Infect. (2021).

11. Lancki, N., Almirol, E., Alon, L., McNulty, M. & Schneider, J. A. Preexposure prophylaxis guidelines have low sensitivity for identifying seroconverters in a sample of young Black MSM in Chicago. AIDS 32, 383–392 (2018).

12. Wei, C. & Raymond, H. F. Pre-exposure prophylaxis for men who have sex with men in China: challenges for routine implementation. J. Int. AIDS Soc. 21, e25166 (2018).

13. Chen, S. & Lu, X. An immunization strategy for hidden populations. Sci. Rep. 7, 1–10 (2017).

14. Hillis, A., Germain, J., Hope, V., McVeigh, J. & Van Hout, M. C. Pre-exposure prophylaxis (PrEP) for HIV prevention among men who have sex with men (MSM): a scoping review on PrEP service delivery and programming. AIDS Behav. 24, 3056–3070 (2020).

15. Muhib, F. B., Pecenka, C. J. & Marfin, A. A. Risk-based vaccines and the need for risk-based subnational vaccination strategies for introduction. Clin. Infect. Dis. 71, S165–S171 (2020).

16. Pastor-Satorras, R., Castellano, C., Van Mieghem, P. & Vespignani, A. Epidemic processes in complex networks. Rev. Mod. Phys. 87, 925–979 (2015).

17. Pastor-Satorras, R. & Vespignani, A. Immunization of complex networks. Phys. Rev. E 65, 36104 (2002).

18. Wang, Z. et al. Statistical physics of vaccination. Phys. Rep. 664, 1–113 (2016).

19. Eames, K. T. D., Read, J. M. & Edmunds, W. J. Epidemic prediction and control in weighted networks. Epidemics 1, 70–76 (2009).

20. Vidondo, B., Schwehm, M., Bühlmann, A. & Eichner, M. Finding and removing highly connected individuals using suboptimal vaccines. BMC Infect. Dis. 12, 51 (2012).

21. Holme, P. & Litvak, N. Cost-efficient vaccination protocols for network epidemiology. PLoS Comp. Biol. 13, e1005696 (2017).

22. Osat, S., Faqeeh, A. & Radicchi, F. Optimal percolation on multiplex networks. Nat. Commun. 8, 1540 (2017).

23. Erkol, S., Castellano, C. & Radicchi, F. Systematic comparison between methods for the detection of influential spreaders in complex networks. Sci. Rep. 9, 15095 (2019).

24. Rosenblatt, S. F., Smith, J. A., Gauthier, G. R. & Hébert-Dufresne, L. Immunization strategies in networks with missing data. PLoS Comp. Biol. 16, e1007897 (2020).

25. Rocha, L. E. C., Liljeros, F. & Holme, P. Information dynamics shape the sexual networks of Internet-mediated prostitution. Proc. Natl. Acad. Sci. USA 107, 5706–5711 (2010).

26. Oliver, N. et al. Mobile phone data for informing public health actions across the COVID-19 pandemic life cycle. Sci. Adv. 6, eabc0764 (2021).

27. Halloran, M. E., Longini, I. M., Jr. & Struchiner, C. J. Design and Analysis of Vaccine Studies. Statistics for Biology and Health (Springer-Verlag, New York, 2010).

28. Paunio, M. et al. Explosive school-based measles outbreak: intense exposure may have resulted in high risk, even among revaccinees. Am. J. Epidemiol. 148, 1103–1110 (1998).

29. Edlefsen, P. T. Leaky vaccines protect highly exposed recipients at a lower rate: implications for vaccine efficacy estimation and sieve analysis. Comput. Math. Methods Med. 2014, 813789 (2014).

30. Gomes, M. G. M., Gordon, S. B. & Lalloo, D. G. Clinical trials: The mathematics of falling vaccine efficacy with rising disease incidence. Vaccine 34, 3007–3009 (2016).

31. Newman, M. Networks (Oxford University Press, Oxford, New York, 2018), second edn.

32. Whittles, L. K., White, P. J. & Didelot, X. A dynamic power-law sexual network model of gonorrhoea outbreaks. PLoS Comp. Biol. 15, e1006748 (2019).

33. Molina, J.-M. et al. On-demand preexposure prophylaxis in men at high risk for HIV-1 Infection. N. Engl. J. Med. 373, 2237–2246 (2015).

34. Buchbinder, S. P. Maximizing the benefits of HIV preexposure prophylaxis. Top. Antivir. Med. 25, 138–142 (2018).

35. Ambrosioni, J., Petit, E., Liegeon, G., Laguno, M. & Miró, J. M. Primary HIV-1 infection in users of pre-exposure prophylaxis. Lancet HIV 8, e166–e174 (2021).

36. Aghaizu, A. et al. Sexual behaviours, HIV testing, and the proportion of men at risk of transmitting and acquiring HIV in London, UK, 2000-13: a serial cross-sectional study. Lancet HIV 3, e431–e440 (2016).

37. Jijón, S., Molina, J.-M., Costagliola, D., Supervie, V. & Breban, R. Can HIV epidemics among MSM be eliminated through participation in preexposure prophylaxis rollouts? AIDS 35, 2347–2354 (2021).

38. AVAC Global PrEP Tracker, Q2 2021 available from prepwatch.org.

39. United Nations, General Assembly, Political Declaration on HIV and AIDS: Ending Inequalities and Getting on Track to End AIDS by 2030, A/75/L.95 (7 June 2021) available from undocs.org/en/A/75/L.95.

40. Robineau, O., Velter, A., Barin, F. & Boelle, P.-Y. Hiv transmission and pre-exposure prophylaxis in a high risk msm population: a simulation study of location-based selection of sexual partners. PLoS ONE 12, e0189002 (2017).

41. Hansson, D., Strömdahl, S., Leung, K. Y. & Britton, T. Introducing pre-exposure prophylaxis to prevent hiv acquisition among men who have sex with men in Sweden: insights from a mathematical pair formation model. BMJ Open 10, e033852 (2020).

42. Landovitz, R. J. et al. Cabotegravir for HIV prevention in Cisgender men and transgender women. N. Engl. J. Med. 385, 595–608 (2021).

43. Castro, D. R., Delabre, R. M. & Molina, J.-M. Give PrEP a chance: moving on from the “risk compensation” concept. J. Int. AIDS Soc. 22, e25351 (2019).

44. Datoo, M. S. et al. Efficacy of a low-dose candidate malaria vaccine, R21 in adjuvant Matrix-M, with seasonal administration to children in Burkina Faso: a randomised controlled trial. Lancet 397, 1809–1818 (2021).

45. Ruktanonchai, N. W. et al. Identifying malaria transmission foci for elimination using human mobility data. PLoS Comp. Biol. 12, e1004846 (2016).

46. Sherman, J. & Morrison, W. J. Adjustment of an inverse matrix corresponding to a change in one element of a given matrix. Ann. Math. Stat. 21, 124–127 (1950).

47. Barrat, A., Barthélemy, M. & Vespignani, A. Dynamical Processes on Complex Networks (Cambridge University Press, 2008).

48. Steinegger, B. et al. Non-selective distribution of infectious disease prevention may outperform risk-based targeting (2022). https://doi.org/10.5281/zenodo.6418584.

## Acknowledgements

We acknowledge the Complexity72h workshop, held at IMT School in Lucca, Italy, 17–21 June 2019, where this study was conceived. B.S. acknowledges financial support from the European Unions Horizon 2020 research and innovation program under the Marie Skłodowska-Curie Grant Agreement No. 713679 and from the Universitat Rovira i Virgili (URV). I.I. acknowledges support from the James S. McDonnell Foundation 21st Century Science Initiative Understanding Dynamic and Multi-scale Systems - Postdoctoral Fellowship Award and from the Agence Nationale de la Recherche (ANR) project DATAREDUX (ANR-19-CE46-0008). A.S.T acknowledges support from FCT and the LASIGE and INESC-ID Research Units, refs: UIDB/00408/2020, UIDP/00408/2020, PTDC/EEI-SII/1937/2014, and UIDB/50021/2020. A.A. gratefully acknowledges the financial support of the Spanish Ministry of Science and Innovation under grant n. IJC2019-040967-I. P.C.F acknowledges financial support from the by the Spanish Ministerio de Ciencia, Innovación y Universidades (MICINN) under Projects FIS2016-78313-P and PID2019-109320GB-100/AEI/10.13039/501100011033. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. The authors are grateful to Sally Blower and Vittoria Colizza for useful feedback.

## Author information

Authors

### Contributions

B.S. and E.V. conceived of, designed the study, and performed the theoretical calculations. B.S. set up and carried out the numerical calculations. B.S., I.I., A.S.T., A.B., P.C.F., A.A., E.V. discussed the results. I.I. designed the figures. E.V. wrote the manuscript. B.S. wrote the Supplementary Information. B.S., I.I., A.S.T., A.B., P.C.F., A.A., E.V. revised the manuscript.

### Corresponding author

Correspondence to Eugenio Valdano.

## Ethics declarations

### Competing interests

The authors declare that they have no competing interests.

## Peer review

### Peer review information

Nature Communications thanks Mirjam Kretzschmar, Gregory Phillips, and Joan Saldaña for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Rights and permissions

Reprints and Permissions

Steinegger, B., Iacopini, I., Teixeira, A.S. et al. Non-selective distribution of infectious disease prevention may outperform risk-based targeting. Nat Commun 13, 3028 (2022). https://doi.org/10.1038/s41467-022-30639-3

• Accepted:

• Published:

• DOI: https://doi.org/10.1038/s41467-022-30639-3