Introduction

Considerable sex disparity exists in the burden of tuberculosis (TB) across all regions of the world. Global estimates indicate that around 64% of all TB incidence in adults (aged \(\ge 15\) years) is among men1. The ratio of men-to-women for prevalent bacteriologically confirmed pulmonary TB shows wide geographic variations with ratios ranging from 1.2 in Ethiopia to about 5 in Vietnam1. Similarly, national TB prevalence surveys have found male-to-female ratios in prevalent bacteriologically-confirmed TB (M:F ratios) of around 2.2 across low- and middle-income countries2. Even in settings with generalised HIV epidemics, where the prevalence of HIV (a strong risk factor for TB) is higher among women than men, men have higher TB prevalence than women2.

The underlying reasons behind excess male burden in different measures are not fully understood. Some studies based on routinely diagnosed TB cases have suggested that the excess notifications among men is a result of systematic underdiagnosis or under reporting of TB in women due to differences in access to care between women and men3. However, prevalence surveys from different settings have usually shown not only a male dominance among undiagnosed cases, but also higher prevalence-to-notification ratios for men, implying a male disadvantage in accessing TB care2,4. In addition, observations that sex disparities in TB rates do not arise before the age of puberty3,5 have led to speculations around the role of biological and behavioural factors6,7. Several studies have established a link between TB and behaviours such as smoking and alcohol consumption8,9 which are more frequent in men than in women in many high TB burden settings1,6,10, and which are known to enhance the risk of TB progression and transmission11,12.

Sex differences in social roles between men and women can determine the patterns and location of social mixing relevant to transmission of Mycobacterium tuberculosis (M. tb)7. Previous studies have linked substantial TB incidence in adults to their preferential contact with other adults13. It has been estimated that 80–90% of infections are acquired from social contacts occurring outside households14,15,16. Many of these infections likely occur in congregate settings which enhance sex-assortative social mixing such as homeless shelters, prisons, public transports, and bars17,18. Consistent with these reports, a recent systematic review of social contact patterns found widespread sex-assortative social mixing, and also higher rates of workplace contact for men19. Modelling of the implications of social contact data from Zambia and South Africa for M. tb forces-of-infection suggested higher forces-of-infection among men due to sex-assortative mixing and higher male TB prevalence13. Sex-assortative mixing therefore provides a mechanism that could amplify disparities in TB burden between men and women arising from other underlying factors.

In this study, we use a sex-stratified TB transmission model to assess the relative role of social mixing, differential progression, and differential case detection rates in establishing and strengthening sex disparities in TB prevalence. Bayesian calibration of our model to two exemplar settings allows us to characterize parameter regions compatible with observations, and assess whether the residual uncertainties affect predicted intervention impact.

Methods

Conceptual framework, model structure, and priors

We use a conceptual framework similar to that developed for considering other social determinants of TB20, and implement these mechanisms within a standard compartmental transmission model structure for adults over 15 years of age (see Fig. 1). We consider three types of mechanism by which men may come to have a higher TB prevalence than women: (1) risk factors directly affecting the natural history of infection, progression and relapse; (2) risk factors that lead to worse care access and therefore longer duration of TB disease among men; (3) risk factors relating to transmission patterns. Data are lacking to support sex-specific mortality and self-cure from untreated TB21, and so we did not include this possibility.

Figure 1
figure 1

Model structure, duplicated for each sex. Red transitions are sex-dependent to represent different risks in disease progression and care access. Blue transitions represent infection, and are also sex-dependent due to assortative mixing. Not shown: death from all states at rate \(\mu\) (and at an additional rate \(\mu _t\) from active TB disease); birth into Uninfected, at rate to keep population fixed.

A key interest for us is the role of assortative mixing between men and women. We model this using a single parameter that interpolates between random mixing between the sexes (contacts equally shared across each sex), and no mixing between different sexes (only same-sex contacts), without changing the total contact rates (assumed the same for men and women). See Eq. (2) for mathematical specification. Natural history risk aspects affected may include increased risk of infection, increased risk of progression following (re)infection, as well as increased risk of relapse. Causes may include immunological differences between sexes as well as different levels of modifiable proximal TB risk factors such as smoking. We model these as a hazard ratio for progression or relapse, ie. we modelled differential fast progression for men as \(\epsilon _m=\alpha \times \epsilon /(1+\alpha )\) and for women as \(\epsilon _f=\epsilon /(1+\alpha )\), so that the average progression parameter, \(\epsilon\), can be informed by literature that does not stratify by sex. We modelled the sex-specific rates of reactivation (\(\nu\)) and relapse (\(\omega\)) analogously, using the same parameter \(\alpha\). The prior for factor \(\alpha\) is chosen based on genotypic studies that compared reactivation rates between men and women22, but we increased the reported standard deviation to 1 to allow for greater differences in other settings: see Table 1. Worse access to care may reflect differences in recognition and response to illness, barriers in accessing care, and health systems features; these are modelled in the same way as differential progression (with \(\delta\) playing the role of \(\epsilon\) and \(\pi\) the role of \(\alpha\)), i.e. as a hazard ratio in moving from symptomatic disease to treatment (see “Online Appendix”).

Table 1 Priors and sources for model parameters. NB the prior for \(\delta\) is defined by comparison to competing hazard parameters to give an uninformative prior on the probability of a TB case being notified (p).

We modelled a fixed population size with births equal to deaths, and half of the population being male at all times. With \(i=m,f\) for men and women, respectively, the ordinary differential equations for this model are given in Eqs. (1) and (2).

$$\begin{aligned} \frac{\mathrm {d}S_{i}}{\mathrm {d}t}&= \mu _{t}I_{i} + \mu N_{i} + (1-\theta ) \tau T_{i}- (\mu + \lambda _{i})S_{i}&\frac{\mathrm {d}E_{i}}{\mathrm {d}t}&= \lambda _{i}S_{i} + \psi \lambda _{i} L_{i}- (\mu +\kappa +\varepsilon _{i})E_{i} \nonumber \\ \frac{\mathrm {d}L_{i}}{\mathrm {d}t}&= \kappa E_{i} + \gamma I_{i} - (\mu +\upsilon _{i} +\psi \lambda _{i})L_{i}&\frac{\mathrm {d}I_{i}}{\mathrm {d}t}&= \varepsilon _{i}E_{i} + \upsilon _{i} L_{i} + \omega R_{i} -(\mu +\mu _{t}+\gamma + \delta _{i})I_{i} \nonumber \\ \frac{\mathrm {d}T_{i}}{\mathrm {d}t}&= \delta _{i} I_{i} - (\tau + \mu )T_{i}&\frac{\mathrm {d}R_{i}}{\mathrm {d}t}&= \theta \tau T_{i} - (\omega + \mu )R_{i}, \end{aligned}$$
(1)

Here, the states and rates are defined in Fig. 1. The total populations are \(N_i = S_i+E_i+L_i+I_i+T_i+R_i\). A fraction \((1-\theta )\) of those on anti-TB treatment are assumed to die. The forces-of-infection in men and women are given by

$$\begin{aligned} \begin{pmatrix} \lambda _m \\ \lambda _f \end{pmatrix}= M \begin{pmatrix} \beta _md_m \\ \beta _fd_f \end{pmatrix} =\begin{pmatrix} 1-(1-\rho )/2 &{} (1-\rho )/2 \\ (1-\rho )/2 &{} 1-(1-\rho )/2 \end{pmatrix} \begin{pmatrix} \beta _md_m \\ \beta _fd_f \end{pmatrix}, \end{aligned}$$
(2)

where \(d_i=I_i/N_i\). The state I represents bacteriologically-positive TB. We also consider differential infectiousness by sex as \(\beta _m=f_m\times \beta\) and \(\beta _f=f_f\times \beta\) where \(f_m\) and \(f_f\) are the fractions of prevalent TB cases that are sputum smear positive among men and women, respectively. This allows \(\beta\) to be identified with the Styblo ratio in defining a prior. The assortativity parameter \(\rho\) in the mixing matrix M ranges from \(-1\) (mixing only with the opposite sex), through 0 (representing random mixing), to 1 (completely within-sex mixing). The proportion, c, of effective contacts each sex makes with the same sex is given by \(c=(\rho +1)/2\). Below, we will refer to the one-sex model, which is defined by considering one of \(i=m,f\) separately with \(\rho =1\) (so that \(\lambda _i=\beta _id_i\)). Table 1 defines the priors we use for our model parameters, and the sources we base them on.

Reproduction number and invasion dynamics

In order to explore the impact of mixing on the early dynamics of TB within a susceptible population (invasion) for our model, we analytically calculate \(R_0\) and the M:F ratio during the associated exponential growth phase from the disease-free equilibrium. We calculate how these metrics vary with assortativity at different levels of other parameters.

Equilibria and sensitivity analysis

Similarly, we numerically compute the equilibrium TB prevalence and M:F ratio by running the model for 400  years, and explore how these vary with assortativity. To robustly characterise the importance of different parameters given our priors, including their interactions, we calculated the Sobol’ variance-based sensitivity indices28 for the outcomes of total TB prevalence and M:F ratio. We used the SALib Python library29, generating 2000 parameter sets using the priors detailed in Table 1. Code for all analyses is available at https://github.com/Debebe/tbsex.

Approach to calibration

We used Mathematica 12.130 to obtain quadratic equation for the one-sex model equilibrium prevalence by setting the left hand side of Eq. (1) to zero. This was of the form

$$\begin{aligned} Q(d;\beta ,\Psi ) = 0, \end{aligned}$$

where \(d = D/N\), and \(\Psi\) represents the parameters other than \(\beta\). The two-sex model has identical form for men and women (with different parameter values); the dynamics are coupled only through the force of infection in Eq. (2), with \(\lambda _i\), \(i=m,f\), playing the role of \(\beta d\) in the one-sex model. This means the condition for the two-sex model to be at equilibrium can be written

$$\begin{aligned}&Q(d_m;\lambda _m/d_m,\Psi _m) = 0 \\&Q(d_f;\lambda _f/d_f,\Psi _f) = 0. \end{aligned}$$

Approximate inference can then be performed using the log-likelihood

$$\begin{aligned} LL_{\mathrm {data}} + LL_{\mathrm {constraint}}+\log ({\text {prior}}(\beta ,\Psi )), \end{aligned}$$

where

$$\begin{aligned} LL_{\mathrm {constraint}} = - \left( \frac{Q(d_m;\lambda _m/d_m,\Psi _m)}{Q_{\text {tol}}}\right) ^2- \left( \frac{Q(d_f;\lambda _f/d_f,\Psi _f)}{Q_{\text {tol}}}\right) ^2 \end{aligned}$$

ensures that the prevalences are close to that generated at equilibrium under the input parameters (to within a tolerance \(Q_{tol}\)). Smaller values of \(Q_{tol}\) result in sampled prevalences that are more tightly peaked around equilibrium values implied by parameters. We used a value of \(Q_{tol}=10^{-6}\).

The log-prior is the sum over the individual log-priors in Table 1. The data likelihood is given by

$$\begin{aligned} LL_{\mathrm {data}} = -\frac{(d-d_{\text {target}})^2}{2\sigma _{\text {d}}^2} -\frac{(r-r_{\text {target}})^2}{2\sigma _{\text {r}}^2} -\frac{(n_m-n^m_{\text {target}})^2}{2s_{\text {m}}^2} -\frac{(n_f-n^f_{\text {target}})^2}{2s_{\text {f}}^2}, \end{aligned}$$

where d and r are the overall prevalence and M:F ratios, respectively, and \(n_i\) are the sex-specific notification rates. For Uganda, the observed values of d and r are 401 per 100,000 and 4.1 respectively31, while for Ethiopia the corresponding values were 277 per 100,000 and 1.232. We used the \(n_m\) and \(n_f\) values of respectively 161 and 79 per 100,000 for Uganda31, and 208 and 169 per 100,000 for Ethiopia1 (see Table 2). We assumed the standard deviations were 10% of their corresponding targets, i.e. \(\sigma _{\text {d}}=d_{\text {target}}/10\), \(\sigma _{\text {r}}=r_{\text {target}}/10\), \(s_{\text {m}}=n^m_{\text {target}}/10\), and \(s_{\text {f}}=n^f_{\text {target}}/10\).

Table 2 Posterior estimates for exemplar settings: calibration targets compared to data, and parameters determining sex difference for TB.

Calibration was performed using no-U-turn Hamiltonian Markov chain Monte Carlo using Stan33. Three chains were run for 10,000 iterations and the first 5000 runs were excluded as burn-in.

Exemplar settings and data

We chose one setting with high M:F ratio (Uganda) and one with low M:F ratio (Ethiopia). For these exemplar settings, we extracted data on sex-stratified bacteriologically-confirmed TB prevalence from national prevalence surveys of Uganda (2014–2015)34 and Ethiopia (2010–2011)32. In both settings, the proportions prevalent with smear-positive disease (which we used to parametrize relative infectiousness) were higher in men: \(f_m=0.45\) vs \(f_f=0.35\) in Ethiopia32, and \(f_m=0.43\) vs \(f_m=0.38\) in Uganda31. We also compiled notification and case detection rates from WHO reports on the same year of prevalence surveys in those settings1.

Sex-specific prevalence and notification rates for each setting were used as calibration targets (see previous section). All data used in this study represent only individuals older than 15 years. For Ethiopia, sex-stratified notifications for bacteriologically confirmed TB were not available in the year of prevalence survey (2011). We therefore generated male-to-female ratios for new and relapse TB notifications from the closest year (i.e., 2013), and applied these values to generate sex-specific bacteriologically confirmed notification calibration targets for 2011. Models for the two exemplar settings were identical and used the same prior distributions (Table 1), but used setting-specific values for background mortality (from United Nations World Population Prospects estimates) and anti-TB treatment success (from World Health Organization collated treatment outcome data).

Relevance to interventions; contribution of assortativity to prevalence M:F ratios

We simulated an intervention (eliminating the sex gap in access to care) and evaluated the relative impact on TB M:F ratio and prevalence after 10 years. Equalizing male case detection was implemented by setting the detection rate, \(\delta _m\), for men equal to its estimated value for women. We reported posterior mean and 95% quantiles of percentage reductions for each setting. Anticipating non-identifiability and potential posterior correlations between assortativity and other parameters influencing the M:F ratio, we assessed whether different combinations of parameters with similar posterior probability resulted in different dynamics under interventions. We selected three pairs of parameters that generate sex differences for TB: \(\alpha\), determining the relative risk of progression; \(\pi\), determining the relative detection rate; and \(\rho\), determining the assortativity of social mixing. For each parameter pair we drew an elliptical region with major and minor radii of respectively 0.05 and 0.02 on their joint posterior distributions. The ellipses were centered at the 25, 50 and and 75th quantiles for each variable, and were labelled respectively ‘L’ for low, ‘M’ for middle, and ‘U’ for upper. We used all samples with in ellipses for parameter pairs of interest and random samples of all other parameters to simulate the the impact of intervention. The number of parameters within each ellipse varied between 90 and 422 depending on sampled region, parameter combination and the exemplar setting.

To assess the contribution of assortativity to M:F ratios in these settings, we evaluated the mean percentage reduction in equilibrium M:F ratio (and 95% quantiles) from moving to a counterfactual with random mixing, across 200 samples from the full posterior.

Results

Influence of assortativity on invasion dynamics and equilibria

The one-sex model

For a compartmental TB transmission model such as this, we found the mean number of secondary cases generated at disease free equilibrium (\(R_0\)) as \(R_0=\) (infectiousness) \(\times\) (mean duration) \(\times\) (progression probability) \(= \beta \times T \times P\). For TB models the structure of Eq. (1), there are two routes of progression from infection to active disease—fast and slow—so that:

$$\begin{aligned} R_0 = \beta \times T \times (P_{\text {fast}} + P_{\text {slow}}). \end{aligned}$$

In our model,

$$\begin{aligned} P_{\text {fast}}&= \frac{\epsilon }{\epsilon + \kappa +\mu }&P_{\text {slow}}&= \frac{\kappa +\mu }{\epsilon + \kappa +\mu }\cdot \frac{\nu }{\nu +\mu }&T_1&= \frac{1}{\delta + \mu _t + \gamma + \mu }, \end{aligned}$$

where \(T_1\) is the mean duration of a single TB episode. In reality (and our model) multiple episodes of TB are possible due to relapse and self-cure followed by reactivation. The probability of each subsequent episode was therefore the sum over these routes, \(P_+ = P_{\text {self-cure}} + P_{\text {relapse}}\), and the mean time with TB, T, given as a sum over each possible number of episodes:

$$\begin{aligned} P_{\text {self-cure}}&= \gamma T_1 \frac{\nu }{\nu + \mu }&P_{\text {relapse}}&= \theta T_1 \frac{\omega }{\omega + \mu }&T&= T_1(1+P_+ + P_+^2 + \cdots ) = \frac{T_1}{1-P_+}. \end{aligned}$$

These formulae were used to compute \(R_m\) and \(R_f\)—the values of \(R_0\) for a one-sex model with parameters for men and women, respectively.

The two-sex model

For the two-sex model, \(R_0\) was calculated as the largest eigenvalue of the next generation matrix G, given by

$$\begin{aligned} G = \begin{pmatrix} P_m &{} 0 \\ 0 &{} P_f\end{pmatrix} M \begin{pmatrix} \beta _mT_m &{} 0 \\ 0 &{} \beta _fT_f\end{pmatrix}, \end{aligned}$$

where M is the mixing matrix defined in Eq. (2), and \(P_m\), \(\beta _m\), and \(T_m\) are the values of P, \(\beta\), and T for men (and similarly for women, with subscript f). This was solved exactly to give

$$\begin{aligned} R_0 = \frac{1}{2}(1+\rho ){\bar{R}}\left[ 1 + \sqrt{1-16\phi \rho /[(1+\phi )(1+\rho )]^2}\right] , \end{aligned}$$

where \({\bar{R}} = (R_m+R_f)/2\) and \(\phi = R_m/R_f\). Solving for the corresponding eigenvector allowed calculation of the M:F incidence ratio during exponential growth as a function of \(\rho\), which was found to satisfy

$$\begin{aligned} \frac{MF(\rho )}{MF(0)} = 2\left( \frac{1-\rho }{1+\rho }\right) \left[ 1-\phi + (1+\phi )\sqrt{1-16\phi \rho /[(1+\phi )(1+\rho )]^2}\right] ^{-1}, \end{aligned}$$

where \(MF(0)=P_m/P_f\). Thus, for a given \({\bar{R}}\) and MF(0) (respectively), \(R_0\) and the early M:F incidence ratio only depend on assortativity (\(\rho\)) and the ratio of male:female one-sex \(R_0\) values (\(\phi\)). The behaviour of \(R_0\) and \(MF(\rho )\) as assortativity varies is shown for various choices of \(\phi\) in Fig. 2a,b.

Figure 2
figure 2

Influence of assortativity (\(\rho\)) at invasion and equilibria for different \(R_m/R_f\). The effect of \(\rho\) on: (a) \(R_0\); (b) M:F ratio during early exponential growth from the disease-free equilibrium; (c) M:F ratio in TB prevalence at endemic equilibrium; (d) TB prevalence at endemic equilibrium. In (c, d), \(\beta\) and \(\alpha\) were solved to give \(R_0=2\) and fix \(R_m/R_f\), \(\rho\) was varied, and \(\pi =1\). Other parameters were the means of priors in Table 1.

Equilibria

The corresponding changes in equilibrium TB prevalence and equilibrium M:F ratio are shown in Fig. 2c,d. Analytical solution was not possible, and (unlike the invasion dynamics) these results may depend on the particular parameter combinations that were used. Parameter values corresponding to the means of priors in Table 1 were used, except for: \(\rho\), which was varied between 0 and 1; \(\beta\) and \(\alpha\), which were solved to fix \(R_0=2\) and various \(R_m/R_f\); and \(\pi\), which was set to 1 to allow \(R_m/R_f\) to attain a value of 1.

Sensitivity of all parameters’ influence on equilibria

At equilibrium the variance in TB M:F ratio is most influenced by a single model parameter—differential progression \(\alpha\), see Fig. 3. The first-order effect of \(\alpha\) on TB M:F ratio is 0.79. On the other hand, the differential detection parameter, \(\pi\) and the mixing parameter (\(m=(\rho +1)/2\) being the proportion of effective contacts that are same sex) have first order effects of 0.01 and 0.009, respectively. All other model parameters have first-order effects less than 0.009. By interacting with other model parameters: \(\alpha\), m, p, \(\beta\) and \(\pi\) attained total-order effects of 0.96, 0.04, 0.03, 0.03 and 0.02 respectively.

For equilibrium TB prevalence, the main influence comes from reactivation rate (\(\nu\)), detection rate (\(\delta\)), effective contact rate (\(\beta\)), and fast progression rate (\(\epsilon\)). Their total effects were attained by their interaction with other parameters (their sum is \(\ge 1\)).

Figure 3
figure 3

Sobol’ first-order (one-way) and total sensitivity indices (including interactions) for equilibrium prevalence M:F ratio and total TB prevalence (shown on x-axes). Distributions of parameters (y-axis) are as in Table 1 (\(\psi\)—partial protection, \(\nu\)—reactivation, \(\epsilon\)—fast progression, \(\kappa\)—stabilization, \(\omega\)—relapse, \(\gamma\)—self-cure, \(\mu _t\)—TB mortality, p—detection, \(\beta\)—effective contact, \(\rho =2m-1\)—assortativity, \(\pi\)—differential detection, \(\alpha\)—differential progression).

Calibration to exemplar settings

Explicit formulae for the quadratic function \(Q(d,\beta ;\Psi )\) used in fitting are given in the “Online Appendix”, as are example Markov chain plots of samples. The Gelman–Rubin convergence diagnostic statistic exhibited \({\hat{R}}<1.0009\) for all model parameters. For each exemplar setting, the fits to calibration target data and estimates of the parameters modelling sex disparities are shown in Table 2. The smoothed pairs plot for the 8-parameter posterior in Ethiopia is shown in Fig. 4.

Figure 4
figure 4

Correlation between model parameters. The lower left corner plot shows the smoothed posterior samples (\(\psi\)—partial protection, \(\nu\)—reactivation, \(\epsilon\)—fast progression, \(\kappa\)—stabilization, \(\omega\)—relapse, \(\gamma\)—self-cure, \(\mu _t\)—TB mortality, p—detection, \(\beta\)—effective contact, \(\rho\)—assortativity, \(\pi\)—differential detection, \(\alpha\)—differential progression). The upper right plots show the ‘L’, ‘M’, and ‘U’ ellipses used for the sensitivity analysis in Table 3.

Table 3 Sensitivity analysis of percentage reduction in TB prevalence and M:F ratios sampling from restricted regions of the posterior. ‘L’, ‘M’, and ‘U’ correspond to different elliptical regions shown top-right in Fig. 4. (\(\rho\)—assortativity, \(\pi\)—differential detection, \(\alpha\)—differential progression).

Intervention results; contribution of assortativity

Sampling across the whole posterior for each setting, we found the hypothetical intervention reduced TB prevalence after 10 years in comparison to the counterfactual of no intervention by 0.7% (95% CrI \(-0.3\) to 2.7%) for Ethiopia and 8% (95% CrI 4–14%) for Uganda. The M:F ratios reduced over the same time time period by 7% (95% CrI \(-10\) to 21%) for Ethiopia and 20% (95% CrI 11–31%) for Uganda. The 3 elliptical regions selected for each parameter pair (\(\pi ,\alpha\)), (\(\rho ,\alpha\)), (\(\rho ,\pi\)) in order to assess the impact of remaining uncertainty on projected intervention impacts are shown in Fig. 4 (top right three panels). Comparisons of posterior log-likelihoods for these regions are shown in the “Online Appendix”, alongside time series plots of prevalence under the intervention. The percentage reductions in TB prevalence and M:F ratio for each setting, parameter pair, and L, M, and U sampling regions are given in Table 3.

Sampling across the whole posterior for each setting, we found equilibria with random mixing had a prevalence M:F ratio that was 12% (\(-0.2\) to 30%) lower in Uganda. For Ethiopia, equilibria with random mixing has equilibrium prevalence M:F ratio lower by 3% (\(-0.5\) to 10%) compared to equilibrium M:F ratio based on assortativity parameter sampled from the whole region.

Discussion

Using a TB transmission dynamic model and Bayesian inference framework, we determined the extent to which social mixing and other factors contribute to sex disparity in TB prevalence. Our analysis shows that sex disparity in TB burden results from a complex interplay among several factors that determine progression, detection and social mixing, although different factors dominate at different stages of the epidemic. The impact of assortativity is much larger at invasion compared to its role at endemic equilibrium. Our fit to a high M:F prevalence ratio setting (Uganda) suggested 12% (0–30%) of the M:F ratio could be attributed to the effects of assortative mixing.

Tuberculosis invades more rapidly as assortative sex mixing increases because men are more likely to transmit infection to other men due to the more frequent interactions with males19 as well as due to higher burden of infectious TB among men32,35. Men have higher rates of mixing with other men meaning that they can generate more secondary cases preferentially among men. As a result, early growth sex disparity in TB burden between men and women is extremely wide. The importance of assortativity during invasion dynamics has relevance to TB outbreaks, and potentially to invasion and replacement by new strains (eg. drug-resistant strains) in TB-endemic settings.

At invasion, higher assortative mixing increases overall TB prevalence. At endemic equilibrium, increasing assortative mixing leads to a gradual initial rise in an overall equilibrium prevalence followed by a subsequent fall as assortativity approaches 1 (Fig. 2c). The peak represents a balance between the amplifying effects of increasing assortativity, and the folding in of infections among only the half of the population at higher risk (men) as assortativity approaches 1.

Although, we found lower male case detection rates in Uganda (our high M:F ratio setting), our finding for Ethiopia (our low M:F ratio setting) is different. In Ethiopia, case detection rates between men and women are nearly comparable, with empirical prevalence-to-notification ratios of 1.7 and 1.5, respectively—implying no considerable differences between men and women in accessing TB care. These findings are consistent with some studies36, and explain the limited impacts of our hypothetical intervention in this setting.

Interventions targeting only differential case detection are unlikely to eliminate sex disparities in TB burden alone. We found closing disparities in detection in a high M:F ratio setting (Uganda) would reduce the M:F ratio by around 20% over 10 years, but would also reduce TB prevalence by around 8%. Elimination of sex disparities in TB burden will therefore also require interventions targeting modifiable risk factors that are more common among men.

We found the predicted impact of narrowing case detection gaps was not strongly dependent on posterior parameter region in our high M:F ratio setting. This is reassuring, in that while data does not completely identify the exact contribution of different mechanisms to observed sex disparities in TB, our ignorance need not serve as a barrier to projecting the likely impact of sex-specific interventions. For Ethiopia, with only a small sex difference in baseline detection rates, the exact size of this differential did influence projected impact on M:F ratio, but not appreciably the impact TB prevalence.

A strength of this study was our inference approach, which treated all model parameters as uncertain, specified via priors, within a Bayesian framework. This was necessary in order to answer our question as to whether residual uncertainty after model fitting allowed for very different model predictions of intervention effect. Bayesian inference was made possible within Stan by introducing auxiliary prevalence variables and using an algebraic condition to fix these to the equilibrium solutions of the model. While Stan is capable of numerically solving ordinary differential equations, our algebraic log-likelihood allowed far more efficient (and in our experience) more reliable inference for a parameter space of this number of dimensions. Our approach may be applicable more widely in cases where a fit to an assumed equilibrium or steady decline through sparse data is sought.

Fitting our model under an assumption of equilibrium is a weakness of our study. Inevitably, TB epidemiology is not at equilibrium in any setting. However, our choice is defensible given that typical TB dynamics exhibit only single-digit annual percentage declines, and given a single prevalence survey and the substantial technical advantages discussed above bought by this assumption. Our model did not consider other subgroups that tend to mix non-randomly in real societies. Assortative mixing by other risk factors for TB, e.g. socioeconomic level, are likely to exhibit analogous behaviour, with disparities in TB burden amplified by mixing. Notably, we did not consider age; varying contact rates by age and simultaneous patterns of non-random mixing by age and sex, together with non-uniform TB burden by age may nuance our findings. In particular, some patterns of sex-dependent age mixing could weaken the effects of sex assortativity we describe: for example, older men preferentially mixing with younger women, as has been documented for sexual partnerships in some settings37. Such mixing patterns could vary between settings and limit generalizability, but have not yet been documented for effective contacts relevant to respiratory infections. Crowded urban locales or accommodation with more men than women in some settings38, for example where young men have moved seeking work, likely intensify the sex-assortative mixing relevant to TB that we model.

We also did not explicitly model HIV, whose prevalence is often unequal by sex. Such differences only appear in our model as averaged effects on progression in each sex. Indeed, the higher prevalence of HIV among women in many sub-Saharan African countries may reduce the sex disparities observed for TB2,39. In our approach, differential progression rates due to sex differences in HIV prevalence or antiretroviral therapy coverage would manifest in the calibrated values for the relative progression parameter.

Assortative mixing has been shown to introduce potential biases in the magnitude of associations between risk factors and outcomes in infectious disease epidemiology40. Similarly to this, our work shows that ascribing observed sex differences in TB burden to specific pathways without considering differences in exposure due to assortative mixing will result in measures of association that are biased upwards. These biases may apply to data we have used to parametrize priors for differential progression. However, these data were from a very low transmission setting, and the relatively large prior uncertainty in this parameter (\(\alpha\)) allows it to change in fitting to settings with large M:F ratios. The large uncertainty in the prior for \(\alpha\) partially explains the dominance of this parameter in the M:F uncertainty analysis (Fig. 3). Finally, we only considered a simple hypothetical intervention around equalizing case detection. Modelled impact may still depend on residual uncertainty for other types of intervention. Future work should consider inclusion of these features we have simplified, and realistic policy options that target disparities in detection, or potentially modifiable risk factors among men.

In conclusion, our analysis showed that assortative mixing does play a role in sex disparities in TB burden, and that this is particularly important under invasion dynamics. We found that equalizing case detection between the sexes alone is insufficient to eliminate disparities in burden. However, in settings with suboptimal case detection and large sex disparities in TB prevalence, this potentially achievable goal could still achieve meaningful reductions in overall TB burden.