Introduction

Tsetse flies (Diptera: Glossinidae) are distributed in sub-Saharan Africa between 15° north and 26° south latitude1. Glossina spp. are the cyclic vectors2 of unicellular protozoa of the genus Trypanosoma that cause African animal trypanosomosis (AAT) or nagana and human African trypanosomosis (HAT) or sleeping sickness3,4. Nagana in cattle is mainly caused by T. congolense, T. vivax and T. brucei brucei5 and causes annual losses to agriculture estimated at $4.75 billion6. In addition, around 35 million doses of trypanocidal drugs are administered to livestock per year for managing AAT7. Human African trypanosomosis is fatal without treatment8 and is caused by two Trypanosoma subspecies, i.e. T. brucei rhodesiense responsible for the acute form of HAT in East Africa and T. b. gambiense for the chronic form of HAT in western and central Africa9. The lack of effective vaccines and the development of resistance to the available trypanocidal drugs makes the control of AAT in the vertebrate host unsustainable10,11. Consequently, an effective tool to reduce Trypanosoma transmission would be the control of the tsetse vector. One effective method to manage populations of tsetse flies is the sterile insect technique (SIT) when used as part of an area-wide integrated pest management (AW-IPM) approach12,13. The SIT method relies on the mass-production and sterilization of male flies by ionizing radiation. The sterile males are released in the target area for mating with wild females and the absence of offspring will gradually reduce the density of the targeted tsetse populations14.

The biological transmission of the Trypanosoma species requires the parasite to undergo a series of proliferation and differentiation steps in the tsetse alimentary tract and finally mature into an infective form in the mouthparts (T. congolense) or salivary glands (T. brucei spp.)15. However, tsetse flies are refractory to Trypanosoma infection meaning that the probability that Trypanosoma ingested during a blood meal complete their developmental cycle in the fly to result in a mature infection is rather low16,17,18. The endogenous bacterial microbiome seems important in providing tsetse flies the natural ability to mitigate Trypanosoma infections19. Three major endosymbiotic bacteria have been identified in tsetse flies, i.e. Wigglesworthia glossinidia, Sodalis glossinidius (hereafter mentioned as Sodalis) and Wolbachia pipientis20. Some studies suggested that the obligate mutualist Wigglesworthia must be present in the larval stage during the development of a mature tsetse fly to properly develop a well-functioning immune system contributing to a refractory phenotype against Trypanosoma5,19.

Sodalis, the second mutualistic symbiont, can be found in the midgut, hemolymph, muscles, fat body, milk glands, and salivary glands of certain tsetse species and is inherited by the progeny through transovarial transmission21. The biological role/importance of Sodalis for tsetse remain unclear and needs to be clarified22. This symbiont might provide some benefits to the host as flies without Sodalis have a significantly shorter lifespan as compared with flies with it23, however the establishment of a Sodalis free colony was feasible24. Sodalis also presents many ideal characteristics to be used for expressing molecular effectors in paratransgenic tsetse25. In addition, previous work suggested that Sodalis may modulate the ability of Trypanosoma to establish an infection in the tsetse midgut as some studies reported that the elimination of this bacterial endosymbiont results in an increased tsetse fly refractoriness to Trypanosoma infection23,26,27. Moreover, Geiger et al.,28 suggested that specific genotypes of Sodalis presents in G. p. gambiensis from insectary colonies facilitate Trypanosoma infection. Soumana et al.,29 revealed that a variation in the Sodalis population caused by a hosted prophage can influence the trypanosome infections. In contrast, a recent study demonstrated that the absence or presence of S. glossinidius in the tsetse fly does not affect the fly’s susceptibility toward Trypanosoma infection24. In conclusion, from the above-described results, it is clear that our knowledge on the impact of Sodalis on Trypanosoma infection in tsetse remains limited and fragmented and is still under debate23. Moreover, exploring on a large scale the occurrence and possible association between Sodalis and Trypanosoma infection in wild flies is highly required. The above described potential impact of Sodalis to facilitate Trypanosoma infection in tsetse, and the fact that Sodalis is found in all laboratory-reared tsetse colonies and some wild populations21 indicates that mitigating action, such as feeding the flies 2–3 times on blood supplemented with trypanocidal drugs before release, is required in SIT programs to minimize the risk of disease transmission by the large number of released males that harbour Sodalis.

Field studies in two HAT foci in Cameroon used PCR to detect Trypanosoma and Sodalis in G. palpalis palpalis and the results indicate that the presence of Sodalis favours Trypanosoma infections especially by T. brucei s.l.30. Furthermore, in the wildlife-livestock-human interface in the Maasai Mara National Reserve in Kenya, it was shown that G. pallidipes infected with Sodalis was associated with increased Trypanosoma infection rates31. However, other studies have found no strong association between trypanosome and Sodalis in some tsetse species collected in four locations in Kenya32. Channumsin et al.,33 suggested that the association between Trypanosoma infection and the presence of Sodalis will vary depending on tsetse and Trypanosoma species. Similarly, studies carried out in the Fontem focus in Cameroon did not find a relationship between the endosymbiont and the parasite in G. p. palpalis34, and no significant Sodalis-Trypanosoma infection association was found in G. tachinoides in two sites of the Faro and Déo Division in Adamawa region of Cameroon35. Likewise, no association between the presence of the parasite and Sodalis was found in G. brevipalpis, G. m. morsitans and G. pallidipes in the Luambe National Park of Zambia36.

The overall objective of this study was to evaluate the prevalence of Sodalis and Trypanosoma in wild tsetse populations at a continental scale, i.e. Burkina Faso, Democratic Republic of Congo (DRC), Eswatini, Ethiopia, Ghana, Guinea, Kenya, Mali, Mozambique, Senegal, South Africa, Tanzania, Uganda, Zambia, and Zimbabwe and analyse these data in the context of a possible association between the occurrence of Sodalis and Trypanosoma infection in tsetse. Such information might guide the decision maker for SIT programmes to take the appropriate action, if necessary, to minimize any potential risk of increased transmission.

Results

Trypanosoma prevalence

Adult tsetse flies (n = 6860) were screened for infection with T. brucei sspp (Tz) (T. b. brucei, T. b. gambiense, T. b. rhodesiense), Tc (T. congolense savannah; T. congolense kilifi; T. congolense forest); Tsg (T. simiae; T. simiae tsavo; T. godfreyi) and Tv (T. vivax). The results indicate that 1736 (25.3%) adults were infected with one or more Trypanosoma taxa (Tables 1, 2 and 3), The Trypanosoma prevalence varied significantly between tsetse taxa (X2 = 750.18, df = 9, P <<  0.001) and between countries (X2 = 2038.1, df = 14, P  <<  0.001). The Permanova analysis indicated as well significant differences between countries (P = 0.009) and taxa (P = 0.041) (Table 4). As all taxa were not collected from all countries, the interaction between taxa and countries was only analyzed where a taxon was collected from several countries.

Table 1 Global prevalence of Sodalis and Trypanosomes in tsetse samples analyzed per country.
Table 2 Global prevalence of Sodalis and Trypanosomes in tsetse samples analyzed per tsetse species.
Table 3 Global prevalence of Sodalis and trypanosomes in tsetse samples analyzed per country and tsetse species.
Table 4 Permanova analysis for Countries and tsetse species for Sodalis and trypanosome (single and multiple) infection prevalence.

Regardless of tsetse taxon, in west African countries the average Trypanosoma prevalence was 20% (n = 3733), with the highest prevalence recorded in Ghana (61%) and the lowest recorded in Guinea (2.2%). The prevalence in Burkina Faso, Mali and Senegal was 21.9, 6.9 and 14.2% respectively (Fig. 1, and Table 1). In east, central and southern African countries, the Trypanosoma infection prevalence was a bit higher than in west African countries with an averaged infection of 31.5% (n = 3127), with the highest prevalence (53.6%) in Zimbabwe and lowest prevalence (2.9%) in DRC. No Trypanosoma infection was detected in Eswatini (Fig. 1 and Table 1). Regardless of the country, Trypanosoma prevalence varied from one taxon to another, and G. m. morsitans showed the highest Trypanosoma prevalence (41%) followed by G. pallidipes (38.5%) and the lowest prevalence was detected in G. brevipalpis (9.71%) in east, central and southern Africa. In west Africa, G. medicorum showed the highest Trypanosoma prevalence (39.5%) and the lowest prevalence was detected in G. p. palpalis (2.8%) (Table 2).

Figure 1
figure 1

The geographical locations of tsetse samples in Africa. Circles indicate the total prevalence of Sodalis and Trypanosoma per country. Black dots indicate samples collection site(s) per country.

Some tsetse taxa were collected from several countries as presented in Fig. 2 and Table 3. The highest Trypanosoma prevalence was recorded in G. tachinoides in Ghana (61%). This was followed by high prevalence in G. m. morsitans collected from Zimbabwe (53.9%), Tanzania (53%) and Zambia (48.4%). G. pallidipes from Zimbabwe, Kenya, Zambia and Tanzania also showed high Trypanosoma prevalence of 52.7%, 50.9%, 45.2% and 37.3%, respectively. The lowest Trypanosoma prevalence was found in G. p. gambiensis from Guinea (2.2%). Based on the Trypanosoma prevalence presented in Fig. 2 and Table 3, the tested samples can be categorized as: (i) tsetse samples with high prevalence (> 35%) detected in G. tachinoides from Ghana; G. medicorum from Burkina Faso, G. pallidipes from Kenya, Zambia, and Zimbabwe, G. m. morsitans from Tanzania, Zambia, and Zimbabwe; (ii) tsetse samples with medium prevalence (10–35%) detected in G. austeni from South Africa, G. f. fuscipes from Kenya and Uganda, G. m. submorsitans from Burkina Faso, G. p. gambiensis from Burkina Faso and Senegal and G. tachinoides from Burkina Faso; (iii) tsetse samples with low prevalence (< 10%) detected in the rest of the samples listed in Table 3 except the G. austeni collected from Eswatini. Despite the difference in Trypanosoma prevalence for each tsetse species, the differences were significant only in G. p. gambiensis (X2 = 26.71, df = 4, P < 0.001) and G. tachinoides, (X2 = 9.38, df = 1, 2, P = 0.002). In contrast, no significant difference was detected between countries for G. austeni (X2 = 1.47, df = 4, P = 0.688), G. brevipalpis (X2 = 0.34, df = 2, P = 0.559), G. f. fuscipes (X2 = 0.15, df = 2, P = 0.702), G. m. morsitans (X2 = 1.04, df = 3, P = 0.593) and G. pallidipes (X2 = 4.983, df = 1,6, P = 0.418) (Table 3). No Trypanosoma infection was recorded in G. austeni from Eswatini. The best glm model (lowest AICc) selected for the overall Trypanosoma prevalence retained the countries as variables that fitted the data well (AICc = 1521.35) (Supplementary File 1).

Figure 2
figure 2

Prevalence of the Sodalis and Trypanosoma (single and multiple) infections per country (A) and tsetse species (B). Prevalence data were square root transformed and averaged based on country-species and the matrix display was conducted in PRIMER version 7 + software. Tree on the left of the matrix is the similarity dendrogram based on the similarity index of the square root of the prevalence values. The colour index is the square root of the prevalence values ranged 0–9 which is the square root of 0–81% prevalence. Country abbreviations follow the UNDP list of country codes https://web.archive.org/web/20060713221355/http://refgat.undp.org/genericlist.cfm?entid=82&pagenumber=1&requesttimeout=360 as follows: BKF: Burkina Faso; ETH: Ethiopia; GHA: Ghana; GUI: Guinea; KEN: Kenya; MLI: Mali; MOZ: Mozambique; SAF: South Africa; SWA: Eswatini; ZAI: Democratic Republic of the Congo; ZAM: Zambia; ZIM: Zimbabwe. Tsetse, Sodalis and Trypanosoma taxa were abbreviated as following: Ga: Glossina austeni; Gb: G. brevipalpis; Gff: G. fuscipes fuscipes, Gmm: G. morsitans morsitans; Gmsm: G. m. submorsitans; Gpg: G. palpalis gambiensis; Gpp: G. palpalis palpalis. Sod: Sodalis, Tc: Trypanosoma. congolense savannah; T. congolense kilifi; T. congolense forest, Tsg: T. simiae; T. simiae Tsavo; T. godfreyi, Tv: T. vivax, Tz: T. brucei brucei, T. b. gambiense, T. b. rhodesiense.

Prevalence of different Trypanosoma taxa and mixed infections

The above-mentioned prevalence of Trypanosoma infection was comprised of several different Trypanosoma species and sub-species. Based on the size of the amplified fragment by PCR, the Trypanosoma infection was categorized into four groups: (i) the Tc group including the different forms of T. congolense; (ii) Tv group including T. vivax infections; (iii) T. brucei sspp (Tz) group including T. b. brucei, T. b. gambiense, T. b. rhodesiense, and infections; and (iv) Tsg group including the infections with T. simiae, T. simiae tsavo and T. godfreyi. The screening results revealed that tsetse flies could be infected with single or multiple (double or triple) taxa of Trypanosoma, and the proportion of the infections with the different Trypanosoma taxa and the mixed infection varied with country (X2 = 63.56, df = 14, P < 0.001) and species (X2 = 21.86, df = 9, P < 0.001) (Supplementary File 1).

The prevalence of the different Trypanosoma species with respect to the above-mentioned groups, indicate that infections with the Tsg group was the highest regardless of countries or tsetse species with an average of 7.06%. The infection rate was higher (14.13%) in east, central and southern African countries than in west Africa (1.13%). Tv infection averaged at 6.75% but with higher prevalence in west African countries (10.37%) than in east, central and southern Africa (2.43%). The prevalence of Tc infection was lower than Tv and Tsg group with an average of 4.78% with higher prevalence in central and southern Africa (8.38%) than in west Africa (1.77%). The Tz group had the lowest prevalence with an average of 2.29%. Like Tv infection, the Tz prevalence was higher in west Africa (3.16%) than central and southern Africa (1.25%).

The prevalence of infection by a single Trypanosoma group varied significantly from one country to another and from one tsetse species to another. For Tc, Tv, Tz and Tsg the infection prevalence varied significantly with country (X2 = 47.74, df = 14, P < 0.001, X2 = 27.40, df = 14, P = 0.01705, X2 = 106.11, df = 14, P = 0. 001 and, X2 = 44.74, df = 14, P = 0.001 respectively). Regardless of tsetse species, the highest infection rate for Tc, Tv, Tz and Tsg was found in Tanzania (14.20%), Ghana (14.10%), Ghana (19.66%) and Zimbabwe (39.81%), respectively (Supplementary Table 1). Similarly, the prevalence of Tc, Tz and Tsg varied significantly with tsetse species (X2 = 40.364, df = 1.9, P  <<  0.001, X2 = 58.253, df = 1.9, P << 0.001 and X2 = 34.871, df = 1.9, P << 0.001, respectively), however no significant difference was found in Tv prevalence between tsetse species (X2 = 5.475, df = 1.9, P = 0.07868). Regardless of the country, the highest infection rate of Tc, Tv, Tz and Tsg was found in G. pallidipes (10.68%), G. tachinoides (12.92%), G. medicorum (13.64%) and G. m. morsitans (22.76%), respectively (Supplementary Table 2). No Tc infection was found in samples of G. austeni collected from Eswatini and Tanzania, G. brevipalpis from Mozambique, G. p. palpalis from DRC and G. p. gambiensis from Guinea. In addition, no Tv infection was detected in G. austeni collected from Eswatini and Mozambique, G. m. morsitans from Kenya and Zambia, G. pallidipes from Uganda and Zimbabwe. For Tz, G. austeni collected from Eswatini and Mozambique, G. brevipalpis from Mozambique, G. f. fuscipes from Kenya, G. m. morsitans from Kenya and Zambia, G. p. palpalis from DRC and G. p. gambiensis from Guinea did not show any infection (Fig. 2 and Supplementary Table 3).

Mixed infections of Trypanosoma groups (double or triple) are rare events with an average prevalence between 0.09 and 1.71% regardless of country or tsetse species. However, double infections seem to be more frequent in some countries than others (X2 = 35.01, df = 14, P = 0.001) for Tv–Tz and in some tsetse species than others (X2 = 21.20, df = 9, P = 0.012) for Tv–Tz (Supplementary File 1). The highest prevalence of the mixed infections Tv–Tz and Tc–Tz were observed in Ghana with 12.39% and 10.68%, respectively, regardless of tsetse species. Although the average Tc–Tsg prevalence was higher than that of Tv–Tz and Tc–Tz, the highest mixed infection with it was found in Zambia with 9.05%. Regardless of the country, the highest mixed infection of Tc–Tsg detected per tsetse species was ~ 5% in G. m. morsitans and G. pallidipes. The mixed infection of Tsg with either Tv or Tz or both was lower than 2% regardless of the country or tsetse species. Taking into account both the country and tsetse species, the highest mixed infection of Tc–Tsg (12.5%) was detected in G. m. morsitans in Zambia. However, the highest prevalence of Tc–Tz (10.68%) and Tv–Tz (12.39%) was detected in G. tachinoides from Ghana. Although the average prevalence of Tv–Tsg was low (0.54%), a relative high infection rate of 6.17% was found in G. m. morsitans from Tanzania.

A triple infection of Trypanosoma groups (Tc–Tv–Tz) was only detected in G. medicorum from Burkina Faso (1.30%) and G. tachinoides from Ghana (1.71%) (Fig. 2 and Supplementary Table 3, Supplementary File 1).

Prevalence of Sodalis infection

The prevalence of Sodalis infection based on the PCR results varied significantly with country (X2 = 108.02, df = 1, 14, P << 0.001) and tsetse species (X2 = 69.60, df = 9, P < 0.001). The best glm model (lowest AICc) selected for the overall Sodalis prevalence retained the countries, the species and their interaction (where possible) as variables that fitted the data well (AICc = 1296.12). Similar to the prevalence of Trypanosoma, the average Sodalis prevalence in east, central and southern Africa (24.6%) was higher than in west Africa (2.70%). Regardless of tsetse species, the highest prevalence of Sodalis infection was found in Tanzania (67.1%) followed by Uganda (43.3%), Kenya (28.5%) and Ethiopia (20.48%) (Table 1). The highest prevalence of Sodalis infection in west Africa was found in Guinea (28.6%). No Sodalis infection was found in Ghana, Mali, Senegal or Eswatini. Regardless of the country, the highest Sodalis prevalence per tsetse species was detected in G. m. morsitans (42.27%) followed by G. pallidipes (30.74%). No Sodalis infection was detected in G. tachinoides. The prevalence of Sodalis infection changed when both the countries and tsetse species are taken into consideration (Table 4). Based on the Sodalis prevalence the tsetse samples can be categorized into four groups: (i) samples with high prevalence (> 50%) (ii) samples with medium prevalence (between < 10% and > 50%) (iii) samples with low prevalence (between > 0% and 10%) and (iv) samples with no Sodalis infection as shown in Fig. 2 and Table 4. The samples showing high Sodalis prevalence includes G. m. morsitans from Kenya (63.5%) and Tanzania (76.5%) and G. pallidipes from Tanzania (74.6%) and Uganda (75%), however the samples with no Sodalis infection includes G. austeni from Eswatini, G. p. gambiensis from Mali and Senegal and G. tachinoides from Burkina Faso and Ghana indicating that there is 95% confidence that the infection rate is less than or equal to 10%, 0.82%, 0.55%, 1.28% and 0.36%, respectively.

Interactions between Sodalis and Trypanosoma infections

Prevalence of co-infections of Sodalis with Trypanosoma

The screening results indicated that the single infection rate was 9.3% (n = 638) and 21.9% (n = 1503) for Sodalis and Trypanosoma, respectively, over all taxa and countries (Fig. 3A). No Sodalis infection was found in G. tachinoides, and therefore was excluded from the analysis. A Cochran–Mantel–Haenszel test for repeated tests of independence showed that infection with Sodalis and Trypanosoma did deviate from independence across all taxa (χ2MH = 41.73, df = 1, P < 0.001) and individual Chi squared tests for independence for each taxon showed significant deviation from independence at the Bonferroni corrected α = 0.00833 in G. pallidipes (P < 0.001) and G. p. gambiensis (P < 0.001) (Supplementary Table 4). The prevalence of coinfection of Sodalis and Trypanosoma in wild tsetse populations varied with tsetse taxon and location. No coinfection was found in many taxa and many locations. The co-infection was found only in G. f. fuscipes (2.73%), G. m. morsitans (15.72%) and G. pallidipes (9.22%) in east, central and southern Africa (Fig. 3B, Table 5 and Supplementary Table 4).

Figure 3
figure 3

Prevalence of coinfection of Sodalis and Trypanosoma infection in wild tsetse populations. (A) Prevalence of coinfection, (B) prevalence of coinfection by tsetse taxa.

Table 5 Distribution of the association between the presence of Trypanosoma spp and the presence of Sodalis according to the tsetse species and the country.

Impact of co-infection on Trypanosoma and Sodalis density

Attempts were to assess the density of Trypanosoma and Sodalis under single (S/T+) and (S+/T) or double infection (S+/T+) conducted using qPCR with primers mentioned in Supplementary Table 5. The results show that Sodalis infections did not have a significant impact on Trypanosoma density (X2 = 0.648, df = 2, P = 0.723), however the median value of (S+/T+) files were slightly (S/T+) lower than (S+/T) and (S/T+) flies and the number of outlier samples with higher trypanosome density (S/T+) flies (Fig. 4A). Trypanosoma infections significantly reduced the density of Sodalis as indicated by comparing (S+/T+) flies with (S+/T) flies (P = 0.014) although the median values in (S+/T+) files is higher than the other samples indicating that the increased of Sodalis density in (S+/T) might be affected with the outlier flies with high Sodalis density (Fig. 4B). No significant different was found in the Trypanosoma density determined by qPCR in the flies tested negative (S+/T) or positive (S+/T+) and (S/T+) with the standard PCR, however, Sodalis density showed significant difference between flies with different infection type (X2 = 14.54, df = 2, P < 0.001) (Fig. 4B). The results showed no correlation between Sodalis and Trypanosoma density (r = 0.007, t = 0.055, df = 69, P = 0.9561) Supplementary Fig. 2, Supplementary File 1).

Figure 4
figure 4

Impact of co-infection with Trypanosoma and Sodalis on Trypanosoma (A) and Sodalis (B) density in Glossina pallidipes and G. m. morsitans. Bars marked with the same lower-case letter do not differ significantly at the 0.05 level.

Discussion

The implementation of the SIT in the context of an AW-IPM strategy to eradicate tsetse flies relies on the release of sterile males in the targeted area. This was successful in eradicating a population of G. austeni from Unguja Island of Zanzibar37 and significant progress has been made in the eradication programme implemented against G. p. gambiensis in the Niayes area of Senegal38. However, as both male and female tsetse flies are vectors of Trypanosoma species, the release of large numbers of sterile male flies bears a potential risk of temporarily increasing disease transmission during the initial release phase of an SIT programme39. Therefore, mitigating measures are required to reduce or eliminate this potential risk, especially in areas where sleeping sickness (HAT) is endemic. To date, to mitigate such risks, sterile males are offered two or three blood meals mixed with the trypanocidal drug isometamidium chloride, before being released which reduces the risk of Trypanosoma transmission significantly but does not eliminate it40,41. In addition, other approaches were proposed to minimize such risks such as paratransgenesis42,43 and combining paratransgenesis with SIT44.

The vector competence of tsetse flies for different trypanosome species is highly variable and is suggested to be affected by various factors, amongst which bacterial endosymbionts. Here, the interaction of Sodalis glossinidius with tsetse trypanosome infection is still under debate. Several studies reported a potential positive correlation between Sodalis and Trypanosoma infections28,30,32,36,45,46,47,48, leading to the hypothesis that Sodalis might facilitate the establishment of Trypanosoma infections in the tsetse midgut23,26,27. However, other studies indicated the lack of correlation between Sodalis and Trypanosoma infection34,35,36. The presence of Sodalis infections in tsetse rearing colonies has been well studied and previous studies indicated that Sodalis is more frequently present in colonized tsetse flies than in wild tsetse populations36,49 with a prevalence of 80 and 100% in colonized G. m. morsitans and G. p. gambiensis, respectively49,50, which is higher than the symbiont prevalence in wild populations of these tsetse species. This seems to indicate that the rearing process of tsetse flies favours the transmission and spread of Sodalis infections within the colonized population. Recently, colonies of G. pallidipes, G. p. gambiensis, G. f. fuscipes, G. m. morsitans, G. m. centralis and G. m. submorsitans maintained at the FAO/IAEA Insect Pest Control Laboratory were screened for Sodalis infections and showed a 100% prevalence of Sodalis; only the G. brevipalpis colony had a lower prevalence of 95% (data not shown). Taken into consideration that mass-rearing conditions enhances Sodalis infections and that Sodalis infections might facilitate the establishment of a Trypanosoma infection in the midgut, sterile male tsetse flies that are derived from colonies might be effective vectors for different Trypanosoma species and, therefore, might increase the trypanosome transmission after flies being released. It is therefore important that the managers and planners of SIT programmes are aware which tsetse species show a positive correlation between Sodalis and Trypanosoma infections to be able to take the necessary mitigating actions.

Various studies have examined the prevalence of Sodalis and Trypanosoma species in wild tsetse populations30,32,35,45,51, but our study presents for the first time the prevalence of Sodalis and Trypanosoma species on a continent-wide scale. In addition, the DNA extraction and PCR methods we have used were standardized and were all carried out in one laboratory to avoid discrepancies in the results due to different handling of tsetse samples or to different methods to discriminate trypanosome species in tsetse tissues. Our results indicate that Sodalis and Trypanosoma prevalence varied with tsetse species and geographical location (with an overall trypanosome prevalence of 23.5%), which agrees with many previous studies52. A high Trypanosoma prevalence (> 30%) was found in G. m. morsitans and G. pallidipes from central and east Africa. This finding is in agreement with previous reports on G. m. morsitans and G. m. centralis from Zambia36,52 and G. m. morsitans sampled in Malawi53. Moreover, a high prevalence of Trypanosoma infection in G. pallidipes was also previously reported in Tanzania54 and Kenya33. However, another study in northern Tanzania indicated a lower prevalence of Trypanosoma infection (< 10%) both in G. m. morsitans and G. pallidipes55.

Our study showed that the prevalence of different Trypanosoma species and or subspecies can be different in different tsetse taxa. In G. tachinoides in Ghana, the Trypanosoma vivax (Tv) infection was high (> 10%) as well as the infections of the T. brucei sspp (Tz) and the T. simiae/T. godfreyi (Tsg) group and the mixed infections of Tv–Tsg. In contrast, the prevalence of T. congolense was very low. These results are in agreement with the prevalence of T. brucei s.l (11%) and T. congolense forest type (2.6%) reported in the same tsetse species in Cameroon. The same study reported a prevalence of 13.7% of T. congolense savannah type35, which was not observed in our study. Our results of trypanosome infection rates in G. tachinoides also agree with former studies56,57, except for T,c for which a high fly infection rate (31.8%) was previously shown57. The Tc infection rates in our study were high in G. pallidipes and G. m. morsitans; for the latter tsetse fly species, a study in Malawi reported a high prevalence for T. brucei (64.4%) but much lower for all other Trypanosoma infections(< 10%)58. The mixed infection of Trypanosoma species/subspecies is in agreement with previous reports35,52,57,59.

Likewise, the prevalence of Sodalis infection varied significantly with tsetse taxon and location and the highest prevalence was found in G. m. morsitans and G. pallidipes. Our results agree with the high prevalence of Sodalis reported in G. pallidipes (~ 50%) in one location in Kenya regardless of the fly age33; however, the same study reported low Sodalis prevalence in another location. In another study in Kenya, Wamwiri et al.,32 reported moderate Sodalis prevalence in G. pallidipes (16%) and low prevalence in G. austeni (3.7%), which is in agreement with our results. On the other hand, our results are different from the low prevalence (< 8%) found in G. m. morsitans and G. pallidipes in Zambia36. In another study in Zambia, Sodalis prevalence in G. m. centralis, was reported to be 15.9% with no significant difference between inter-site prevalence52. The prevalence of Sodalis in G. brevipalpis in our study was found to be low (< 2.3%) which contradicts with the high prevalence (93.7%) reported in this species in Zambia36. In the DRC, the global prevalence of Sodalis in G. fuscipes quanzensis midgut averaged 15.5%, but in certain locations the prevalence exceeded 40%60. In Nigeria, Sodalis prevalence in G. p. palpalis and G. tachinoides was 35.7%61 which is higher that the prevalence reported in our study for both species.

Our data indicate that the Trypanosoma and Sodalis infections were very low or absent in some tsetse taxa from certain locations such as G. austeni in Eswatini for Trypanosoma and Sodalis infections and several species in west Africa for Sodalis. The lack of Sodalis and/or Trypanosoma infection in these samples might be due to (i) low number of tested samples (ii) the use of the DNA extracted from the whole body of tsetse adults (iii) the possibility of the collected samples being infected with different strains/genotypes that might not be detected with the primers used and (iv) the infection of Sodalis and Trypanosoma are under the detection limit of the used PCR. It is important to note that due to the high number of samples tested in our study, the more sensitive nested PCR to detect low infection level was excluded due to logistic reasons.

Our results indicate significant deviation from independence (correlation) of Sodalis and Trypanosoma infections in G. medicorum, G. p. gambiensis and G. pallidipes. However, the lack of detection of any tsetse adult with co-infection of Sodalis and Trypanosoma in G. medicorum, and G. p. gambiensis might indicate a negative correlation. Such negative trend might be supported by the lower density of Sodalis in the flies with co-infection (S+/T+) compared to these with Sodalis infection only (S+/T). On other hand the lack of impact of Sodalis infection on Trypanosoma density does not support the negative trend and agreed with the results of Trappeniers et al.,24 reported on colonized flies. This results also agreed with previous results reporting the absence of direct correlation between the presence of Sodalis and the acquisition of a Trypanosoma infection63. However, an inverse correlation was reported between Sodalis and the vector competence where the presence of Sodalis in both midgut and proboscis of G. p. gambiensis was associated with its status as a poor vector, whereas it is not found in the proboscis of G. m. morsitans (major vector). It is worth noting that all previous studies of Sodalis infection in G. p. gambiensis and its interaction with Trypanosoma infection was carried out with flies reared under laboratory conditions28,29,64. The correlation between Sodalis and Trypanosoma infection in G. pallidipes is positive, evidenced with the relative high number (n = 170) of tsetse with co-infection. This positive correlation was also found in G. pallidipes from Kenya although with too few flies with co-infection to enable us to draw a definite conclusion32. Although co-infections were found in G. m. morsitans and G. f. fuscipes in some locations, the global correlation was missing. This is in agreement with the positive correlation found between Sodalis and Trypanosoma infection in G. m. centralis in Zambia, in which there was a 6.2 fold increase in the likelihood of a fly being infected with Trypanosoma if Sodalis was present52. More studies are needed to enhance the potential control interventions mediated by endosymbionts to reduce parasitic infections61.

The results of this study clearly indicate that the interaction between Sodalis and Trypanosoma infection is complex, species-specific and requires further investigation. The prevalence results indicate that Sodalis and Trypanosoma infections are not independent in some species, such as G. p. gambiensis and G. medicorum in west Africa and G. pallidipes in central and east Africa, In case of a positive correlation between Sodalis and Trypanosoma infection in these species, additional measures could be suggested when implementing the SIT to reduce the Sodalis density in the sterile males released in the targeted area to maximize the safe implementation of the SIT. These measures might include the mixing of Sodalis phage(s)29,65 with the blood meals to feed the mass-reared flies to reduce the Sodalis density in these flies. In addition, the blood meal offered to the males before release can be supplemented with one or more of the following antimicrobial products to reduce Sodalis density, i.e. streptozotocin23, indolicidin and OaBAC 5 mini66. The use of the Sodalis phage as well as these antimicrobial agents requires further studies to (1) develop methods to isolate the phage, (2) determine the conditions (e.g. suitable concentration) for its use, and (3) determine the impact on Sodalis density, tsetse productivity and survival. For G. m morsitans and G. pallidipes, our results suggest that Sodalis infection does not have an impact on Trypanosoma infection so here no additional measures need to be taken during the implementation of SIT against these species.

Conclusion

Sodalis and Trypanosoma infection varied with tsetse taxon and location. There is a significant positive correlation between Sodalis and Trypanosoma infection in G. medicorum, G. p. gambiensis and G. pallidipes; however, no significant correlation was found in other tsetse taxa and locations. The results of this study will enable the decision makers of SIT projects to better plan and take the necessary measures to fine-tune and optimize SIT efficiency and safety.

Methods

Tsetse collection and DNA extractions

Tsetse flies were collected in 1995 and between 2005 and 2018 from 95 different geographical locations in fifteen countries in east, central, southern, and western Africa (Table 6, Supplementary Table 6). The tsetse flies were collected with species-specific traps which included the biconical trap67, the monoconical trap68, the Vavoua trap69, the Ngu trap70,71, the odour-baited Epsilon trap72, the NZI trap73, and the odour baited H trap74. A total of 6860 tsetse flies, belonging to ten tsetse species, were collected for this study (Table 6). The majority of the samples were collected in Burkina Faso (2274), Kenya (1008), Senegal (547) and South Africa (526). As the distribution of most tsetse species is allopatric (only few species are sympatric), not all tsetse species were collected from each country. Following collection, fly samples were preserved in 95% ethanol or propylene glycol and shipped to the FAO/IAEA Insect Pest Control Laboratory (IPCL) in Seibersdorf, Austria and stored at − 20 °C until analysis. Total DNA was extracted from individual whole fly bodies using the DNeasy tissue kit (QIAGEN Inc., Valencia, CA) following the supplier’s instructions. The DNA quality and concentration were measured by spectrophotometry (Synergy H1 Multi-Mode Reader, BioTek, Instruments, Inc., USA) and subsequently kept at 4 °C until screened for Sodalis and Trypanosoma infections. To verify the quality of the extracted DNA, a set of specific primers amplifying the Glossina spp. microsatellite GpCAG133 sequence (Supplementary Table 5) and only the successful samples were included in the analysis21,75.

Table 6 List of collections of tsetse adults with valid DNA screened for Sodalis and Trypanosomea infection in wild tsetse population in east, central, southern and west Africa.

Trypanosoma prevalence and genotyping

Polymerase chain reaction (PCR), following the method of Njiru et al.76 that used the primers ITS1-CF and ITS1-BR (Supplementary Table 5) previously designed to amplify the internal transcribed spacer (ITS1) of the ribosomal DNA, was used to detect Trypanosoma infection and Trypanosoma species in the fly samples. The PCR was carried out in 25 μl reaction mixtures containing 22.5 µl of 1.1 × Pre-Aliquoted PCR Master Mix (0.625 units Thermoprime Plus DNA Polymerase, 75 mM Tris–HCl (pH 8.8 at 25 °C), 20 mM (NH4)2SO4, 2.0 mM MgCl2, 0.01% (v/v) Tween-20 and 0.2 mM each of the dNTPs (ABgene, UK), 1 µl primers (at 200 nM final concentration of forward and reverse primer) and 1.5 µl of template DNA. PCR cycles were: 94 °C for 15 min, followed by 40 cycles of 94 °C for 30 s, 60 °C for 30 s, 72 °C for 30 s, and final extension 72 °C for 5 min. Interpretation of the results after resolving the amplification products in a 2% agarose gel (Fisher Biotech) stained with SafeGreen or ethidium bromide, was based on the characteristic band size of Trypanosoma taxa: all members of the subgenus T. brucei sspp (T. b. brucei, T. b. gambiense, T. b. rhodesiense: 480 bp); T. congolense savannah (700 bp); T. congolense Kilifi (620 bp); T. congolense forest (710 bp); T. simiae (400 bp); T. simiae Tsavo (370 bp); T. godfreyi (300 bp) and T. vivax (250 bp). The positive control DNA was from T. congolense savannah, T. congolense forest, T. b. brucei, T. b. gambiense, T. b. rhodesiense, T. evansi, and T. vivax. DNA samples validated with GpCAG133 primer amplification were screened for trypanosome infection. A tsetse sample was recorded as positive if one or more of the indicated band sizes was detected. Trypanosoma infection status and species were recorded for each fly.

Prevalence of Sodalis infection

The detection of Sodalis in natural tsetse samples was based on the Sodalis fliC (flagellin) gene which results in an amplicon length of about 508 base pairs with the Sodalis specific primers Sod-fliC-F and Sod-fliC-R (Supplementary Table 5)77. These primers were used in single pairs or in multiplex PCR with GpCAG133 primers. For all PCR reactions, 22.5 µl of 1.1 × Pre-Aliquoted PCR Master Mix (ABgene, UK) was used. In a final volume of 25 µl, 1.5 µl of template DNA plus forward and reverse primers were added to a final concentration of 0.2 mM per primer in a volume of 1 µl. Samples were considered Sodalis-infected if the expected symbiont PCR product amplicon was detected. Data were accepted only if the control gene GpCAG133 sequence was amplified. The PCR cycling conditions were: 95 °C for 5 min followed by 34 cycles of 95 °C for 30 s, 52.5 °C for 30 s, 72 °C for 30 s and lastly at 72 °C for 10 min; PCR products were separated by agarose (2%) gel electrophoresis and SafeGreen or ethidium bromide staining.

Analysis of the Trypanosoma and Sodalis infection in wild tsetse populations

Co-infection of tsetse adults with Sodalis and Trypanosoma infection

The co-infection of Sodalis and Trypanosoma infection was evaluated based on the PCR prevalence. The infection status was divided into four categories Sodalis positive and Trypanosoma positive (S+/T+), Sodalis positive and Trypanosoma negative (S+/T), Sodalis negative and Trypanosoma positive (S/T+) and Sodalis negative and Trypanosoma negative (S/T).

Analysis of the Trypanosoma and Sodalis density

Samples showing Trypanosoma infection (not T. vivax) with Sodalis (S+/T+) and samples not infected with Trypanosoma but infected with Sodalis (S+/T) were evaluated with quantitative PCR (qPCR) to assess the impact of Trypanosoma infection (regardless the Trypanosoma type) on Sodalis density. The qPCR was performed using a CFX96 Real Time PCR Detection System (Bio-Rad). The fliC gene was amplified with the following primers: sodqPCR-FliCF and sodqPCR-FliCR78 (Supplementary Table 5) to assess the density of the symbiont present within Trypanosoma infected and noninfected, additional criteria for the selection of the samples was the presence of the two groups (S+/T+) and (S+/T) in a given population. Based on the preceding criteria 96 individual flies (52 and 44 flies with infection status of (S+/T+) and (S+/T), respectively, were selected from the G. pallidipes and G. m. morsitans collected in Kenya, Tanzania and Zimbabwe. In addition, samples with (S+/T+) and (S/T+) were used to assess the impact of Sodalis infection on Trypanosoma density. Trypanosomatidae18S specific primers (18S_Typ_F and18S_Typ_R) (Supplementary Table 5) were used to assess the Trypanosoma density in the tested samples. The DNA from all selected samples was diluted to a final concentration of 4 ng/μl and 5 μl of the diluted DNA was used for qPCR to determine Sodalis and Trypanosoma DNA density normalized to the housekeeping β-tubulin gene. The amplification mixture contained 5 μl of DNA template, 200 nM of each primer, and 7.5 μl iQ™ SYBER Green Supermix (Bio-Rad). qPCR cycling conditions for Sodalis were as follows: initial denaturation at 95 °C for 2 min; 39 cycles of 95 °C for 5 s, 55 °C for 30 s, one step at 95 °C for 5 s and a melting curve constructed from 65 °C to 95 °C in increments of 0.5 °C for 5 s. The same conditions were used for Trypanosoma except the annealing temperature was at 60 °C. The analysis of the Sodalis, Trypanosoma and Tubulin densities was based only on qPCR data with the expected melting curve at 81.5 °C, 85.5 °C and 86 °C, respectively.

Data analysis

The prevalence data were recorded and analyzed with the general linear model (GLM)79. The prevalence of Sodalis, Trypanosoma species and each Trypanosoma species and co-infection were tested for differences between the tsetse taxa and between countries. For each country, the prevalence was assessed again for differences between the localities where the flies were collected and between the tsetse species present in each country. In the absence of PCR detected Sodalis or Trypanosoma infection, the upper 95% confidence interval for the true rate of infection was calculated following the method of Couey and Chew62. Trypanosoma prevalence between taxa was compared between species by a pairwise comparison of proportions with a Bonferroni correction and Benjamini–Hochberg correction. The analyses were executed in R v 4.0.579 using RStudio V 1.4.110680,81 with the packages ggplot2 v3.3.2.182, lattice v0.20-4183, car84, ggthems85 and MASS v7.3-51.686 except for the Chi squared tests for independence, Spearman correlation coefficient and Cochran–Mantel–Haenszel test for repeated tests of independence, which were performed using Excel 2013 The R Markdown file is available in Supplementary File 1.

To analyse the qPCR data, normalized density of Trypanosoma and Sodalis against the house keeping gene (tubulin) was extracted from the CFX Maestro software. Samples giving a valid density (not N/A) for both Trypanosoma and Sodalis were retained for further statistical analysis in R. Similarities in the structure of Sodalis and Trypanosoma (single and multiple) infection and the role of different factors such as countries and tsetse taxa, were assessed using the matrix display and metric multidimensional scaling (mMDS) plot with bootstrap averages in PRIMER version 7+. The bootstrap averages plots were displayed with a Bray and Curtis matrix based on the square-root transformation of the Sodalis and Trypanosoma (single and multiple) infection abundance data87. The tests were based on the multivariate null hypothesis via the use of the non-parametric statistical method PERMANOVA88. The Permanova test was conducted on the average of the abundance data based on the country-species after excluding the data of Eswatini (low number of tested samples).