Evolutionary dynamics of HIV-1 subtype C in Brazil

Souto, Bernardino; Triunfante, Vera; Santos-Pereira, Ana; Martins, Joana; Araújo, Pedro M. M.; Osório, Nuno S.

doi:10.1038/s41598-021-02428-3

Download PDF

Article
Open access
Published: 29 November 2021

Evolutionary dynamics of HIV-1 subtype C in Brazil

Bernardino Souto^1,2,3^na1,
Vera Triunfante^1,2^na1,
Ana Santos-Pereira^1,2^na1,
Joana Martins^1,2,
Pedro M. M. Araújo^1,2 &
…
Nuno S. Osório^1,2

Scientific Reports volume 11, Article number: 23060 (2021) Cite this article

1794 Accesses
6 Citations
6 Altmetric
Metrics details

Subjects

Abstract

The extensive genetic diversity of HIV-1 is a major challenge for the prevention and treatment of HIV-1 infections. Subtype C accounts for most of the HIV-1 infections in the world but has been mainly localized in Southern Africa, Ethiopia and India. For elusive reasons, South Brazil harbors the largest HIV-1 subtype C epidemic in the American continent that is elsewhere dominated by subtype B. To investigate this topic, we collected clinical data and viral sequences from 2611 treatment-naïve patients diagnosed with HIV-1 in Brazil. Molecular epidemiology analysis supported 35 well-delimited transmission clusters of subtype C highlighting transmission within South Brazil but also from the South to all other Brazilian regions and internationally. Individuals infected with subtype C had lower probability to be deficient in CD4⁺ T cells when compared to subtype B. The HIV-1 epidemics in the South was characterized by high female-to-male infection ratios and women-to-child transmission. Our results suggest that HIV-1 subtype C probably takes advantage of longer asymptomatic periods to maximize transmission and is unlikely to outcompete subtype B in settings where the infection of women is relatively less relevant. This study contributes to elucidate factors possibly underlying the geographical distribution and expansion patterns of the most spread HIV-1 subtypes.

HIV-1 molecular diversity in Brazil unveiled by 10 years of sampling by the national genotyping network

Article Open access 04 August 2021

Tiago Gräf, Gonzalo Bello, … Amilcar Tanuri

Molecular epidemiology and HIV-1 variant evolution in Poland between 2015 and 2019

Article Open access 16 August 2021

Karol Serwin, Anna Urbańska, … Miłosz Parczewski

Transmission dynamics of HIV-1 subtype B strains in Indonesia

Article Open access 27 September 2019

Shuhei Ueda, Adiana Mutamsari Witaningrum, … Masanori Kameoka

Introduction

Retroviruses such as HIV (Human Immunodeficiency Virus) have an extreme capacity to generate genetic diversity¹. HIV genetic diversity spectrum is divided into types I and II, with HIV-1 comprising the groups M, O, N and P. The pandemic group M is increasingly diversifying and comprises at least 10 subtypes, several sub-subtypes and recombinant forms^2,3. Interestingly, these HIV-1 clades might be evolving at different rates, to modulate virulence^4,5. Most accepted theories on virulence evolution postulate that the selection for an optimal virulence level follows a complex trade-off between the factors influencing pathogen induced-host mortality and between-host transmission⁶. In fact, M group subtypes were associated to differences in disease progression^7,8,9,10,11, preferential transmission routes^12,13 and different capacity to evade the immune system^14,15 or therapy^16,17,18. These differences possibly result in subtype-related advantages in different niches contributing for the global subtype spread dynamics^5,11.

Subtype C causes nearly all infections in Southern Africa, Ethiopia and India being responsible for almost half of the HIV-1 infections in the world^19,20,21. Despite the increasing amount of evidence that supports the geographic expansion of C subtype and other non-B subtypes in different continents^22,23,24,25, globally, in the last decades, subtype C has been shown to have a decreasing profile, along with other subtypes, contrasting with subtype B²⁰. In fact, subtype B remains the most geographically spread HIV-1 subtype worldwide. Ex vivo evidence following viral infection of peripheral blood mononuclear cells suggests that C subtype might be less cytopathogenic due to a preference for CCR5 co-receptor expressing cells and less fit when compared to B^26,27,28. Furthermore, it was shown that HIV-1 subtype C is associated with slower rates of CD4⁺ T-cell declines and higher frequencies of long-term non-progression when compared to subtype A or D in women from Uganda and Zimbabwe²⁹. In cohorts from Kenya¹³ or Tanzania¹² it was found that pregnant women infected with subtype C had higher risk of mother-to-child transmission when compared with the ones infected with A or D.

Studies comparing in detail subtype C and B infections in human cohorts are limited by the rarity of informative clinical settings where subtype C and B co-exist in large numbers. In case of Brazil, the HIV-1 epidemics is dominated by B subtype. However, subtype C represents the most prevalent subtype in the South region of the country. The fact that most subtype C sequences from this region branch within a monophyletic clade suggest that this epidemic possibly initiated by the introduction in South Brazil, around 1960–80 s, of a single founder lineage derived from the radiation of an East African regional-specific group^{30,31,32,33,34,35}. Reports show that in the early 2000s, C subtype represented around 30% of the HIV-1 infections in several cities in this region and that, after 2005, it became the most prevalent subtype, representing more than 40% of the cases^30,36. Most strikingly, the spread of C subtype in other regions of Brazil outside of the South were revealed to be slow and modest. Despite intense and regular movement of people between the South and South-East regions, the South-East region of Brazil and other regions bordering South Brazil in Argentina, Paraguay or Uruguay remained with low subtype C prevalence^36,37,38,39. The reasons underlying these regional differences are elusive and gaining insights into the introduction and regional expansion of HIV-1 subtype C in Brazil might give important information about C versus B subtype-related differences in what regards to within-host replication, virulence, transmission, and overall host population infection dynamics. Thus, in the present study, we investigated the phylogeography of HIV-1 lineages and compared clinical and epidemiological information from 2611 Brazilian patients.

Results

Proportion of HIV-1 subtype C infections in Brazil

To investigate the differences in the proportion of cases caused by HIV-1 C and B subtypes in Brazil, we subtyped the sequences from all individuals that were treatment naïve and sampled from 01/2008 to 04/2017 at the National Genotyping Network of Brazil (n = 2611; Table S1). The region with the higher proportion of naïve HIV-1 infected individuals was the South-East (n = 1305; 49.98%) followed by the South (n = 507; 19.42%) and North-East (n = 487; 18.65%) (Fig. 1). HIV-1 subtype B was the most common at the country level with a total of 1675 cases, representing 64.15% of all infections in the studied population. Combining all regions, the proportion of C subtype among our sample was 13.02% (340 cases of a total of 2611). However, in the South, subtype C represented 50.30% of the cases being the most frequent in the region (Fig. 1). The analysis of the number of cases per year highlights that subtype C was consistently the most abundant in the South during the period under analysis (Fig. 1). In the South-East, the region with most HIV-1 infections, the number of cases with subtype C in the studied population never reached more than 12 cases per year, contrasting with the South, in which the number of C infections was superior to 30 cases per year in the period between 2013 and 2016, with a peak of 48 cases in the year of 2014 (Fig. 1). Overall, the South had a high growth in the number of cases caused by subtype C and was the only region where this subtype was more frequent than subtype B (Table S2).

Subtype C associates with lower deficiency in CD4⁺ T cells when compared with B

To address subtype-related differences in infection progression outcomes, we compared the viral loads and CD4⁺ T cell counts between the infections caused by subtypes C or B. We found no statistically significant differences when comparing viral loads between individuals infected with C vs. B subtypes (p = 0.79). To test if individuals with very high viral loads could be confounding the analysis, we separated individuals with viral loads ≤ 100,000 virus/mL (n = 1927) from those with viral loads > 100,000 virus/mL (n = 535). The cut-off point of 100,000 virus/mL was chosen considering previous literature demonstrating its value to predict disease progression or treatment failure^40,41,42. Again, we found no significant differences between C and B subtypes (p = 0.63; Table S3). To investigate the effect of HIV-1 subtype in CD4⁺ T cell counts we compared individuals with or without immunodeficiency and, among the immunodeficient, the ones with moderate or severe levels. Considering the age-related differences of CD4⁺ T cell normality, the classification of each case was done by adjusting the reference values according to the age of the subject (Table S4). The criteria used to define immunodeficiency accounted only for CD4⁺ T cell counts and was based on cut off points based on previous literature and commonly used in clinical practice^43,44. Individuals infected with subtype C had a significant lower probability to be immunodeficient (p = 0.000) when compared with subtype B (Fig. 2; Table S5). This association was maintained when dividing the group by age (Table S5). Among the individuals with immunodeficiency, the ones infected with subtype C had significant lower probability of severe immunodeficiency (p = 0.001; Fig. 2; Table S5). Individuals with less than 18 years infected with subtype C had an even lower probability of severe immunodeficiency (p = 0.008; Table S5). Moreover, we decided to evaluate the proportion of ambiguous sites (PAS), a surrogate of age of infection^45,46,47,48, on all the viral sequences and no significant difference was found between subtypes (Table S6). Overall, these results suggest that C subtype viruses, despite reaching viral loads similar to subtype B, are less able to cause a deficiency in CD4⁺ T cells, which could lead to longer asymptomatic periods and possibly increase the opportunity for transmission in some settings.

Evidence for interregional and international subtype C transmission

To gain insights into the transmission of subtype C in Brazil, we performed maximum likelihood (ML) and Bayesian phylogenetic analysis of the 340 subtype C sequences described in this study and 854 closely related sequences obtained from public databases (total n = 1194). The phylogenetic representation (Fig. 3, Table S7) demonstrated that the vast majority (99.26%; 1076 out of 1084) of the C subtype viruses isolated in Brazil were included in a monophyletic clade (SH-like branch support 0.94) that was nested with sequences from the East African region. This large clade also included sequences obtained from public databases and isolated in Asia, Europe, and other American countries. We then performed the characterization of transmission clusters and found 35 well-delimited transmission clusters (TC1 to TC35, Table 1) involving a total of 174 sequences. The average number of sequences per cluster was 4.97. TC24 was the largest cluster including a total of 24 sequences isolated in the South-East, Central-West or North regions of Brazil. Most of the clusters (18 out of 35, 51.43%) were exclusively formed from sequences isolated in the South of Brazil. From the nine clusters spanning more than one Brazilian region (Table 1, interregional), only clusters TC24 and TC35 did not include sequences isolated in the South. Interestingly, TC35 branched outside the diversification of the major founding event of subtype C in Brazil (Fig. 3) suggesting that rare transmission events of subtype C viruses from different introductions might exist in some parts of the country. Furthermore, we found four clusters that included sequences isolated outside Brazil (Table 1, international). The two largest (TC5, TC16) included sequences from the South, South-East, one other Brazilian region and other countries (USA, Spain, Portugal, or Germany). Most of the sequences in clusters (71.26%, 124 out of 174) lacked information on the reported route of infection. Among those with available data, transmission between heterosexual (28 cases) and MSM (12 cases) were the most reported. The estimates for the time of the MRCA for each cluster ranged from 1992 to 2009. The cluster depth analysis (obtained for each cluster by subtracting the most recent sampling date minus the time of MRCA) supports that six transmission clusters (TCs 2, 5, 16, 22, 24 and 26) were ongoing for more than 20 years (Table 1).

Table 1 Characterization of the 35 transmission clusters of HIV-1 subtype C virus identified in this study.

Full size table

Relevant role of the South-East in the transmission of subtype C

To assess the viral diffusion patterns, a phylogeographic analysis (Fig. 4A) was performed considering the sequences included in the transmission clusters that were sampled in Brazil and have complete information (n = 156). Tip location was defined as Brazilian region of sample collection (North, North-East, Central-West or South-East). Due to larger sample size, sequences from the South region were classified according to the state of origin (Santa Catarina (SC), Rio Grande do Sul (RS), or Paraná (PR)). Analysis with TempEst showed a positive correlation between genetic divergence and sampling time (r = 0.45, R² = 0.20) suggesting that the temporal signal of the dataset was suitable for phylogenetic molecular clock analysis⁴⁹. The root of subtype C epidemic in Brazil was inferred to SC (psp = 0.93, pp = 1.00) in the South. Transmission clusters including C sequences from outside of the South were estimated to have a time of MRCA as early as 1993 (1983.8–2001.9) with the regions of the South-East and North, having the highest probability (psp = 0.85, pp = 1.00 and psp = 0.83, pp = 1.00, respectively) of being the first points of introduction from the South into other regions (Fig. 4B). A total of 5 pairwise rates of diffusion between locations were found to have a strong support value (Bayes factor (BF) > 10). These include the transmission among SC and the South-East (BF = 34,202.74, pp = 1.00), the South-East and Central-West (BF = 1898.13, pp = 1.00), SC and RS (BF = 58.83, pp = 0.96), SC and PR (BF = 37.50, pp = 0.95), or RS and Central-West (BF = 11.58, pp = 0.84). Furthermore, the linkage between South-East and North, or South-East and North-East was supported by a BF above 3. The results suggest that, at a given point in the transmission history, the South-East not only received C viruses from the South but was also involved in transmission to other Brazilian regions. These findings were also supported by a phylogeographic analysis using the transmission cluster sequences grouped by Brazilian state and including sequences sampled outside Brazil (n = 161, Fig. S1). Additionally, this analysis showed well supported diffusion rates (BF > 10) for the international transmission of the Brazilian C subtype clade relating the Southern state of RS with Germany and Spain with the United States of America (Table S8).

Demographic differences in subtype C infections

Having established evidence for intense South to South-East transmission of subtype C, we then explored the demographic and epidemiologic characteristic of the HIV-1 epidemics in these regions to investigate possible reasons for the inferior capacity of C subtype to become dominant outside the South. In total, 60.18% (204 of 339) of the infections by subtype C in Brazil were in women and only 39.82% (135 of 339) in men (OR = 1.64; CI = 1.30–2.08; p = 0.000). In accordance, we found significant differences (p = 0.0160) in the distribution of the sex of the HIV-1 infected individuals in the South when compared with the South-East (Table 2). In the South, HIV-1 affected more females (55.82%) than males (female-to-male ratio = 1.27) while in the South-East most of the infections were in male (50.04%; female-to-male ratio = 0.98). Despite the missing data, within our study population, mother-to-child transmission was significantly more likely (CI = 1.76–6.96; p = 0.0002) to occur in the South than in the South-East. Moreover, the number of infected individuals with less than 18 years of age infected with HIV-1 in the South was also significantly (p = 0.000) higher than in the South-East. Transmission between men that have sex with men (MSM) was significantly more associated with the South-East (OR = 3.72; CI = 1.22–15.13; p = 0.0218). These findings highlight clear demographic and epidemiological differences between these two neighboring Brazilian regions.

Table 2 Epidemiological comparison of HIV-1 epidemics between South and South-East Brazil.

Full size table

Discussion

HIV-1 subtypes C and B can be considered the evolutionarily most successful HIV-1 subtypes. Given the differences in geographic distribution between C and B subtypes it is reasonable to assume that there are particularities in these viruses possibly conferring subtype-specific advantages in different settings. In this study, country level clinical and demographic data, and partial sequences of the HIV-1 genome (pol sequence) originating from routine genotypic testing for resistance to antiretroviral therapy were investigated. The observed proportion of HIV-1 infections by Brazilian region in the study population was in accordance with the official HIV-1 prevalence reports⁵⁰. The pol region, previously shown to be able to accurately reconstruct HIV transmission⁵¹, was used for phylogenetic analysis. Regarding HIV-1 subtype distribution in Brazil, our results update and expand to the country-level previous literature^52,53,54 in showing that Brazil has bordering regions dominated in prevalence by subtype B or C. During the period under analysis, subtype C led in proportion only in the South with the rest of Brazil being dominated by subtype B. Most interestingly, despite intense and regular movement of people between the South and South-East regions, the lowest overall subtype C proportion of cases in the studied population was found in the South-East (3.52%; 44 cases out of 1248).

Subtype C was previously associated with higher CD4⁺ T cell counts in African cohorts when compared with subtypes A and D²⁹. In the comparison with B subtype, our analysis in the Brazilian cohort suggests that subtype C, despite reaching similar viral loads than subtype B, could lead to more moderate rates of destruction of CD4⁺ T cells. In fact, among people infected with subtype C there were significantly less individuals with deficiency in CD4⁺ T cells when compared with the ones infected with subtype B, which was not due to differences on the age of the infected individuals or in the time since infection, as no significant differences were observed on the statistical analysis of the PAS^45,46,47,48. This could lead to longer asymptomatic periods in subtype C infections and possibly increased opportunities for transmission. To investigate the C subtype transmission, we performed a molecular epidemiology and phylogeographical analysis using the 340 C subtype sequences obtained in this study and the closest related sequences from databases. This was performed to enrich the information that could be obtained related to the transmission outside Brazil. Our analysis generated information on the origin and probable place of introduction of C subtype in Brazil. In accordance with the previous studies^30,32, we found strong evidence supporting one major founding event of introduction of subtype C in Brazil originating from Middle East African countries. We found no evidence supporting the introduction from UK to Brazil as suggested in one study³¹. We did find a transmission cluster (TC19) with sequences isolated in the UK that likely originated in Brazil and was transmitted to the UK. We found strong statistical support for international transmission from the Southern Brazil state of Rio Grande do Sul (RS) to Germany. This link is possibly explained by the known migratory fluxes between these two geographic locations.

In our analysis, the state with the highest probability for the place of entrance of subtype C in Brazil was Santa Catarina (SC). The characterization of transmission clusters and phygeographic dynamics suggests that the inferior capacity of C subtype to thrive outside the South was not due to absence of cross-regional transmission. In fact, we found that more than 20% of the C subtype transmission events bridged, in the last decades, the South and at least one other Brazilian region with emphasis on the South-East. We found strong statistical support indicating that the South-East region was not only recipient but also donor in interregional transmission clusters of subtype C viruses. This suggests that, although the South-East has among the lowest overall proportion and annual growth rate of subtype C in the country it played a role in disseminating C subtype virus to other Brazilian regions. Considering our results, it is tempting to speculate that for HIV-1 subtype C to thrive in a population it relies on its high within-host replicative capacity (like that of B subtype) but possibly also takes advantage of longer asymptomatic periods that might increase its opportunities to transmit. The epidemiological comparison between the South and South-East Brazil suggests that C subtype capacity to outcompete B might be facilitated in settings with higher female-to-male infection ratios and women-to-child transmission. However, these conclusions are limited by the presence of missing data on the reported route of infection and to what is possible by means of a cross-sectional study. Notwithstanding, this data finds parallels in previous studies in African cohorts showing that C subtype was more adapted to women-to-child transmission than A or D subtypes^12,13. In a Kenyan cohort, it was found that pregnant women infected with subtype C were significantly more likely to shed HIV-1–infected vaginal cells than were those infected with subtype A or D¹³. Whether C subtype virus are present in higher levels in cells from the vaginal mucosa or even breast milk when compared to B subtype virus has not, to our best knowledge, been studied, being a matter for future investigation. On the other hand, the distribution of HIV infection among men, women and children is also influenced by sociocultural factors such as breast feeding and other gender equality-related factors. It is relevant to point out that the practice of cross-breastfeeding was a culturally established and accepted behavior in Brazil^55,56. It was initially provided by lactating slaves mainly originating from the same African regions that are the most probable point of origin of the HIV-1 subtype C introduced in Brazil. Long after slavery was abolished and at least until the first half of the XX century, it was frequent that lactating Afro-Brazilian women were paid to cross-breastfeed⁵⁶. It is possible that sociocultural heritages from this past influenced the introduction and transmission of subtype C and, consequently, its distribution in the Brazilian territory. The South has the highest prevalence in Brazil of AIDS in pregnant women and children and the higher female-to-male infection ratio⁵⁰. The degree of genetic mixing in the Brazilian population is very high being unlikely that differences in human population ancestry between the South and the South-East could be the explanation for the high rate of subtype C infections. However, subtype C could have found in the Southern region of Brazil, sociocultural and behavioral conditions favorable to its dissemination with similarities to those found in African and Asian regions, where it is also the most prevalent HIV-1 subtype^20,57.

Overall, this study opens lines of research on the differences between the two most prevalent HIV-1 subtypes and, at the same time, it is useful for the management of the health care and public HIV-1 control policies. Regarding the dynamics between B and C subtypes it is possible that C subtype outcompetes B only in settings with sizable infection of women and women-to-child transmission. Thus, it is suggested that, where the prevalence of subtype C is higher, care professionals and public policies define specific strategies for the protection of women and the pregnancy-puerperal cycle against HIV infection. Targeting this group by close surveillance to make the diagnosis and treatment as close as possible to the time of infection is likely to reduce the epidemiological burden of subtype C HIV-1 infections.

Materials and methods

Study population

Data was collected from HIV-1 infected patient records (n = 2611, Table S1) available at the Specialized Assistance Services on Sexually Transmissible Diseases and HIV/Aids including all 26 Brazilian states and Federal District from 01/01/2008 to 04/30/2017 that met the inclusion criteria for this study. The inclusion criteria were availability of partial HIV-1 genome sequence, and absence of previous antiretroviral treatment upon sampling. For all cases matching the criteria the following patient data was collected anonymously from previously available records: self-reported transmission route; sex; birth year; date of the viral sample collection for sequencing; CD4⁺ T-cell count at sampling; viral load at sampling; geographical origin of the sample; and pregnancy. The study was approved and was granted exemption from written informed consent by the Brazilian national ethic committee, "Comissão Nacional de Ética em Pesquisa (Conep)", through the protocol CAAE 53757016.0.0000.5504. All methods were performed in accordance with the Declaration of Helsinki.

Sequencing

DNA sequencing, from the reverse transcriptase PCR amplicons was performed with commercially available HIV-1 genotyping systems based on Sanger sequencing and performed using standardized protocols in the National Genotyping Network of Brazil. The HIV-1 genome sequence portion used in this study corresponds to the pol region. The HIV-1 positions in this study refer to the HXB2 HIV-1 reference genome (GenBank: K03455.1). All multiple sequence alignments were performed using MAFFT v7.309⁵⁸ removing columns containing at least 10% gaps. The HIV-1 subtype was assigned using Rega HIV-1 Subtyping Tool v3.0⁵⁹, Comet HIV-1 v2.3⁶⁰, jpHMM⁶¹, RIP v3.0⁶², SCUEAL⁶³, and SNAPPY⁶⁴. The results of the different tools were compared, and subtype was classified based on the agreement between the used tools and manual inspection of the results from phylogenetic and recombination analysis. The 2611 sequences selected for this study were made available in GenBank (accession numbers pending).

Phylogenetic analysis

To obtain additional sequences from outside the National Genotyping Network of Brazil we queried the HIV reference sequence database (http://www.hiv.lanl.gov/) using BLAST⁶⁵. For each of the 340 subtype C sequences described in this study the 10 most closely related generated outputs were selected. We excluded duplicates or sequences from the same patient and sequences showing evidence of recombination. Applying these criteria 854 database sequences were added to this study for phylogenetic analysis. An alignment of 1194 sequences was used to make a phylogenetic reconstruction using PhyML v3.0⁶⁶. The best fitting substitution model was GTR + G4 + I, determined by PhyML SMS(Smart Model Selection) using AIC (Akaike Information Criterion)⁶⁷.The heuristic trees search was performed using SPR and NNI methods. The branch support was calculated with the approximate likelihood-ratio (aLRT) SH-like test. The tree with the best likelihood value was performed using SPR with 3 random starting trees (Fig. 2). Bayesian evolutionary and phylogeographic analyses were performed using BEAST v1.10.4^68,69, with GTR + G4 + I for two different codon partitions (1 + 2, 3), as nucleotide substitution model, coalescent Skygrid model and uncorrelated relaxed clock. The site model GTR + G4 + I corresponding to the best model selected by jModelTest program⁷⁰. The sampling Brazilian region, state or country outside Brazil were used as discrete traits. A symmetric discrete traits substitution model selecting the option to infer social network with Bayesian Stochastic Search Variable Selection (BSSVS) method was used to estimate transition rates between locations. The temporal signal of the data was tested by TempEst⁴⁹. Two different runs (random seeds) of 320 million generations, converged to similar values. Outputs were analysed with Tracer v1.7.1⁷¹ to ensure all parameters had an effective sampling size (ESS) superior to 200. The two multiple tree output files were combined, using LogCombiner v1.10.4⁶⁸, to build the maximum clade credibility tree with mean heights with TreeAnnotator v1.10.4⁶⁸. The resulting log files were also combined with LogCombiner v1.10.4⁶⁸. The phylogeographic representations were created with SpreaD3⁷³. For database sequences from outside Brazil, the country’s locations were plotted as their geographic centre.

Definition of transmission cluster and tree visualization

The criteria for the definition of a clade as a transmission cluster were likelihood ratio test (aLRT) SH-like branch support ≥ 0.95 (estimated with PhyML v3); branch posterior probability ≥ 0.99 (estimated with BEAST v1.10.4); mean cluster genetic distance < 0.003 substitutions per site; and maximum genetic distance < 0.05 substitutions per site. MEGA X v10.05⁷² was used for genetic distance calculation. Only clusters with more than 2 sequences were included. The phylogenetic tree shown in Fig. S1 was used for the characterization and dating of transmission clusters. FigTree v1.4.4 was used for visualization and manipulation of the trees⁷³.

Statistical analysis

After verifying and optimizing the quality of epidemiological data (transmission route; sex; birth year; date of the viral sample collection for sequencing; CD4⁺ T-cell count at sampling; viral load at sampling; geographical origin of the sample), they were organized into spreadsheets and processed by the software Epi Info, from the Center for Disease Control and Prevention (United States). For statistical analysis, the Mantel–Haenszel chi-square test was used when the minimum sample size in all variables was greater than or equal to 5. When sample size was less than 5 units in at least one of the variables, the Fisher exact test was used for calculating the Odds Ratio and the corrected Mantel–Haenszel chi-square test for calculating the p value. In all cases, the tests were two-tailed, and the level of significance considered was 5%.

References

Peeters, M., D’Arc, M. & Delaporte, E. Origin and diversity of human retroviruses. AIDS Rev 16, 23–34 (2014).
Robertson, D. L. et al. HIV-1 nomenclature proposal. Science 288, 55–65 (2000).
Article CAS Google Scholar
Yamaguchi, J. et al. Complete genome sequence of CG-0018a-01 establishes HIV-1 subtype L. JAIDS J. Acquir. Immune Defic. Syndr. https://doi.org/10.1097/qai.0000000000002246 (2019).
Article Google Scholar
Jefferys, R. J. Evidence for HIV weakening over time. Proc. Natl. Acad. Sci. U.S.A. https://doi.org/10.1073/pnas.1502380112 (2015).
Article PubMed PubMed Central Google Scholar
Ariën, K. K., Vanham, G. & Arts, E. J. Is HIV-1 evolving to a less virulent form in humans?. Nat. Rev. Microbiol. https://doi.org/10.1038/nrmicro1594 (2007).
Article PubMed PubMed Central Google Scholar
Alizon, S., Hurford, A., Mideo, N. & Van Baalen, M. Virulence evolution and the trade-off hypothesis: History, current state of affairs and the future. J. Evol. Biol. https://doi.org/10.1111/j.1420-9101.2008.01658.x (2009).
Article PubMed Google Scholar
Kaleebu, P. et al. Effect of human immunodeficiency virus (HIV) type 1 envelope subtypes A and D on disease progression in a large cohort of HIV-1–positive persons in Uganda. J. Infect. Dis. https://doi.org/10.1086/340130 (2002).
Article PubMed Google Scholar
Easterbrook, P. J. et al. Impact of HIV-1 viral subtype on disease progression and response to antiretroviral therapy. J. Int. AIDS Soc 13. 1–9 (2010).
Kiwanuka, N. et al. Effect of human immunodeficiency virus Type 1 (HIV-1) subtype on disease progression in persons from Rakai, Uganda, with incident HIV-1 infection. J. Infect. Dis. 197, 707–713 (2008).
Article Google Scholar
Baeten, J. M. et al. HIV-1 subtype D infection is associated with faster disease progression than subtype A in spite of similar plasma HIV-1 loads. J. Infect. Dis. https://doi.org/10.1086/512682 (2007).
Article PubMed Google Scholar
Araújo, P. M. M. et al. Characterization of a large cluster of HIV-1 A1 infections detected in Portugal and connected to several Western European countries. Sci. Rep. https://doi.org/10.1038/s41598-019-43420-2 (2019).
Article PubMed PubMed Central Google Scholar
Renjifo, B. et al. Preferential in-utero transmission of HIV-1 subtype C as compared to HIV-1 subtype A or D. AIDS 18, 1629–1636 (2004).
Article CAS Google Scholar
John-Stewart, G. C. et al. Subtype C Is associated with increased vaginal shedding of HIV-1. J. Infect. Dis. 192, 492–496 (2005).
Article Google Scholar
Serwanga, J. et al. Frequencies of Gag-restricted T-cell escape ‘footprints’ differ across HIV-1 clades A1 and D chronically infected Ugandans irrespective of host HLA B alleles. Vaccine https://doi.org/10.1016/j.vaccine.2015.02.037 (2015).
Article PubMed PubMed Central Google Scholar
Kinloch, N. N. et al. Genotypic and mechanistic characterization of subtype-specific HIV adaptation to host cellular immunity. J. Virol. 93, 1502–1520 (2018).
Google Scholar
Brenner, B. et al. A V106M mutation in HIV-1 clade C viruses exposed to efavirenz confers cross-resistance to non-nucleoside reverse transcriptase inhibitors. AIDS 17, F1-5 (2003).
Article CAS Google Scholar
Abecasis, A. B. et al. Investigation of baseline susceptibility to protease inhibitors in HIV-1 subtypes C, F, G and CRF02_AG. Antivir. Ther 11. 581–589 (2006).
Abecasis, A. B. et al. Protease mutation M89I/V is linked to therapy failure in patients infected with the HIV-1 non-B subtypes C, F or G. AIDS 19, 1799–1806 (2005).
Article CAS Google Scholar
Gartner, M. J., Roche, M., Churchill, M. J., Gorry, P. R. & Flynn, J. K. Understanding the mechanisms driving the spread of subtype C HIV-1. EBioMedicine 53, 102682 (2020).
Article Google Scholar
Hemelaar, J. et al. Global and regional molecular epidemiology of HIV-1, 1990–2015: A systematic review, global survey, and trend analysis. Lancet Infect. Dis. 19, 143–155 (2019).
Article Google Scholar
Santos-Pereira, A., Magalhães, C., Araújo, P. M. M. & Osório, N. S. Evolutionary genetics of mycobacterium tuberculosis and HIV-1: “The tortoise and the hare”. Microorganisms https://doi.org/10.3390/microorganisms9010147 (2021).
Article PubMed PubMed Central Google Scholar
Soares, E. A. et al. HIV-1 subtype C dissemination in southern Brazil. AIDS 19(Suppl 4), S81–S86 (2005).
Article Google Scholar
Vidal, N. et al. Distribution of HIV-1 variants in the Democratic Republic of Congo suggests increase of subtype C in Kinshasa between 1997 and 2002. J. Acquir. Immune Defic. Syndr. https://doi.org/10.1097/01.qai.0000159670.18326.94 (2005).
Article PubMed Google Scholar
Li, D. Q., Zheng, X. W. & Zhang, G. Y. Study on the distribution HIV-1 C subtype in Ruili and other counties, Yunnan, China. Zhonghua Liu Xing Bing Xue Za Zhi. 17, 337–339 (1996).
Carvalho, A. et al. Analysis of a local HIV-1 epidemic in portugal highlights established transmission of non-B and non-G subtypes. J. Clin. Microbiol. 53, 1506–1514 (2015).
Article Google Scholar
Ball, S. C. et al. Comparing the ex vivo fitness of CCR5-tropic human immunodeficiency virus type 1 isolates of subtypes B and C. J. Virol. https://doi.org/10.1128/jvi.77.2.1021-1038.2003 (2003).
Article PubMed PubMed Central Google Scholar
Abraha, A. et al. CCR5- and CXCR4-tropic subtype C human immunodeficiency virus type 1 isolates have a lower level of pathogenic fitness than other dominant group M subtypes: Implications for the epidemic. J. Virol. https://doi.org/10.1128/jvi.02051-08 (2009).
Article PubMed PubMed Central Google Scholar
Ndung’u, T. et al. HIV-1 subtype C in vitro growth and coreceptor utilization. Virology https://doi.org/10.1016/j.virol.2005.11.047 (2006).
Article PubMed Google Scholar
Venner, C. M. et al. Infecting HIV-1 subtype predicts disease progression in women of Sub-Saharan Africa. EBioMedicine https://doi.org/10.1016/j.ebiom.2016.10.014 (2016).
Article PubMed PubMed Central Google Scholar
Bello, G. et al. Origin and evolutionary history of HIV-1 subtype C in Brazil. AIDS 22, 1993–2000 (2008).
Article Google Scholar
De Oliveira, T., Pillay, D. & Gifford, R. J. The HIV-1 subtype C epidemic in South America is linked to the United Kingdom. PLoS ONE 5, e9311 (2010).
Article ADS Google Scholar
Véras, N. M. C., Gray, R. R. & Brígido, de L. F. M., Rodrigues, R. & Salemi, M.,. High-resolution phylogenetics and phylogeography of human immunodeficiency virus type 1 subtype C epidemic in South America. J. Gen. Virol. https://doi.org/10.1099/vir.0.028951-0 (2011).
Article PubMed Google Scholar
Delatorre, E. O. & Bello, G. Phylodynamics of HIV-1 subtype C epidemic in East Africa. PLoS One 7, e41904 (2012).
Article ADS CAS Google Scholar
Soares, M. A. et al. A specific subtype C of human immunodeficiency virus type 1 circulates in Brazil. AIDS 17, 11–21 (2003).
Article CAS Google Scholar
Delatorre, E. et al. Tracing the origin and northward dissemination dynamics of HIV-1 subtype C in Brazil. PLoS ONE 8, e74072 (2013).
Article ADS CAS Google Scholar
Raboni, S. M. et al. Molecular epidemiology of HIV-1 clades in Southern Brazil. Mem. Inst. Oswaldo Cruz https://doi.org/10.1590/S0074-02762010000800015 (2010).
Article PubMed Google Scholar
Delgado, E. et al. Identification of CRF89_BF, a new member of an HIV-1 circulating BF intersubtype recombinant form family widely spread in South America. Sci. Rep. 11, 11442 (2021).
Gräf, T. et al. HIV-1 molecular diversity in Brazil unveiled by 10 years of sampling by the national genotyping network. Sci. Rep. 11, 15842 (2021).
Article ADS Google Scholar
Alves, B. M. et al. Estimating HIV-1 genetic diversity in Brazil through next-generation sequencing. Front. Microbiol. 10, 749 (2019).
Article Google Scholar
Wood, E., Hogg, R. S., Yip, B., Harrigan, P. R. & Montaner, J. S. G. Why are baseline HIV RNA levels 100,000 copies/mL or greater associated with mortality after the initiation of antiretroviral therapy?. J. Acquir. Immune Defic. Syndr. 38, 289–295 (2005).
PubMed Google Scholar
Mellors, J. W. et al. Quantitation of HIV-1 RNA in plasma predicts outcome after seroconversion. Ann. Intern. Med. 122, 573–579 (1995).
Article CAS Google Scholar
Egger, M. et al. Prognosis of HIV-1-infected patients starting highly active antiretroviral therapy: A collaborative analysis of prospective studies. Lancet 360, 119–129 (2002).
Article Google Scholar
Vajpayee, M., Kaushik, S., Sreenivas, V., Wig, N. & Seth, P. CDC staging based on absolute CD4 count and CD4 percentage in an HIV-1-infected Indian population: Treatment implications. Clin. Exp. Immunol. https://doi.org/10.1111/j.1365-2249.2005.02857.x (2005).
Article PubMed PubMed Central Google Scholar
Schneider, E. et al. Revised surveillance case definitions for HIV infection among adults, adolescents, and children aged < 18 months and for HIV infection and AIDS among children aged 18 months to < 13 years–United States, 2008. MMWR. Recomm. Rep. 57, 1–12 (2008).
PubMed Google Scholar
Shankarappa, R. et al. Consistent viral evolutionary changes associated with the progression of human immunodeficiency virus type 1 infection. J. Virol. 73, 10489–10502 (1999).
Article CAS Google Scholar
Ragonnet-Cronin, M. et al. Genetic diversity as a marker for timing infection in HIV-infected patients: evaluation of a 6-month window and comparison with BED. J. Infect. Dis. 206, 756–764 (2012).
Article Google Scholar
Kouyos, R. D. et al. Ambiguous nucleotide calls from population-based sequencing of HIV-1 are a marker for viral diversity and the age of infection. Clin. Infect. Dis. 52, 532–539 (2011).
Article CAS Google Scholar
Andersson, E. et al. Evaluation of sequence ambiguities of the HIV-1 pol gene as a method to identify recent HIV-1 infection in transmitted drug resistance surveys. Infect. Genet. Evol. 18, 125–131 (2013).
Article CAS Google Scholar
Rambaut, A., Lam, T. T., Max Carvalho, L. & Pybus, O. G. Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen). Virus Evol. 2, vew007 (2016).
Article Google Scholar
Ministério da Saúde (BR) & Secretaria de Vigilância em Saúde. Boletim Epidemiológico HIV/Aids. http://www.aids.gov.br/pt-br/pub/2018/boletim-epidemiologico-hivaids-2018 (2018).
Hué, S., Clewley, J. P., Cane, P. A. & Pillay, D. HIV-1 pol gene variation is sufficient for reconstruction of transmissions in the era of antiretroviral therapy. AIDS 18, 719–728 (2004).
Article Google Scholar
Gräf, T. et al. HIV-1 genetic diversity and drug resistance among treatment naïve patients from Southern Brazil: An association of HIV-1 subtypes with exposure categories. J. Clin. Virol. https://doi.org/10.1016/j.jcv.2011.04.011 (2011).
Article PubMed Google Scholar
de Medeiros, R. M. et al. Co-circulation HIV-1 subtypes B, C, and CRF31-BC in a drug-naïve population from Southernmost Brazil: Analysis of primary resistance mutations. J. Med. Virol. https://doi.org/10.1002/jmv.22188 (2011).
Article PubMed Google Scholar
Da Silva, M. M. G., Telles, F. Q., da Cunha, C. A. & Rhame, F. S. HIV subtype, epidemiological and mutational correlations in patients from Paraná, Brazil. Braz. J. Infect. Dis. 14, 495–501 (2010).
Article Google Scholar
Barbieri, C. L. A. & Couto, M. T. As amas de leite e a regulamentação biomédica do aleitamento cruzado: Contribuições da socioantropologia e da história. Cad. História da Ciência Inst. Butantan VIII, 61–76 (2012).
Article Google Scholar
Langland, V. Expressing motherhood: Wet nursing and human milk banking in Brazil. J. Hum. Lact. 35, 354–361 (2019).
Article Google Scholar
Essex, M. Human immunodeficiency viruses in the developing world. Adv. Virus Res. https://doi.org/10.1016/S0065-3527(08)60343-7 (1999).
Article PubMed Google Scholar
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. https://doi.org/10.1093/molbev/mst010 (2013).
Article PubMed PubMed Central Google Scholar
Pineda-Peña, A. C. et al. Automated subtyping of HIV-1 genetic sequences for clinical and surveillance purposes: Performance evaluation of the new REGA version 3 and seven other tools. Infect. Genet. Evol. https://doi.org/10.1016/j.meegid.2013.04.032 (2013).
Article PubMed Google Scholar
Struck, D., Lawyer, G., Ternes, A. M., Schmit, J. C. & Perez Bercoff, D. COMET: Adaptive context-based modeling for ultrafast HIV-1 subtype identification. Nucleic Acids Res. 42, e144 (2014).
Article Google Scholar
Schultz, A. K. et al. jpHMM: Improving the reliability of recombination prediction in HIV-1. Nucleic Acids Res. https://doi.org/10.1093/nar/gkp371 (2009).
Article ADS PubMed PubMed Central Google Scholar
Siepel, A. C., Halpern, A. L., Macken, C. & Korber, B. T. M. A computer program designed to screen rapidly for HIV type 1 intersubtype recombinant sequences. AIDS Res. Hum. Retroviruses https://doi.org/10.1089/aid.1995.11.1413 (1995).
Article PubMed Google Scholar
Pond, S. L. K. et al. An evolutionary model-based algorithm for accurate phylogenetic breakpoint mapping and subtype prediction in HIV-1. PLoS Comput. Biol. https://doi.org/10.1371/journal.pcbi.1000581 (2009).
Article MathSciNet Google Scholar
Araújo, P. M. M., Martins, J. S. & Osório, N. S. SNAPPy: A snakemake pipeline for scalable HIV-1 subtyping by phylogenetic pairing. Virus Evol. 5, vez050 (2019).
Article Google Scholar
Karlin, S. & Altschul, S. F. Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. Proc. Natl. Acad. Sci. U. S. A. https://doi.org/10.1073/pnas.87.6.2264 (1990).
Article PubMed PubMed Central MATH Google Scholar
Guindon, S. S. et al. New algorithms and methods to estimate maximum-likelihood phylogenies: Assessing the performance of PhyML 3.0. Syst. Biol. 59, 307–321 (2010).
Article CAS Google Scholar
Lefort, V., Longueville, J. E. & Gascuel, O. SMS: Smart model selection in PhyML. Mol. Biol. Evol. 34, 2422–2424 (2017).
Article CAS Google Scholar
Drummond, A. J., Suchard, M. A., Xie, D. & Rambaut, A. Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol. Biol. Evol. 29, 1969–1973 (2012).
Article CAS Google Scholar
Ferreira, M. A. R. & Suchard, M. A. Bayesian analysis of elapsed times in continuous-time Markov chains. Can. J. Stat. https://doi.org/10.1002/cjs.5550360302 (2008).
Article MathSciNet MATH Google Scholar
Posada, D. jModelTest: Phylogenetic model averaging. Mol. Biol. Evol. https://doi.org/10.1093/molbev/msn083 (2008).
Article PubMed PubMed Central Google Scholar
Rambaut, A., Drummond, A. J., Xie, D., Baele, G. & Suchard, M. A. Posterior summarization in Bayesian phylogenetics using Tracer 1.7. Syst. Biol. https://doi.org/10.1093/sysbio/syy032 (2018).
Article PubMed PubMed Central Google Scholar
Kumar, S., Stecher, G., Li, M., Knyaz, C. & Tamura, K. MEGA X: Molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. https://doi.org/10.1093/molbev/msy096 (2018).
Article PubMed PubMed Central Google Scholar
Bielejec, F. et al. Sprea D3: Interactive visualization of spatiotemporal history and trait evolutionary processes. Mol. Biol. Evol. https://doi.org/10.1093/molbev/msw082 (2016).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We would like to acknowledge Juliana Monteiro da Cruz, from the Information Technology Governance of the Brazilian Ministry of Health, Brazil; Alexandre Carvalho from Hospital de Braga, ICVS, School of Medicine, University of Minho; and Jorge Pedrosa from ICVS, School of Medicine, University of Minho, Portugal for their critical contributions in setting up this Project.

Funding

This work has been funded by National funds, through the Foundation for Science and Technology (FCT) - project UIDB/50026/2020 and UIDP/50026/2020. and IF/00474/2014; FCT PhD scholarships PDE/BDE/113599/2015, PD/BD/127827/2016.

Author information

These authors contributed equally: Bernardino Souto, Vera Triunfante and Ana Santos-Pereira.

Authors and Affiliations

Life and Health Sciences Research Institute (ICVS), School of Medicine, University of Minho, Braga, Portugal
Bernardino Souto, Vera Triunfante, Ana Santos-Pereira, Joana Martins, Pedro M. M. Araújo & Nuno S. Osório
ICVS/3B’s - PT Government Associate Laboratory, Braga, Guimarães, Portugal
Bernardino Souto, Vera Triunfante, Ana Santos-Pereira, Joana Martins, Pedro M. M. Araújo & Nuno S. Osório
Department of Medicine, Federal University of São Carlos, São Carlos, Brazil
Bernardino Souto

Authors

Bernardino Souto
View author publications
You can also search for this author in PubMed Google Scholar
Vera Triunfante
View author publications
You can also search for this author in PubMed Google Scholar
Ana Santos-Pereira
View author publications
You can also search for this author in PubMed Google Scholar
Joana Martins
View author publications
You can also search for this author in PubMed Google Scholar
Pedro M. M. Araújo
View author publications
You can also search for this author in PubMed Google Scholar
Nuno S. Osório
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

B.S., V.T., A.S.P., J.M., P.M.M.A. and N.S.O. collected and analyzed the data. V.T., B.S. and N.S.O. prepared the figures and tables. N.S.O. designed the study and wrote the main manuscript text. All authors reviewed the manuscript.

Corresponding author

Correspondence to Nuno S. Osório.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Video 1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Souto, B., Triunfante, V., Santos-Pereira, A. et al. Evolutionary dynamics of HIV-1 subtype C in Brazil. Sci Rep 11, 23060 (2021). https://doi.org/10.1038/s41598-021-02428-3

Download citation

Received: 11 May 2021
Accepted: 12 November 2021
Published: 29 November 2021
DOI: https://doi.org/10.1038/s41598-021-02428-3

This article is cited by

Genofunc: genome annotation and identification of genome features for automated pipelining analysis of virus whole genome sequences
- Xiaoyu Yu
BMC Bioinformatics (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.