To generate the long-terminal repeats (LTR) that border the integrated viral genome, two-strand transfer steps must occur during reverse transcription. Analysis of the genetic polymorphisms that are present in the LTR of HIV-1 heterozygous virions in single infection cycle studies has revealed which of the two copies of genomic RNAs is used for each transfer event. Thus, the first event of strand transfer has been described to be either intra- or intermolecular, while the second event is generally intramolecular. Here, we repeated these analyses using sequences from HIV databases and extended the study to the regions surrounding the LTR. We observed a striking correlation between the pattern of recombination in the LTR and the phylogenetic origin of the surrounding sequences. This correlation suggests that the second-strand transfer can be either intra- or intermolecular and, interestingly, could reflect an effect of proximity between nucleic acids that would guide this transfer. This factor could be particularly relevant for heterozygous viruses containing highly divergent genomic RNAs, such as those considered in the present study.
The replication of the retroviral genome requires the conversion of the viral single-stranded genomic RNA to double-stranded DNA by the viral reverse transcriptase (RT)1,2. To achieve this goal, the nascent DNA strands must be transferred at least twice from one region of the template onto another, as schematized in Fig. 1A–I. In particular, two events, named first and second strong stop strand transfer (hereafter referred simply as first and second strand transfer, respectively), are required to achieve the synthesis of a complete proviral DNA molecule with the duplicated long terminal repeated sequences (LTR) that constitute the viral promoter. Additional strand transfer events between the two copies of genomic RNA (gRNA) contained in a viral particle can occur during copying of internal regions of the genomic RNA (copy choice)3. When the two RNAs carry distinct genetic polymorphisms, these strand transfer events can lead to genetic recombination.
The mechanism underlying the generation of LTR has been intensively studied in different retroviruses by exploiting the genetic polymorphisms between the two RNA copies in heterozygous virions to determine which gRNA was used for each event of strand transfer4,5,6. For the human immunodeficiency virus (HIV), studies of a single infection cycle in cell culture have concluded that the first strand transfer can be intra- or intermolecular with comparable frequencies. In the first case, minus-strand strong-stop DNA (-sssDNA) is transferred to the 3′ end of the same gRNA used as the template for its synthesis, while in the second event, -sssDNA is transferred onto the 3′ end of the second copy of gRNA present in the virion. By contrast, the second strand transfer is generally intramolecular (see panels F and G in Fig. 1), indicating that the 5′ ends of the annealing (−) strand DNA and (+) strand DNA PBS sequences originate from the same gRNA strand rather than from an inter-strand interaction.
To address the question of the relative use of the two gRNAs for the generation of the LTR using sequences issued from natural infections, one must rely on the sequences of recombinant viruses available in databases. The region where DNA synthesis switches between the two genomic RNAs, generating the recombinant DNA is defined as a breakpoint. Specifically, a breakpoint corresponds to the region encompassed by the two closest informative sites of each of the parental strains involved in the crossing. As such, its size varies as a function of the density of these sites, with one nucleotide being the smallest size and without an actual upper size limit. In HIV databases, while breakpoints have been mapped along the coding sequences, little is known about the breakpoints in the LTR. However, the increasing numbers of available sequences now also enable the characterization of recombinant LTR from patient isolated strains7,8,9,10,11,12,13,14.
A plethora of HIV recombinants have been identified in the last few decades15,16. HIV is subdivided into two types, HIV-1 and HIV-2. No natural recombinants between these two types have been described so far. HIV-1, the type responsible for the AIDS pandemic, is divided into four groups: M, N, O, and P. Inter-group recombinants have been described between groups M and O in vivo 7,8,9,14,17,18. Within groups, recombinants have largely been documented for group M16,19 and to a lesser extent for group O20. Recombinants found in fewer than three patients are called unique recombinant forms (URFs), while those found in at least three unrelated patients are called circulating recombinant forms (CRFs)21. The present work is based on the analysis of all three classes of recombinants identified so far in natural infections: intra-group M CRFs and URFs, and MO inter-group recombinants (HIV-1/MO). We exploited the sequence divergence that exists in vivo in chimeric genomes with recombinant LTR to verify whether the signature of their mechanisms of generation is consistent with that expected based on the commonly accepted models. To this end, we extended the analysis of the pattern of recombination not only to the LTR but also to the surrounding sequences.
Analysis of the available CRFs, URFs and HIV-1/MO sequences
Starting with the sequences available in databases, we exploited the sequence diversity found in vivo in heterozygous virions to investigate which of the two copies of genomic RNAs is used during first and second strand transfers (Fig. 1B,C and F,G, respectively). This approach requires focusing on recombinants, as these viruses are necessarily generated from heterozygous virions. Because, as mentioned above, we wanted to extend the analysis of the recombination pattern not only to the LTR but also to the surrounding sequences, we decided to exclusively consider the recombinant forms for which the full-length sequence, including the LTR, was available (see Materials and Methods). For the sequences of HIV-1/M CRFs and URFs, data on the complete recombination pattern of 13 HIV-1/M CRFs (subtype G22, CRF03_AB, CRF08_BC, CRF12_BF, CRF26_AU, CRF31_BC, CRF32_06/A1, CRF42_BF, CRF45_cpx, CRF60_BC, CRF71_BF, CRF87_cpx, CRF88_BC) and that of 5 URFs (URF_SHI10, URF_CRN11, URF_AUK11, URF_WEI13, URF_GUI12) were available. Concerning CRF08, CRF12, CRF42 and URF_CRN, the precise characterization of the recombination pattern in the region encompassing the LTR and the matrix-coding sequence (LTR-MA) was missing from the databases. We therefore completed the recombination analysis in this region using SimPlot23 (Fig. 2A,B). For MO recombinants, nine fully sequenced recombinants were considered (see Materials and Methods). For seven of these recombinants (RBF2088, YBF274, REC107, BCF212, RBF222, RBF237 and RBF240), the precise characterization of the recombination pattern in nef and the LTR-MA region was also lacking. These recombinants were therefore reanalysed here (Fig. 2C). Altogether, 27 full-length recombinant sequences were considered, including 13 CRFs, 5 URFs and 9 HIV-1 intergroup recombinants from groups M and O (Supplementary Table S1).
Pattern of recombination in fully sequenced CRFs, URFs and HIV-1/MO recombinants
In heterozygous viruses, the synthesis of the LTR results in well-defined patterns of recombination. Intermolecular first strand transfer generates recombinant LTR with a breakpoint at the border between U3 and R (U3/R junction), while intramolecular first strand transfer leads to LTR without breakpoints. For second strand transfer, which occurs in the primer binding sequence (PBS), intermolecular transfer creates a breakpoint between U5 and the 5′ portion of gag (U5/gag junction). Intramolecular second strand transfer, instead, generates LTR without breakpoints. For the analysis of the generation of the LTR we therefore focused on the breakpoints located at the U3/R and U5/gag junctions. To obtain a more comprehensive view of the dynamics underlying the use of the two gRNAs during the generation of the LTR, we also analysed the pattern of recombination in the rest of the genome, focusing on the phylogenetic origins of the regions adjacent to the LTR: the nef and gag genes. These sequences are indicative, respectively, of which RNA was used for the first strand transfer and on which RNA (−) DNA synthesis was achieved before the second strand transfer.
We observed a striking trend when the proviruses were divided into those presenting phylogenetically matching gag and nef (“matching gag-nef” proviruses, 12 CRFs, 3 URFs and 4 HIV-1/MO) and those in which these sequences are phylogenetically discordant (“discordant gag-nef” proviruses, 1 CRF, 2 URFs and 5 HIV-1/MO). Indeed, the pattern of the breakpoints in the LTR markedly differs between the two classes (Table 1 and Fig. 3). Among 19 matching gag-nef recombinants, only one recombinant presented a breakpoint in the LTR, and its location is indicative of generation by copy choice within U3 (Fig. 3). By sharp contrast, 8/8 of the discordant gag-nef proviruses presented breakpoints in the LTR. Specifically, except for one breakpoint the pattern of which suggests its generation by copy choice within the R region, the other 7 breakpoints were located at the U3/R junction (Fig. 3). This distribution is significantly unequal (p < 10−4), suggesting a link between the positions of the breakpoints in the LTR and the phylogenetic origin of their surrounding sequences. As the phylogenetic origin of the different portions of the proviral DNA is indicative of which gRNA was used for reverse transcription in heterozygous virions, we discuss below how this observation can be exploited to determine whether a mechanistic relationship exists between the LTR and the surrounding regions.
The generation of retroviral LTR requires two strand transfer events, called first and second strand transfers. These events can be either intra- or intermolecular between the two copies of gRNA present in the viral particle. Inter and intramolecular transfers generate precise patterns of recombination in heterozygous virions. Proviruses with a recombination breakpoint at the U3/R junction are generally attributed to intermolecular first strand transfer, while intramolecular first strand transfer yields LTR without breakpoints. As intra- and intermolecular first strand transfer occur at comparable rates in cell culture5,6, both types of LTR (with a U3/R breakpoint and without breakpoints) are expected to be identified in databases. Although we consistently detect both types of LTR, their distribution is unexpectedly unequal when the proviruses are grouped according to the phylogenetic origin of the sequences adjacent to the LTR (gag and nef). Indeed, LTR containing a U3/R breakpoint are exclusive to proviruses with phylogenetically discordant gag-nef sequences, while all LTR without breakpoints are observed among proviruses with phylogenetically matching gag-nef. How can this phenomenon be explained if the first strand transfer follows an intra- or intermolecular pathway on a stochastic basis?
One potential explanation is selection occurring on the recombinant LTR. As the present analysis was conducted on sequences retrieved from patients, the recombinants most likely emerged from selection for the most adapted forms in vivo. It is therefore possible that selective pressure erases all signatures of the mechanism of generation of the recombinant LTR. However, this explanation suggests that the presence of a breakpoint at the U3/R junction favours viruses with discordant gag and nef sequences. Intermolecular first strand transfer, generating U3/R breakpoints, would uncouple the TAR element in R from the Tat coding sequence located near the 3′ end of the genome. As the function of Tat is exerted through binding to TAR, the phylogenetic concordance could influence viral fitness; nevertheless, this factor is unrelated to the concordance or discordance of gag and nef. Consistently, the phylogenetic concordance of the Tat/TAR pair was independent of the phylogenetic concordance of the gag-nef pair (Supplementary Table S2). Another possibility, based on selection, could be that the recruitment of Gag to the plasma membrane is increased when Nef and Gag are phylogenetically concordant. Although Nef improves Gag intracellular trafficking, this effect is not exerted through a direct interaction between the two proteins24. Therefore, the phenomenon should not be influenced by the phylogenetic concordance between gag and nef.
Another possible explanation is that the bias is generated through the mechanism of the first strand transfer. This would imply that intermolecular transfer favours the subsequent occurrence of an odd number of internal template switching, while intramolecular transfer leads to either no or an even number of internal template switching (Supplementary Figure S1). However, although intermolecular first strand transfer increases the frequency of internal template switching by negative interference25, no correlation between an odd or an even number of strand transfer events and the nature of the first strand transfer has been reported.
Alternatively, the biased presence of U3/R breakpoints could reflect the fact that these breakpoints are not generated during first strand transfer but are a consequence of the second strand transfer. In this case, the unequal distribution would result from an effect of proximity between the 5′ end of the genomic RNA displaced by the synthesis of + sssDNA and the + sssDNA itself (see panels E-G of Fig. 1). This proximity would guide the second strand transfer and, if gag is copied into (−) DNA using the same RNA on which (-) DNA synthesis was primed (Fig. 4A, step i), the second strand transfer would preferentially be intramolecular (Fig. 4A, steps ii-iv). This generates LTR without breakpoints and matching gag-nef proviruses (Fig. 4A, steps v-vii). If, instead, copying of gag is completed on the other gRNA (Fig. 4B, step i), the effect of proximity would favour intermolecular second strand transfer (Fig. 4B, steps ii-iv), resulting in discordant gag-nef proviruses that, at the next infectious cycle, would generate a breakpoint at the U3/R border (Fig. 4B, steps v-vii). U3/R breakpoints and discordant gag-nef proviruses would thus be associated. Palindromic sequences in the HIV-1 PBS, which have been shown to fold into small hairpins26,27, could favour the proximity of the 5′ end of the genomic RNA displaced by the synthesis of + sssDNA and the + sssDNA itself, as schematically indicated in Fig. 5.
Several factors may be responsible for the discrepancies between the present observations and the expectations from mechanistic studies in cell culture. By attributing the generation of the U3/R breakpoints to the second strand transfer in the analysed proviruses, we indirectly implied that in these cases, the first strand transfer is intramolecular. The genetic divergence between the sequences considered here could be a major reason for this observation. As intermolecular first strand transfer is disfavoured by sequence divergence in R28,29, which is higher between the reference sequences used in the present study (5.7% intra M, 3.9% intra O and 24.9% between M and O) than between the isolates used in cell culture experiments (1%5,6), this could explain the discrepancy between the observations published previously and ours.
The likelihood that the U3/R breakpoints identified here are generated through intermolecular second strand transfer, in contrast to the general model, is also supported by the following consideration. Depending on which RNA (−) DNA synthesis is achieved, intramolecular second strand transfer should generate, or not, a breakpoint at the U5/gag junction (Supplementary Figure S2). Since, due to frequent recombination in internal regions of the genome, (−) DNA synthesis should be stochastically achieved on one gRNA or on the other, breakpoints at the U5/gag junction should also occur on a stochastic basis, if the second strand transfer is always intramolecular. However, no U5/gag breakpoints were identified among the 27 analysed proviruses. Nevertheless, this difference between the observations made in the present study and those made in previous studies in single infection cycle systems could partly reflect a disequilibrium of the RT initiation on both gRNAs. Indeed, for intermolecular second strand transfer to occur, reverse transcription must begin on both types of gRNAs and + sssDNA must be correctly generated on the second gRNA. Yu and colleagues5 showed that the initiation of reverse transcription was three times less frequent on the second gRNA, potentially precluding the frequent generation of ( + sssDNA on this molecule. How frequently + sssDNA is generated on both gRNAs in vivo is unknown, but this factor likely influences the nature of the second strand transfer.
In conclusion, the results of the present study underscore an unexpected correlation between two features of the sequence of recombinant proviral DNA identified in patients, suggesting that the choice of the templates used for the generation of the LTR could also follow alternative pathways with respect to the well-established pathways. These alternative pathways could be relevant when considering heterozygous virions involving considerably divergent phylogenetic strains in vivo.
Materials and Methods
Eleven full-length CRF sequences were retrieved from the Los Alamos HIV sequence database (www.hiv.lanl.gov/). Five full-length URF sequences were searched in the GenBank with the key words “HIV URF” or “HIV recombinant full sequence”, and the 200 first hits were screened. Lots of sequences were full-length in the coding region but not in the LTRs and were therefore discarded. Eight full-length sequence HIV-1/MO recombinants were retrieved from the literature (97CA-MP6457, RBF2088, REC0249, YBF27414) or from the database of the virology unit at Rouen University Hospital (REC107, BCF212, RBF222 and RBF237; sequences in process of publication). Strains that were already described in publications and in LANL were not further analysed. All the strains used in this study are presented in Supplementary Table S1.
To study the phylogenetic relationships and the recombination patterns of the HIV-1/MO intergroup recombinants as well as the HIV-1/M intragroup recombinants (CRFs and URFs), a panel of HIV-1/M reference strains was generated with sequences from HIV-1/M pure subtypes, CRF01_AE and CRF02_AG, as well as HIV-1/O and HIV-1/P sequences, gathered from the LANL sequence database and aligned with MEGA 730. In this study, we considered that subtype G is a recombinant and CRF02 a pure subtype, as demonstrated by Abecasis et al. in 200722. Very few sequences were complete in the LTRs. Namely no sub-subtype A2 and F2, and no subtypes H, J and K were available (Supplementary Table S3).
Study of the mosaic structure of the HIV recombinants
Three CRFs (CRF08, CRF12, CRF42) from the LANL HIV database were not fully described in the LTR. One URF (URF_CRN) was fully described but one breakpoint, located between U5 and the beginning of gag was not precisely defined. We thus retrieved the full sequence of these four viruses for further analysis.
The coding sequence of these four strains was computed with jpHMM software31 (http://jphmm.gobics.de) to generate and verify the mosaic structure in their coding sequence (Fig. 2A). Moreover, for each of the four strains, the sequence going from the 5′ border of the LTR till the end of the MA coding sequence (LTR-MA) was aligned with the panel of selected HIV-1/M reference strains to study the precise recombination pattern in this region. The alignment was gap-stripped and the SimPlot software23 was used to perform similarity plot (SP), informative sites (IS) and BootScan (BS) analyses (Fig. 2B). For each recombinant, the SP analysis was performed using HIV-1/M subtype reference sequences corresponding to the strains whose sequences were present in the coding region of the recombinants (for instance, subtype B and C for CRF08_BC), with parameters set to 80-nt window/10-nt steps and a Kimura 2-parameter method with a transition-transversion (Ts/Tv) ratio of 2.0. For informative sites (IS) and BootScan (BS) analysis, a third HIV-1/M subtype was taken as out-group. The IS analysis was run when a cross-over was suspected using SP and consisted in determining the number of signature polymorphisms for each subtype involved in the cross-over. The distribution of the informative sites flanking the cross-over was then tested for statistical significance sites using the Yates-corrected X2 test, as already described32,33. The BS analysis was then run to confirm the location of the breakpoint.
Concerning HIV-1/MO recombinants, the non-coding sequence of the RBF208 HIV-1/MO recombinant8 was not described and HIV-1/MO recombinants (YBF274, REC107, BCF212, RBF222, RBF240 and RBF237) are not yet published. For these recombinants, the nef coding sequence and the LTR-MA region were analysed. For each recombinant, the part of the nef coding sequence that does not overlap with U3 was concatenated with the LTR-MA sequence into a nef-LTR-MA sequence (see Supplementary sequences). The nef-LTR-MA sequences were aligned with the reference panel with MEGA 730, the alignment was gap-stripped and a recombination analysis was performed with the SimPlot software23 as described for HIV-1/M CRFs and URFs. For each recombinant, the SimPlot analysis was performed using HIV-1/O reference sequences and the HIV-1/M subtype reference sequences corresponding to the strain whose sequence was present in the coding region of the recombinants (for instance, subtype D for RBF208, which is a recombinant between HIV-1/O and HIV-1/M subtype D). For informative sites and BootScan analysis, HIV-1/P was taken as out-group.
Statistical analyses were carried out with Prism 6 software. The Yates-corrected chi-square test was used to determine whether significant evidence of recombination existed using informative site (α = 0.01). The Fisher’s exact test was used to compare the distribution of the LTR breakpoints between the matching gag-nef and discordant gag-nef categories and the association of phylogenetic matching Tat and TAR as function of the concordance of gag and nef (α = 0.01, two-tailed p-value).
Baltimore, D. & RNA-dependent, D. N. A. polymerase in virions of RNA tumour viruses. Nature 226, 1209–1211 (1970).
Temin, H. M. & Mizutani, S. RNA-dependent DNA polymerase in virions of Rous sarcoma virus. Nature 226, 1211–1213 (1970).
Vogt, P. K. Genetically stable reassortment of markers during mixed infection with avian tumor viruses. Virology 46, 947–952 (1971).
Panganiban, A. T. & Fiore, D. Ordered interstrand and intrastrand DNA transfer during reverse transcription. Science 241, 1064–1069 (1988).
Yu, H., Jetzt, A. E., Ron, Y., Preston, B. D. & Dougherty, J. P. The nature of human immunodeficiency virus type 1 strand transfers. Journal of Biological Chemistry 273, 28384–28391 (1998).
van Wamel, J. L. & Berkhout, B. The first strand transfer during HIV-1 reverse transcription can occur either intramolecularly or intermolecularly. Virology 244, 245–251 (1998).
Peeters, M. et al. Characterization of a highly replicative intergroup M/O human immunodeficiency virus type 1 recombinant isolated from a Cameroonian patient. Journal of Virology 73, 7368–7375 (1999).
Vessière, A. et al. First evidence of a HIV-1 M/O recombinant form circulating outside Cameroon. AIDS 24, 1079–1082 (2010).
Ngoupo, P. A. et al. First evidence of transmission of an HIV-1 M/O intergroup recombinant virus. AIDS 1 https://doi.org/10.1097/QAD.0000000000000880 (2016).
Shi, B. et al. Construction of an infectious HIV type 1 molecular clone from an African patient with a subtype D/C Recombinant Virus. AIDS Research and Human Retroviruses 20, 1015–1018 (2004).
Frange, P. et al. New and old complex recombinant HIV-1 strains among patients with primary infection in 1996–2006 in France: the French ANRS CO06 primo cohort study. Retrovirology 5, 69 (2008).
Gui, T. et al. Genetic Characterization of a Unique Recombinant Originating from CRF55_01B, CRF01_AE, and CRF07_BC in Shenzhen, China. AIDS Research and Human Retroviruses 31, 559–563 (2015).
Wei, H. et al. Near full-length genomic characterization of a novel HIV type 1 CRF07_ BC/01_AE recombinant in men who have sex with men from Sichuan, China. AIDS Research and Human Retroviruses 29, 1173–1176 (2013).
De Oliveira, F. et al. Multiple HIV-1/M+HIV-1/O dual infections and new HIV-1/MO inter-group recombinant forms detected in Cameroon. Retrovirology 14, 1 (2017).
Ramirez, B. C., Simon-Loriere, E., Galetto, R. & Negroni, M. Implications of recombination for HIV diversity. Virus Research 134, 64–73 (2008).
Hemelaar, J. Implications of HIV diversity for the HIV-1 pandemic. Journal of Infection 66, 391–400 (2013).
Takehisa, J. et al. Human immunodeficiency virus type 1 intergroup (M/O) recombination in cameroon. Journal of Virology 73, 6810–6820 (1999).
Yamaguchi, J. et al. HIV infections in northwestern Cameroon: identification of HIV type 1 group O and dual HIV type 1 group M and group O infections. AIDS Research and Human Retroviruses 20, 944–957 (2004).
Lau, K. A. & Wong, J. J. L. Current Trends of HIV Recombination Worldwide. Infect Dis Rep 5, e4 (2013).
Roques, P. et al. Phylogenetic analysis of 49 newly derived HIV-1 group O strains: high viral diversity but no group M-like subtype structure. Virology 302, 259–273 (2002).
Robertson, D. L. et al. HIV-1 nomenclature proposal. Science 288, 55–56 (2000).
Abecasis, A. B. et al. Recombination confounds the early evolutionary history of human immunodeficiency virus type 1: subtype G is a circulating recombinant form. Journal of Virology 81, 8543–8551 (2007).
Lole, K. S. et al. Full-length human immunodeficiency virus type 1 genomes from subtype C-infected seroconverters in India, with evidence of intersubtype recombination. Journal of Virology 73, 152–160 (1999).
Malbec, M. et al. HIV-1 Nef promotes the localization of Gag to the cell membrane and facilitates viral cell-to-cell transfer. Retrovirology 10, 80 (2013).
Hu, W. S., Bowman, E. H., Delviks, K. A. & Pathak, V. K. Homologous recombination occurs in a distinct retroviral subpopulation and exhibits high negative interference. Journal of Virology 71, 6028–6036 (1997).
Isel, C., Ehresmann, C., Keith, G., Ehresmann, B. & Marquet, R. Initiation of reverse transcription of HIV-1: secondary structure of the HIV-1 RNA/tRNA(3Lys) (template/primer). Journal of Molecular Biology 247, 236–250 (1995).
Johnson, P. E. et al. A mechanism for plus-strand transfer enhancement by the HIV-1 nucleocapsid protein during reverse transcription. Biochemistry 39, 9084–9091 (2000).
Berkhout, B., Vastenhouw, N. L., Klasens, B. I. & Huthoff, H. Structural features in the HIV-1 repeat region facilitate strand transfer during reverse transcription. RNA 7, 1097–1114 (2001).
Beerens, N. & Kjems, J. Circularization of the HIV-1 genome facilitates strand transfer during reverse transcription. RNA 16, 1226–1235 (2010).
Kumar, S., Stecher, G. & Tamura, K. MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets. Molecular Biology and Evolution 33, 1870–1874 (2016).
Schultz, A.-K. et al. jpHMM: improving the reliability of recombination prediction in HIV-1. Nucleic Acids Research 37, W647–51 (2009).
Rousseau, C. M. et al. Extensive intrasubtype recombination in South African human immunodeficiency virus type 1 subtype C infections. Journal of Virology 81, 4492–4500 (2007).
Foster, G. M. et al. Novel HIV-1 recombinants spreading across multiple risk groups in the United Kingdom: the identification and phylogeography of Circulating Recombinant Form (CRF) 50_A1D. PLoS ONE 9, e83337 (2014).
Work in MN’s and JCP’s labs was supported by grant from Sidaction. PC was recipient of a fellowship from Sidaction and from the ANRS. We are grateful to Daniela Lener and Roland Marquet for helpful suggestions and critical reading of the manuscript.
The authors declare that they have no competing interests.
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
About this article
Cite this article
Cappy, P., Moisan, A., De Oliveira, F. et al. HIV-1 sequences in the epidemic suggest an alternative pathway for the generation of the Long Terminal Repeats. Sci Rep 7, 13715 (2017). https://doi.org/10.1038/s41598-017-14135-z