Abstract
The host genetic landscape surrounding integrated HIV-1 has an impact on the fate of the provirus. Studies analysing HIV-1 integration sites in macrophages are scarce. We studied HIV-1 integration site patterns in monocyte-derived macrophages (MDMs) and activated CD4+ T cells derived from seven antiretroviral therapy (ART)-treated HIV-1-infected individuals whose cells were infected ex vivo with autologous HIV-1 isolated during the acute phase of infection. A total of 1,484 unique HIV-1 integration sites were analysed. Their distribution in the human genome and genetic features, and the effects of HIV-1 integrase polymorphisms on the nucleotide selection specificity at these sites were indistinguishable between the two cell types, and among HIV-1 isolates. However, the repertoires of HIV-1-hosting gene clusters overlapped to a higher extent in MDMs than in CD4+ T cells. The frequencies of HIV-1 integration events in genes encoding HIV-1-interacting proteins were also different between the two cell types. Lastly, HIV-1-hosting genes linked to clonal expansion of latently HIV-1-infected CD4+ T cells were over-represented in gene hotspots identified in CD4+ T cells but not in those identified in MDMs. Taken together, the repertoire of genes targeted by HIV-1 in MDMs is distinct from and more restricted than that of CD4+ T cells.
Similar content being viewed by others
Introduction
The human immunodeficiency virus-1 (HIV-1) mainly infects cells that express CD4 and the co-receptor CCR5 or CXCR4, i.e. CD4+ T cells and macrophages1,2,3,4,5,6. The phenotypes as a result of an HIV-1 infection in macrophages differ considerably from those of CD4+ T cells. For instance, macrophages are more resistant to HIV-1 cytopathic effects7,8. Specialized macrophages are widely distributed in various tissues, including anatomical sanctuaries. Indeed, HIV-1 has been detected in microglial cells of the central nervous system9,10,11. The involvement of HIV-1-infected macrophages, especially in the central nervous system, leading to the pathogenesis of HIV-1-associated diseases has been shown in various studies (reviewed in12).
Although it is conceivable that the different cellular physiology of macrophages in contrast to that of CD4+ T cells may contribute to the observed difference in productivity of the HIV-1 provirus in these two cell types, a multitude of studies, both in vivo and in vitro, have shown that the host genetic landscape surrounding the provirus does have an impact on the subsequent fate of the provirus. Earlier integration site studies have established the host genetic requirements for HIV-1 integration: (i) HIV-1 favours integration into active transcription units in gene-dense regions but disfavours centromeric alphoid repeats and human endogenous retrovirus (HERV) elements13,14,15, (ii) Alu repetitive elements are over-represented at HIV-1 integration sites14,15,16,17, (iii) HIV-1 provirus is flanked by the duplicated weakly conserved palindromic sequence 5′–GT(A/T)AC–3′13,15,17 and (iv) HIV-1 does not have a preference in transcription orientation relative to the gene into which it is integrated14,17.
Studies of HIV-1 integration site patterns in macrophages are scarce. This is partly due to the number of HIV-1-infected macrophages that can be recovered from blood samples of HIV-1-infected individuals is too low for HIV-1 integration site analysis, and obtaining them from other tissues is too invasive. Hence, ex vivo infection of primary monocyte-derived macrophages (MDMs) is a practical solution. The study of Barr et al.18 examined HIV-1 integration site patterns in MDMs and compared them with those in CD4+ T cells obtained from other studies. They observed that HIV-1 integration events are found more often in long intergenic regions and, correspondingly, less often in gene-dense regions in MDMs compared to CD4+ T cells. In the study of Wellensiek et al.19, although they had examined HIV-1 integration site patterns in MDMs and CD4+ T cells, they focussed on the differences between adult and neonatal cells of each cell type rather than between cell types. To date, a systematic comparison of HIV-1 integration site patterns between MDMs and CD4+ T cells derived from the same experimental settings has never been done.
Here, we extended the characterization of HIV-1 integration site patterns in MDMs and CD4+ T cells in an experimental setting that mimics the in vivo situation. We used MDMs and CD4+ T cells derived from ART-treated HIV-1-infected individuals and infected these cells ex vivo with the individuals’ autologous HIV-1 isolated during the acute phase of their HIV-1 infection. We focussed on other aspects of HIV-1 integration site patterns, e.g. structural and functional gene clusters formed by HIV-1-hosting genes and the frequencies of HIV-1 integration events in certain gene subsets, and compared these features between the two cell types. We also examined the effects of two naturally occurring amino acid polymorphisms in the HIV-1 integrase on its nucleotide selection specificity at HIV-1 integration sites in primary cells. We showed that the repertoire of genes targeted by HIV-1 in MDMs was distinct and more restricted than that of CD4+ T cells.
Results
Characteristics of study participants and their autologous primary HIV-1 isolates
At the time of autologous HIV-1 isolation, the study participants were infected with HIV-1 for an estimated 15–153 days (Supplementary Table S1). All HIV-1-infected individuals are enrolled in the Zurich Primary HIV Infection (ZPHI) study20 and six of seven HIV-1-infected individuals started early ART during the acute phase of infection. The replication capacities of these primary HIV-1 isolates in MDMs were most comparable to that of HIV-1JR-FL, a macrophage-tropic strain. The replication capacity of primary HIV-1 isolate 6 in MDMs was consistently the highest among the seven primary HIV-1 isolates (Supplementary Figure S1). All primary HIV-1 isolates were also replication competent in CD4+ T cells (Supplementary Figure S2).
At the time of MDMs and CD4+ T cell isolation from HIV-1-infected individuals, viral suppression ranged from 2.8–7.8 years (mean = 4.7 yr) (Supplementary Table S1). These cells were then infected ex vivo with autologous primary HIV-1 isolates or heterologous HIV-1 isolates. The non-restrictive linear amplification-mediated polymerase chain reaction (nrLAM-PCR)21 was used to amplify the 5′ HIV-1 integration junctions, which were analysed using our new Integration Site Analysis Pipeline (InStAP) (Supplementary Figure S3). Prior to these experiments, CD4+ T cells from three ART-treated HIV-1-infected individuals were activated and monitored for HIV-1 outgrowth. Reactivation of HIV-1 from latently infected cells did not occur within 7–8 days (data not shown). Thus, we are certain that the HIV-1 integration sites amplified from CD4+ T cells after two days of activation and a further two days of ex vivo infection were the result of these ex vivo infections. More importantly, no HIV-1 integration sites were found in mock infected cell samples of HIV-1-infected individuals.
The basic genetic requirements for HIV-1 integration were similar among individual HIV-1 isolates and between MDMs and CD4+ T cells
We obtained a total of 1,484 unique HIV-1 integration sites, derived from the seven selected HIV-1-infected individuals whose MDMs (n = 987) and CD4+ T cells (n = 497) were infected ex vivo with autologous primary HIV-1 isolates and heterologous HIV-1 isolates (Supplementary Table S2). Of these, 783 (79.3%) and 431 (86.7%) integration sites were found within genes in MDMs and CD4+ T cells, respectively. HIV-1 integration sites were also found mainly in gene-dense regions and away from centromeres, Giemsa dark regions, and the p arms of acrocentric chromosomes in both MDMs and CD4+ T cells (Fig. 1). The genic distribution and genetic profiles of HIV-1 integration sites were consistent with previous studies14,18,22, and they were indistinguishable between MDMs and CD4+ T cells, as well as between pooled autologous and heterologous HIV-1 isolates (Supplementary Figure S4). These basic genetic requirements for HIV-1 integration were also similar among individual HIV-1 isolates (Supplementary Figure S5).
Amino acid residue polymorphisms in HIV-1 integrase altered its nucleotide selection specificity in primary cells but not the genetic patterns of HIV-1 integration sites
Recent reports have identified the amino acid residues in the HIV-1 integrase that dictate the nucleotide selection specificity at integration sites, i.e. S119, T122, R231, and K258, in cell lines23,24. Since the full-length genomes of all well characterized primary HIV-1 isolates were sequenced, we could analyse these amino acid residues of our seven primary HIV-1 isolates and two clonal HIV-1 strains. Six HIV-1 isolates had the most common amino acid residues in all four positions. Primary HIV-1 isolates 2 and 6 exhibited a polymorphism at positions 122 (T122I) and 119 (S119P), respectively. These two polymorphisms were present concurrently in primary HIV-1 isolate 5 (Supplementary Figure S6). We investigated the effects of S119P and T122I polymorphisms on the nucleotide selection specificity at HIV-1 integration sites independently by analysing the 20 nucleotide positions immediately upstream of HIV-1 integration junctions (nucleotide position 0) in both MDMs and CD4+ T cells.
We analysed two nucleotide positions that were previously reported to be altered by the S119P HIV-1 integrase variant23. S119P resulted in a significantly increased preference for adenines at positions −8 (p < 0.0001) and −9 (p < 0.01) in both MDMs and CD4+ T cells (Fig. 2a), consistent with previous report23. T122I significantly decreased the preference of HIV-1 for thymines at position −8 in MDMs (p < 0.0001) (Fig. 2b), similar to the previously reported T122K23. Nonetheless, a consistent decrease in preference of HIV-1 for thymines at positions −4 and −8 was observed in both MDMs and CD4+ T cells (Fig. 2b). Primary HIV-1 isolate 5, which exhibited both polymorphisms in its integrase, could not be analysed due to the lack of sufficient HIV-1 integration sites. Both polymorphisms (S119P and T122I) dictated the nucleotide selection specificity at HIV-1 integration sites in primary cells.
The repertoire of HIV-1-hosting genes in MDMs was more restricted and distinct from that of CD4+ T cells
Next, we assessed whether the annotated features of HIV-1-hosting genes, e.g. involvement in cellular and molecular processes, were different between the two cell types using the Database for Annotation, Visualization and Integrated Discovery (DAVID)25,26. The majority of HIV-1-hosting gene clusters over-represented in the human genome had annotations associated with structural motifs and domains of DNA and proteins in both cell types (Supplementary Figure S7). However, these gene clusters were more diverse in CD4+ T cells, which was also the case in most other categories of gene clusters (data not shown). MDMs had more over-represented gene clusters that were associated with protein metabolism, regulation, and transport whereas the proportion of over-represented gene clusters associated with a variety of other cellular processes, e.g. regulation of apoptosis, was higher in CD4+ T cells (Supplementary Figure S7). There were also gene clusters that were observed exclusively in either cell type, e.g. deubiquitination and regulation of axonogenesis in MDMs whereas T cell receptor signalling and cell cycle progression in CD4+ T cells (data not shown), which concordantly reflected the cellular and molecular functions of the corresponding cell types.
As the diversity of structural and functional HIV-1-hosting gene clusters appeared to be higher in CD4+ T cells, we investigated further the extent to which these gene clusters overlap between cell types in order to estimate the relative size of the repertoire of genes accessible to HIV-1 for integration. For this purpose, we analysed the top 40 gene clusters of each data set, including both over-represented and not over-represented gene clusters in the human genome. We scored gene clusters having the same or highly similar annotations between data sets, and observed that the gene cluster overlap was the highest (47.5%) between MDM/HIV-1autologous and MDM/HIV-1heterologous, and decreased by 7.5–15% between MDMs and CD4+ T cells. As expected, between CD4+ T cell/HIV-1autologous and CD4+ T cell/HIV-1heterologous, the extent of overlap was lower (32.5%) (Table 1). This apparent trend of gene cluster overlaps was reproducible when we included HIV-1 integration site data sets from two other studies: the one other publicly available MDM data set (n = 754)18 and another CD4+ T cell data set (n = 534)27 (Table 1). Furthermore, to confirm the quality and comparability among data sets, we used a human T-lymphotropic virus-1 (HTLV-1) integration site data set from the study of Meekings et al.28 as a denominator control (Table 1). All six HIV-1 integration site data sets overlapped to similarly low extents with that of HTLV-1.
Due to the unequal number of HIV-1 integration sites recovered from each HIV-1-infected individual’s sample infected ex vivo, the observed results could have been driven mainly by samples of two individuals with the highest number of recovered HIV-1 integration sites, i.e. individuals 1 and 6 (Supplementary Table S2). To address this issue, we performed two analyses: (i) the proportion of HIV-1 integration sites forming the top 40 gene clusters were compared with the proportion of HIV-1 integration sites recovered from each HIV-1-infected individual’s sample and (ii) gene clusters formed with data from individuals 1 or 6 only, and when removed, were compared with the entire corresponding data sets. A high correlation (R2 = 0.9774) was observed between the proportion of HIV-1 integration sites forming the top 40 gene clusters and that of HIV-1 integration sites recovered from each HIV-1-infected individual’s sample, indicating that each individual’s sample contributed towards the formation of the top 40 gene clusters, and the extent of their contributions were concordant with the proportion of HIV-1 integration sites recovered (Supplementary Figure S8). Additionally, data sets containing only HIV-1 integration sites derived from HIV-1-infected individuals 1 or 6 versus data sets without these HIV-1 integration sites overlapped with the entire corresponding data set to comparable levels (Supplementary Table S3), demonstrating our observations were neither HIV-1-infected individual-specific nor skewed by these two individuals who had the highest number of recovered HIV-1 integration sites.
Using the National Centre for Biotechnology Information (NCBI) database of published protein interactions between HIV-1 and its cellular host29, we investigated the frequency of HIV-1 integration events in a subset of genes encoding HIV-1-interacting proteins. We hypothesized that this frequency would be different between MDMs and CD4+ T cells given the lower HIV-1-hosting gene cluster overlap between them (Table 1). Here, we expanded our analysis and included a total of 11 HIV-1 integration site data sets: two MDM data sets (n = 987) and two CD4+ T cell data sets (n = 497) derived from this study and one other MDM data set (n = 754) and six other CD4+ T cell data sets (n = 7,557) obtained elsewhere14,18,27,30,31,32,33 (Table 2). We observed an increase of HIV-1 integration events in genes encoding HIV-1-interacting proteins in MDMs (1.2×) and a significant increase in CD4+ T cells (1.4×, p < 0.001) compared to three sets of 5,000 random genes generated in silico (Fig. 3). As the CD4+ T cell data sets were derived from both activated and resting states, we further tested if the difference in cellular activation states in CD4+ T cells would have an influence on the frequency of HIV-1 integration into genes encoding HIV-1-interacting proteins; no significant difference was observed (data not shown). Though quantitatively modest, there was a significant difference (p < 0.05) between MDMs and CD4+ T cells (Fig. 3).
Given the consistent trend of gene cluster overlaps, a rigorous assessment to show the robustness of our analysis, and a significantly different (p < 0.05) over-representation of HIV-1 integration events in genes encoding HIV-1-interacting proteins between MDMs and CD4+ T cells, we demonstrated that the repertoires of genes accessible to HIV-1 for integration in MDMs and CD4+ T cells were distinct and that of MDMs’ was more restricted.
Genes linked to clonal expansion of latently HIV-1-infected CD4+ T cells were targeted more frequently in CD4+ T cells but not monocyte-derived macrophages
Expansion of the viral reservoir by means of clonal expansion of latently HIV-1-infected CD4+ T cells harbouring HIV-1 proviruses in genes of certain functional nature has recently been suggested27,30,32. These genes are referred to as genes linked to clonal expansion of latently HIV-1-infected CD4+ T cells here.
We compiled genes linked to clonal expansion of latently HIV-1-infected CD4+ T cells identified in two studies27,30 and collapsed them into a total of 196 unique genes. Next, we divided each HIV-1 integration site data set into two groups; one containing unique protein-coding genes in which HIV-1 was found ≥2 times, i.e. gene hotspots (Fig. 4) and the other, only once (Fig. 5a). We included seven HIV-1 integration site data sets for this analysis: two derived from this study (data sets of the same cell type combined; n = 1,484) and one other MDM data set (n = 754) and four other CD4+ T cell data sets (n = 4,613) obtained elsewhere14,18,31,32,33 (Table 2),
By comparing the frequencies of HIV-1-hosting genes linked to clonal expansion of latently HIV-1-infected CD4+ T cells in gene hotspots and genes with single HIV-1 integration events, we observed a highly significant increase of genes linked to clonal expansion of latently HIV-1-infected CD4+ T cells in gene hotspots in CD4+ T cells (2.5×, p < 0.001), but not in MDMs (Fig. 5b), showing again that the repertoires of genes accessible to HIV-1 for integration in the two cell types were distinct.
Gene hotspots identified for HIV-1 integration
We identified a total of 75 (9.5%) and 33 (7.7%) genes in which HIV-1 was found ≥2 times in MDMs and CD4+ T cells, respectively (Fig. 4). Furthermore, ~70% of these genes were identified in ≥2 HIV-1-infected individuals. As the probability of getting the same gene ≥2 times in multiple data sets was low34, these genes were considered as hotspots for HIV-1 integration here. Three gene hotspots were identified ≥4 times in both cell types, i.e. MECP2, KANSL1, and UBE2G1, and they were also listed in other studies (Table 3). In most cases, HIV-1 proviruses integrated into the same introns (Table 3), and they spanned 36 kb, 139 kb, and 22 kb in MECP2, KANSL1, and UBE2G1, respectively.
Discussion
We compared HIV-1 integration sites patterns between macrophages and CD4+ T cells derived from seven ART-treated HIV-1-infected individuals whose cells were infected ex vivo with autologous, as well as heterologous, primary HIV-1 isolates generated during the acute phase of infection. This unique experimental set-up allowed us to replicate the in vivo situation for the examination of HIV-1 integration site patterns arising from the two major cellular targets of HIV-1 and potentially among different HIV-1 isolates.
The various genetic features at HIV-1 integration sites were similar among different HIV-1 isolates, regardless of whether the virus was autologous or heterologous. However, given that the gene clusters consistently overlapped to a higher extent in MDMs, the repertoire of genes accessible to HIV-1 for integration in MDMs appeared to be more restricted than and distinct from that of CD4+ T cells. This distinction between the two cell types was again reflected in the different frequencies of HIV-1-hosting genes in two gene subsets: genes encoding HIV-1-interacting proteins and those linked to clonal expansion of latently HIV-1-infected CD4+ T cells. Firstly, we observed an over-representation of HIV-1 integration events in the former gene subset in both MDMs and CD4+ T cells, but to different extents. Despite the heterogeneity of the NCBI HIV-1 Human Interaction Database29, its representativeness and specificity were demonstrated by the substantially lower frequency of HIV-1-interacting genes in random and HTLV-1 data sets. Secondly, HIV-1 integration into genes linked to clonal expansion of latently HIV-1-infected CD4+ T cells was over-represented in gene hotspots in CD4+ T cells but not in MDMs.
The gene cluster distribution of our CD4+ T cell data set was consistent with that derived from the study of Wagner et al.27, whereas such consistency was somewhat lacking between our MDM data sets with that derived from the study of Barr et al.18. Comparison of HIV-1 integration sites between MDMs and CD4+ T cells derived from the same experimental settings is important, as evident from the absolute percentages of the top 40 DAVID gene cluster overlaps; MDMs from the study of Barr et al.18 and CD4+ T cells from the study of Wagner et al.27 had the same absolute values when compared to MDMs from this study. Nonetheless, the trend of gene cluster overlaps was consistent across all data sets that were derived from the same experimental settings, i.e. gene cluster overlaps were the highest between MDM data sets, followed by between MDM and CD4+ T cell data sets, and the lowest was observed between CD4+ T cell data sets.
We focussed on gene hotspots that were identified at least four times in both cell types, i.e. MECP2, KANSL1, and UBE2G1. The products of these genes are involved in transcriptional and post-translational regulations of genes35,36,37. Interestingly, downregulation of MECP2 has been reported to enhance replication of HIV-1NL4-3 in Jurkat cells38. Although it was unclear if these gene hotspots, as well as genes encoding HIV-1-interacting proteins were highly accessible prior to or upon HIV-1 infection, it is conceivable that they were transcriptionally active enough to allow integration14,15,18,31 and/or located in the vicinity of the nuclear pore, close to the incoming HIV-1 DNA34. The model of Marini et al.34 describes that specific chromatin organization in the nucleus results in certain genes being closer to the nuclear pore, thus recurrently targeted by HIV-1 for integration. This model could be envisaged to be applicable down to the levels of genes, in which certain intragenic regions might be closer and more exposed to the incoming HIV-1 DNA. This would explain why HIV-1 proviruses that were found in the three gene hotspots preferred integration into the same introns in most cases.
The effect of T122I polymorphism on HIV-1 nucleotide selection specificity has not been reported previously. This natural and the second most abundant amino acid polymorphism at position 122 consistently altered the nucleotide selection specificity of HIV-1, demonstrating its role in integration site targeting. The effect of S119P polymorphism on HIV-1 nucleotide selection at the site of integration in cell lines has been described by Demeulemeester et al.23, and our observation in primary cells was consistent with theirs. Certain amino acid polymorphisms of the HIV-1 integrase, especially at position 119, have been associated with rapid disease progression23. However, association of natural integrase variants with disease progression could not be examined in our study participants because all of them have been successfully treated. Although these HIV-1 integrase polymorphisms altered the nucleotide selection specificity at integration sites, they did not have an effect on various genetic features at HIV-1 integration sites23. It is likely that the retargeting of HIV-1 was restricted by the repertoire of accessible genes in MDMs and CD4+ T cells.
In summary, we demonstrated in various analyses using our own data sets and of others that the repertoires of HIV-1-hosting genes targeted by HIV-1 in MDMs and CD4+ T cells were distinct in terms of their structural and functional classes and relative sizes. These cell type-dependent HIV-1 integration site patterns might account for the differences in the life cycle of HIV-1 in these cell populations.
Methods
HIV-1-infected study participants and HIV-1 negative donors
HIV-1-infected individuals are enrolled in the Zurich Primary HIV Infection (ZPHI) study, which is a non-randomized, observational, open-label, and single-centre study (www.clinicaltrials.gov; ID: NCT00537966; registered 25th September 2007)20. For the current sub-study, the inclusion criteria were: (i) HIV-1-infected individuals had at least 1.5 years of viral suppression of <20 HIV-1 RNA copies/mL plasma with antiretroviral therapy (ART) at the time of cell sample collection, and (ii) their primary HIV-1 isolates were capable of replication in monocyte-derived macrophages (MDMs).
Ethics, consent, and permission
This study was approved by the Ethics Committee of the University Hospital Zurich and written informed consent was obtained from all HIV-1-infected individuals and HIV-1 negative donors. All experiments were performed in accordance with the relevant guidelines and regulations.
Generation and sequencing of primary HIV-1 isolates and clonal HIV-1 strains
The generation of primary HIV-1 isolates has been described elsewhere39. Briefly, CD4+ T cells from HIV-1-infected individuals in the acute phase of infection were co-cultured with activated donors’ peripheral blood mononuclear cells (PBMCs) until the HIV-1 p24 levels of the culture supernatant was >2,000 pg/mL prior to harvesting of the viral supernatant. HIV-1 p24 levels were measured with an enzyme-linked immunosorbent assay (ELISA)40. The 50% tissue culture infectious dose (TCID50) of the primary HIV-1 isolates was determined by co-culturing quadruplicates of 5-fold dilutions of primary HIV-1 isolates and scoring the number of p24-positive wells, and the TCID50 was subsequently calculated using the method of Reed and Muench41.
HIV-1JR-FL42 and HIV-1NL4-343 were generated by transfecting HEK 293T cells with pJR-FL and pNL4-3, respectively, and the viral supernatant was harvested and cleared off of cells with a 0.22 μm filter 48 hours post-transfection. pJR-FL42 was obtained from Irvin SY Chen and Yoshio Koyanagi, and pNL4-343 was obtained from Malcolm Martin through the NIH AIDS Reagent Programme, Division of AIDS, NIAID, NIH.
We had previously developed a sensitive and robust method to amplify and sequence the full-length genomes of primary HIV-1 isolates with minimal artefactual in vitro recombination44. The seven selected primary HIV-1 isolates were all confirmed to be of subtype B as determined by their pol nucleotide sequences using the REGA HIV-1 subtyping tool V345.
Ex vivo HIV-1 infection of monocyte-derived macrophages and CD4+ T cells
Blood samples of ART-treated HIV-1-infected individuals were first depleted of CD8+ T cells using RosetteSep Human CD8 Depletion Cocktail (Stemcell Technologies). CD8− PBMCs were isolated by density gradient centrifugation using Lymphoprep (Stemcell Technologies). CD14+ monocytes were positively isolated by magnetic-activated cell sorting using CD14 microbeads (Miltenyi Biotec) whereas the remaining CD8−CD14− fraction was treated functionally as CD4+ T cells. The median purities of CD14+ monocytes and CD4+ T cells were 95.4% and 98.8%, respectively, as measured by flow cytometry. CD14+ cells were induced to differentiate into MDMs in RPMI-1640 (10% (v/v) fetal bovine serum (FBS), 100 U/mL Penicillin, and 100 μg/mL Streptomycin) supplemented with 5% (v/v) human serum (Sigma) and 20 ng/mL macrophage-colony stimulating factor (M-CSF) (Peprotech) for eight days. CD4+ T cells were activated in RPMI-1640 (10% (v/v) FBS, 100 U/mL Penicillin, and 100 μg/mL Streptomycin) supplemented with 10 U/mL IL-2 (Roche) and cultured in OKT3-coated flasks for two days. MDMs and activated CD4+ T cells were infected ex vivo with primary HIV-1 isolates or clonal strains (HIV-1JR-FL or HIV-1NL4-3) at an MOI of 0.74–1.0 for six days and 0.01 for two days, respectively.
In the screening of macrophage-tropic primary HIV-1 isolates, HIV-1 negative donors’ blood samples were used and treated essentially the same way to obtain MDMs and CD4+ T cells, except that the isolated CD4+ T cells were divided into three fractions, activated individually with 5 U/ml phytohaemagglutinin (PHA), 0.5 U/mL PHA, and OKT3, and pooled two days later prior to HIV-1 infection ex vivo.
Amplification and sequencing of 5′ HIV-1 integration junctions
The 5′ HIV-1 integration junctions were amplified with the non-restrictive linear amplification-mediated PCR (nrLAM-PCR)21 from a total of 0.2–1.0 μg of genomic DNA extracted from cells infected ex vivo with HIV-1 isolates. The procedure was essentially the same as described elsewhere21, except: (i) the 3′ primers used for linear, first exponential, and second exponential amplifications were 5′CCCTGGTGTGTAGTTCTGCC3′, 5′CAATCAGGGAAGTAGCCTTGTG3′, and 5′TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGN0–3 TGTGGTAGACCCACAGATCAAG3′, respectively, (ii) the 5′ primer used for second exponential amplification was 5′GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGN0–3GATCTGAATTCAGTGGCACAG3′, (iii) the cycling conditions for linear amplification were 2× (95 °C for 2 min followed by 50 cycles of 95 °C for 45 s, 57 °C for 45 s, and 72 °C for 8 s) and (iv) The cycling conditions for first and second exponential amplifications were 95 °C for 2 min followed by 40 cycles of 95 °C for 45 s, 57 °C for 45 s, and 72 °C for 8 s, and a final extension at 72 °C for 2 min. The HIV-1 5′ LTR-specific primers were designed based on the most conserved regions of the consensus sequence of HIV-1 subtype B in the Los Alamos HIV Sequence Database. The N0–3 random nucleotides in the primers used for the second exponential amplification were introduced to increase nucleotide diversity to enable DNA cluster generation during DNA sequencing with the Illumina MiSeq platform.
5′ HIV-1 integration junctions were sequenced with the Illumina MiSeq platform. Illumina sequencing adaptors were added to 1 ng of purified PCR products in a PCR using the following primers: 5′AATGATACGGCGACCACCGAGATCTACAC(Index)TCGTCGGCAGCGTC3′ and 5′CAAGCAGAAGACGGCATACGAGAT(Index)GTCTCGTGGGCTCGG3′ at 0.5 μM each. The cycling conditions were 72 °C for 3 min, 95 °C for 30 s, 12 cycles of 95 °C for 10 s, 55 °C for 30 s, and 72 °C for 30 s, and a final extension at 72 °C for 5 min. A total of 12–14 pM of purified PCR products were sequenced with the MiSeq Reagent Kit v2 (300 or 500 cycles) with 8% PhiX.
Post-sequencing processing and mapping to the human genome assembly
Sequencing reads were filtered to remove low quality reads. True 5′ HIV-1 integration junctions were identified using an in-house bioinformatic pipeline, i.e. Integration Site Analysis Pipeline (InStAP), and satisfied the following criteria: (i) presence of both the nrLAM-PCR adaptor and 5′ HIV-1 LTR, (ii) host DNA was within 3 nucleotides from the end of 5′ HIV-1 LTR, and (iii) ≥85% of the lengths of the reads mapped to the human assembly GRCh37.p13 with ≥98% identity. Contaminants were accounted for and removed from all samples in which they were identified except the sample that was sequenced first and/or with the highest number of read count. Manuscript on the details of InStAP is in preparation.
HIV-1 integration site data sets derived from this study and obtained elsewhere for comparison are shown in Table 2. All HIV-1 integration sites derived from this study are listed in Supplementary Table S4.
Bioinformatic databases
Annotated features of HIV-1-hosting genes were assessed using the Database for Annotation, Visualization and Integrated Discovery (DAVID)25,26. DAVID groups genes, which are related by their structural and functional annotations, into clusters and calculates an enrichment score to indicate if the gene clusters were significantly over-represented in a particular genomic background. Clusters of HIV-1-hosting genes that were significantly over-represented in the human genome are defined as having a DAVID enrichment score >1.3 with the stringency level in DAVID set to high. By definition, a gene cluster having a DAVID enrichment score >1.3 was equivalent to an overall geometric mean <0.05 for all its gene members, and a high stringency would consider a gene cluster as biologically significant only when the kappa coefficient, κ ≥ 0.85 for at least three genes with at least three related annotations between any two.
The National Centre for Biotechnology Information (NCBI) catalogues and provides access to a database of published interactions between HIV-1 proteins and its cellular host29. We used the November 2014 release (the latest at time of analysis) of the HIV-1 Human Interaction Database, which documented a total of 3,337 unique RefSeq protein-coding genes whose products interact physically, as well as indirectly, with HIV-1. This number of genes represented ~16.15% of all unique RefSeq protein-coding genes in the human genome (20,664 as of 15th of December 2014).
Statistical analyses
Two-tailed Fisher’s exact test, χ2 test, and Mann-Whitney U test with a 95% confidence interval were used at where indicated.
Graphical plots
Some of the graphical plots were generated using the following online platforms: Plotly (https://plot.ly/), WebLogo46,47, and Circos48.
Additional Information
How to cite this article: Kok, Y. L. et al. Monocyte-derived macrophages exhibit distinct and more restricted HIV-1 integration site repertoire than CD4+ T cells. Sci. Rep. 6, 24157; doi: 10.1038/srep24157 (2016).
References
Dalgleish, A. G. et al. The CD4 (T4) antigen is an essential component of the receptor for the AIDS retrovirus. Nature 312, 763–767 (1984).
Klatzmann, D. et al. T-lymphocyte T4 molecule behaves as the receptor for human retrovirus LAV. Nature 312, 767–768 (1984).
Koenig, S. et al. Detection of AIDS virus in macrophages in brain tissue from AIDS patients with encephalopathy. Science (New York, N.Y.) 233, 1089–1093 (1986).
Maddon, P. J. et al. The T4 gene encodes the AIDS virus receptor and is expressed in the immune system and the brain. Cell 47, 333–348 (1986).
Alkhatib, G. et al. CC CKR5: a RANTES, MIP-1alpha, MIP-1beta receptor as a fusion cofactor for macrophage-tropic HIV-1. Science (New York, N.Y.) 272, 1955–1958 (1996).
Deng, H. et al. Identification of a major co-receptor for primary isolates of HIV-1. Nature 381, 661–666, doi: 10.1038/381661a0 (1996).
Raposo, G. et al. Human macrophages accumulate HIV-1 particles in MHC II compartments. Traffic 3, 718–729 (2002).
Pelchen-Matthews, A., Kramer, B. & Marsh, M. Infectious HIV-1 assembles in late endosomes in primary macrophages. The Journal of cell biology 162, 443–455, doi: 10.1083/jcb.200304008 (2003).
Stoler, M. H., Eskin, T. A., Benn, S., Angerer, R. C. & Angerer, L. M. Human T-cell lymphotropic virus type III infection of the central nervous system. A preliminary in situ analysis. Jama 256, 2360–2364 (1986).
Brinkmann, R. et al. Human immunodeficiency virus infection in microglia: correlation between cells infected in the brain and cells cultured from infectious brain tissue. Annals of neurology 31, 361–365, doi: 10.1002/ana.410310403 (1992).
Bagasra, O. et al. Cellular reservoirs of HIV-1 in the central nervous system of infected individuals: identification by the combination of in situ polymerase chain reaction and immunohistochemistry. AIDS (London, England) 10, 573–585 (1996).
Koppensteiner, H., Brack-Werner, R. & Schindler, M. Macrophages and their relevance in Human Immunodeficiency Virus Type I infection. Retrovirology 9, 82, doi: 10.1186/1742-4690-9-82 (2012).
Carteau, S., Hoffmann, C. & Bushman, F. Chromosome structure and human immunodeficiency virus type 1 cDNA integration: centromeric alphoid repeats are a disfavored target. Journal of virology 72, 4005–4014 (1998).
Schroder, A. R. et al. HIV-1 integration in the human genome favors active genes and local hotspots. Cell 110, 521–529 (2002).
Wang, G. P., Ciuffi, A., Leipzig, J., Berry, C. C. & Bushman, F. D. HIV integration site selection: analysis by massively parallel pyrosequencing reveals association with epigenetic modifications. Genome research 17, 1186–1194, doi: 10.1101/gr.6286907 (2007).
Stevens, S. W. & Griffith, J. D. Human immunodeficiency virus type 1 may preferentially integrate into chromatin occupied by L1Hs repetitive elements. Proceedings of the National Academy of Sciences of the United States of America 91, 5557–5561 (1994).
Stevens, S. W. & Griffith, J. D. Sequence analysis of the human DNA flanking sites of human immunodeficiency virus type 1 integration. Journal of virology 70, 6459–6462 (1996).
Barr, S. D. et al. HIV integration site selection: targeting in macrophages and the effects of different routes of viral entry. Molecular therapy : the journal of the American Society of Gene Therapy 14, 218–225, doi: 10.1016/j.ymthe.2006.03.012 (2006).
Wellensiek, B. P. et al. Differential HIV-1 integration targets more actively transcribed host genes in neonatal than adult blood mononuclear cells. Virology 385, 28–38, doi: 10.1016/j.virol.2008.10.052 (2009).
Rieder, P. et al. Characterization of human immunodeficiency virus type 1 (HIV-1) diversity and tropism in 145 patients with primary HIV-1 infection. Clinical infectious diseases : an official publication of the Infectious Diseases Society of America 53, 1271–1279, doi: 10.1093/cid/cir725 (2011).
Paruzynski, A. et al. Genome-wide high-throughput integrome analyses by nrLAM-PCR and next-generation sequencing. Nature protocols 5, 1379–1395, doi: 10.1038/nprot.2010.87 (2010).
Han, Y. et al. Resting CD4+ T cells from human immunodeficiency virus type 1 (HIV-1)-infected individuals carry integrated HIV-1 genomes within actively transcribed host genes. Journal of virology 78, 6122–6133, doi: 10.1128/jvi.78.12.6122-6133.2004 (2004).
Demeulemeester, J. et al. HIV-1 integrase variants retarget viral integration and are associated with disease progression in a chronic infection cohort. Cell host & microbe 16, 651–662, doi: 10.1016/j.chom.2014.09.016 (2014).
Serrao, E. et al. Integrase residues that determine nucleotide preferences at sites of HIV-1 integration: implications for the mechanism of target DNA binding. Nucleic acids research 42, 5164–5176, doi: 10.1093/nar/gku136 (2014).
Huang da, W., Sherman, B. T. & Lempicki, R. A. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic acids research 37, 1–13, doi: 10.1093/nar/gkn923 (2009).
Huang da, W., Sherman, B. T. & Lempicki, R. A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nature protocols 4, 44–57, doi: 10.1038/nprot.2008.211 (2009).
Wagner, T. A. et al. HIV latency. Proliferation of cells with HIV integrated into cancer genes contributes to persistent infection. Science (New York, N.Y.) 345, 570–573, doi: 10.1126/science.1256304 (2014).
Meekings, K. N., Leipzig, J., Bushman, F. D., Taylor, G. P. & Bangham, C. R. HTLV-1 integration into transcriptionally active genomic regions is associated with proviral expression and with HAM/TSP. PLoS pathogens 4, e1000027, doi: 10.1371/journal.ppat.1000027 (2008).
Ako-Adjei, D. et al. HIV-1, human interaction database: current status and new features. Nucleic acids research 43, D566–570, doi: 10.1093/nar/gku1126 (2015).
Maldarelli, F. et al. HIV latency. Specific HIV integration sites are linked to clonal expansion and persistence of infected cells. Science (New York, N.Y.) 345, 179–183, doi: 10.1126/science.1254194 (2014).
Lewinski, M. K. et al. Genome-wide analysis of chromosomal features repressing human immunodeficiency virus transcription. Journal of virology 79, 6610–6619, doi: 10.1128/jvi.79.11.6610-6619.2005 (2005).
Ikeda, T., Shibata, J., Yoshimura, K., Koito, A. & Matsushita, S. Recurrent HIV-1 integration at the BACH2 locus in resting CD4+ T cell populations during effective highly active antiretroviral therapy. The Journal of infectious diseases 195, 716–725, doi: 10.1086/510915 (2007).
Brady, T. et al. HIV integration site distributions in resting and activated CD4+ T cells infected in culture. AIDS (London, England) 23, 1461–1471, doi: 10.1097/QAD.0b013e32832caf28 (2009).
Marini, B. et al. Nuclear architecture dictates HIV-1 integration site selection. Nature, doi: 10.1038/nature14226 (2015).
Nan, X., Campoy, F. J. & Bird, A. MeCP2 is a transcriptional repressor with abundant binding sites in genomic chromatin. Cell 88, 471–481 (1997).
Li, X., Wu, L., Corsa, C. A., Kunkel, S. & Dou, Y. Two mammalian MOF complexes regulate transcription activation by distinct mechanisms. Molecular cell 36, 290–301, doi: 10.1016/j.molcel.2009.07.031 (2009).
Watanabe, T. K. et al. Molecular cloning of UBE2G, encoding a human skeletal muscle-specific ubiquitin-conjugating enzyme homologous to UBC7 of C. elegans. Cytogenetics and cell genetics 74, 146–148 (1996).
Chiang, K., Liu, H. & Rice, A. P. miR-132 enhances HIV-1 replication. Virology 438, 1–4, doi: 10.1016/j.virol.2012.12.016 (2013).
Trkola, A. et al. Delay of HIV-1 rebound after cessation of antiretroviral therapy through passive transfer of human neutralizing antibodies. Nature medicine 11, 615–622, doi: 10.1038/nm1244 (2005).
Moore, J. P., McKeating, J. A., Weiss, R. A. & Sattentau, Q. J. Dissociation of gp120 from HIV-1 virions induced by soluble CD4. Science (New York, N.Y.) 250, 1139–1142 (1990).
Reed, L. J. & Muench, H. A simple method of estimating fifty per cent endpoints. Am. J. Epidemiol. 27, 493–497 (1938).
Koyanagi, Y. et al. Dual infection of the central nervous system by AIDS viruses with distinct cellular tropisms. Science (New York, N.Y.) 236, 819–822 (1987).
Adachi, A. et al. Production of acquired immunodeficiency syndrome-associated retrovirus in human and nonhuman cells transfected with an infectious molecular clone. Journal of virology 59, 284–291 (1986).
Giallonardo, F. D. et al. Full-length haplotype reconstruction to infer the structure of heterogeneous virus populations. Nucleic acids research 42, e115, doi: 10.1093/nar/gku537 (2014).
Pineda-Pena, A. C. et al. Automated subtyping of HIV-1 genetic sequences for clinical and surveillance purposes: performance evaluation of the new REGA version 3 and seven other tools. Infect Genet Evol 19, 337–348, doi: 10.1016/j.meegid.2013.04.032 (2013).
Schneider, T. D. & Stephens, R. M. Sequence logos: a new way to display consensus sequences. Nucleic acids research 18, 6097–6100 (1990).
Crooks, G. E., Hon, G., Chandonia, J. M. & Brenner, S. E. WebLogo: a sequence logo generator. Genome research 14, 1188–1190, doi: 10.1101/gr.849004 (2004).
Krzywinski, M. et al. Circos: an information aesthetic for comparative genomics. Genome research 19, 1639–1645, doi: 10.1101/gr.092759.109 (2009).
Acknowledgements
We are grateful to individuals enrolled in the ZPHI study for their commitment and biological samples. We thank Dominique Braun and Christina Grube (USZ, Zurich, Switzerland) for patient care, Fabienne-Desirée Geissberger (UZH, Zurich, Switzerland) for technical assistance, Frederic Bushman and Nirav Malani (UPenn, Pennsylvania, USA), Anne Arens (Qiagen, Schwetzingen, Germany), and Kim D. Pruitt (NCBI, NLM, NIH, Maryland, USA) for their assistance in InSPiRD, HISAP, and NCBI databases, respectively. The following reagents were obtained through the NIH AIDS Reagent Programme, Division of AIDS, NIAID, NIH: pJR-FL from Irvin SY Chen and Yoshio Koyanagi and pNL4-3 from Malcolm Martin. This study has been financed by SNF grant 310030_141067 (to K.J.M. and H.F.G.), University of Zurich’s clinical research priority programme: viral infectious diseases – ZPHI (to H.F.G.), and an unrestricted research grant from the Vontobel Stiftung (to K.J.M. and H.F.G.).
Author information
Authors and Affiliations
Contributions
Y.L.K. and V.V. designed the study, performed the experiments, interpreted the data, and wrote the manuscript. M.S., V.V. and R.K. developed InStAP. FDG provided the HIV-1 integrase sequences of primary HIV-1 isolates. H.K. performed the experiment on the reactivation of in vivo HIV-1 from ART-treated HIV-1-infected individuals’ CD4+ T cells. H.F.G. participated in the design of the study. K.J.M. designed the study and edited the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Supplementary information
Rights and permissions
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
About this article
Cite this article
Kok, Y., Vongrad, V., Shilaih, M. et al. Monocyte-derived macrophages exhibit distinct and more restricted HIV-1 integration site repertoire than CD4+ T cells. Sci Rep 6, 24157 (2016). https://doi.org/10.1038/srep24157
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/srep24157
This article is cited by
-
Spatially clustered loci with multiple enhancers are frequent targets of HIV-1 integration
Nature Communications (2019)
-
Spontaneous reactivation of latent HIV-1 promoters is linked to the cell cycle as revealed by a genetic-insulators-containing dual-fluorescence HIV-1-based vector
Scientific Reports (2018)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.