A source of the single-stranded DNA substrate for activation-induced deaminase during somatic hypermutation

Wang, Xiaohua; Fan, Manxia; Kalis, Susan; Wei, Lirong; Scharff, Matthew D.

doi:10.1038/ncomms5137

Article
Published: 13 June 2014

A source of the single-stranded DNA substrate for activation-induced deaminase during somatic hypermutation

Xiaohua Wang¹,
Manxia Fan¹,
Susan Kalis¹,
Lirong Wei¹ &
…
Matthew D. Scharff¹

Nature Communications volume 5, Article number: 4137 (2014) Cite this article

2575 Accesses
25 Citations
Metrics details

Subjects

Abstract

During somatic hypermutation (SHM), activation-induced deaminase (AID) mutates deoxycytidine on single-stranded DNA (ssDNA) generated by the transcription machinery, but the detailed mechanism remains unclear. Here we report a higher abundance of RNA polymerase II (Pol II) at the immunoglobulin heavy-chain variable (Igh-V) region compared with the constant region and partially transcribed Igh RNAs, suggesting a slower Pol II progression at Igh-V that could result in some early/premature transcription termination after prolonged pausing/stalling of Pol II. Knocking down RNA–exosome complexes, which could decrease premature transcription termination, leads to decreased SHM. Knocking down Spt5, which can augment premature transcription termination, leads to increase in both, SHM and the abundance of ssDNA substrates. Collectively, our data support the model that, following the reduction of Pol II progression (pausing or stalling) at the Igh-V, additional steps such as premature transcription termination are involved in providing ssDNA substrates for AID during SHM.

You have full access to this article via your institution.

Download PDF

DDX41 coordinates RNA splicing and transcriptional elongation to prevent DNA replication stress in hematopoietic cells

Article Open access 14 October 2022

Satoru Shinriki, Mayumi Hirayama, … Hirotaka Matsui

BRCA1 deficiency specific base substitution mutagenesis is dependent on translesion synthesis and regulated by 53BP1

Article Open access 11 January 2022

Dan Chen, Judit Z. Gervai, … Dávid Szüts

Accelerated DNA replication fork speed due to loss of R-loops in myelodysplastic syndromes with SF3B1 mutation

Article Open access 08 April 2024

David Rombaut, Carine Lefèvre, … Michaela Fontenay

Introduction

While antibodies are responsible for the humoral immune response, those antibodies encoded by germline sequences often do not have sufficient affinity or specificity to provide full protection against diverse pathogens. To deal with this, B cells have developed mechanisms to somatically hypermutate (SHM) the V regions of their germline encoded heavy- and light-chain antibody genes. Most mutations are accumulated in the complementarity-determining regions of antibodies to refine their antigen-contacting surfaces. B cells also diversify antibody function by carrying out class switch recombination (CSR), which allows different antigen binding sites to be expressed with different constant regions. Together, SHM and CSR provide us with antibodies that can interact with all antigens, are distributed throughout the body and mediate effective humoral immune protection¹.

Both SHM and CSR require the endogenous mutagenic enzyme activation-induced deaminase (AID)^2,3. As a result of intensive study over the last 13 years, it is now known that AID: (1) deaminates dC to dU and creates point mutations, abasic sites and G:U mismatches⁴; (2) recruits error-prone DNA repair processes that further contribute to the diversification of mutation spectra by introducing mutations at A/T sites⁵; (3) requires transcription of the targeted genomic region during both SHM and CSR⁶; and (4) utilizes single-stranded DNA (ssDNA) as its biochemical substrate⁷. The fact that AID binds and mutates only ssDNA substrates is puzzling because ssDNA is a transient and short-lived form of DNA that is mostly covered by protein complexes during transcription or replication. It is thus unclear exactly how ssDNA is generated and made accessible for AID during SHM or CSR and whether the two different processes share the same mechanism(s). When carrying out SHM and CSR in B cells, AID also occasionally introduces mutations at genomic loci other than antibody genes and these ‘off-target’ mutations are responsible for many B-cell malignancies⁸.

Since transcription is necessary for SHM and CSR, AID-mediated mutagenesis must be coordinated with transcriptional machinery⁹. In fact, polymerase II (Pol II)¹⁰, Spt5¹¹ and RNA polymerase associated factor (PAF)¹² have all been reported to interact with AID directly and to affect CSR. However, even though transcription is always accompanied by ssDNA around the synthesis centre of the Pol II complex—the structure referred to as a transcription bubble—the conventional transcription cycle of initiation, elongation and termination is unlikely to supply ssDNA substrates for AID for the following reasons. First, recent structural studies suggest that ssDNA created in the transcription bubble is almost fully covered by either the Pol II complex itself or cofactors like the DRB sensitivity-inducing factor (DSIF, composed of Spt4 and Spt5 proteins)¹³. Such coverage is probably present throughout elongation and even during transcription termination triggered by poly-A signals¹⁴. Second, whereas transcription initiation involves TFIIH-mediated dsDNA melting at promoter regions before a complete Pol II complex is assembled¹⁵, AID may be excluded from those regions because a high density of transcription factors normally cover promoters. This is consistent with the fact that AID preferentially targets the immunoglobulin heavy-chain variable (Igh-V) coding region but spares regions that are within ~200 bp immediately after the transcription start site (TSS)^4,16,17.

Nevertheless, several processes accompanying or resulting from transcription have been proposed to help expose ssDNA for AID targeting. These processes include the following: (1) R-loops resulting from special features in the DNA sequence at Igh switch regions but not Igh-V regions^18,19; (2) negative DNA supercoils occurring at the trailing end of transcription bubbles²⁰; and (3) paused Pol II complexes that are frequently present proximal to the TSS regions²¹. It is noteworthy that pausing only represents one of the three possible states of stalled Pol II complexes and the other two scenarios include backtracking Pol II due to transcription errors and early terminating Pol II due to the failure of error correction²². While DNA supercoils generate AID accessible regions on dsDNA plasmids in vitro²³ and paused Pol II complexes play essential roles in CSR^11,24,25, the source(s) of ssDNA substrates for AID during SHM of the V region in vivo remains to be established.

Premature transcription termination refers to the process when Pol II is released in the middle of a transcribing gene independent of a poly-A termination signal. It is a plausible way of exposing ssDNA around the Pol II catalytic centre for two reasons. First, during premature transcriptional termination, the release of Pol II complex from template DNA may not be coordinated with the transcriptional termination complexes, which could leave the melted dsDNA unprotected. Second, partial transcripts resulting from premature transcription termination negatively regulate Pol II progression specifically around the termination site²⁶, which could in turn maintain ssDNA levels at V regions. While detailed mechanisms remain unclear, it is well established that DSIF complex component Spt5 is essential for Pol II processivity since the loss of Spt5 leads to an increase in premature transcription termination in yeast and in mammalian cells^27,28,29. On the other hand, a sustained level of existing premature transcription termination requires the RNA exosome through a newly identified feedback mechanism²⁶. Based on these considerations, we tested the role of premature transcription termination in SHM by manipulating the levels of Spt5 and the RNA exosome in the mutating human B-cell line, Ramos. Our data suggest that, in addition to Pol II pausing, there is a second set of events like the premature transcription termination process, which B cells may indeed ‘hijack’ to supply ssDNA substrates for AID to mutate at immunoglobulin V regions during SHM.

Results

Reduced progression of Pol II at Igh-V region in Ramos cells

Since Pol II abundance at specific regions of a transcribing gene is widely used to assess local progression efficiency, we first examined the distribution of Pol II complexes at the actively transcribing Igh-V locus in Ramos cells. The human Burkitt’s lymphoma-derived Ramos cell line has many characteristics of a germinal centre B cell including the constitutive capacity to undergo SHM and a pattern of AID-induced mutations at G:C sites that is similar to what has been observed in vivo³⁰. However, analysis of Pol II abundance in wild type Ramos cells is challenging because the size of the hypermutating V region (~400 bp) is at the resolution limit of the chromatin immunoprecipitation (ChIP) analysis. To circumvent this problem, we expanded the AID-targeting region in Ramos by introducing an mCherry fluorescent protein coding sequence into the second exon of the Igh-V locus. Using recombinase-mediated cassette exchange³¹, we replaced the endogenous Ramos V region with an in-frame fusion of the mCherry gene and the endogenous V region (Fig. 1a) to provide an ~1.2 kb target for AID as well as a reporter for AID-induced mutations. We confirmed that the entire fusion V exon was able to be targeted by AID-mediated SHM (see below). Using the ChIP assay, we detected on the average ~2–3 times more Pol II at the second exon of Igh-V—the 1.2 kb hypermutating mCherry/Igh-V fusion region—than at other areas of the gene, that is, the promoter, the downstream intronic enhancer Eμ and the constant region (Fig. 1b). The relatively higher abundance of Pol II was quite consistent among the different parts of the mCherry/Igh-V fusion (c–f) compared with the relatively lower abundance within the downstream Eμ-Cμ region (h–k) (Fig. 1c). This significant but moderate increase in Pol II occupancy in the mCherry/Igh-V region suggested a slowing down of Pol II progression. We also conducted ChIP analysis for Pol II phosphorylated at serine 5 of the carboxy-terminal repeat domain (Pol II S5P), a form of Pol II that is enriched during transcription initiation and gradually decreases throughout elongation. We found that Pol II S5P occupancy stayed high from the promoter region until ~2.5 kb downstream of TSS (Fig. 1d). This suggested that Pol II started to transit from initiation phase to elongation phase immediately before Eμ.

**Figure 1: Abundance of Pol II complexes at Igh-V region in Ramos cells.**

Spt5 knockdown affects Pol II occupancy and ssDNA patches

Reduced progression of Pol II complexes suggested the existence of stalled Pol II at Igh-V regions. Stalled Pol II complexes could represent a paused Pol II, a backtracking Pol II or an early terminating Pol II (ref. 22). To address whether the stalled Pol II contributes to SHM and, if so, which of these three processes is responsible, we chose to manipulate the level of the DSIF complex component Spt5. This is because: (1) Spt5 mediates proximal TSS pausing of the RNA polymerase³²; and (2) it positively maintains Pol II processivity during elongation^27,28,29. Therefore, a decrease in Spt5 level is expected to reduce Pol II pausing but facilitate premature Pol II termination.

Of five short hairpin RNA (shRNA) constructs against human Spt5 (SUPT5H) tested in Ramos cells, two (#4 and 6) achieved ~50% knockdown at both messenger RNA (mRNA) and protein levels (Fig. 2a,b, Supplementary Fig. 1). Spt5 knockdown (Spt5KD) cells and cells transduced with a control-shRNA (Ctrl-shRNA) expressed similar amounts of steady-state IgH and AID mRNA (Fig. 2c). We also found that, as expected, Spt5 abundance at the IgH region was consistently decreased in Spt5KD cells and, to a more variable extent, the total Pol II complex abundance decreased accordingly in two independent experiments (Fig. 2d).

**Figure 2: Decrease in the cellular Spt5 level is associated with more ssDNA patches.**

Transient patches of ssDNA in chromatin can be detected by treating cross-linked nuclei with bisulphite reagents under non-denaturing conditions^33,34. In this assay, the bisulphite treatment only converts dCs on exposed ssDNA to dUs but cannot convert dCs that are either hybridized with other nucleic acids (dsDNA or RNA:DNA hybrids) or covered by proteins. Using this bisulphite method, we found significantly more non-protected ssDNA patches at the Igh-V region in Spt5KD cells than in Ctrl-shRNA-transduced cells (Fig. 2e), suggesting that the decrease in cellular Spt5 levels was associated with more potential AID substrates.

Spt5 knockdown is associated with an increase in SHM

We first estimated the change of SHM rate on Spt5 reduction in an IgM⁻ Ramos subclone that expressed endogenous levels of AID and had a nonsense mutation in the native V region. We used the IgM gene reversion assay established previously³⁵ (see Methods) and found that the ~50% reduction in the Spt5 level was accompanied by a significant increase in the SHM rate (5.46 × 10⁻⁵ versus 7.52 × 10⁻⁵ per nucleotide per generation (P<0.05)) (Fig. 3a).

**Figure 3: Decrease in cellular Spt5 leads to an increase in SHM.**

We then used an independent Ramos cell line (‘reporter line’ described in Fig. 1a and the Methods) that carries the mCherry/VH4–34 fusion at Igh-V locus and expresses AID–ER fusion protein to confirm the effect of knocking down Spt5 on SHM. With this reporter line, the mutation rate can then be quantitatively assessed based on the percentage of cells that lose their fluorescence on 4-hydroxy-tamoxifen (4-OHT) induction of the nuclear localization of AID³⁶. Consistent with the reversion assay, ~50–60% more cells lost their fluorescence under the Spt5KD condition than the control cells after 7 days of induction (Fig. 3b). While the SHM level reached a plateau with the induction concentration of ~0.25 μM 4-OHT in both Spt5KD and control cells, Spt5KD cells mutated the Igh-V region more efficiently at a lower concentration of inducer (0.0625 μM) than control cells did at the maximum induction level (Fig. 3c). These data indicated that the increased level of SHM in Spt5KD cells was not due to the excessive level of active AID molecules in nucleus. Because of the significant increase in the experimental efficiency using the reporter line instead of the reversion assay in analysing the efficiency of SHM, we chose to perform all further mutation analyses on the reporter platform.

We next used an shRNA-resistant form (see Methods) of Spt5 to rescue the effect phenotypically. The exogenous Spt5 rescue construct restored Spt5 levels in the knockdown cells close to normal levels (lanes 2 and 5 versus 6 in Fig. 3d, Supplementary Fig. 2). Consistent with the regulatory role of Spt5 on SHM, Spt5KD cells (with either the #4 or #6 shRNA construct) significantly reduced their mutation frequencies (P<0.0001) when cellular Spt5 levels were largely restored (Fig. 3e). These data confirmed that the reduced level of Spt5 was the cause of the observed increase in SHM in the Spt5-specific shRNA-transduced cells.

Since the DSIF complex is composed of Spt5 and Spt4 molecules, we also tested whether reducing the Spt4 level in cells would have an impact on SHM similar to Spt5KD. Indeed, a similar increase in SHM was observed when endogenous Spt4 level was reduced in the reporter Ramos cells (Fig. 3f). Nevertheless, the experiment needs to be interpreted with caution because Spt4 is also a stabilizing factor for Spt5 (ref. 37) and the effect of Spt4KD can at least partially be caused by the reduction in the level of general DSIF complexes.

Impact of Spt5KD on the characteristics of SHM

In the previous section, we used both the reversion of a nonsense mutation in the native Ramos V region and the loss of fluorescence of a reporter in the endogenous Igh-V locus to show that the knockdown of Spt5 resulted in a 50–60% increase in V region mutation. To provide an independent measure of the frequency of mutation per mutated Igh-V gene and to examine the characteristics of the additional mutations, we went on to examine the sequences of the Igh-V fusion genes in mutated Ramos reporter cells by Sanger sequencing. Since in Ramos cells only a small percent of the V regions undergo mutation, we sequenced only those cells that mutated their Igh-V gene sufficiently to cause a loss of fluorescence. Most of the mutations in both the control and the knockdown cells were in AID hot spots. Reduction of Spt5 did not cause a statistically significant change in the distribution of mutations in either the mCherry or the 4–34 part of the fusion V region (Fig. 4a). However, the average number of mutations harboured by each mutated Igh-V gene was consistently (but not statistically significantly) increased by about 25% based on three independent experiments (Fig. 4b). This provides independent evidence that a decrease in Spt5 leads to not only more V regions being targeted for mutation but also more mutations accumulating in individual mutated Igh-V genes. While these are two different ways of quantifying mutation, the overall impact was estimated by multiplying the ~1.6-fold increase in frequency (based on the fluorescence reporter assay) by the 1.25 increase in mutations per V region, which resulted in a approximately twofold overall increase of AID-induced mutations due to the knockdown of Spt5. The magnitude of this increase is similar to what can be achieved by artificially introducing a termination signal at the immunoglobulin light chain V region in chicken DT40 cells³⁸. Spt5 levels did not influence the strand distribution of mutations (mutation preference on the template or the non-template strand) because the incidence of mutations from C or from G was similar (~50%) in both Spt5KD and Ctrl-shRNA-transduced cells (Fig. 4c). Interestingly, the frequency of G:C transversions in Spt5KD cells was consistently and significantly higher than control cells (Fig. 4d), suggesting that Spt5 might suppress base excision repair or the lack of Spt5 might expose docking sites that facilitate the recruitment of base excision repair machinery. Nevertheless, a difference of UNG recruitment is not likely to be the main reason for our observed increase in SHM upon Spt5 reduction because: (1) UNG mediates only one of the two pathways in the second phase of SHM and accounts for both transversion mutations and error-free repair⁴; (2) UNG promotes both SHM and CSR while reduction of Spt5 leads to opposite effects on SHM and CSR (see Discussion); and (3) change in the availability of UNG does not influence overall mutation frequency³⁹. Hence, the increase of the overall SHM rate most likely resulted from the initial increase of the frequency of deamination. Overall, these data suggest that reduction in the Spt5 level promotes SHM in a general manner without any spatial preference.

RNA exosome and premature transcription termination in SHM

The finding that a decrease in Spt5 levels is accompanied by an increase in the rate of SHM suggested that premature transcription termination might be associated with and could contribute to SHM. To explore this possibility further, we knocked down the RNA–exosome complex because: (1) the RNA exosome helps to induce premature termination of transcription²⁶; and (2) the exosome core components facilitate SHM in a biochemical assay⁴⁰ in vitro, although this has not been established in vivo. The RNA–exosome complex is composed of two ribonuclease units and a nine sub-unit core including Rrp40 (EXOSC3)^41,42, a molecule that has a profound impact in AID-mediated mutation activity in vitro⁴⁰. Therefore, we tested the role of Rrp40 in SHM using Ramos cells.

The maximum reduction we achieved from five shRNAs against the exosome core component Rrp40 was ~30% at mRNA level (Fig. 5a). This is probably because an extensive decrease in its level is cytotoxic considering the essential role of RNA exosome for ribosomal RNA processing⁴³. However, based on data from two independent shRNA constructs, this small decrease of Rrp40 level was sufficient to reduce the frequency of SHM by ~30–40% (P<0.05) (Fig. 5b). The Rrp40 knockdown did not significantly change the overall distribution of mutations in either the mCherry or the endogenous V region (Fig. 5c). In addition, there was no increase in the numbers of mutations on the non-template strand compared with the template strand (Fig. 5d), nor did we find any change in mutation pattern resulting from the reduced Rrp40 level (Fig. 5e). This indicated that the RNA–exosome complexes could promote SHM in a general way rather than solely through the suggested nascent RNA degradation mechanism that would help to preferentially expose template ssDNA⁴⁰. Our data, however, cannot rule out the possibility that the latter mechanism may also play an important role in SHM because our limited ability to reduce RNA–exosome levels might have precluded us from observing such an effect.

Premature transcription termination likely occurs at Igh-V

Overall, our experiments confirmed the positive role of RNA exosome core component Rrp40 in optimal SHM. Together with the observed increase in SHM on Spt5 reduction, they provided strong support for a positive role of premature transcription termination in SHM. We therefore tested whether premature transcription termination events happened on the Igh gene. Early transcription termination events should result in a higher abundance of transcripts containing only the 5′ end of the gene compared with transcripts containing both 5′ and 3′ ends. Such an imbalance could serve as a surrogate indicator for premature transcription termination and could be estimated by quantifying the absolute copy number of RNA transcripts containing various portions of the gene in total RNA. To do this, we constructed a plasmid harbouring a single copy of each region to be investigated. Using this plasmid and real time quantitative PCR, we generated individual standard curves of the copy numbers (estimated by the molecular weight of the plasmid) against the Ct (cycle threshold) values for each region of interest. These standard curves were then used to determine the absolute quantity of RNA transcripts containing the designated region in total RNA. This quantification method worked effectively, as exemplified by its ability to distinguish the intron region f because of its low abundance (~2–4%) compared with the mature full-length Igh mRNA (Fig. 6a). In support of our hypothesis, we observed approximately two fold more RNA transcripts containing the Igh variable region (Igh-V fusion) than those containing the constant region (Cμ) in Ramos cells (Fig. 6a) when cDNAs were synthesized using random hexamers. Partial Igh transcripts containing only the 5′ end of the Igh gene were not polyadenylated (Fig. 6b) since this bias was not observed when cDNAs were synthesized using poly-T oligonucleotides (the lower signals detected at the 5′ end of the mRNA compared with the constant region were likely due to the limitation in the enzymatic processivity of the reverse transcriptases). Consistent with the important role of RNA exosome in premature transcription termination, the reduction in Rrp40 levels reduced the bias of higher 5′ only transcripts over the full-length mRNA (Fig. 6c). These findings suggest that premature transcription termination does occur naturally at the Igh-V region with its level positively correlating with SHM in B cells.

Discussion

As Spt5 is essential for optimal CSR^11,44 and interacts with AID¹¹, it is surprising that a reduction in Spt5 levels facilitates SHM. However, this discrepancy can be an indication of the distinct sources of ssDNA substrates for AID during SHM and CSR. Whereas R-loops are the major source of ssDNA during CSR, we propose here (see Fig. 7) that premature transcription termination provides ssDNA substrates for AID during SHM. Since mutation rates exceeding 10⁻³ per base per generation will introduce too many nonsense mutations for efficient affinity maturation⁴⁵, the complex role of Spt5 in SHM may have evolved to achieve optimal antibody diversification. Moreover, it is noteworthy that Spt5 seems to also influence the non-homologous end joining and homologous recombination processes⁴⁴, both of which are important for CSR but largely dispensable for SHM. Thus, dissimilar roles of Spt5 in SHM and CSR probably reflect the distinct molecular mechanisms involved in these two processes.

**Figure 7: Model of the premature transcription termination as a source of ssDNA AID substrates.**

Genome-wide Pol II occupancy studies have revealed a large number of genes with stalled Pol II complexes in the transcribing regions. Although the fate of these stalled Pol II complexes remains under debate, one possibility among several others (like pausing or stalling), is that at least some of them will eventually lose all processivity and terminate transcription prematurely^21,22. Our RNA transcript analysis identified partially transcribed Igh genes in B cells (Fig. 6). Although a bias towards 5′ region containing transcripts could result from preferential 3′ degradation of mRNA, it more likely reflects a role of premature transcription termination in SHM since: (1) Pol II is enriched in Igh-V region at a similar level as that reported around transcription termination sites (Fig. 1)^21,22; (2) a reduction in Spt5 leads to an increase in SHM and in the frequency of ssDNA patches at Igh-V region (Figs 2e and 3); (3) the RNA exosome is required for optimal SHM (Fig. 5); and (4) insertion of a transcription termination signal at the Igh-V region in DT40 cells results in increases of both Pol II accumulation and SHM upstream of the termination signal³⁸. Together these findings support the idea that premature transcription termination contributes to SHM by supplying ssDNA substrates for AID. This premature transcription termination at the Igh-V region could be the consequence of frequent stalling (slow progression) and the stochastic loss of Pol II processivity physiologically, which could expose unwound DNA templates for AID to mediate SHM (Fig. 7). Consistent with this idea, NEDD4, an E3-ubiquitinase that can mediate the degradation of unresolvable stalling Pol II, has been found at AID-targeted regions during CSR⁴⁶. This confirms that Pol II complexes can indeed ‘leave’ their template in the middle of transcriptional process, which may provide AID access to the single-stranded transcription bubble. Interestingly, this model is also consistent with the observation that hypermutating dark zone germinal centre B cells actually reduce their surface Ig level and that could be due to an increase in premature transcription termination at hypermutating V regions⁴⁷.

Two different processes have recently been proposed to provide ssDNA substrates for AID. First, paused Pol II complexes correlated with the generation of R-loop structures at IgH switch regions, which positively contributes to CSR^18,19. If paused Pol II complexes themselves are also sufficient to facilitate SHM, the reduction in DSIF (Spt5/Spt4 complex) level should lead to a decrease in both Pol II pausing and SHM. On the contrary, we observed an increase in SHM when the Spt5 level was reduced in Ramos B cells. Since the reduction of cellular Spt5 is known to decrease Pol II processivity and hence promote premature transcription termination^27,28,29, these observations suggested that premature transcription termination could be playing an important role in SHM. Second, accumulation of DNA supercoils around the transcribing Pol II complex has also been proposed as a source of AID substrates^23,34. AID can mutate supercoiled DNA in vitro and reduction of the DNA supercoil relieving factor topoisomerase I results in an increase in SHM⁴⁸. However, supercoiled DNA should expose both strands of a DNA molecule symmetrically, but this remains to be examined by the ssDNA patch analysis by deeper sequencing for the V region in vivo^33,34.

Our model (Fig. 7) of the premature transcription termination process as a source of ssDNA for AID during SHM suggests that: (1) the hypersensitive mutation region is similar to the size of a transcription bubble (14–18 bp); (2) any factors that reduce Pol II processivity such as a decreased level of elongation factors or increased abnormal DNA structures like supercoils will facilitate SHM⁴⁸; (3) an artificial increase of Pol II complex termination by adding termination signals at V region promotes SHM³⁸; and (4) template and non-template ssDNA are not likely exposed to AID simultaneously. We also (Fig. 7) hypothesize that on premature termination, unwound non-template ssDNA is exposed while template strand may remain hybridized with nascent RNA. Later, non-template ssDNA in the transcription bubble gets protected by RPA while RNA:DNA hybrids in the same transcription bubble will be processed through an RNA removal mechanism during termination⁴⁹ to expose the template strand ssDNA for AID to target. Such sequential exposure of ssDNA for AID targeting is probably important for keeping the frequency of DNA double-strand breaks low (<~10%) at the Igh-V regions during SHM³⁰.

How AID targets Ig loci with high specificity in B cells remains a central question in B-cell biology. Although our data indicate that the frequency of AID substrates strongly influences the mutation frequency in a SHM-competent cell, the occurrence of ssDNA at the Igh-V region is probably not sufficient for eliciting the SHM process. In fact, the premature transcription termination process is likely to be determined by DNA elements independent of AID and SHM. This notion is consistent with recent observations that neither the accumulation of Pol II at the switch regions²⁴ nor the ssDNA frequency³³ at Igh-V region is dependent on AID. Moreover, the level of Pol II accumulation at the Ig-L locus in DT40 cells is independent of either the surrounding cis-elements capable of promoting SHM or the presence of AID⁵⁰. Thus, although we have elucidated a novel mechanism utilized by B cells to provide ssDNA substrate for AID during SHM, the susceptibility of the variable region to SHM and the specificity of AID targeting to those regions in B cells are probably subject to several more levels of control rather than a simple interaction of the enzyme and its substrates. It is however interesting to investigate in the future what source(s) of ssDNA is used in off-target mutation sites of AID and whether they share similar mechanisms with the Igh-V region.

Methods

Cell lines and antibodies

The wild type human Burkitt’s lymphoma cell line Ramos has been described in refs 30, 35. The modified reporter Ramos cell line was established by replacing the endogenous 4–34 Igh-V region with mCherry-4–34 fusion fragment (Fig. 1a) using recombinase-mediated cassette change. Briefly, a LoxP-flanked hygromycin resistance gene was integrated into endogenous Igh-V locus by homologous recombination. The mCherry reporter cassette was then knocked into the Igh-V locus through Cre-mediated recombination³¹. The Ramos clone used to construct this reporter line was preselected to have an undetectable level of AID protein and confirmed not to undergo SHM. Cells containing mCherry–Igh-V fusion were then transfected with AID–ER fusion protein³⁶ and clones were selected based on their capacity for undergoing SHM in a 4-OHT (Sigma-Aldrich)-inducible manner. Cells were considered to have undergone SHM when they reduced their mCherry fluorescence based on flow cytometry analysis. Antibodies used in this study were anti-Pol II C-terminal repeats (Abcam), anti-Pol II C-terminal repeats (serine 5 phosphorylated) (Abcam), anti-Spt5 (Santa Cruz) (1:250 dilution), anti-tubulin (Sigma-Aldrich) (1:2,500 dilution).

Real-time PCR quantification

In ChIP experiments, ChIP DNA precipitated with a specific antibody (1 μg per reaction) was analysed with the ΔΔCt method using SYBR Green PCR Master Mix (Life Technologies or KAPA Biosystems) and was normalized to input DNA. PCR efficiency was estimated using serial dilutions and the signal was adjusted accordingly. In all experiments, rabbit anti-rat polyclonal antibody was used as the negative control and the ChIP signal from it was subtracted from each sample before further analysis. For absolute quantification of RNA transcripts, a single plasmid containing a single copy of each region of interest was made and the copy number was estimated using the molecular weight calculated based on the size of the plasmid. The plasmid was then used as a standard in the standard curve method of quantitative PCR and the copy number was calculated by ViiA 7 real-time PCR software (Life Technologies). Primers used in the study are listed in Supplementary Table 1.

Lentiviral transduction and shRNA

Lentiviral particles containing designated shRNA were prepared by shRNA Core Facility at Albert Einstein College of Medicine. All the shRNA constructs were obtained from the human TRC Library (Thermo Scientific) with sequences listed in Supplementary Table 2. Control-shRNA (Ctrl-shRNA) is the SHC002 construct from Sigma-Aldrich. Ramos cells were transduced with ~3:1 multiplicity of infection and were subjected to Puromycin (Gibco, Life Technology) selection for 7–9 days. Successful knockdown of targeted genes was verified by real-time PCR and in the case of Spt5, western analysis as well. An shRNA-resistant form of human Spt5 was created by replacing 7 of the 21 nucleotides of the shRNA-targeting sequence but keeping amino-acid sequence untouched.

Mutation analysis

To obtain the mutation pattern, cells that had lost their mCherry fluorescence were sorted by flow cytometry to extract their genomic DNA (Qiagen). The mCherry–Igh-V fusion region was amplified using PfuTurbo (Agilent) cloned into the sequencing vector and Sanger sequenced in both directions to cover the whole ~1.3 kb region. Sequencing data were then aligned by ClustalW2 and analysed using SHMTool.

ssDNA detection by Bisulphite treatment

In situ bisulphite treatment of cross-linked cellular nuclei under non-denaturing conditions was conducted to detect ssDNA patches that were natively exposed in chromatin^33,34. The final concentration of bisulphite was reduced to ~2 M to improve DNA recovery and a new KAPA HiFi Uracil+ kit (KAPA Biosystems) was used in the DNA amplification step to improve the accuracy of the analysis. The whole fusion of mCherry and Ramos Igh-V was amplified, cloned into vector and sequenced by Sanger sequencing.

Estimation of mutation rate by IgM reversion assay

Individual clones of IgM-negative Ramos cells expressing the endogenous VH4–34 heavy-chain V region bearing a stop-codon³⁵ were allowed to accumulate mutations for 3 weeks. Any mutations that change the stop-codon into a sense codon will allow its descendants to become IgM positive. The frequency of those reverted cells was examined by flow cytometry. After analysing sufficient numbers of clones, a mutation rate at that specific site could be estimated by maximal likelihood method⁵¹.

Statistical analysis

All statistical analyses were conducted using Prism 6 software. Throughout the data, * represented that the statistical significance was achieved and P-value was illustrated correspondingly. Error bars represent s.d. among independent experiments. In the cases where multiple experiments were compiled together, a paired Student’s t-test was used.

Additional information

How to cite this article: Wang, X. et al. A source of the single-stranded DNA substrate for activation-induced deaminase during somatic hypermutation. Nat. Commun. 5:4137 doi: 10.1038/ncomms5137 (2014).

Disclaimer

The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.

References

Victora, G. D. & Nussenzweig, M. C. Germinal centers. Annu. Rev. Immunol. 30, 429–457 (2012).
Article CAS Google Scholar
Muramatsu, M. et al. Class switch recombination and hypermutation require activation-induced cytidine deaminase (AID), a potential RNA editing enzyme. Cell 102, 553–563 (2000).
Article CAS Google Scholar
Revy, P. et al. Activation-induced cytidine deaminase (AID) deficiency causes the autosomal recessive form of the Hyper-IgM syndrome (HIGM2). Cell 102, 565–575 (2000).
Article CAS Google Scholar
Peled, J. U. et al. The biochemistry of somatic hypermutation. Annu. Rev. Immunol. 26, 481–511 (2008).
Article CAS Google Scholar
Rada, C., Di Noia, J. M. & Neuberger, M. S. Mismatch recognition and uracil excision provide complementary paths to both Ig switching and the A/T-focused phase of somatic mutation. Mol. Cell 16, 163–171 (2004).
Article CAS Google Scholar
Stavnezer, J., Guikema, J. E. & Schrader, C. E. Mechanism and regulation of class switch recombination. Annu. Rev. Immunol. 26, 261–292 (2008).
Article CAS Google Scholar
Bransteitter, R., Pham, P., Scharff, M. D. & Goodman, M. F. Activation-induced cytidine deaminase deaminates deoxycytidine on single-stranded DNA but requires the action of RNase. Proc. Natl Acad. Sci. USA 100, 4102–4107 (2003).
Article CAS ADS Google Scholar
Nussenzweig, A. & Nussenzweig, M. C. Origin of chromosomal translocations in lymphoid cancer. Cell 141, 27–38 (2010).
Article CAS Google Scholar
Kenter, A. L. AID targeting is dependent on RNA polymerase II pausing. Semin. Immunol. 24, 281–286 (2012).
Article CAS Google Scholar
Nambu, Y. et al. Transcription-coupled events associating with immunoglobulin switch region chromatin. Science 302, 2137–2140 (2003).
Article CAS ADS Google Scholar
Pavri, R. et al. Activation-induced cytidine deaminase targets DNA at sites of RNA polymerase II stalling by interaction with Spt5. Cell 143, 122–133 (2010).
Article CAS Google Scholar
Willmann, K. L. et al. A role for the RNA pol II-associated PAF complex in AID-induced immune diversification. J. Exp. Med. 209, 2099–2111 (2012).
Article CAS Google Scholar
Martinez-Rucobo, F. W., Sainsbury, S., Cheung, A. C. & Cramer, P. Architecture of the RNA polymerase-Spt4/5 complex and basis of universal transcription processivity. EMBO J. 30, 1302–1310 (2011).
Article CAS Google Scholar
Hsin, J. P. & Manley, J. L. The RNA polymerase II CTD coordinates transcription and RNA processing. Genes Dev. 26, 2119–2137 (2012).
Article CAS Google Scholar
Kouzine, F. et al. Global regulation of promoter melting in naive lymphocytes. Cell 153, 988–999 (2013).
Article CAS Google Scholar
Longerich, S., Tanaka, A., Bozek, G., Nicolae, D. & Storb, U. The very 5' end and the constant region of Ig genes are spared from somatic mutation because AID does not access these regions. J. Exp. Med. 202, 1443–1454 (2005).
Article CAS Google Scholar
Woo, C. J., Martin, A. & Scharff, M. D. Induction of somatic hypermutation is associated with modifications in immunoglobulin variable region chromatin. Immunity 19, 479–489 (2003).
Article CAS Google Scholar
Zarrin, A. A. et al. An evolutionarily conserved target motif for immunoglobulin class-switch recombination. Nat. Immunol. 5, 1275–1281 (2004).
Article CAS Google Scholar
Huang, F. T. et al. Sequence dependence of chromosomal R-loops at the immunoglobulin heavy-chain Smu class switch region. Mol. Cell Biol. 27, 5921–5932 (2007).
Article CAS Google Scholar
Wu, H. Y., Shyy, S. H., Wang, J. C. & Liu, L. F. Transcription generates positively and negatively supercoiled domains in the template. Cell 53, 433–440 (1988).
Article CAS Google Scholar
Core, L. J., Waterfall, J. J. & Lis, J. T. Nascent RNA sequencing reveals widespread pausing and divergent initiation at human promoters. Science 322, 1845–1848 (2008).
Article CAS ADS Google Scholar
Adelman, K. & Lis, J. T. Promoter-proximal pausing of RNA polymerase II: emerging roles in metazoans. Nat. Rev. Genet. 13, 720–731 (2012).
Article CAS Google Scholar
Shen, H. M. & Storb, U. Activation-induced cytidine deaminase (AID) can target both DNA strands when the DNA is supercoiled. Proc. Natl Acad. Sci. USA 101, 12997–13002 (2004).
Article CAS ADS Google Scholar
Wang, L., Wuerffel, R., Feldman, S., Khamlichi, A. A. & Kenter, A. L. S region sequence, RNA polymerase II, and histone modifications create chromatin accessibility during class switch recombination. J. Exp. Med. 206, 1817–1830 (2009).
Article CAS Google Scholar
Rajagopal, D. et al. Immunoglobulin switch mu sequence causes RNA polymerase II accumulation and reduces dA hypermutation. J. Exp. Med. 206, 1237–1244 (2009).
Article CAS Google Scholar
Wagschal, A. et al. Microprocessor, Setx, Xrn2, and Rrp6 co-operate to induce premature termination of transcription by RNAPII. Cell 150, 1147–1157 (2012).
Article CAS Google Scholar
Mason, P. B. & Struhl, K. Distinction and relationship between elongation rate and processivity of RNA polymerase II in vivo. Mol. Cell 17, 831–840 (2005).
Article CAS Google Scholar
Wu-Baer, F., Lane, W. S. & Gaynor, R. B. Role of the human homolog of the yeast transcription factor SPT5 in HIV-1 Tat-activation. J. Mol. Biol. 277, 179–197 (1998).
Article CAS Google Scholar
Bourgeois, C. F., Kim, Y. K., Churcher, M. J., West, M. J. & Karn, J. Spt5 cooperates with human immunodeficiency virus type 1 Tat by preventing premature RNA release at terminator sequences. Mol. Cell Biol. 22, 1079–1093 (2002).
Article CAS Google Scholar
Sale, J. E. & Neuberger, M. S. TdT-accessible breaks are scattered over the immunoglobulin V domain in a constitutively hypermutating B cell line. Immunity 9, 859–869 (1998).
Article CAS Google Scholar
Baughn, L. B. et al. Recombinase-mediated cassette exchange as a novel method to study somatic hypermutation in Ramos cells. mBio 2, e00186–e00211 (2011).
Article Google Scholar
Yamaguchi, Y., Shibata, H. & Handa, H. Transcription elongation factors DSIF and NELF: promoter-proximal pausing and beyond. Biochim. Biophys. Acta 1829, 98–104 (2013).
Article CAS Google Scholar
Ronai, D. et al. Detection of chromatin-associated single-stranded DNA in regions targeted for somatic hypermutation. J. Exp. Med. 204, 181–190 (2007).
Article CAS Google Scholar
Parsa, J. Y. et al. Negative supercoiling creates single-stranded patches of DNA that are substrates for AID-mediated mutagenesis. PLoS Genet. 8, e1002518 (2012).
Article MathSciNet CAS Google Scholar
Zhang, W. et al. Clonal instability of V region hypermutation in the Ramos Burkitt's lymphoma cell line. Int. Immunol. 13, 1175–1184 (2001).
Article CAS Google Scholar
Doi, T., Kinoshita, K., Ikegawa, M., Muramatsu, M. & Honjo, T. De novo protein synthesis is required for the activation-induced cytidine deaminase function in class-switch recombination. Proc. Natl Acad. Sci. USA 100, 2634–2638 (2003).
Article CAS ADS Google Scholar
Ding, B., LeJeune, D. & Li, S. The C-terminal repeat domain of Spt5 plays an important role in suppression of Rad26-independent transcription coupled repair. J. Biol. Chem. 285, 5317–5326 (2010).
Article CAS Google Scholar
Kodgire, P., Mukkawar, P., Ratnam, S., Martin, T. E. & Storb, U. Changes in RNA polymerase II progression influence somatic hypermutation of Ig-related genes by AID. J. Exp. Med. 210, 1481–1492 (2013).
Article CAS Google Scholar
Liu, M. et al. Two levels of protection for the B cell genome during somatic hypermutation. Nature 451, 841–845 (2008).
Article CAS ADS Google Scholar
Basu, U. et al. The RNA exosome targets the AID cytidine deaminase to both strands of transcribed duplex DNA substrates. Cell 144, 353–363 (2011).
Article CAS Google Scholar
Lykke-Andersen, S., Brodersen, D. E. & Jensen, T. H. Origins and activities of the eukaryotic exosome. J. Cell Sci. 122, (Pt 10): 1487–1494 (2009).
Article CAS Google Scholar
Houseley, J. & Tollervey, D. The nuclear RNA surveillance machinery: the link between ncRNAs and genome structure in budding yeast? Biochim. Biophys. Acta 1779, 239–246 (2008).
Article CAS Google Scholar
Allmang, C., Mitchell, P., Petfalski, E. & Tollervey, D. Degradation of ribosomal RNA precursors by the exosome. Nucleic Acids Res. 28, 1684–1691 (2000).
Article CAS Google Scholar
Stanlie, A., Begum, N. A., Akiyama, H. & Honjo, T. The DSIF subunits Spt4 and Spt5 have distinct roles at various phases of immunoglobulin class switch recombination. PLoS Genet. 8, e1002675 (2012).
Article CAS Google Scholar
Rajewsky, K. Clonal selection and learning in the antibody system. Nature 381, 751–758 (1996).
Article CAS ADS Google Scholar
Sun, J. et al. E3-ubiquitin ligase Nedd4 determines the fate of AID-associated RNA polymerase II in B cells. Genes Dev. 27, 1821–1833 (2013).
Article CAS Google Scholar
MacLennan, I. C. Germinal centers. Annu. Rev. Immunol. 12, 117–139 (1994).
Article CAS Google Scholar
Kobayashi, M. et al. Decrease in topoisomerase I is responsible for activation-induced cytidine deaminase (AID)-dependent somatic hypermutation. Proc. Natl Acad. Sci. USA 108, 19305–19310 (2011).
Article CAS ADS Google Scholar
Kuehner, J. N., Pearson, E. L. & Moore, C. Unravelling the means to an end: RNA polymerase II transcription termination. Nat. Rev. Mol. Cell Biol. 12, 283–294 (2011).
Article CAS Google Scholar
Kohler, K. M. et al. Identification of core DNA elements that target somatic hypermutation. J. Immunol. 189, 5314–5326 (2012).
Article CAS Google Scholar
Hall, B. M., Ma, C. X., Liang, P. & Singh, K. K. Fluctuation analysis CalculatOR: a web tool for the determination of mutation rate using Luria-Delbruck fluctuation analysis. Bioinformatics 25, 1564–1565 (2009).
Article CAS Google Scholar

Download references

Acknowledgements

We thank Drs Barbara Birshtein, Hilda Ye, Jonathan Warner and Richard Chahwan for their critical comments on the project. We thank the core facilities at Albert Einstein College for their technical support on Flow Cytometry and shRNA viral particle preparation. Funding for this work was provided by National Institutes of Health grants R01 CA072649, R01 CA102705 and an NCI/CFAR Pilot Project from P30CA013330 to M.D.S.

Author information

Authors and Affiliations

Department of Cell Biology, Albert Einstein College of Medicine, Bronx, 10461, New York, USA
Xiaohua Wang, Manxia Fan, Susan Kalis, Lirong Wei & Matthew D. Scharff

Authors

Xiaohua Wang
View author publications
You can also search for this author in PubMed Google Scholar
Manxia Fan
View author publications
You can also search for this author in PubMed Google Scholar
Susan Kalis
View author publications
You can also search for this author in PubMed Google Scholar
Lirong Wei
View author publications
You can also search for this author in PubMed Google Scholar
Matthew D. Scharff
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.W. and M.F. performed the experiments. S.K. and L.W. provided essential reagents and technical supports. X.W. and M.D.S designed the study and wrote the manuscript.

Corresponding author

Correspondence to Matthew D. Scharff.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary Figures 1-2 and Supplementary Tables 1-2 (PDF 914 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, X., Fan, M., Kalis, S. et al. A source of the single-stranded DNA substrate for activation-induced deaminase during somatic hypermutation. Nat Commun 5, 4137 (2014). https://doi.org/10.1038/ncomms5137

Download citation

Received: 12 December 2013
Accepted: 16 May 2014
Published: 13 June 2014
DOI: https://doi.org/10.1038/ncomms5137

This article is cited by

Transient AID expression for in situ mutagenesis with improved cellular fitness
- Talal Salem Al-Qaisi
- Yu-Cheng Su
- Steve R. Roffler
Scientific Reports (2018)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.