Activation Induced Cytidine Deaminase is a mutator that is essential for antibody diversification. A new study, recently reported in Cell Research , provides insights into the mechanism that guides the mutator to physiological targets.

The immune system produces diverse antibodies to counter infections. Three mechanisms contribute to antibody diversity (Fig. 1).1 During B cell maturation, V(D)J recombination creates a large repertoire of antigen-binding sites in immunoglobulin (Ig) variable regions. Naïve B cells express the primary antigen-binding sites in the context of B cell receptors. Antigenic challenge triggers B cell activation. In this process, class switch recombination (CSR) changes the constant region of Ig heavy chain (IgH), which defines antibody class. All naïve B cells express IgM and IgD, whereas activated B cells express IgG, IgA or IgE. As different classes of antibody are specialized for distinct immune functions, class switching broadens the scope of immune response. In parallel with CSR, somatic hypermutation (SHM) modifies antibody variable regions to improve antigen-binding affinity.

Fig. 1
figure 1

Illustration of three pathways of antibody diversification. The diagram is for illustration purpose only and is not drawn to scale. The germline IgH locus is shown at the top. In B cell progenitors, V(D)J recombination joins various combinations of VH, D, and JH segments together to form the complete V(D)J exon, which encodes the Ig variable region. In naïve B cells, Cμ and Cδ are expressed in association with the assembled V(D)J exon to form IgM and IgD, respectively. In response to antigenic challenge and B cell activation, class switch recombination occurs between switch region (Sμ) in front of Cμ and one of the downstream switch regions, for instance, Sγ in the diagram. As a result, the downstream constant region (e.g., Cγ in the diagram) replaces Cμ and Cδ, leading to the expression of a new class of antibody with distinct effector functions. Additionally, somatic hypermutation introduces high levels of mutations in the V(D)J exon. The microenvironment of germinal center selects for mutations that improve antigen-binding affinity

CSR takes place between switch regions that precede constant regions at the IgH locus (Fig. 1). SHM introduces primarily point mutations into the variable region exons of both Ig heavy and light chains. Despite their apparent differences, CSR and SHM share similar mechanisms.2,3 At the core of both processes is Activation Induced Cytidine Deaminase (AID). AID initiates CSR by deaminating cytidines in switch regions. The deamination leaves uracil in DNA. Uracil DNA glycosylase excises the uracil to form abasic sites. Apurinic endonuclease cleaves the abasic sites to break switch regions. Non-Homologous End Joining machinery ligates two broken switch regions to form the CSR product. During SHM, AID deaminates cytidines in Ig variable region exons. Subsequent processing of uracil by DNA replication and various repair pathways mutate the variable region.

The mutagenic activity of AID is largely confined to switch regions and Ig variable regions. The restriction is not absolute, and AID acts on certain non-Ig loci at low levels. Despite the leakiness, the targeting mechanism minimizes deleterious mutations in B cells during an immune response. Oncogenic transformation of B lymphocyte is relatively rare in relation to innumerable B cell activation events throughout the lifespan of an individual. Understanding the AID targeting mechanism has been a major goal of the field.

AID preferentially deaminates cytidines in the context of WRCY or the complementary RGYW motif. Such a prevalent motif cannot be sufficient to unequivocally mark AID targets. Although AID is relatively nondiscriminatory with respect to substrates at the primary sequence level, the enzyme acts strictly on single-stranded DNA. As transcription transiently unwinds DNA template, the process is a major source of single-stranded DNA in genome. Consistent with this link, both CSR and SHM require transcription. Since AID does not act on all transcribed regions, transcription at AID targets may be special in some way. Indeed, AID off-targets in non-Ig loci tend to be convergently transcribed.4,5 How convergent transcription attracts AID to off-targets, and whether or not it is involved in on-target activity at Ig loci has not been fully elucidated. The study by Chen and colleagues, which is reported recently in Cell Research,6 shed some light on this question.

Chen and colleagues investigated the AID targeting mechanism by searching for factors that interact with AID and guide AID to genomic targets. The approach is a well-trodden path and, over the years, has yielded a bewilderingly large collection of AID-interacting proteins. However, the role of many of these putative co-factors remains to be rigorously validated by functional assays in vivo. In this crucial respect, the AID-interacting protein identified by Chen and colleagues, ROD1/PTBP3,7 stands out from the crowd. To test the function of ROD1, the authors knocked out its gene in mice. Remarkably, ROD1 deficiency led to severe reductions in both CSR and SHM. The phenotype is not an indirect consequence of defects in related processes. In this regard, ROD1-knockout mice contain normal B cell populations in different developmental stages, including germinal center B cells, where CSR and SHM take place. In response to B cell activation signals in vitro, the ROD1-deficient B cells proliferate, upregulate AID expression, and transcribe various switch regions at comparable levels as wild-type B cells. In light of the direct interaction between AID and ROD1, the most plausible interpretation of the phenotype is that loss of ROD1 directly impairs AID function. ROD1 protein is a family member of RNA-binding proteins that exhibit high affinity for polypyrimidine tracts.8 The protein consists of a tandem array of four RRM-type RNA binding domains. Based on a genome-wide survey, ROD1 preferentially associates with transcripts that originate from bi-directionally transcribed loci. In line with the association between AID and ROD1, AID exhibits similar RNA binding profiles to ROD1, and the interaction is ROD1-dependent. Based on these observations, the authors proposed that ROD1 binds nascent transcripts at bi-directionally transcribed loci and recruits AID to these targets.

The model raises several questions. First, like other polypyrimidine tract-binding proteins, ROD1 preferentially interacts with pyrimidine-rich motifs in RNA. Paradoxically, transcription of one of the most prominent AID targets, switch region, yields predominantly G-rich transcripts. Transcripts from Ig variable regions are not notable for high pyrimidine content either. How could ROD1 recognize transcripts from these bona fide AID targets? Second, to guide AID to DNA, ROD1 must bind to nascent transcript that is still attached to template. Interaction with free transcript is counterproductive, as ROD1 would divert AID away from its genomic target. How could ROD1 differentiate a nascent transcript from the swarm of free transcripts with the same sequence? Bi-directional transcription is unlikely to be a specific mark for ROD1 or AID targets either, as divergent transcription near promoters is a common trait of many loci. Instead, convergent transcription provides a more definitive correlation with AID non-Ig off-targets, although the association remains to be established for the two main AID targets: switch regions and Ig variable regions. Thus, additional work is needed to determine whether and how ROD1 targets convergently transcribed regions and, more importantly, whether the mechanism applies to the physiological AID targets at the Ig loci.