Introduction

Obesity and the metabolic syndrome are growing problems worldwide, contributing significantly to increased mortality, morbidity, and associated health-care costs. Insulin resistance (IR), which precedes type 2 diabetes, is strongly associated with obesity and is also on the rise worldwide.1 The etiologies of IR and type 2 diabetes have been attributed to complex genetic factors as well as environmental exposures, with suspicion falling on soluble factors secreted by adipocytes, as well as metabolic abnormalities in muscle tissue associated with increased intracellular lipid content, among other potential mechanisms.2, 3, 4, 5 Recently, obesity and IR have been shown to manifest deranged profiles in both innate and adaptive immune responses.1 Data from Winer et al.6 indicated that the adaptive immune system may play an important role in the development of IR in the context of diet-induced obesity. In these studies, B lymphocytes were implicated in the pathogenesis of IR in C57BL/6 mice fed a high-fat diet (HFD); HFD led to increased numbers of B cells infiltrating the visceral adipose tissues (VAT), and these B cells influenced VAT-resident macrophage and T-cell effector functions. Most importantly, the development of IR was abrogated by B-cell depletion.6 B-cell-null obese mice that received purified immunoglobulin G (IgG) collected from IR mice showed decreased responses to glucose tolerance tests, suggesting a role for circulating antibody in the phenotype. However, other immunological roles for B cells, such as the presentation of antigens to T cells, or other effector functions, could also be consistent with the results of this study. Notably, T cells and macrophages were also required for the development of IR. The potential importance of similar B-cell mechanisms in human IR was supported by the finding that IR human subjects showed autoantibodies to a distinct set of autoantigens as compared with insulin-sensitive subjects.

Given these links between obesity, IR, and B-cell biology in mice and humans, we hypothesized that a HFD and resultant obesity might have a significant impact on B-cell populations and the antibody gene rearrangements they express, potentially selecting for autoreactive B cells or otherwise altering the B-cell repertoire in affected tissues. To assess the effects of HFD on mouse antibody repertoires, we carried out high-throughput DNA sequencing of 24 million antibody heavy chain gene (IGH) rearrangements from B cells expressing IgM, IgG, and IgA isotypes in the blood, spleen, small intestine, large intestine, liver, peritoneal cavity washing, brown adipose tissue, and VAT of mice fed either regular diet (RD) or HFD for 18 weeks. We report that mice on HFD show systematic alterations of IGH rearrangements in their B-cell repertoires in a tissue- and isotype-specific manner. We find that HFD leads to increased proportions of IgA-expressing B cells with short and hydrophobic complementarity-determining region-3 (CDRH3) IGH sequences, and increased proportions of IGH rearrangements lacking somatic mutation. A surprising finding in the analysis is that there are a number of highly stereotyped or “convergent” antibody gene rearrangements expressed by the B cells in the adipose tissues, peritoneal B cells, and, to a lesser degree, intestinal tissues across numerous individual mice. These gene rearrangements are rare in other tissues of the body. The convergent antibody gene rearrangements in the adipose tissues of HFD mice are notable for having higher somatic hypermutation frequencies, suggesting that they may have undergone affinity maturation in an antigen-driven process. Taken together, these results demonstrate that dietary alterations can cause significant changes in the systemic immunoglobulin repertoires expressed by B cells in mice in a tissue- and isotype-specific manner, and provide a framework for understanding how diet and obesity could lead to pathogenic adaptive immune responses associated with IR.

Results

HFD selects for B cells expressing IgA with short, hydrophobic CDRH3s in multiple tissues

As a first analysis of the IGH gene rearrangements detected in mice on HFD compared with RD, we determined the frequencies of IGHV, IGHD, and IGHJ gene segment usage in the B cells from each tissue isolated. We considered gene segment usage separately for each antibody isotype, as there is selection for or against particular gene segments in the naive B-cell repertoire compared with memory B-cell populations.9, 10 Mouse IGHV, IGHD, and IGHJ genes were aligned with the germline gene segment repertoires at the IMGT repository.11 Overall, IGHV gene segment usage was similar in HFD compared with RD mice. Supplementary Figures S1 and S2 online show the IGHV, IGHD, and IGHJ gene segment usage frequency differences in the spleen, small intestine, and VAT when five HFD mice were compared with five RD mice, plotting the top quartile of most highly rearranged IGHV gene segments, and all of the IGHD and IGHJ gene segments. When corrected for multiple hypothesis testing by the Benjamini–Hochberg procedure, no IGHV, IGHD, or IGHJ gene segment was significantly different in its frequency between the HFD and RD mice.

A key determinant of an antibody’s antigen specificity is the peptide loop encoded by the CDRH3 region that spans the junctions of the joined IGHV, IGHD, and IGHJ gene segments.12, 13 In humans, shorter CDRH3 lengths are observed in antigen-experienced B cells such as memory B cells as compared with naive B cells, suggesting that there is a selection against antibodies with long CDRH3 sequences during antigen-driven proliferation and selection.14, 15 We therefore sought to determine whether there were CDRH3 length differences between HFD and RD mice (Figure 1a). No significant difference in CDRH3 length was seen for IgM in all tissues studied, after correction for multiple hypothesis testing. There was, however, a striking difference in the CDRH3 lengths of IgA in HFD compared with RD mice, with shorter CDRH3s seen in spleen, small intestinal tissue, and VAT (Figure 1a). Interestingly, although the difference was most significant in the somatically mutated IGH rearrangements, a similar trend was seen in the germline IgA reads, suggesting that exposure to a HFD was associated with selection for shorter CDRH3 in both isotype-switched unmutated B cells and those that had undergone isotype switching to IgA and somatic mutation. In contrast, IgG-expressing B cells did not show significant differences in CDRH3 lengths following HFD compared with RD, suggesting that the effect of HFD is not generic for all isotype-switched B cells, but is specific to IgA-expressing B cells.

Figure 1
figure 1

Complementarity-determining region-3 (CDRH3) amino acid characteristics differ between high-fat diet (HFD) and regular diet (RD) mice. CDRH3 length and hydropathy scores were determined and averaged per mouse for both HFD and RD mice. Results are represented in box–whisker plot, with asterisks (*) indicating significance after multiple hypotheses testing (MHT) correction. (a) HFD mice exhibited shorter immunoglobulin A (IgA) CDRH3 amino acid length. When compared with RD mice, HFD mice showed significantly shorter IgA CDRH3 amino acid length in spleen (mutated: 10.0 vs. 11.3, P<0.0004), small intestine (mutated: 9.9 vs. 10.9, P<0.02; germline: 10.0 vs. 10.9, P<0.03), and visceral adipose tissue (mutated: 9.9 vs. 11.3, P<0.009; germline: 9.3 vs. 12.1, P<0.004). (b) HFD mice select for more hydrophobic CDRH3. When compared with RD mice, HFD mice had IgA CDRH3s that were more hydrophobic (higher hydropathy score) in the small intestine (mutated: −6.6 vs. −8.3, P<0.01), visceral adipose tissue (mutated: −5.2 vs. −10.7, P<0.008; germline: −5.3 vs. −10.8, P<0.03), and splenic tissue (mutated: −5.8 vs. −8.2, P<0.02). SI, small intestine; SPLN, spleen; VAT, visceral adipose tissue.

PowerPoint slide

To evaluate for other systematic differences in the antibody repertoires in response to HFD, we also assessed the physicochemical properties of the CDRH3 amino acid sequences in each isotype. HFD was associated with an increase in hydrophobicity (shown as having higher hydropathy scores) in IgA antibodies, but not IgM or IgG antibodies (Figure 1b). The most marked increase in CDRH3 hydrophobicity was seen in VAT specimens. Thus, in the context of HFD, B cells expressing IgA with short CDRH3s, with more hydrophobic amino acid residues, appear to be preferentially selected to clonally expand or persist in the tissues we examined. To investigate whether the changes to CDRH3 hydrophobicity were related to length, we evaluated the mean hydropathy scores as a function of CDRH3 length (Figure 2). We observed a strong negative correlation between CDRH3 length and hydrophobicity. This relationship was seen in both HFD and RD mice, and their respective linear regression lines were not significantly different in slope (Figure 2). Analysis of covariance was performed and showed CDRH3 hydropathy to be affected by CDRH3 length and diet status, and that there was significant interaction between diet status and CDRH3 length. These results indicate mice eating a HFD show increased frequencies of B cells expressing antibodies that are both hydrophobic and short in their CDHR3s, suggesting that selection of these B cells may be acting on the basis of the CDRH3 lengths, or their hydrophobicity, or both.

Figure 2
figure 2

Complementarity-determining region-3 (CDRH3) hydropathy is negatively correlated with sequence length. CDRH3 hydropathy scores for sequences of varying length were calculated and averaged per mouse for high-fat diet (HFD) and regular diet (RD) mice. Results of each diet group are represented as a box–whisker plot per tissue and isotype studied. Regression lines are drawn for each diet group relating length to hydropathy score. There is a negative correlation between CDRH3 length and hydropathy score for all tissues and isotypes examined. The effect length on hydropathy is seen equally in both HFD and RD groups. SI, small intestine; SPLN, spleen; VAT, visceral adipose tissue.

PowerPoint slide

Higher proportion of unmutated IgA in HFD

We next systematically assessed somatic hypermutation frequencies in the IGHV gene segments associated with each antibody isotype in the HFD and RD mouse tissues. For this analysis, all of the IGH rearrangements from mice in the HFD or RD groups were aggregated into tissue- and isotype-specific sets, and histograms of the IGHV mutation rate distribution were generated (Figure 3a). There was a significant difference in the IgM mutation distribution between HFD and RD mice in the small intestine, expressed as the proportion of reads per mouse with <1% mutation frequency (Figure 3b). IgA in the HFD group also displayed a higher proportion of unmutated reads in the spleen and small intestine. A trend toward higher proportions of unmutated reads was also noted in IgG in the spleen and small intestine of HFD mice, but these differences did not rise to significance when corrected for multiple hypothesis testing. The biological impact of elevated levels of isotype-switched but unmutated antibodies in the case of HFD mice is unknown. It is possible that these antibodies may have been stimulated by altered microbiota of mice receiving the HFD. We did not measure autoreactive or polyreactive properties of the antibodies in the HFD mice, or identify the antigens that they bind, but we note there is a precedent in the context of certain human autoimmune diseases, including lupus, for unmutated isotype-switched antibodies to be polyreactive and autoreactive.16, 17, 18

Figure 3
figure 3

High-fat diet (HFD) mice have lower antibody mutation levels compared with regular diet (RD) mice. (a) All mice were grouped into either HFD or RD groups. IGHV gene mutation frequencies were calculated for each unique IGH sequence by taking the total number of mutations found in the IGHV gene segment, dividing by its full nucleotide length and expressing as a fraction (along the x axis) in the histogram plot. Mutation frequencies were calculated for each tissue and isotype. (b) The proportion of IGHV gene reads below a 1.0% mutation threshold was determined for each mouse on a tissue and isotype basis, and shown in the box–whisker plot. Significant comparisons after multiple hypotheses testing (MHT) correction are indicated by asterisks (*). HFD mice showed statistically significant higher proportion of reads below 1.0% mutation in IgA in the spleen (0.44 vs. 0.26, P<0.007) and small intestine (0.35 vs. 0.20, P<0.007), as well as IgM in the small intestine (0.78 vs. 0.70, P<0.02). SI, small intestine; SPLN, spleen; VAT, visceral adipose tissue.

PowerPoint slide

B cells expressing convergent antibodies are highly enriched in the VAT of mice

“Public” or “convergent” antibody and T-cell receptor (TCR) sequences have been reported in a variety of human and mouse immunological studies, and can be the result of antigen-driven selection of similar Ig or TCR paratope sequences in different individuals, in spite of the great diversity of the B-cell and TCR repertoires.14, 19 Given the prominent role of B cells in the development of IR in the HFD model, we hypothesized that there might be antigen-driven selection for convergent IGH sequences in HFD mice, and that some of these antibodies might be pathogenic. We performed a pairwise comparison of the IGH sequences derived from each mouse with the IGH sequences in all the other mice, in a tissue-specific manner, to search for similar IGH gene rearrangements shared between mice. We defined convergent sequences as those using the same IGHV and IGHJ gene segments and having identical CDRH3 length, and at least 90% similarity of CDRH3 nucleotide sequence. To normalize for the sequencing depth obtained from each mouse tissue, the number of related sequences between the two samples being compared was divided by the total number of comparisons (i.e., the product of the number of unique sequences for the two samples). A heatmap summarizing these results is presented in Figure 4a. Strikingly, we find a consistent pattern of enrichment for similar IGH sequences in the adipose tissues, particularly in mice fed the HFD. Most of these convergent rearrangements were expressed as IgM, with occasional IgG and IgA members. We next examined the somatic mutation status of the convergent adipose tissue sequences found in all mice and compared them with the convergent adipose tissue sequences found only in HFD mice but not RD mice (Figure 4b). In the HFD-specific convergent rearrangements, there was an increase in the proportion of somatically mutated reads. This result is particularly interesting in light of the finding that the overall mutation rate in HFD is decreased, as shown in Figure 3b. Overall, the findings suggest that B cells expressing similar IGH rearrangements can be found in the VAT of all mice, but that there are also convergent sequences unique to the VAT of HFD mice, and these show more evidence of somatic mutation and possible affinity maturation. This could indicate the presence of an antigen-driven process in response to a shared antigen or antigens in the VAT of HFD mice.

Figure 4
figure 4

Convergent immunoglobulin M (IgM) antibody gene rearrangements in the adipose tissue of different mice. (a) A pairwise comparison of IGH reads within and between all mice and tissues was carried out to identify highly similar (“convergent”) sequences shared between mice. Each individual mouse’s Ig repertoire, when compared with itself, showed numerous highly similar sequences shared between different tissue compartments, as is expected from the presence of members of the same B-cell clones distributed within the mouse’s body, and is seen in the large regions of lighter-colored squares along the diagonal in the heatmap. Convergent reads shared between different mice were also detected, particularly in the visceral and brown adipose tissues, with occasional overlap also seen in the small intestinal tissues, among all mice in both the high-fat diet (HFD) and regular diet (RD) groups. The proportion of convergent sequences in each comparison was corrected for sequencing read depth by dividing the total number of similar reads found between the two samples by the product of total number of reads compared in each specimen. This quotient is represented in the heatmap. Tissues are abbreviated as: BAT, brown adipose tissue; COL, colon; DUO, duodenum; ILE, ileum; JEJ, jejunum; LIV, liver; PBMC, peripheral blood mononuclear cell; PERC, peritoneal cavity cell; SPLN, spleen; VAT, visceral adipose tissue. (b) Convergent VAT reads in HFD mice have elevated proportions of mutated sequences. Mutation status of convergent visceral adipose tissue reads found in all mice (both HFD and RD mice) was compared with convergent visceral adipose tissue reads found only in HFD mice. Convergent visceral adipose tissue reads that were identified only in HFD mice showed a higher proportion of mutated reads compared with convergent visceral adipose tissue found in all mice (0.60 vs. 0.20, P<0.002). *P-value <0.05.

PowerPoint slide

To further characterize these convergent antibodies, we clustered these convergent IGH sequences by single linkage, requiring identical IGHV and IGHJ gene usage, identical CDRH3 length, and at least 90% similarity between CDRH3 nucleotide sequences. The convergent rearrangements found in normal VAT repertoire grouped into 11 distinct clusters. In addition, five distinct clusters were identified in HFD VAT convergent reads. A summary of the sequence features of the convergent IGH rearrangements found in all mice, and those present specifically in the HFD mice, is presented in Table 1a, b and c.

Table 1a Nonspecific convergent antibody clones in visceral adipose tissue
Table 1b High-fat diet-specific convergent antibody clones in visceral adipose tissue
Table 1c High-fat diet convergent antibody clones distribution

Discussion

The adaptive immune system has been selected in evolution to respond sensitively to a variety of potential infectious disease threats to the host organism, but misdirected immune responses triggered by environmental exposures can also give rise to pathological consequences, as in the case of food allergies and postinfectious autoimmune complications such as rheumatic heart disease, Guillain–Barre syndrome, and cold agglutinin autoimmune hemolytic anemia. It is less clear what effect the large number of nonpathogen-associated antigen exposures encountered by the host organism may have on the adaptive immune system, particularly in tissues in intimate contact with the environment, such as the gastrointestinal tract or lungs. Several lines of evidence indicate that obesity-associated IR induced by dietary changes may be initiated or worsened by adaptive immune responses. For example, changes to TCR repertoires can indicate clonal expansion of T cells that have been selected by antigenic stimuli,20 and HFD-fed mouse VAT TCRs have been reported to have a restricted repertoire compared with TCRs from lean VAT.21, 22 VAT T cells in diet induced obese ovalbumin (OVA)-specific OT-2 TCR transgenic mice develop a highly biased TCR-Vα repertoire to non-OVA antigen(s), suggesting a non-OVA antigen-driven T-cell response.21

The HFD mouse model enables a test of the hypothesis that a simple dietary intervention that alters the composition of biomolecules and antigens encountered via the gastrointestinal tract can shape adaptive immune cell populations and antigen receptor repertoires in wild-type mice. We analyzed the repertoires of antibodies expressed by B cells in mice fed HFD to measure any systematic changes in the immune system under these controlled conditions. Our findings indicate that some aspects of the B-cell repertoires in mice exposed to HFD remain relatively unchanged, such as the frequency of usage of IGHV, IGHD, and IGHJ gene segments. In contrast, HFD is associated with a prominent selection for B cells expressing IgA with short CDRH3s and hydrophobic amino acid CDRH3 composition. B cells expressing IgA antibodies with short, hydrophobic CDRH3s were most prominent in the small intestine, spleen, and VAT, showing that the finding is present systemically in secondary lymphoid tissues, as well as in the gastrointestinal mucosal tissues and adipose tissues that may be most directly affected by the high-fat content of the diet. The specificity of these findings for IgA-expressing B cells, as opposed to those expressing IgM or IgG, could be consistent with a dysregulated immune reaction triggered by HFD, initiated in the gastrointestinal tract where IgA-expressing B cells predominate, and then extending to VAT sites and secondary lymphoid tissues.

The finding that the adaptive immune system alterations induced by HFD manifest most prominently in the IgA-expressing B cells of the gut is particularly interesting. It is possible that hydrophobic lipid-containing antigens in the gut or associated with adipose tissues lead to the selection of B cells expressing hydrophobic antibodies in these sites. Because mouse antibody CDRH3 length and hydrophobicity are negatively correlated, it is also possible that the HFD is leading to selection for B cells with shorter CDRH3 regions, rather than specifically acting on the hydrophobicity itself. In humans, it is well established that B cells with greater evidence of prior antigenic exposures, such as highly mutated memory B cells, show selection for shorter CDRH3 lengths, although the mechanistic reasons for this are not well understood.23, 24, 25 Another potential basis for the B-cell repertoire changes we have observed in HFD mice could be the impact of the diet on the intestinal microbiome. The intestines are the site of food absorption, and host an enormous, diverse commensal microbe population, but are also a major site of attack by pathogens. IgA, as the most abundant antibody class in the intestinal mucosal secretions,26 plays crucial roles in protecting the body from invasive pathogens and also regulating metabolism by effects on nonpathogen microbes in ways that are often intertwined.27 There is substantial evidence that obesity and type 2 diabetes are associated with an altered gut microbiome.28 Cohousing, transplantation of fecal microbiota, and antibiotic treatment have shown a causal relationship between altering gut microbiota and affecting obesity and metabolism in mouse models.29, 30, 31 Given that noninvasive commensal gut bacteria can induce production of bacteria-specific IgA by mucosal B cells,32 it is possible that changes in the microbiome associated with HFD consumption can be recognized by the gut-associated B cells and alter the IgA repertoire.

We identified a set of related antibody IGH sequences that were found in the VAT of all mice analyzed in the study, both HFD and RD. These were predominantly expressed as IgM and can be grouped into 11 distinct clusters, each having multiple unique CDRH3 members. The presence of convergent IGH indicates that the normal B-cell repertoire in the VAT of mice has stereotypical features, similar to those of tissue-specific gamma-delta T-cell populations previously reported in various tissue sites such as the skin, lung, or intestines in mice.33, 34 Interestingly, there were five distinct sets of related IGH rearrangements that were specific to the VAT of HFD mice, and these showed a higher mutation rate than the convergent rearrangements seen in the VAT repertoires of both HFD and RD mice. We hypothesize that these convergent VAT B cells found exclusively in HFD mice could be involved in the development of IR, as it has been previously demonstrated that early after the initiation of HFD, B cells accumulate in the VAT, and are required for the development of IR in conjunction with T cells and macrophages.6 These data suggest that although there are normal resident VAT B cells with stereotyped antibody IGH sequences that constitute the normal VAT repertoire, HFD feeding may stimulate a distinct set of B cells to proliferate and undergo somatic hypermutation, potentially in response to autoantigen stimulation. It is also possible that such B cells may migrate into the VAT from elsewhere in the body in response to the HFD.

Taken together, our findings demonstrate that high-fat dietary changes lead to systemic immunoglobulin repertoire alterations, with many of these changes preferentially affecting intestinal and visceral adipose tissues, raising the possibility of a gastrointestinal–VAT immune axis shaped by the diet or other environmental antigens. The VAT in healthy mice contains a highly convergent, shared B-cell repertoire that is perturbed by increased fat consumption. These findings provide a framework for better understanding the role that B cells play in diet-induced obesity and the development of insulin resistance. Although a simple dietary intervention leads to the changes observed, additional investigations are necessary to determine the mechanistic details causing the immunological disturbances. As mentioned above, immune system changes may be driven by alterations in the microbiota of the HFD mice. Furthermore, in humans, it has been shown that obesity and increased body mass index can perturb the immune system, leading to increased susceptibility to viral and bacterial infections.35, 36, 37, 38, 39, 40 As HFD causes many and varied biological perturbations including obesity, contributions from other physiological changes associated with diet are potentially involved. Future work identifying the antigen targets of the B-cell lineages induced by HFD, and the biological effects of secreted antibodies from those cells, or alterations of the interactions of B cells with other immune cell types, should further extend these findings.

Methods

Mice. Wild-type and HFD-fed male C57BL/6 mice were purchased from the Jackson Laboratory (Sacramento, CA) at 24 weeks of age. Animals were rested in our facility for 3 days and killed at 24 weeks of age. All mice were maintained in a specific pathogen-free, temperature-controlled environment on a 12 h light and dark cycle. Mice at the Jackson Laboratory were fed a diet of mouse chow for the first 6 weeks of life, then either continued with the regular mouse chow diet (RD; 15 kcal% fat; LabDiet, St. Louis, MO) or were switched to a HFD (60 kcal% fat; Research Diets, New Brunswick, NJ, for 18 weeks), and were maintained on Teklad Sani-Chips bedding (Harlan, Indianapolis, IN). The Stanford University institutional animal care and use committee approved all protocols.

Tissue isolation and processing. Peripheral blood was collected via retro–orbital bleed into a potassium EDTA Vacutainer tube, transferred to a microcentrifuge tube, and snap frozen in liquid nitrogen. Peritoneal cavity cells were harvested by injecting 7 ml of phosphate-buffered saline with 2 mM EDTA and 2% fetal calf serum into the peritoneal cavity. Spleen, VAT, duodenum, ileum, jejunum, colon, liver, and brown adipose tissue were harvested whole and snap frozen. Samples were stored at −80 °C.

RNA isolation. Tissue (75 mg) was homogenized in 750 μl Trizol (Invitrogen, Carlsbad, CA) with stainless steel beads (Qiagen, Hilden, Germany) in a microtube homogenizer (Beadbug, Benchmark Scientific, Edison, NJ) for 30 s at 4 °C. RNA was isolated as previously described.7

IGH amplification and sequencing. IGH DNA libraries for sequencing were generated from complementary DNA prepared from total RNA isolated from mouse tissues. Complementary DNA was generated from 600 ng of total RNA from each sample, using random hexamer primers and Superscript II (Invitrogen) reverse transcriptase. For PCR amplifications, multiplexed primers targeting the mouse framework region 1 (FR1) of the IGHV were used in combination with each isotype-specific primer. IgG isotype-specific primers were adapted from previously published primers.41 Isotype primers targeted constant region exon 1 of IgA, IgG, or IgM. Amplification of each isotype was performed in a separate PCR reaction to avoid formation of inter-isotype chimeric PCR products. Each sample was barcoded with 8-mer primer-encoded nucleotide sequences on both the 5′ and 3′ ends of the amplicons. A list of primers used is shown (Supplementary Table S1). PCR amplification was performed in two steps. In the first step, amplicons were generated using primers with the partial Illumina (San Diego, CA) adapter, barcode and locus-specific sequence. A final 30 μl volume was used in the first PCR amplification. A second PCR was then performed to complete the Illumina adapter sequence and to ensure that final PCR product was not amplified to saturation; we used 1 μl from the first PCR product with a final 20 μl volume. The conditions for the first step PCR were 35 cycles at: melt 94 °C for 30 s, anneal at 56 °C for 45 s, and extend at 72 °C for 90 s. Conditions of the second PCR step were 12 cycles at: melt 94 °C for 30 s, anneal at 60 °C for 45 s, and extend at 72 °C for 90 s. Amplified product size was verified by electrophoresis with a 1.5% agarose gel. Products were then pooled, purified by agarose gel electrophoresis and extraction, and sequenced with an Illumina MiSeq instrument.

Analysis of IGH sequence data. Paired-end sequences were merged, demultiplexed based on exact matching to primer sequences and primer-encoded barcode sequences to determine sample assignment, and primer-trimmed. Next, the variable (IGHV), diversity (IGHD), and joining (IGHJ) gene segment usage in each rearrangement, as well as mutations from germline, was identified using the IgBlast program.8 Isotype identity was established by exact match to non-primer-encoded sequence within exon 1 of the sequenced constant region. The number of reads and clones for each mouse, tissue type, and isotype are presented in Supplementary Tables S2 and S3. CDRH3 was determined as the sequence flanked on the 5′ end by the end of FR3 as identified by IgBlast, and on the 3′ by FR4 boundary determined by position weight matrix search trained to identify the WGQG motif and sequence variants. To avoid overinterpretation of sequencing errors as somatic mutations in IGH gene rearrangements, a threshold of 1% mutation in the IGHV gene segment was used to distinguish mutated sequences from unmutated sequences. Exactly matching IGH nucleotide sequences from the same sample were collapsed into a single read for further analysis.