Abstract
B-cell responses result in clonal expansion, and can occur in a variety of tissues. To define how B-cell clones are distributed in the body, we sequenced 933,427 B-cell clonal lineages and mapped them to eight different anatomic compartments in six human organ donors. We show that large B-cell clones partition into two broad networks—one spans the blood, bone marrow, spleen and lung, while the other is restricted to tissues within the gastrointestinal (GI) tract (jejunum, ileum and colon). Notably, GI tract clones display extensive sharing of sequence variants among different portions of the tract and have higher frequencies of somatic hypermutation, suggesting extensive and serial rounds of clonal expansion and selection. Our findings provide an anatomic atlas of B-cell clonal lineages, their properties and tissue connections. This resource serves as a foundation for studies of tissue-based immunity, including vaccine responses, infections, autoimmunity and cancer.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$29.99 /Â 30Â days
cancel any time
Subscribe to this journal
Receive 12 print issues and online access
$209.00 per year
only $17.42 per issue
Rent or buy this article
Prices vary by article type
from$1.95
to$39.95
Prices may be subject to local taxes which are calculated during checkout
Similar content being viewed by others
Accession codes
References
McKean, D. et al. Generation of antibody diversity in the immune response of BALB/c mice to influenza virus hemagglutinin. Proc. Natl. Acad. Sci. USA 81, 3180–3184 (1984).
Berek, C., Berger, A. & Apel, M. Maturation of the immune response in germinal centers. Cell 67, 1121–1129 (1991).
Weigert, M.G., Cesari, I.M., Yonkovich, S.J. & Cohn, M. Variability in the lambda light chain sequences of mouse antibody. Nature 228, 1045–1047 (1970).
Jacob, J., Kelsoe, G., Rajewsky, K. & Weiss, U. Intraclonal generation of antibody mutants in germinal centres. Nature 354, 389–392 (1991).
Nossal, G.J. & Lederberg, J. Antibody production by single cells. Nature 181, 1419–1420 (1958).
Schroeder, H.W. Jr. & Cavacini, L. Structure and function of immunoglobulins. J. Allergy Clin. Immunol. 125 (Suppl. 2), S41–S52 (2010).
Kroese, F.G., de Waard, R. & Bos, N.A. B-1 cells and their reactivity with the murine intestinal microflora. Semin. Immunol. 8, 11–18 (1996).
Lindner, C. et al. Diversification of memory B cells drives the continuous adaptation of secretory antibodies to gut microbiota. Nat. Immunol. 16, 880–888 (2015).
Rhee, K.J., Sethupathi, P., Driks, A., Lanning, D.K. & Knight, K.L. Role of commensal bacteria in development of gut-associated lymphoid tissues and preimmune antibody repertoire. J. Immunol. 172, 1118–1124 (2004).
Benitez, A. et al. Differences in mouse and human nonmemory B cell pools. J. Immunol. 192, 4610–4619 (2014).
Steiniger, B.S. Human spleen microanatomy: why mice do not suffice. Immunology 145, 334–346 (2015).
Thome, J.J. et al. Spatial map of human T cell compartmentalization and maintenance over decades of life. Cell 159, 814–828 (2014).
Sathaliyawala, T. et al. Distribution and compartmentalization of human circulating and tissue-resident memory T cell subsets. Immunity 38, 187–197 (2013).
Thome, J.J. et al. Early-life compartmentalization of human T cell differentiation and regulatory function in mucosal and lymphoid tissues. Nat. Med. 22, 72–77 (2016).
Vallangeon, B.D., Tyer, C., Williams, B. & Lagoo, A.S. Improved detection of diffuse large B-cell lymphoma by flow cytometric immunophenotyping-Effect of tissue disaggregation method. Cytometry B Clin. Cytom. 90, 455–461 (2016).
Sakano, H., Kurosawa, Y., Weigert, M. & Tonegawa, S. Identification and nucleotide sequence of a diversity DNA segment (D) of immunoglobulin heavy-chain genes. Nature 290, 562–565 (1981).
Glanville, J. et al. Precise determination of the diversity of a combinatorial antibody library gives insight into the human immunoglobulin repertoire. Proc. Natl. Acad. Sci. USA 106, 20216–20221 (2009).
Zhang, B., Meng, W., Luning Prak, E.T. & Hershberg, U. Discrimination of germline V genes at different sequencing lengths and mutational burdens: A new tool for identifying and evaluating the reliability of V gene assignment. J. Immunol. Methods 427, 105–116 (2015).
Hershberg, U. & Luning Prak, E.T. The analysis of clonal expansions in normal and autoimmune B cell repertoires. Phil. Trans. R. Soc. Lond. B 370, 20140239 (2015).
Yaari, G. & Kleinstein, S.H. Practical guidelines for B-cell receptor repertoire sequencing analysis. Genome Med. 7, 121 (2015).
Colwell, R.K. et al. Models and estimators linking individual-based and sample based rarefaction, extrapolation and comparison of assemblages. J. Plant Ecol. 5, 3–21 (2012).
Goldin, L.R., McMaster, M.L. & Caporaso, N.E. Precursors to lymphoproliferative malignancies. Cancer Epidemiol. Biomarkers Prev. 22, 533–539 (2013).
Rosenfeld, A.M., Meng, W., Luning Prak, E.T. & Hershberg, U. ImmuneDB: a system for the analysis and exploration of high-throughput adaptive immune receptor sequencing data. Bioinformatics 33, 292–293 (2017).
Sheneman, L., Evans, J. & Foster, J.A. Clearcut: a fast implementation of relaxed neighbor joining. Bioinformatics 22, 2823–2824 (2006).
Schwartz, G.W., Shokoufandeh, A., Ontanon, S. & Hershberg, U. Using a novel clumpiness measure to unite data with metadata: Finding common sequence patterns in immune receptor germline V genes. Pattern Recognit. Lett. 74, 24–29 (2016).
Yuvaraj, S. et al. Evidence for local expansion of IgA plasma cell precursors in human ileum. J. Immunol. 183, 4871–4878 (2009).
Gowans, J.L. & Knight, E.J. The route of re-circulation of lymphocytes in the rat. Proc. R. Soc. Lond. B Biol. Sci. 159, 257–282 (1964).
McDermott, M.R. & Bienenstock, J. Evidence for a common mucosal immunologic system. I. Migration of B immunoblasts into intestinal, respiratory, and genital tissues. J. Immunol. 122, 1892–1898 (1979).
Rudikoff, S., Pawlita, M., Pumphrey, J. & Heller, M. Somatic diversification of immunoglobulins. Proc. Natl. Acad. Sci. USA 81, 2162–2166 (1984).
Sablitzky, F., Wildner, G. & Rajewsky, K. Somatic mutation and clonal expansion of B cells in an antigen-driven immune response. EMBO J. 4, 345–350 (1985).
Briney, B.S., Willis, J.R., Finn, J.A., McKinney, B.A. & Crowe, J.E. Jr. Tissue-specific expressed antibody variable gene repertoires. PLoS One 9, e100839 (2014).
Spencer, J., Barone, F. & Dunn-Walters, D. Generation of immunoglobulin diversity in human gut-associated lymphoid tissue. Semin. Immunol. 21, 139–146 (2009).
Dunn-Walters, D.K., Isaacson, P.G. & Spencer, J. Sequence analysis of human IgVH genes indicates that ileal lamina propria plasma cells are derived from Peyer's patches. Eur. J. Immunol. 27, 463–467 (1997).
Uduman, M. et al. Detecting selection in immunoglobulin sequences. Nucleic Acids Res. 39, W499–W504 (2011).
Hershberg, U., Uduman, M., Shlomchik, M.J. & Kleinstein, S.H. Improved methods for detecting selection by mutation analysis of Ig V region sequences. Int. Immunol. 20, 683–694 (2008).
Holtmeier, W., Hennemann, A. & Caspary, W.F. IgA and IgM V(H) repertoires in human colon: evidence for clonally expanded B cells that are widely disseminated. Gastroenterology 119, 1253–1266 (2000).
Masahata, K. et al. Generation of colonic IgA-secreting cells in the caecal patch. Nat. Commun. 5, 3704 (2014).
Berkowska, M.A. et al. Circulating human CD27-IgA+ memory B cells recognize bacteria with polyreactive Igs. J. Immunol. 195, 1417–1426 (2015).
Khan, T.A. et al. Accurate and predictive antibody repertoire profiling by molecular amplification fingerprinting. Sci. Adv. 2, e1501371 (2016).
Gupta, N.T. et al. Hierarchical clustering can identify B cell clones with high confidence in ig repertoire sequencing data. J. Immunol. 198, 2489–2499 (2017).
Ralph, D.K. & Matsen, F.A. IV Likelihood-based inference of B cell clonal families. PLoS Comput. Biol. 12, e1005086 (2016).
van Dongen, J.J. et al. Design and standardization of PCR primers and protocols for detection of clonal immunoglobulin and T-cell receptor gene recombinations in suspect lymphoproliferations: report of the BIOMED-2 Concerted Action BMH4-CT98-3936. Leukemia 17, 2257–2317 (2003).
Meng, W. et al. Trials and tribulations with VH replacement. Front. Immunol. 5, 10 (2014).
Vander Heiden, J.A. et al. pRESTO: a toolkit for processing high-throughput sequencing raw reads of lymphocyte receptor repertoires. Bioinformatics 30, 1930–1932 (2014).
Schirmer, M. et al. Insight into biases and sequencing errors for amplicon sequencing with the Illumina MiSeq platform. Nucleic Acids Res. 43, e37 (2015).
Schwartz, G.W. & Hershberg, U. Germline amino acid diversity in B cell receptors is a good predictor of somatic selection pressures. Front. Immunol. 4, 357 (2013).
Schwartz, G.W. & Hershberg, U. Conserved variation: identifying patterns of stability and variability in BCR and TCR V genes with different diversity and richness metrics. Phys. Biol. 10, 035005 (2013).
Bolotin, D.A. et al. MiXCR: software for comprehensive adaptive immunity profiling. Nat. Methods 12, 380–381 (2015).
Shugay, M. et al. VDJtools: unifying post-analysis of T cell receptor repertoires. PLoS Comput. Biol. 11, e1004503 (2015).
Acknowledgements
We thank the Human Immunology Core facility at the University of Pennsylvania for assistance with cell preparations and sequencing support. We thank Y. Louzoun, M. Cancro and R. Thomas Jr. for discussions. We thank the organ donor families and the LiveOnNY transplant staff and coordinators. This work was supported by NIH P01 AI106697 (D.L.F., U.H., M.J.S., W.M., B.Z., A.M.R., D.R., D.J.C., N.M., T.G. and E.T.L.P.), P30-CA016520 (E.T.L.P.) and F31AG047003 (J.J.C.T.). G.W.S. was funded by the US Department of Education Graduate Assistance in Areas of National Need (GAANN) program, CFDA Number: 84.200.
Author information
Authors and Affiliations
Contributions
W.M. designed and performed experiments, developed methods, analyzed data, prepared figures and helped write the manuscript. B.Z., G.W.S. and A.M.R. developed methods for data analysis and visualization, analyzed data, prepared figures and helped revise the manuscript. D.R. performed experiments and helped revise the manuscript. J.J.C.T., D.J.C., N.M., H.L., A.L.F. and T.G. were involved in donor recruitment, organ recovery and helped revise the manuscript. D.L.F. directs the organ donor tissue resource for acquisition of tissues and helped write the manuscript. M.J.S. designed experiments, contributed ideas and helped write the manuscript. E.T.L.P. and U.H. planned the study. U.H. developed methods for data analysis and visualization, designed the data analysis and helped write the manuscript. E.T.L.P. designed experiments, contributed ideas to the data analysis, oversaw the overall study and wrote the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Integrated supplementary information
Supplementary Figure 1 Rarefaction Analysis and Sampling.
a, Rarefaction plots show the expected levels of coverage for different subsamples of sequencing libraries for repertoires of clone size of at least 2 (darkest blue line), 10 (second darkest blue), 20 (red, the cut-off chosen for the overlap analysis), 30 (light blue), 40 (lighter blue) or 50 and greater (light grey) in the repertoire of D207. Separate plots are shown for each tissue. b, Sampling needed for clones of different sizes and coverage thresholds in D207. Bars indicate the number of additional libraries needed to sample 0.5,0.75,0.9 and 0.95 (fractions) of all clones at different clone size cut-offs. Colors of the bars indicate the tissue. PBL = peripheral blood; BM = bone marrow; SPL = spleen; MLN = mesenteric lymph node.
Supplementary Figure 2 Cosine Similarity Analysis.
a, Box plots of distribution of cosine similarity between all sample pairs within a tissue. Similarity is assessed for C20 clones, C20 clones without the top 10 clones in each tissue, all clones, or all clones without the top 10 copy number clones in each tissue. Boxes represent the first and third quartiles bisected by the median. Whiskers represent the most extreme data excluding outliers, where outliers (dots) are data beyond the third or first quartile by a distance exceeding 1.5 times the inter-quartile interval. The C20 clone panel for D207 corresponds to Fig. 1b in the main manuscript. b, The cosine similarity of clonal populations between tissue pairs in D181 and D207 with and without blood clones. Each section represents a tissue and each arrow within each section represents the level of overlap (cosine similarity) between that tissue and clones from other tissues. Longer arrows indicate higher value of cosine similarity indicating more overlap of clonal populations between the tissues. Top: all C20 clones (same as Fig. 1c). Bottom: C20 overlap without blood clones; clones that have sequences in the blood were computationally removed from all tissues.
Supplementary Figure 3 Analysis of Clonal Networks in D207 Using Different Clone Size Definitions.
Clonal overlap was evaluated using cosine similarities between different tissue pairs, as described in the Online Methods. Each wedge represents a tissue and each arrow represents the cosine similarity between the tissues (arrow colors are indicated in the legend, on the left). Three different definitions of clone size are used for the analysis: a, the copy number of all sequences in a clone in a given tissue; b, the total number of unique sequence instances in a clone in a given tissue; and c, the definition that we used in the manuscript (same image as Fig. 1c) - the number of independent sequencing library samples in which the clone appears, in a tissue.
Supplementary Figure 4 Clone Sizes Grouped by Clone Category.
Sizes of large clones (in binned clone sizes of sequence instances, see Online Methods), grouped by tissue distribution category in D181 (left) and D207 (right).
Supplementary Figure 5 Large Clone Tissue Distribution Types in D207.
C20 clones (including clones in the blood): a, Single tissue clones; b, Two-tissue clones; c, Regional (found in 3-5 tissues) and d, Global clones (found in 6-8 tissues). Tissues that contain only a single instance of a clone are not counted towards the distribution pattern of that clone. Each line is a clone. The position along the line represents a tissue. The size of the circles at each position represents the total number of instances the clones have in each tissue. The colored wedges represent the fraction of sequencing libraries that contain at least one instance of the clone in each tissue. A sample clone is shown for each tissue distribution pattern. The number to the left of each line is that clone’s unique clone ID in http://immunedb.com/tissue-atlas. The number to the right is the number of different clones with this tissue distribution pattern. p = peripheral blood; b = bone marrow; s = spleen; l = lung; m = mesenteric lymph node; j = jejunum; i= ileum; c = colon.
Supplementary Figure 6 Large Clone Tissue Distribution Types in D181.
C20 clones (including clones in the blood): a, Single tissue clones; b, Two-tissue clones; c, Regional (found in 3-5 tissues) and d, Global clones (found in 6-8 tissues). Tissues that contain only a single instance of a clone are not counted towards the distribution pattern of that clone. Each line is a clone. The position along the line represents a tissue. The size of the circles at each position represents the total number of instances the clones have in each tissue. The colored wedges represent the fraction of sequencing libraries that contain at least one instance of the clone in each tissue. A sample clone is shown for each tissue distribution pattern. The number to the left of each line is that clone’s unique clone ID in http://immunedb.com/tissue-atlas. The number to the right is the number of different clones with this tissue distribution pattern. p = peripheral blood; b = bone marrow; s = spleen; l = lung; m = mesenteric lymph node; j = jejunum; i= ileum; c = colon.
Supplementary Figure 7 Effects of Clone Size Thresholding on Clone Tissue Membership.
a, Similar blood and GI tract networks are seen at different clone size cut-offs. Overlapping clones are plotted at different clone size cut-offs (greater than or equal to 0.01%, 0.005% and 0.001% of total copies of all of the sequencing libraries within each donor). Each horizontal line represents a clone. Networks of overlapping clones were visualized using MiXCR and VDJtools software (see Online Methods). Only clones from D207 that overlapped between two or more tissues are shown. PBL = peripheral blood; BM = bone marrow; SPL = spleen; Lun = lung; MLN = mesenteric lymph node; Jej = jejunum; Ile = ileum; Col = colon. b, Breadth of tissue membership using different CDR3 identity thresholds. The tissue distribution of C20 clones in the two most deeply sequenced donors was studied using: 85% amino acid identity in CDR3 (the cut-off used in the main manuscript), 90% amino acid identity and no requirement for any CDR3 identity (“bucket”). All of the sequences were collapsed at these different CDR3 similarity thresholds into clones. Then C20 clones were analyzed for their tissue representation: single tissue, double (2 tissues), regional (3-5 tissues) and global (6-8 tissues). The C20 clones were recalculated for each threshold. c, Comparable partitioning of clones from D207 into blood and GI tract networks at two different CDR3 amino acid similarity thresholds. Networks of C20 clones with 85% CDR3 identity (left image is from Fig. 1c in the main manuscript) and 90% CDR3 amino acid identity (right panel) were analyzed using the cosine statistic (see Online Methods). Cosine similarity for two-tissue comparisons is plotted using the color scheme defined in the legend.
Supplementary Figure 8 Blood and GI Tract Networks are Present in All Donors.
Clonal overlap was evaluated in all donors with clones that comprised 0.01% or more of total copies. Each horizontal line represents a clone. Networks of overlapping clones were visualized using MiXCR and VDJtools software (see Online Methods). Only clones that overlapped between two or more tissues are shown. The MLN sample in D168 was hypocellular. PBL = peripheral blood; BM = bone marrow; SPL = spleen; Lun = lung; MLN = mesenteric lymph node; Jej = jejunum; Ile = ileum; Col = colon.
Supplementary Figure 9 Clonal Lineage Networks in D207 with and without Blood Clones.
Each horizontal line represents a clone with at least 20 unique sequence instances in one tissue that overlaps with at least one other tissue, each column represents a tissue. All sub plots are identical, but are colored differently to highlight clones that reside in different tissues. In each instance, the clones that overlap in a specific tissue are colored as indicated by the tissue listed at the top of each panel. All other clones are denoted in black. a, Blood tissue clones highlighted; b, GI tract clones highlighted. Top row: all C20 clones. Bottom row: blood C20 clones computationally excluded (any clone with sequences found in the blood is removed from the analysis). Clones are ordered by their tissue membership.
Supplementary Figure 10 Clonal Lineage Networks in D181 with and without Blood Clones.
Each horizontal line represents a clone with at least 20 unique sequence instances in one tissue that overlaps with at least one other tissue, each column represents a tissue. All sub plots are identical, but are colored differently to highlight clones that reside in different tissues. In each instance, the clones that overlap in a specific tissue are colored as indicated by the tissue listed at the top of each panel. All other clones are denoted in black. a, Blood tissue clones highlighted; b, GI tract clones highlighted. Top row: all C20 clones. Bottom row: blood C20 clones computationally excluded (any clone with sequences found in the blood is removed from the analysis). Clones are ordered by their tissue membership.
Supplementary Figure 11 Sequence Mixing within Clonal Lineages.
a (inset), Numbers of clones used for the clumpiness analysis. Shown are the numbers of clones with nodes in each pair of tissues. This analysis is restricted to C20 clones in D207; these numbers correspond to the clones analyzed in Fig. 3b. b, Sequence mixing within clonal lineages varies between different tissue types. The frequency distributions of clumpiness values between two tissues for all clones with nodes in each tissue pair (combinations are listed at the top of each plot). This analysis was performed on C20 clones in D207.
Supplementary information
Supplementary Text and Figures
Supplementary Figures 1–11 and Supplementary Tables 1 and 2 (PDF 1371 kb)
Rights and permissions
About this article
Cite this article
Meng, W., Zhang, B., Schwartz, G. et al. An atlas of B-cell clonal distribution in the human body. Nat Biotechnol 35, 879–884 (2017). https://doi.org/10.1038/nbt.3942
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/nbt.3942
This article is cited by
-
The rise of big data: deep sequencing-driven computational methods are transforming the landscape of synthetic antibody design
Journal of Biomedical Science (2024)
-
Adaptive immune receptor repertoire analysis
Nature Reviews Methods Primers (2024)
-
Revolutionizing Synthetic Antibody Design: Harnessing Artificial Intelligence and Deep Sequencing Big Data for Unprecedented Advances
Molecular Biotechnology (2024)
-
Current advancements in B-cell receptor sequencing fast-track the development of synthetic antibodies
Molecular Biology Reports (2024)
-
Human intestinal B cells in inflammatory diseases
Nature Reviews Gastroenterology & Hepatology (2023)