Abstract
Precisely characterizing the breakpoints of copy number variants (CNVs) is crucial for assessing their functional impact. However, fewer than 10% of known germline CNVs have been mapped to the single-nucleotide level. We characterized the sequence breakpoints from a dataset of all CNVs detected in three unrelated individuals in previous array-based CNV discovery experiments. We used targeted hybridization-based DNA capture and 454 sequencing to sequence 324 CNV breakpoints, including 315 deletions. We observed two major breakpoint signatures: 70% of the deletion breakpoints have 1−30 bp of microhomology, whereas 33% of deletion breakpoints contain 1−367 bp of inserted sequence. The co-occurrence of microhomology and inserted sequence is low (10%), suggesting that there are at least two different mutational mechanisms. Approximately 5% of the breakpoints represent more complex rearrangements, including local microinversions, suggesting a replication-based strand switching mechanism. Despite a rich literature on DNA repair processes, reconstruction of the molecular events generating each of these mutations is not yet possible.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
$209.00 per year
only $17.42 per issue
Buy this article
- Purchase on Springer Link
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Similar content being viewed by others
References
Mills, R.E. et al. An initial map of insertion and deletion (INDEL) variation in the human genome. Genome Res. 16, 1182–1190 (2006).
Levy, S. et al. The diploid genome sequence of an individual human. PLoS Biol. 5, e254 (2007).
Kim, P.M. et al. Analysis of copy number variants and segmental duplications in the human genome: evidence for a change in the process of formation in recent evolutionary history. Genome Res. 18, 1865–1874 (2008).
Wyman, C. & Kanaar, R. DNA double-strand break repair: all's well that ends well. Annu. Rev. Genet. 40, 363–383 (2006).
Hastings, P.J., Lupski, J.R., Rosenberg, S.M. & Ira, G. Mechanisms of change in gene copy number. Nat. Rev. Genet. 10, 551–564 (2009).
Iliakis, G. et al. Mechanisms of DNA double strand break repair and chromosome aberration formation. Cytogenet. Genome Res. 104, 14–20 (2004).
Lieber, M.R. The mechanism of human nonhomologous DNA end joining. J. Biol. Chem. 283, 1–5 (2008).
Inoue, K. & Lupski, J.R. Molecular mechanisms for genomic disorders. Annu. Rev. Genomics Hum. Genet. 3, 199–242 (2002).
Bennardo, N., Cheng, A., Huang, N. & Stark, J.M. Alternative-NHEJ is a mechanistically distinct pathway of mammalian chromosome break repair. PLoS Genet. 4, e1000110 (2008).
Benson, G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 27, 573–580 (1999).
Hastings, P.J., Ira, G. & Lupski, J.R. A microhomology-mediated break-induced replication model for the origin of human copy number variation. PLoS Genet. 5, e1000327 (2009).
Okou, D.T. et al. Microarray-based genomic selection for high-throughput resequencing. Nat. Methods 4, 907–909 (2007).
Albert, T.J. et al. Direct selection of human genomic loci by microarray hybridization. Nat. Methods 4, 903–905 (2007).
Conrad, D. et al. Origins and functional impact of copy number variation in the human genome. Nature advance online publication, doi:10.1038/nature08516 (7 October 2009).
Korbel, J.O. et al. Paired-end mapping reveals extensive structural variation in the human genome. Science 318, 420–426 (2007).
Redon, R. et al. Global variation in copy number in the human genome. Nature 444, 444–454 (2006).
Tuzun, E. et al. Fine-scale structural variation of the human genome. Nat. Genet. 37, 727–732 (2005).
Pique-Regi, R. et al. Sparse representation and Bayesian detection of genome copy number alterations from microarray data. Bioinformatics 24, 309–318 (2008).
Wong, Z., Wilson, V., Patel, I., Povey, S. & Jeffreys, A.J. Characterization of a panel of highly variable minisatellites cloned from human DNA. Ann. Hum. Genet. 51, 269–288 (1987).
Lee, J.A., Carvalho, C.M. & Lupski, J.R.A. DNA replication mechanism for generating nonrecurrent rearrangements associated with genomic disorders. Cell 131, 1235–1247 (2007).
Carvalho, C.M. et al. Complex rearrangements in patients with duplications of MECP2 can occur by fork stalling and template switching. Hum. Mol. Genet. 18, 2188–2203 (2009).
Zhang, F. et al. The DNA replication FoSTeS/MMBIR mechanism can generate genomic, genic and exonic complex rearrangements in humans. Nat. Genet. 41, 849–853 (2009).
Jobling, M.A. et al. A selective difference between human Y-chromosomal DNA haplotypes. Curr. Biol. 8, 1391–1394 (1998).
Sharp, A.J. Emerging themes and new challenges in defining the role of structural variation in human disease. Hum. Mutat. 30, 135–144 (2009).
Tian, D. et al. Single-nucleotide mutation rate increases close to insertions/deletions in eukaryotes. Nature 455, 105–108 (2008).
Kidd, J.M. et al. Mapping and sequencing of structural variation from eight human genomes. Nature 453, 56–64 (2008).
Campbell, P.J. et al. Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing. Nat. Genet. 40, 722–729 (2008).
Bois, P. & Jeffreys, A.J. Minisatellite instability and germline mutation. Cell. Mol. Life Sci. 55, 1636–1648 (1999).
Lindsay, S.J., Khajavi, M., Lupski, J.R. & Hurles, M.E. A chromosomal rearrangement hotspot can be identified from population genetic variation and is coincident with a hotspot for allelic recombination. Am. J. Hum. Genet. 79, 890–902 (2006).
Myers, S., Freeman, C., Auton, A., Donnelly, P. & McVean, G. A common sequence motif associated with recombination hot spots and genome instability in humans. Nat. Genet. 40, 1124–1129 (2008).
Bai, J. & Perron, P. Computation and analysis of multiple structural change models. J. Appl. Econom. 18, 1–22 (2003).
Zeileis, A., Kleiber, C., Kramer, W. & Hornik, K. Testing and dating of structural changes in practice. Comput. Stat. Data Anal. 44, 109–123 (2003).
Huber, W., Toedling, J. & Steinmetz, L.M. Transcript mapping with high-density oligonucleotide tiling arrays. Bioinformatics 22, 1963–1970 (2006).
Ning, Z., Cox, A.J. & Mullikin, J.C. SSAHA: a fast search method for large DNA databases. Genome Res. 11, 1725–1729 (2001).
Kent, W.J. BLAT–the BLAST-like alignment tool. Genome Res. 12, 656–664 (2002).
Staden, R., Beal, K.F. & Bonfield, J.K. The Staden package, 1998. Methods Mol. Biol. 132, 115–130 (2000).
Wheeler, D.A. et al. The complete genome of an individual by massively parallel DNA sequencing. Nature 452, 872–876 (2008).
Acknowledgements
This research was funded by Wellcome Trust grant 077014/Z/05/Z. We thank the Wellcome Trust Sanger Institute for informatics and sequencing support, H. Lam for sharing sequenced CNV breakpoint locations and colleagues at NimbleGen for support and discussions.
Author information
Authors and Affiliations
Contributions
D.F.C. and M.E.H. designed the study. L.M. and D.J.T. performed pulldown and sequencing experiments. D.F.C., B.B., C.B., S.L. and M.E.H. analyzed the data. C.B. performed validation experiments. D.F.C. and M.E.H. wrote the paper with contributions from C.B., B.B. and C.L.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Supplementary information
Supplementary Text and Figures
Supplementary Figures 1–5 and Supplementary Note (PDF 203 kb)
Supplementary Table 1
CNV breakpoint locations (XLS 110 kb)
Supplementary Table 2
PCR validation primers (XLS 42 kb)
Rights and permissions
About this article
Cite this article
Conrad, D., Bird, C., Blackburne, B. et al. Mutation spectrum revealed by breakpoint sequencing of human germline CNVs. Nat Genet 42, 385–391 (2010). https://doi.org/10.1038/ng.564
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/ng.564
This article is cited by
-
Evaluation of genetic risk of apparently balanced chromosomal rearrangement carriers by breakpoint characterization
Journal of Assisted Reproduction and Genetics (2024)
-
Prevalence and mechanisms of somatic deletions in single human neurons during normal aging and in DNA repair disorders
Nature Communications (2022)
-
Targeting the RHOA pathway improves learning and memory in adult Kctd13 and 16p11.2 deletion mouse models
Molecular Autism (2021)
-
Rare genetic variants in the gene encoding histone lysine demethylase 4C (KDM4C) and their contributions to susceptibility to schizophrenia and autism spectrum disorder
Translational Psychiatry (2020)
-
Genomic profiling of Acute lymphoblastic leukemia in ataxia telangiectasia patients reveals tight link between ATM mutations and chromothripsis
Leukemia (2017)