Increasingly powerful sequencing technologies are ushering in an era of personal genome sequences and raising the possibility of using such information to guide medical decisions. Genome resequencing also promises to accelerate the identification of disease-associated mutations. Roughly 98% of the human genome is composed of repeats and intergenic or non–protein-coding sequences. Thus, it is crucial to focus resequencing on high-value genomic regions. Protein-coding exons represent one such type of high-value target. We have developed a method of using flexible, high-density microarrays to capture any desired fraction of the human genome, in this case corresponding to more than 200,000 protein-coding exons. Depending on the precise protocol, up to 55–85% of the captured fragments are associated with targeted regions and up to 98% of intended exons can be recovered. This methodology provides an adaptable route toward rapid and efficient resequencing of any sizeable, non-repeat portion of the human genome.
This is a preview of subscription content, access via your institution
Open Access articles citing this article.
Scientific Reports Open Access 22 June 2021
Subscribe to this journal
Receive 12 print issues and online access
$209.00 per year
only $17.42 per issue
Rent or buy this article
Prices vary by article type
Prices may be subject to local taxes which are calculated during checkout
Topol, E.J. & Frazer, K.A. The resequencing imperative. Nat. Genet. 39, 439–440 (2007).
Greenman, C. et al. Patterns of somatic mutation in human cancer genomes. Nature 446, 153–158 (2007).
Futreal, P.A., Wooster, R. & Stratton, M.R. Somatic mutations in human cancer: insights from resequencing the protein kinase gene family. Cold Spring Harb. Symp. Quant. Biol. 70, 43–49 (2005).
Margulies, M. et al. Genome sequencing in microfabricated high-density picolitre reactors. Nature 437, 376–380 (2005).
Bentley, D.R. Whole-genome re-sequencing. Curr. Opin. Genet. Dev. 16, 545–552 (2006).
Sebat, J. et al. Large-scale copy number polymorphism in the human genome. Science 305, 525–528 (2004).
Gunderson, K.L., Steemers, F.J., Lee, G., Mendoza, L.G. & Chee, M.S. A genome-wide scalable SNP genotyping assay using microarray technology. Nat. Genet. 37, 549–554 (2005).
Ren, B. et al. Genome-wide location and function of DNA binding proteins. Science 290, 2306–2309 (2000).
Barski, A. et al. High-resolution profiling of histone methylations in the human genome. Cell 129, 823–837 (2007).
Johnson, D.S., Mortazavi, A., Myers, R.M. & Wold, B. Genome-wide mapping of in vivo protein-DNA interactions. Science 316, 1497–1502 (2007).
Robertson, G. et al. Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing. Nat. Methods 4, 651–657 (2007).
Sjoblom, T. et al. The consensus coding sequences of human breast and colorectal cancers. Science 314, 268–274 (2006).
Cleary, M.A. et al. Production of complex nucleic acid libraries using highly parallel in situ oligonucleotide synthesis. Nat. Methods 1, 241–248 (2004).
Pruitt, K.D., Tatusova, T. & Maglott, D.R. NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 35, D61–D65 (2007).
International Human Genome Sequencing Consortium. Finishing the euchromatic sequence of the human genome. Nature 431, 931–945 (2004).
International HapMap Consortium. A haplotype map of the human genome. Nature 437, 1299–1320 (2005).
Morgulis, A., Gertz, E.M., Schaffer, A.A. & Agarwala, R. WindowMasker: window-based masker for sequenced genomes. Bioinformatics 22, 134–141 (2006).
The authors thank M.Q. Zhang and A. Smith for their help in the read mapping and analysis, J. Silva for providing the MCF10A cell line DNA, and M. Rooks, S. McCarthy and members of the McCombie and Hannon laboratories for helpful discussion. G.J.H. is an Investigator of the Howard Hughes Medical Institute and is supported in part by a kind gift from Kathryn W. Davis and major support from the Stanley Foundation. Purchase of instrumentation and this work were supported in part by grants from the US National Science Foundation and National Institutes of Health (M.Q. Zhang, G.J.H. and W.R.M.).
T.J.A., M.N.M., S.W.S., C.M.M. and M.J.R. are employees of Nimblegen, Inc.
About this article
Cite this article
Hodges, E., Xuan, Z., Balija, V. et al. Genome-wide in situ exon capture for selective resequencing. Nat Genet 39, 1522–1527 (2007). https://doi.org/10.1038/ng.2007.42
This article is cited by
Cereal Research Communications (2022)
Beyond monogenetic rare variants: tackling the low rate of genetic diagnoses in predominantly antibody deficiency
Cellular & Molecular Immunology (2021)
Scientific Reports (2021)
Genetic etiologic analysis in 74 Chinese Han women with idiopathic premature ovarian insufficiency by combined molecular genetic testing
Journal of Assisted Reproduction and Genetics (2021)