Abstract
Targeting genomic loci by massively parallel sequencing requires new methods to enrich templates to be sequenced. We developed a capture method that uses biotinylated RNA 'baits' to fish targets out of a 'pond' of DNA fragments. The RNA is transcribed from PCR-amplified oligodeoxynucleotides originally synthesized on a microarray, generating sufficient bait for multiple captures at concentrations high enough to drive the hybridization. We tested this method with 170-mer baits that target >15,000 coding exons (2.5 Mb) and four regions (1.7 Mb total) using Illumina sequencing as read-out. About 90% of uniquely aligning bases fell on or near bait sequence; up to 50% lay on exons proper. The uniformity was such that ∼60% of target bases in the exonic 'catch', and ∼80% in the regional catch, had at least half the mean coverage. One lane of Illumina sequence was sufficient to call high-confidence genotypes for 89% of the targeted exon space.
This is a preview of subscription content, access via your institution
Relevant articles
Open Access articles citing this article.
-
Extensive pedigrees reveal the social organization of a Neolithic community
Nature Open Access 26 July 2023
-
Exploiting a targeted resistome sequencing approach in assessing antimicrobial resistance in retail foods
Environmental Microbiome Open Access 29 March 2023
-
Core genome sequencing and genotyping of Leptospira interrogans in clinical samples by target capture sequencing
BMC Infectious Diseases Open Access 14 March 2023
Access options
Subscribe to this journal
Receive 12 print issues and online access
$209.00 per year
only $17.42 per issue
Rent or buy this article
Prices vary by article type
from$1.95
to$39.95
Prices may be subject to local taxes which are calculated during checkout





References
Margulies, M. et al. Genome sequencing in microfabricated high-density picolitre reactors. Nature 437, 376–380 (2005).
Shendure, J. et al. Accurate multiplex polony sequencing of an evolved bacterial genome. Science 309, 1728–1732 (2005).
Bentley, D.R. et al. Accurate whole genome sequencing using reversible terminator chemistry. Nature 456, 53–59 (2008).
Smith, D.R. et al. Rapid whole-genome mutational profiling using next-generation sequencing technologies. Genome Res. 18, 1638–1642 (2008).
Ley, T.J. et al. DNA sequencing of a cytogenetically normal acute myeloid leukaemia genome. Nature 456, 66–72 (2008).
Wang, J. et al. The diploid genome sequence of an Asian individual. Nature 456, 60–66 (2008).
Wheeler, D.A. et al. The complete genome of an individual by massively parallel DNA sequencing. Nature 452, 872–876 (2008).
Dahl, F., Gullberg, M., Stenberg, J., Landegren, U. & Nilsson, M. Multiplex amplification enabled by selective circularization of large sets of genomic DNA fragments. Nucleic Acids Res. 33, e71 (2005).
Albert, T.J. et al. Direct selection of human genomic loci by microarray hybridization. Nat. Methods 4, 903–905 (2007).
Dahl, F. et al. Multigene amplification and massively parallel sequencing for cancer mutation discovery. Proc. Natl. Acad. Sci. USA 104, 9387–9392 (2007).
Fredriksson, S. et al. Multiplex amplification of all coding sequences within 10 cancer genes by Gene-Collector. Nucleic Acids Res. 35, e47 (2007).
Hodges, E. et al. Genome-wide in situ exon capture for selective resequencing. Nat. Genet. 39, 1522–1527 (2007).
Okou, D.T. et al. Microarray-based genomic selection for high-throughput resequencing. Nat. Methods 4, 907–909 (2007).
Porreca, G.J. et al. Multiplex amplification of large sets of human exons. Nat. Methods 4, 931–936 (2007).
Krishnakumar, S. et al. A comprehensive assay for targeted multiplex amplification of human DNA sequences. Proc. Natl. Acad. Sci. USA 105, 9296–9301 (2008).
Clamp, M. et al. Distinguishing protein-coding and noncoding genes in the human genome. Proc. Natl. Acad. Sci. USA 104, 19428–19433 (2007).
Nilsson, M. et al. Padlock probes: circularizing oligonucleotides for localized DNA detection. Science 265, 2085–2088 (1994).
Hardenbol, P. et al. Multiplexed genotyping with sequence-tagged molecular inversion probes. Nat. Biotechnol. 21, 673–678 (2003).
Dohm, J.C., Lottaz, C., Borodina, T. & Himmelbauer, H. Substantial biases in ultra-short read data sets from high-throughput DNA sequencing. Nucleic Acids Res. 36, e105 (2008).
Quail, M.A. et al. A large genome center's improvements to the Illumina sequencing system. Nat. Methods 5, 1005–1010 (2008).
Frazer, K.A. et al. A second generation human haplotype map of over 3.1 million SNPs. Nature 449, 851–861 (2007).
Lovett, M., Kere, J. & Hinton, L.M. Direct selection: a method for the isolation of cDNAs encoded by large genomic regions. Proc. Natl. Acad. Sci. USA 88, 9628–9632 (1991).
Parimoo, S., Patanjali, S.R., Shukla, H., Chaplin, D.D. & Weissman, S.M. cDNA selection: efficient PCR approach for the selection of cDNAs encoded in large chromosomal DNA fragments. Proc. Natl. Acad. Sci. USA 88, 9623–9627 (1991).
Bashiardes, S. et al. Direct genomic selection. Nat. Methods 2, 63–69 (2005).
Jaffe, D.B. et al. Whole-genome sequence assembly for mammalian genomes: Arachne 2. Genome Res. 13, 91–96 (2003).
Acknowledgements
We thank the staff of the Broad Institute Genome Sequencing Platform and Genetic Analysis Platform for generating sequencing and genotyping data. This work was supported by National Human Genome Research Institute grant HG03067-05 (to E.S.L.) and funds of the Broad Institute.
Author information
Authors and Affiliations
Contributions
A.M. and P.R. developed the wet lab protocol. J.M., W.B., T.F., C.R., S.G. and D.B.J. developed computational tools and analyzed data. E.M.L. synthesized the 200mer oligodeoxynucleotide pools. G.G. and S.F. prepared and sequenced fragment libraries. A.G., E.S.L and C.N. designed and directed the project and wrote the paper.
Corresponding author
Ethics declarations
Competing interests
E.M.L. is employed by Agilent technologies, Inc., and Agilent reagents are used in the research presented in this article. A.G., E.S.L. and C.N. are named as inventors on a patent application on selection of nucleic acids by solution hybridization to synthetic oligonucleotide baits.
Supplementary information
Supplementary Text and Figures
Supplementary Figures 1–4 and Supplementary Tables 1 and 2 (PDF 467 kb)
Supplementary Data 1
List of 15,565 targeted exons (TXT 258 kb)
Supplementary Data 2
List of 22,000 synthetic oligodeoxynucleotides for exon capture (TXT 5629 kb)
Supplementary Data 3
List of 10,000 synthetic oligodeoxynucleotides for regional capture (TXT 2825 kb)
Rights and permissions
About this article
Cite this article
Gnirke, A., Melnikov, A., Maguire, J. et al. Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencing. Nat Biotechnol 27, 182–189 (2009). https://doi.org/10.1038/nbt.1523
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/nbt.1523
This article is cited by
-
Core genome sequencing and genotyping of Leptospira interrogans in clinical samples by target capture sequencing
BMC Infectious Diseases (2023)
-
Exploiting a targeted resistome sequencing approach in assessing antimicrobial resistance in retail foods
Environmental Microbiome (2023)
-
A genetic locus complements resistance to Bordetella pertussis-induced histamine sensitization
Communications Biology (2023)
-
Pembrolizumab alone and pembrolizumab plus chemotherapy in previously treated, extrapulmonary poorly differentiated neuroendocrine carcinomas
British Journal of Cancer (2023)
-
Beyond assembly: the increasing flexibility of single-molecule sequencing technology
Nature Reviews Genetics (2023)