Rapid genome sequencing with short universal tiling probes

Pihlak, Arno; Baurén, Göran; Hersoug, Ellef; Lönnerberg, Peter; Metsis, Ats; Linnarsson, Sten

doi:10.1038/nbt1405

Article
Published: 25 May 2008

Rapid genome sequencing with short universal tiling probes

Arno Pihlak^1,2,
Göran Baurén²^nAff3,
Ellef Hersoug^1,2,
Peter Lönnerberg^1,2,
Ats Metsis^1,2 &
…
Sten Linnarsson^1,2

Nature Biotechnology volume 26, pages 676–684 (2008)Cite this article

1149 Accesses
44 Citations
13 Altmetric
Metrics details

Abstract

The increasing availability of high-quality reference genomic sequences has created a demand for ways to survey the sequence differences present in individual genomes. Here we describe a DNA sequencing method based on hybridization of a universal panel of tiling probes. Millions of shotgun fragments are amplified in situ and subjected to sequential hybridization with short fluorescent probes. Long fragments of 200 bp facilitate unique placement even in large genomes. The sequencing chemistry is simple, enzyme-free and consumes only dilute solutions of the probes, resulting in reduced sequencing cost and substantially increased speed. A prototype instrument based on commonly available equipment was used to resequence the Bacteriophage λ and Escherichia coli genomes to better than 99.93% accuracy with a raw throughput of 320 Mbp/day, albeit with a significant number of small gaps attributed to losses in sample preparation.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 1: A massively parallel DNA display platform based on *in situ* RCA.**

**Figure 2: Probe design and characterization.**

**Figure 3: Fragments aligned to the reference genome in the Bacteriophage λ assembly.**

**Figure 4: Probabilistic basecalling algorithm.**

**Figure 5: The depth of coverage along the *E. coli* chromosome was strongly skewed toward the origin of replication.**

**Figure 6: Assembly statistics for the *E. coli* genome.**

Advanced preparation of fragment libraries enabled by oligonucleotide-modified 2′,3′-dideoxynucleotides

Article Open access 16 March 2022

Long-read sequencing for identification of insertion sites in large transposon mutant libraries

Article Open access 03 March 2022

The chemistry of next-generation sequencing

Article 16 October 2023

Accession codes

Accessions

GenBank/EMBL/DDBJ

References

Sanger, F., Nicklen, S. & Coulson, A.R. DNA sequencing with chain-terminating inhibitors. Proc. Natl. Acad. Sci. USA 74, 5463–5467 (1977).
Article CAS Google Scholar
Prober, J.M. et al. A system for rapid DNA sequencing with fluorescent chain-terminating dideoxynucleotides. Science 238, 336–341 (1987).
Article CAS Google Scholar
Luckey, J.A. et al. High speed DNA sequencing by capillary electrophoresis. Nucleic Acids Res. 18, 4417–4421 (1990).
Article CAS Google Scholar
Venter, J.C. et al. Environmental genome shotgun sequencing of the Sargasso Sea. Science 304, 66–74 (2004).
Article CAS Google Scholar
The International HapMap Consortium. A haplotype map of the human genome. Nature 437, 1299–1320 (2005).
Klein, R.J. et al. Complement factor H polymorphism in age-related macular degeneration. Science 308, 385–389 (2005).
Article CAS Google Scholar
Maraganore, D.M. et al. High-resolution whole-genome association study of Parkinson disease. Am. J. Hum. Genet. 77, 685–693 (2005).
Article CAS Google Scholar
Margulies, M. et al. Genome sequencing in microfabricated high-density picolitre reactors. Nature 437, 376–380 (2005).
Article CAS Google Scholar
Shendure, J. et al. Accurate multiplex polony sequencing of an evolved bacterial genome. Science 309, 1728–1732 (2005).
Article CAS Google Scholar
Blazej, R.G., Kumaresan, P. & Mathies, R.A. Microfabricated bioprocessor for integrated nanoliter-scale Sanger DNA sequencing. Proc. Natl. Acad. Sci. USA 103, 7240–7245 (2006).
Article CAS Google Scholar
Bennett, S.T., Barnes, C., Cox, A., Davies, L. & Brown, C. Toward the 1,000 dollars human genome. Pharmacogenomics 6, 373–382 (2005).
Article CAS Google Scholar
Shendure, J., Mitra, R.D., Varma, C. & Church, G.M. Advanced sequencing technologies: methods and goals. Nat. Rev. Genet. 5, 335–344 (2004).
Article CAS Google Scholar
Brenner, S. et al. Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays. Nat. Biotechnol. 18, 630–634 (2000).
Article CAS Google Scholar
Ghadessy, F.J., Ong, J.L. & Holliger, P. Directed evolution of polymerase function by compartmentalized self-replication. Proc. Natl. Acad. Sci. USA 98, 4552–4557 (2001).
Article CAS Google Scholar
Mitra, R.D. & Church, G.M. In situ localized amplification and contact replication of many individual DNA molecules. Nucleic Acids Res. 27, e34 (1999).
Article CAS Google Scholar
Bing, D.H. et al. Bridge amplification: a solid phase PCR system for the amplification and detection of allelic differences in single copy genes. Genetic Identity Conference Proceedings, Seventh International Symposium on Human Identification, Scottsdale, AZ, September 18–20, 1996 (Promega Corp., Madison, WI, 1996).
Google Scholar
Braslavsky, I., Hebert, B., Kartalov, E. & Quake, S.R. Sequence information can be obtained from single DNA molecules. Proc. Natl. Acad. Sci. USA 100, 3960–3964 (2003).
Article CAS Google Scholar
Hyman, E.D. A new method of sequencing DNA. Anal. Biochem. 174, 423–436 (1988).
Article CAS Google Scholar
Ronaghi, M., Karamohamed, S., Pettersson, B., Uhlen, M. & Nyren, P. Real-time DNA sequencing using detection of pyrophosphate release. Anal. Biochem. 242, 84–89 (1996).
Article CAS Google Scholar
Metzker, M.L. et al. Termination of DNA synthesis by novel 3′-modified-deoxyribonucleoside 5′-triphosphates. Nucleic Acids Res. 22, 4259–4267 (1994).
Article CAS Google Scholar
Canard, B. & Sarfati, R.S. DNA polymerase fluorescent substrates with reversible 3′-tags. Gene 148, 1–6 (1994).
Article CAS Google Scholar
Hinds, D.A. et al. Whole-genome patterns of common DNA variation in three human populations. Science 307, 1072–1079 (2005).
Article CAS Google Scholar
Drmanac, R., Petrovic, N., Glisin, V. & Crkvenjakov, R. Sequencing of megabase plus DNA by hybridization: theory of the method. Genomics 4, 114–128 (1989).
Article CAS Google Scholar
Drmanac, S. et al. Accurate sequencing by hybridization for DNA diagnostics and individual genomics. Nat. Biotechnol. 16, 54–58 (1998).
Article CAS Google Scholar
Bains, W. & Smith, G.C. A novel method for nucleic acid sequence determination. J. Theor. Biol. 135, 303–307 (1988).
Article CAS Google Scholar
Lysov, Y.P., Florent'ev, V.L., Khorlin, A.A., Khrapko, K.R. & Shik, V.V. Determination of the nucleotide sequence of DNA using hybridization with oligonucleotides. A new method. Dokl. Akad. Nauk SSSR 303, 1508–1511 (1988).
CAS PubMed Google Scholar
Lizardi, P.M. et al. Mutation detection and single-molecule counting using isothermal rolling-circle amplification. Nat. Genet. 19, 225–232 (1998).
Article CAS Google Scholar
Koshkin, A.A. et al. LNA (Locked Nucleic Acids): synthesis of the adenine, cytosine, guanine, 5-methylcytosine, thymine and uracil bicyclonucleoside monomers, oligomerisation, and unprecedented nucleic acid recognition. Tetrahedron 54, 3607–3630 (1998).
Article CAS Google Scholar
Donachie, W.D. The cell cycle of Escherichia coli. Annu. Rev. Microbiol. 47, 199–230 (1993).
Article CAS Google Scholar
Ewing, B. & Green, P. Base-calling of automated sequencer traces using phred. ii. Error probabilities. Genome Res. 8, 186–194 (1998).
Article CAS Google Scholar
Shamir, R. & Tsur, D. Large scale sequencing by hybridization. J. Comput. Biol. 9, 413–428 (2002).
Article CAS Google Scholar
Drmanac, R. et al. Sequencing by hybridization (SBH): advantages, achievements, and opportunities. Adv. Biochem. Eng. Biotechnol. 77, 75–101 (2002).
CAS PubMed Google Scholar
Arratia, R., Martin, D., Reinert, G. & Waterman, M.S. Poisson process approximation for sequence repeats, and sequencing by hybridization. J. Comput. Biol. 3, 425–463 (1996).
Article CAS Google Scholar
Pe'er, I., Arbili, N. & Shamir, R. A computational method for resequencing long DNA targets by universal oligonucleotide arrays. Proc. Natl. Acad. Sci. USA 99, 15492–15496 (2002).
Article CAS Google Scholar
Whiteford, N. et al. An analysis of the feasibility of short read sequencing. Nucleic Acids Res. 33, e171 (2005).
Article Google Scholar
Lander, E.S. et al. Initial sequencing and analysis of the human genome. Nature 409, 860–921 (2001).
Article CAS Google Scholar
Church, G., Shendure, J. & Porreca, G. Sequencing thoroughbreds. Nat. Biotechnol. 24, 139 (2006).
Article CAS Google Scholar
Williams, R. et al. Amplification of complex gene libraries by emulsion PCR. Nat. Methods 3, 545–550 (2006).
Article CAS Google Scholar

Download references

Acknowledgements

We thank M. Nilsson and U. Landegren for early advice on RCA; J. Sagemark for initial bioinformatics analysis of the feasibility of the concept; P. Bérubé for the E. coli DNA preparation; M. Belouchi, P. van Eerdewegh, J. Hooper, B. Houle, R. Paulussen for helpful discussions; and P. Ernfors for advice and discussions. This work was supported by Swedish Research Council grant 522-2006-6511.

Author information

Göran Baurén
Present address: Present address: GE Healthcare, Björkgatan 30, SE-751 84 Uppsala, Sweden.,

Authors and Affiliations

Department of Medical Biochemistry and Biophysics, Laboratory for Molecular Neurobiology, Karolinska Institutet, Scheeles väg 1, Stockholm, SE-171 77, Sweden
Arno Pihlak, Ellef Hersoug, Peter Lönnerberg, Ats Metsis & Sten Linnarsson
Genizon Svenska AB, Nobels väg 12A, SE-171 77, Stockholm, Sweden
Arno Pihlak, Göran Baurén, Ellef Hersoug, Peter Lönnerberg, Ats Metsis & Sten Linnarsson

Authors

Arno Pihlak
View author publications
You can also search for this author in PubMed Google Scholar
Göran Baurén
View author publications
You can also search for this author in PubMed Google Scholar
Ellef Hersoug
View author publications
You can also search for this author in PubMed Google Scholar
Peter Lönnerberg
View author publications
You can also search for this author in PubMed Google Scholar
Ats Metsis
View author publications
You can also search for this author in PubMed Google Scholar
Sten Linnarsson
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.P. developed the short LNA probes, participated in the development of sample preparation, the rolling-circle arrays and the hybridization cycle. G.B. participated in the development of sample preparation and the hybridization cycle. A.M. participated in the development of sample preparation and the rolling-circle arrays. P.L. participated in the development of image analysis and basecalling software and algorithms. E.H. developed the custom Peltier assembly, built the instrument and participated in the development of instrument control and image analysis software. S.L. conceived of the concept of shotgun SBH, participated in probe design, the development of sample preparation, rolling-circle arrays, hybridization cycle, and of the instrument control, image analysis and basecalling software; analyzed the experiments, directed the research and drafted the manuscript.

Corresponding author

Correspondence to Sten Linnarsson.

Ethics declarations

Competing interests

The authors are former employees of Genizon Svenska AB, an affiliate of Genizon Biosciences Inc. (Montreal), which has also funded the research. The authors may under certain circumstances stand to receive royalty payments or similar benefits related to the technology presented in the paper.

Supplementary information

Supplementary Text and Figures

Supplementary Figs. 1–3, Tables 1–3 (PDF 478 kb)

Supplementary Data

Supplementary Data (XLS 527 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pihlak, A., Baurén, G., Hersoug, E. et al. Rapid genome sequencing with short universal tiling probes. Nat Biotechnol 26, 676–684 (2008). https://doi.org/10.1038/nbt1405

Download citation

Received: 09 October 2007
Accepted: 29 April 2008
Published: 25 May 2008
Issue Date: June 2008
DOI: https://doi.org/10.1038/nbt1405

This article is cited by

An Algorithm for Sequencing by Hybridization Based on an Alternating DNA Chip
- Marcin Radom
- Piotr Formanowicz
Interdisciplinary Sciences: Computational Life Sciences (2018)
Single-molecule mechanical identification and sequencing
- Fangyuan Ding
- Maria Manosas
- Vincent Croquette
Nature Methods (2012)
Estimating accuracy of RNA-Seq and microarrays with proteomics
- Xing Fu
- Ning Fu
- Philipp Khaitovich
BMC Genomics (2009)
The challenges of sequencing by synthesis
- Carl W Fuller
- Lyle R Middendorf
- Dmitri V Vezenov
Nature Biotechnology (2009)
High quality draft sequences for prokaryotic genomes using a mix of new sequencing technologies
- Jean-Marc Aury
- Corinne Cruaud
- Patrick Wincker
BMC Genomics (2008)

Rapid genome sequencing with short universal tiling probes

Abstract

Access options

Similar content being viewed by others

Advanced preparation of fragment libraries enabled by oligonucleotide-modified 2′,3′-dideoxynucleotides

Long-read sequencing for identification of insertion sites in large transposon mutant libraries

The chemistry of next-generation sequencing

Accession codes

Accessions

GenBank/EMBL/DDBJ

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Supplementary information

Supplementary Text and Figures

Supplementary Data

Rights and permissions

About this article

Cite this article

This article is cited by

An Algorithm for Sequencing by Hybridization Based on an Alternating DNA Chip

Single-molecule mechanical identification and sequencing

Estimating accuracy of RNA-Seq and microarrays with proteomics

The challenges of sequencing by synthesis

High quality draft sequences for prokaryotic genomes using a mix of new sequencing technologies

Next-generation sequencing-by-hybridization

Search

Quick links

Abstract

Access options

Similar content being viewed by others

Accession codes

Accessions

GenBank/EMBL/DDBJ

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links