Targeted capture and massively parallel sequencing of 12 human exomes

Ng, Sarah B.; Turner, Emily H.; Robertson, Peggy D.; Flygare, Steven D.; Bigham, Abigail W.; Lee, Choli; Shaffer, Tristan; Wong, Michelle; Bhattacharjee, Arindam; Eichler, Evan E.; Bamshad, Michael; Nickerson, Deborah A.; Shendure, Jay

doi:10.1038/nature08250

Letter
Published: 16 August 2009

Targeted capture and massively parallel sequencing of 12 human exomes

Sarah B. Ng¹,
Emily H. Turner¹,
Peggy D. Robertson¹,
Steven D. Flygare¹,
Abigail W. Bigham²,
Choli Lee¹,
Tristan Shaffer¹,
Michelle Wong¹,
Arindam Bhattacharjee⁴,
Evan E. Eichler^1,3,
Michael Bamshad²,
Deborah A. Nickerson¹ &
…
Jay Shendure¹

Nature volume 461, pages 272–276 (2009)Cite this article

18k Accesses
1439 Citations
78 Altmetric
Metrics details

Abstract

Genome-wide association studies suggest that common genetic variants explain only a modest fraction of heritable risk for common diseases, raising the question of whether rare variants account for a significant fraction of unexplained heritability^1,2. Although DNA sequencing costs have fallen markedly³, they remain far from what is necessary for rare and novel variants to be routinely identified at a genome-wide scale in large cohorts. We have therefore sought to develop second-generation methods for targeted sequencing of all protein-coding regions (‘exomes’), to reduce costs while enriching for discovery of highly penetrant variants. Here we report on the targeted capture and massively parallel sequencing of the exomes of 12 humans. These include eight HapMap individuals representing three populations⁴, and four unrelated individuals with a rare dominantly inherited disorder, Freeman–Sheldon syndrome (FSS)⁵. We demonstrate the sensitive and specific identification of rare and common variants in over 300 megabases of coding sequence. Using FSS as a proof-of-concept, we show that candidate genes for Mendelian disorders can be identified by exome sequencing of a small number of unrelated, affected individuals. This strategy may be extendable to diseases with more complex genetics through larger sample sizes and appropriate weighting of non-synonymous variants by predicted functional impact.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

Figure 1: **Minor allele frequency and coding indel length distributions.**

Figure 2: **Direct identification of the causal gene for a monogenic disorder by exome sequencing.**

Systematic analysis of paralogous regions in 41,755 exomes uncovers clinically relevant variation

Article Open access 27 October 2023

A structural variation reference for medical and population genetics

Article Open access 27 May 2020

GATK-gCNV enables the discovery of rare copy number variants from exome sequencing data

Article 21 August 2023

References

Cohen, J. C. et al. Multiple rare alleles contribute to low plasma levels of HDL cholesterol. Science 305, 869–872 (2004)
Article ADS CAS Google Scholar
Frazer, K. A., Murray, S. S., Schork, N. J. & Topol, E. J. Human genetic variation and its contribution to complex traits. Nature Rev. Genet. 10, 241–251 (2009)
Article CAS Google Scholar
Shendure, J. & Ji, H. Next-generation DNA sequencing. Nature Biotechnol. 26, 1135–1145 (2008)
Article CAS Google Scholar
The International HapMap Consortium. A haplotype map of the human genome. Nature 437, 1299–1320 (2005)
Toydemir, R. M. et al. Mutations in embryonic myosin heavy chain (MYH3) cause Freeman-Sheldon syndrome and Sheldon-Hall syndrome. Nature Genet. 38, 561–565 (2006)
Article CAS Google Scholar
Sjoblom, T. et al. The consensus coding sequences of human breast and colorectal cancers. Science 314, 268–274 (2006)
Article ADS Google Scholar
Olson, M. Enrichment of super-sized resequencing targets from the human genome. Nature Methods 4, 891–892 (2007)
Article CAS Google Scholar
Hodges, E. et al. Genome-wide in situ exon capture for selective resequencing. Nature Genet. 39, 1522–1527 (2007)
Article CAS Google Scholar
National Center for Biotechnology Information. Consensus CDS protein set <http://www.ncbi.nlm.nih.gov/projects/CCDS> (2009)
Ng, P. C. et al. Genetic variation in an individual human exome. PLoS Genet. 4, e1000160 (2008)
Article Google Scholar
Kidd, J. M. et al. Mapping and sequencing of structural variation from eight human genomes. Nature 453, 56–64 (2008)
Article ADS CAS Google Scholar
Bentley, D. R. et al. Accurate whole human genome sequencing using reversible terminator chemistry. Nature 456, 53–59 (2008)
Article ADS CAS Google Scholar
Li, H., Ruan, J. & Durbin, R. Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome Res. 18, 1851–1858 (2008)
Article CAS Google Scholar
Campbell, P. J. et al. Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing. Nature Genet. 40, 722–729 (2008)
Article CAS Google Scholar
Ewing, B. & Green, P. Base-calling of automated sequencer traces using Phred. II. Error probabilities. Genome Res. 8, 186–194 (1998)
Article CAS Google Scholar
Turner, E. H., Lee, C., Ng, S. B. & Shendure, J. Massively parallel exon capture and library-free resequencing across 16 individuals. Nature Methods 6, 315–316 (2009)
Article CAS Google Scholar
Kidd, J. M. et al. Haplotype sorting using human fosmid clone end-sequence pairs. Genome Res. 18, 2016–2023 (2008)
Article CAS Google Scholar
Albert, T. J. et al. Direct selection of human genomic loci by microarray hybridization. Nature Methods 4, 903–905 (2007)
Article CAS Google Scholar
Wheeler, D. A. et al. The complete genome of an individual by massively parallel DNA sequencing. Nature 452, 872–876 (2008)
Article ADS CAS Google Scholar
Wang, J. et al. The diploid genome sequence of an Asian individual. Nature 456, 60–65 (2008)
Article ADS CAS Google Scholar
Levy, S. et al. The diploid genome sequence of an individual human. PLoS Biol. 5, e254 (2007)
Article Google Scholar
Ley, T. J. et al. DNA sequencing of a cytogenetically normal acute myeloid leukaemia genome. Nature 456, 66–72 (2008)
Article ADS CAS Google Scholar
Boyko, A. R. et al. Assessing the evolutionary impact of amino acid mutations in the human genome. PLoS Genet. 4, e1000083 (2008)
Article Google Scholar
Sunyaev, S. et al. Prediction of deleterious human alleles. Hum. Mol. Genet. 10, 591–597 (2001)
Article CAS Google Scholar
Yngvadottir, B. et al. A genome-wide survey of the prevalence and evolutionary forces acting on human nonsense SNPs. Am. J. Hum. Genet. 84, 224–234 (2009)
Article CAS Google Scholar
Olson, M. V. When less is more: gene loss as an engine of evolutionary change. Am. J. Hum. Genet. 64, 18–23 (1999)
Article CAS Google Scholar
Cohen, J. et al. Low LDL cholesterol in individuals of African descent resulting from frequent nonsense mutations in PCSK9 . Nature Genet. 37, 161–165 (2005)
Article CAS Google Scholar
Jones, S. et al. Exomic sequencing identifies PALB2 as a pancreatic cancer susceptibility gene. Science 324, 217 (2009)
Article ADS CAS Google Scholar
Siva, N. 1000 Genomes project. Nature Biotechnol. 26, 256 (2008)
Article Google Scholar
Kryukov, G. V., Shpunt, A., Stamatoyannopoulos, J. A. & Sunyaev, S. R. Power of deep, all-exon resequencing for discovery of human trait genes. Proc. Natl Acad. Sci. USA 106, 3871–3876 (2009)
Article ADS CAS Google Scholar

Download references

Acknowledgements

For discussions or assistance with genotyping data, we thank P. Green, J. Akey, R. Patwardhan, G. Cooper, J. Kidd, D. Gordon, J. Smith, I. Stanaway and M. Rieder. For assistance with project management, computation, data management and submission, we thank E. Torskey, S. Thompson, T. Amburg, B. McNally, S. Hearsey, M. Shumway and L. Hillier. For Human1M-Duo genotype data on HapMap samples, we thank Illumina. Our work was supported in part by grants from the National Institutes of Health/National Heart Lung and Blood Institute, the National Institutes of Health/National Human Genome Research Institute, National Institutes of Health/National Institute of Child Health and Human Development, and the Washington Research Foundation. S.B.N. is supported by the Agency for Science, Technology and Research, Singapore. E.H.T. and A.W.B. are supported by a training fellowship from the National Institutes of Health/National Human Genome Research Institute. E.E.E. is an investigator of the Howard Hughes Medical Institute.

Author Contributions The project was conceived and experiments planned by S.B.N., E.H.T., A.B., E.E.E., M.B., D.A.N. and J.S. Experiments were performed by S.B.N., E.H.T., C.L. and M.W. Algorithm development and data analysis were performed by S.B.N., P.D.R., S.D.F., A.W.B., T.S., M.B., D.A.N. and J.S. The manuscript was written by S.B.N. and J.S. All aspects of the study were supervised by J.S.

Author information

Authors and Affiliations

Department of Genome Sciences,,
Sarah B. Ng, Emily H. Turner, Peggy D. Robertson, Steven D. Flygare, Choli Lee, Tristan Shaffer, Michelle Wong, Evan E. Eichler, Deborah A. Nickerson & Jay Shendure
Department of Pediatrics, University of Washington,
Abigail W. Bigham & Michael Bamshad
Howard Hughes Medical Institute, Seattle, Washington 98195, USA ,
Evan E. Eichler
Agilent Technologies, Santa Clara, California 95051, USA ,
Arindam Bhattacharjee

Authors

Sarah B. Ng
View author publications
You can also search for this author in PubMed Google Scholar
Emily H. Turner
View author publications
You can also search for this author in PubMed Google Scholar
Peggy D. Robertson
View author publications
You can also search for this author in PubMed Google Scholar
Steven D. Flygare
View author publications
You can also search for this author in PubMed Google Scholar
Abigail W. Bigham
View author publications
You can also search for this author in PubMed Google Scholar
Choli Lee
View author publications
You can also search for this author in PubMed Google Scholar
Tristan Shaffer
View author publications
You can also search for this author in PubMed Google Scholar
Michelle Wong
View author publications
You can also search for this author in PubMed Google Scholar
Arindam Bhattacharjee
View author publications
You can also search for this author in PubMed Google Scholar
Evan E. Eichler
View author publications
You can also search for this author in PubMed Google Scholar
Michael Bamshad
View author publications
You can also search for this author in PubMed Google Scholar
Deborah A. Nickerson
View author publications
You can also search for this author in PubMed Google Scholar
Jay Shendure
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Sarah B. Ng or Jay Shendure.

Ethics declarations

Competing interests

COMPETING INTERESTS: A.B. is an employee of Agilent Technologies. Agilent supplies arrays that can be used for exome capture as described.

Additional information

The authors declare competing financial interests: details accompany the full-text HTML version of the paper at www.nature.com/nature.

Supplementary information

Supplementary Information

This file contains Supplementary Figures 1-6 with Legends and Supplementary Tables 1-5. (PDF 161 kb)

Supplementary Data 1

This file lists intervals within the targeted exome that were excluded from consideration based on poor anticipated mappability with 76 bp single-end reads. (TXT 211 kb)

Supplementary Data 2

This file lists the fraction of targeted coding bases in each gene that were covered in each of 12 individuals (either with >=1x coverage or with sufficient coverage to variant call). (TXT 2828 kb)

PowerPoint slides

PowerPoint slide for Fig. 1

PowerPoint slide for Fig. 2

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ng, S., Turner, E., Robertson, P. et al. Targeted capture and massively parallel sequencing of 12 human exomes. Nature 461, 272–276 (2009). https://doi.org/10.1038/nature08250

Download citation

Received: 05 June 2009
Accepted: 29 June 2009
Published: 16 August 2009
Issue Date: 10 September 2009
DOI: https://doi.org/10.1038/nature08250

This article is cited by

Adoptive neoantigen-reactive T cell therapy: improvement strategies and current clinical researches
- Ruichen Huang
- Bi Zhao
- Wei Zhang
Biomarker Research (2023)
A Novel Frameshift Microdeletion of the TEX12 Gene Caused Infertility in Two Brothers with Nonobstructive Azoospermia
- Minh Duc Bui
- Thi Lan Anh Luong
- Van Hai Nong
Reproductive Sciences (2023)
Toripalimab combined with lenvatinib and GEMOX is a promising regimen as first-line treatment for advanced intrahepatic cholangiocarcinoma: a single-center, single-arm, phase 2 study
- Guo-Ming Shi
- Xiao-Yong Huang
- Jian Zhou
Signal Transduction and Targeted Therapy (2023)
Rabbit targeted genomic sequences after heterologous hybridization using human exome
- Nathalie Iannuccelli
- Julien Sarry
- Julie Demars
BMC Research Notes (2022)
Splicing mutations in the CFTR gene as therapeutic targets
- Karine Deletang
- Magali Taulan-Cadars
Gene Therapy (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Targeted capture and massively parallel sequencing of 12 human exomes

Abstract

Access options

Similar content being viewed by others

Systematic analysis of paralogous regions in 41,755 exomes uncovers clinically relevant variation

A structural variation reference for medical and population genetics

GATK-gCNV enables the discovery of rare copy number variants from exome sequencing data

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Information

Supplementary Data 1

Supplementary Data 2

PowerPoint slides

PowerPoint slide for Fig. 1

PowerPoint slide for Fig. 2

Rights and permissions

About this article

Cite this article

This article is cited by

Adoptive neoantigen-reactive T cell therapy: improvement strategies and current clinical researches

A Novel Frameshift Microdeletion of the TEX12 Gene Caused Infertility in Two Brothers with Nonobstructive Azoospermia

Toripalimab combined with lenvatinib and GEMOX is a promising regimen as first-line treatment for advanced intrahepatic cholangiocarcinoma: a single-center, single-arm, phase 2 study

Rabbit targeted genomic sequences after heterologous hybridization using human exome

Splicing mutations in the CFTR gene as therapeutic targets

Comments

Search

Quick links

Abstract

Access options

Similar content being viewed by others

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

PowerPoint slides

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links