Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

Selective multiplexed enrichment for the detection and quantitation of low-fraction DNA variants via low-depth sequencing

Abstract

DNA sequence variants with allele fractions below 1% are difficult to detect and quantify by sequencing owing to intrinsic errors in sequencing-by-synthesis methods. Although molecular-identifier barcodes can detect mutations with a variant-allele frequency (VAF) as low as 0.1% using next-generation sequencing (NGS), sequencing depths of over 25,000× are required, thus hampering the detection of mutations at high sensitivity in patient samples and in most samples used in research. Here we show that low-frequency DNA variants can be detected via low-depth multiplexed NGS after their amplification, by a median of 300-fold, using polymerase chain reaction and rationally designed ‘blocker’ oligonucleotides that bind to the variants. Using an 80-plex NGS panel and a sequencing depth of 250×, we detected single nucleotide polymorphisms with a VAF of 0.019% and contamination in human cell lines at a VAF as low as 0.07%. With a 16-plex NGS panel covering 145 mutations across 9 genes involved in melanoma, we detected low-VAF mutations (0.2–5%) in 7 out of the 19 samples of freshly frozen tumour biopsies, suggesting that tumour heterogeneity could be notably higher than previously recognized.

This is a preview of subscription content, access via your institution

Access options

Rent or buy this article

Prices vary by article type

from$1.95

to$39.95

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Allele enrichment with mBDA enables the detection of rare variants using low-depth sequencing.
Fig. 2: Quantitation of variant VAFs based on observed VRF values from mBDA libraries.
Fig. 3: Detection and quantitation of variants with a low VAF using mBDA NGS.
Fig. 4: Determination of contaminant identity based on mBDA NGS data.
Fig. 5: Detection of cell-line contamination using mBDA with qPCR.
Fig. 6: Validation of mBDA panels on clinical samples.

Similar content being viewed by others

Data availability

The main data supporting the results in this study are available within the paper and its Supplementary Information. All requests for raw and analysed data will be reviewed by the Legal Department of Rice University to verify whether the request is subject to any intellectual property or confidentiality constraints. Requests for patient-related data not included in the paper will not be considered. Data can be shared for non-commercial research purposes via a material transfer agreement.

Code availability

All requests for code will be reviewed by the Legal Department of Rice University to verify whether the request is subject to any intellectual property or confidentiality constraints. Custom code can be shared for non-commercial research purposes via a material transfer agreement.

References

  1. Goodwin, S., McPherson, J. D. & McCombie, W. R. Coming of age: ten years of next-generation sequencing technologies. Nat. Rev. Genet. 17, 333–351 (2016).

    Article  CAS  PubMed  Google Scholar 

  2. Mardis, E. R. Next-generation sequencing platforms. Annu. Rev. Anal. Chem. 6, 287–303 (2013).

    Article  CAS  Google Scholar 

  3. Schirmer, M. et al. Insight into biases and sequencing errors for amplicon sequencing with the Illumina MiSeq platform. Nucleic Acids Res. 43, e37 (2015).

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  4. Schirmer, M., DaAmore, R., Ijaz, U. Z., Hall, N. & Quince, C. Illumina error profiles: resolving fine-scale variation in metagenomic sequencing data. BMC Bioinform. 17, 125 (2016).

    Article  CAS  Google Scholar 

  5. Kinde, I., Wu, J., Papadopoulos, N., Kinzler, K. W. & Vogelstein, B. Detection and quantification of rare mutations with massively parallel sequencing. Proc. Natl Acad. Sci. USA 108, 9530–9535 (2011).

    Article  PubMed  PubMed Central  Google Scholar 

  6. Newman, A. M. et al. Integrated digital error suppression for improved detection of circulating tumor DNA. Nat. Biotechnol. 34, 547–555 (2016).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Schmitt, M. W. et al. Detection of ultra-rare mutations by next-generation sequencing. Proc. Natl Acad. Sci. USA 109, 14508–14513 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Schwaederle, M. C. et al. Utility of genomic assessment of blood-derived circulating tumor DNA (ctDNA) in patients with advanced lung adenocarcinoma. Clin. Cancer Res. 23, 5101–5111 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Ma, C. X. et al. Neratinib efficacy and circulating tumor DNA detection of HER2 mutations in HER2 nonamplified metastatic breast cancer. Clin. Cancer Res. 23, 5687–5695 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Hata, A. N. et al. Tumor cells can follow distinct evolutionary paths to become resistant to epidermal growth factor receptor inhibition. Nat. Med. 22, 262–269 (2016).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Cohen, J. D. et al. Detection and localization of surgically resectable cancers with a multi-analyte blood test. Science 359, 926–930 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Zitvogel, L., Ma, Y., Raoult, D., Kroemer, G. & Gajewski, T. F. The microbiome in cancer immunotherapy: diagnostic tools and therapeutic strategies. Science 359, 1366–1370 (2018).

    Article  CAS  PubMed  Google Scholar 

  13. Zhernakova, A. et al. Population-based metagenomics analysis reveals markers for gut microbiome composition and diversity. Science 352, 565–569 (2016).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Fisher, R. A., Gollan, B. & Helaine, S. Persistent bacterial infections and persister cells. Nat. Rev. Microbiol. 15, 453–464 (2017).

    Article  CAS  PubMed  Google Scholar 

  15. Merker, M. et al. Whole genome sequencing reveals complex evolution patterns of multidrug-resistant Mycobacterium tuberculosis Beijing strains in patients. PLoS ONE 8, e82551 (2013).

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  16. Hindson, B. J. et al. High-throughput droplet digital PCR system for absolute quantitation of DNA copy number. Anal. Chem. 83, 8604–8610 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Watanabe, M. et al. Ultra-sensitive detection of the pretreatment EGFR T790M mutation in non-small cell lung cancer patients with an EGFR-activating mutation using droplet digital PCR. Clin. Cancer Res. 21, 3552–3560 (2015).

    Article  CAS  PubMed  Google Scholar 

  18. Didelot, A. et al. Competitive allele specific TaqMan PCR for KRAS, BRAF and EGFR mutation detection in clinical formalin fixed paraffin embedded samples. Exp. Mol. Pathol. 92, 275–280 (2012).

    Article  CAS  PubMed  Google Scholar 

  19. Milbury, C. A., Li, J. & Makrigiorgos, G. M. Ice-COLD-PCR enables rapid amplification and robust enrichment for low-abundance unknown DNA mutations. Nucleic Acids Res. 39, e2 (2010).

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  20. Wu, L. R., Chen, S. X., Wu, Y., Patel, A. A. & Zhang, D. Y. Multiplexed enrichment of rare DNA variants via sequence-selective and temperature-robust amplification. Nat. Biomed. Eng. 1, 714–723 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. International HapMap Project (National Human Genome Research Institute, 2012); https://www.genome.gov/10001688/international-hapmap-project/

  22. 1000 Genomes Project (The International Genome Sample Resource, 2021); http://www.internationalgenome.org/

  23. Potapov, V. & Ong, J. L. Examining sources of error in PCR by single-molecule sequencing. PLoS ONE 12, e0169774 (2017).

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  24. Volokhov, D. V., Graham, L. J., Brorson, K. A. & Chizhikov, V. E. Mycoplasma testing of cell substrates and biologics: review of alternative non-microbiological techniques. Mol. Cell. Probes. 25, 69–77 (2011).

    Article  CAS  PubMed  Google Scholar 

  25. Capes-Davis, A. et al. Check your cultures! A list of cross-contaminated or misidentified cell lines. Int. J. Cancer 127, 1–8 (2010).

    Article  CAS  PubMed  Google Scholar 

  26. Qu, K. et al. Detection of BRAF V600 mutations in metastatic melanoma: comparison of the Cobas 4800 and Sanger sequencing assays. J. Mol. Diagn. 15, 790–795 (2013).

    Article  CAS  PubMed  Google Scholar 

  27. Zehir, A. et al. Mutational landscape of metastatic cancer revealed from prospective clinical sequencing of 10,000 patients. Nat. Med. 23, 703–713 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  28. Beltran, H. et al. Targeted next-generation sequencing of advanced prostate cancer identifies potential therapeutic targets and disease heterogeneity. Eur. Urol. 63, 920–926 (2013).

    Article  CAS  PubMed  Google Scholar 

  29. Russo, M. et al. Tumor heterogeneity and lesion-specific response to targeted therapy in colorectal cancer. Cancer Discov. 6, 147–153 (2016).

    Article  CAS  PubMed  Google Scholar 

  30. McGranahan, N. & Swanton, C. Clonal heterogeneity and tumor evolution: past, present, and the future. Cell 168, 613–628 (2017).

    Article  CAS  PubMed  Google Scholar 

  31. Dagogo-Jack, I. & Shaw, A. T. Tumour heterogeneity and resistance to cancer therapies. Nat. Rev. Clin. Oncol. 15, 81–94 (2018).

    Article  CAS  PubMed  Google Scholar 

  32. Romano, G. et al. A preexisting rare PIK3CAE545K subpopulation confers clinical resistance to MEK plus CDK4/6 inhibition in NRAS melanoma and is dependent on S6K1 signaling. Cancer Discov. 8, 556–567 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Tate, J. G. et al. COSMIC: the catalogue of somatic mutations in cancer. Nucleic Acids Res. 47, D941–D947 (2018).

    Article  PubMed Central  CAS  Google Scholar 

  34. Pel, J. et al. Nonlinear electrophoretic response yields a unique parameter for separation of biomolecules. Proc. Natl Acad. Sci. USA 106, 14796–14801 (2009).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Kang, S. et al. Targeted sequencing with enrichment PCR: a novel diagnostic method for the detection of EGFR mutations. Oncotarget 6, 13742–13749 (2015).

    Article  PubMed  PubMed Central  Google Scholar 

  36. Song, C. et al. Elimination of unaltered DNA in mixed clinical samples via nuclease-assisted minor-allele enrichment. Nucleic Acids Res. 44, e146 (2016).

    PubMed  PubMed Central  CAS  Google Scholar 

  37. Lee, S. H. et al. CUT-PCR: CRISPR-mediated, ultrasensitive detection of target DNA using PCR. Oncogene 36, 6823–6829 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  38. Alexandrov, L. B. et al. The repertoire of mutational signatures in human cancer. Nature 578, 94–101 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Barretina, J. et al. The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature 483, 603–607 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Sharma, S. V., Haber, D. A. & Settleman, J. Cell line-based platforms to evaluate the therapeutic efficacy of candidate anticancer agents. Nat. Rev. Cancer 10, 241–253 (2010).

    Article  CAS  PubMed  Google Scholar 

  41. Kikuchi, T. et al. Human iPS cell-derived dopaminergic neurons function in a primate Parkinson’s disease model. Nature 548, 592–596 (2017).

    Article  CAS  PubMed  Google Scholar 

  42. Schweppe, R. E. et al. Deoxyribonucleic acid profiling analysis of 40 human thyroid cancer cell lines reveals cross-contamination resulting in cell line redundancy and misidentification. J. Clin. Endocrinol. Metab. 93, 4331–4341 (2008).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Freedman, L. P. et al. Reproducibility: changing the policies and culture of cell line authentication. Nat. Methods 12, 493–497 (2015).

    Article  CAS  PubMed  Google Scholar 

  44. Huang, Y., Liu, Y., Zheng, C. & Shen, C. Investigation of cross-contamination and misidentification of 278 widely used tumor cell lines. PLoS ONE 12, e0170384 (2017).

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  45. Horbach, S. P. & Halffman, W. The ghosts of HeLa: how cell line misidentification contaminates the scientific literature. PLoS ONE 12, e0186281 (2017).

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  46. Martincorena, I. et al. High burden and pervasive positive selection of somatic mutations in normal human skin. Science 348, 880–886 (2015).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  47. Do, H. & Dobrovic, A. Sequence artifacts in DNA from formalin-fixed tissues: causes and strategies for minimization. Clin. Chem. 61, 64–71 (2015).

    Article  CAS  PubMed  Google Scholar 

  48. Chen, G., Mosier, S., Gocke, C. D., Lin, M. T. & Eshleman, J. R. Cytosine deamination is a major cause of baseline noise in next-generation sequencing. Mol. Diagn. Ther. 18, 587–593 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  49. Goldberg, S. B. et al. Early assessment of lung cancer immunotherapy response via circulating tumor DNA. Clin. Cancer Res. 24, 1872–1880 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

This work was supported by NIH grant nos R01CA203964 and R01CA233364, and CPRIT grant no. RP180147 to D.Y.Z. We thank J. Nie for proofreading assistance and G. Bao for providing access to his BioRad QX200 digital droplet PCR instrument. We thank Nuprobe for providing early access VarMap NSCLC kits for cfDNA testing.

Author information

Authors and Affiliations

Authors

Contributions

P.S. and D.Y.Z. conceived the project. P.S., S.X.C. and Y.H.Y. performed the mBDA sequence design for the cell-line contamination panels. L.Y.C. and P.D. performed the mBDA sequence design for the cancer panels. A.A.P. provided clinical cfDNA samples and performed the comparison deep sequencing experiments. P.S. and Y.H.Y. performed experiments and analysed qPCR data. P.S. and A.P. performed NGS experiments. P.S., S.X.C. and D.Y.Z. analysed the NGS data. P.S. and D.Y.Z. wrote the manuscript with input from all authors.

Corresponding author

Correspondence to David Yu Zhang.

Ethics declarations

Competing interests

There are patents pending on the BDA (patent number EP3146080B1) and mBDA (patent number WO2019164885A1) methods used in this work. P.S., S.X.C., L.Y.C. and P.D. declare competing interests in the form of consulting for Nuprobe USA. A.A.P. declares a competing interest in the form of consulting for Nuprobe USA as well as consulting for and equity ownership in Binary Genomics. D.Y.Z. declares a competing interest in the form of consulting for and equity ownership in Nuprobe and Torus Biosystems as well as consulting for Avenge Bio.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary methods, figures and tables.

Reporting Summary

Supplementary Dataset

Design of mBDA for alleles of 80 SNPs.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Song, P., Chen, S.X., Yan, Y.H. et al. Selective multiplexed enrichment for the detection and quantitation of low-fraction DNA variants via low-depth sequencing. Nat Biomed Eng 5, 690–701 (2021). https://doi.org/10.1038/s41551-021-00713-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/s41551-021-00713-0

This article is cited by

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing