Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

Specter: linear deconvolution for targeted analysis of data-independent acquisition mass spectrometry proteomics

Abstract

Mass spectrometry with data-independent acquisition (DIA) is a promising method to improve the comprehensiveness and reproducibility of targeted and discovery proteomics, in theory by systematically measuring all peptide precursors in a biological sample. However, the analytical challenges involved in discriminating between peptides with similar sequences in convoluted spectra have limited its applicability in important cases, such as the detection of single-nucleotide polymorphisms (SNPs) and alternative site localizations in phosphoproteomics data. We report Specter (https://github.com/rpeckner-broad/Specter), an open-source software tool that uses linear algebra to deconvolute DIA mixture spectra directly through comparison to a spectral library, thus circumventing the problems associated with typical fragment-correlation-based approaches. We validate the sensitivity of Specter and its performance relative to that of other methods, and show that Specter is able to successfully analyze cases involving highly similar peptides that are typically challenging for DIA analysis methods.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Figure 1: Specter uses linear algebra to formally deconvolute MS2 spectra derived from cofragmented precursors.
Figure 2: Total ion chromatograms calculated by Specter are as accurate as those from manual targeted analysis of DIA data in terms of both identification and quantification.
Figure 3: The false discovery rate of Specter is inherently <5%.
Figure 4: Specter chromatograms of groups of synthetic peptides with highly similar spectra.
Figure 5: Specter distinguished close positional isomers with overlapping chromatographic profiles in a phosphoproteomics data set.
Figure 6: Analysis of a public data set by Specter and five other software tools.

Similar content being viewed by others

References

  1. Tabb, D.L. et al. Repeatability and reproducibility in proteomic identifications by liquid chromatography-tandem mass spectrometry. J. Proteome Res. 9, 761–776 (2010).

    Article  CAS  Google Scholar 

  2. Picotti, P. & Aebersold, R. Selected reaction monitoring–based proteomics: workflows, potential, pitfalls and future directions. Nat. Methods 9, 555–566 (2012).

    Article  CAS  Google Scholar 

  3. Röst, H.L. et al. OpenSWATH enables automated, targeted analysis of data-independent acquisition MS data. Nat. Biotechnol. 32, 219–223 (2014).

    Article  Google Scholar 

  4. Tsou, C.-C. et al. DIA-Umpire: comprehensive computational framework for data-independent acquisition proteomics. Nat. Methods 12, 258–264 (2015).

    Article  CAS  Google Scholar 

  5. Wang, J. et al. MSPLIT-DIA: sensitive peptide identification for data-independent acquisition. Nat. Methods 12, 1106–1108 (2015).

    Article  CAS  Google Scholar 

  6. Bilbao, A. et al. Ranking fragment ions based on outlier detection for improved label-free quantification in data-independent acquisition LC-MS/MS. J. Proteome Res. 14, 4581–4593 (2015).

    Article  CAS  Google Scholar 

  7. Bruderer, R. et al. Extending the limits of quantitative proteome profiling with data-independent acquisition and application to acetaminophen-treated three-dimensional liver microtissues. Mol. Cell. Proteomics 14, 1400–1410 (2015).

    Article  CAS  Google Scholar 

  8. Bern, M. et al. Deconvolution of mixture spectra from ion-trap data-independent-acquisition tandem mass spectrometry. Anal. Chem. 82, 833–841 (2010).

    Article  CAS  Google Scholar 

  9. Navarro, P. et al. A multicenter study benchmarks software tools for label-free proteome quantification. Nat. Biotechnol. 34, 1130–1136 (2016).

    Article  CAS  Google Scholar 

  10. Schulz-Trieglaff, O. et al. Statistical quality assessment and outlier detection for liquid chromatography-mass spectrometry experiments. BioData Min. 2, 4 (2009).

    Article  Google Scholar 

  11. Frewen, B.E., Merrihew, G.E., Wu, C.C., Noble, W.S. & MacCoss, M.J. Analysis of peptide MS/MS spectra from large-scale proteomics experiments using spectrum libraries. Anal. Chem. 78, 5678–5684 (2006).

    Article  CAS  Google Scholar 

  12. Ritter, G.L., Lowry, S.R., Isenhour, T.L. & Wilkins, C.L. Factor analysis of the mass spectra of mixtures. Anal. Chem. 48, 591–595 (1976).

    Article  CAS  Google Scholar 

  13. Likic´, V.A. Extraction of pure components from overlapped signals in gas chromatography-mass spectrometry (GC-MS). BioData Min. 2, 6 (2009).

    Article  Google Scholar 

  14. Nikolskiy, I., Mahieu, N.G., Chen, Y.-J., Tautenhahn, R. & Patti, G.J. An untargeted metabolomic workflow to improve structural characterization of metabolites. Anal. Chem. 85, 7713–7719 (2013).

    Article  CAS  Google Scholar 

  15. Du, X. & Zeisel, S.H. Spectral deconvolution for gas chromatography mass spectrometry-based metabolomics: current status and future perspectives. Comput. Struct. Biotechnol. J. 4, e201301013 (2013).

    Article  Google Scholar 

  16. Lam, H., Deutsch, E.W. & Aebersold, R. Artificial decoy spectral libraries for false discovery rate estimation in spectral library searching in proteomics. J. Proteome Res. 9, 605–610 (2010).

    Article  CAS  Google Scholar 

  17. Cheng, C.-Y., Tsai, C.-F., Chen, Y.-J., Sung, T.-Y. & Hsu, W.-L. Spectrum-based method to generate good decoy libraries for spectral library searching in peptide identifications. J. Proteome Res. 12, 2305–2310 (2013).

    Article  CAS  Google Scholar 

  18. Du, X. et al. Linear discriminant analysis-based estimation of the false discovery rate for phosphopeptide identifications. J. Proteome Res. 7, 2195–2203 (2008).

    Article  CAS  Google Scholar 

  19. Shastry, B.S. SNPs in disease gene mapping, medicinal drug development and evolution. J. Hum. Genet. 52, 871–880 (2007).

    Article  CAS  Google Scholar 

  20. Lawrence, R.T., Searle, B.C., Llovet, A. & Villén, J. Plug-and-play analysis of the human phosphoproteome by targeted high-resolution mass spectrometry. Nat. Methods 13, 431–434 (2016).

    Article  CAS  Google Scholar 

  21. Rosenberger, G. et al. Inference and quantification of peptidoforms in large sample cohorts by SWATH-MS. Nat. Biotechnol. 35, 781–788 (2017).

    Article  CAS  Google Scholar 

  22. Malecz, N., Foisner, R., Stadler, C. & Wiche, G. Identification of plectin as a substrate of p34cdc2 kinase and mapping of a single phosphorylation site. J. Biol. Chem. 271, 8203–8208 (1996).

    Article  CAS  Google Scholar 

  23. Bouameur, J.-E. et al. Phosphorylation of serine 4,642 in the C-terminus of plectin by MNK2 and PKA modulates its interaction with intermediate filaments. J. Cell Sci. 126, 4195–4207 (2013).

    Article  CAS  Google Scholar 

  24. McQueen, P. et al. Whole cell, label free protein quantitation with data independent acquisition: quantitation at the MS2 level. Proteomics 15, 16–24 (2015).

    Article  CAS  Google Scholar 

  25. MacLean, B. et al. Skyline: an open source document editor for creating and analyzing targeted proteomics experiments. Bioinformatics 26, 966–968 (2010).

    Article  CAS  Google Scholar 

  26. Gillet, L.C. et al. Targeted data extraction of the MS/MS spectra generated by data-independent acquisition: a new concept for consistent and accurate proteome analysis. Mol. Cell. Proteomics 11, O111.016717 (2012).

    Article  Google Scholar 

  27. Yang, W. & Zurbenko, I. Kolmogorov-Zurbenko filters. Wiley Interdiscip. Rev. Comput. Stat. 2, 340–351 (2010).

    Article  Google Scholar 

  28. Deutsch, E.W. File formats commonly used in mass spectrometry proteomics. Mol. Cell. Proteomics 11, 1612–1621 (2012).

    Article  Google Scholar 

  29. Frewen, B. & MacCoss, M.J. Using BiblioSpec for creating and searching tandem MS peptide libraries. Curr. Protoc. Bioinformatics 20, 13.7.1–13.7.12 (2007).

    Google Scholar 

  30. Rappsilber, J., Mann, M. & Ishihama, Y. Protocol for micro-purification, enrichment, pre-fractionation and storage of peptides for proteomics using StageTips. Nat. Protoc. 2, 1896–1906 (2007).

    Article  CAS  Google Scholar 

  31. Abelin, J.G. et al. Reduced-representation phosphosignatures measured by quantitative targeted MS capture cellular states and enable large-scale comparison of drug-induced phenotypes. Mol. Cell. Proteomics 15, 1622–1641 (2016).

    Article  CAS  Google Scholar 

  32. Vizcaíno, J.A. et al. 2016 update of the PRIDE database and its related tools. Nucleic Acids Res. 44, D447–D456 (2016).

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported by the NIH (grant U54 HG008097 to J.D.J.). The authors thank S. Tenzer and L. Gillet for providing details of the data sets for the LFQbench study, and K. Clauser for many insightful discussions.

Author information

Authors and Affiliations

Authors

Contributions

R.P. conceived of the methodology, created the software, performed the analyses, and wrote the manuscript. S.A.M. conceived and carried out the experiments involving similar synthetic peptides and DIA-DDA comparison, and helped revise the manuscript. A.S.V.J. carried out the experiments and manual analysis of the synthetic phosphopeptide spike-in experiments. J.D.E. and J.G.A. developed the DIA acquisition method. M.J.M., S.A.C., and J.D.J. provided laboratory resources and guidance on the manuscript.

Corresponding author

Correspondence to Jacob D Jaffe.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Text and Figures

Supplementary Figures 1–4, Supplementary Notes 1–6 and Supplementary Tables 1–6 (PDF 4934 kb)

Life Sciences Reporting Summary (PDF 133 kb)

Source data

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Peckner, R., Myers, S., Jacome, A. et al. Specter: linear deconvolution for targeted analysis of data-independent acquisition mass spectrometry proteomics. Nat Methods 15, 371–378 (2018). https://doi.org/10.1038/nmeth.4643

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/nmeth.4643

This article is cited by

Search

Quick links

Nature Briefing: Translational Research

Sign up for the Nature Briefing: Translational Research newsletter — top stories in biotechnology, drug discovery and pharma.

Get what matters in translational research, free to your inbox weekly. Sign up for Nature Briefing: Translational Research