Identification of post-translational modifications by blind search of mass spectra

Tsur, Dekel; Tanner, Stephen; Zandi, Ebrahim; Bafna, Vineet; Pevzner, Pavel A

doi:10.1038/nbt1168

Letter
Published: 27 November 2005

Identification of post-translational modifications by blind search of mass spectra

Dekel Tsur¹^na1,
Stephen Tanner²^na1,
Ebrahim Zandi³,
Vineet Bafna¹ &
…
Pavel A Pevzner¹

Nature Biotechnology volume 23, pages 1562–1567 (2005)Cite this article

2198 Accesses
212 Citations
4 Altmetric
Metrics details

Abstract

Most tandem mass spectrometry (MS/MS) database search algorithms perform a restrictive search that takes into account only a few types of post-translational modifications (PTMs) and ignores all others. We describe an unrestrictive PTM search algorithm, MS-Alignment, that searches for all types of PTMs at once in a blind mode, that is, without knowing which PTMs exist in nature. Blind PTM identification makes it possible to study the extent and frequency of different types of PTMs, still an open problem in proteomics. Application of this approach to lens proteins resulted in the largest set of PTMs reported in human crystallins so far. Our analysis of various MS/MS data sets implies that the biological phenomenon of modification is much more widespread than previously thought. We also argue that MS-Alignment reveals some uncharacterized modifications that warrant further experimental validation.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

SPECTRUM – A MATLAB Toolbox for Proteoform Identification from Top-Down Proteomics Data

Article Open access 02 August 2019

Abdul Rehman Basharat, Kanzal Iman, … Safee Ullah Chaudhary

Identification of modified peptides using localization-aware open search

Article Open access 13 August 2020

Fengchao Yu, Guo Ci Teo, … Alexey I. Nesvizhskii

PepQuery2 democratizes public MS proteomics data for rapid peptide searching

Article Open access 18 April 2023

Bo Wen & Bing Zhang

References

Shu, H., Chen, S., Bi, Q., Mumby, M. & Brekken, D.L. Identification of phosphoproteins and their phosphorylation sites in the wehi-231 b lymphoma cell line. Mol. Cell. Proteomics 3, 279–286 (2004).
Article CAS Google Scholar
Cantin, G.T. & Yates, J.R. Strategies for shotgun identification of post-translational modifications by mass spectrometry. J. Chromatogr. A. 1053, 7–14 (2004).
Article CAS Google Scholar
Yates, J.R., Eng, J.K. & McCormack, A.L. Mining genomes: correlating tandem mass spectra of modified and unmodified peptides to sequences in nucleotide databases. Anal. Chem. 67, 3202–3210 (1995).
Article CAS Google Scholar
Pevzner, P.A., Dančík, V. & Tang, C.L. Mutation-tolerant protein identification by mass spectrometry. J. Comput. Biol. 7, 777–787 (2000).
Article CAS Google Scholar
Pevzner, P.A., Mulyukov, Z., Dancik, V. & Tang, C.L. Efficiency of database search for identification of mutated and modified proteins via mass spectrometry. Genome Res. 11, 290–299 (2001).
Article CAS Google Scholar
Searle, B.C. et al. High-throughput identification of proteins and unanticipated sequence modifications using a mass-based alignment algorithm for MS/MS de novo sequencing results. Anal. Chem. 76, 2220–2230 (2004).
Article CAS Google Scholar
Han, Y., Ma, B. & Zhang, K. SPIDER: software for protein identification from sequence tags with de novo sequencing error. J. Bioinform. Comput. Biol. 3, 697–716 (2005).
Article CAS Google Scholar
Hansen, B.T., Davey, S.W., Ham, A.J. & Liebler, D.C. P-mod: an algorithm and software to map modifications to peptide sequences using tandem MS data. J. Proteome Res. 4, 358–368 (2005).
Article CAS Google Scholar
Tang, W.H. et al. Discovering known and unanticipated protein modifications using MS/MS database searching. Anal. Chem. 77, 3931–3946 (2005).
Article CAS Google Scholar
Searle, B.S. et al. Identification of protein modifications using MS/MS de novo sequencing and the Opensea alignment algorithm. J. Proteome Res. 4, 546–554 (2005).
Article CAS Google Scholar
MacCoss, M.J., Wu, C.C. & Yates, J.R. Probability-based validation of protein identifications using a modified SEQUEST algorithm. Anal. Chem. 74, 5593–5599 (2002).
Article CAS Google Scholar
Keller, A. et al. Experimental protein mixture for validating tandem mass spectral analysis. OMICS 6, 207–212 (2002).
Article CAS Google Scholar
Tanner, S. et al. Inspect: fast and accurate identification of post-translationally modified peptides from tandem mass spectra. Anal. Chem. 77, 4626–4639 (2005).
Article CAS Google Scholar
Craig, R. & Beavis, R.C. A method for reducing the time required to match protein sequences with tandem mass spectra. Rapid Commun. Mass Spectrom. 17, 2310–2316 (2003).
Article CAS Google Scholar
Yates, J.R., Eng, J.K., McCormack, A.L. & Schieltz, D. Method to correlate tandem mass spectra of modified peptides to amino acid sequences in the protein database. Anal. Chem. 67, 1426–1436 (1995).
Article CAS Google Scholar
Tabb, D.L. et al. Statistical characterization of ion trap tandem mass spectra from doubly charged tryptic peptides. Anal. Chem. 75, 1155–1163 (2003).
Article CAS Google Scholar
Perkins, D.N., Pappin, D.J., Creasy, D.M. & Cottrell, J.S. Probability-based protein identification by searching sequence databases using mass spectrometry data. Electrophoresis 20, 3551–3567 (1999).
Article CAS Google Scholar
Nesvizhskii, A.I., Keller, A., Kolker, E. & Aebersold, R. A statistical model for identifying proteins by tandem mass spectrometry. Anal. Chem. 75, 4646–4658 (2003).
Article CAS Google Scholar
Razumovskaya, J. et al. A computational method for assessing peptide-identification reliability in tandem mass spectrometry analysis with sequest. Proteomics 4, 961–969 (2004).
Article CAS Google Scholar
Frank, A., Tanner, S.W., Bafna, V. & Pevzner, P.A. Peptide sequence tags for fast database search in mass-spectrometry. J. Proteome Res. 4, 1287–1295 (2005).
Article CAS Google Scholar
Elias, J.E., Gibbons, F.D., King, O.D., Roth, F.P. & Gygi, S.P. Intensity-based protein identification by protein learning from a library of tandem mass spectra. Nat. Biotechnol. 22, 214–219 (2004).
Article CAS Google Scholar
Havilio, M., Haddad, Y. & Smilansky, Z. Intensity-based statistical scorer for tandem mass spectrometry. Anal. Chem. 75, 435–444 (2003).
Article CAS Google Scholar
Anderson, D.C., Li, W., Payan, D.G. & W.S., Noble A new algorithm for the evaluation of shotgun peptide sequencing in proteomics: support vector machine classification of peptide MS/MS spectra and SEQUEST scores. J. Proteome Res. 2, 137–146 (2003).
Article CAS Google Scholar
Geer, L.Y. et al. Open mass spectrometry search algorithm. J. Proteome Res. 3, 958–964 (2004).
Article CAS Google Scholar

Download references

Acknowledgements

This project was supported by National Institutes of Health grant NIGMS 1-R01-RR16522. We are grateful to Brian Searle and Larry David for making their lens data set available and to Larry David, Katalin Medzihradszky and Philip Wilmarth for many useful discussions. Production of the lens data set was supported by National Eye Institute grant EY007755. This research was supported in part by the UCSD FWGrid Project, NSF Research Infrastructure Grant Number EIA-0303622. Production of the IKKb data set was supported by NIH grant R01GM65325 and by the Pew Scholars Program.

Author information

Dekel Tsur and Stephen Tanner: These authors contributed equally to this work.

Authors and Affiliations

Department of Computer Science and Engineering, University of California, San Diego, 9500 Gilman Drive, La Jolla, 92093-0404, California, USA
Dekel Tsur, Vineet Bafna & Pavel A Pevzner
Bioinformatics Program, University of California, San Diego, 9500 Gilman Drive, La Jolla, 92093-0419, California, USA
Stephen Tanner
Molecular Microbiology and Immunology, School of Medicine, Univ. Southern California, 2011 Zonal Avenue, HMR 401, Los Angeles, 90033, California, USA
Ebrahim Zandi

Authors

Dekel Tsur
View author publications
You can also search for this author in PubMed Google Scholar
Stephen Tanner
View author publications
You can also search for this author in PubMed Google Scholar
Ebrahim Zandi
View author publications
You can also search for this author in PubMed Google Scholar
Vineet Bafna
View author publications
You can also search for this author in PubMed Google Scholar
Pavel A Pevzner
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Stephen Tanner.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tsur, D., Tanner, S., Zandi, E. et al. Identification of post-translational modifications by blind search of mass spectra. Nat Biotechnol 23, 1562–1567 (2005). https://doi.org/10.1038/nbt1168

Download citation

Received: 06 June 2005
Accepted: 20 October 2005
Published: 27 November 2005
Issue Date: 01 December 2005
DOI: https://doi.org/10.1038/nbt1168

This article is cited by

Proteome-Wide Analyses Reveal the Diverse Functions of Lysine 2-Hydroxyisobutyrylation in Oryza sativa
- Chao Xue
- Zhongying Qiao
- Zhiyun Gong
Rice (2020)
Identification of modified peptides using localization-aware open search
- Fengchao Yu
- Guo Ci Teo
- Alexey I. Nesvizhskii
Nature Communications (2020)
Increased diversity of peptidic natural products revealed by modification-tolerant database search of mass spectra
- Alexey Gurevich
- Alla Mikheenko
- Pavel A. Pevzner
Nature Microbiology (2018)
Informed-Proteomics: open-source software package for top-down proteomics
- Jungkap Park
- Paul D Piehowski
- Sangtae Kim
Nature Methods (2017)
Metabolic regulation of gene expression through histone acylations
- Benjamin R. Sabari
- Di Zhang
- Yingming Zhao
Nature Reviews Molecular Cell Biology (2017)

Identification of post-translational modifications by blind search of mass spectra

Abstract

Access options

Similar content being viewed by others

SPECTRUM – A MATLAB Toolbox for Proteoform Identification from Top-Down Proteomics Data

Identification of modified peptides using localization-aware open search

PepQuery2 democratizes public MS proteomics data for rapid peptide searching

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Supplementary information

Supplementary Fig. 1

Supplementary Fig. 2

Supplementary Table 1

Supplementary Table 2

Supplementary Table 3

Supplementary Methods (PDF 92 kb)

Rights and permissions

About this article

Cite this article

This article is cited by

Proteome-Wide Analyses Reveal the Diverse Functions of Lysine 2-Hydroxyisobutyrylation in Oryza sativa

Identification of modified peptides using localization-aware open search

Increased diversity of peptidic natural products revealed by modification-tolerant database search of mass spectra

Informed-Proteomics: open-source software package for top-down proteomics

Metabolic regulation of gene expression through histone acylations

Search

Quick links

Abstract

Access options

Similar content being viewed by others

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links