Mass spectrometry searches using MASST

Wang, Mingxun; Jarmusch, Alan K.; Vargas, Fernando; Aksenov, Alexander A.; Gauglitz, Julia M.; Weldon, Kelly; Petras, Daniel; da Silva, Ricardo; Quinn, Robert; Melnik, Alexey V.; van der Hooft, Justin J. J.; Caraballo-Rodríguez, Andrés Mauricio; Nothias, Louis Felix; Aceves, Christine M.; Panitchpakdi, Morgan; Brown, Elizabeth; Di Ottavio, Francesca; Sikora, Nicole; Elijah, Emmanuel O.; Labarta-Bajo, Lara; Gentry, Emily C.; Shalapour, Shabnam; Kyle, Kathleen E.; Puckett, Sara P.; Watrous, Jeramie D.; Carpenter, Carolina S.; Bouslimani, Amina; Ernst, Madeleine; Swafford, Austin D.; Zúñiga, Elina I.; Balunas, Marcy J.; Klassen, Jonathan L.; Loomba, Rohit; Knight, Rob; Bandeira, Nuno; Dorrestein, Pieter C.

doi:10.1038/s41587-019-0375-9

Correspondence
Published: 01 January 2020

Mass spectrometry searches using MASST

Mingxun Wang^1,2,
Alan K. Jarmusch¹,
Fernando Vargas ORCID: orcid.org/0000-0001-5847-2439^1,3,
Alexander A. Aksenov ORCID: orcid.org/0000-0002-9445-2248¹,
Julia M. Gauglitz¹,
Kelly Weldon ORCID: orcid.org/0000-0003-1064-8153^1,4,
Daniel Petras ORCID: orcid.org/0000-0002-6561-3022¹,
Ricardo da Silva¹,
Robert Quinn^1,5,
Alexey V. Melnik¹,
Justin J. J. van der Hooft ORCID: orcid.org/0000-0002-9340-5511^1,6,
Andrés Mauricio Caraballo-Rodríguez ORCID: orcid.org/0000-0001-5499-2728¹,
Louis Felix Nothias¹,
Christine M. Aceves¹,
Morgan Panitchpakdi¹,
Elizabeth Brown¹,
Francesca Di Ottavio⁷,
Nicole Sikora¹,
Emmanuel O. Elijah¹,
Lara Labarta-Bajo³,
Emily C. Gentry¹,
Shabnam Shalapour⁸,
Kathleen E. Kyle ORCID: orcid.org/0000-0002-2837-1124⁹,
Sara P. Puckett¹⁰,
Jeramie D. Watrous ORCID: orcid.org/0000-0001-9124-6783¹¹,
Carolina S. Carpenter⁴,
Amina Bouslimani¹,
Madeleine Ernst¹,
Austin D. Swafford ORCID: orcid.org/0000-0001-5655-8300⁴,
Elina I. Zúñiga³,
Marcy J. Balunas ORCID: orcid.org/0000-0003-2374-4048¹⁰,
Jonathan L. Klassen⁹,
Rohit Loomba^4,12,
Rob Knight ORCID: orcid.org/0000-0002-0975-9019^4,13,14,
Nuno Bandeira^4,14,15 &
…
Pieter C. Dorrestein ORCID: orcid.org/0000-0002-3003-1030^1,4,8,13

Nature Biotechnology volume 38, pages 23–26 (2020)Cite this article

9406 Accesses
133 Citations
118 Altmetric
Metrics details

Subjects

Access through your institution

Buy or subscribe

To the Editor — We introduce a web-enabled mass spectrometry (MS) search engine, named Mass Spectrometry Search Tool (MASST; https://masst.ucsd.edu). By enabling searches of all small-molecule tandem MS (MS/MS) data in public metabolomics repositories, we posit that MASST will unlock these resources for clinical, environmental and natural product applications.

Introduced in 1990, a tool for discovering related protein or gene sequences named Basic Local Alignment Search Tool (BLAST) enabled researchers to query entire public sequence data repositories through a web interface (WebBLAST; https://blast.ncbi.nlm.nih.gov/Blast.cgi)¹. WebBLAST is one of the most widely cited and used bioinformatics tools because it permits any researcher to answer simple questions, such as ‘is a protein or DNA sequence common or rare?’. In the early days of public gene and protein databases, metadata, which include descriptions of sample, population or technical details, were limited. No deposition standards existed, except for the Short Read Archive and European Nucleotide Archive, which include experimental details for sequencing, instrumental details and sample description, such as the source of a sample. The current status of much MS data in the public domain is reminiscent of the DNA databanks of the 1990s. To increase usage and unlock the potential of openly available MS resources, we set out to build an infrastructure to enable WebBLAST for MS.

This is a preview of subscription content, access via your institution

Relevant articles

Open Access articles citing this article.

From MS/MS library implementation to molecular networks: Exploring oxylipin diversity with NEO-MSMS
- Anis Elloumi
- , Lindsay Mas-Normand
- … Jean-Marie Galano
Scientific Data Open Access 13 February 2024
Bile salt hydrolase catalyses formation of amine-conjugated bile acids
- Bipin Rimal
- , Stephanie L. Collins
- … Andrew D. Patterson
Nature Open Access 07 February 2024
microbeMASST: a taxonomically informed mass spectrometry search tool for microbial metabolomics data
- Simone Zuffa
- , Robin Schmid
- … Pieter C. Dorrestein
Nature Microbiology Open Access 05 February 2024

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: MASST search, reporting and match visualization.**

Data availability

All data used for testing and validating MASST are deposited in GNPS/MassIVE. MASST is a web-based application that is embedded in GNPS, which is a community service in which all public data are public. All data underlying figures present in the Supplementary Note are included as Supplementary Data 1 and 2. We cannot provide server installation, software engineers or administrator support for individual installations of MASST. The MASST platform is built as a workflow on top of the web repository workflow platform ProteoSAFe (https://github.com/CCMS-UCSD/ProteoSAFe). Each step of the MASST query is written in Python. Web rendering of the results is displayed by ProteoSAFe in the browser.

Code availability

For those who wish to build out MASST and recruit their own programmers, software engineers and system administrators, we have deposited the code at github (https://github.com/CCMS-UCSD/GNPS_Workflows/tree/master/search_single_spectrum). The standalone MASST query interface is written in Python and Flask with a web front end written in HTML and JavaScript. It is open source (https://github.com/mwang87/GNPS_MASST) and released under an LGPL-3 license.

References

Altschul, S. F. et al. J. Mol. Biol. 215, 403–410 (1990).
Article CAS Google Scholar
Watrous, J. et al. Proc. Natl Acad. Sci. USA 109, 1743–1752 (2012).
Article Google Scholar
Rasche, F. Anal. Chem. 83, 1243–1251 (2011).
Article CAS Google Scholar
Lai, Z. et al. Nat. Methods 15, 53–56 (2018).
Article CAS Google Scholar
Chong, J. et al. Nucleic Acids Res. 46, W486–W494 (2018).
Article CAS Google Scholar
Tautenhahn, R. et al. Anal. Chem. 84, 5035–5039 (2012).
Article CAS Google Scholar
Wishart, D. S. et al. Nucleic Acids Res. 46, D608–D617 (2018).
Article CAS Google Scholar
Aksenov, A. A. et al. Nat. Rev. Chem. 1, 0054 (2017).
Article CAS Google Scholar
Perez-Riverol, Y. et al. Nat. Biotechnol. 35, 406–409 (2017).
Article CAS Google Scholar
Rocca-Serra, P. et al. Metabolomics 12, 14 (2016).
Article Google Scholar
Wang, M. et al. Nat. Biotechnol. 34, 828–837 (2016).
Article CAS Google Scholar
Kirchner, M. et al. J. Proteome Res. 9, 2762–2763 (2010).
Article CAS Google Scholar
Kessner, D. et al. Bioinformatics 24, 2534–2536 (2008).
Article CAS Google Scholar
Horai, H. et al. J. Mass Spectrom. 45, 703–714 (2010).
Article CAS Google Scholar
Sawada, Y. et al. Phytochemistry 82, 38–45 (2012).
Article CAS Google Scholar
Otogo N’Nang, E. et al. Org. Lett. 20, 6596–6600 (2018).
Article Google Scholar
Schymanski, E. L. et al. Metabolites 3, 517–538 (2013).
Article CAS Google Scholar
Kyle, J. E. et al. Bioinformatics 33, 1744–1746 (2017).
Article CAS Google Scholar
Haug, K. et al. Nucleic Acids Res. 41, D781–D786 (2013).
Article CAS Google Scholar
Sud, M. et al. Nucleic Acids Res. 44, D463–D470 (2016).
Article CAS Google Scholar
Mungall, C. J. et al. Genome Biol. 13, R5 (2012).
Article Google Scholar
Schriml, L. M. et al. Nucleic Acids Res. 47, D955–D962 (2019).
Article CAS Google Scholar
Bolyen, E. et al. Nat. Biotechnol. 37, 852–857 (2019).
Article CAS Google Scholar
Gonzalez, A. et al. Nat. Methods 15, 796–798 (2018).
Article CAS Google Scholar
Jarmusch, A.K. et al. Preprint at bioRxiv https://doi.org/10.1101/750471 (2019).
Sumner, L. W. et al. Metabolomics 3, 211–221 (2007).
Article CAS Google Scholar

Download references

Acknowledgements

Conversion of data from different repositories was supported by R03 CA211211 on reuse of metabolomics data. The development of a user-friendly interface was in part supported by Gordon and Betty Moore Foundation through grant GBMF7622. The UC San Diego Center for Microbiome Innovation supported the campus wide SEED grant awards for data collection that enabled the development of much of this infrastructure. A.K.J. thanks the American Society for Mass Spectrometry for the 2018 Postdoctoral Career Development Award. We acknowledge C. O’Donovan and K. Haug for help with navigating the MetaboLights data repository. J.V.D.H. was supported by a ASDI eScience grant (ASDI.2017.030) from the Netherlands eScience Center (NLeSC). E.I.Z. and L.L.-B. were supported by NIH grants AI081923 and AI113923. A.M.C.R., K.E.K., S.P.P., J.L.K., M.J.B. and P.C.D. were supported by NSF grant IOS-1656475. A.B. was supported by National Institute of Justice Award 2015-DN-BX-K047. F.V. was supported by the Department of Navy, Office of Naval Research Multidisciplinary University Research Initiative (MURI) Award, award number N00014-15-1-2809. D.P. was supported by the German Research Foundation (DFG) with grant PE 2600/1. Additional support for data acquisition and data storage was provided by P41 GM103484 Center for Computational Mass Spectrometry, Instrument support though NIH S10RR029121. R.L. is supported by NIH grants R01DK106419, 5P42ES010337 and 5UL1TR001442, and NIH K01DK116917 to J.D.W. The development of the web interface and harmonization with Qiita was in part supported by the Sloan Foundation.

Author information

Authors and Affiliations

Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
Mingxun Wang, Alan K. Jarmusch, Fernando Vargas, Alexander A. Aksenov, Julia M. Gauglitz, Kelly Weldon, Daniel Petras, Ricardo da Silva, Robert Quinn, Alexey V. Melnik, Justin J. J. van der Hooft, Andrés Mauricio Caraballo-Rodríguez, Louis Felix Nothias, Christine M. Aceves, Morgan Panitchpakdi, Elizabeth Brown, Nicole Sikora, Emmanuel O. Elijah, Emily C. Gentry, Amina Bouslimani, Madeleine Ernst & Pieter C. Dorrestein
Ometa Labs LLC, San Diego, CA, USA
Mingxun Wang
Division of Biological Sciences, University of California San Diego, La Jolla, CA, USA
Fernando Vargas, Lara Labarta-Bajo & Elina I. Zúñiga
Center for Microbiome Innovation, University of California San Diego, La Jolla, CA, USA
Kelly Weldon, Carolina S. Carpenter, Austin D. Swafford, Rohit Loomba, Rob Knight, Nuno Bandeira & Pieter C. Dorrestein
Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, USA
Robert Quinn
Bioinformatics Group, Wageningen University, Wageningen, The Netherlands
Justin J. J. van der Hooft
Faculty of Bioscience and Technology for Food, Agriculture, and Environment, University of Teramo, Teramo, TE, Italy
Francesca Di Ottavio
Department of Pharmacology, School of Medicine, University of California San Diego, La Jolla, CA, USA
Shabnam Shalapour & Pieter C. Dorrestein
Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
Kathleen E. Kyle & Jonathan L. Klassen
Division of Medicinal Chemistry, Department of Pharmaceutical Sciences, University of Connecticut, Storrs, CT, USA
Sara P. Puckett & Marcy J. Balunas
Department of Medicine, University of California San Diego, San Diego, California, USA
Jeramie D. Watrous
Division of Gastroenterology, University of California San Diego, La Jolla, CA, USA
Rohit Loomba
Department of Pediatrics, University of California San Diego, La Jolla, CA, USA
Rob Knight & Pieter C. Dorrestein
Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA, USA
Rob Knight & Nuno Bandeira
Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
Nuno Bandeira

Authors

Mingxun Wang
View author publications
You can also search for this author in PubMed Google Scholar
Alan K. Jarmusch
View author publications
You can also search for this author in PubMed Google Scholar
Fernando Vargas
View author publications
You can also search for this author in PubMed Google Scholar
Alexander A. Aksenov
View author publications
You can also search for this author in PubMed Google Scholar
Julia M. Gauglitz
View author publications
You can also search for this author in PubMed Google Scholar
Kelly Weldon
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Petras
View author publications
You can also search for this author in PubMed Google Scholar
Ricardo da Silva
View author publications
You can also search for this author in PubMed Google Scholar
Robert Quinn
View author publications
You can also search for this author in PubMed Google Scholar
Alexey V. Melnik
View author publications
You can also search for this author in PubMed Google Scholar
Justin J. J. van der Hooft
View author publications
You can also search for this author in PubMed Google Scholar
Andrés Mauricio Caraballo-Rodríguez
View author publications
You can also search for this author in PubMed Google Scholar
Louis Felix Nothias
View author publications
You can also search for this author in PubMed Google Scholar
Christine M. Aceves
View author publications
You can also search for this author in PubMed Google Scholar
Morgan Panitchpakdi
View author publications
You can also search for this author in PubMed Google Scholar
Elizabeth Brown
View author publications
You can also search for this author in PubMed Google Scholar
Francesca Di Ottavio
View author publications
You can also search for this author in PubMed Google Scholar
Nicole Sikora
View author publications
You can also search for this author in PubMed Google Scholar
Emmanuel O. Elijah
View author publications
You can also search for this author in PubMed Google Scholar
Lara Labarta-Bajo
View author publications
You can also search for this author in PubMed Google Scholar
Emily C. Gentry
View author publications
You can also search for this author in PubMed Google Scholar
Shabnam Shalapour
View author publications
You can also search for this author in PubMed Google Scholar
Kathleen E. Kyle
View author publications
You can also search for this author in PubMed Google Scholar
Sara P. Puckett
View author publications
You can also search for this author in PubMed Google Scholar
Jeramie D. Watrous
View author publications
You can also search for this author in PubMed Google Scholar
Carolina S. Carpenter
View author publications
You can also search for this author in PubMed Google Scholar
Amina Bouslimani
View author publications
You can also search for this author in PubMed Google Scholar
Madeleine Ernst
View author publications
You can also search for this author in PubMed Google Scholar
Austin D. Swafford
View author publications
You can also search for this author in PubMed Google Scholar
Elina I. Zúñiga
View author publications
You can also search for this author in PubMed Google Scholar
Marcy J. Balunas
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan L. Klassen
View author publications
You can also search for this author in PubMed Google Scholar
Rohit Loomba
View author publications
You can also search for this author in PubMed Google Scholar
Rob Knight
View author publications
You can also search for this author in PubMed Google Scholar
Nuno Bandeira
View author publications
You can also search for this author in PubMed Google Scholar
Pieter C. Dorrestein
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.C.D. and M.W. came up with the concept of MASST. M.W. and N.B. performed the engineering to enable MASST. M.W., A.V.M., A.K.J., J.J.J.v.d.H., J.M.G., M.P., E.O.E., K.W., C.M.A., F.D.O., E.B., A.B., R.Q., M.C., N.S. and S.S. curated metadata. F.V., J.M.G., L.L.-B., K.W., E.B., A.A.A., R.Q., M.C. and C.S.C. generated data for the manuscript. E.C.G. synthesized the bile acids. P.C.D., M.W., D.P., J.D.W., M.J., L.F.N., J.M.G., E.I.Z., L.L.-B., K.E.K., S.P.P., A.M.C.R., A.V.M., F.V., K.W., A.A.A. and S.S. performed experiments and/or analysis for Box 1. P.C.D., D.P., L.F.N., J.J.J.v.d.H., J.M.G., A.A.A., A.M.C.R., F.V., K.W., A.B., F.D.O., M.E. and R.d.S. tested the MASST infrastructure and downloaded public data. P.C.D., N.B., E.I.Z., R.L., R.K., A.D.S., M.J.B. and J.L.K. provided supervision and funding for the project. P.C.D., A.K.J., D.P., J.J.v.d.H., M.E., J.M.G., A.A.A., A.M.C.R., R.K., J.L.K., L.F.N., N.B. and M.W. wrote and edited the manuscript.

Corresponding author

Correspondence to Pieter C. Dorrestein.

Ethics declarations

Competing interests

M.W. is the founder of Ometa Labs LLC and consults for Sirenas, and P.C.D. is on the scientific advisory board of Sirenas.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, M., Jarmusch, A.K., Vargas, F. et al. Mass spectrometry searches using MASST. Nat Biotechnol 38, 23–26 (2020). https://doi.org/10.1038/s41587-019-0375-9

Download citation

Published: 01 January 2020
Issue Date: January 2020
DOI: https://doi.org/10.1038/s41587-019-0375-9

This article is cited by

Genome sequencing and molecular networking analysis of the wild fungus Anthostomella pinea reveal its ability to produce a diverse range of secondary metabolites
- R. Iacovelli
- T. He
- K. Haslinger
Fungal Biology and Biotechnology (2024)
From MS/MS library implementation to molecular networks: Exploring oxylipin diversity with NEO-MSMS
- Anis Elloumi
- Lindsay Mas-Normand
- Jean-Marie Galano
Scientific Data (2024)
Synthesizing and identifying potential biomarkers to explore uncharted biochemistry

Nature (2024)
The changing metabolic landscape of bile acids – keys to metabolism and immune regulation
- Ipsita Mohanty
- Celeste Allaband
- Pieter C. Dorrestein
Nature Reviews Gastroenterology & Hepatology (2024)
Fast mass spectrometry search and clustering of untargeted metabolomics data
- Mihir Mongia
- Tyler M. Yasaka
- Hosein Mohimani
Nature Biotechnology (2024)

Mass spectrometry searches using MASST

Subjects

Relevant articles

From MS/MS library implementation to molecular networks: Exploring oxylipin diversity with NEO-MSMS

Bile salt hydrolase catalyses formation of amine-conjugated bile acids

microbeMASST: a taxonomically informed mass spectrometry search tool for microbial metabolomics data

Access options

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Supplementary information

Supplementary Note

Supplementary Data for Supplementary Note Box Figure 1a

Supplementary Data for Supplementary Note Box Figure 1c,d

Supplementary Data for Supplementary Note Box Figure 2a

Rights and permissions

About this article

Cite this article

This article is cited by

Genome sequencing and molecular networking analysis of the wild fungus Anthostomella pinea reveal its ability to produce a diverse range of secondary metabolites

From MS/MS library implementation to molecular networks: Exploring oxylipin diversity with NEO-MSMS

Synthesizing and identifying potential biomarkers to explore uncharted biochemistry

The changing metabolic landscape of bile acids – keys to metabolism and immune regulation

Fast mass spectrometry search and clustering of untargeted metabolomics data

Search

Quick links

Subjects

Relevant articles

Access options

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links