To the Editor — The importance of providing open and transparent solutions to use, share, inspect and reproduce mass spectrometry (MS) data analysis is becoming more apparent. To address these challenges, we developed a collaborative interactive web application, the GNPS Dashboard (https://gnps-lcms.ucsd.edu), to enable remote and synchronous collaborative research in a common analysis environment for MS data analysis (similar to collaborative text editing in browser-based word processors, such as Google Docs).
The disruptions to office and laboratory workspaces resulting from the COVID-19 pandemic, including campus closures, telework and stay-at-home orders, increased the need for online collaborative approaches to perform scientific research. Most solutions for the analysis of MS data are geared for exploration on a local workstation that lacks synchronous collaborative data exploration, requiring specific software packages, knowledge of data storage locations, familiarity with file transfer protocols and often conversion of data into compatible formats1,2,3,4. Commercial data exploration solutions usually lack universality and are costly, creating additional hurdles to data accessibility, preventing collaborators, reviewers and readers of scientific publications from inspecting MS data to verify interpretations. However, commercial software offers deep access to proprietary vendor-specific data formats and unique data analysis capabilities, for example, creation of private databases. More broadly, as scientific data are becoming more complex, scientific reproducibility is a challenge. Therefore, there is a growing mandate by both funding bodies and journals to make mass spectrometry data (and other data) adherent to FAIR (findable, accessible, interoperable and reusable).
To empower remote collaborations or classroom teaching, the GNPS Dashboard includes leader–follower synchronization (real-time updates from one lead user) and fully collaborative synchronization (real-time updates from multiple users). Followers can disconnect the synchronization to continue the analysis from where the leader left off without needing to reload the data. The GNPS Dashboard leader–follower paradigm has already been used in classroom teaching by at least five institutions, including undergraduate institutions (Supplementary Note 1 and 2). The fully collaborative synchronization enables multi-user visualization and data exploration similar to online synchronous collaborative document editing. For example, users can initiate a collaborative session with two or more people on any web-accessible device (providing a dashboard collaboration start link, instructions, video tutorial and video application). A snapshot of the collaborative work is automatically created and can be shared with collaborators or included in publications. The final state of the analysis is saved together with every discrete action, enabling inspection of the evolution of data analysis.
The GNPS Dashboard includes tools that facilitate the exploration of liquid and gas chromatography–mass spectrometry data for collaborative examination, and hands-on teaching of MS concepts using private and publicly available MS data, including files stored in the MS data repositories GNPS/MassIVE5, MetaboLights6, ProteomeXchange7 and Metabolomics Workbench8 (Fig. 1; providing a link to instructions and video tutorial). All public data from compatible repositories can be selected using the GNPS dataset explorer (https://gnps-explorer.ucsd.edu/). Files not deposited in public MS repositories can be uploaded through a drag-and-drop option for file transfer. mzXML, mzML, CDF and raw formats are compatible with GNPS Dashboard. Through deep linking from the GNPS platform, the GNPS Dashboard serves as a data explorer and hub for further data analysis, including Molecular Networking5, gas chromatography–mass spectrometry deconvolution9, in silico annotation and Mass Spectrometry Search Tool (MASST10) (Supplementary Note 2).
The GNPS Dashboard has been integrated into the web-accessible MS ecosystem. The top and left panels show how the GNPS Dashboard can ingress datasets from public metabolomics resources: MetaboLights, Metabolomics Workbench and GNPS/MassIVE; and from proteomics resources: PRIDE and MassIVE, private user uploads and private GNPS analysis tasks. The bottom panel shows that users can launch downstream analysis on their data directly out of the GNPS Dashboard. Finally, the right panel shows how the GNPS Dashboard and its visualizations can interact with the wider community with reproducible URL or QR code link outs from any analysis and teaching/synchronous collaboration modes for real-time interactivity.
The GNPS Dashboard’s visualization tool enables inspection of total ion chromatograms, retention time versus m/z heat map, extracted ion chromatograms, tandem mass spectra for inspection/visualization of individual compounds and quantitative comparison of the peak abundances of two groups as boxplots — all with publication-quality figures. The dashboard aids peer review of scientific manuscripts and inspection of public quantitative MS data to validate published results. Beyond visualization and analysis of MS data, the dashboard supports the development of other bioinformatics tools that do not have their own web-enabled user interfaces. Documentation and tutorials are available in Supplementary Note 1 and Supplementary Fig. 1.
The GNPS Dashboard encodes the visualization in a shareable URL, which empowers users to share the exact same visualization with colleagues, thus reducing miscommunication and improving data transparency, for example, during remote meetings. Every visualization and analysis result can be shared via a URL that will re-launch the original data visualization on their device, along with the history of the analysis (up to 1,000 steps per session). Users can share these links with collaborators and embed them in publications, presentations or social media posts (for example, Example Tweet). The final visualization can also be shared as a quick response (QR) code. Anyone with a link or QR code can build on the analysis and re-share their additions. Links and QR codes will remain valid for data that have been archived in a public repository.
Together, the capabilities of the GNPS Dashboard should help improve MS data accessibility and interoperability, lower barriers of entry for MS data analysis in the research environment and the classroom, encourage data transparency and sharing, and strengthen the reproducibility of MS data analysis.
Code availability
While the GNPS Dashboard is accessible as a free public web service, it is possible to locally install the GNPS Dashboard to function with local data sources, making collaborative analysis and sharing possible privately within an institution when necessary. The source code is available through the GNPS web environment and GitHub, enabling quick installation on local servers. The GNPS Dashboard source code can be found on GitHub: https://github.com/mwang87/GNPS_LCMSDashboard under a modified UCSD BSD License. The GNPS Dataset Explorer source code can be found on GitHub: https://github.com/mwang87/GNPS_DatasetExplorer under an MIT License. Further implementation details can be found in Supplementary Methods.
References
Pluskal, T. et al. BMC Bioinform. 11, 395 (2010).
Tsugawa, H. et al. Nat. Methods 12, 523–526 (2015).
Röst, H. L. et al. Nat. Methods 13, 741–748 (2016).
Huang, Y.-C. et al. J. Cheminform. 13, 8 (2021).
Wang, M. et al. Nat. Biotechnol. 34, 828–837 (2016).
Haug, K. et al. Nucleic Acids Res. 48, D440–D444 (2020).
Vizcaíno, J. A. et al. Nat. Biotechnol. 32, 223–226 (2014).
Sud, M. et al. Nucleic Acids Res. 44, D463–D470 (2016).
Aksenov, A. A. et al. Nat. Biotechnol. 39, 169–173 (2021).
Wang, M. et al. Nat. Biotechnol. 38, 23–26 (2020).
Acknowledgements
This work was, in part, supported by the National Institutes of Health (NIH) with grant numbers U19AG063744, U2CDK119886, OT2 OD030544, GM107550, R03CA211211, R24GM127667, 1R01LM013115 and P41GM103484, the National Science Foundation (NSF) with grants IOS-1656475 and ABI 1759980, and the Gordon and Betty Moore Foundation with grant GBMF7622. D.P. was supported by the Deutsche Forschungsgemeinschaft through the CMFI Cluster of Excellence (EXC 2124). V.V.P. was supported by NIH grant R35GM128690. N.B. and B.P. were supported by NIH grant P41-GM103484. M.T.M. was supported by NSF grant CHE-1845230. D.B. was supported by NSF grant DUE 16-25354. The I.M laboratory was supported by grants from the European Research Council (grant number 640384) and from the Israel Science Foundation (ISF number 1947/19). T.R.N., B.B. and K.L. were supported by the US Department of Energy Joint Genome Institute, a DOE Office of Science User Facility, which is supported by the Office of Science of the US Department of Energy under contract number DE-AC02-05CH11231. A.W.P. and D.A.C.J. were supported by NIH grant R00 GM 118762. This research was supported in part by the Intramural Research Program of the National Institute of Environmental Health Sciences of the NIH (ES103363-01). Furthermore, we would like to thank T. de Rond, L.-I. McCall, W. Bittremieux, K. Weldon and E. Gentry for testing the software and suggesting updates. We would like to thank V. Sharma, for working on the standalone MSView app during her Masters degree research 12 years ago, which provided some visualization inspiration for the GNPS Dashboard. We thank C. O’Donovan and the team at MetaboLights for developing a well-documented API. Lastly, we thank all members of the research community who make their data publicly accessible, which contributes to open, transparent and reproducible science.
Author information
Authors and Affiliations
Contributions
M.W. and D.P. conceived the project. M.W. developed the software for GNPS Dashboard and GNPS Dataset Explorer. M.W., D.P., V.V.P. and P.C.D. provided guidance and supervision. M.W., D.P., V.V.P., K.N.M., A.T.A., A.K.J., B.P., D.A.C.J., A.W.P., C.M.S., M.T., N.G., N.B. and P.C.D. tested and offered user feedback for the GNPS Dashboard. S.S. and E.F. implemented the APIs in Metabolomics Workbench to support GNPS Dashboard. T.N., B.B. and K.L. facilitated making metabolomics data in Genome Portal available in GNPS Dashboard. M.W., D.P., V.V.P., K.N.M., A.T.A., A.K.J., S.A.J., D.A., B.P., D.A.C.J., A.W.P., C.M.S., M.T., A.E.A., R.S., N.B., S.B., M.T.M., D.B.O., M.N.H., R.G., M.M.M., I.M., J.H., F.V.C., J.M.D., N.G., N.B., R.L.N. and P.C.D. wrote use cases for the GNPS Dashboard Supplemental Information. P.C.D. provided funding for the project. P.C.D., M.W., V.V.P. and D.P. wrote the draft manuscript. All authors edited and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
P.C.D. is a scientific advisor of Sirenas, Galileo and Cybele, and scientific advisor and co-founder of Ometa Labs LLC and Enveda with approval by the UC San Diego. M.W. is a founder of Ometa Labs LLC. T.R.N. is an advisor of Brightseed Bio.
Additional information
Peer review information Nature Methods thanks Juan Antonio Vizcaino, Jean-Luc Wolfender and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Supplementary Information
Supplementary Information
Supplemental Notes 1 and 2, Supplementary Methods, Data Availability, Supplementary Fig. 1.
Rights and permissions
About this article
Cite this article
Petras, D., Phelan, V.V., Acharya, D. et al. GNPS Dashboard: collaborative exploration of mass spectrometry data in the web browser. Nat Methods 19, 134–136 (2022). https://doi.org/10.1038/s41592-021-01339-5
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41592-021-01339-5
This article is cited by
-
Eco-Metabolomics Applied to the Chemical Ecology of Poison Frogs (Dendrobatoidea)
Journal of Chemical Ecology (2023)
-
Quick tips for re-using metabolomics data
Nature Cell Biology (2022)
-
Listeria monocytogenes exposed to antimicrobial peptides displays differential regulation of lipids and proteins associated to stress response
Cellular and Molecular Life Sciences (2022)