Inter-laboratory mass spectrometry dataset based on passive sampling of drinking water for non-target analysis

Schulze, Bastian; van Herwerden, Denice; Allan, Ian; Bijlsma, Lubertus; Etxebarria, Nestor; Hansen, Martin; Merel, Sylvain; Vrana, Branislav; Aalizadeh, Reza; Bajema, Bernard; Dubocq, Florian; Coppola, Gianluca; Fildier, Aurélie; Fialová, Pavla; Frøkjær, Emil; Grabic, Roman; Gago-Ferrero, Pablo; Gravert, Thorsten; Hollender, Juliane; Huynh, Nina; Jacobs, Griet; Jonkers, Tim; Kaserzon, Sarit; Lamoree, Marja; Le Roux, Julien; Mairinger, Teresa; Margoum, Christelle; Mascolo, Giuseppe; Mebold, Emmanuelle; Menger, Frank; Miège, Cécile; Meijer, Jeroen; Moilleron, Régis; Murgolo, Sapia; Peruzzo, Massimo; Pijnappels, Martijn; Reid, Malcolm; Roscioli, Claudio; Soulier, Coralie; Valsecchi, Sara; Thomaidis, Nikolaos; Vulliet, Emmanuelle; Young, Robert; Samanipour, Saer

doi:10.1038/s41597-021-01002-w

Download PDF

Data Descriptor
Open access
Published: 24 August 2021

Inter-laboratory mass spectrometry dataset based on passive sampling of drinking water for non-target analysis

Bastian Schulze¹,
Denice van Herwerden ORCID: orcid.org/0000-0003-1940-9415²,
Ian Allan³,
Lubertus Bijlsma ORCID: orcid.org/0000-0001-7005-8775⁴,
Nestor Etxebarria⁵,
Martin Hansen ORCID: orcid.org/0000-0002-4663-8742⁶,
Sylvain Merel⁷,
Branislav Vrana⁸,
Reza Aalizadeh⁹,
Bernard Bajema¹⁰,
Florian Dubocq¹¹,
Gianluca Coppola¹²,
Aurélie Fildier¹³,
Pavla Fialová⁸,
Emil Frøkjær⁶,
Roman Grabic¹⁴,
Pablo Gago-Ferrero¹⁵,
Thorsten Gravert ORCID: orcid.org/0000-0003-1194-267X⁶,
Juliane Hollender ORCID: orcid.org/0000-0002-4660-274X¹⁶,
Nina Huynh¹⁷,
Griet Jacobs¹⁸,
Tim Jonkers ORCID: orcid.org/0000-0002-6415-6173¹⁹,
Sarit Kaserzon¹,
Marja Lamoree¹⁹,
Julien Le Roux¹⁷,
Teresa Mairinger^16,20,
Christelle Margoum⁷,
Giuseppe Mascolo²¹,
Emmanuelle Mebold²²,
Frank Menger ORCID: orcid.org/0000-0002-1808-8835²³,
Cécile Miège⁷,
Jeroen Meijer ORCID: orcid.org/0000-0002-4551-6823¹⁹,
Régis Moilleron¹⁷,
Sapia Murgolo²¹,
Massimo Peruzzo¹³,
Martijn Pijnappels²⁴,
Malcolm Reid³,
Claudio Roscioli ORCID: orcid.org/0000-0002-0857-4143²⁵,
Coralie Soulier²⁶,
Sara Valsecchi ORCID: orcid.org/0000-0003-2200-3475²⁵,
Nikolaos Thomaidis ORCID: orcid.org/0000-0002-4624-4735⁹,
Emmanuelle Vulliet¹¹,
Robert Young ORCID: orcid.org/0000-0001-7485-0604^6,27 &
…
Saer Samanipour ORCID: orcid.org/0000-0001-8270-6979^2,3

Scientific Data volume 8, Article number: 223 (2021) Cite this article

4336 Accesses
18 Citations
12 Altmetric
Metrics details

Subjects

Abstract

Non-target analysis (NTA) employing high-resolution mass spectrometry is a commonly applied approach for the detection of novel chemicals of emerging concern in complex environmental samples. NTA typically results in large and information-rich datasets that require computer aided (ideally automated) strategies for their processing and interpretation. Such strategies do however raise the challenge of reproducibility between and within different processing workflows. An effective strategy to mitigate such problems is the implementation of inter-laboratory studies (ILS) with the aim to evaluate different workflows and agree on harmonized/standardized quality control procedures. Here we present the data generated during such an ILS. This study was organized through the Norman Network and included 21 participants from 11 countries. A set of samples based on the passive sampling of drinking water pre and post treatment was shipped to all the participating laboratories for analysis, using one pre-defined method and one locally (i.e. in-house) developed method. The data generated represents a valuable resource (i.e. benchmark) for future developments of algorithms and workflows for NTA experiments.

Measurement(s)	chemical • drinking water
Technology Type(s)	high resolution mass spectrometry • non-target analysis • Interlaboratory
Factor Type(s)	method
Sample Characteristic - Environment	laboratory environment

Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.15028665

Integrating different tools and technologies to advance drinking water quality exposure assessments

Article 08 August 2023

Monitoring pollution pathways in river water by predictive path modelling using untargeted GC-MS measurements

Article Open access 28 June 2023

Reliability of urinary charged metabolite concentrations in a large-scale cohort study using capillary electrophoresis-mass spectrometry

Article Open access 01 April 2021

Background & Summary

Non-target analysis (NTA) using high-resolution mass spectrometry (HRMS) is the most comprehensive approach for the screening and discovery of organic compounds/chemicals of emerging concern (CECs) in complex environmental samples^{1,2,3,4,5,6,7}. This strategy is a bottom up approach with minimum a priori assumptions and/or knowledge regarding the samples and the CECs^5,8,9,10. The recent surge in the number of newly identified CECs in different environmental compartments is a testimonial to the power of this technology^11,12,13.

The main drawback of NTA is its complexity. Long and laborious processes increase the likelihood of false positive and false negative results, and the resulting data is difficult to interpret and can suffer from poor reproducibility^{1,7,8,14,15,16,17,18}. Additionally, the wide variety of chemicals with different physico-chemical properties and variable concentration ranges make NTA an extremely challenging task^4,19.

Several sophisticated algorithms and workflows have been developed in the past decade to tackle the complexity of NTA data^{3,5,20,21,22,23}. Each of these approaches attempted to address one or more steps in the process, from noise removal, over peak picking to identification and finally communication of the confidence level of the identified CECs. While these algorithms have shown increasing success, there still remain some challenges with NTA and the underlying assumptions. Accordingly, recent studies have highlighted that further improvements are needed to be able to generate reproducible results^8,15,24,25.

International collaborative efforts, such as raw data sharing (e.g. FAIR Principles^26,27), novel CEC sharing (e.g. NormaNews²⁸), community based spectral libraries (e.g. MassBank EU²⁹ and MoNA), and former inter-laboratory studies (ILSs) have shown great potential in highlighting these shortfalls. Such work is crucial to steer future research and ensure success^{9,12,13,21,28}. An example of such effort was the NTA collaborative trial organized by Norman Network^9,10, which promoted a clear reporting strategy of the confidence levels in the identified CECs in complex samples within the environmental chemistry field^9,10.

In this study, we present the data collected during an international collaborative ILS organized through the Norman Network. This study aimed at assessing the uncertainty in the identified compounds caused by different NTA workflows and spectral databases. We used a passive sampling (PS) strategy to sample river water at a drinking water intake, and post treatment for the production of drinking water. This will enable us to sample substances (i) present in river water at the drinking water intake, (ii) present in drinking water, as well identify those (iii) removed during treatment and (iv) generated during drinking water treatment.

PS enables to pre-concentrate chemicals from a complex matrix while leaving behind a significant proportion of unwanted matrix affecting the performance of the analysis, therefore allowing to sample chemicals often found at trace-levels in water and in the environment^30,31. PS devices accumulate chemicals through diffusion over time when deployed in water (Table 1). When uptake kinetics are known and understood, it is possible to relate the amount of a chemical accumulated in a sampler to its time-averaged concentration in water for the period of deployment through a sampling rate, an equivalent volume of water extracted by the sampler per unit of time. This allows the collection of a more representative sample than when using spot sampling.

Table 1 Overview of samples provided to participants.

Full size table

All participants in this ILS were given the same “ready for injection” samples consisting of the two surface water and two drinking water PS extracts. They were requested to analyze these PS extracts using one pre-defined method for liquid chromatography (LC) and mass spectrometry (MS), Table 2. Participants were also required to analyze the samples via their “in-house” methods. The participants were allowed the freedom to use any data processing strategy for the identification of CECs in those samples (details are provided in section Experimental Design).

Table 2 Gradient programme for the pre-defined method using a 150 mm × 2.1 mm column with 1.8 μm particles, with a flowrate of 0.4 ml/min.

Full size table

This dataset includes the raw data from 21 different laboratories (Fig. 1), using different reversed-phase LC (RPLC) columns and instrumentations (Tables 3 and 4, respectively), chromatographic gradients, and MS acquisitions. This dataset provides the means to assess the impact of method transfer, chromatographic conditions, and data processing workflows on the results of NTA approaches. Additionally, this dataset may, potentially, be used as a benchmark for future development of algorithms for NTA workflows.

Table 3 Columns used for the pre-defined method and the in-house method, as indicated by the participant; the number of participants using the same column is indicated in brackets.

Full size table

Table 4 Manufacturers and models of MS instruments used by the participants.

Full size table

Methods

Experimental design

The participants were given a set of samples and were asked to perform a complete NTA/extended suspect screening workflow (i.e. RPLC coupled with HRMS) using two different experimental approaches. A harmonized approach, hereafter referred to as pre-defined method (details are provided in the section “pre-defined method”) and an individually developed in-house method. Finally, the participants were requested to provide: the raw data (i.e. vendor format), the converted files (mzMXL format), the raw feature list associated with each sample, the top 50 identified features including the level of confidence in the identifications, based on the Schymanski scale⁹.

The experimental design (Fig. 2) allows the systematic assessment of the impact of the method transfer, chromatography approach, and data acquisition on the explored chemical space. Furthermore, it enables the evaluation of the impact of different data processing strategies on the identified features and the level of confidence associated to the identifications according to the tiered levels proposed by Schymanski et al.^9,10.

Sample set

A sample set including six vials was shipped to all 21 participants of this ILS. The vials consisted of four extracts from passive samplers (PS), one procedural blank, and a mixture of internal standards (IS) for the retention time modeling developed by the University of Athens (Table 1)³².

To generate the samples, integrative sampling at both the input and the output of a drinking water treatment plant (i.e. the source river water and drinking water, respectively) was carried out in June 2019 using Horizon Atlantic® HLB-L disks (47 mm diameter) with exposure times of 2 and 4 days. A detailed description of the sampling and sample preparation is given in supporting information (SI).

HLB disks were chosen as the receiving phase as the HLB sorbent has been shown to be suitable for the sampling of a variety of substances with different physicochemical properties, including compounds with low logD/Kow or ionized at river/drinking water pH^33,34. However, limitations exist, particularly in the very low logD range.

In order to increase the sampling rate of the chemicals into the PS, the samplers were placed in a “dynamic” PS device (DPS), consisting of an electrically driven large volume water pumping device coupled to a PS exposure cell³⁵. A total of 26 discs were exposed to treated drinking water and the same number to source water.

The exposed HLB disks, once in the lab, were kept in the freezer at −20 °C until extraction. The frozen disks were then freeze-dried for 48 hours to remove any water residues. The dried disks were spiked with six isotope-labeled ISs (i.e. Caffeine-13C3, Nicotine-D4, Cotinine-D3, Simazine-D10, Carbamazepine-D10, and Diuron-D6) prior to extraction. The spiking level was set at 50 ng/mL of each IS in the final extract, assuming a 100% recovery.

For the extraction, we used a previously tested wide-scope approach, which consisted of three consecutive extractions of the PS with 200 mL of acetone for 24 hr. The final extracts, then, were solvent exchange to methanol, following the US EPA method 3570³⁶. The extracts of 13 different PS were pooled together to obtain homogeneous samples/extracts that were shipped to the participants. We also included a procedural blank, which consisted of the pooled extracts of seven PS brought to the field without being exposed to water.

In order to estimate the approximate volume of water sampled by the HLB disk PS, 0.5 mm thick silicone sheets made of AlteSil™ (Altec, UK) with the exposed surface area of 200 cm² were co-deployed in the DPS next to the HLB disks. Prior to exposure, the silicone sheets were spiked with 14 performance reference compounds (PRCs)³⁵. The loss of PRCs from silicone during exposure was applied to calculate the mass transfer coefficient (MTC) of PRCs through the water boundary layer (WBL). The estimated volumes of water extracted by HLB disks are based on the assumption of the same WBL-controlled MTC in silicone and HLB disks, by the approach shown by Vrana et al.³⁵ and were 190 and 346 L for the river exposures for 2 and 4 days, 160 and 295 L for the drinking water exposures.

Considering the 40 times dilution used to prepare individual vials for distribution to participating laboratories, vials 1 to 4 contained equivalent volumes of water sampled by the HLB discs of 4.8, 8.7, 4.0 ad 7.4 L, respectively (Table 1). This can be applied to substances accumulated linearly over the exposure time by the HLB discs under boundary-layer controlled uptake. This is not necessarily applicable to all substances and samples.

Pre-defined method

The pre-defined method consisted of two components, namely: 1) LC separation and 2) mass spectrometric data acquisition. The participants were given detailed instructions on how to set up the exact conditions on their instruments. Additionally, the participants were given access to a set of tools (e.g. method transfer and column chemistry assessment - https://find.waters.com/ColumnCoach/existingcolumn/column) to find the closest conditions to the instructions in the pre-defined method³⁷.

1)
Separation: the mobile phase consisted of 5 mM aqueous ammonium formate adjusted to pH 3 (A) and acetonitrile (ACN) containing 0.1% formic acid (B) used in a 15 min gradient elution (Table 2). The flow was given at 0.4 ml/min for a 150 mm × 2.1 mm × 1.8 μm C18 (Acquity UPLC BEH C18). The injection volume for this method was set to 5 µL. The pre-defined method was a wide scope generic method for analysis of pharmaceuticals and pesticides in environmental samples^16,38.
2)
Data acquisition: the participants were requested to run their samples in a data-independent acquisition mode. The mass range was set between 50 and 900 Da. For the quadrupole-time-of-flight (QTOF) instruments the sampling rate was limited to 2 Hz while for Orbitrap instruments the participants were advised to run their samples with maximum of 60,000 resolution. For the SWATH experiments, the number of windows was limited to 10. The method required acquisition in positive-polarity mode using an electrospray-ionization source. For the pseudo MS2 spectra, the participants were asked to perform their experiments either using a collision energy of 10–45 eV as a ramp or an average collision energy of 30–40 eV.

It should be noted that the observed diversity in the pre-defined method may or may not translate into the same level of diversity in the explored chemical space. In fact this can only be assessed by systematic re-processing of the data using open source/access workflows, which will be the subject of future studies by the consortium.

Variations in the pre-defined method

Twenty out of the 21 participants, depending on the availability of instrumentations in their labs, implemented some changes in the pre-defined method, employing the provided tools by the organizers.

Four out of 21 participants opted for reconstitution of the extracts into ACN/water 50/50, ACN/water 14/86, methanol/water 10/90 or methanol, prior to the separation. One participant mixed the sample with mobile phase A (see above) at a 1:1 volume ratio.

As for the injection volumes, 16 participants followed the instructions in the pre-defined method (i.e. 5 µL) while three participants opted for 10 µL injection volume. One of the participants performed a large volume injection of 200 µL.

All participants used C18 columns for both the pre-defined and in-house methods. 14 out of 21 participants used the 0.4 mL/min flow rate suggested by the pre-defined method while six participants used different flow rates between 0.21 and 0.3 mL/min and one of 1 mL/min. Additionally, it should be noted that not all the participants employed the same column for both pre-defined and in-house methods (Table 3).

In terms of mass spectrometers, 8 Thermo Fisher Scientific, 6 Waters, 3 Bruker, 3 Sciex and 1 Agilent instruments were used for the data acquisition. All eight Thermo Fisher Scientific instruments were Orbitraps, while all other instruments relied on the time-of-flight (TOF) principle (Table 4). All the participants were asked to perform the analysis via data independent acquision (DIA). This acquisition mode of accurate-mass full-scan spectra under different collision induced dissociation conditions within a single injection is known under different names depending on the manufacturer: MSE in the case of Waters, broadband collision-induced dissociation (bbCID) in the case of Bruker, or all-ions MS/MS in the case of Agilent. Likewise for Q-Orbitrap instruments from Thermo, this type of acquisition is also possible, and known as All Ion Fragmentation (AIF) or variable Data-Independent Analysis (vDIA). An alternative DIA was Sequential Window Acquisition of All Theoretical Mass Spectra (SWATH-MS), which maily performed via AB Sciex instruments.

For the acquisition mode and collision energies, all participants followed the instructions provided. As for the mass range, 17 used the range defined in the pre-defined method whereas four participants used 75–950, 100–900, or 77–1000 Da.

In-house method

The participants were also asked to analyze the extracts using an in-house method. Six out of 21 participants decided to reconstitute the sample in new solvent, five of which in methanol/water (4.8%, 10%, 20%, 50% and 100% methanol) and one in ACN/water 1/99. Injection volumes, in this case, ranged from 2 μL to 1000 μL, with 16 institutes using 5 μL, similar to the pre-defined method, and two using 10 μL.

Commonly applied flow rates ranged from 0.2 mL/min to 0.5 ml/min with the median at 0.375 mL/min while there were two cases with 1 mL/min and 0.0002 mL/min (nano LC). Two participants used flow gradients (0.2–0.48 ml/min and 0.3–0.4 ml/min).

As for column chemistries, we observed a greater diversity in the used columns when compared to the pre-defined method. More specialized columns were used for the in-house method. However, some participants used the same column for both methods (Table 3). The used mobile phases, varied mostly in the concentration of additives: water with ammonium acetate or formate (up to 10 mM) and formic acid (up to 0.1%) for mobile phase A and methanol or ACN, pure or with the same additives, for mobile phase B. All participants used gradients for elution, mostly running between 20 and 30 minutes.

The starting masses for the scan range ranged from 30 Da to 100 Da with a median of 60 Da, while the end of the scan range was between 800 and 1300 Da with the median at 925 Da. Collision energies were the same as the ones used for the pre-defined method except for slight changes (experimental details are provided in Record 3).

Data conversion

The participants were asked to convert the datasets to mzXML³⁹ format prior to submission using either the vendor provided tools or MSConvert implemented via ProteoWizard⁴⁰. All the users employed the MSConvert 32− or 64-bit. Additionally, a minimum absolute intensity of 50 counts was applied to all mzXML files as the intensity threshold, in order to reduce the size of the converted data.

Data Records

This ILS resulted in three different data records – i.e. mzXML files related to the pre-defined method, mzXML files related to the in-house method, and the experimental conditions in XLSX format.

Record 1

includes the mzXML files for both the samples related to the river water and drinking water with exposure times of 2 and 4 days analyzed via the pre-defined method. This record includes 157 mzXML files^41,42.

Record 2

includes the mzXML files for both the samples related to the river water and drinking water with exposure times of 2 and 4 days analyzed via the own method. This record includes 134 mzXML files^42,43.

Record 3

consists of an excel file that includes the details of participants and the experimental conditions associated with each mzXML file, including the software packages used for the data acquisition and pre-processing⁴⁴.

Record GNPS

additionally, all the mzXML files from records 1 and 2 are also available for download or analysis via GNPS⁴².

Technical Validation

We used the detection of IS, their mass accuracy, retention factor (i.e. run-time normalized retention time), and their intensity to assess the quality of the generated data by the participants. The extracted ion chromatograms (XIC) of the IS were generated using a mass window of ±0.01 Da. The peak of each IS was identified as the maximum signal in the selected XIC. Cases where we were not able to identify the IS signal as the most prominent signal, were considered as not detected. The measured mass (i.e. the median mass of three scans around apex), retention time, and intensity of apex were recorded for quality assurance of the data.

For 98% of datasets, the participants were able to detect all six IS, independently from the method used (i.e. pre-defined versus in-house). For two cases analyzed via the pre-defined method only three out of six ISs were detected while this was the case only for one chromatogram via own method, respectively. Higher detection frequencies were obtained for the in-house method than for the pre-defined method for all IS (Fig. 3). Diuron-D6 had the lowest detection frequency of 92% for pre-defined method whereas nicotine-D4 was the IS with the lowest detection frequency of 96% via the in-house method (Fig. 4). Diuron-D6 was the latest eluting IS with pre-defined method whereas nicotine-D4 was the IS with the smallest retention time for both methods (Figs. 5 and 6).

When looking at the mass error of IS, for both methods, a median of ±12 ppm was observed (Figs. 5 and 6). Overall, the mass errors were distributed between −7 mDa and +10 mDa, for both methods. We did not observe a statistically significant correlation between the retention time, m/z value and the measured mass error.

As for the intensities of the IS a median intensity of 10⁶ was measured across the datasets and the methods. The variance in the intensities of around five orders of magnitude appeared to be independent from the method used for the analysis.

Usage Notes

Often, when developing new tools for NTA, they are restricted to a limited number of datasets usually generated in one laboratory. These datasets rarely represent the actual diversity in potential future datasets that should be processed using these tools. In this data descriptor, we present a dataset which covers a wide range of instruments and instrumental/analysis conditions. Additionally, the dataset in this study provides three different levels of chemical complexity (i.e. the number of potential chemical constituents) in blank, drinking water, and river water, respectively. Such a dataset can be used for the assessment of the impact of method transfer on the explored chemical space of the samples as well as the quality of the generated spectra. The records made available with this data descriptor constitute a valuable resource for the future development of NTA algorithms and workflows, for example by providing a collection of successfully identified compounds and retention time indices obtained over a wide range of instruments and analytical conditions. Altogether, these can be used by individual laboratories for evaluation of their own practices.

When using this data, the records 1 and 2 have the following name structure: InstituteID_(nr)_pd.mzXML for the pre-defined method and InstituteID_(nr)_own.mzXML for the in-house method. These files can be opened with almost any open source software for MS data.

Code availability

The script for the extraction of the IS signal and the plots in this data descriptor is available in this repository: https://bitbucket.org/Denice_van_Herwerden/ils-validation/src/main/.

References

Schulze, B. et al. An assessment of Quality Assurance/Quality Control Efforts in High Resolution Mass Spectrometry Non-Target Workflows for Analysis of Environmental Samples. Trends Anal. Chem. 133, 116063 (2020).
Bletsou, A. A., Jeon, J., Hollender, J., Archontaki, E. & Thomaidis, N. S. Targeted and non-targeted liquid chromatography-mass spectrometric workflows for identification of transformation products of emerging pollutants in the aquatic environment. Trends Anal. Chem. 66, 32–44 (2015).
Article CAS Google Scholar
Martínez-Bueno, M. J., Gómez Ramos, M. J., Bauer, A. & Fernández-Alba, A. R. An overview of non-targeted screening strategies based on high resolution accurate mass spectrometry for the identification of migrants coming from plastic food packaging materials. Trends Anal. Chem. 110, 191–203 (2019).
Article Google Scholar
Milman, B. L. & Zhurkovich, I. K. The chemical space for non-target analysis. Trends Anal. Chem. 97, 179–187 (2017).
Article CAS Google Scholar
Oberacher, H. & Arnhard, K. Current status of non-targeted liquid chromatography-tandem mass spectrometry in forensic toxicology. Trends Anal. Chem. 84, Part B, 94–105 (2016).
Article Google Scholar
Albergamo, V. et al. Nontarget Screening Reveals Time Trends of Polar Micropollutants in a Riverbank Filtration System. Environ. Sci. Technol. 53, 7584–7594 (2019).
Article ADS CAS Google Scholar
Samanipour, S., Martin, J. W., Lamoree, M. H., Reid, M. J. & Thomas, K. V. Letter to the Editor: Optimism for Nontarget Analysis in Environmental Chemistry. Environ. Sci. Technol. 53, 5529–5530 (2019).
Article ADS CAS Google Scholar
Hohrenk, L. L. et al. Comparison of Software Tools for Liquid Chromatography–High-Resolution Mass Spectrometry Data Processing in Nontarget Screening of Environmental Samples. Anal. Chem. 92, 1898–1907 (2020).
Article CAS Google Scholar
Schymanski, E. L. et al. Identifying Small Molecules via High Resolution Mass Spectrometry: Communicating Confidence. Environ. Sci. Technol. 48, 2097–2098 (2014).
Article ADS CAS Google Scholar
Schymanski, E. L. et al. Non-target screening with high-resolution mass spectrometry: critical review using a collaborative trial on water analysis. Anal. Bioanal. Chem. 407, 6237–55 (2015).
Article CAS Google Scholar
Tian, Z. et al. A ubiquitous tire rubber–derived chemical induces acute mortality in coho salmon. Science 37, 185–189 (2021).
Article ADS Google Scholar
Ulrich, E. M. et al. EPA’s non-targeted analysis collaborative trial (ENTACT): genesis, design, and initial findings. Anal. Bioanal. Chem. 411, 853–866 (2019).
Article CAS Google Scholar
Rostkowski, P. et al. The strength in numbers: comprehensive characterization of house dust using complementary mass spectrometric techniques. Anal. Bioanal. Chem. 411, 1957–1977 (2019).
Article CAS Google Scholar
Hollender, J., Schymanski, E. L., Singer, H. P. & Ferguson, P. L. Nontarget Screening with High Resolution Mass Spectrometry in the Environment: Ready to Go? Environ. Sci. Technol. 51, 11505–11512 (2017).
Article ADS CAS Google Scholar
Hites, R. A. & Jobst, K. J. Response to “Letter to the Editor: Optimism for Nontarget Analysis in Environmental Chemistry”. Environ. Sci. Technol. 53, 5531–5533 (2019).
Article ADS CAS Google Scholar
Samanipour, S., Reid, M. J. & Thomas, K. V. Statistical Variable Selection: An Alternative Prioritization Strategy during the Nontarget Analysis of LC-HR-MS Data. Anal. Chem. 89, 10, 5585–5591 (2017).
Samanipour, S., Reid, M. J., Bæk, K. & Thomas, K. V. Combining a Deconvolution and a Universal Library Search Algorithm for the Nontarget Analysis of Data-Independent Acquisition Mode Liquid Chromatography−High-Resolution Mass Spectrometry Results. Environ. Sci. Technol. 52, 4694–4701 (2018).
Article ADS CAS Google Scholar
Samanipour, S. et al. Machine learning combined with non-targeted LC-HRMS analysis for a risk warning system of chemical hazards in drinking water: A proof of concept. Talanta 195, 426–432 (2019).
Article CAS Google Scholar
Escher, B. I., Stapleton, H. M. & Schymanski, E. L. Tracking complex mixtures of chemicals in our changing environment. Science 367, 388–392 (2020).
Article ADS CAS Google Scholar
Gosetti, F., Mazzucco, E., Gennaro, M. C. & Marengo, E. Contaminants in water: non-target UHPLC/MS analysis. Environ. Chem. Lett. 14, 51–65 (2016).
Article CAS Google Scholar
Alygizakis, N. A. et al. NORMAN digital sample freezing platform: A European virtual platform to exchange liquid chromatography high resolution-mass spectrometry data and screen suspects in “digitally frozen” environmental samples. Trends Anal. Chem. 115, 129–137 (2019).
Article CAS Google Scholar
Blaženović, I., Kind, T., Ji, J. & Fiehn, O. Software Tools and Approaches for Compound Identification of LC-MS/MS Data in Metabolomics. Metabolites 8, 31 (2018).
Article Google Scholar
Chow, C. W. K. et al. Development of smart data analytics tools to support wastewater treatment plant operation. Chemom. Intell. Lab. Syst. 177, 140–150 (2018).
Article CAS Google Scholar
Brodsky, L., Moussaieff, A., Shahaf, N., Aharoni, A. & Rogachev, I. Evaluation of Peak Picking Quality in LC−MS Metabolomics Data. Anal. Chem. 82, 9177–9187 (2010).
Article CAS Google Scholar
Samanipour, S., O’Brien, J. W., Reid, M. J. & Thomas, K. V. Self Adjusting Algorithm for the Nontargeted Feature Detection of High Resolution Mass Spectrometry Coupled with Liquid Chromatography Profile Data. Anal. Chem. 91, 10800–10807 (2019).
Article CAS Google Scholar
Wilkinson, M. D. et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 3, 160018 (2016).
Article Google Scholar
FAIR Principles. GO FAIR https://www.go-fair.org/fair-principles/.
Alygizakis, N. A. et al. Exploring the Potential of a Global Emerging Contaminant Early Warning Network through the Use of Retrospective Suspect Screening with High-Resolution Mass Spectrometry. Environ. Sci. Technol. 52, 5135–5144 (2018).
Article ADS CAS Google Scholar
MassBank consortium and its contributors. MassBank/MassBank-data: Release version 2020.06. Zenodo https://doi.org/10.5281/zenodo.3903207 (2020).
Vrana, B. et al. Passive sampling techniques for monitoring pollutants in water. Trends Anal. Chem. 24, 845–868 (2005).
Article CAS Google Scholar
Miège, C. et al. Position paper on passive sampling techniques for the monitoring of contaminants in the aquatic environment – Achievements to date and perspectives. Trends Environ. Anal. Chem. 8, 20–26 (2015).
Aalizadeh, R., Nika, M.-C. & Thomaidis, N. S. Development and application of retention time prediction models in the suspect and non-target screening of emerging contaminants. J. Hazard. Mater. 363, 277–285 (2019).
Article CAS Google Scholar
Samanipour, S. et al. Assessing sample extraction efficiencies for the analysis of complex unresolved mixtures of organic pollutants: A comprehensive non-target approach. Anal. Chim. Acta 1025, 92–98 (2018).
Article CAS Google Scholar
Samanipour, S. et al. The effect of extraction methodology on the recovery and distribution of naphthenic acids of oilfield produced water. Sci. Total Environ. 652, 1416–1423 (2019).
Article ADS Google Scholar
Vrana, B. et al. Mobile dynamic passive sampling of trace organic compounds: Evaluation of sampler performance in the Danube River. Sci. Total Environ. 636, 1597–1607 (2018).
Article ADS CAS Google Scholar
US EPA, O. EPA Method 3570 (SW-846): Microscale Solvent Extraction (MSE). US EPA https://www.epa.gov/esam/epa-method-3570-sw-846-microscale-solvent-extraction-mse (2019).
Samanipour, S. NORMAN Collaborative Trial on Passive Sampling and Non-target Screening (NTS) Instruction file. University of Amsterdam https://doi.org/10.21942/uva.14501100.v2 (2021).
Samanipour, S. et al. Two stage algorithm vs commonly used approaches for the suspect screening of complex environmental samples analyzed via liquid chromatography high resolution time of flight mass spectroscopy: A test study. J. Chromatogr. A 1501, 68–78 (2017).
Deutsch, E. W. File Formats Commonly Used in Mass Spectrometry. Proteomics. Mol. Cell. Proteomics MCP 11, 1612–1621 (2012).
Article Google Scholar
Chambers, M. C. et al. A cross-platform toolkit for mass spectrometry and proteomics. Nat. Biotechnol. 30, 918–920 (2012).
Article CAS Google Scholar
Samanipour, S. NORMAN Collaborative Trial on Passive Sampling and Non-target Screening (NTS) - Pre-defined method (individual files). figshare https://doi.org/10.21942/uva.14447595 (2021).
Samanipour, S. Inter-laboratory dataset from a collaborative trial for future use in the development of non-target analysis. MassIVE https://doi.org/10.25345/C5JV6C (2021).
Samanipour, S. NORMAN Collaborative Trial on Passive Sampling and Non-target Screening (NTS) - Own method (individual files). figshare https://doi.org/10.21942/uva.14455965.v2 (2021).
Samanipour, S. NORMAN Collaborative Trial on Passive Sampling and Non-target Screening (NTS) - Metadata. figshare https://doi.org/10.21942/uva.13562267.v3 (2021).

Download references

Acknowledgements

This work was supported by the NORMAN network. MH acknowledge funding from the Danish Environmental Protection Agency (MST-667-00207) and The Aarhus University Research Foundation (AUFF-T-2017-FLS-7-4). The authors are also thankful to Biotage for providing the passive samplers and Helena Švecová for contribution during the sampling.

Author information

Authors and Affiliations

Queensland Alliance for Environmental Health Sciences (QAEHS), The University of Queensland, 202. Cornwall Street, QLD 4102, Woolloongabba, Australia
Bastian Schulze & Sarit Kaserzon
Van ‘t Hoff Institute for Molecular Sciences (HIMS), University of Amsterdam, Science Park 904, 1098 XH, Amsterdam, The Netherlands
Denice van Herwerden & Saer Samanipour
Norwegian Institute for Water Research (NIVA), Gaustadalléen 21, 0349, Oslo, Norway
Ian Allan, Malcolm Reid & Saer Samanipour
Environmental and Public Health Analytical Chemistry, Research Institute for Pesticides and Water, University Jaume I, Avda. Vincent Sos Baynat, s/n, 12071, Castelló de la Plana, Castellón, Spain
Lubertus Bijlsma
Plentzia Marine Station, Department of Analytical Chemistry, University of the Basque Country, Areatza Pasealekua, 48620, Plentzia, Basque Country, Spain
Nestor Etxebarria
Aarhus University, Department of Environmental Science, Environmental Metabolomics Lab, Frederiksborgvej 399, 4000, Roskilde, Denmark
Martin Hansen, Emil Frøkjær, Thorsten Gravert & Robert Young
INRAE, UR RiverLy, F-69625, Villeurbanne, France
Sylvain Merel, Christelle Margoum & Cécile Miège
Masaryk University, Faculty of Science, RECETOX, Kamenice 753/5, 625 00, Brno, Czech Republic
Branislav Vrana & Pavla Fialová
National and Kapodistrian University of Athens, Athens, Greece
Reza Aalizadeh & Nikolaos Thomaidis
Vitens N.V., Oude Veerweg 1, Zwolle, 8001 BE, The Netherlands
Bernard Bajema
Man-Technology-Environment Research Centre, School of Science and Technology, Örebro University, Fakultetsgatan 1, 701 82, Örebro, Sweden
Florian Dubocq & Emmanuelle Vulliet
Eurolab Srl, Via Monsignore Rodolfi 22, IT-36022, Cassola, VI, Italy
Gianluca Coppola
Univ Lyon, CNRS, Université Claude Bernard Lyon 1, Institut des Sciences Analytiques, UMR 5280, 5 rue de la Doua, F-69100, Villeurbanne, France
Aurélie Fildier & Massimo Peruzzo
University of South Bohemia in České Budějovice, Faculty of Fisheries and Protection of Waters, South Bohemian Research Center of Aquaculture and Biodiversity of Hydrocenoses, Zátiší 728/II, CZ-389 25, Vodňany, Czech Republic
Roman Grabic
Institut Català de Recerca de l’Aigua (ICRA) Catalan Institute for Water Research, Edifici H2O - Parc Científic i Tecnològic Universitat de Girona Carrer Emili Grahit, 101 E- 17003, Girona, Spain
Pablo Gago-Ferrero
Eawag, Swiss Federal Institute of Aquatic Science and Technology, 8600, Duebendorf, Switzerland
Juliane Hollender & Teresa Mairinger
Univ Paris Est Creteil, Ecole des Ponts, LEESU, F-94010, Creteil, France
Nina Huynh, Julien Le Roux & Régis Moilleron
Flemish Institute for Technological Research (VITO), Unit Separation and Conversion Technology, Boeretang 200, 2400, Mol, Belgium
Griet Jacobs
Department of Environment & Health, Faculty of Science, Amsterdam Institute of Molecular and Life Sciences, Vrije Universiteit Amsterdam, De Boelelaan 1085, 1081, Amsterdam, HV, The Netherlands
Tim Jonkers, Marja Lamoree & Jeroen Meijer
Department of Chemistry, University of Natural Resources and Life Sciences—BOKU Vienna, Muthgasse 18, 1190, Vienna, Austria
Teresa Mairinger
Consiglio Nazionale delle Ricerche, Istituto di Ricerca Sulle Acque, Via De Blasio 5, 70132, Bari, Italy
Giuseppe Mascolo & Sapia Murgolo
OSU-EFLUVE, Univ Paris Est Creteil, CNRS, F-94010, Creteil, France
Emmanuelle Mebold
Department of Aquatic Sciences and Assessment, Swedish University of Agricultural Sciences (SLU), SE-75007, Uppsala, Sweden
Frank Menger
Ministry of Infrastructure and Water Management, Rijkswaterstaat, Zuiderwagenplein 2, 8224, Lelystad, AD, Netherlands
Martijn Pijnappels
Instituto di Ricerca Sulle Acque, Consiglio Nazionale delle Ricerche, Via Mulino 19, IT- 20861, Brugherio, MB, Italy
Claudio Roscioli & Sara Valsecchi
BRGM, F-45060, Orleans, France
Coralie Soulier
Colorado State University, Soil and Crop Sciences Department, Plant Sciences C117, Fort Collins, CO, 80523, United States
Robert Young

Authors

Bastian Schulze
View author publications
You can also search for this author in PubMed Google Scholar
Denice van Herwerden
View author publications
You can also search for this author in PubMed Google Scholar
Ian Allan
View author publications
You can also search for this author in PubMed Google Scholar
Lubertus Bijlsma
View author publications
You can also search for this author in PubMed Google Scholar
Nestor Etxebarria
View author publications
You can also search for this author in PubMed Google Scholar
Martin Hansen
View author publications
You can also search for this author in PubMed Google Scholar
Sylvain Merel
View author publications
You can also search for this author in PubMed Google Scholar
Branislav Vrana
View author publications
You can also search for this author in PubMed Google Scholar
Reza Aalizadeh
View author publications
You can also search for this author in PubMed Google Scholar
Bernard Bajema
View author publications
You can also search for this author in PubMed Google Scholar
Florian Dubocq
View author publications
You can also search for this author in PubMed Google Scholar
Gianluca Coppola
View author publications
You can also search for this author in PubMed Google Scholar
Aurélie Fildier
View author publications
You can also search for this author in PubMed Google Scholar
Pavla Fialová
View author publications
You can also search for this author in PubMed Google Scholar
Emil Frøkjær
View author publications
You can also search for this author in PubMed Google Scholar
Roman Grabic
View author publications
You can also search for this author in PubMed Google Scholar
Pablo Gago-Ferrero
View author publications
You can also search for this author in PubMed Google Scholar
Thorsten Gravert
View author publications
You can also search for this author in PubMed Google Scholar
Juliane Hollender
View author publications
You can also search for this author in PubMed Google Scholar
Nina Huynh
View author publications
You can also search for this author in PubMed Google Scholar
Griet Jacobs
View author publications
You can also search for this author in PubMed Google Scholar
Tim Jonkers
View author publications
You can also search for this author in PubMed Google Scholar
Sarit Kaserzon
View author publications
You can also search for this author in PubMed Google Scholar
Marja Lamoree
View author publications
You can also search for this author in PubMed Google Scholar
Julien Le Roux
View author publications
You can also search for this author in PubMed Google Scholar
Teresa Mairinger
View author publications
You can also search for this author in PubMed Google Scholar
Christelle Margoum
View author publications
You can also search for this author in PubMed Google Scholar
Giuseppe Mascolo
View author publications
You can also search for this author in PubMed Google Scholar
Emmanuelle Mebold
View author publications
You can also search for this author in PubMed Google Scholar
Frank Menger
View author publications
You can also search for this author in PubMed Google Scholar
Cécile Miège
View author publications
You can also search for this author in PubMed Google Scholar
Jeroen Meijer
View author publications
You can also search for this author in PubMed Google Scholar
Régis Moilleron
View author publications
You can also search for this author in PubMed Google Scholar
Sapia Murgolo
View author publications
You can also search for this author in PubMed Google Scholar
Massimo Peruzzo
View author publications
You can also search for this author in PubMed Google Scholar
Martijn Pijnappels
View author publications
You can also search for this author in PubMed Google Scholar
Malcolm Reid
View author publications
You can also search for this author in PubMed Google Scholar
Claudio Roscioli
View author publications
You can also search for this author in PubMed Google Scholar
Coralie Soulier
View author publications
You can also search for this author in PubMed Google Scholar
Sara Valsecchi
View author publications
You can also search for this author in PubMed Google Scholar
Nikolaos Thomaidis
View author publications
You can also search for this author in PubMed Google Scholar
Emmanuelle Vulliet
View author publications
You can also search for this author in PubMed Google Scholar
Robert Young
View author publications
You can also search for this author in PubMed Google Scholar
Saer Samanipour
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The core group of authors included Bastian Schulze, Denice van Herwerden, Ian Allan, Lubertus Bijlsma, Nestor Etxebarria, Martin Hansen and Sylvain Merel, the project was conceptualized by Ian Allen, Pavla Fialová, Roman Grabic, Juliane Hollender, Christelle Margoum, Cécile Miège, Malcolm Reid, Saer Samanipour and Branislav Vrana. The report was written by Bastian Schulze, Denice van Herwerden and Saer Samanipour and reviewed by all involved authors. Data was contributed by Bastian Schulze, Lubertus Bijlsma, Bernard Bajema, Nestor Etxebarria, Martin Hansen, Sylvain Merel, Reza Aalizadeh, Florian Dubocq, Gianluca Coppola, Aurélie Fildier, Emil Frøkjær, Roman Grabic, Pablo Gago-Ferrero, Thorsten Gravert, Juliane Hollender, Nina Huynh, Tim Jonkers, Sarit Kaserzon, Marja Lamoree, Julien Le Roux, Teresa Mairinger, Christelle Margoum, Giuseppe Mascolo, Emmanuelle Mebold, Frank Menger, Cécile Miège, Jeroen Meijer, Régis Moilleron, Sapia Murgolo, Griet Jacobs, Massimo Peruzzo, Martijn Pijnappels, Claudio Roscioli, Coralie Soulier, Sara Valsecchi, Nikolaos Thomaidis, Emmanuelle Vulliet and Robert Young.

Corresponding author

Correspondence to Saer Samanipour.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Suplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.

Reprints and permissions

About this article

Cite this article

Schulze, B., van Herwerden, D., Allan, I. et al. Inter-laboratory mass spectrometry dataset based on passive sampling of drinking water for non-target analysis. Sci Data 8, 223 (2021). https://doi.org/10.1038/s41597-021-01002-w

Download citation

Received: 14 January 2021
Accepted: 12 July 2021
Published: 24 August 2021
DOI: https://doi.org/10.1038/s41597-021-01002-w

This article is cited by

Predicting RP-LC retention indices of structurally unknown chemicals from mass spectrometry data
- Jim Boelrijk
- Denice van Herwerden
- Saer Samanipour
Journal of Cheminformatics (2023)
Complementary methods for structural assignment of isomeric candidate structures in non-target liquid chromatography ion mobility high-resolution mass spectrometric analysis
- Masoumeh Akhlaqi
- Wei-Chieh Wang
- Anneli Kruve
Analytical and Bioanalytical Chemistry (2023)
Novel nontarget LC-HRMS-based approaches for evaluation of drinking water treatment
- Petra Nováková
- Helena Švecová
- Roman Grabic
Environmental Monitoring and Assessment (2023)
NORMAN guidance on suspect and non-target screening in environmental monitoring
- Juliane Hollender
- Emma L. Schymanski
- Martin Krauss
Environmental Sciences Europe (2023)
Critical assessment of chromatographic metadata in publicly available metabolomics data repositories
- Eva-Maria Harrieder
- Fleming Kretschmer
- Michael Witting
Metabolomics (2022)

Subjects

Abstract

Similar content being viewed by others

Background & Summary

Methods

Experimental design

Sample set

Pre-defined method

Variations in the pre-defined method

In-house method

Data conversion

Data Records

Record 1

Record 2

Record 3

Record GNPS

Technical Validation

Usage Notes

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links