The U.S. Geological Survey (USGS) maintains a place-based research program in San Francisco Bay (USA) that began in 1969 and continues, providing one of the longest records of water-quality measurements in a North American estuary. Constituents include salinity, temperature, light extinction coefficient, and concentrations of chlorophyll-a, dissolved oxygen, suspended particulate matter, nitrate, nitrite, ammonium, silicate, and phosphate. We describe the sampling program, analytical methods, structure of the data record, and how to access all measurements made from 1969 through 2015. We provide a summary of how these data have been used by USGS and other researchers to deepen understanding of how estuaries are structured and function differently from the river and ocean ecosystems they bridge.
Machine-accessible metadata file describing the reported data (ISA-tab format)
Background & Summary
On April 10 and 11, 1969 oceanographers from the U.S. Geological Survey (USGS) conducted the first hydrographic research cruise along the salinity gradient of San Francisco Bay (SFB)—one of the largest estuaries on the west coast of the Americas. Although it was not the researchers’ original intention, that survey launched an observational program that continues and expanded into a program of long-term ecosystem research that has contributed to the development of estuarine oceanography as a scientific discipline. In that era little was known about how estuaries function as transitional ecosystems between land and sea, where seawater and fresh water meet. Early USGS studies focused on: estuarine circulation where surface waters flow seaward over a landward-flowing bottom layer1; sediment accumulation in an estuarine turbidity maximum2; geomorphology3; marsh vegetation and land forms4; biogeochemistry of nutrients, oxygen and carbon5; benthic invertebrate communities6; urban pollution7, and its flushing by river inflows8.
Over time the research expanded into new domains to measure, model and understand: tidal circulation and transport processes9,
Central to this research is a core set of measurements repeated over time at a network of sampling sites (Fig. 1, Table 1) spaced along the estuarine salinity gradient. The San Francisco Bay system has been a useful place for studying estuarine dynamics because it includes two different estuary types. South Bay (stations 21–36) is an urbanized marine lagoon, and North Bay (stations 15–657) is the estuary of California’s two largest rivers, the Sacramento and San Joaquin. Central Bay connects South and North Bays to each other and to the coastal Pacific Ocean (Fig. 1). Thus, one goal of USGS research has been to compare two different estuary types36. The data set includes measurements of salinity, temperature, suspended particulate matter, light penetration, dissolved oxygen, chlorophyll-a as an indicator of phytoplankton biomass, and concentrations of dissolved inorganic N, P and Si. The sampling program maps longitudinal and vertical distributions of these estuarine properties and captures their variability at seasonal, annual and decadal time scales.
The data described here were collected for one research purpose—to measure and understand how an estuarine ecosystem changes in response to human activities and the climate system. However, we recognized from the beginning of this effort that the data have value beyond this one purpose. We have encouraged and supported use of these data by others, and the diversity of applications of this data set has been both surprising and gratifying. We illustrate this diversity with examples of scientific articles (Table 2 (available online only)) that used the data for purposes we could not have imagined, ranging across disciplines of archaeology, geochemistry, hydrodynamics, ecotoxicology, conservation biology, sediment dynamics, and biology of organisms from microbes to seabirds. Some of these publications were collaborations with visiting scientists, postdocs and graduate students. Others were done independently of USGS research. The collective knowledge accumulated from this research over decades has contributed to the global understanding of estuaries as ecosystems situated where land, ocean, atmosphere and people converge. Our purpose here is to widen accessibility of these data so their value continues to grow.
USGS water-quality studies in San Francisco Bay include two types of measurements: (1) laboratory analyses of discrete water samples collected aboard ship (chlorophyll-a, dissolved oxygen, suspended particulate matter, dissolved inorganic nutrients), and (2) shipboard or submersible sensors to measure salinity, temperature, chlorophyll fluorescence, dissolved oxygen, turbidity, and light attenuation. The analyses of discrete water samples were used to calibrate the chlorophyll fluorescence, dissolved oxygen, and turbidity sensors, with individual calibrations for each sampling cruise, and often separate calibrations for each bay region. Therefore, the data record includes both discrete measurements (e.g., Discrete_Chlorophyll-a) and sensor-based in-situ measurements (e.g., Calculated_Chlorophyll-a).
From 1969 through March 1987 the discrete water samples were collected by submersible pump that delivered bay water to a shipboard fluorometer, nephelometer, thermistor, and conductivity sensor37. Vertical profiles were obtained by lowering the pump to prescribed depths, typically 0, 2, 5, 10, 20 m. Since April 1987 the discrete water samples have been collected near surface (~1.5 m) by pump and ~1 m above bottom with a Niskin bottle, and vertical profiles of salinity and temperature have been obtained with a Sea-Bird Electronics SBE-9 CTD. In 1988 turbidity, chlorophyll-a fluorescence, and photosynthetically active radiation (PAR) sensors were added to the CTD package, and in 1993 a dissolved oxygen sensor was added. In 2002 we started using a Sea-Bird Electronics SBE-9plus CTD. The individual sensors on the CTDs changed over time as new technologies emerged (see below). The CTD system is lowered through the water column at a rate<1 m s−1, collecting >24 samples/meter for Seabird sensors and >5 samples/meter for third party sensors. The values we report are averages over 1-m depth bins centered at the depth reported (i.e., CTD values are means of all measurements made 0.5 m above and 0.5 m below the reported depth).
This data set was acquired as a component of a research program whose goals evolved over time, so the frequency, spatial coverage, and makeup of water-quality measurements varied from year to year. We characterize sampling effort for five constituents, measured as the number of samples binned by station, month, and year (Fig. 2). Sampling effort was greatest in South Bay and during March and April, reflecting a key research objective to follow dynamics and ecological and biogeochemical consequences of the spring phytoplankton bloom30. The record reveals multi-year gaps in measurements of SPM and dissolved oxygen; that chlorophyll-a measurements first began in 1977; and that nutrient (e.g., phosphate) concentrations were measured less frequently than other constituents (Fig. 2). Sampling became more regular, and all constituents were measured each cruise starting in 1993 when this program became incorporated into the Regional Monitoring Program for Water Quality in San Francisco Bay (http://www.sfei.org/rmp).
Pump depth was determined using a pressure transducer with an accuracy of +/− 1 m. Readings at zero meters are representative of a 0.4 m intake depth. Beginning March 1987, depth was determined with a Paroscientific Digiquartz (http://www.paroscientific.com/) pressure transducer as part of a Sea-Bird Electronics CTD package.
Chlorophyll-a measurements began in 1977, and methods changed as new instrumentation and widely-accepted standard methods emerged. Samples were collected onto Gelman GFF (glass fiber) filters and pigments were extracted with 90% acetone. The absorbance of the extracts was measured with a Varian 635D spectrophotometer following Strickland and Parsons38. Chlorophyll-a concentrations were calculated using the SCOR-UNESCO trichromatic equations39. Beginning in 1983, we used Lorenzen’s40 spectrophotometric equations. In 1992 we began using a Hewlett Packard 8452A diode array spectrophotometer. In 1999 we began measuring chlorophyll-a concentrations fluorometrically using the acidification method on a Turner Designs TD-700 fluorometer calibrated with chlorophyll-a standard41,42. Since 2011 we have used a Turner Designs Trilogy fluorometer. After each method change we compared results of the older and newer approach on replicate samples across a range of chlorophyll-a concentrations to verify that bias was not introduced as new instruments and methods were used.
Vertical profiles of chlorophyll-a were derived from calculated concentrations based on calibrations of an in-vivo fluorometer done each cruise to account for variability of phytoplankton species assemblages and the relationship between chlorophyll-a and fluorescence43. Calibrations were linear regressions of Discrete_Chlorophyll-a (above) and in-vivo fluorescence measured initially with a Turner Designs Model 10 fluorometer connected to a pumped stream of bay water. Beginning in 1988, profiles were obtained with a SeaTech fluorometer connected to a Sea-Bird Electronics CTD. This was replaced with a Turner Designs SCUFA fluorometer in 2002, and with a Turner Designs C7 fluorometer in May 2004.
Water samples for dissolved oxygen measurement were collected into 300-ml BOD bottles that were filled from the bottom and allowed to overflow at least 3 times their volume. Winkler reagents38 were added immediately and bottles were stored capped with water in their cap-wells. In the laboratory, 100.2 ml of acidified sample was titrated manually following Carpenter44. Beginning in 1993, the samples were analyzed with a Metrohm 686 titroprocessor autotitrator38 using the potentiometric titration method of Granéli and Granéli45. Potassium iodate standardization of the sodium thiosulfate was conducted (Knapp et al., 1991). In 2007 the autotitrator was replaced with a Metrohm Titrino 798.
In 1993 we added a Sea-Bird Electronics SBE-13 sensor to the CTD package to obtain vertical profiles of dissolved oxygen. The sensor was calibrated prior to each cruise with 100% and zero saturation endpoints, and additionally with Discrete_DO measurements (above) each cruise. In 2002 we began using a Sea-Bird Electronics SBE-43 oxygen sensor calibrated each cruise with Discrete_DO measurements.
Suspended particulate matter was measured gravimetrically as mass collected onto pre-weighed 0.45 μm pore-size silver filters (1969–1984) or polycarbonate 0.4 μm pore-size membrane filters (1993–2015). A correction was made for mass of salt retained on the filter46.
Vertical profiles of suspended particulate matter were calculated concentrations derived from individual calibrations of a nephelometer or optical backscatter sensor each cruise. Calibrations were linear regressions of Discrete_SPM (above) and voltage output from a Turner Designs Model 10 fluorometer configured as a nephelometer connected to the pumped stream of bay water37. Beginning 1993, SPM profiles were obtained with a D&A Instrument Company (now Campbell Scientific) OBS-3 optical backscatter sensor as part of a Sea-Bird Electronics CTD package.
PAR (mol quanta m−2 s−1) was measured using a Li-Cor Biosciences LI-192 underwater quantum sensor (1977–1982, 1988–2015). The light extinction coefficient (k) was computed from the slope of the regression of ln(PAR) against water depth. Measurements were initially made at 6–7 depths per station. From 1983 through 1987 the light extinction coefficient was computed from Secchi depth SD (m) using an empirical relationship derived for San Francisco Bay: k=0.4+109/SD47. Beginning in 1988, the LI-192 sensor was deployed as part of a Sea-Bird Electronics CTD package collecting at least 28 measurements per meter, generating high-resolution vertical profiles of PAR.
Salinity was initially measured with an Industrial Instruments RS5-3 induction salinometer (accuracy of 0.3 PSU), and beginning December 1969 with a CM2 model 516 CTD probe48. Output from that probe was verified each cruise with 6–12 duplicate water samples analyzed in the laboratory with a Beckman RS7-B salinometer calibrated with Copenhagen water (agreement of 0.2 PSU)48. Beginning July 1974, salinity was measured with an electrodeless induction salinometer, with outputs validated each cruise with duplicate samples run on the Beckman salinometer (agreement of 0.05 PSU)48. In March 1987 we began measuring salinity with a Sea-Bird Electronics SBE-4 conductivity sensor as part of the CTD package, and since 2002 with a SBE-4C conductivity sensor. Salinity was computed from conductivity, temperature, and pressure48.
Temperature was initially measured with linear thermistors calibrated at ice point and near 20 °C each cruise49. Beginning March 1987, temperature was measured with a Sea-Bird Electronics SBE-3 temperature sensor as part of the CTD package, and since 2002 with a SBE-3plus temperature sensor.
Samples for dissolved inorganic nutrient analyses were filtered through polycarbonate 0.4-μm pore size membrane filters into bottles previously acid washed with 10% HCL. From 1971–2003, most sample bottles were pre-washed with acetone and 2.5 meq l−1 sodium bicarbonate instead of 10% HCL, with blank analyses confirming undetectable nutrient concentration. Sample filtrates were either analyzed immediately, refrigerated and analyzed within 48 h, or frozen until analysis. Frozen samples were allowed to thaw at room temperature for at least 14 hours before analysis. Samples were analyzed with a Technicon II AutoAnalyzer for: dissolved silica using Technicon Industrial Method 105-71WB50, dissolved reactive phosphate using the method of Atlas et al.51 with ascorbic acid as a reductant, nitrate+nitrite using Technicon Industrial Method AII 100-70 W52, and ammonium using the method of Solorzano53 and starting in 1980 with color development at 37 °C following Berg and Abdullah54. Standards for each analyte were prepared in artificial river water and artificial seawater38. Beginning in April 1991, the ammonium method was modified to improve precision as detailed in Hager46. Beginning in 2006, nutrients were stored frozen until analysis by the Richard Dugdale laboratory at San Francisco State University using a Bran and Luebbe AutoAnalyzer II for all nutrients except ammonium, which was analyzed by spectrophotometer. Nitrate, nitrite, and phosphate were determined according to Whitledge et al.55. Silicate was determined with Bran Luebbe AutoAnalyzer Method No. G-177-96 (ref. 56). Ammonium was determined with the method of Solorzano53. Beginning in March 2014, nutrients were analyzed by the USGS National Water Quality Laboratory with a Thermo Scientific Aquakem 600 automated discrete analyzer using methods of Fishman and Friedman57 for nitrite, phosphate, and silicate, the method of Patton and Kryskalla58 for nitrate, and the Solorzano method53 for ammonium with a salt correction factor applied59.
The dataset includes the following fields for each record:
Date: format MM/DD/YY
Depth: sampled depth below the surface (m)
Discrete_Chlorophyll-a: chlorophyll-a measured in a water sample (μg l−1)
Calculated_Chlorophyll-a: chlorophyll-a calculated from in-vivo fluorescence (μg l−1)
Discrete_Oxygen: dissolved oxygen measured in a water sample (mg l−1)
Calculated_Oxygen: dissolved oxygen calculated from an oxygen sensor (mg l−1)
Discrete_SPM: suspended particulate matter measured in a water sample (mg l−1)
Calculated_SPM: suspended particulate matter calculated from a turbidity sensor (mg l−1)
Extinction_Coefficient: light extinction coefficient (m−1)
Salinity: Practical Salinity Units (PSU)
Temperature: water temperature (°C)
Nitrite: nitrite concentration (μM)
Nitrate+Nitrite: sum of nitrate and nitrite concentration (μM)
Ammonium: ammonium concentration (μM)
Phosphate: phosphate concentration (μM)
Silicate: silicate concentration (μM)
Data record 1
The dataset includes 210,826 records, each representing a water sample from a unique date, station, and depth. All measurements made between 4/10/69 and 12/16/15 are available in one csv file (SanFranciscoBayWaterQualityData1969-2015v3.csv) uploaded to the USGS ScienceBase repository (Data Citation 1: U.S. Geological Survey https://doi.org/10.5066/F7TQ5ZPR). An xml-formatted metadata file is also available at that repository.
Results from each sampling cruise were examined carefully by at least two members of the research team to ensure that all values fell within expected ranges, to verify that calibration regressions were an acceptable basis for computing quantities from shipboard sensor measurements, to ensure completeness of each cruise data report, and to verify that values transcribed from field notes were accurate. The complete 1969–2015 data set was validated with three steps: (1) range tests to ensure that the measured values fell within ranges that are plausible and consistent with knowledge of San Francisco Bay and other estuaries; (2) pattern tests of time series of all measurements to ensure they followed plausible and understandable patterns of variability over time; (3) pattern tests of all measurements by sampling station to ensure they followed plausible and understandable spatial patterns.
Sea-Bird Electronics sensors were calibrated annually by the manufacturer and have initial accuracies of: temperature=±0.001 °C, conductivity=±0.0003 mS m−1, pressure=±0.015% of full range, dissolved oxygen=±2% of saturation (http://www.seabird.com). Li-Cor LI192 sensors were calibrated by the manufacturer and sensitivity is typically 4 μA per 1,000 μmol m−2 s−1 (https://www.licor.com). Cruise-specific calibrations of shipboard fluorometers, nephelometer/optical backscatter, and oxygen sensors yielded highly significant (P<10−16) linear relationships between all discrete and calculated concentrations of chlorophyll-a, SPM and DO (Fig. 3). Median absolute deviations between discrete and calculated concentrations were: 0.40 μg l−1 for chlorophyll-a; 2.10 mg l−1 for SPM; 0.10 mg l−1 for DO. Linear regressions yielded residual standard errors between discrete and calculated concentrations of: 1.36 μg l−1 for chlorophyll-a; 8.2 mg l−1 for SPM; 0.16 mg l−1 for DO (Fig. 3).
Discrete chlorophyll-a values are mean concentrations in replicate (2, 3, or 4) aliquots from each sample. If the replicate results differed by more than 10% of their mean the results were not included in the data set. The mean coefficient of variation between replicate aliquots from 3,564 chlorophyll-a samples collected between 2005 and 2013 was 2.4%. Agreement between all replicates was within the recommended guideline for the method: >90% of the coefficients of variation (CV) between samples are <5% (ref. 42). Discrete suspended particulate matter precision was 1%-10%. Analytical precision of the potentiometric DO method is <0.3% (ref. 45).
Nutrients analysed at the USGS Menlo Park (USGS-MP) laboratory from 1969–2005 had a typical precision of 0.02–0.2 μM for ammonium, 0.04–0.17 μM for nitrate+nitrite, 0.01–0.05 μM for nitrite, 0.01–0.05 μM for phosphate, and 0.06–1.0 μM for silicate46,60,
As a preliminary step in the 2014 transition from SFSU to the USGS National Water Quality Laboratory (USGS-NWQL), we collected triplicate water samples along the salinity gradient of San Francisco Bay to compare analyses by SFSU, USGS-NWQL, and the Chesapeake Biological Laboratory (CBL) as an independent laboratory. We continued analysis of duplicate samples by USGS-NWQL and CBL through 2015. We compare results of the three laboratories in Fig. 5. The USGS-NWQL has the following minimum reporting levels: 0.7 μM ammonium, 0.1 μM nitrite and phosphate, 0.7 μM nitrate+nitrite when total<10 μM, 2.9 μM when total >10 μM, and 1.0 μM silicate. Replicate samples are intermittently analysed by USGS-NWQL to measure precision. Replicates have mean coefficients of variation<5% for all nutrients: nitrite=3.1%, nitrate+nitrite=2.3%, ammonium=4.6%, phosphate=1.6%, and silicate=0.01%.
Although nutrient methods changed over time, routine analyses of blanks and standards confirmed that methods changes did not reduce analytical precision or accuracy.
This Data Descriptor identifies a csv file that contains the complete record of USGS water-quality measurements made in San Francisco Bay from 1969–2015. Users may prefer to access the data from our project web page that includes a database from which queries can be made to select and download subsets of the full data record (https://sfbay.wr.usgs.gov/access/wqdata/index.html ). This web page also provides visual displays of water-quality spatial variability for each sampling cruise, and more detail about the research project and team members.
How to cite this article: Schraga, T. S. & Cloern, J. E. Water quality measurements in San Francisco Bay by the U.S. Geological Survey, 1969–2015. Sci. Data 4:170098 doi: 10.1038/sdata.2017.98 (2017).
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Cloern, J. E., & Schraga, T. S. U.S. Geological Survey https://doi.org/10.5066/F7TQ5ZPR (2016)
Long-term USGS research in San Francisco Bay has been supported by: USGS National Research Program; USGS Priority Ecosystem Science; Regional Monitoring Program for Water Quality in San Francisco Bay; and the San Francisco Bay Nutrient Management Strategy.