Abstract
Permafrost degradation leads to considerable changes in river ecosystems. The Eastern Siberian River Chemistry (ESRC) database was constructed to create a spatially extensive river chemistry database to assess climate warming-induced changes in freshwater systems in permafrost-dominated eastern Siberia. The database includes 9487 major ion (Na+, K+, Ca2+, Mg2+, Cl−, SO42− and HCO3−) data of chemical results from 1434 water samples collected mainly in six large river basins in eastern Siberia spanning 1940–2019. Data were obtained from public databases, scientific literature in English and Russian, and researchers and were formatted with a consistent table structure. The database is transparent and reproducible. Climate variable (air temperature and precipitation) data, discharge data, trace element concentration data, and isotope data at the basin and subbasin scales are also provided. This database enhances knowledge about the water chemistry of the permafrost region, especially in eastern Siberia, where data are scarce. The database will be useful to those assessing spatiotemporal changes in river water chemistry associated with permafrost degradation or other environmental stressors in a warmer climate.
Measurement(s) | water chemistry |
Technology Type(s) | chromatography and spectrometry |
Sample Characteristic - Environment | hydrochemistry |
Sample Characteristic - Location | Eastern Siberia |
Similar content being viewed by others
Background & Summary
The Arctic Ocean accounts for only 1% of the global ocean volume, while it receives more than 10% of global river discharge (~ 4300 km3 per year)1,2 from ~ 15% of the global land surface3. Surface water from Arctic and sub-Arctic river basins is generally fresh4 with low concentrations of dissolved ions. Over the past several decades, the Arctic freshwater system has experienced significant changes5 due to accelerated climate warming and an intensified hydrological cycle as well as human activities across the terrestrial pan-Arctic6,7,8.
The chemical compositions of river water are the result of natural processes and anthropogenic influences9. Progressive increases in major ion delivery to the Arctic and sub-Arctic freshwater systems are highly associated with permafrost degradation in a warmer climate10. Permafrost degradation enhances infiltration, increases groundwater storage, and drives deeper flow paths11, leading to increasing contributions of highly mineralized groundwater to streamflow. As a result, Arctic freshwater is shifting from a mineral-poor surface water-dominated river system to a mineral-rich groundwater system12. Our understanding of the response of the Arctic freshwater system to permafrost degradation is mainly based on river water chemistry observations in western Siberia13.
The water chemistry database in western Siberia is relatively rich, especially for the Ob River, with sampling dating back to the 1930s13, and is constantly replenished14,15,16. In contrast, water chemistry data in eastern Siberia are relatively sparse. Early data on water chemistry in eastern Siberia were published mainly in the Russian literature and were difficult to access. In fact, the water chemistry of eastern Siberia was continuously observed and studied by scholars in the former Soviet Union during the 1940s and 1950s e.g., Bochkarev17. In the 1990s, research for two PhD theses was conducted to systematically study the water chemistry of the Lena River18 and the other rives of eastern Siberia19. After 2000, the Arctic Great Rivers Observatory (ArcticGRO), which originated from the Pan-Arctic River Transport of Nutrients, Organic Matter, and Suspended Sediments (PARTNERS) project, provides open-access water chemistry data of the Lena and Kolyma Rivers from 2003. However, water chemistry data for other rivers (e.g., Angara, Selenga, Yana and Indigirka) are still limited.
The objective of this study was to combine existing eastern Siberian river chemistry datasets into a single database that can help assess climate effects on freshwater chemistry in permafrost-dominated regions. Data obtained from public databases, researchers, and the literature, including English and Russian articles and dissertations, were combined to create a georeferenced database with 9487 water chemistry results for 1434 samples collected from rivers across eastern Siberia (Fig. 1). A shapefile that delineated polygons for river basins was constructed to accompany the chemistry database. This database also included climate variables such as air temperature and precipitation at the basin scale. The database is transparent and reproducible and can be useful to assess the responses of freshwater systems to climate change in permafrost-dominated regions.
Methods
Data acquisition
Google Scholar, Scopus, and eLIBRARY.RU, as well as public data sources, were searched using the term “water chemistry” in Eastern Siberia. In total, 1434 multisource data, including major ions, were obtained from both published datasets and unpublished field studies (Table 1). Among these data, (1) 159 datasets were from the ArcticGRO water quality data20 and the GLObal RIver CHemistry (GLORICH) databases21; (2) 928 water chemistry data were sourced from 10 published studies in both English22,23,24,25,26 and Russian17,18,27,28,29); and (3) 347 unpublished datasets were provided by Gabysheva O.I. and Wang P. Chemical analyses of the waters sampled by research groups led by Gabysheva O.I. and Wang P. were performed at the laboratory of the Institute for Biological Problems of Cryolithozone and the Baikal Institute of Nature Management (Siberian Branch, Russian Academy of Sciences), respectively, following the methodology described by Semenov30.
For the 347 unpublished datasets, water samples were collected in pre-cleaned polypropylene bottles and immediately filtered through disposable sterile Sartorius filter elements (pore size 0.45 μm). The first 50 mL of the filtrate was discarded. The filtered solutions for cation and trace element analysis were acidified (pH = 2) with ultrapure double-distilled HNO3, stored in HDPE bottles prewashed with 1 M HCl and rinsed with Milli-Q deionized water. Filtered water samples for anions were not acidified and stored in High Density Polyethylene (HDPE) bottles prewashed according to the procedure described above for cations. Some components were analysed directly at the sampling sites; the remaining samples were fixed according to the analysis procedure and transported in a refrigerated box at 1–3 °C. Anions (Cl−, SO42−, HCO3−) were determined by high-performance liquid chromatography (HPLC), and cations (Ca2+, Mg2+, K+ and Na+) were analysed by flame atomic-absorption spectrometry.
We consolidated all collected data for major dissolved ions (Na+, K+, Ca2+, Mg2+, Cl−, SO42− and HCO3−) in eastern Siberian rivers and divided them into 7 categories according to the spatial distribution in six major river basins and out-of-basin areas (named Angara, Selenga-Baikal, Lena, Yana, Indigirka, Kolyma and Eastern Siberia in the “Basin” attribute) and eliminated duplicate data.
Unit conversion
All of the original water chemical data included major ions (Na+, K+, Ca2+, Mg2+, Cl−, SO42− and HCO3−) without alteration other than standardization of units to mg/L. Based on Lesch31 and EWT Water Technology (https://www.ewt-wasser.de/en/knowledge/concentration-quantities-unit-conversions.html#Umre%204/5), the atomic weight (AW) and valence (V) were used in the conversion relationships between ppm, mmol/L, mEq/L and mg/L (Table 2):
The inorganic total dissolved solids (TDS) were determined by the sum of seven major ions (Na+, K+, Ca2+, Mg2+, Cl−, SO42− and HCO3−) expressed in mg/L. Among the ArcticGRO datasets20, the SO42− concentrations (117 datasets) were obtained by multiplying the concentration of sulfur (mg S/L) by three, and the HCO3− concentrations (147 datasets) were calculated from the alkalinity based on the ratio of equivalent weights32 and marked as “cal_alk” in the attribute “Note”:
Anion HCO3− in 151 groups of data from Huh, et al.24, Huh, et al.26, and GLORICH21 were determined by the charge balance method from the other ions, which was marked as “cal_ib” in the “Note” attribute.
Ionic charge balance controls
To control the data quality of water samples, the ionic charge balance technique was used in this study since the concentrations of all negatively charged ions should be equal to the sum of the positively charged ions in each sample. The ion balance (IB) was determined as follows WMO33:
where Ci is the concentration of ion type i in a specific sample (mEq/L); IS is the sum of all ion concentrations (mEq/L); ID is the difference between the sum of the cation concentrations and the sum of the anion concentrations (mEq/L); and IB is the ratio of ID to IS, representing both systematic and random errors during the measurements.
As a result, 122 samples (8.5% of the total samples) with absolute values of IB greater than 10 were excluded from this study, and in 48 samples, some ions were absent (marked as “imbalance” and “absent” in the “IB” attribute, respectively). As a result, 1264 samples were considered reasonable for further analysis.
Normal distribution assessment
The normality assumption is assessed using skewness and kurtosis and applies to both small and large samples34 for the 1264 sets of TDS data. The skewness (γ1) and kurtosis (γ2) describe the degree of asymmetry in a distribution and the extent to which the density of observations differs from the probability density of the normal curve35:
where n represents the sample size with a value of xi, \(\bar{{x}}\) is the mean value and SD is the standard deviation.
A z-test is applied, and z scores can be obtained by dividing the skew values or excess kurtosis by their standard errors34:
where SEγ1 and SEγ2 are the standard errors of skewness and kurtosis, respectively.
The normality test results with positive skew values and positive excess kurtosis from IBM SPSS software (https://www.ibm.com/analytics/spss-statistics-software) show that the dataset of TDS values does not follow the normal distribution (Table 3), as the z score is larger than ± 1.96 (α = 0.05).
Outlier detection
The 1264 sets of TDS data varied widely (12–2586 mg/L). Tukey’s method36 applies to both symmetric and skewed data and detects more outliers for data that do not follow a normal distribution, unlike the standard deviation (SD) method (Mean ± 2 SD, Mean ± 3 SD)37. Since Tukey’s method makes no distributional assumptions about the data37, outliers in this study were detected by Tukey’s 3 IQR (interquartile range) method. The IQR is known as the difference between the first quartile (Q1) and the third quartile (Q3)38:
The samples were detected as potential outliers and possible outliers by inner fences with a 1.5 IQR interval and outer fences with a 3 IQR37,39, respectively.
Inner fences are situated at a distance of 1.5 IQR below Q1 and above Q3:
The intervals with 3 IQR are called outer fences and are located below Q1 and above Q3 at 3 IQR distances:
The outlier detection results (Table 4) show that 4.4% and 8.2% of the 1264 TDS data account for the possible outliers and potential outliers, respectively.
Subbasin selection
The subbasin boundaries used in this study were extracted from the HydroBASINS shapefile40, which follows the rule that at every location where two river branches meet, each has an individual upstream area that exceeds a certain size threshold (i.e., 100 km2). The rule still allows smaller subbasins to occur, and we selected the 6th-level basin for this database according to the data volume and sampling density.
In total, 218 subbasins were selected from a total of 776 subbasins in the eastern Siberia region (including its six major basins) where the sampling sites were located (Fig. 2). Each subbasin was named with a unique code in ObjectID together with average river water chemistry and the climatic factors (temperature (T), precipitation (P) and potential evaporation (PET)) at subbasin scales. T and PET were derived from the Climate Research Unit (CRU) 4.04 dataset41, and P was obtained from the Global Precipitation Climatology Centre (GPCC) dataset42 at a resolution of 0.5° from 1901 to 2019.
Meteorological data processing
We clipped the meteorological data (.nc file) using subbasin boundaries and then pre-processed the data to filter out the missing data. The monthly precipitation data (mm/month) of the year are summed to obtain the annual precipitation (mm/year). The same was true for the daily potential evaporation data, which should be multiplied by the number of days of each year. After that, we averaged the meteorological data within each subbasin.
Data Records
The dataset is publicly available at figshare43. The water chemistry database consists of the following 3 categories and associated listed files:
Category 1: Boundary data
This folder contains the boundaries of eastern Siberia and its six major river basins (Angara, Selenga-Baikal, Lena, Yana, Indigirka and Kolyma) with the river system, which consist of four shp files.
Eastern_Siberia_boundary.shp
Basin_boundary.shp
Subbasin_boundary.shp
River_system.shp
Category 2: Water chemistry data
This folder contains the full river water chemistry database, which consists of a csv file with all total dissolved solids (TDS) and major ions (Na+, K+, Ca2+, Mg2+, Cl−, SO42− and HCO3−), as well as related information (basins, coordinates, sample period, data source, permafrost type, and lithology), basic climatic (temperature and precipitation) and discharge data for each sample (Sample_ID). This folder also contains a sample summary csv file, which provides the maximum, minimum, mean, standard deviation and number information for the ion concentrations and TDS in each river basin.
Samples_database.csv
Samples_summary.csv
Category 3: Meteorology data
This folder contains the climatic information (temperature, precipitation, and potential evaporation) for the 218 subbasins (named ObjectID) on a yearly scale from 1901 to 2019. Each of the 3 files contains a group of 25942 data with 3332 missing values denoted as −9999.
tmp.csv
pre.csv
pet.csv
Technical Validation
Quality assurance for the 1434 unique datasets from each independent source was separated into two stages (Fig. 3): (1) Import and standardization and (2) Screening by chemical and statistical methods.
Import and standardization
Data extracted from different sources (manuscripts, online databases and field work reports) were input into an initial data file according to corresponding attributes without alteration. After the multisource data were assembled, an initial check of transcription errors and the modification of input errors (e.g., decimal point mislocation, incorrect placement of variables, and character error) were conducted. Then, standardization and unit conversion were carried out for original water chemical data by parametric conversion (i.e., determining the concentration of hydrogen carbonate by alkalinity and determining the concentration of sulfate by sulfur concentration) and conversion of units into mg/L. Duplicate data were then screened by comparing the coordinates and times of the datasets. Ten percent of random data were selected from our database for validation to eliminate errors during the whole import and standardization process.
Screening by chemical and statistical methods
We compared the original TDS values from data sources (i.e., literature and database) with the calculated TDS by the sum of major ions to ensure the rationality of the original ion concentration data. Forty-eight of all datasets were missing ions and marked as “absent” in the “IB” column of the “Samples_database.csv” file. Then, we performed charge balance across all datasets and identified 122 total samples that did not meet the ion balance (marked as “imbalance” in the “IB” column of the “Samples_database.csv” file). The remaining 1264 sets of data were explored by normal distribution assessment and outlier detection methods, and the input and processing of outliers were then verified. Both the inner and outer fences of outliers were determined for the 1264 TDS datasets, and outliers with high mineralization of river water appear in only the Angara, Lena and Selenga-Baikal River basins due to different karst processes. Finally, 150 datasets were selected randomly from the final database twice, and the final validation was conducted by people not involved in the data collection process.
Meteorological data validation
To ensure the reliability of the meteorological data, the gridded data were compared with the observation data from the meteorological stations. The validation of monthly gridded data against the observed data (Fig. 4) showed a good performance of CRU temperature products (MAE = 1.41 °C, RMSE = 2.33 °C, NSE = 0.98, R2 = 0.98, n = 159889) and GPCC precipitation products (MAE = 1.99 mm, RMSE = 5.95 mm, NSE = 0.97, R2 = 0.97, n = 147825).
Usage Notes
The Eastern Siberian River Chemistry (ESRC) database includes the boundaries of eastern Siberia, its six river basins and the 218 subbasins in which water samples were taken. In addition to the sampling information, this database also includes 1434 samples of 7 major ion concentrations, total dissolved solids (TDS), climatic factors (temperature and precipitation), lithology, permafrost, sampling information, and annual air temperature, precipitation, potential evaporation, and discharge data for each subbasin during the period from 1901–2019.
Water chemistry datasets
-
1)
Samples_database.csv
Sample_ID - Unique sampling data identifier. The code consists of 2 parts:
-
The first part represents the region: SE - Selenga-Baikal; AN - Angara; LE - Lena; YA - Yana; IN - Indigirka; KO - Kolyma; ES - Eastern Siberia.
-
The second part represents the sample numbers in each basin.
Data - Sampling date in the format YYYY-MM-DD, nondaily sample dates are blank.
Year - Sampling years.
Month - Sampling month: 1 - January; 2 - February; 3 - March; 4 - April; 5 - May; 6 - June; 7 - July; 8 - August; 9 - September; 10 - October; 11 - November; 12 - December; 7–8 - July to August; 1–12 - annual average data, which do not correspond to a certain month.
La - Latitude in unit of decimal degrees.
Lo - Longitude in unit of decimal degrees.
Ca2+[mg/L] - Calcium in units of milligrams per litre (mg/L).
Mg2+[mg/L] - Magnesium in units of milligrams per litre (mg/L).
K+[mg/L] - Potassium in units of milligrams per litre (mg/L).
Na+[mg/L] - Sodium in units of milligrams per litre (mg/L).
Cl−[mg/L] - Chloride in units of milligrams per litre (mg/L).
SO42−[mg/L] - Sulfate in units of milligrams per litre (mg/L).
HCO3−[mg/L] - Hydrogen carbonate in units of milligrams per litre (mg/L).
TDS[mg/L] - Total dissolved solids (mg/L) calculated by the sum of seven major ions (Na+, K+, Ca2+, Mg2+, Cl−, SO42− and HCO3−).
pH - pH value of the sampled water.
River - Name of sampled river in English.
Basin - Basin to which the sampling location belongs. The basin boundaries are derived from the HydroBASINS shapefile40.
Subbasin - Subbasin to which the sampling location belongs. The subbasin boundaries are derived from the 6th-level HydroBASINS shapefile40, and the subbasin code corresponds to the ObjectID attribute of Subbasins_boundary.shp in this database.
T-annual[°C] - Annual 2-m air temperature (°C) for the corresponding year at the sampling location, which is calculated based on the gridded monthly average 2-m temperature data with a resolution of 0.5° obtained from the Climate Research Unit time series41 (CRU TS) v. 4.04.
T-monthly[°C] - Monthly average 2-m air temperature (°C) for the month of sampling at the corresponding location, which is sourced from the CRU TS v. 4.04 with 0.5° resolution41.
P-annual[mm] - Annual precipitation for the corresponding year at the sampling location in units of millimetres (mm), which is calculated based on monthly precipitation data with 0.5° grid resolution prepared by the Global Precipitation Climatology Centre (GPCC)42.
P-monthly[mm] - Monthly mean precipitation (mm) for the month of sampling at the corresponding location, which is sourced from the GPCC42 with a resolution of 0.5°.
Lithology - Lithology type of the sampling position based on the new global lithological map database GLiM44: mt – metamorphic rocks; pa - acid plutonic rocks; pb - basic plutonic rocks; sc - carbonate sedimentary rocks; sm - mixed sedimentary rocks; ss - siliciclastic sedimentary rocks; su - unconsolidated sediments; va - acid volcanic rocks; vb - basic volcanic rocks; vi - intermediate volcanic rocks. Blank means no lithological data are available, and wb represents a water body.
Permafrost type - 5 permafrost types based on the Northern Hemisphere Permafrost data45: 1 - continuous; 2 - discontinuous; 3 - sporadic; 4 - isolated patches; 5 - none. Blank means no permafrost data.
Citation - Data source. The numbers correspond to “No.” in Table 1: 1 - ArcticGRO20; 2 - GLORICH21; 3 - Georgiadi et al.22; 4 - Huh et al.24; 5 - Kuzmin et al.23; 6 - Huh and Edmond, 199925; 7 - Huh et al.26; 8 - Berkin et al.27; 9 - Bochkarev, 195917; 10 - Grebenshchikova et al.28; 11 - Sidorov, 199229; 12 - Shpakova, 199918; 13 - Gabysheva O.I.; 14 - Wang P..
IB – Ionic balance results checked by the charge balance method. Samples with absent ions are marked as “absent”; samples with an ion balance (IB) greater than 10 are marked as “imbalance”.
Note - Remarks during the data acquisition. The HCO3 concentrations that were calculated from alkalinity and determined by the charge balance method are marked as “cal_alk” and “cal_ib”, respectively.
Discharge[m3/s] - Daily discharge data (m3/s).
Ori_ID - The original sample ID from the data sources.
Li[mg/L] - Lithium in units of milligrams per litre (mg/L).
Sr[mg/L] - Strontium in units of milligrams per litre (mg/L).
As[mg/L] - Arsenic in units of milligrams per litre (mg/L).
Ba[mg/L] - Barium in units of milligrams per litre (mg/L).
Si[mg/L] - Silicon in units of milligrams per litre (mg/L).
87Sr/86Sr - 87Sr/86Sr ratios.
δ 18O-H2O[‰] - Oxygen isotope values of water in units of ‰.
δ 2H-H2O[‰] - Hydrogen isotope values of water in units of ‰.
-
2)
Samples_summary.csv
Basin - Basin to which the sampling location belongs. The basin boundaries are derived from the HydroBASINS shapefile40.
Attribute - water chemistry parameters include 7 major ions and the total dissolved solids in units of milligrams per litre (mg/L): Ca2+ - calcium; Mg2+ - magnesium; K+ - potassium; Na+ - sodium; Cl–chloride; SO42–sulfate; HCO3–hydrogen carbonate; TDS - total dissolved solids.
Statistical variable - The main statistical results for different hydrochemical parameters in each basin: Max - maximum; Min - minimum; Mean - average value; Std - standard deviation; n - number of samples.
Meteorology datasets
The datasets contain 3 files, tmp.csv, pre.csv and pet.csv, which are air temperature (T, °C), precipitation (P, mm/yr), and potential evaporation (PET, mm/yr) data, respectively. Each file has similar data with two main attributes: Subbasin ID (Subbasin) and yearly data average (1901 to 2019).
Subbasin ID - named by the unique code of the subbasin according ObjectID, including a total number of 218 subbasins.
Yearly data average - Named by the corresponding year of average annual temperature (precipitation or potential evaporation) within each subbasin from 1901 to 2019. We denote a missing value as −9999.
Code availability
Within the repository, we also provide code for extracting climate data of each subbasin from the Climatic Research Unit at the University of East Anglia (http://www.cru.uea.ac.uk/) and the Global Precipitation Climatology Centre (https://climatedataguide.ucar.edu/climate-data/gpcc-global-precipitation-climatology-centre) in the code folder.
♦ The shp folder contains 218 subbasin boundary shp files.
♦ The downloaded input data are stored in 3 nc files with annual average temperature, annual precipitation, and annual potential evaporation data (1901–2019) in yearmean_cru_ts4.04.1901.2019.tmp.dat.nc, yearmean_GPCC_1901–2019_05.nc, and yearmean_cru_ts4.04.1901.2019.pet.dat.nc, respectively.
♦ The code for extracting data from the .nc file to the .xlsx file was written in Python, and extract_tmp-nc_to_xlsx.py, extract_precip-nc_to_xlsx.py, and extract_pet-nc_to_xlsx.py were used to extract temperature, precipitation, and evaporation data, respectively.
♦ The output data will be stored in .xlsx format (multi_yr_tmp_subbasins6_1901–2019.xlsx, multi_yr_precip_subbasins6_1901–2019.xlsx, multi_yr_pet_subbasins6_1901–2019.xlsx for temperature, precipitation, and potential evaporation data, respectively) in folders tmp_clip_sub6, precip_clip_sub6 and PET_clip_sub6 after running the code.
Change history
10 January 2023
A Correction to this paper has been published: https://doi.org/10.1038/s41597-023-01927-4
References
Holmes, R. M. et al. Seasonal and Annual Fluxes of Nutrients and Organic Matter from Large Rivers to the Arctic Ocean and Surrounding Seas. Estuaries and Coasts 35, 369–382, https://doi.org/10.1007/s12237-011-9386-6 (2012).
Shiklomanov, A. et al. in Arct. Hydrology, Permafr. Ecosyst. (eds D., Yang & D. L., Kane) 703–738 (Springer International Publishing, 2021).
Feng, D. et al. Recent changes to Arctic river discharge. Nat. Commun. 12, 6917, https://doi.org/10.1038/s41467-021-27228-1 (2021).
Serreze, M. C. et al. The large-scale freshwater cycle of the Arctic. Journal of Geophysical Research: Oceans 111, https://doi.org/10.1029/2005jc003424 (2006).
White, D. et al. The arctic freshwater system: Changes and impacts. Journal of Geophysical Research: Biogeosciences 112, https://doi.org/10.1029/2006JG000353 (2007).
Rawlins, M. A. et al. Analysis of the Arctic System for Freshwater Cycle Intensification: Observations and Expectations. J. Clim. 23, 5715–5737, https://doi.org/10.1175/2010JCLI3421.1 (2010).
Labat, D., Goddéris, Y., Probst, J. L. & Guyot, J. L. Evidence for global runoff increase related to climate warming. Adv. Water Resour. 27, 631–642, https://doi.org/10.1016/j.advwatres.2004.02.020 (2004).
Wang, P. et al. Potential role of permafrost thaw on increasing Siberian river discharge. Environ. Res. Lett. 16, 034046, https://doi.org/10.1088/1748-9326/abe326 (2021).
Khatri, N. & Tyagi, S. Influences of natural and anthropogenic factors on surface and groundwater quality in rural and urban areas. Front. Life Sci. 8, 23–39, https://doi.org/10.1080/21553769.2014.933716 (2015).
Colombo, N. et al. Review: Impacts of permafrost degradation on inorganic chemistry of surface fresh water. Glob. Planet. Change 162, 69–83, https://doi.org/10.1016/j.gloplacha.2017.11.017 (2018).
Walvoord, M. A. & Striegl, R. G. Increased groundwater to stream discharge from permafrost thawing in the Yukon River basin: Potential impacts on lateral export of carbon and nitrogen. Geophys. Res. Lett. 34, https://doi.org/10.1029/2007gl030216 (2007).
Frey, K. E., Siegel, D. I. & Smith, L. C. Geochemistry of west Siberian streams and their potential response to permafrost degradation. Water Resources Research 43, https://doi.org/10.1029/2006WR004902 (2007).
Holmes, R. M. et al. Climate Change Impacts on the Hydrology and Biogeochemistry of Arctic Rivers. (2013).
Pokrovsky, O. S. et al. Impact of permafrost thaw and climate warming on riverine export fluxes of carbon, nutrients and metals in Western Siberia. Water (Switzerland) 12, https://doi.org/10.3390/w12061817 (2020).
Pokrovsky, O. S. et al. Permafrost coverage, watershed area and season control of dissolved carbon and major elements in western Siberian rivers. Biogeosciences 12, 6301–6320, https://doi.org/10.5194/bg-12-6301-2015 (2015).
Pokrovsky, O. S. et al. Trace element transport in western Siberian rivers across a permafrost gradient. Biogeosciences 13, 1877–1900, https://doi.org/10.5194/bg-13-1877-2016 (2016).
Bochkarev, P. F. Hydrochemistry of the rivers of Eastern Siberia [in Russian: Гидрохимия рек Восточной Сибири]. 155 (Irkutsk, Vostochno-Sibirskoe knizhnoe izdatel’stvo, 1959).
Shpakova, R. Formation of the water quality of the Lena River in the modern period (in Russian: Формирование качества воды реки Лена в современный период). Dissertation (Thesis) for the Degree of Candidate of Geographical Sciences (1999).
Huh, Y. The fluvial geochemistry of the rivers of Eastern Siberia and implications for the effect of climate on weathering. PhD Thesis, MIT/WHOI Joint Program in Oceanography, Cambridge, MA, p.236 (1998).
Holmes, R. M., McClelland, J. W., Tank, S. E., Spencer, R. G. M. & Shiklomanov, A. I. Arctic Great Rivers Observatory. Water Quality Dataset, Version 20211118. https://www.arcticgreatrivers.org/data. (2021).
Hartmann, J., Lauerwald, R. & Moosdorf, N. A Brief Overview of the GLObal RIver Chemistry Database, GLORICH. Procedia Earth Planet. Sci. 10, 23–27, https://doi.org/10.1016/j.proeps.2014.08.005 (2014).
Georgiadi, A. G., Tananaev, N. L. & Dukhova, L. A. Hydrochemical regime of the Lena River in August 2018. Okeanologiya [Oceanology] 59, 881–884, https://doi.org/10.31857/S0030-1574595881-884 (2019).
Kuzmin, M. I., Tarasova, E. N., Mamontova, E. A., Mamontov, A. A. & Kerber, E. V. Seasonal and interannual variations of water chemistry in the headwater streams of the Angara River (Baikal) from 1950 to 2010. Geochem. Int. 52, 523–532, https://doi.org/10.1134/S0016702914070040 (2014).
Huh, Y., Tsoi, M.-Y., Zaitsev, A. & Edmond, J. M. The fluvial geochemistry of the rivers of Eastern Siberia: I. tributaries of the Lena River draining the sedimentary platform of the Siberian Craton. Geochimica et. Cosmochimica Acta 62, 1657–1676, https://doi.org/10.1016/S0016-7037(98)00107-0 (1998).
Huh, Y. & Edmond, J. M. The fluvial geochemistry of the rivers of Eastern Siberia: III. Tributaries of the Lena and Anabar draining the basement terrain of the Siberian Craton and the Trans-Baikal Highlands. Geochimica et. Cosmochimica Acta 63, 967–987, https://doi.org/10.1016/S0016-7037(99)00045-9 (1999).
Huh, Y., Panteleyev, G., Babich, D., Zaitsev, A. & Edmond, J. M. The fluvial geochemistry of the rivers of Eastern Siberia: II. Tributaries of the Lena, Omoloy, Yana, Indigirka, Kolyma, and Anadyr draining the collisional/accretionary zone of the Verkhoyansk and Cherskiy ranges. Geochimica et. Cosmochimica Acta 62, 2053–2075, https://doi.org/10.1016/S0016-7037(98)00127-6 (1998).
Berkin, N. С., Makarov, A. А. & Rusinek, O. Т. Baikal studies: textbook (in Russian). 291 (Irkutsk: Irkutsk State University Press, 2009).
Grebenshchikova, V. I., Zagorulko, N. A. & Pastukhov, M. V. Monitoring of the ionic composition of water at the Angara river from lake Baikal (in Russian: Мониторинговые исследования ионного состава воды истока р. Ангары (озеро Байкал)). Water: Chem. Ecol. 4, 2–8 (2011).
Sidorov, I. S. The special features in the formation of the hydr~hemicai regime of the Lena mouth area and the southeastern part of the Laptev Sea (in Russian: Особенности формирования гидрохимического режима устьевой области р. Лена и юго-восточной части моря Лаптевых). Summary of Candidate of Geograph. Sci. Thesis, Rostov-on-Don, 26 pp. (1992).
Semenov, A. D. E. Manual on chemical analysis of terrestrial surface water (in Russian: Руководство по химическому анализу поверхностных вод суши). Gidrometeoizdat: Leningrad, Russia, 541 p. (1977).
Lesch, S. M. Technical Note: A Short Note on Calculating the Adjusted SAR Index. T Asabe v. 52, 493-496-2009 v.2052 no.2002 (2009).
Whittemore, D. O. Geochemical identification of saline water from an irrigation well in north-central Stafford County, Kansas. 6 (Kansas Geological Survey, Lawrence, KS, United States, 2012).
WMO. Manual for the GAW Prec data quality objectives and standard operating procedures. (WMO, 2004).
Kim, H.-Y. Statistical notes for clinical researchers: assessing normal distribution (2) using skewness and kurtosis. rde 38, 52–54, https://doi.org/10.5395/rde.2013.38.1.52 (2013).
Hopkins, K. D. & Weeks, D. L. Tests for Normality and Measures of Skewness and Kurtosis: Their Place in Research Reporting. Educ. Psychological Meas. 50, 717–729, https://doi.org/10.1177/0013164490504001 (1990).
Tukey, J. W. Exploratory Data Analysis. (Addison-Wesley, Reading, MA, 1977).
Seo, S. A review and comparison of methods for detecting outliers in univariate data sets, University of Pittsburgh, (2006).
Saleem, S., Aslam, M. & Shaukat, M. R. J. P. J. O. S. A Review and Empirical Comparison of Univariate Outlier Detection Methods. Pak. J. Stat. 37, 447–462 (2021).
Langroudi, G. & Panjalizadeh Marseh, B. Detection of Outlier in Flood Observations: A Case Study of Tamer Watershed. Res. J. Recent. Sci. 4, 150–153 (2015).
Lehner, B. & Grill, G. Global river hydrography and network routing: baseline data and new approaches to study the world’s large river systems. Hydrol. Process. 27, 2171–2186, https://doi.org/10.1002/hyp.9740 (2013).
Harris, I., Osborn, T. J., Jones, P. & Lister, D. Version 4 of the CRU TS monthly high-resolution gridded multivariate climate dataset. Sci. Data 7, 109, https://doi.org/10.1038/s41597-020-0453-3 (2020).
Schneider, U., Becker, A., Finger, P., Rustemeier, E. & Ziese, M. GPCC full data monthly product version 2020 at 0.5°: monthly land-surface precipitation from rain-gauges built on GTS-based and historical data. Global Precipitation Climatology Centre at Deutscher Wetterdienst: Offenbach, Germany, https://doi.org/10.5676/DWD_GPCC/FD_M_V2020_050 (2020).
Wang, P. et al. Water chemistry dataset in eastern Siberian rivers, figshare, https://doi.org/10.6084/m9.figshare.c.5831975.v1 (2022).
Hartmann, J. & Moosdorf, N. The new global lithological map database GLiM: A representation of rock properties at the Earth surface. Geochemistry, Geophysics, Geosystems 13, https://doi.org/10.1029/2012GC004370 (2012).
Obu, J. et al. Northern Hemisphere permafrost map based on TTOP modelling for 2000–2016 at 1 km2 scale. Earth-Science Rev. 193, 299–316, https://doi.org/10.1016/j.earscirev.2019.04.023 (2019).
Acknowledgements
This research was funded by the NSFC-RSF (42061134017 and 21-47-00008), the National Natural Science Foundation of China (42001098), the Science and Technology Basic Resources Investigation Programme of China (No. 2017FY101302), the Strategic Priority Research Programme of the Chinese Academy of Sciences (No. XDA2003020101), and the state assignment of the Ministry of Science and Higher Education of the Russian Federation (theme No. 0297-2021-0026, reg. No. AAAA-A21-121012190036-6). The authors gratefully acknowledge the Chief Editor, Guy Jones, the Handing Editor, Julien Emile-Geay, and two anonymous reviewers for their valuable comments and suggestions that have led to the substantial improvement of the earlier version of this manuscript.
Author information
Authors and Affiliations
Contributions
P.W., S.L. and Q.H. designed and constructed the database and wrote the manuscript. O.G., Z.L., J.Z., E.K., Y.L., T. B. and R.S. performed database proofing. V.G., S.P. and N.F. assisted in database design and manuscript preparation.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Liu, S., Wang, P., Huang, Q. et al. A database of water chemistry in eastern Siberian rivers. Sci Data 9, 737 (2022). https://doi.org/10.1038/s41597-022-01844-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-022-01844-y