Abstract
Simulating the carbon-water fluxes at more widely distributed meteorological stations based on the sparsely and unevenly distributed eddy covariance flux stations is needed to accurately understand the carbon-water cycle of terrestrial ecosystems. We established a new framework consisting of machine learning, determination coefficient (R2), Euclidean distance, and remote sensing (RS), to simulate the daily net ecosystem carbon dioxide exchange (NEE) and water flux (WF) of the Eurasian meteorological stations using a random forest model or/and RS. The daily NEE and WF datasets with RS-based information (NEE-RS and WF-RS) for 3774 and 4427 meteorological stations during 2002–2020 were produced, respectively. And the daily NEE and WF datasets without RS-based information (NEE-WRS and WF-WRS) for 4667 and 6763 meteorological stations during 1983–2018 were generated, respectively. For each meteorological station, the carbon-water fluxes meet accuracy requirements and have quasi-observational properties. These four carbon-water flux datasets have great potential to improve the assessments of the ecosystem carbon-water dynamics.
Similar content being viewed by others
Background & Summary
The eddy-covariance flux stations provide reliable ecosystem-scale measurements of the carbon and energy fluxes at a high temporal resolution1. They have become crucial tools to generate observation datasets to verify and benchmark the Earth surface models2,3. In particular, it is possible to construct a carbon-water flux simulation model from the station-scale to the regional- or global-scale by means of a large-scale eddy covariance4 measurement network (e.g. Fluxnet, AmeriFlux and ChinaFlux). However, the existing flux stations are sparsely and unevenly distributed and yield rather discontinuous observation data1. This restricts studies on the carbon-water fluxes at a large-scale3, for example in Eurasia, where a strong spatial heterogeneity is exhibited on complex terrains. The meteorological stations, in contrast, are densely spread around the world with long-term continuous observation data5, which could have great potential to mine the more extensive carbon-water flux information, particularly combined with machine learning (ML) and remote sensing (RS). This could greatly offset the limitations of the flux station-based observations.
Machine learning is increasingly used to extract the patterns and insights from big geospatial data6. Many studies have focused on the comparative evaluation of different ML algorithms and have found the accuracy performance of the same algorithm varies in different research contexts7,8,9,10. The data-driven ML algorithms are similar to the encapsulated complex empirical algorithms, which demonstrate a high simulation accuracy3,11. But the ML algorithms are still influenced by the quality, processing methods, and spatio-temporal representativeness of the data12,13,14. Compared with the process-based land surface or ecosystem models, the ML has a higher carbon-water flux simulation accuracy1,6. However, when transferred to other site or regional or spatial (grid) scales, the applicability of both the ML models and process models need to be evaluated due to the distinct spatio-temporal heterogeneity. That is to say, there is no guarantee that these models are applicable to all sites, grids or regions. If this evaluation of the model applicability is not considered, the simulation results will generate new uncertainties. This issue has become a major problem affecting the simulation accuracy of the carbon-water fluxes at different scales.
In this study, ML with flux observations was used to build carbon-water flux simulation models (random forest model, RFM) to simulate the carbon-water fluxes of the meteorological stations in Eurasia. We proposed a framework to evaluate the applicability of the model transfer and to build a bridge from the flux stations to the meteorological stations. We used this framework to generate four carbon-water flux datasets for the Eurasian meteorological stations. Due to the high precision, these datasets could be regarded as quasi-observational at the site level, which might be used to assess the simulation accuracy of the regional- or global-scale ecosystem carbon-water fluxes based on the ecosystem or land surface or remote sensing or atmospheric inversion models. Our study can, therefore, benefit terrestrial water management and carbon dynamic assessments.
Methods
The RFM was constructed based on the Eurasian flux stations. We built a total of 3,600 RFMs at site scale in accordance with the classification of the flux stations. The simulation accuracy of these RFMs at each flux station in the test set was validated by the spatial cross-validation, thus generating thousands of determination coefficients (R2) at test stations. According to the third law of geography15, the factors (variables) similarity between the test station and the training set of the RFM determines the similarity between their fluxes, that is, the R2 of the RFM at the test station are determined. The similarity between the datasets composed of the same factors could be characterized by the Euclidean distance. Based on the R2 and Euclidean distance, the R2 simulation model (RSM) was built by using multiple linear regression (MLR) to evaluate the applicability of RFM on meteorological stations. So that the RFMs can be reasonably transferred to meteorological stations to simulate the carbon-water fluxes. Figure 1 shows the detailed flowchart of the data processing, RFM construction and RFM transfer to the meteorological stations.
Data processing
We selected 156 flux stations in Eurasia from five different landscape types (Fig. 2a), as detailed in the flux station information file16. For the flux stations from the National Tibetan Plateau Data Center (NTPDC)17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,101,102,103,104,105,106,107,108,109,110,111,112,113,114,115,116,117,118,119,120,121,122,123,124,125,126,127,128,129,130,131,132,133 and European Fluxes Database Cluster (EFDC)134,135,136,137 (http://www.europe-fluxdata.eu/home), the flux data from one hour before (and after) rainfall were excluded. The data collected at 10-min or 30-min intervals were interpolated using the marginal distribution sampling (MDS) method in REddyProc138. All final data were converted into daily data. For the flux stations from FLUXNET139,140,141,142,143,144,145,146,147,148,149,150,151,152,153,154,155,156,157,158,159,160,161,162, the data were extracted with quality control values ≥ 0.8 for the net ecosystem carbon dioxide exchange (NEE) and latent heat fluxes (LE). The water fluxes (WF) were converted from LE (W/m2) with a conversion factor of 0.408 × 10−6 163,164,165. For the selected 6856 meteorological stations from the Global Surface Summary of the Day dataset in the National Centers for Environmental Information (https://www.ncei.noaa.gov/metadata/geoportal/rest/metadata/item/gov.noaa.ncdc%3AC00516/html#), the vapour pressure deficit variable was calculated using the air temperature and dew point temperature. The downward shortwave radiation (DSR) of meteorological stations for 2002–2020 and 1983–2018 were extracted from the GLASS dataset166,167 and the dataset of high-resolution global surface solar radiation168,169 from the NTPDC, respectively. For the remote sensing (RS) variables (including the fraction of the photosynthetically active radiation extracted from the MCD15A3H data170, enhanced vegetation index, land surface water index and surface reflectance for the Moderate Resolution Imaging Spectroradiometer bands 1–7 extracted from the MOD09GA data171), a linear interpolation was carried out for the missing data with continuous missing days < 8165,172. Terrain and soil variables were extracted from the MERIT DEM data173 and the HWSD data174, respectively. High quality RS variables, terrain variables and soil variables averaged over a 500-meter spatial extent centered on the station were integrated into the flux stations and meteorological stations (Table S1).
Due to the significant spatial heterogeneity of the earth’s surface, the flux stations and meteorological stations were divided into nine categories according to the following four strategies. The first is based on the International Geosphere-Biosphere Programme classification from the MCD12Q1 data175, including Wetland (i.e. permanent wetlands), Cropland (i.e. croplands and cropland/natural vegetation mosaics), Grassland (i.e. grasslands, savannas, woody savannas) and Forest (i.e. evergreen needleleaf forests, evergreen broadleaf forests, deciduous needleleaf forests, deciduous broadleaf forests, and mixed forests). The second is based on the continents, e.g. Asia and Europe. The third is the arid and non-arid regions classified by the dryland dataset176. This dataset identified areas with an aridity index (AI) less than 0.65 as drylands, which were described as arid in this study, and the remaining areas (i.e. AI ≥ 0.65) were classified as non-arid regions. The fourth comprises entire Eurasia, that is, overall. We used the datasets of nine categories from flux stations as input to RFM, and the detail is presented in the division of flux stations file177.
RFM construction
In this work, the random forest method6,178 was used to construct the RFM using the scikit-learn library in Python 3.7.6 (https://pypi.org/project/scikit-learn/1.0.1/). The RFM is a combination model based on independent regression trees179, of which the predictions were made by averaging the results across all regression trees. The random-search optimizer180 was applied to identify the optimal hyperparameter settings for the RFM (Table S2). In addition, the most important step in the RFM construction is the k-fold cross-validation (CV)181. Suppose the flux station dataset is composed of data from n flux stations and it might generate k sub datasets (D1, D2, …, Dk) if it is equally divided into k parts. Here, each sub dataset is a test set, which is composed of data from the m = n/k flux stations, while the remaining sub datasets constitute training sets, which are composed of data from the (n-m) flux stations. Thus, each training set could be used to establish the RFM for the flux simulation; consequently, a total of k models (RFM1, RFM2, …, RFMk) were built. Each RFM is tested and verified at each flux station in the test set. Furthermore, the R2 is calculated for each case. Hence, the R2 amount that could be generated after each RFM training and test is given by k × m = k × n/k = n. The flux station dataset (n) requires multiple (p) repetitions of the k-fold division to avoid the contingency of the station division. In this way, k × p RFMs could be constructed and the number of R2 that could be generated is n × p.
According to the above-mentioned principles, we used the 10-time (split) 10-fold CV (i.e. p = 10 and k = 10) to set up 100 RFMs for each of the nine categories under two scenarios: with and without RS variables in the model building (i.e. RFM-RS and RFM-WRS), respectively. That is, we constructed 900 RFM-RS models to simulate NEE and WF, respectively. And we also constructed 900 RFM-WRS models to simulate NEE and WF, respectively. Each RFM was validated at each flux station in the corresponding test set, and the R2 was generated to represent the validation accuracy of the RFM. The R2 also represents the applicability of these models in the test flux stations. The higher the R2 of the model on the test flux station, the more suitable the model is for the specific flux station, that is, the more similar the data characteristics of the training set for building the model are to the data characteristics of the flux station according to the third law of geography.
RFM transfer to the meteorological stations
We screened available RFMs for meteorological stations by using the RSM, which was used to evaluate the RFM applicability on the meteorological stations. The framework for the evaluation was designed (Fig. 1) as follows:
① Euclidean distances of the influencing factors between test flux stations and training sets of RFMs.
The R2 of each RFM is determined by the similarity of the influencing factors between the training set and the test set181. This could be characterized by the Euclidean distance182.
For a specific factor affecting the RFM, the Euclidean distance (ds) between a flux station in the test set (test station) and a flux station in the training set (training station) is expressed as:
where x represents one of the factors influencing the carbon-water fluxes in the training station (Table S1), y denotes the corresponding influencing factor in the test station, and t is the sample size of the factor. The data of the training station and test station were averaged on the same day (day of the year, DOY) for multiple years, respectively. Then, the two stations could be matched day by day based on DOY to ensure that these have the same daily data sample size.
For the factor j influencing the RFM, the final Euclidean distance (d) is the average of all n-m ds between the test station and each of the n-m training stations, which is the Euclidean distance (dj) of the factor j between this test station and the training set of the RFM (Eq. 2). In the same way, the Euclidean distances of all influencing factors are produced and denoted as d1, …, dw-1, dw. In this way, the R2 of the RFM tested in the test station from the test set and the Euclidean distances d1, …, dw-1, dw between this test station and the training set for building the RFM constitute a complete data sample (Fig. S3). Similarly, all test stations could generate such samples, which constitute a dataset with a quantity equal to 10 × n × 9. Samples of the same category are integrated into one dataset, that is, nine datasets produced under nine categories, i.e. Dataset 1 in Fig. 1.
where n represents the number of the flux stations of each category and m illustrates the number of the flux stations in each test set.
② Construction of the RSM.
Based on Dataset 1, the RSM is constructed using MLR183 for nine categories under NEE and WF scenarios, of which each one is expressed as:
where a0, a1, a2, …, aw-1, aw are regression coefficients and d1, d2, …, dw-1, dw indicate the Euclidean distances of the factors influencing the carbon-water fluxes between the test station and the training set.
③ Euclidean distances of the influencing factors between meteorological stations and training sets of RFMs.
The same processes of ① are applied to the meteorological stations so as to calculate the Euclidean distance for the influencing factors between each RFM training set and meteorological station for each category under two scenarios, which yields a large dataset, i.e. Dataset 2 in Fig. 1.
④ Prediction of the R2 of the RFM transfer to the meteorological stations.
Before a RFM is transferred to a specific meteorological station, the RSM could predict the R2 value on the station using Dataset 2 in Fig. 1. Only if predicted R2 ≥ 0.5, its corresponding RFM might be transferred to the corresponding meteorological stations. Otherwise, the RFM was assumed to be not applicable to the meteorological station. If there was more than one RFM applicable to a meteorological station, the RFM corresponding to the maximum predicted R2 was screened as the model that could be linked to the meteorological station. Not all meteorological stations did have a RFM which is applicable to the meteorological station.
⑤ Carbon-water flux simulation of the meteorological stations.
For the meteorological stations in Eurasia that could be linked with an applicable RFM, the corresponding RFM can be used to simulate the daily carbon-water fluxes and to build high-precision carbon-water flux datasets of the Eurasian meteorological stations to analyze the carbon-water dynamics. These datasets184 consist of two essential building blocks: (i) datasets related to remote sensing, including the net ecosystem carbon dioxide exchange (NEE-RS) and water fluxes (WF-RS) simulated by the RFM-RS; (ii) the net ecosystem carbon dioxide exchange (NEE-WRS) and water fluxes (WF-WRS) simulated by the RFM-WRS.
Data Records
Our carbon-water flux datasets184 are available at figshare (https://doi.org/10.6084/m9.figshare.21347721.v3). The data record contains two daily carbon dioxide flux datasets (NEE-RS and NEE-WRS) and two daily water flux datasets (WF-RS and WF-WRS) of the Eurasian meteorological stations. The coverage period of the NEE-RS and WF-RS has been recorded from 2002 to 2020 and the one of NEE-WRS and WF-WRS from 1983 to 2018. The data of each meteorological station was deposited separately in the CSV file format under the dataset folders. The file name indicates the identification number of the meteorological station in the meteorological station information file185 (https://doi.org/10.6084/m9.figshare.23695920.v2). The list of flux stations used in this study and the details of flux station division used for the RFM construction are shown in the flux station information file16 (https://doi.org/10.6084/m9.figshare.23899701.v1) and the division of flux stations file177 (https://doi.org/10.6084/m9.figshare.23695980.v2) stored at figshare, respectively. In addition, the details of the RSM construction are presented in the RSMs information file186 (https://doi.org/10.6084/m9.figshare.23899785.v1) deposited at figshare. The file specific fields are as follows:
Carbon-water flux datasets file (.csv)
-
(1)
id: Identification of the meteorological station.
-
(2)
lon: Longitude of the meteorological station.
-
(3)
lat: Latitude of the meteorological station.
-
(4)
year: Year of the data record.
-
(5)
month: Month of the data record.
-
(6)
day: Day of the data record.
-
(7)
doy: Day of the year.
-
(8)
NEE: Net ecosystem carbon dioxide exchange (g C m−2 d−1).
-
(9)
WF: Water fluxes (mm d−1).
Meteorological station information file (.xlsx)
-
(1)
Identification of meteorological station.
-
(2)
Station name: Name of the meteorological station.
-
(3)
Longitude: Longitude of the meteorological station.
-
(4)
Latitude: Latitude of the meteorological station.
-
(5)
Elevation: Elevation of the meteorological station (m).
-
(6)
Continent: Continent of the meteorological station.
-
(7)
Drought situation: Drought situation of the meteorological station.
-
(8)
Landscape: Landscape of the meteorological station.
-
(9)
Data source: Data source of the meteorological station.
-
(10)
Classification of simulated NEE-RS: Accuracy classification of NEE-RS for meteorological stations (1: low quality, R2 < 0.5; 2: moderate quality, 0.5 ≤ R2 < 0.7; 3: high quality, R2 ≥ 0.7).
-
(11)
Classification of simulated NEE-WRS: Accuracy classification of NEE-WRS for meteorological stations (1: low quality, R2 < 0.5; 2: moderate quality, 0.5 ≤ R2 < 0.7; 3: high quality, R2 ≥ 0.7).
-
(12)
Classification of simulated WF-RS: Accuracy classification of WF-RS for meteorological stations (1: low quality, R2 < 0.5; 2: moderate quality, 0.5 ≤ R2 < 0.7; 3: high quality, R2 ≥ 0.7).
-
(13)
Classification of simulated WF-WRS: Accuracy classification of WF-WRS for meteorological stations (1: low quality, R2 < 0.5; 2: moderate quality, 0.5 ≤ R2 < 0.7; 3: high quality, R2 ≥ 0.7).
Flux station information file (.xlsx)
-
(1)
Identification of flux stations.
-
(2)
Flux station: Name of the flux station.
-
(3)
Longitude: Longitude of the flux station.
-
(4)
Latitude: Latitude of the flux station.
-
(5)
Elevation: Elevation of the flux station (m).
-
(6)
Continent: Continent of the flux station.
-
(7)
Drought situation: Drought situation of the flux station.
-
(8)
Landscape: Landscape of the flux station.
-
(9)
Study period: Study period of the flux station used in this study.
-
(10)
Data source: Data source of the flux station.
Division of flux stations file (.xlsx)
-
(1)
Categories: Category of the flux station.
-
(2)
Number of flux station: Number of flux stations under each category.
-
(3)
Split: Identification of divisions for 10-fold cross-validation on flux stations.
-
(4)
Fold: Identification of folds for cross-validation on flux stations.
-
(5)
Identification of flux station: List of identifications for flux stations under each fold.
RSMs information file (.xlsx)
-
(1)
Models: Name of the RSM.
-
(2)
Categories: Category of the RSM.
-
(3)
N: Number of samples used by the RSM.
-
(4)
R2rsm: Determination coefficient of the RSM.
-
(5)
Adj. R2rsm: Adjusted determination coefficient of the RSM.
-
(6)
F-statistic: F-statistic of the RSM.
-
(7)
P value: Significance probability of the RSM.
-
(8)
RSMs: Equation of the RSM.
Technical Validation
Model validation
The R2 and RMSE (root mean square error) were used to evaluate the performance of the RFM to simulate the NEE-RS (NEE simulated by RFM-RS), WF-RS (WF simulated by RFM-RS), NEE-WRS (NEE simulated by RFM-WRS) and WF-WRS (WF simulated by RFM-WRS)184. The model’s simulation accuracy for the WF was much higher than for NEE under each category and the performance of the RFM-RS was also better than the RFM-WRS (Fig. 2b, Table S3). For the WF simulation of the RFM-RS and RFM-WRS under each category, the percentage of the models with R2 ≥ 0.5 in the test sets was larger than 70%, while for the NEE simulation, it was lower than 50% (Fig. 2b). For the category ‘overall’, the RFM generally indicated a high simulation accuracy (Fig. 2b, Table S3). The simulation accuracy of the RFM was generally higher in Asia and the arid regions than in Europe and the non-arid regions. For the cropland and forest, the RFMs demonstrated the highest simulation accuracy under the scenarios of NEE-RS and NEE-WRS; for the grassland and wetland, the RFMs demonstrated the highest simulation accuracy under the WF-RS and WF-WRS scenarios.
The box plots (Fig. 3, Fig. S1) present the simulation performance of the RFM for NEE and WF in 10-time 10-fold CVs, with each box representing the R2 distribution for the test flux stations in each split (time). The simulation accuracy of the same RFM for different test flux stations varied widely, indicating that the RFM cannot be applied to all flux stations and that not all stations could match at least one available RFM model. The maximum R2 distribution for each flux station was observed in a 10-time 10-fold CV (Table S4). The proportion of the flux stations with R2 ≥ 0.5 of the RFM test measured 60.9%, 46.2%, 89.7% and 88.5% under the NEE-RS, NEE-WRS, WF-RS and WF-WRS scenarios, respectively.
Accuracy pre-assessment of the carbon-water flux simulation at the meteorological stations
Using the MLR model in which the Euclidean distances were independent variables and R2 concerned a dependent variable, the RSMs were constructed for different categories under different scenarios, as shown in the RFMs information file186. By using the RSM to simulate the R2 of the RFM at the test flux stations, the overall accuracies (of the RSM) for a correct classification of R2 under nine categories amounted to 80.1%, 84.0%, 91.0% and 89.1% for the NEE-RS, NEE-WRS, WF-RS and WF-WRS, respectively (Fig. S2). This might prove that the RSM is reliable and could be utilized to predict the accuracy of the RFMs applied to the meteorological stations.
Finally, the RFMs were transferred to all meteorological stations in Eurasia and the R2 was predicted for every meteorological station for each category under two scenarios (Fig. 4a–d, Table S5). In this study, the criteria for screening the RFM imply that the RFM corresponding to the highest predicted R2 of a given meteorological station and its R2 ≥ 0.5 was screened as the simulation model of the carbon-water fluxes for the meteorological station. The percentages of the meteorological stations in Eurasia were 84.5%, 68.2%, 99.1% and 98.7% for the NEE-RS, NEE-WRS, WF-RS and WF-WRS, respectively, in which the RFMs met the above-mentioned criteria (Fig. 4e). The RFMs have much higher applicable percentages and seem more accurate for the WF simulation than the NEE simulation at the meteorological stations. The RFM models of the forest and grassland categories were highly applicable and more accurate regarding the NEE and WF simulation of the meteorological stations than for cropland or wetland (Table S5).
The input data for the RFM were primarily derived from the meteorological stations’ observations and remote sensing data. Moreover, machine learning models (such as RFM) have the advantage and the predictive ability in the non-linear relation fitting and have been proven in the application research of relevant geoscience6, which is generally superior to linear regression, ecosystem process models, remote sensing inversion models, etc. Therefore, the carbon-water flux datasets of the meteorological stations generated in this study demonstrate a relatively high accuracy and constitute the attribute of quasi-observation, which might be considered as a quasi-observational dataset. They could be applied as benchmark data to verify the simulation results produced by the process-based models or remote sensing inversion models related to the carbon-water fluxes, which overcomes the challenge of insufficient observational data on the carbon-water fluxes165,187.
Spatio-temporal patterns of the Eurasian NEE and WF
We have further investigated the spatio-temporal distribution of the mean daily values of NEE and WF simulated by the RFM-RS during the period March-November from 2003 to 2020 and simulated by the RFM-WRS from 1984 to 2018. The meteorological stations with at least 30 data volumes of each spring, summer and autumn were used for statistical analysis. The mean daily values of the NEE-RS, NEE-WRS, WF-RS and WF-WRS at the meteorological stations are −3.9~0.7 g C m−2 d−1, −2.6~0.4 g C m−2 d−1, 0.8~3.8 mm d−1 and 0.5~4.3 mm d−1, respectively (Fig. 5a,b,d,e). The spatial distribution of these mean daily NEE fluxes reveals that the ecosystem carbon dioxide loss had increased in Eurasia during 2003–2020, with 457 carbon dioxide loss stations during this period, which means 178 more than from 1984 to 2018 (Fig. 5a,b). The daily average NEE (generally presented as net carbon dioxide uptake) has shown an increasing trend from 1984 to 2002, while a slightly decreasing tendency from 2003 to 2020, with slight fluctuations during these two periods (Fig. 5c). The temporal variation of the WF has demonstrated a rising trend with a distinct fluctuation from 1984 to 2020 (Fig. 5f). The differences between RS and WRS products might be caused by the differences in the input DSR dataset, the RFMs, and the number of meteorological stations (Fig. 5c,f).
Comparison of the NEE and WF with other carbon-water flux products
We further compared the NEE and WF datasets with those from FLUXCOM1,3, GOSAT L4A (https://data2.gosat.nies.go.jp/) and MODIS (MOD16A2 Version 6)188 (Fig. 6, Table S6). The WF from the other products were converted from the LE. Our NEE and WF datasets, as well as the fluxes from the other products, were converted into a monthly scale. The months with the same number of stations for each product were selected for a comparison. All data show a similar seasonal variation, with high carbon-water fluxes in summer and low during winter (Fig. 6c,d). The distributions of the carbon-water fluxes from FLUXCOM and NEE from GOSAT are relatively discrete (Fig. 6a,b). The multi-year monthly averages of the NEE (NEE-RS = −0.31 g C m−2 d−1 and NEE-WRS = −0.34 g C m−2 d−1) and WF (WF-RS= +1.57 mm d−1 and WF-WRS= +1.48 mm d−1) simulated herein are less than those from FLUXCOM (NEE = −0.61 g C m−2 d−1, and WF= +1.79 mm d−1), whereas the averages are larger than those from GOSAT (NEE = −0.20 g C m−2 d−1) and MODIS (WF= +1.51 mm d−1). The WF from MODIS were almost consistent with our results (Fig. 6d). Because of the fact that the carbon-water flux datasets of the meteorological stations (generated by the RFM in this study) could be considered as “quasi-observational data”, the Eurasian carbon-water fluxes from FLUXCOM may be overestimated, while the NEE from GOSAT could rather be underestimated (Fig. 6c,d).
Code availability
The code189 to generate the carbon-water flux datasets is available at figshare (https://doi.org/10.6084/m9.figshare.21510183.v2).
References
Jung, M. et al. Scaling carbon fluxes from eddy covariance sites to globe: Synthesis and evaluation of the FLUXCOM approach. Biogeosciences 17, 1343–1365 (2020).
Ciais, P. et al. Five decades of northern land carbon uptake revealed by the interhemispheric CO2 gradient. Nature 568, 221–225 (2019).
Jung, M. et al. The FLUXCOM ensemble of global land-atmosphere energy fluxes. Sci. Data 6, 1–14 (2019).
Jung, M. et al. Compensatory water effects link yearly global land CO2 sink changes to temperature. Nature 541, 516–520 (2017).
Wang, R. et al. Recent increase in the observation-derived land evapotranspiration due to global warming. Environ. Res. Lett. 17, 024020 (2022).
Reichstein, M. et al. Deep learning and process understanding for data-driven Earth system science. Nature 566, 195–204 (2019).
Li, X. et al. Intercomparison of six upscaling evapotranspiration methods: From site to the satellite pixel. J. Geophys. Res.-Atmos. 123, 6777–6803 (2018).
Shi, H. et al. Evaluation of water flux predictive models developed using eddy-covariance observations and machine learning: a meta-analysis. Hydrol. Earth Syst. Sci. 26, 4603–4618 (2022).
Shi, H. et al. Variability and uncertainty in flux-site-scale net ecosystem exchange simulations based on machine learning and remote sensing: a systematic evaluation. Biogeosciences 19, 3739–3756 (2022).
Xu, T. et al. Evaluating different machine learning methods for upscaling evapotranspiration from flux towers to the regional scale. J. Geophys. Res.-Atmos. 123, 8674–8690 (2018).
Bzdok, D., Nichols, T. E. & Smith, S. M. Towards algorithmic analytics for large-scale datasets. Nat. Mach. Intell. 1, 296–306 (2019).
Fatima, M. & Pasha, M. Survey of machine learning algorithms for disease diagnostic. J. Intell. Learn. Syst. Appl. 9, 1 (2017).
Ghahramani, Z. Probabilistic machine learning and artificial intelligence. Nature 521, 452–459 (2015).
Jordan, M. I. & Mitchell, T. M. Machine learning: Trends, perspectives, and prospects. Science 349, 255–260 (2015).
Zhu, A. X., Lu, G., Liu, J., Qin, C. Z. & Zhou, C. Spatial prediction based on Third Law of Geography. Ann. GIS 24, 225–240 (2018).
Xie, M. Flux station information. figshare https://doi.org/10.6084/m9.figshare.23899701.v1 (2023).
Che, T., Xu, Z., Ren, Z., Tan, J. & Zhang, Y. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (automatic weather station of Zhangye wetland station, 2019). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270677 (2020).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (automatic weather station of Huazhaizi desert steppe station, 2019). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270680 (2020).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (an observation system of Meteorological elements gradient of Sidaoqiao Superstation, 2019). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270698 (2020).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (automatic weather station of mixed forest station, 2019). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270681 (2020).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (eddy covariance system of mixed forest station, 2019). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270689 (2020).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (eddy covariance system of Huazhaizi station, 2019). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270684 (2020).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (eddy covariance system of Sidaoqiao superstation, 2019). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270685 (2020).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (eddy covariance system of Zhangye wetland station, 2019). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270690 (2020).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (automatic weather station of mixed forest station, 2020). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.271403 (2021).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (an observation system of Meteorological elements gradient of Sidaoqiao Superstation, 2020). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.271542 (2021).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (eddy covariance system of Sidaoqiao superstation, 2020). National Tibetan Plateau Data Center https://doi.org/10.11888/Geogra.tpdc.271440 (2021).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (automatic weather station of Zhangye wetland station, 2020). National Tibetan Plateau Data Center https://doi.org/10.11888/Geogra.tpdc.271437 (2021).
Liu, S., Che, T., Xu, Z., Ren, Z. & Zhang, Y. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (eddy covariance system of Zhangye wetland station, 2020). National Tibetan Plateau Data Center https://doi.org/10.11888/Geogra.tpdc.271441 (2021).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (eddy covariance system of mixed forest station, 2020). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.271399 (2021).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (an observation system of Meteorological elements gradient of A’rou Superstation, 2019). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270694 (2020).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (eddy covariance system of A’rou Superstation, 2019). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270691 (2020).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (automatic weather station of Dashalong station, 2019). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270757 (2020).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (eddy covariance system of Dashalong station, 2019). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270687 (2020).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (automatic weather station of Jingyangling station, 2019). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270682 (2020).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (eddy covariance system of Jingyangling station, 2019). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270688 (2020).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (automatic weather station of Yakou station, 2019). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270678 (2020).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (eddy covariance system of Yakou station, 2019). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270686 (2020).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (an observation system of Meteorological elements gradient of A’rou Superstation, 2020). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.271415 (2021).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (eddy covariance system of A’rou Superstation, 2020). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.271406 (2021).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (automatic weather station of Jingyangling station, 2020). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.271400 (2021).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (eddy covariance system of Jingyangling station, 2020). National Tibetan Plateau Data Center https://doi.org/10.11888/Geogra.tpdc.271409 (2021).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (automatic weather station of Yakou station, 2020). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.271398 (2021).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (eddy covariance system of Yakou station, 2020). National Tibetan Plateau Data Center https://doi.org/10.11888/Geogra.tpdc.271408 (2021).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (automatic weather station of Dashalong station, 2018). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270779 (2019).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (an automatic weather station of Sidaoqiao mixed forest station, 2013). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.184.2014.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (an automatic weather station of Sidaoqiao mixed forest station, 2014). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.261.2015.db (2016).
Liu, S. et al. HiWATER: Dataset of Hydrometeorological observation network (an automatic weather station of Sidaoqiao mixed forest station, 2015). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.318.2016.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of mixed forest station, 2013). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.197.2014.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of mixed forest station, 2014). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.241.2015.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of mixed forest station, 2015). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.301.2016.db (2016).
Liu, S. et al. HiWATER: Dataset of hydro-meteorological observation network (automatic weather station of Huazhaizi Desert Steppe Station, 2014). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.257.2015.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (automatic weather station of Huazhaizi desert steppe station, 2015). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.325.2016.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of Huazhaizi desert Station, 2014). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.240.2015.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of Huazhaizi desert station, 2015). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.305.2016.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (an observation system of meteorological elements gradient of Sidaoqiao superstation, 2013). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.187.2014.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (an observation system of Meteorological elements gradient of Sidaoqiao Superstation, 2014). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.264.2015.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (an observation system of meteorological elements gradient of Sidaoqiao Superstation, 2015). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.321.2016.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of Sidaoqiao superstation, 2013). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.200.2014.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of Sidaoqiao superstation, 2014). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.245.2015.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of Sidaoqiao superstation, 2015). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.304.2016.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (automatic weather station of Zhangye wetland station, 2013). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.192.2014.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (automatic weather station of Zhangye wetland station, 2014). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.265.2015.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (automatic weather station of Zhangye wetland station, 2015). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.327.2016.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of Zhangye wetland station, 2013). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.206.2014.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of Zhangye wetland Station, 2015). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.307.2016.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (an automatic weather station of Sidaoqiao mixed forest station, 2016). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.461.2017.db (2017).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of mixed forest station, 2016). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.450.2017.db (2017).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (automatic weather station of Huazhaizi desert steppe station, 2016). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.459.2017.db (2017).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of Huazhaizi desert station, 2016). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.448.2017.db (2017).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (an observation system of meteorological elements gradient of Sidaoqiao Superstation, 2016). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.463.2017.db (2017).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of Sidaoqiao superstation, 2016). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.451.2017.db (2017).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (automatic weather station of Zhangye wetland station, 2016). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.465.2017.db (2017).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of Zhangye wetland Station, 2016). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.453.2017.db (2017).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (an automatic weather station of Sidaoqiao mixed forest station, 2017). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.21.2018.db (2018).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of mixed forest station, 2017). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.10.2018.db (2018).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (automatic weather station of Huazhaizi desert steppe station, 2017). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.19.2018.db (2018).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of Huazhaizi desert station, 2017). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.8.2018.db (2018).
Liu, S. et al. HiWATER: Dataset of Hydrometeorological observation network (an observation system of Meteorological elements gradient of Sidaoqiao Superstation, 2017). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.23.2018.db (2018).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of Sidaoqiao superstation, 2017). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.11.2018.db (2018).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (automatic weather station of Zhangye wetland station, 2017). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.25.2018.db (2018).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of Zhangye wetland Station, 2017). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.13.2018.db (2018).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (automatic weather station of mixed forest station, 2018). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270771 (2019).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (eddy covariance system of mixed forest station, 2018). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270785 (2019).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of the Heihe River Basin integrated observatory network (automatic weather station of Huazhaizi desert steppe station, 2018). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270773 (2019).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (eddy covariance system of Huazhaizi station, 2018). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270787 (2019).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of the Heihe River Basin integrated observatory network (automatic weather station of Jingyangling station, 2018). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270770 (2019).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (an observation system of meteorological elements gradient of Sidaoqiao superstation, 2018). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270780 (2019).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of the Heihe River Basin integrated observatory network (eddy covariance system of Sidaoqiao superstation, 2018). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270782 (2019).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (automatic weather station of Zhangye wetland station, 2018). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270768 (2019).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (eddy covariance system of Zhangye wetland station, 2018). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270783 (2019).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (an observation system of meteorological elements gradient of A’rou Superstation, 2014). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.248.2015.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (an observation system of meteorological elements gradient of A’rou superstation, 2015). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.308.2016.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of A’rou Superstation, 2014). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.235.2015.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of A’rou Superstation, 2015). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.296.2016.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (Dashalong automatic meteorological station, 2013). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.178.2014.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (automatic weather station of Dashalong station, 2014). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.251.2015.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (automatic weather station of Dashalong station, 2015). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.310.2016.db (2016).
Liu, S. et al. HiWATER: Dataset of Hydrometeorological observation network (eddy covariance system of Dashalong station, 2013). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.195.2014.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of Dashalong station, 2014). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.238.2015.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of Dashalong station, 2015). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.297.2016.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (automatic weather station of Yakou station, 2015). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.315.2016.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of Yakou station, 2015). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.298.2016.db (2016).
Liu, S. et al. HiWATER: Dataset of hydro-meteorological observation network (automatic weather station of Dashalong station, 2016). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.456.2017.db (2017).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of Dashalong station, 2016). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.447.2017.db (2017).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (automatic weather station of Yakou station, 2016). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.464.2017.db (2017).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of Yakou station, 2016). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.452.2017.db (2017).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (an observation system of meteorological elements gradient of A’rou Superstation, 2017). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270897 (2018).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of A’rou Superstation, 2017). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.5.2018.db (2018).
Liu, S. et al. HiWATER: Dataset of hydro-meteorological observation network (automatic weather station of Dashalong station, 2017). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.16.2018.db (2018).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of Dashalong station, 2017). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.7.2018.db (2018).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (automatic weather station of Yakou station, 2017). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.14.2018.db (2018).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of Yakou station, 2017). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.12.2018.db (2018).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (an observation system of meteorological elements gradient of A’rou Superstation, 2018). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270777 (2019).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (eddy covariance system of A’rou superstation, 2018). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270775 (2019).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (eddy covariance system of Dashalong station, 2018). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270788 (2019).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of the Heihe River Basin integrated observatory network (eddy covariance system of Jingyangling station, 2018). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270784 (2019).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of Heihe integrated observatory network (automatic weather station of Yakou station, 2018). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270769 (2019).
Liu, S. et al. Qilian Mountains integrated observatory network: Dataset of the Heihe River Basin integrated observatory network (eddy covariance system of Yakou station, 2018). National Tibetan Plateau Data Center https://doi.org/10.11888/Meteoro.tpdc.270781 (2019).
Liu, S., Li, X. & Xu, Z. HiWATER: Dataset of flux observation matrix (automatic meteorological station of No.1) of the MUlti-Scale Observation EXperiment on Evapotranspiration over heterogeneous land surfaces 2012 (MUSOEXE-12). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.059.2013.db (2016).
Liu, S., Li, X. & Xu, Z. HiWATER: The multi-scale observation experiment on evapotranspiration over heterogeneous land surfaces 2012 (MUSOEXE-12)-dataset of flux observation matrix (No.1 eddy covariance system). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.080.2013.db (2016).
Liu, S., Li, X. & Xu, Z. HiWATER: Dataset of flux observation matrix (automatic meteorological station of No.17) of the MUlti-Scale Observation EXperiment on Evapotranspiration over heterogeneous land surfaces 2012 (MUSOEXE-12). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.075.2013.db (2016).
Liu, S., Li, X. & Xu, Z. HiWATER: The multi-scale observation experiment on evapotranspiration over heterogeneous land surfaces (MUSOEXE-12)-dataset of flux observation matrix (No.17 eddy covariance system) from Mar to Sep, 2012. National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.095.2013.db (2016).
Liu, S. et al. HiWATER: Dataset of hydrometeorological observation network (eddy covariance system of Zhangye wetland Station, 2014). National Tibetan Plateau Data Center https://doi.org/10.3972/hiwater.246.2015.db (2016).
Liu, S. & Xu, Z. Multi-scale surface flux and meteorological elements observation dataset in the Hai River Basin (Daxing site-automatic weather station) (2008-2010). National Tibetan Plateau Data Center https://doi.org/10.3972/haihe.004.2013.db (2016).
Liu, S. & Xu, Z. Multi-scale surface flux and meteorological elements observation dataset in the Hai River Basin (Daxing site - eddy covariance system) (2008-2010). National Tibetan Plateau Data Center https://doi.org/10.3972/haihe.005.2013.db (2016).
Che, T. et al. Integrated hydrometeorological, snow and frozen-ground observations in the alpine region of the Heihe River Basin, China. Earth Syst. Sci. Data 11, 1483–1499 (2019).
Jia, Z., Liu, S., Xu, Z., Chen, Y. & Zhu, M. Validation of remotely sensed evapotranspiration over the Hai River Basin, China. J. Geophys. Res.-Atmos. 117, D13113 (2012).
Liu, S. et al. The heihe integrated observatory network: a basin‐scale land surface processes observatory in China. Vadose Zone J. 17, 1–21 (2018).
Liu, S. et al. Upscaling evapotranspiration measurements from multi-site to the satellite pixel scale over heterogeneous land surfaces. Agric. For. Meteorol. 230-231, 97–113 (2016).
Liu, S. et al. A comparison of eddy-covariance and large aperture scintillometer measurements with respect to the energy balance closure problem. Hydrol. Earth Syst. Sci. 15, 1291–1306 (2011).
Liu, S., Xu, Z., Zhu, Z., Jia, Z. & Zhu, M. Measurements of evapotranspiration from eddy-covariance systems and large aperture scintillometers in the Hai River Basin, China. J. Hydrol. 487, 24–38 (2013).
Xu, Z. et al. Intercomparison of surface energy flux measurement systems used during the HiWATER‐MUSOEXE. J. Geophys. Res.-Atmos. 118, 13140–13157 (2013).
Dušek, J., Faußer, A., Stellner, S. & Kazda, M. Stems of Phragmites australis are buffering methane and carbon dioxide emissions. Sci. Total Environ. 882, 163493 (2023).
Foltýnová, L., Fischer, M. & McGloin, R. P. Recommendations for gap-filling eddy covariance latent heat flux measurements using marginal distribution sampling. Theor. Appl. Climatol. 139, 677–688 (2020).
Granier, A., Bréda, N., Longdoz, B., Gross, P. & Ngao, J. Ten years of fluxes and stand growth in a young beech forest at Hesse, North-eastern France. Ann. For. Sci. 65, 704 (2008).
Kivalov, S. N. et al. Addressing effects of environment on eddy-covariance flux estimates at a Temperate Sedge-Grass Marsh. Bound.-Layer Meteor. 186, 217–250 (2023).
Wutzler, T. et al. Basic and extensible post-processing of eddy covariance flux data with REddyProc. Biogeosciences 15, 5015–5030 (2018).
Aurela, M., Laurila, T., Hatakka, J., Tuovinen, J.-P. & Rainne, J. FLUXNET2015 RU-Tks Tiksi. FLUXNET https://doi.org/10.18140/FLX/1440244 (2016).
Bernhofer, C. et al. FLUXNET2015 DE-Spw Spreewald. FLUXNET https://doi.org/10.18140/FLX/1440220 (2016).
Bernhofer, C. et al. FLUXNET2015 DE-Tha Tharandt. FLUXNET https://doi.org/10.18140/FLX/1440152 (2016).
Chen, S. FLUXNET2015 CN-Du2 Duolun_grassland. FLUXNET https://doi.org/10.18140/FLX/1440140 (2016).
Dong, G. FLUXNET2015 CN-Cng Changling. FLUXNET https://doi.org/10.18140/FLX/1440209 (2016).
Ibrom, A. & Pilegaard, K. FLUXNET2015 DK-Sor Soroe. FLUXNET https://doi.org/10.18140/FLX/1440155 (2016).
Klatt, J., Schmid, H., Mauder, M. & Steinbrecher, R. FLUXNET2015 DE-SfN Schechenfilz Nord. FLUXNET https://doi.org/10.18140/FLX/1440219 (2016).
Kosugi, Y. & Takanashi, S. FLUXNET2015 MY-PSO Pasoh Forest Reserve (PSO). FLUXNET https://doi.org/10.18140/FLX/1440240 (2016).
Kotani, A. FLUXNET2015 JP-MBF Moshiri Birch Forest Site. FLUXNET https://doi.org/10.18140/FLX/1440238 (2016).
Kotani, A. FLUXNET2015 JP-SMF Seto Mixed Forest Site. FLUXNET https://doi.org/10.18140/FLX/1440239 (2016).
Li, Y. FLUXNET2015 CN-Ha2 Haibei Shrubland. FLUXNET https://doi.org/10.18140/FLX/1440211 (2016).
Maximov, T. FLUXNET2015 RU-SkP Yakutsk Spasskaya Pad larch. FLUXNET https://doi.org/10.18140/FLX/1440243 (2016).
Olesen, J. FLUXNET2015 DK-Fou Foulum. FLUXNET https://doi.org/10.18140/FLX/1440154 (2016).
Pastorello, G. et al. The FLUXNET2015 dataset and the ONEFlux processing pipeline for eddy covariance data. Sci. Data 7, 225, https://doi.org/10.1038/s41597-020-0534-3 (2020).
Pilegaard, K. & Ibrom, A. FLUXNET2015 DK-Eng Enghave. FLUXNET https://doi.org/10.18140/FLX/1440153 (2016).
Sachs, T., Wille, C., Larmanou, E. & Franz, D. FLUXNET2015 DE-Zrk Zarnekow. FLUXNET https://doi.org/10.18140/FLX/1440221 (2016).
Schneider, K. & Schmidt, M. FLUXNET2015 DE-Seh Selhausen. FLUXNET https://doi.org/10.18140/FLX/1440217 (2016).
Shao, C. FLUXNET2015 CN-Du3 Duolun Degraded Meadow. FLUXNET https://doi.org/10.18140/FLX/1440210 (2016).
Shao, C. FLUXNET2015 CN-Sw2 Siziwang Grazed (SZWG). FLUXNET https://doi.org/10.18140/FLX/1440212 (2016).
Shi, P., Zhang, X. & He, Y. FLUXNET2015 CN-Dan Dangxiong. FLUXNET https://doi.org/10.18140/FLX/1440138 (2016).
Tang, Y., Kato, T. & Du, M. FLUXNET2015 CN-HaM Haibei Alpine Tibet site. FLUXNET https://doi.org/10.18140/FLX/1440190 (2016).
Wang, H. & Fu, X. FLUXNET2015 CN-Qia Qianyanzhou. FLUXNET https://doi.org/10.18140/FLX/1440141 (2016).
Zhang, J. & Han, S. FLUXNET2015 CN-Cha Changbaishan. FLUXNET https://doi.org/10.18140/FLX/1440137 (2016).
Zhou, G. & Yan, J. FLUXNET2015 CN-Din Dinghushan. FLUXNET https://doi.org/10.18140/FLX/1440139 (2016).
Anapalli, S. S. et al. Quantifying soybean evapotranspiration using an eddy covariance approach. Agric. Water Manage. 209, 228–239 (2018).
Xie, M. et al. Simulation of site-scale water fluxes in desert and natural oasis ecosystems of the arid region in Northwest China. Hydrol. Process. 35, e14444 (2021).
Zhang, C. et al. A framework for estimating actual evapotranspiration at weather stations without flux observations by combining data from MODIS and flux towers through a machine learning approach. J. Hydrol. 603, 127047 (2021).
Liang, S. et al. The global land surface satellite (GLASS) product suite. Bull. Amer. Meteorol. Soc. 102, 1–37 (2020).
Zhang, X., Liang, S., Zhou, G., Wu, H. & Zhao, X. Generating Global Land Surface Satellite incident shortwave radiation and photosynthetically active radiation products from multiple satellite data. Remote Sens. Environ. 152, 318–332 (2014).
Tang, W. Dataset of high-resolution (3 hour, 10 km) global surface solar radiation (1983–2018). National Tibetan Plateau Data Center https://cstr.cn/18406.11.Meteoro.tpdc.270112 (2019).
Tang, W., Yang, K., Qin, J., Li, X. & Niu, X. A 16-year dataset (2000-2015) of high-resolution (3 h, 10 km) global surface solar radiation. Earth Syst. Sci. Data 11, 1905–1915 (2019).
Myneni, R., Knyazikhin, Y. & Park, T. MCD15A3H MODIS/Terra+Aqua Leaf Area Index/FPAR 4-day L4 Global 500m SIN Grid V006. NASA EOSDIS Land Processes DAAC https://doi.org/10.5067/MODIS/MCD15A3H.006 (2015).
Vermote, E. & Wolfe, R. MOD09GA MODIS/Terra Surface Reflectance Daily L2G Global 1kmand 500m SIN Grid V006. NASA EOSDIS Land Processes DAAC https://doi.org/10.5067/MODIS/MOD09GA.006 (2015).
Fang, B., Lei, H., Zhang, Y., Quan, Q. & Yang, D. Spatio-temporal patterns of evapotranspiration based on upscaling eddy covariance measurements in the dryland of the North China Plain. Agric. For. Meteorol. 281, 107844 (2020).
Yamazaki, D. et al. A high-accuracy map of global terrain elevations. Geophys. Res. Lett. 44, 5844–5853 (2017).
FAO/IIASA/ISRIC/ISSCAS/JRC. Harmonized World Soil Database (version 1.2). FAO, Rome, Italy and IIASA, Laxenburg, Austria https://webarchive.iiasa.ac.at/Research/LUC/External-World-soil-database/HTML (2012).
Friedl, M. & Sulla-Menashe, D. MCD12Q1 MODIS/Terra+Aqua Land Cover Type Yearly L3 Global 500m SIN Grid V006. NASA EOSDIS Land Processes DAAC https://doi.org/10.5067/MODIS/MCD12Q1.006 (2019).
Sorensen, L. A spatial analysis approach to the global delineation of dryland areas of relevance to the CBD Programme of Work on Dry and Subhumid Lands. UNEP-WCMC https://www2.unep-wcmc.org/resources-and-data/a-spatial-analysis-approach-to-the-global-delineation-of-dryland-areas-of-relevance-to-the-cbd-programme-of-work-on-dry-and-subhumid-lands (2007).
Xie, M. Division of flux stations. figshare https://doi.org/10.6084/m9.figshare.23695980.v2 (2023).
Lopatin, J., Dolos, K., Hernández, H., Galleguillos, M. & Fassnacht, F. Comparing generalized linear models and random forest to model vascular plant species richness using LiDAR data in a natural forest in central Chile. Remote Sens. Environ. 173, 200–210 (2016).
Biau, G. Analysis of a random forests model. J. Mach. Learn. Res. 13, 1063–1095 (2012).
Bergstra, J., Komer, B., Eliasmith, C., Yamins, D. & Cox, D. D. Hyperopt: a python library for model selection and hyperparameter optimization. Comput. Sci. Discov. 8, 014008 (2015).
Ploton, P. et al. Spatial validation reveals poor predictive performance of large-scale ecological mapping models. Nat. Commun. 11, 1–11 (2020).
Patel, S. P. & Upadhyay, S. H. Euclidean distance based feature ranking and subset selection for bearing fault diagnosis. Expert Syst. Appl. 154, 113400 (2020).
Lever, J., Krzywinski, M. & Altman, N. Points of significance: Principal component analysis. Nat. Methods 14, 641–643 (2017).
Xie, M. Carbon-water flux datasets of Eurasian meteorological stations. figshare https://doi.org/10.6084/m9.figshare.21347721.v3 (2022).
Xie, M. Meteorological station information. figshare https://doi.org/10.6084/m9.figshare.23695920.v2 (2023).
Xie, M. RSMs information. figshare https://doi.org/10.6084/m9.figshare.23899785.v1 (2023).
Jiang, F. et al. A 10-year global monthly averaged terrestrial net ecosystem exchange dataset inferred from the ACOS GOSAT v9 XCO 2 retrievals (GCAS2021). Earth Syst. Sci. Data 14, 3013–3037 (2022).
Running, S., Mu, Q. & Zhao, M. MOD16A2 MODIS/Terra Net Evapotranspiration 8-Day L4 Global 500m SIN Grid V006. NASA EOSDIS Land Processes DAAC https://doi.org/10.5067/MODIS/MOD16A2.006 (2017).
Xie, M. Code for mining carbon-water flux information at Eurasian meteorological stations. figshare https://doi.org/10.6084/m9.figshare.21510183.v2 (2022).
Acknowledgements
This research was supported by the Tianshan Talent Cultivation (Grant No. 2022TSYCLJ0001), the Key Projects of the Natural Science Foundation of Xinjiang Autonomous Region (Grant No. 2022D01D01), the Strategic Priority Research Program of the Chinese Academy of Sciences (Grant No. XDA20060302), the High-End Foreign Experts Project and the China Scholarship Council. Mingjuan Xie was supported by a grant from the program of China Scholarship Council (ICPIT–International Cooperative Program for Innovative Talents, Grant No. 202110630005) during her stay in Ghent University, Gent, Belgium. Andrej Varlagin was supported by Russian Science Foundation (project 21-14-00209). Iris Feigenwinter was funded by the EU project SUPER-G (Grant No. 774124) and the SNF project ICOS-CH (Grant No. 20F120_198227). Ankit Shekhar acknowledges funding by the ETH Zürich project FEVER ETH-27 19-1. The University of Padova (AP, FM, LT) carried out the study within the Agritech National Research Center and received funding from the European Union Next-GenerationEU (PIANO NAZIONALE DI RIPRESA E RESILIENZA (PNRR) – MISSIONE 4 COMPONENTE 2, INVESTIMENTO 1.4 – D.D. 1032 17/06/2022, CN00000022). The part of this work was carried out within the framework of the state assignment of the Ministry of Education and Science of Russia under the project “Study of biogeochemical cycles and adaptive reactions of plants of boreal and arctic ecosystems of northeastern Russia”, AAAA-A21-121012190034-2. We thank the many people who provided the metadata for this research. The MCD15A3H data, the MOD09GA data, the MCD12Q1 data and the MOD16A2 data were provided by the NASA EOSDIS Land Processes DAAC. The GOSAT dataset was obtained via the GOSAT Data Archive Service, and the GOSAT project is a joint effort promoted by the JAXA, the NIES and MOE. The DSR dataset during 1983–2018 and some of the flux station data were provided by the National Tibetan Plateau Data Center (http://data.tpdc.ac.cn). Some of the flux station data were shared by the European Fluxes Database Cluster (http://www.europe-fluxdata.eu/home) and the FLUXNET, whose data providers include Adriano Conte, Adrien Jacotot, Aino Korrensalo, Alexander Knohl, Anders Lindroth, Andrea Pitacco, Andreas Ibrom, Andrej Varlagin, Ankit Shekhar, Annalea Lohila, Anne De Ligne, Arnaud Carrara, Aurore Brut, Axel Don, Ayumi Kotani, Bart Kruijt, Benjamin Loubet, Bernard Heinesch, Bogdan Chojnicki, Carlo Calfapietra, Carole Helfter, Caroline Vincke, Casimiro Pio, Changliang Shao, Christian Bernhofer, Christian Brümmer, Christian Markwitz, Christian Wille, Christoph Ammann, Claire Campbell, Cristina Gimeno, Dan Yakir, Daniel Berveiller, Daniela Franz, Dario Papale, Denis Loustau, Donatella Spano, Edoardo Cremonese, Eeva-Stiina Tuittila, Eiko Nemitz, Eric Ceschia, Eric Grehan, Eric Larmanou, Fabio Turco, Fanny Kittler, Fatima LAGGOUN, Filipe Costa e Silva, Francesco Mazzenga, Franco Meggio, Franco Miglietta, Francois Gastal, Franziska Koebsch, Frédéric Bornet, Frédéric Guibal, Frederik Schrader, Gang Dong, Gary J. Lanigan, Georg Niedrist, Georg Wohlfahrt, Gerald Jurasinski, Gerard Kiely, Giorgio Matteucci, Giovanni Manca, Giuseppe Scarascia Mugnozza, Guillaume Simioni, Guoyi Zhou, Hans Peter Schmid, Huimin Wang, Ignacio Goded, Iris Feigenwinter, Ivan Janssens, Thomas Gruenwald, Jan Elbers, Jan Segers, Janina Klatt, Janusz Olejnik, Jean-Christophe Calvet, Jean-Marc Limousin, Jean-Pierre Delorme, Jiri Dusek, Joachim Jansen, Joao Santos Pereira, Joel Leonard, Jørgen E. Olesen, Johan Neirynck, John Moncrieff, Juha Hatakka, Juha-Pekka Tuovinen, Junhua Yan, Junhui Zhang, Jutta Holst, Juuso Rainne, Kari Minkkinen, Karl Schneider, Katerina Havrankova, Katja KLUMPP, Kim Pilegaard, Ladislav Šigut, Leif Klemedtsson, Lenka Foltynova, Louis Gourlez de la Motte, Luca Tezza, Lukas Hörtnagl, Lutz Merbold, Marek Urbaniak, Mana Gharun, Margaret Anderson-Dunn, Marian Pavelka, Marilyn Roland, Marius Schmidt, Mark A. Sutton, Markus Hehn, Marta Galvagno, Mathias Goeckede, Mathilde Jammet, Matthew Saunders, Matthew Wilkinson, Matthias Cuntz, Matthias Mauder, Michal Heliasz, Michel Vennetier, Mika Aurela, Mika Korkiakoski, Mike Jones, Mingyuan Du, Mirco Migliavacca, Monique Carnol, Nadia Vendrame, Natalia Kowalska, Nelius Foley, Nicola Arriga, Nina Buchmann, Olaf Kolle, Olivier Marloie, Paolo Stefani, Pasi Kolari, Patrick Crill (Dept of Geological Sciences, Stockholm University, Sweden), Paul G. Leahy, Pauline Buysse, Pavel Alekseychik, Peili Shi, Per Weslien, Radek Czerny, Radoslaw Juszczak, Rainer Steinbrecher, Regine Maier, Rémy Soubie, Richard Harding, Rober Falcimagne, Robert Clement, Satoru Takanashi, Sebastien Gogo, Shijie Han, Shiping Chen, Silvano Fares, Sofia Cerasoli, Tanguy Manise, Tarek El-Madany, Thomas Friborg, Tim De Meulder, Timo Vesala, Tiphaine Tallec, Tiziano Sorgi, Tomomichi Kato, Torsten Sachs, Trofim Maximov, Tuomas Laurila, Umberto Morra di Cella, Uta Moderow, Valerio Moretti, Vincenzo Magliulo, Wilma Jans, Xianzhou Zhang, Xiaoli Fu, Yanhong Tang, Yi Wang, Yingnian Li, Yongtao He, Yoshiko Kosugi, Zoltan Nagy, and Zsolt Csintalan.
Author information
Authors and Affiliations
Contributions
Geping Luo conceived the work. Mingjuan Xie, Xiaofei Ma and Geping Luo initiated this research and were responsible for the work integrity as a whole. Mingjuan Xie and Xiaofei Ma performed the formal analysis and the calculations and drafted the manuscript. Geping Luo, Mingjuan Xie, Xiaofei Ma, Yuangang Wang, Chaofan Li, Haiyang Shi, Xiuliang Yuan, Chunbo Chen, Wenqiang Zhang, Chen Zhang, Qing Ling, Ruixiang Gao, Yu Zhang and Friday Uchenna Ochege were responsible for the data collection and analysis. Geping Luo, Amaury Frankl, Philippe De Maeyer, and Olaf Hellwich contributed to the resources and manuscript review. Nina Buchmann, Iris Feigenwinter, Jørgen E. Olesen, Radoslaw Juszczak, Adrien Jacotot, Aino Korrensalo, Andrea Pitacco, Andrej Varlagin, Ankit Shekhar, Annalea Lohila, Arnaud Carrara, Aurore Brut, Bart Kruijt, Benjamin Loubet, Bernard Heinesch, Bogdan Chojnicki, Carole Helfter, Caroline Vincke, Changliang Shao, Christian Bernhofer, Christian Brümmer, Christian Wille, Eeva-Stiina Tuittila, Eiko Nemitz, Franco Meggio, Gang Dong, Gary J. Lanigan, Georg Niedrist, Georg Wohlfahrt, Guoyi Zhou, Ignacio Goded, Thomas Gruenwald, Janusz Olejnik, Joachim Jansen, Johan Neirynck, Juha-Pekka Tuovinen, Junhui Zhang, Katja KLUMPP, Kim Pilegaard, Ladislav Šigut, Leif Klemedtsson, Luca Tezza, Lukas Hörtnagl, Marek Urbaniak, Marilyn Roland, Marius Schmidt, Mark A. Sutton, Markus Hehn, Matthew Saunders, Matthias Mauder, Mika Aurela, Mika Korkiakoski, Mingyuan Du, Nadia Vendrame, Natalia Kowalska, Paul G. Leahy, Pavel Alekseychik, Peili Shi, Per Weslien, Shiping Chen, Silvano Fares, Thomas Friborg, Tiphaine Tallec, Tomomichi Kato, Torsten Sachs, Trofim Maximov, Umberto Morra di Cella, Uta Moderow, Yingnian Li, Yongtao He, and Yoshiko Kosugi contributed to providing the flux station data and manuscript proofreading.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Xie, M., Ma, X., Wang, Y. et al. Monitoring of carbon-water fluxes at Eurasian meteorological stations using random forest and remote sensing. Sci Data 10, 587 (2023). https://doi.org/10.1038/s41597-023-02473-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-023-02473-9