Introduction

Freshwater ecosystems are significant sources of methane (CH4), and natural lakes contribute approximately 70% of the freshwater CH4 emission, which is disproportionate to the small area they occupy1. CH4 production in lakes mainly occurs from anoxic sediment2,3,4, and can reach the atmosphere via four emission pathways: ebullitive flux, diffusion, storage flux and emissions from aquatic vegetation. The diffusive flux and, to a lesser extent, ebullition are the most studied of these emission pathways to date1.

Ebullition is considered to be the most important contributor to emissions from surface waters1, especially in shallow lakes (<20 m). Released bubbles of CH4 correspond to a direct vertical flux from the anoxic sediment to the atmosphere, with little physical or chemical loss or any biological interaction within the water column5. A large proportion of CH4 within the water column is oxidized by bacteria6 with oxidation rates varying from between 15 to 90%7,8. CH4 that escapes oxidation and reaches the upper mixed layer of the water column is then emitted by diffusive flux. Storage flux occurs when dissolved CH4, which has accumulated in the water column during periods of stratification, when CH4 builds up in the anoxic layer, or when ice formation prevents gas exchanges with the atmosphere, is suddenly emitted by diffusion. These sudden emissions may be triggered by lake overturn9,10 or by melting of ice11,12. Finally, the importance of CH4 flux through emergent macrophytes is directly related to the presence, abundance and morpho-physiological characteristics of marginal vegetation in the aquatic ecosystem13. It is the only emission pathway that doesn’t originate directly from the so-called “open water” parts of the lakes.

Due to the increasing atmospheric concentration of CH4 and its impact on climate14,15, CH4 emissions from lakes have become more and more frequently measured1,16,17,18 in recent years. The main focus of these studies has been to establish values of emission per unit area and time19,20,21,22, but the estimates generally include only one or two emission pathways, assuming the other fluxes to be negligible. This assumption limits the possibility of accurately scaling up the values to estimate total CH4 emission rates from lakes at a regional or global scale23. Additionally, the drivers that can influence the total flux, or each individual pathway, are not well understood, although some environmental and methodological driving factors have been identified23,24,25. Temperature and precipitation, for example, were suggested as important regulators for CH4 emissions during one modeling exercise24. Lake area, water depth, the concentrations of total phosphorous and dissolved organic carbon have also been proposed as regulators for CH4 emissions at high latitudes23. Finally, the impact of methodological and biophysical drivers has not been assessed at regional or global scales and estimates of CH4 emissions at these scales still have large uncertainties.

Determining the environmental and methodological factors that are most related to the dynamics of each different flux can provide an important basis to predict and to understand the large differences in CH4 emissions that occur between different lakes and between different compartments of the same ecosystem. Thus, this study aims to determine the driving factors related to each CH4 emission pathway from the open waters of lakes at a large scale. Additionally, we aim to quantify the uncertainties associated with the assumption that some emission pathways can be neglected.

Results

Climate and landscape: importance of external drivers

Based on data from 297 lakes (Fig. 1), we observed that CH4 emissions were significantly different across climatic zones (p < 0.001, Fig. 2). The lowest values were found in boreal and north temperate zones whilst higher values were found in the tropics and the south temperate zones (Fig. 2). The statistical difference between the values for different climatic zones was found for all CH4 emission pathways (p < 0.05 for storage fluxes, p < 0.001 for diffusive, total, and ebullitive fluxes, Table S1). Total fluxes of CH4 were found to be significantly and negatively correlated with minimum monthly air temperature (p < 0.01; Table S1) when it was considered as a single factor. Total fluxes of CH4 were also significantly correlated with minimum and maximum monthly air temperature when analyzed in interaction with other variables (Table S1). Diffusive fluxes of CH4 were significantly and positively correlated with minimum and maximum monthly air temperature (p < 0.001) and with all the temperature-related variables when analyzed in interaction with other variables (Table S1). The ebullition and storage fluxes of CH4 were also significantly correlated with minimum monthly air temperature (p < 0.05 and p < 0.001, respectively) but higher minimum monthly air temperature was found to increase ebullition whereas the opposite effect was found for storage flux. Precipitation related variables were also important driving factors, mainly for diffusive and storage fluxes. Storage flux significantly decreased with increasing annual average precipitation (p < 0.001). Finally, the regression tree analysis performed for the total and diffusive flux on the second subgroup (i.e. those for which information on dissolved organic carbon (DOC), phosphorus concentration (TP) and maximum depth was available) demonstrated that precipitation was an important driver of CH4 emissions (Fig. 3).

Figure 1
figure 1

Map showing the location of the lakes. The map was created using R. 3.2.3 (https://www.R-project.org/). The rworldmap packages was used to draw the map (http://cran.r-project.org/web/packages/rworldmap).

Figure 2
figure 2

Boxplot of methane emission estimates for each climatic zone.

Figure 3
figure 3

Regression trees performed on the CH4 emissions estimated for the second subgroup of lakes (see SI): (a) Total flux and (b) Diffusive flux. Each split in the tree represents a yes or no answer to the question shown in the node box. The nodes also list the mean values of CH4 flux and the number of lakes (n). The circles are terminal nodes. Landscape = Is the landscape around the lake composed by: *needle-leaved evergreen tree cover + cultivated and managed areas, broadleaved deciduous tree cover or closed-open evergreen shrub cover; **closed-open herbaceous cover, cultivated and managed or a mosaic; §broadleaved deciduous tree cover or closed-open evergreen shrub cover; §§Sparse herbaceous or sparse shrub, mosaic, closed-open herbaceous cover or snow? Min. monthly precipitation = annual mean of minimum monthly precipitation during the sampling year (mm d−1). DOC = dissolved organic carbon. TP = total phosphorous.

Analysis of the first subgroup of lakes showed that the surrounding landscape significantly affected all of the CH4 emissions pathways except for storage (p < 0.001). Regression tree analysis for the total and diffusive fluxes highlights the importance of the surrounding landscape as a driving factor for these CH4 emission pathways (Fig. 3b). However, this result should be treated with caution. The classification of lakes by landscape type often tends to group together lakes which lie within the same geographical area, and so the landscape effect we observed might just be an artifact created by this grouping.

The importance of lake characteristics

The analysis of the first subgroup which consists of all lakes for which there is information on area, indicated that area, when considered as a single factor, was only an important variable for ebullition and the total flux (p < 0.01 and p < 0.05, respectively Table S1). Nevertheless, significant interactions (p < 0.001) were observed between area and landscape, or between area and average temperature, for diffusive flux. A significant interaction between area and landscape was also observed (p < 0.001) for ebullition.

Through analysis of the second subgroup of lakes, for which DOC, TP and depth are considered as explanatory variables, DOC was highlighted as an important driving factor for total and diffusive CH4 emissions (Table S1). The maximum depth was another variable related to lake characteristics selected by the models for total, and diffusive fluxes of CH4 (Table S1), with fluxes found to be positively correlated with it. Total phosphorous, a proxy for eutrophication, only significantly affected the flux when in combination with other variables and then only for the diffusive flux. The regression tree (Fig. 3b) highlights the importance of total phosphorous as a lake characteristic influencing the values of emission, with higher emissions via diffusion occurring when total phosphorous content was high. DOC was always positively correlated with emissions either as single factor or in interaction, except for case of storage (Fig. 4).

Figure 4
figure 4

Relationships between the CH4 flux and dissolved organic carbon (DOC, mg C L−1) for the second subgroup of lakes (see SI): (a) total flux; (b) diffusive flux; (c) ebullitive flux. The total flux is the sum of the different fluxes measured for each lake.

CH4 emission and its dependency on the methodology

The diffusive flux was the most frequently estimated pathway in the studies used to compile the database (Table S2) and was, therefore, the pathway most used (both individually and as part of another flux component) to estimate the total flux. The ebullitive flux, on the other hand, was the pathway estimated least often. However, for the cases where all three emission pathways were estimated for the same lake, all the pathways made important contributions to the total flux, with 30.9% coming from the diffusive flux, 35.7% from the storage flux and 33.4% from ebullition (Fig. 5a).

Figure 5
figure 5

Boxplot of methane emission estimates presented according to estimation method and emission pathway. (a) Contribution of each flux separately and all possible combinations to open water CH4 emission when all three types of flux were measured (n = 24). Diff. Flux = diffusive flux, Stor. = storage, Ebul. = ebullition. Bars combining the three fluxes represent the real total flux of an ecosystem. (b) Differences between CH4 fluxes (diffusive and ebullitive) measured using different methods. DC = dissolved concentration; SF = submersed funnel and FC = floating chamber. Different letters above the boxes mean there is a significant difference.

The use of different methods to estimate the same CH4 emission pathway appeared to be a source of significant differences in total CH4 flux estimation. Using the methods as explanatory variables in the linear models, we found significant statistical differences (p < 0.001) between the different methods used to estimate ebullitive fluxes. Ebullitive fluxes estimated by the submerged funnel gas method, are 6 times lower on average than those measured with floating chambers (Fig. 5b).

Discussion

Here, we showed that CH4 emissions from lakes are correlated to several environmental drivers such as climate related variables (Fig. 6). We also observed different emission patterns in different climatic zones. For example, we found that higher emission rates were reported in climatic zones with higher mean air temperatures (i.e. the tropics and south temperate zones Fig. 2). Therefore, the observed latitudinal pattern of emission is probably related to temperature26. This temperature effect is probably related to the role of temperature as a methanogenesis driving factor27,28. Additionally, we found that precipitation significantly affected the diffusive and storage emissions of CH4. This effect is probably due to the impact of precipitation on lake mixing. These important climate dependencies were not found when different aquatic ecosystems, including lakes, estuaries, oceans, rivers and wetlands were all analyzed together29. The absence of any climate dependency in that study is probably explained by the large variations in the CH4 emission data, which are reduced when the analysis is restricted to natural lakes only. CH4 emissions from natural wetlands have been linked to variations in environmental factors such as temperature and precipitation24. Furthermore, higher precipitation generally induces higher runoff and drainage, which impact the local dissolved organic carbon concentration and, ultimately, the lateral fluxes30. We may therefore expect higher external inputs, mainly of organic substance, in areas with high precipitation.

Figure 6
figure 6

The different emission pathways and the main drivers. Green driver names indicate a positive correlation whereas red indicates a negative correlation. The correlation refers to the analysis done with the second subset of lake data. *Not analyzed in this study. **Significantly affected by the methods used to measure the flux.

Total phosphorous concentration seems to be a factor leading to differences in CH4 emissions (Table S1), but its importance is mainly restricted to total and diffusive emissions (Fig. 3). Lakes with high total phosphorous concentration appear to be major sources of CH4 emissions. According to the regression tree analysis, the diffusion flux in lakes with total phosphorous concentration higher than 0.9 mol L−1 was 5 times higher than in lakes with total phosphorous concentration less than 0.9 mol L−1. This relationship may be significant as currently the major impacts of human activities affecting lakes are increases in nutrients and organic matter concentration18,31,32. If these factors impact lake CH4 dynamics, they can be important drivers of global change in aquatic ecosystems33.

Eutrophication, caused by anthropogenic sources is caused by a large input of limiting nutrients and a consequent increase in primary production. The subsequent sinking and decay of algal blooms can have direct effects on methanogenesis - providing a greater supply of substrate - as well as indirect effects, due to a reduction in dissolved oxygen concentration, especially in the hypolimnion34. Thus the trophic state impacts CH4 concentrations in water and sediment16,34,35,36,37 as well as the CH4 emission to the atmosphere16,18. However, it is important to note that lakes grouped by the first split in Fig. 3 are mainly from the same region. Consequently, the high levels of CH4 emission observed in this group might be due to other, separate, environmental characteristics that are specifically related to this region.

The fact that area was a significant explanatory variable, mainly when interactions with other variables such as landscape or average precipitation are considered, suggests that the effects of some factors driving CH4 emission might vary according to lake area. In particular, the significant interactions between landscape and area suggest that larger lakes are less sensitive to the effect of landscape than smaller lakes. Furthermore, large ebullitive emissions are reported in the littoral zone, which represents the shallower part of lakes23,38,39,40.

In our database, DOC was an important driver of the diffusive emissions, with the relationship between DOC and CH4 emission always being positive (Fig. 4, Table S1). This relationship is consistent with the available literature, showing that DOC is an indicator of the substrate availability for methanogenesis41,42,43. DOC may come from autochthonous sources (algae exudation, microbial decomposition products) or from allochthonous sources (terrestrial organic matter). Both sources are substrate for methanogenesis33,44. Anthropogenic activities may also increase DOC leaching from soil to aquatic ecosystems32,45,46, which can represent an additional source of energy for microbial populations47,48. As terrestrial DOC supply can increase methanogenesis rates33, it could also influence CH4 emission rates.

The uncertainties on the different emission pathways are considerable and we have identified some aspects of the methodology which can help to explain such large variability. The large imbalance in CH4 flux measurements between diffusion and the other mechanisms might be due to the development of easy-to-use techniques for measuring the diffusion flux. These techniques make use of mathematical equations involving the gas concentration in the water column and in the air, and other environmental variables such as water temperature and wind speed to estimate the flux49. Since it was first proposed, this simple, indirect measurement method has been widely applied21,23,28. Currently, we also have the possibility of making direct field measurements of CH4 diffusive emissions. This is mainly done with the use of floating chambers, which is inexpensive and simple50, and has become the most common technique for measuring gas fluxes in lakes1,10,19. However, the exact details of the implementation of the method can vary extensively.

The observation that different estimates of CH4 emissions are obtained when different measurement methods are applied is in agreement with a previous finding at Lake Rostee in Switzerland25 and at a lake in Finland51, where lower estimates for diffusive flux were obtained using the indirect approach (i.e. CH4 dissolved concentration in water column). It should be noted that, in our case, when the two lake groups are divided according to the estimation method used, there is also a distinction between the two groups in terms of depth. Lakes for which the CH4 emission flux was measured directly were, on average, shallower. Results which appeared to be related to the sampling method used might, instead, have been related to the lake depth. Indeed, shallow lakes do generally present higher CH4 emission rates23. We also observed a significant effect of the method used to estimate ebullition flux on the values obtained. In this case the effect might be due to the fact that the floating chambers used to estimate ebullition flux also captured the diffusion flux.

We showed that, when the three emission pathways were estimated for the same lake, storage and ebullition, which were the least often measured pathways of CH4 emission, were the main contributor to the total flux. The diffusive flux was the least important contributor to the total flux, although it was the CH4 emission pathway estimated most often. The assumption that measuring only the diffusion flux is a good surrogate for the total flux23 may lead to significant underestimates of the CH4 flux to the atmosphere. For instance, in lakes where all three pathways were estimated, assuming that the pathways other that diffusion are negligible would lead to an underestimation of the total flux of up to 277%.

The lack of ebullition and storage flux measurements may be closely linked to both emission characteristics and technical difficulties. Emissions by bubbles and storage flux have high temporal variability and, in contrast to the diffusive flux, cannot be measured continuously. According to our database, the ebullitive flux was the least often measured component, even though it was the main contributor towards the total flux. It should be noted that storage flux only occurs in a limited number of lakes as it requires water column stratification or surface layer freezing. Ebullition is a much more commonly occurring process, since most of the lakes of the world are shallow52.

Diffusion, the most commonly estimated pathway of CH4 emission (265 in a total of 297 estimates), makes a limited contribution to the total flux, highlighting the importance of measuring all the other emission pathways despite the technical challenges involved. Additionally, systematic measurements of environmental variables and lake characteristics, such as DOC, TP, etc., that are closely related to the CH4 emissions of lakes, would make it easier to investigate the mechanisms that influence CH4 emission from lakes at regional or global scales. Finally, if we are to be able to explain the substantial differences found in the emission fluxes between different lakes and accurately estimate the CH4 fluxes from lake surfaces to the atmosphere, the methods used to measure CH4 fluxes require thorough evaluation.

Methods

Database

A database was created by compiling estimates of the CH4 emission from lakes of different climatic zones around the world using “CH4” or “methane” and “lake” or “wetland” as keywords in the web of knowledge in January and February 2018. Unpublished data of estimates of diffusive flux from 5 tropical coastal lagoons available in the thesis document of Marinho 2013 from the Institute of Biophysics Carlos Chagas Filho at the Federal University of Rio de Janeiro (Table S1) were also included in the database. We gathered data from the literature for each ecosystem, using studies which reported the contribution of at least one of the CH4 fluxes from open waters (i.e. diffusive flux, ebullition and storage). We chose studies for which data from individual lakes could be extracted and we therefore excluded studies which present only average values for groups of lakes. In total, there were 55 studies with data from 297 lakes (Fig. 1), published between 1977 and 2017, which clearly separate the different flux components (Table S2). To extract data from published figures, we used the free software Engauge Digitizer 4.1. CH4 emission estimates are generally reported in studies as daily or annual rates, and less often as hourly fluxes. When the CH4 flux was measured as a daily rate we extrapolated the data to estimate an annual rate. Extrapolation of daily CH4 emissions into an annual rate may create large uncertainties if the sampling period is not representative. Nevertheless, this method has been successfully used in previous studies1.

Here, we use the term “total flux” to represent the sum of the measured fluxes for each lake. This number can be an underestimate of the actual total flux since most of the studies consider only one or two of the emission pathways. Of the 297 lakes included in the database, all three flux components were measured for only 30 lakes.

Ancillary data

To better understand the factors controlling each of the different CH4 emission pathways, several variables were collected. The geographic coordinates, area, maximum depth, DOC and TP were obtained from each study or from related studies on the same lake. Information on the landscape surrounding each lake, including the type of vegetation cover and/or land use, was obtained either directly from the relevant study, or from the global land cover 2000 database53. The distinction between managed and unmanaged areas was based on the global land cover 2000 database classification by considering that “artificial surfaces” and “cultivated and managed areas” where the two managed classes. Climate-related information for each lake (minimum, maximum and average monthly precipitation and air temperature) was obtained by using the relevant datapoints from the global climate data set CRUNCEP v5 (Viovy et al., pers. com.) at a resolution of 0.5° × 0.5°. This data set combines the CRU-TS254 monthly climatology, covering the period 1901–2013, with the NCEP reanalysis, starting from 1948. The climatic zones were defined based on the latitude of each lake; boreal if higher than 60°N; north temperate between 30°N and 60°N; tropical between 30°S and 30°N; and south temperate between 60°S and 30°S.

Four categorical variables related to the methods used to estimate the CH4 fluxes were added, as follows. The variable “estimation component” describes the emission pathways that were used to estimate the total CH4 flux. The variable “estimation method”, describes the method used to estimate CH4 emissions: floating chamber and dissolved CH4 concentration for diffusive and storage fluxes; floating chambers and bubble traps for ebullitive emission. Thus, the total emission estimated by a given study might involve a different combination of methods than the total emission estimated another study. The variable “starting year” indicates when the lakes were sampled, and the variable “time in months” provides the length of the sampling events. These temporal aspects are important because CH4 emissions are temporally dynamic with occasional episodes of high emissions which might not be representative of the annual rates18.

Statistical methods

The relationships between CH4 emission and lake characteristics, environmental variables and methodological approaches were investigated using two statistical methods: Stepwise linear regression, and regression tree analysis using the software R 3.0.2. The regression trees provide a complementary, non-parametric approach55 and are used here to rank the importance of the lake characteristics on CH4 emission. To reduce the heterogeneity of the variances and meet the assumptions of multiple linear regression, the data were log-transformed prior to performing the stepwise regressions.

Both statistical methods were applied first on 293 lakes for total flux (four were excluded from the complete database because of missing information), diffusion was measured over 261 lakes, ebullition over 93 lakes and storage flux over 87. In this first step, to ensure we were analyzing the most complete database, we did not use explanatory variables related to lake characteristics and landscape, but only the following explanatory variables: estimation component; method of estimation; time in months; starting year; climatic zone; and climatic information data. For subsequent analyses, to help understand the factors controlling CH4 emissions, the database was split into two subgroups. The first subgroup consisted of all lakes for which there was information on area (205, 176, 55 and 93 lakes for total, diffusive, ebullitive and storage fluxes, respectively). The second subgroup included all lakes for which there was information on DOC, TP and maximum depth (95, 89, 49 and 27 lakes for total, diffusive, ebullitive and storage fluxes, respectively). For the second subgroup, ebullition flux was excluded from the analysis, as there were not enough DOC concentration data available. The database and the scripts are available upon request to the corresponding author.