Improved global 250 m 8-day NDVI and EVI products from 2000–2021 using the LSTM model

Xiong, Changhao; Ma, Han; Liang, Shunlin; He, Tao; Zhang, Yufang; Zhang, Guodong; Xu, Jianglei

doi:10.1038/s41597-023-02695-x

Download PDF

Data Descriptor
Open access
Published: 14 November 2023

Improved global 250 m 8-day NDVI and EVI products from 2000–2021 using the LSTM model

Scientific Data volume 10, Article number: 800 (2023) Cite this article

3552 Accesses
2 Citations
Metrics details

Subjects

Abstract

Satellite vegetation index (VI) products, such as normalized difference vegetation index (NDVI) and enhanced vegetation index (EVI), have been widely used. However, they are severely contaminated by clouds and other factors and provide false signals of the surface vegetation conditions. In this study, the new global seamless 250 m, eight-day NDVI and EVI products from 2000–2021 were developed from Moderate Resolution Imaging Spectroradiometer (MODIS) surface reflectance data using a long short-term memory (LSTM) neural network method. High-quality globally representative time series VI samples were constructed to train the model using a combination of the Savitzky-Golay filter (SG), Global LAnd Surface Satellite (GLASS) leaf area index (LAI) fitting and upper envelope methods. To evaluate the proposed method and the 250 m VI products, the MODIS VI product (MOD13Q1) was used for the inter-comparisons using four widely used VI reconstruction methods. Assuming that the MODIS VI data of high quality represents the true values, the root mean square error (RMSE) for NDVI and EVI generated by the LSTM model are 0.0734 and 0.0509, respectively.

Streamflow seasonality in a snow-dwindling world

Article 29 May 2024

Accurate medium-range global weather forecasting with 3D neural networks

Article Open access 05 July 2023

Whole-soil warming leads to substantial soil carbon emission in an alpine grassland

Article Open access 27 May 2024

Background & Summary

Satellite products of vegetation indices (VIs) have been widely used for various purposes, including vegetation change monitoring^1,2, vegetation phenology extraction^3,4, terrestrial carbon circulation modelling^5,6, dynamic environmental simulations^7,8, and land coverage and change detection^9,10.

Among them, the normalized difference vegetation index (NDVI) calculated from the near infrared band (NIR) and visible red band (RED) obtained by optical satellites is one of the most popular indices¹¹.

Similar to the NDVI, the enhanced vegetation index (EVI) minimizes the canopy background variations and maintains its sensitivity under dense vegetation conditions. The EVI also uses the blue band (BLUE) to remove residual atmospheric contamination caused by smoke and thin sub-pixel clouds¹².

However, for satellite-derived VI data, it is almost impossible to avoid adverse observation conditions such as clouds and sensor failure^12,13,14,15, which greatly hinder the application of VI products. It is important to reconstruct the contaminated and missing data and obtain spatiotemporally continuous VI products^16,17.

The VI reconstruction methods can be divided into two categories: the spatial and temporal based. Spatial based methods for the reconstruction of remote sensing images are most commonly used. Due to the high correlation between adjacent pixels, spatial interpolation according to the neighbouring effective pixel is effective under certain circumstances. Representative methods include the linear interpolation method¹⁸ and kriging interpolation¹⁹. However, when a spatial based reconstruction method is applied to remote sensing images with strong contamination or heterogeneity, its effectiveness will be greatly reduced²⁰. Therefore, many reconstruction methods based on VI time series have been proposed. In the past few decades, dozens of time series VI smoothing methods²¹ have been developed. These methods can be divided into three categories according to their principles: (1) methods based on temporal information; (2) methods based on frequency information; and (3) hybrid methods. For the first category, some methods use sliding windows to filter information such as the best index slope extra (BISE) algorithm²², and the Savitzky-Golay (SG) filter²³. Some use functions to fit local time series such as the asymmetric Gaussian (AG) function²⁴, double logistic (DL) function²⁵, or cubic spline polynomial²⁶. Among the frequency information-based methods, the representative methods include the harmonic analysis of time series^27,28 and the wavelet method²⁹. In addition, there are many methods that use other principles³⁰, such as the temporospatial filter (TSF)³¹ and the search and fill algorithm with moving offset method (SFA-MOM)³².

However, for traditional time-series based VI reconstruction methods, whether the method is based on temporal information or frequency information, the reconstructed VIs are significantly affected by the parameter settings. For specific situations in different regions, it is difficult to obtain consistent excellent results using traditional time-series based VI reconstruction methods^30,33,34. VI reconstruction methods that rely on temporal and spatial information involve intricate algorithm designs and necessitate substantial computational resources. When employed in extensive regions, these methods tend to exhibit low computational efficiency²⁰. Limited by the above conditions, the existing methods cannot efficiently generate spatiotemporally continuous global VI products. In recent years, with the increase in satellite data, deep learning neural networks have been widely used in remote sensing data processing, such as the classification of land types^35,36, land surface biomass inversion³⁷, and land surface temperature (LST) products^38,39. The superb learning performance and computational efficiency of neural networks makes it possible to efficiently generate global products. The long-short term memory (LSTM) network can effectively process long-short temporal information and output the results⁴⁰. Since the emergence of the LSTM, it has been used to predict and retrieve temporal information in many fields, such as computer vision, semantic recognition, acoustics, and remote sensing^41,42,43,44. Recently, the version 6 leaf area index (LAI) product⁴⁵ in the Global LAnd Surface Satellite (GLASS) products suite⁴⁶ was produced using the LSTM, and high-quality results have been obtained⁴⁵. The LSTM model has been proven to be effective in estimating continuous LAI data even when the high-quality surface reflectance input is absent for a long period. LAI and VI both function as indicators of plant growth, but they have distinct definitions and capture different aspects of vegetation dynamics. LAI represents the number of leaf layers per unit area, while VI, calculated directly from satellite reflectance, mainly reflects greenness and photosynthetically active biomass. These differences make each index suitable for specific applications and analyses. Owing to its direct relationship with vegetation structure, LAI is widely used in various climate and land surface models. On the other hand, NDVI is more user-friendly and can provide vegetation information for a wide range of applications, particularly among non-specialist users.

Therefore, this study is aimed to develop a LSTM based global VI reconstruction method similar to that of the GLASS V6 LAI. The core idea of this method is to create a globally representative sample based on LAI products and existing reconstruction methods. The high-quality VI time series of the sample pixels are reconstructed using a combination of LAI fitting, the upper envelope method, and the SG filter methods, and the results are used to train the LSTM model. The trained LSTM model produces spatiotemporally continuous global VI products.

Methods

In this study, a deep learning method was used to reconstruct the 250 m VI time series. The overall work flow of this research is shown in Fig. 1. First, the existing VI reconstruction methods and LAI products were used to reconstruct high-quality VI time series of the sample pixels. This step was the premise of the model training. Second, the reconstructed samples were used to train the model. The final step was product production and evaluation.

In this study, the NDVI and EVI were calculated as follows:

$$NDVI=\left(NIR-RED\right)/\left(NIR+RED\right)$$

(1)

$$EVI=2.5\ast \left(NIR-RED\right)/\left(NIR+6\ast RED-7.5\ast BLUE+1\right)$$

(2)

Four satellite data products were used. The GLASS LAI and MODIS surface reflectance products were used to create high-quality VI time series for sample pixels. The MODIS surface reflectance products were also used for training the LSTM model and producing the global NDVI and EVI products. The MODIS VI products were used for the inter-comparisons.

Surface reflectance data

To obtain VI time series with a higher temporal resolution, MOD09Q1, and MOD09A1 version 6 products from 2014 to 2015 obtained from the MODIS sensor on-board the Terra satellite were used^47,48. The MOD09Q1 product provides the surface spectral reflectance in bands 1 and 2 at a 250 m resolution and is corrected for atmospheric conditions such as gasses, aerosols, and Rayleigh scattering. Along with the two surface reflectance bands, a quality layer is also included⁴⁹. For each pixel, a value is selected from all of the acquisitions within the 8-day composite on the basis of a high observation coverage, a low view angle, the absence of clouds or cloud shadows, and aerosol loading. In this study, the red band (250 m surface reflectance band 1 (620–670 nm)) and near infrared (NIR) band (250 m surface reflectance band 2 (841–876 nm)) from the product were used to calculate the NDVI.

When calculating the EVI, the blue band (500 m surface reflectance band 3 (459–479 nm)) provided by the MOD09A1 was used. This band was also used to calculate the EVI in the MODIS VI product (MOD13Q1). In addition, the same calculation formulas and parameter settings were used because MOD13Q1 has been widely used. This product was also used as a contrastable object in the evaluation.

When the VIs were calculated, the basic processing of the results was conducted. When the value of the surface reflectance were negative and the solar zenith angle was greater than 85°, the calculated VI values were invalid, and the value was set to −0.2⁴⁸. The red band, near infrared band, and blue band were also used as input data for the LSTM model training and calculation.

LAI data

The GLASS V6 LAI data were used to fit the original VI time series, which is one of the methods used to generate high-quality VI time series of the sample pixels⁴⁵. GLASS V6 LAI is a globally seamless spatiotemporal LAI product, derived from three distinct LAI datasets using a deep learning model⁴⁵. This unique approach allows it to assimilate data from multiple sensors and effectively correct for weather-induced degradation. Compared with the previous iteration, GLASS V6 LAI delivers heightened reliability and consistency in depicting vegetation dynamics, even under the presence of cloud cover or adverse meteorological conditions. In this study, we leverage the attributes of GLASS LAI during the construction of vegetation index samples. Our objective in incorporating GLASS LAI data is to mitigate the prolonged impact of cloud cover and other weather-related factors that could potentially compromise the quality of NDVI data. The GLASS V6 LAI dataset was downloaded from http://www.glass.umd.edu/LAI/MODIS/250 m/ (last access: 3 November 2022, Ma, and Liang, 2022d).

VIs data

The MOD13Q1 product provides VIs data with a spatial resolution of 250 m and a temporal resolution of 16 days. It contains two vegetation index layers⁵⁰. The first is the NDVI, and the second is the EVI, which minimizes the canopy background variations and maintains its sensitivity under dense vegetation conditions. This product provides a detailed quality assurance (QA) layer and a summary QA layer to measure the availability and accuracy of the vegetation index layers. The QA indicates if a pixel is affected by cloud contamination or covered by snow/ice. In the evaluation, the VIs and related QA information provided by MOD13Q1 were used to compare the results of the LSTM method with those of other methods.

Global representative samples

Training samples are very important in deep learning research, and they directly determine the scope of the application of the model and its universality⁵¹. In previous studies, 52997 sample pixels distributed around the world were selected⁴⁵. They were selected based on global time series LAI clustering analysis and the least difference criterion, as well as by assuming that the LAI values of three widely used products with the lowest mean square errors (MSEs) represented the true values of specific pixels. These sample pixels can represent different vegetation types, different observation qualities, and different geographical locations around the world at the same time, which meets the needs of reconstructing VI time series.

The LSTM’s superb learning performance allowed us to efficiently obtain results with a high quality similar to the high-quality VI time series of the sample pixels⁵². Thus, other auxiliary data can be introduced to ensure the quality. During production, the LSTM does not need to use the auxiliary data, which reduces the demand for data and computing resources. For the VI time series, the pollution caused by clouds and snow is the main cause of the low-quality data^12,13,15. Although there were low-quality data with high values, this type of phenomenon was relatively scarce in the NDVI time series. For the EVI time series, a simple threshold method was used to remove these data. In this study, it was assumed that the data with higher values in the NDVI time series and the pre-processed EVI time series were of high quality, so the high-quality VI time series were similar to the upper envelope of the original VI time series. In addition, the VIs were indices that represent the vegetation status, and many studies have also indicated that VIs are significantly proportional to the LAI^53,54,55. Therefore, it was assumed that the overall patterns of increase or decrease of the high-quality VI values throughout the seasons will follow a similar pattern as the changes in LAI values.

In order to make the results consistent with the above assumptions, four steps were used to create high-quality VI time series of the sample pixels.

The first step was to conduct the SG filtering process to roughly remove the contaminated values. The SG filtering can be interpreted as a weighted moving average window, and the weighting was given as a polynomial of the window width. It was designed to fit the upper envelope and to describe the changing patterns of the parameter through an iteration-process. According to the user guide provided by TIMESAT²⁴, the half-width of the smoothing window of the SG filter was set to 12 (number of values per year/4), and the number of iterations was set to 2. The specific method was that when there was a value higher or lower than the SG filtering result in the original time series and it exceeded the threshold, the value was labelled as an invalid value. In this study, the threshold was set to 0.2 for the EVI time series and 0.4 for the NDVI time series, and the invalid value was set to −0.2.

The second step was the NDVI-LAI fitting. The GLASS LAI data were introduced to fit the VI values that had been processed using the SG filter. For each VIs time series, an exponential function was used to establish the relationship between the LAI value and the high-quality VI value using the least squares method and to obtain the new VI time series returned by the LAI time series. The function can be expressed as

$$VI=c-{e}^{(a\ast LAI+b)}$$

(3)

Equation (3) was used because the VIs were saturated when the LAI value was high. This saturation phenomenon not only led to nonlinearity of the regression, but also the inconsistencies between the trend patterns of the VIs and LAI at high values. In the third step, the upper envelope method was used to effectively solve this problem.

As an auxiliary method, the principle of the upper envelope method is very simple. It can roughly be summarized as follows. If a value in the time series is higher than the previous and next value, this value is determined to be an envelope value. Then, the value that is not selected will be linearly interpolated according to the previous and next envelope value. A new time series is synthesized by replacing the value in the results of the NDVI-LAI fitting method with the corresponding higher value in the results of the upper envelope method. This can be expressed as follows:

$$NDV{I}_{syn}=max\left(NDV{I}_{fitting},serie{s}_{env}\right)$$

(4)

NDVI_fitting is the NDVI time series returned by the LAI time series. series_env is the upper envelope time series obtained using the third step. Finally, the SG filtering method is used to process the synthetic times series to obtain the final high-quality VI time series of the sample pixels.

Most of the samples in this study were processed using the above steps, but there were two exceptions.

First, for the time series with a minimum LAI of greater than 5 in two years, due to the severe saturation phenomenon, the LAI fitting fails, and there are even negative correlations between the VIs and the LAI. In order to deal with this phenomenon, a in Eq. (3) is limited to being greater than zero, and the data in the time series with higher values than the result of the first fitting are used for the second fitting. The remaining steps are consistent with the above descriptions.

Second, for pixels located at high latitudes that are covered by snow and ice for a long time in winter, their time series data are also inconsistent with the trend of the LAI because of the extremely low NDVI values caused by the snow and ice. These extremely low NDVI values are consistent with the actual situation of the surface, so even if they have different trends from the LAI, they should be retained. Equation (5) was used to deal with the above problem.

$$NDVI=\left(NDV{I}_{ori}+0.2\right)/0.4\ast NDV{I}_{syn}+\left(1-\left(NDV{I}_{ori}+0.2\right)/0.4\right)\ast NDV{I}_{ori}$$

(5)

SG filtering is applied to the original NDVI time series. For values of less than 0.2 in the filtering result, Eq. (5) is used for the processing. ndvi_ori is the corresponding original NDVI. ndvi_syn is the corresponding synthetic NDVI mentioned above. Note that the coefficients in Eq. 5 for processing sample pixels in ice and snow regions are determined through empirical analysis and experimentation. This step enables the reconstructed NDVI to retain the low value characteristics caused by long-term snow and ice cover.

LSTM deep learning model training

The LSTM network is an improved structure of a recurrent neural network (RNN), which is a feed forward network with a feedback loop and internal memory⁵⁶. When using low-quality data for the training process, the RNN can use its own structure to deal with these shortcomings⁵⁷. As an improved structure of the RNN, the LSTM network performs better regarding the problems of exploding and vanishing gradients⁵⁸. More importantly, the LSTM has a long-term memory compared to the RNN, with an input gate, output gate, and forget gate in each layer of the network. More details have been provided by this study⁴⁵.

Datasets for model training

For training the LSTM model, the surface reflectance data for 2014 and 2015 from MOD09A1 and MOD09Q1 were collected. The high-quality VI time series created were considered to be the target of the model training. While training, the sample pixels were randomly divided into three parts. Eighty percent of the sample pixels were used to train the model, 10% were used to optimize the model, and 10% were used to verify and evaluate the quality of the model and results.

Model training

The deep learning model developed in this study is based on Python3.8 (PyTorch). This model comprises an input layer, a three-layer LSTM with eight hidden nodes each, an activation function layer, and an output layer. Within the activation function layer, Gaussian error linear units (GELU) are employed as the activation function, known for their high-performance in neural networks and their capacity to enhance the network’s ability to fit nonlinear relationships. Thanks to the LSTM’s inherent characteristics, the original surface reflectance data can be directly used for model training without requiring additional pre-processing steps. The model undergoes iteration and optimization with the Adam optimizer, with the learning rate dynamically adjusted between 0.0001 and 0.00000001 using the lr_scheduler function provided by PyTorch. This method causes the learning rate drop from the maximum value to the minimum value following the morphology of the sinusoidal function, which makes the optimization more accurate. The three bands used in the VIs formulas are used as the input data (i.e., the red band, near infrared band (b1 and b2 from MOD09Q1), and blue band (b3 from MOD09A1)). In order to improve the training efficiency, the time span of the sample is shortened as much as possible. Two years of time series data contain both intact vegetation growth cycles and information between the previous year and the next year. Therefore, two years is considered the most appropriate time span. Data from 2014 to 2015 were used in this study.

NDVI and EVI products evaluation

The obtained VI time series were evaluated based on three aspects. Due to the lack of true observations of VIs, in recent years, most studies complete the quantitative part of the evaluation by generating a reference time series. The mean values of high-quality observations acquired on the same date in the long time series are calculated, and these mean values are used to generate high-quality a reference time series. This method ignores the case where the pixels change during the long period of time. Moreover, the reference time series generated does not exist in practice, and even when noise is artificially added to this reference time series, LSTM cannot learn such a feature during training. Thus, this method was not applicable in this study. In this study, we quantitatively and qualitatively evaluated the LSTM products using MOD13Q1 and by visually analysing the time series and spatial characteristics.

MOD13Q1 was calculated from MOD09Q1 using the constrained view angle-maximum value composite method (CV-MVC) to obtain VI values that were considered to have the highest quality within 16 days. MOD13Q1 also provides a layer that describes the data quality (i.e., summaryQA), in which the best and useful data are set to one and zero, while the cloud-contaminated and ice-covered data are set to 2 and 3. It was assumed that the reconstructed VIs with a better quality should be more consistent with the best and useful data in MOD13Q1.

In order to compare the LSTM method and the other reconstruction methods, in this study, four commonly used VI reconstruction methods, namely, the SG filter (SG), double logistic (DL), asymmetric Gaussian (AG), and Whittaker smoother (WT) methods were applied. Among them, the SG, DL, and AG are provided by TIMESAT version 3.3²⁴. The WT can be regarded as the penalized least squares method, which puts a fidelity penalty on the roughness of the smooth curve⁵⁹. According to the user guide provided by TIMESAT, the half-width of the smoothing window of the SG was set to 12 (number of values per year/4), and the number of iterations was set to 2. For the DL and AG, the seasonality parameter was set to 0.5. For the WT, according to previous research⁶⁰, the default smoothing parameter was set to 2.

The root mean square error (RMSE) and coefficient of determination (R²) were used to evaluate the performances of the five methods. The RMSE reflects the fidelity of the reconstruction results compared to the high-quality data in MOD13Q1. The R² value is between 0 and 1, and its value reflects the adequacy of the independent variable’s interpretation of the dependent variable. R² is one of the most common indexes used to evaluate regression models. These two parameters can be calculated as follows:

$$RMSE=\sqrt{{\sum }_{i=1}^{n}\left({y}_{resul{t}_{i}}-{y}_{goo{d}_{i}}\right)}$$

(6)

$${R}^{2}=\frac{{\sum }_{i=1}^{n}{\left({y}_{resul{t}_{i}}-{y}_{average}\right)}^{2}}{{\sum }_{i=1}^{n}{\left({y}_{o{b}_{i}}-{y}_{average}\right)}^{2}}$$

(7)

n is the total number of observations with good quality. ${y}_{o{b}_{i}}$ is the value of an observation, ${y}_{resul{t}_{i}}$ is the value of the corresponding reconstruction result, and y_average is the average value of the observations of high-quality data in MOD13Q1.

Two other parameters were used as a reference to evaluate the results⁶¹. One was the upper envelope rate, which is defined as the ratio of the reconstruction results with values lower than the original observations. For the NDVI, better reconstruction results should have a lower envelope rate. The other was the distance between the reconstruction results and the original observations, which is defined as the average of the absolute value of the difference between all of the reconstruction results and the original observations. It measures the fidelity of the reconstructed results compared to the original observations, and good reconstruction results should not have large elevation distance values.

The above parameters were calculated using independent sample pixels on a global scale reserved for evaluation. In addition to quantitative evaluation, in this study, qualitative evaluation was also conducted. This was performed at the selected sample pixels and in the selected spatial area.

Data production

The production of the products continues to be based on PyTorch. By inputting the near-infrared band and red band of MOD09Q1 and the blue band of MOD09A1 into the trained LSTM model and performing calculations, time-series results can be obtained. Then, the results are organized into the same projection and tiling as the MODIS reflectance product through matrix transformation. These products use the same land-sea mask as GLASS V6 LAI, and the ocean part will be assigned invalid values.

Data Records

The improved global NDVI and EVI products from 2000–2021 are freely available from figshare. The 250 m 8-day GLASS VIs products for the first day in 2000 is freely available at https://doi.org/10.6084/m9.figshare.22220050, https://doi.org/10.6084/m9.figshare.22220125^62,63. We have also aggregated it to coarser resolutions(0.05° 8d, 0.1° per month, and 0.25° per month; https://doi.org/10.6084/m9.figshare.22267048⁶⁴). The 250 m data are in the sinusoidal projection, whereas the 0.05, 0.1, and 0.25 data are in the geographic latitude and longitude coordinate system. The data files are provided in Hierarchical Data Format-Earth Observing Systems (HDF-EOS) format. The 250-m 8-day NDVI files are named in the following format: “GLASS13D01.V10.AYYYYDDD.hHHvVV.yyyyddd.hdf”, where “GLASS13D01”, “V10”, “YYYY”, “DDD”, “HH” and “VV” denote the product name, version number, year, Julian day of the year (doy), and MODIS tile ID, respectively. The lowercase letters “yyyyddd” represent the year and doy of the processing date. For the EVI files, the product name is “GLASS14D01”. Additional information, such as the scale factor and value range, is stored in the files.

Technical Validation

In this section, the LSTM-reconstructed VIs are displayed on the global scale. Quantitatively comparison and analysis of the performance of the LSTM method and the other reconstruction methods are conducted based on MOD13Q1. In terms of the qualitative analysis, the reconstruction methods are compared in both space and time.

Global reconstruction results

Figure 2 shows the global reconstructed VIs (VI reconstruction results) and the MOD13Q1 product. As can be seen, the spatial distribution of the LSTM-reconstructed VIs is the same as that of the MOD13Q1 product.

In the calculation of the VIs, in the observation process, clouds and snow often leads to low-quality data. The global distributions of clouds and snow has very significant spatial and temporal characteristics. According to a previous study, the tropical regions near the equator and at mid-high latitudes have the highest proportion of low-quality observations⁶⁵. In these areas, the reconstruction of VIs is more difficult. Therefore, the north-eastern part of South America and Canada were selected for detailed analysis. The LSTM-reconstructed VIs and MOD13Q1 for these areas are shown in Figs. 3, 4.

In Figs. 3, 4, it can be clearly seen that when the observations were affected by clouds and/or snow, the spatial distributions of the VIs became very uneven. Even though MOD13Q1 calculates relatively high-quality VIs by reducing the time resolution to 16 days and using the CV-MVC algorithm, in the summaryQA layer in MOD13Q1, when the quality of the observation is low, the NDVI will have low values in the area with clouds and/or snow, while the EVI will contain a large amount of noise in the corresponding area. The above phenomena are consistent with our predictions, and the LSTM reconstruction method also effectively eliminates these problems.

Quantitative evaluation

Figure 5 shows the global distribution of the selected evaluation pixels, and the results are presented in Table 1.

Table 1 Evaluation of the LSTM method compared with other methods.

Full size table

A total of 5301 evaluation pixels are shown in Fig. 5, which are mainly distributed on the surfaces covered by vegetation, and the vegetation types are diverse. Such sample pixels meet the requirements for our research.

Table 1 shows the good performance of the LSTM method. For the different latitudes and VIs, the LSTM method achieved the minimum RMSE and maximum R², demonstrating that the LSTM-reconstructed VIs were the most consistent with the high-quality observation data of the MOD13Q1 product. For the other two parameters, the upper envelope rate and the distance, the LSTM method achieved the expected effect. For the reconstruction of the NDVI, the LSTM method had the lowest upper envelope rate, indicating that its results were closest to the upper envelope of the original time series. The distance results obtained for all of the methods were relatively similar, indicating that the LSTM-reconstructed VIs have similar fidelities to those of the other methods. In detail, among the four methods, the SG method usually achieved the best results, except for the LSTM method. In addition, the WT method has been widely used to reconstruct the EVI. Table 1 shows that the WT method achieved better results than the SG filter in reconstructing the EVI at middle and high latitudes. Among the methods, the SG and WT methods were second only to the LSTM method. In the subsequent qualitative evaluation, we will compare these two methods to the LSTM method.

Qualitative evaluation

Due to their superior performances compared with the other methods, in this section we focus on comparing the SG and WT methods with the LSTM method. The evaluation will be carried out from two aspects: the VI time series curves of representative pixels and the spatial distributions of the reconstruction results. The representative pixels and areas selected for display are guided by last section, that is, the pixels and areas with large differences in the results of the different reconstruction methods are displayed to highlight the advantages and disadvantages of these methods.

Evaluation based on the curves of the representative pixels

This section displays the VI time series curves with the greatest differences according to the differences in the reconstruction results of the different methods. Curves with similar shapes are not displayed repeatedly in the following figures.

Figure 6 shows the performances of the three methods in reconstructing the NDVI. Compared with the LSTM method, the reconstruction results of the WT and SG methods are limited in the following cases. First, in the tropical area, due to the influences of the large number of clouds and the poor observation conditions, there are no high-quality data for a long time. This leads to underestimation of the reconstruction results of the SG and WT, and almost none of the time filtering methods can solve this problem. Second, in the middle and high latitudes, due to the influences of snow and the observation angle, there are a large number of invalid data in the original surface reflectance data. In this case, the reconstruction results of the SG and WT are inevitably affected by the filling value, and the quality of the reconstruction results is highly related to the filling method. In this experiment, the reconstruction results of the SG and WT are significantly underestimated because the filling value was set to – 0.2. In contrast, due to the introduction of the LAI information, the LSTM method can effectively eliminate the adverse effects and reconstruct a high-quality NDVI time series when a large number of low-quality observations and invalid values exist for a long time.

For the EVI in Fig. 7, the situation is similar to that for the NDVI. In the two cases mentioned above, the low-quality EVI data often appear as noise with high or low values. This creates challenges for the LSTM method and the other methods because the goal of reconstruction is no longer to reconstruct the upper envelope curve. Even so, for the LSTM method, which introduces the LAI, its results also recover the variation trend of the EVI with the LAI. For the other two methods, in addition to the problems encountered when reconstructing the NDVI, when reconstructing the EVI, the low-quality data with high values have a great impact on the final results, thus reducing the quality of the results.

Evaluation based on the spatial distribution

In this section, the experimental area corresponds to 400 × 400 pixels in a tile of the MOD09A1 product. The selected areas are the areas located in the centers of tile h28v06 and tile h10v08. The original VIs and LSTM-reconstructed VIs corresponding to the first half of 2014 in this region are displayed. The EVI reconstruction results obtained using the WT method and the NDVI reconstruction results obtained using the SG method are also displayed. According to the quality file provided by MOD09Q1, the cloudy portion of the original vegetation indices is set to −1.

Figures 8, 9 show that the SG and WT methods have a certain effect on the reconstruction of the VIs. However, consistent with the conclusions in the previous section, when the observations are polluted by continuous clouds, the quality of the SG and WT results cannot meet the requirements. The corresponding area in Fig. 8 is located in southeastern China and has a subtropical monsoon climate. This makes the region rich in precipitation in spring and summer, which manifests as the large number of low value areas in Fig. 8(a). On doy 57 and 121, there were two consecutive low-quality observation periods of more than 21 days in the region.

In Fig. 9, the area shown is located in the northeastern part of South America. The observations in this area were disturbed by perennial clouds and precipitation, which is reflected in the EVI as a large amount of discrete noise. Figure 9b shows that the WT method cannot effectively remove the noise pixels. For the first 73 days of the year, low-quality values still exist in the results of the WT method.

Usage Notes

This study presents a method for reconstructing global 250 m NDVI and EVI using LSTM networks and MODIS surface reflectance data. Specifically, high-quality VI time series were generated for 52997 pixels by leveraging MODIS surface reflectance data from 2014–2015 and the GLASS V6 Leaf Area Index (LAI). Due to methodological assumptions, this product theoretically exhibits the highest quality in areas covered by vegetation. In areas devoid of vegetation, such as deserts or urban zones, the LAI value typically remains extremely low or approaches zero. GLASS LAI offers data for these regions, as it can furnish insights into the presence of sparse vegetation and the dynamics of vegetation cover changes over time. Moreover, the temporal curve of this product generally aligns with the LAI trend while preserving its inherent characteristics. In addition, the low observation quality in high latitude regions may affect the product to some extent, but this only occurs at the beginning and end of the year and the impact is minimal.

Code availability

The Python codes for generating and processing data and be accessed through GitHub (https://github.com/Xiongkovsky/glass_vis_lstm_code).

References

Zhang, Y. L., Song, C. H., Band, L. E., Sun, G. & Li, J. X. Reanalysis of global terrestrial vegetation trends from MODIS products: Browning or greening? Remote Sensing of Environment. 191, 145–155 (2017).
Article ADS Google Scholar
Zeng, L. L., Wardlow, B. D., Xiang, D. X., Hu, S. & Li, D. R. A review of vegetation phenological metrics extraction using time-series, multispectral satellite data. Remote Sensing of Environment. 237, 20 (2020).
Article Google Scholar
Buyantuyev, A. & Wu, J. G. Urbanization diversifies land surface phenology in arid environments: Interactions among vegetation, climatic variation, and land use pattern in the Phoenix metropolitan region, USA. Landsc. Urban Plan. 105, 149–159 (2012).
Article Google Scholar
Pastor-Guzman, J., Dash, J. & Atkinson, P. M. Remote sensing of mangrove forest phenology and its environmental drivers. Remote Sensing of Environment. 205, 71–84 (2018).
Article ADS Google Scholar
Guan, X. B., Shen, H. F., Li, X. H., Gan, W. X. & Zhang, L. P. A long-term and comprehensive assessment of the urbanization-induced impacts on vegetation net primary productivity. Sci. Total Environ. 669, 342–352 (2019).
Article ADS CAS PubMed Google Scholar
Tucker, C. J. & Sellers, P. J. Satellite remote-sensing of primary production. Int J Remote Sens. 7, 1395–1416 (1986).
Article Google Scholar
Zhao, J. et al. Time-lagged response of vegetation dynamics to climatic and teleconnection factors. Catena. 189, 12 (2020).
Article Google Scholar
Tong, X. W. et al. Quantifying the effectiveness of ecological restoration projects on long-term vegetation dynamics in the karst regions of Southwest China. International Journal of Applied Earth Observation and Geoinformation. 54, 105–113 (2017).
Article ADS Google Scholar
Hu, T. X. et al. Mapping fine-scale human disturbances in a working landscape with Landsat time series on Google Earth Engine. Isprs Journal of Photogrammetry and Remote Sensing. 176, 250–261 (2021).
Article ADS Google Scholar
Jia, K. et al. Land cover classification of finer resolution remote sensing data integrating temporal features from time series coarser resolution data. Isprs Journal of Photogrammetry and Remote Sensing. 93, 49–55 (2014).
Article ADS Google Scholar
Holben, B. N., Tucker, C. J. & Fan, C. J. Spectral assessment of soybean leaf-area and leaf biomass. Photogramm. Eng. Remote Sens. 46, 651–656 (1980).
Google Scholar
Huete, A. et al. Overview of the radiometric and biophysical performance of the MODIS vegetation indices. Remote Sensing of Environment. 83, 195–213 (2002).
Article ADS Google Scholar
Kong, D. D., Zhang, Y. Q., Gu, X. H. & Wang, D. G. A robust method for reconstructing global MODIS EVI time series on the Google Earth Engine. Isprs Journal of Photogrammetry and Remote Sensing. 155, 13–24 (2019).
Article ADS Google Scholar
Li, X. H. et al. Cloud removal in remote sensing images using nonnegative matrix factorization and error correction. Isprs Journal of Photogrammetry and Remote Sensing. 148, 103–113 (2019).
Article ADS Google Scholar
Shen, H. F. et al. Missing Information Reconstruction of Remote Sensing Data: A Technical Review. IEEE Geosci. Remote Sens. Mag. 3, 61–85 (2015).
Article Google Scholar
Shao, Y., Lunetta, R. S., Wheeler, B., Iiames, J. S. & Campbell, J. B. An evaluation of time-series smoothing algorithms for land-cover classifications using MODIS-NDVI multi-temporal data. Remote Sensing of Environment. 174, 258–265 (2016).
Article ADS Google Scholar
Gutman, G., Skakun, S. & Gitelson, A. Revisiting the use of red and near-infrared reflectances in vegetation studies and numerical climate models. Science of Remote Sensing. 4 (2021).
Xu, L. L., Li, B. L., Yuan, Y. C., Gao, X. Z. & Zhang, T. A Temporal-Spatial Iteration Method to Reconstruct NDVI Time Series Datasets. Remote Sensing. 7, 8906–8924 (2015).
Article ADS Google Scholar
Bhattacharjee, S., Mitra, P. & Ghosh, S. K. Spatial Interpolation to Predict Missing Attributes in GIS Using Semantic Kriging. Ieee Transactions on Geoscience and Remote Sensing. 52, 4771–4780 (2014).
Article ADS Google Scholar
Li, S. et al. High-quality vegetation index product generation: A review of NDVI time series reconstruction techniques. International Journal of Applied Earth Observation and Geoinformation. 105 (2021).
Geng, L. Y. et al. Comparison of Eight Techniques for Reconstructing Multi-Satellite Sensor Time-Series NDVI Data Sets in the Heihe River Basin, China. Remote Sensing. 6, 2024–2049 (2014).
Article ADS Google Scholar
Viovy, N., Arino, O. & Belward, A. S. The best index slope extraction (bise) - a method for reducing noise in ndvi time-series. Int J Remote Sens. 13, 1585–1590 (1992).
Article Google Scholar
Chen, J. et al. A simple method for reconstructing a high-quality NDVI time-series data set based on the Savitzky-Golay filter. Remote Sensing of Environment. 91, 332–344 (2004).
Article ADS Google Scholar
Jonsson, P. & Eklundh, L. Seasonality extraction by function fitting to time-series of satellite sensor data. Ieee Transactions on Geoscience and Remote Sensing. 40, 1824–1832 (2002).
Article ADS Google Scholar
Beck, P. S. A., Atzberger, C., Hogda, K. A., Johansen, B. & Skidmore, A. K. Improved monitoring of vegetation dynamics at very high latitudes: A new method using MODIS NDVI. Remote Sensing of Environment. 100, 321–334 (2006).
Article ADS Google Scholar
Chen, J. M., Deng, F. & Chen, M. Z. Locally adjusted cubic-spline capping for reconstructing seasonal trajectories of a satellite-derived surface parameter. Ieee Transactions on Geoscience and Remote Sensing. 44, 2230–2238 (2006).
Article ADS Google Scholar
Yang, G., Shen, H. F., Zhang, L. P., He, Z. Y. & Li, X. H. A Moving Weighted Harmonic Analysis Method for Reconstructing High-Quality SPOT VEGETATION NDVI Time-Series Data. Ieee Transactions on Geoscience and Remote Sensing. 53, 6008–6021 (2015).
Article ADS Google Scholar
Zhou, J., Jia, L. & Menenti, M. Reconstruction of global MODIS NDVI time series: Performance of Harmonic ANalysis of Time Series (HANTS). Remote Sensing of Environment. 163, 217–228 (2015).
Article ADS Google Scholar
Lu, X. L., Liu, R. G., Liu, J. Y. & Liang, S. L. Removal of noise by wavelet method to generate high quality temporal data of terrestrial MODIS products. Photogramm. Eng. Remote Sens. 73, 1129–1139 (2007).
Article Google Scholar
Chu, D. et al. Long time-series NDVI reconstruction in cloud-prone regions via spatio-temporal tensor completion. Remote Sensing of Environment. 264 (2021).
Fang, H. L. et al. Developing a spatially continuous 1 km surface albedo data set over North America from Terra MODIS products. J. Geophys. Res.-Atmos. 112, 20 (2007).
Article Google Scholar
Padhee, S. K. & Dutta, S. Spatio-Temporal Reconstruction of MODIS NDVI by Regional Land Surface Phenology and Harmonic Analysis of Time-Series. GISci. Remote Sens. 56, 1261–1288 (2019).
Article Google Scholar
Cai, Z., Jönsson, P., Jin, H. & Eklundh, L. Performance of Smoothing Methods for Reconstructing NDVI Time-Series and Estimating Vegetation Phenology from MODIS Data. Remote Sensing. 9 (2017).
Zhou, J., Jia, L., Menenti, M. & Gorte, B. On the performance of remote sensing time series reconstruction methods – A spatial comparison. Remote Sensing of Environment. 187, 367–384 (2016).
Article ADS Google Scholar
Zhang, C. et al. A hybrid MLP-CNN classifier for very fine resolution remotely sensed image classification. Isprs Journal of Photogrammetry and Remote Sensing. 140, 133–144 (2018).
Article ADS CAS Google Scholar
Zhao, W. Z. & Du, S. H. Learning multiscale and deep representations for classifying remotely sensed imagery. Isprs Journal of Photogrammetry and Remote Sensing. 113, 155–165 (2016).
Article ADS Google Scholar
Zhang, Y. Z., Ma, J., Liang, S. L., Li, X. S. & Liu, J. D. A stacking ensemble algorithm for improving the biases of forest aboveground biomass estimations from multiple remotely sensed datasets. GISci. Remote Sens. 59, 234–249 (2022).
Article Google Scholar
Tan, J. C. et al. Deep Learning Convolutional Neural Network for the Retrieval of Land Surface Temperature from AMSR2 Data in China. Sensors. 19 (2019).
Wu, P. H., Yin, Z. X., Yang, H., Wu, Y. L. & Ma, X. S. Reconstructing Geostationary Satellite Land Surface Temperature Imagery Based on a Multiscale Feature Connected Convolutional Neural Network. Remote Sensing. 11 (2019).
Gamboa, J. C. B. Deep learning for time-series analysis. arXiv preprint arXiv:1701.01887. (2017).
Gers, F. A., Schmidhuber, J. & Cummins, F. Learning to forget: Continual prediction with LSTM. Neural Computation. 12, 2451–2471 (2000).
Article CAS PubMed Google Scholar
Graves, A. & Schmidhuber, J. Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Networks. 18, 602–610 (2005).
Article PubMed Google Scholar
Graves, A., Jaitly, N., Mohamed, A.-R. & Ieee. in IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU). 273–278 (2013).
Greff, K., Srivastava, R. K., Koutnik, J., Steunebrink, B. R. & Schmidhuber, J. LSTM: A Search Space Odyssey. Ieee Transactions on Neural Networks and Learning Systems. 28, 2222–2232 (2017).
Article MathSciNet PubMed Google Scholar
Ma, H. & Liang, S. Development of the GLASS 250-m leaf area index product (version 6) from MODIS data using the bidirectional LSTM deep learning model. Remote Sensing of Environment. 273 (2022).
Liang, S. et al. The Global Land Surface Satellite (GLASS) Product Suite. Bulletin of the American Meteorological Society. 102, E323–E337 (2021).
Article Google Scholar
Vermote, E. MOD09Q1 MODIS/Terra Surface Reflectance 8-Day L3 Global 250m SIN Grid V006. NASA EOSDIS Land Processes DAAC https://doi.org/10.5067/MODIS/MOD09Q1.006 (2015).
Vermote, E. MOD09A1 MODIS/Terra Surface Reflectance 8-Day L3 Global 500m SIN Grid V006. NASA EOSDIS Land Processes DAAC https://doi.org/10.5067/MODIS/MOD09A1.006 (2015).
Roy, D. P. et al. The MODIS Land product quality assessment approach. Remote Sensing of Environment. 83, 62–76 (2002).
Article ADS Google Scholar
Didan, K., Munoz, A. B., Solano, R., Huete, A. MOD13Q1 MODIS/Terra Vegetation Indices 16-Day L3 Global 250m SIN Grid V006. NASA EOSDIS Land Processes DAAC https://doi.org/10.5067/MODIS/MOD13Q1.006 (2015).
Schmidhuber, J. Deep learning in neural networks: An overview. Neural Networks. 61, 85–117 (2015).
Article PubMed Google Scholar
Yu, Y., Si, X. S., Hu, C. H. & Zhang, J. X. A Review of Recurrent Neural Networks: LSTM Cells and Network Architectures. Neural Computation. 31, 1235–1270 (2019).
Article MathSciNet PubMed MATH Google Scholar
Fan, L., Gao, Y., Bruck, H. & Bernhofer, C. Investigating the relationship between NDVI and LAI in semi-arid grassland in Inner Mongolia using in-situ measurements. Theoretical and Applied Climatology. 95, 151–156 (2009).
Article ADS Google Scholar
Fensholt, R., Sandholt, I. & Rasmussen, M. S. Evaluation of MODIS LAI, fAPAR and the relation between fAPAR and NDVI in a semi-arid environment using in situ measurements. Remote Sensing of Environment. 91, 490–507 (2004).
Article ADS Google Scholar
Johnson, L. F. Temporal stability of an NDVI-LAI relationship in a Napa Valley vineyard. Australian Journal of Grape and Wine Research. 9, 96–101 (2003).
Article Google Scholar
Lyu, H., Lu, H. & Mou, L. Learning a Transferable Change Rule from a Recurrent Neural Network for Land Cover Change Detection. Remote Sensing. 8 (2016).
Thireou, T. & Reczko, M. Bidirectional Long Short-Term Memory Networks for predicting the subcellular localization of eukaryotic proteins. Ieee-Acm Transactions on Computational Biology and Bioinformatics. 4, 441–446 (2007).
Article CAS PubMed Google Scholar
Colah. Understanding LSTM Networks http://colah.github.io/posts/2015-08-Understanding-LSTMs/ (2015).
Atzberger, C. & Eilers, P. H. C. A time series for monitoring vegetation activity and phenology at 10-daily time steps covering large parts of South America. International Journal of Digital Earth. 4, 365–386 (2011).
Article ADS Google Scholar
Atkinson, P. M., Jeganathan, C., Dash, J. & Atzberger, C. Inter-comparison of four models for smoothing satellite sensor time-series data to estimate vegetation phenology. Remote Sensing of Environment. 123, 400–417 (2012).
Article ADS Google Scholar
Julien, Y. & Sobrino, J. A. Comparison of cloud-reconstruction methods for time series of composite NDVI data. Remote Sensing of Environment. 114, 618–625 (2010).
Article ADS Google Scholar
Xiong, C. A global land surface 250-m 8-day NDVI product (2000_001). Figshare https://doi.org/10.6084/m9.figshare.22220050 (2023).
Xiong, C. A global land surface 250-m 8-day EVI product (2000_001). Figshare https://doi.org/10.6084/m9.figshare.22220125 (2023).
Xiong, C. A global land surface NDVI and EVI product (0.1°, 0.25°). Figshare https://doi.org/10.6084/m9.figshare.22267048 (2023).
Liu, R., Shang, R., Liu, Y. & Lu, X. Global evaluation of gap-filling approaches for seasonal NDVI with considering vegetation growth trajectory, protection of key point, noise resistance and curve stability. Remote Sensing of Environment. 189, 164–179 (2017).
Article ADS Google Scholar

Download references

Acknowledgements

This research was funded by the National Natural Science Foundation of China (No. 42301397). This research was also supported by the Open Research Program of the International Research Center of Big Data for Sustainable Development Goals, Grant NO.CBAS2022ORPO1. Acknowledgment for the data support from “National Earth System Science Data Centre, National Science & Technology Infrastructure of China. (http://www.geodata.cn)”.

Author information

Authors and Affiliations

School of Remote Sensing and Information Engineering, Wuhan University, Hubei, 430010, China
Changhao Xiong, Tao He, Yufang Zhang, Guodong Zhang & Jianglei Xu
JC STEM Lab of Quantitative Remote Sensing, Department of Geography, The University of Hong Kong, Hong Kong, China
Han Ma & Shunlin Liang
International Center for China Development Studies, The University of Hong Kong, Hong Kong, China
Han Ma

Authors

Changhao Xiong
View author publications
You can also search for this author in PubMed Google Scholar
Han Ma
View author publications
You can also search for this author in PubMed Google Scholar
Shunlin Liang
View author publications
You can also search for this author in PubMed Google Scholar
Tao He
View author publications
You can also search for this author in PubMed Google Scholar
Yufang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Guodong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jianglei Xu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.X. performed data collection, investigation, methodology, evaluation and writing-original draft; H.M. performed conceptualization, supervision, writing- review and editing; S.L. performed supervision, writing- review; G.Z., Y.Z. and J.X. participated in data resources and distribution; T.H. participated in data production.

Corresponding author

Correspondence to Han Ma.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Xiong, C., Ma, H., Liang, S. et al. Improved global 250 m 8-day NDVI and EVI products from 2000–2021 using the LSTM model. Sci Data 10, 800 (2023). https://doi.org/10.1038/s41597-023-02695-x

Download citation

Received: 19 March 2023
Accepted: 27 October 2023
Published: 14 November 2023
DOI: https://doi.org/10.1038/s41597-023-02695-x