Mapping of secondary forest age in China using stacked generalization and Landsat time series

Zhang, Shaoyu; Xu, Hanzeyu; Liu, Aixia; Qi, Shuhua; Hu, Bisong; Huang, Min; Luo, Jin

doi:10.1038/s41597-024-03133-2

Download PDF

Data Descriptor
Open access
Published: 16 March 2024

Mapping of secondary forest age in China using stacked generalization and Landsat time series

Shaoyu Zhang¹,
Hanzeyu Xu²,
Aixia Liu³,
Shuhua Qi ORCID: orcid.org/0000-0002-0708-373X¹,
Bisong Hu¹,
Min Huang¹ &
…
Jin Luo¹

Scientific Data volume 11, Article number: 302 (2024) Cite this article

1423 Accesses
Metrics details

Subjects

Abstract

A national distribution of secondary forest age (SFA) is essential for understanding the forest ecosystem and carbon stock in China. While past studies have mainly used various change detection algorithms to detect forest disturbance, which cannot adequately characterize the entire forest landscape. This study developed a data-driven approach for improving performances of the Vegetation Change Tracker (VCT) and Continuous Change Detection and Classification (CCDC) algorithms for detecting the establishment of forest stands. An ensemble method for mapping national-scale SFA by determining the establishment time of secondary forest stands using change detection algorithms and dense Landsat time series was proposed. A dataset of national secondary forest age for China (SFAC) for 1 to 34 and with a 30-m spatial resolution was produced from the optimal ensemble model. This dataset provides national, continuous spatial SFA information and can improve understanding of secondary forests and the estimation of forest carbon storage in China.

Ghost roads and the destruction of Asia-Pacific tropical forests

Article Open access 10 April 2024

FSC-certified forest management benefits large mammals compared to non-FSC

Article Open access 10 April 2024

Hotspots of biogeochemical activity linked to aridity and plant traits across global drylands

Article 12 April 2024

Background & Summary

Secondary forests represent forest or wood ecosystems that have recovered from disturbance following regeneration or plantation¹. Secondary forests dominate the forest landscape and play a crucial role in ecosystem health². Therefore, understanding the structure characteristics of secondary forests is important for developing forest conservation policies³. For example, stand age is an indicator of the forest ecosystem with ecological relevance^4,5,6. However, few past studies on secondary forests have less interest in stand age since the main focuses have been on the extent and range of forest disturbance. Therefore, there is a need to estimate secondary forest age (SFA) to improve understanding of the function of secondary forests in the national-scale terrestrial ecosystem⁷.

Remote sensing technologies offer low-cost, efficient, and easily accessible data at multiple spatial and temporal scales, and these data provide exciting possibilities for investigating the resources, composition, and functions of forests⁸. There are currently three typical categories of methods for mapping SFA: (1) derivation from the land cover datasets; (2) classification from remote-sensed images; and (3) retrieving the establishment time of secondary forest stands. The choice of derivation method is heavily dependent on the accuracy and period of continuous land cover datasets, and these datasets can be difficult to obtain at a large scale^1,9,10. Classification-based SFA mapping offers the opportunity to differentiate between mature and secondary forests. This approach typically uses high-resolution satellite imagery, including SPOT-5, WorldView-3, and ALOS PALSAR^11,12,13. Time series data of land disturbance can be utilized for mapping the age of tree crops (e.g., oil palm, rubber) based on change detection algorithms^14,15,16. Therefore, historical SFA can be estimated by retrieving the establishment time of the secondary forest using time series change detection algorithms.

Many time series change detection algorithms have been proposed for analyzing the historical dynamics of forests^17,18,19. Some previous estimates of SFA by monitoring forest stands after disturbance at regional scales^{1,20,21,22,23} using time series segment algorithms, such as the Vegetation Change Tracker (VCT)^24,25 and Continuous Change Detection and Classification (CCDC)²⁶. However, the use of a single approach to retrieve the time of stand establishment is inadequate due to the complexity of the terrain, the range of forest types, and the basic algorithm logic. There has been limited focus and progress on the recovery of the secondary forest age since most algorithms have been designed for detecting forest disturbance²⁷. In addition, the application of these algorithms to different regions has highlighted their continued inconvenience and uncertainty. For example, the widely-used VCT remains difficult to apply at large scales and the CCDC continues to overestimate change due to its sensitivity to subtle changes²⁸. Thus, there remains a need to improve the understanding and accuracy of change detection algorithms to allow the precise mapping of regional- and national-scale SFA.

Forest accounts for 23.04% of the total area of China (2021) (http://www.forestry.gov.cn/). This high coverage of forest in China can be mainly attributed to afforestation and forest recovery efforts over the past decades, including programs to return cultivated land into forest and the closing hillsides to facilitate afforestation. Estimations of the distribution, density, structure, and pattern of secondary forests are key for understanding the role and function of secondary forests within the wider forest ecosystem in China. The aims of the present study were to (1) develop a data-driven method for VCT and CCDC on detecting the establishment of forest stands; (2) design a novel SFA estimation method using stacked generalization and Landsat time series; (3) use the optimal ensemble method to produce a national-scale mapping of SFA for China for 2020; (4) assess the accuracy of SFAC²⁹ with validation samples, statistical data, and other datasets.

Methods

Data processing

Landsat time series data

The present study used Landsat Collection 2 Level-1 data to retrieve the establishment time of secondary forest stands. All available surface reflectance images, including Landsat 5 Thematic Mapper (5TM), 7 Enhanced Thematic Mapper Plus (7ETM+), and 8 Operational Land Imager (8OLI) images from 1986 to 2021 were provided by the United States Geological Survey (USGS) (available at https://earthexplorer.usgs.gov/) and were obtained using Google Earth Engine (GEE)³⁰. Clouds, cloud shadows, and snow were filtered using CFMask algorithms³¹ in GEE. The stacks of the annual composite were obtained using the Best Available Pixel (BAP) method³² and were used as input data for the LandTrendr (LT) and VCT algorithms described below. The stacks of all available images were prepared as inputs for the Moving Average Change Detection (MACD) and CCDC algorithms described below.

New reference forest map

The present study produced a new reference forest map for China from three land cover products for 2020³³ (Fig. 1a). These datasets included the World Cover 2020 (ESA-2020) (available at https://esa-worldcover.org/en), ESRI 2020 Land Cover (ESRI-2020) (available at https://livingatlas.arcgis.com/landcover), and the GlobeLand30 version of V2020 (GLC-2020) (available at http://www.globallandcover.com/). The ESA-2020 dataset is a global land cover map with an overall accuracy for Asia 2020 of 80.7% at a 10-m resolution³⁴. The ESRI-2020 co-released by ESRI and Microsoft Planetary Computer platform is a global land cover map with a 10-m resolution³⁵. GLC-2020 was created by a research group in China and is a global land cover product with a 30-m resolution and an overall accuracy for 2020 of 85.72%^36,37. The “forest” category is all provided in these land cover products, though from inconsistent definitions (i.e., tree cover percentage >15% and tree height >3 m in ESA-2020, vegetation cover with trees >30% in GLC-2020). Within the production of the reference forest map for China, a pixel was assumed to represent forest when the same pixel in at least two land cover products showed forest properties, thereby decreasing the uncertainties of classification of forests at a large scale^38,39.

Figure 1 shows the distribution of the forest baseline map and the multiple sub-study districts in China. These sub-study districts include eight forestry projects and 31 provinces: the three-north shelterbelt program (TN); afforestation program for Taihang Mountain (TH); shelterbelt program for Liaohe river (LH); shelterbelt program for the middle reaches of the Yellow river (Yellow); the shelterbelt program for Huaihe river and Taihu lake (HT); the shelterbelt program for the upper and middle reaches of the Yangtze river (Yangtze); the shelterbelt program for the Pearl river (Pearl); the coastal shelterbelt program (Coastal). The provinces of China and adjacent areas included in the present study are Anhui (AH), Beijing (BJ), Chongqing (CQ), Fujian (FJ), Guangdong (GD), Gansu (GS), Guangxi (GX), Guizhou (GZ), Hebei (HB), Henan (HeN), Heilongjiang (HLJ), Hainan (HN), Hubei (HuB), Hunan (HuN), Inner Mongolia (IM), Jilin (JL), Jiangsu (JS), Jiangxi (JX), Liaoning (LN), Ningxia (NX), Qinghai (QH), Sichuan (SC), Shandong (SD), Shanghai (SH), Shannxi (SNX), Shanxi (SX), Tibet, Tianjin (TJ), Taiwan (TW), Xinjiang (XJ), Yunnan (YN), Zhejiang (ZJ).

Candidate stable and secondary forest maps

Candidate stable and secondary forest maps were prepared for validation against samples and input into the algorithms. The European Space Agency Climate Change Initiative-Land Cover (ESA_CCI-LC) project provides a consistent annual global land cover map with a 300-m spatial resolution for 1992 to 2020⁴⁰ (available at https://climate.esa.int/en/projects/land-cover/). The stable and secondary forest maps were individually derived based on ESA_CCI-LC by yearly overlaying¹⁰. Pixels of stable forest represented the forest in 1986 was always there from 1986 to 2020 without clear-cut or regrowth, whereas pixels of secondary forest were identified as the newly occurred forest including the natural forest regrowth and artificial afforestation.

Validation Samples

The 2,072 samples of secondary and 3,000 samples of stable forest, respectively were used to assess the accuracy of the results produced by each algorithm and ensemble⁴¹ (Fig. 2). Samples for validation were selected randomly from the 7th National Forest Resources Inventory (NFRI) and were compared to the secondary forest maps produced above. The candidate points were visually examined using “Landsat Time Series Explorer”, a shared Application on GEE (https://jstnbraaten.users.earthengine.app/view/landsat-timeseries-explorer). In addition, historical imagery from Google Earth (https://earth.google.com/), GF-6 panchromatic/multispectral (PMS) images (a high-resolution Chinese satellite) (https://data.cresda.cn/#/2dMap) helped to distinguish stable and secondary forest samples. A total of 2,072 validation samples of secondary forest age ranging from 1 to 34 were defined by the re-interpreted approach mentioned above.

Over 3,000 candidates of stable forests were randomly sampled from stable forest maps for validation. The classification of these samples of stable forest was ensured by filtering through many public land cover products. As shown in Table 1, these datasets included AGLC-2000-2015, GLC_FCS, FNF, GLC, CLUD, and GFCC. The categorization of the samples as stable forests was ensured by processing using Python, ArcGIS 10.6, and GEE. The 3,000 samples of the stable forest were then completed after manually removing pixels not following the rules: first, the patch should have pure, intact forest cover and satisfy the definition of forest in the Food and Agriculture Organization of the United Nations (FAO)⁴²; second, the point sample of the forest should be located in the center of the forest patch.

Table 1 Land cover products used for determining stable forest samples.

Full size table

Detection of establishment times of secondary forest stands

The ages of secondary forest stands were determined by detecting the times of the newest stand establishment using change detection algorithms and Landsat time series data. The present study selected four basic algorithms, namely threshold-based moving average change detection (MACD)⁴³, LandTrendr (LT)⁴⁴, VCT, and CCDC algorithms, to detect the establishment times of secondary forest stands. These algorithms were chosen due to their relative advantages in large-scale analysis, performance, convenience, and efficiency, as well as their use in previous studies for estimating the SFAs of specific forests or trees^{9,23,43,45,46,47}. MACD is a thresholding method in which changes are defined as large deviations from the set threshold. The bare soil index (BSI) with a threshold of 0 was used for detecting the stand establishment. LT identifies gradual changes (mainly recovery) in time series by temporal segmentation and linear regression^44,48. VCT was used to detect the forest regrowth based on the Integrated Forest Z-score (IFZ) threshold^24,25. CCDC algorithm can fit a curve for each pixel with harmonic model and historical time-series Landsat images and capture changes by comparing model prediction with satellite observation²⁶. The Normalized Burn Ratio (NBR) index was widely used to detect the forest dynamic, and it was also used as an input parameter for LT and CCDC. The MACD, LT, VCT, and CCDC were used to identify the establishment times of the secondary forest stands. The establishment time was then converted to forest age in 2020.

Data-driven VCT

The VCT suffers various disadvantages, including complex computation, the need for forest samples, and the difficulty of application at a large scale. Therefore, the present study applied a data-driven approach to facilitate the online use of VCT in GEE. The core index, integrated forest z-score (IFZ), was used with VCT to detect the forest dynamic⁴⁹. The IFZ index needed in VCT was calculated as:

$$F{Z}_{i}=\frac{{b}_{i}-{\bar{b}}_{i}}{S{D}_{i}},$$

(1)

$$IFZ=\sqrt{\frac{1}{NB}\mathop{\sum }\limits_{i=1}^{NB}{\left(F{Z}_{i}\right)}^{2}},$$

(2)

where the RED, short-wave infrared 1 (SWIR1), and short-wave infrared 1 (SWIR2) bands in Landsat were needed to construct the forest z-score (FZ) and IFZ indices. The ${\bar{b}}_{i}$ and SD_i are the mean and standard deviation of the band i spectral values of the forest samples within the image, respectively, and NB is the number of total bands.

Many forest samples were needed to calculate the b_i and SD_i of forest pixels. However, there are differences in structure and spectral properties among different forest types, such as deciduous, mixed forest, open forest, and evergreen forests⁵⁰, as well as among different climate zones, such as temperate, semiarid, and arid zones²⁴. Therefore, the use of b_i and SD_i as reference values for the entire study region is inaccurate, particularly at a national scale. In addition, when applying the calculation to many samples, the diversity of forest types hinders the application of the VCT algorithm at a large scale. Secondly, there is a need for a flexible and accurate threshold within the determination of forest recovery. Different IFZ thresholds have been applied among different past studies on forest change. For example, thresholds of 2.5⁵¹, 3^52,53, 4.5⁵⁴, 4^55,56, and 6.5 have been applied in semiarid regions²⁴, whereas many other studies do not mention the threshold used^{23,57,58,59,60}. Although a subtle detail, the IFZ threshold is of importance, particularly when working at a national scale.

The current study proposed a data-driven approach for the application of the VCT at a national scale that is more efficient. The steps of the approach are: (1) The samples of stable and secondary forests were filtered based on various conditions, including an area >4,500 m², random selection, and data conversion⁶¹ (Fig. 3a). (2) A grid with a spatial resolution of 3° was created for the entire study region, thereby overcoming the challenge of mass operation using VCT in the study area. (3) Over 100 forest samples were randomly selected to determine IFZ from the stable forest map produced above. The forest points from the new forest map for 2020 were substituted when no samples in the stable forest map existed in one grid. (4) The samples from the secondary forest map produced above were used to calculate the threshold of IFZ. A pixel was characterized as a forest pixel when the value in the VCT time series was below the threshold for two consecutive years⁵⁵.

RF for CCDC

The Random Forest (RF) algorithm for CCDC (CCDC_RF) was developed by introducing the RF model to identify breakpoints of CCDC. The entire CCDC time series was segmented into several shorter time series. These time series were used to produce multiple breakpoints related to changes in land cover. The frequent occurrence of many breakpoints over the entire period at one pixel indicated the strong sensitivity of CCDC for change detection⁶². The false identification of breakpoints by the CCDC was inevitable due to the effects of subtle disturbance, degeneration, and insects on forests at the pixel scale⁶³, and not all breaks identified by CCDC represented changes in land cover. The present study aimed to identify breakpoints that only represented afforestation or the transition of non-forest land cover to forest for detecting the establishment of forest stands.

The large quantity of information provided by the breakpoints generated by the CCDC allowed the secondary classification of breakpoints. The coefficients of the fitting model and the variable derived from the three harmonics for each segment were used to classify land cover⁶⁴. The present study aimed to generate a CCDC_RF method in which RF⁶⁵ is used to classify and validate the breakpoints identified by CCDC. The steps used in this process included: (1) All samples were assumed to be correct when they were detected by all four basic algorithms described above at a consistent time (±1 year). False samples were identified as samples for which the results of the algorithms were not consistent with the latest CCDC breakpoints. (2) True and false samples numbering 3,850 and 3,189, respectively were used to train and test the RF for identifying CCDC breakpoints (available at https://doi.org/10.6084/m9.figshare.22224037.v2)⁶⁶ (Fig. 3b). The RF was used to train the classifier model and the latest CCDC breakpoint was used to classify the breakpoints for the entire time series. The maximum number of segments in the CCDC was set to 6, which is sufficient to represent changes in land cover in the time series⁶⁷.

Ensemble rule

Two rules were used to construct the ensembles. In the first rule, each ensemble was constructed using stacked generalization in which the ahead result was masked from the result of the backward algorithm. For example, the VCT + MACD presents that the MACD’s results were the baseline, and the change results from VCT were then kept in the pixels where MACD detected no changes. This rule was applied to the VCT + MACD, CCDC_RF_OLB + VCT, VCT + CCDC_RF_OLB, and VCT + CCDC_RF_ALB ensembles as well as in VCT + CCDC_RF_ALB + 2 out of 4 (VCR2) ensembles. The ensemble models used in the present study were designed according to the results using individual algorithms for the Detection of establishment times of secondary forest stands. In the remaining 2 out of 4, a forest pixel was needed by at least 2 of the 4 basic algorithms. Within CCDC_RF_OLB, only the latest break was used in the CCDC_RF. In addition, CCDC_RF_ALB was constructed through secondary classification for all breakpoints in the time series.

Accuracy assessment

The 2,072 and 3,000 samples of secondary and stable forests, respectively were determined using a labor-intensive exercise. The confusion matrix was obtained based on the validation samples, with the quantitative metrics calculated for each basic algorithm and ensemble, including the overall accuracy (OA), producer’s accuracy (PA), and user’s accuracy (UA). The correct rate (CR) was also used to assess the results of different methods. CR was calculated as the number of correct examples divided by the total detected examples in each class. Figure 4 shows a schematic representing the processes followed in the current study for mapping SFA for China.

Comparison with Statistical data

The statistical data of planted forest area and natural forest area was used for comparison with the area of secondary forest and stable forest produced in this study. The statistical afforestation data of China in 2018 was obtained from the China Forest Resources Report (2014–2018) (available at http://www.forestry.gov.cn/). Statistical areas of planted forests would be as the reference data in Chinese provinces with the hypothesis that the planted forest investigated was mainly planted from 1987 to 2018. TW was not included in the provinces investigated in the National Forest Resources Report. In addition, we obtained data on the recovery of forest cover in China for each decade from 1990 to 2020 from FAO⁶⁸. The increase in China’s forest cover from FAO arises from two elements-planted forest area and naturally regenerated forests, consistent with the forest definition detected in this study.

Comparison with other maps

The SFA_CCI⁶⁹, SFA_MODIS⁷⁰, SFA_CLCD⁷¹, and global map of planting years of plantations (GPYP)⁷² were used for comparison with the secondary forest age for China (SFAC)²⁹. Since not many SFA products can be compared to SFAC²⁹, the SFA data used within the comparison were derived from continuous land cover products, for example, ESA-CCI, MCD12Q1, CLCD(Table 2), and GPYP⁷². The global map of planting years of plantations (GPYP) can be downloaded on the figshare (https://doi.org/10.6084/m9.figshare.19070084.v1)⁷² GPYP is in a GeoTIFF format with the 30-meter spatial resolution by recording gridded species types and planting years of global plantations. The Global 1 km forest age datasets (SFA_BGI) can be obtained at https://doi.org/10.17871/ForestAgeBGI.2021⁷³. This SFA_BGI provides an ensemble of global estimation of 1 km global forest age in 2010 using forest inventories, biomass, and climate data.

Table 2 Land cover datasets of forest age used for inter-comparison.

Full size table

SFA_BGI from 1990 to 2010 was also compiled for comparison with our results. The presented study counted the surface fraction heat plots of the secondary forest according to the age period and at the same spatial resolution of the corresponding reference SFA data in 0.05° spatial grids (Table 2). This approach allowed inconsistencies in spatial and temporal scales to be avoided.

Data Records

The SFAC dataset produced in the current study can be freely downloaded from figshare (https://doi.org/10.6084/m9.figshare.21792557.v2)²⁹. The dataset produced in 2022 represents forest age for China in 2020. The data includes 20 files named ‘’sfa_china_2020” with tiff format in a zip. Values from 1 to 34 in the “Age” band represent the age of the forest, where values of 36 and 0 indicate a forest age >34 (not a specific pixel-scale age) and non-forest, respectively. At the same time, the age of 34 to 1 represents the year of forest regrowth ranging from 1987 to 2020. The spatial extent of the dataset includes mainland China and Taiwan but excludes the South China Sea islands. The map is defined in the WGS84 projection and has a 30-m spatial resolution.

The external data used in this paper included the forest map and validation datasets. The new forest base map in 2020 used in our study is available at https://doi.org/10.6084/m9.figshare.22223854.v1³³. The stable and secondary forest validation samples can also be obtained from https://doi.org/10.6084/m9.figshare.22223911.v1⁴¹. The stable and secondary forest samples used for the calculation of VCT are available at https://doi.org/10.6084/m9.figshare.22223956.v1⁶¹. The training and test data for CCDC can be accessed at https://doi.org/10.6084/m9.figshare.22224037.v2⁶⁶.

The three SFA datasets derived from the Moderate Resolution Imaging Spectroradiometer (MODIS), ESA_CCI-LC, and CLCD products for inter-comparison data can be viewed at respectively: (SFA_MODIS: https://doi.org/10.6084/m9.figshare.22225969.v1)⁷⁰, (SFA_CCI: https://doi.org/10.6084/m9.figshare.22225993.v1)⁶⁹, (SFA_CLCD: https://doi.org/10.6084/m9.figshare.22225930.v1)⁷¹. We provided more access online from GEE in Supplementary DataRecords.

Technical Validation

Accuracy assessment

The results of the accuracy assessment showed that using the ensemble method improved accuracies (Table 3). Among the individual algorithms, VCT showed the best performance, with a PA of 72.13%, UA of 49.71%, OA of 71.61%, and mean CR of 75.51%. In contrast, the CCDC achieved the worst results, with a PA of 60.18%, UA of 48.79%, OA of 65.89%, and the lowest mean CR of 63.53%. The LT achieved asymmetric results, with the highest PA of 89.17%, the lowest UA of 20.27%, an OA of 66.42%, and a mean CR of 64.85%. The MACD provided results of intermediate accuracy, with a PA of 57.14%, UA of 63.9%, OA of 65.67%, and mean CR of 80.82%. The UA and PA results of the VCT and CCDC_RF_ALB algorithms were more balanced compared to those for the other single algorithms. The accuracy assessment results for single algorithms suggested that some ensembles were created based on stacked generalization or aggregation.

Table 3 The results of the accuracy assessment of each individual and ensemble algorithm.

Full size table

The performances of the ensemble models exceeded those of the individual algorithms. The VCT + CCDC_RF_ALB and CCDC_RF_ALB + VCT ensemble algorithms were constructed in a different order and achieved higher performance than their respective single algorithms. Ensemble algorithm 3 of 4 achieved the highest PA and CR for secondary forests of 95.12% and 97.85%, respectively, and the lowest OA of 13.18%. Ensemble algorithm 2 of 4 produced a PA, UA, OA, and mean CR of 81.53%, 49.86%, 74.90%, and 84.42%, respectively. The proposed ensemble of VCT + CCDC_RF_ALB + 2 of 4 (VCR2) obtained a balanced PA of 68.26%, UA of 66.12%, OA of 73.60%, and mean CR of 82.96% (Table 3). Among the assessed ensemble models, the present study used the superior VCR2 to determine the establishment times of secondary forest stands.

Comparisons with Statistical data

The secondary forest and stable forest produced in this study show a good consistency compared with the statistical data although these data originated from a different standard. There was less difference between the secondary forest area and the statistical area of planted forest in AH, FJ, GD, GX, GX, HLJ, JX, SNX, and ZJ provinces, etc, while a large difference existed in CQ, HB, HeN, IM, LN, SD provinces, etc (Fig. 5a). The secondary forest area with an area of 6.53 × 10⁷ ha had a slight underestimation compared with the statistical area of planted forest with an area of 7.96 × 10⁷ in China. On the other hand, a large difference did not exist in provinces between stable forest area and statistical natural forest area, excepting the HuN, IM, Tibet, and YN provinces (Fig. 5c). The good results were found in the correlations that R² = 0.6 (Fig. 5b), and R² = 0.71 (Fig. 5d), respectively, demonstrating the reliability of our results too. This study indicated that the forest has increased by 2.039 × 10⁷ ha, 1.928 × 10⁷ ha, and 1.978 × 10⁷ ha in 1990–2000, 2000–2010, and 2010–2020, respectively (Table 4, Supplementary Fig. 2). There was only a 5.25% difference in the total area of secondary forest in SFAC from 1990 to 2020 compared to that in FAO (Table 4). Other results, especially the SFA_BGI data, have a big difference compared with the FAO data.

Table 4 Estimated area of secondary forest for each period in this study (SFAC)²⁹, Food and Agriculture Organization of the United Nations (FAO)⁶⁸, SFA_CCI⁶⁹, SFA_MODIS⁷⁰, and global map of planting years of plantations (GPYP)⁷², SFA_BGI⁷³.

Full size table

Comparisons with other maps

The maps produced in the current study showed a positive relationship between secondary forest age for China (SFAC)²⁹ and reference datasets, although there were some large differences. It should be considered that none of the four inter-comparison products chosen in the present study can be considered reflective of reality as these data were not created specifically for SFA. As shown in Fig. 6, the relationships between SFAC²⁹ and SFA_CCI⁶⁹, SFA_MODIS⁷⁰, SFA_CLCD⁷¹, and GPYP⁷² achieved R² values of 0.382, 0.233, 0.457, and 0.408, respectively. The SFAC²⁹ indicated underestimation within all reference SFA data. The comparison between SFAC²⁹ and SFA_CLCD⁷¹ showed higher consistency, whereas that between SFAC²⁹ and the two datasets with lower spatial resolutions, SFA_CCI⁶⁹ and SFA_MODIS⁷⁰, showed lower consistency. Some products’ accuracy, spatial resolution, time domain, and methods heavily influenced the low relationships between our results and other derived SFA maps. The results indicated that the non-thematic data heavily underestimated the SFA distribution compared with our results. Our estimated area of secondary forest was closest to the statistics from FAO compared with other SFA data.

The present study further identified spatial differences between these datasets among the three sub-regions presenting dense secondary forests in northeastern, southeastern, and southwestern China (Fig. 7). Various spatial differences were identified when comparing SFAC²⁹ to the other SFA maps at a fine spatial scale. Consistency in spatial patterns of secondary forests was identified with the comparison between SFAC²⁹ and SFA_MODIS⁷⁰ in regions A and B, as well as between SFAC²⁹ and SFA_CLCD⁷¹ in regions A and C. The SFAC²⁹ provided more detailed long-term descriptions of secondary forests at a 30-m resolution compared to that provided by the low-resolution SFA_CCI⁶⁹ and SFA_MODIS⁷⁰. In addition, despite their higher spatial resolution of 30 m and extended age span datasets, SFA_CLCD⁷¹ and GPYP⁷² underestimated the secondary forest cover. The imperfections in SFA_CCI and SFA_MODIS stem from the low accuracy and spatial resolution of the ESA_CCI and MODIS products. The low comparison consistency in SFA_CLCD and GPYP with SFAC originated from the classification in CLCD and the only used LT algorithm in GPYP. The results showed that the secondary forest identified by SFAC²⁹ covered virtually all areas of secondary forest identified in the four reference SFA maps.

Usage Notes

The national-scale 30-m SFAC²⁹ product provides SFA for China over the past 34 years which previous studies have not been able to achieve. This SFAC²⁹ dataset can potentially provide information to support forest ecosystem research, including forest biomass, forest carbon sequestration, and forest dynamics.

The variable thresholds of data-driven VCT

As shown in Fig. 8, the results of the present study showed a heterogeneous IFZ threshold pattern across mainland China and Taiwan. This result could be attributed to region-specific differences in forest ecosystems due to different geographical and climatic conditions. The relatively large and variable IFZ thresholds of Xin Jiang and Tibet could be mainly attributed to their extremely cold climates. The IFZ threshold of western China reached a maximum of 11.4, indicating the establishment of forest stands at a IFZ that was lower than the threshold at the pixel scale. One major reason for the above results is the sparse coverage of unique forest species in the extremely cold regions, such as Picea asperata and Abies fabri, whereas the lower IFZ threshold in eastern and southern China can be attributed to fast-growing dense forests in the subtropical climate zone.

As expected, the mean IFZ threshold decreased in intervals along the longitude 75° to 135° E, indicating a strong negative correlation (r = −0.75, p < 0.001) (Fig. 8b). However, the trend in mean latitudinal threshold differed from that of the longitudinal threshold. There was an increasing trend in the IFZ threshold along latitude 16.82° to 37.82° N. This result could be attributed to the dense forest cover at low latitudes, with a strong positive correlation (r = 0.93, p < 0.01). On the other hand, Fig. 8c shows a decreasing trend in the IFZ threshold along the latitude 37.82° to 52.82° N with a negative correlation (r = −0.89, p < 0.05). This result can be attributed to the thick forest in the Greater Khingan Mountains, an important forest reserve in northeast China. The variations in the IFZ threshold across the entire study area demonstrated stand establishment of different grades at the growth stage in every interval.

The performance of the improved CCDC_RF

The CCDC_RF achieved a higher performance in monitoring the establishment of forest stands compared to the CCDC algorithm. The CCDC_RF_OLB achieved an OA, UA, PA, and mean CR of 71.27%, 38.08%, 81.93%, and 69.99%, respectively, exceeding the performance of the single CCDC without the RF model. The CCDC_RF_ALB achieved the best single algorithm performance among the assessed algorithms with an OA of 72.12%, UA of 41.17%, PA of 81.39%, and mean CR of 69.56%. The addition of all breakpoints resulted in obvious improvements in the PA and UA. However, the mean CR achieved by the CCDC_RF_ALB was lower than that of VCT and CCDC_RF_OLB. As shown in Supplementary Fig. 5, the spatial distribution of results produced by the CCDC_RF was similar to that of the original CCDC. However, careful observation revealed that secondary forests shown in Supplementary Fig. 5b had lower coverage than those shown in Supplementary Fig. 5a. This result can be attributed to the retention of true secondary forest due to the removal of false breakpoint information by the RF model.

Advantages and limitations

The ensemble approach for mapping SFA proposed in the current study produced a distribution of SFAC²⁹ that could not be obtained by any single direct or indirect mapping method (Fig. 7). Various past studies have successfully identified the spatial distributions of secondary forest, mature forest, and non-forest land cover using various classification schemes^21,74,75,76. While patch size has been shown to be an important indicator influencing classification accuracy⁷⁷, the present study is the first to detect patch size based on a time-series approach⁷⁸, thereby explaining the higher area of secondary forest identified in the present study compared to that in other SFA datasets (land cover from classification). On the other hand, the change detection algorithms showed major differences within their use in a time-series approach due to their basic logic, the density of observation data, and the designed target¹⁹, thereby limiting their use in detection of forest dynamics⁷⁹. The method proposed in the present study provides an improved forest coverage output compared to that provided by single algorithms.

In comparison to common applications using the traditional VCT and CCDC algorithms, the present study developed novel data-driven approaches for VCT and CCDC and applied the RF model to the outputs of the CCDC. The proposed approach showed improved results (Fig. 7, Supplementary Fig. 5). In particular, the results of the CCDC_RF confirmed that it is unreasonable to directly use the outputs of the CCDC for determining secondary forest stands (Table 3). The results of CCDC_RF indicated that it decreased omission and commission errors through application of secondary classification based on many samples defined from the four algorithms. At the same time, the coef_INTP, RMSE, MAG, and coef_SIN variables in outputs of CCDC and topographical factors contributed greatly to the result obtained for CCDC_RF (Fig. 9). CCDC_RF obtained accuracy and Kappa values of 0.98, and 0.96, respectively based on validation against 2,166 samples, higher than the assessment based on validation samples. Theoretically, the CCDC_RF map should provide results that are an improvement over the assessment (Table 3) under the assumption that the whole samples co-defined by the four algorithms for output classification of the CCDC were correct. The discrepancy in assessment can be attributed to errors in the validation samples, despite being manually assessed. Thus, it can be argued that the SFAC²⁹ map produced using the optimal ensemble had a higher accuracy with an OA of 73.60% and a mean CR of 82.96% (Table 3).

The PA, UA, and OA from the final ensemble were lower than the accuracy assessment of common classification results. Not only the two types, secondary forests, and stable forests were present, but also the stand establishment time of secondary forests in the temporal domain was determined in this study. Therefore, the CR was also used in this study to show the accuracy of the maps. Actually, the accuracies of secondary and stable forest in the final map were 76.71% and 89.20%. This SFAC product provides a finer description of spatial and temporal domains compared with other maps related to forest age. Overall, our result was better and more reliable now from the validation and comparison based on validation examples, statistical data, and other products. However, the age estimation of the old forest needs further exploration in future work.

Code availability

The codes used in data generation and processing are in two formats, JavaScript used in GEE and Python. The codes are available in GitHub at: (https://github.com/Zhangshaoy/SFAC.git). Each repository includes a guide for the use of the codes. An online visualization map using the GEE experimental app is also provided: (https://zsy11600.users.earthengine.app/view/sfac).

References

Fujiki, S., Okada, K., Nishio, S. & Kitayama, K. Estimation of the stand ages of tropical secondary forests after shifting cultivation based on the combination of WorldView-2 and time-series Landsat images. ISPRS Journal of Photogrammetry and Remote Sensing 119, 280–293 (2016).
Article ADS Google Scholar
Fischer, J., Lindenmayer, D. B. & Manning, A. D. Biodiversity, Ecosystem Function, and Resilience: Ten Guiding Principles for Commodity Production Landscapes. Frontiers in Ecology and the Environment 4, 80–86 (2006).
Article Google Scholar
Trejo, I. & Dirzo, R. Deforestation of seasonally dry tropical forest: a national and local analysis in Mexico. Biological Conservation 94, 133–142 (2000).
Article Google Scholar
Wang, D., Sun, G. & Guo, Z. A case study on estimating natural forest age with DBH distribution and forest growth model. Ecology and Evironment 17, 1999–2003 (2008).
Google Scholar
Song, C. & Woodcock, C. E. A regional forest ecosystem carbon budget model: Impacts of forest age structure and landuse history. Ecological Modelling 164, 33–47 (2003).
Article CAS Google Scholar
Abbasi, A. O. et al. Spatial database of planted forests in East Asia. Sci Data 10, 480 (2023).
Article PubMed PubMed Central Google Scholar
Zhang, Y., Yao, Y., Wang, X., Liu, Y. & Piao, S. Mapping spatial distribution of forest age in China. Earth and Space Science 4, 108–116 (2017).
Article ADS Google Scholar
Chambers, J. Q. et al. Regional ecosystem structure and function: ecological insights from remote sensing of tropical forests. TREENS in Ecology and Evolution 22, 411–423 (2007).
Google Scholar
Chen, B. et al. Identifying Establishment Year and Pre-Conversion Land Cover of Rubber Plantations on Hainan Island, China Using Landsat Data during 1987–2015. Remote Sensing 10, 1240 (2018).
Article ADS Google Scholar
Silva Junior, C. H. L. et al. Benchmark maps of 33 years of secondary forest age for Brazil. Sci Data 7, 269 (2020).
Article PubMed PubMed Central Google Scholar
Dibs, H., Idrees, M. O. & Alsalhin, G. B. A. Hierarchical classification approach for mapping rubber tree growth using per-pixel and object-oriented classifiers with SPOT-5 imagery. Egyptian Journal of Remote Sensing and Space Science 20, 21–30 (2017).
Article Google Scholar
Rizeei, H. M., Shafri, H. Z. M., Mohamoud, M. A., Pradhan, B. & Kalantar, B. Oil palm counting and age estimation from WorldView-3 imagery and LiDAR data using an integrated OBIA height model and regression analysis. Journal of Sensors 2018, 1–13 (2018).
Article Google Scholar
Shimada, M. et al. New global forest/non-forest maps from ALOS PALSAR data (2007–2010). Remote Sensing of Environment 155, 13–31 (2014).
Article ADS Google Scholar
Chen, B. et al. Estimation of rubber stand age in typhoon and chilling injury afflicted area with Landsat TM data: A case study in Hainan Island, China. Forest Ecology and Management 274, 222–230 (2012).
Article ADS Google Scholar
Li, Z. & Fox, J. M. Mapping rubber tree growth in mainland Southeast Asia using time-series MODIS 250 m NDVI and statistical data. Applied Geography 32, 420–432 (2012).
Article Google Scholar
Xiao, C., Li, P. & Feng, Z. A renormalized modified normalized burn ratio (RMNBR) index for detecting mature rubber plantations with Landsat-8 OLI in Xishuangbanna, China. Remote Sensing Letters 10, 214–223 (2019).
Article Google Scholar
Aakala, T. Forest fire histories and tree age structures in Värriö and Maltio Strict Nature Reserves, northern Finland. Boreal Env. Res 23, 209–219 (2018).
Google Scholar
Chen, Y., Runping, S., Dawei, Y. U., Ronggao, L. I. U. & Jingming, C. Forest disturbance monitoring based on the time- series trajectory of remote sensing index. Journal of Remote Sensing 4619, 1246–1263 (2013).
Google Scholar
Zhu, Z. Change detection using landsat time series: A review of frequencies, preprocessing, algorithms, and applications. ISPRS Journal of Photogrammetry and Remote Sensing 130, 370–384 (2017).
Article ADS Google Scholar
Yang, X. et al. Forest age mapping based on multiple-resource remote sensing data. Environ Monit Assess 192, 734 (2020).
Article PubMed Google Scholar
Carreiras, J. M. B., Jones, J., Lucas, R. M. & Shimabukuro, Y. E. Mapping major land cover types and retrieving the age of secondary forests in the Brazilian Amazon by combining single-date optical and radar remote sensing data. Remote Sensing of Environment 194, 16–32 (2017).
Article ADS Google Scholar
Chen, D., Loboda, T. V., Krylov, A. & Potapov, P. V. Mapping stand age dynamics of the Siberian larch forests from recent Landsat observations. Remote Sensing of Environment 187, 320–331 (2016).
Article ADS Google Scholar
Diao, J. et al. Use of vegetation change tracker, spatial analysis, and random forest regression to assess the evolution of plantation stand age in Southeast China. Annals of Forest Science 77, 27 (2020).
Article Google Scholar
Huang, C. et al. An automated approach for reconstructing recent forest disturbance history using dense Landsat time series stacks. Remote Sensing of Environment 114, 183–198 (2010).
Article ADS Google Scholar
Thomas, N. E. et al. Validation of North American Forest Disturbance dynamics derived from Landsat time series stacks. Remote Sensing of Environment 115, 19–32 (2011).
Article ADS Google Scholar
Zhu, Z. & Woodcock, C. E. Continuous change detection and classification of land cover using all available Landsat data. Remote Sensing of Environment 144, 152–171 (2014).
Article ADS Google Scholar
DeVries, B. et al. Tracking disturbance-regrowth dynamics in tropical forests using structural change detection and Landsat time series. Remote Sensing of Environment 169, 320–334 (2015).
Article ADS Google Scholar
Awty-Carroll, K., Bunting, P., Hardy, A. & Bell, G. An Evaluation and Comparison of Four Dense Time Series Change Detection Methods Using Simulated Data. Remote Sensing 11, 2779 (2019).
Article ADS Google Scholar
Zhang, S. et al. Mapping of secondary forest age in China using stacked generalization and Landsat time series. figshare https://doi.org/10.6084/m9.figshare.21792557.v2 (2023).
Gorelick, N. et al. Google Earth Engine: Planetary-scale geospatial analysis for everyone. Remote Sensing of Environment 202, 18–27 (2017).
Article ADS Google Scholar
Zhu, Z. & Woodcock, C. E. Object-based cloud and cloud shadow detection in Landsat imagery. Remote Sensing of Environment 118, 83–94 (2012).
Article ADS Google Scholar
White, J. C. et al. Pixel-Based Image Compositing for Large-Area Dense Time Series Applications and Science. Canadian Journal of Remote Sensing 40, 192–212 (2014).
Article ADS Google Scholar
Zhang, S. et al. New Forest Map 2020 in China. figshare https://doi.org/10.6084/m9.figshare.22223854.v1 (2023).
Zanaga, D. et al. ESA WorldCover 10 m 2020 v100. Zenodo https://doi.org/10.5281/zenodo.5571936 (2021).
Karra, K. et al. Global land use / land cover with Sentinel 2 and deep learning. 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS pp, 4704–4707 (2021).
Chen, J., Ban, Y. & Li, S. China: Open access to Earth land-cover mapGlobeLand30. Nature 514, 434–434 (2014).
Article ADS CAS Google Scholar
Chen, J. et al. Global land cover mapping at 30m resolution: A POK-based operational approach. ISPRS Journal of Photogrammetry and Remote Sensing 103, 7–27 (2015).
Article ADS Google Scholar
Zhang, S. et al. Mapping the Age of Subtropical Secondary Forest Using Dense Landsat Time Series Data: An Ensemble Model. Remote Sensing 15, 2067 (2023).
Article ADS Google Scholar
Shang, R. et al. China’s current forest age structure will lead to weakened carbon sinks in the near future. The Innovation 4, 100515 (2023).
Article PubMed PubMed Central Google Scholar
Bontemps, S. et al. Consistent Global Land Cover Maps For Climate Modelling Communities: Current Achievements Of The ESA’ Land Cover CCI. Proceedings of the ESA living planet symposium, Edimburgh 13, 9–13 (2013).
Google Scholar
Zhang, S. et al. Validation Samples for SFAC. figshare https://doi.org/10.6084/m9.figshare.22223911.v1 (2023).
Keenan, R. J. et al. Dynamics of global forest area: Results from the FAO Global Forest Resources Assessment 2015. Forest Ecology and Management 352, 9–20 (2015).
Article Google Scholar
Danylo, O. et al. A map of the extent and year of detection of oil palm plantations in Indonesia, Malaysia and Thailand. Sci Data 8, 96 (2021).
Article PubMed PubMed Central Google Scholar
Kennedy, R. E., Yang, Z. & Cohen, W. B. Detecting trends in forest disturbance and recovery using yearly Landsat time series: 1. LandTrendr - Temporal segmentation algorithms. Remote Sensing of Environment 114, 2897–2910 (2010).
Article ADS Google Scholar
Li, D., Ju, W., Fan, W. & Gu, Z. Estimating the age of deciduous forests in northeast China with Enhanced Thematic Mapper Plus data acquired in different phenological seasons. Journal of Applied Remote Sensing 8, 083670 (2014).
Article ADS Google Scholar
Xiao, C., Li, P., Feng, Z. & Liu, X. An updated delineation of stand ages of deciduous rubber plantations during 1987-2018 using Landsat-derived bi-temporal thresholds method in an anti-chronological strategy. International Journal of Applied Earth Observation and Geoinformation 76, 40–50 (2019).
Article ADS Google Scholar
Beckschäfer, P. Obtaining rubber plantation age information from very dense Landsat TM & ETM + time series data and pixel-based image compositing. Remote Sensing of Environment 196, 89–100 (2017).
Article ADS Google Scholar
Kennedy, R. E. et al. Spatial and temporal patterns of forest disturbance and regrowth within the area of the Northwest Forest Plan. Remote Sensing of Environment 122, 117–133 (2012).
Article ADS Google Scholar
Huang, C. et al. Automated masking of cloud and cloud shadow for forest change analysis using Landsat images. International Journal of Remote Sensing 31, 5449–5464 (2010).
Article ADS Google Scholar
Fagan, M. E. et al. Mapping pine plantations in the southeastern U.S. using structural, spectral, and temporal remote sensing data. Remote Sensing of Environment 216, 415–426 (2018).
Article ADS Google Scholar
Liu, L. et al. Mapping afforestation and deforestation from 1974 to 2012 using Landsat time-series stacks in Yulin District, a key region of the Three-North Shelter region, China. Environmental monitoring and assessment 185, 9949–9965 (2013).
Article PubMed Google Scholar
Zhang, Y., Shen, W., Li, M. & Lv, Y. Integrating landsat time series observations and corona images to characterize forest change patterns in a mining region of nanjing, eastern china from 1967 to 2019. Remote Sensing 12, 1–21 (2020).
Google Scholar
Fang, L., Yang, J., Zhang, W., Zhang, W. & Yan, Q. Combining allometry and landsat-derived disturbance history to estimate tree biomass in subtropical planted forests. Remote Sensing of Environment 235, 111423 (2019).
Article Google Scholar
Hu, S. Y., Pang, Y., Meng, S. L. & Yue, C. R. Annual Forest Disturbance Detection Using Time Series Landsat 8 OLI Data. Forest Research 33, 65–72 (2020).
Google Scholar
Zhihao, Z., Wanli, B. & Tao, J. An Improved VCT Long Time Series Forest Disturbance Method. Journal of Geomatics Science and Technology 38, 38–43 (2021).
Google Scholar
Novo-Fernández, A. et al. Landsat time series analysis for temperate forest cover change detection in the Sierra Madre Occidental, Durango, Mexico. International Journal of Applied Earth Observation and Geoinformation 73, 230–244 (2018).
Article ADS Google Scholar
Shen, W., Li, M. & Wei, A. Spatio-temporal variations in plantation forests’ disturbance and recovery of northern Guangdong Province using yearly Landsat time series observations (1986–2015). Chinese Geographical Science 27, 600–613 (2017).
Article Google Scholar
Chen, Y., Yu, G., Zhao, T., Xiao, M. & Yao, W. Assessing the river habitat suitability and effects of introduction of exotic fish species based on anecohydraulic model system. Ecological Informatics 45, 59–69 (2018).
Article Google Scholar
Masek, J. G. et al. United States Forest Disturbance Trends Observed Using Landsat Time Series. Ecosystems 16, 1087–1104 (2013).
Article Google Scholar
Qiu, J. et al. Quantifying Forest Fire and Post-Fire Vegetation Recovery in the Daxin’anling Area of Northeastern China Using Landsat Time-Series Data and Machine Learning. Remote Sensing 13, 792 (2021).
Article ADS Google Scholar
Zhang, S. et al. Secondary forest samples for VCT. figshare https://doi.org/10.6084/m9.figshare.22223956.v1 (2023).
Xie, S., Liu, L., Zhang, X. & Yang, J. Mapping the annual dynamics of land cover in Beijing from 2001 to 2020 using Landsat dense time series stack. ISPRS Journal of Photogrammetry and Remote Sensing 185, 201–218 (2022).
Article ADS Google Scholar
Zhang, Y. et al. Mapping causal agents of disturbance in boreal and arctic ecosystems of North America using time series of Landsat data. Remote Sensing of Environment 272, 112935 (2022).
Article Google Scholar
Pasquarella, V. J., Holden, C. E. & Woodcock, C. E. Improved mapping of forest type using spectral-temporal Landsat features. Remote Sensing of Environment 210, 193–207 (2018).
Article ADS Google Scholar
Breiman, L. Random Forests. Machine Learning 45, 5–32 (2001).
Article Google Scholar
Zhang, S. et al. Samples for ccdc. figshare https://doi.org/10.6084/m9.figshare.22224037.v2 (2023).
Arévalo, P., Bullock, E. L., Woodcock, C. E. & Olofsson, P. A Suite of Tools for Continuous Land Change Monitoring in Google Earth Engine. Frontiers in Climate 2, 576–740 (2020).
Article Google Scholar
FAO. Global Forest Resources Assessment 2020: Main report. Rome. https://doi.org/10.4060/ca9825en (2020).
Zhang, S. et al. SFA_CCI_CHINA. figshare https://doi.org/10.6084/m9.figshare.22225993.v1 (2023).
Zhang, S. et al. SFA_MODIS_CHINA. figshare https://doi.org/10.6084/m9.figshare.22225969.v1 (2023).
Zhang, S. et al. SFA_CLCD_CHINA. figshare https://doi.org/10.6084/m9.figshare.22225930.v1 (2023).
Du, Z. et al. A global map of planting years of plantations. Sci Data 9, 141 (2022).
Article PubMed PubMed Central Google Scholar
Besnard, S. et al. Mapping global forest age from forest inventories, biomass and climate data. Earth System Science Data 13, 1–22 (2021).
Article Google Scholar
Kimes, D. S., Nelson, R. F., Salas, W. A. & Skole, D. L. Mapping secondary tropical forest and forest age from SPOT HRV data. International Journal of Remote Sensing 20, 3625–3640 (1999).
Article ADS Google Scholar
Nelson, R. F., Kimes, D. S., Salas, W. A. & Routhier, M. Secondary Forest Age and Tropical Forest Biomass Estimation Using Thematic Mapper Imagery. BioScience 50, 419 (2000).
Article Google Scholar
Zhang, Q. et al. Deriving stand age distribution in boreal forests using SPOT VEGETATION and NOAA AVHRR imagery. Remote Sensing of Environment 91, 405–418 (2004).
Article ADS Google Scholar
Lechner, A. M., Stein, A., Jones, S. D. & Ferwerda, J. G. Remote sensing of small and linear features: Quantifying the effects of patch size and length, grid position and detectability on land cover mapping. Remote Sensing of Environment 113, 2194–2204 (2009).
Article ADS Google Scholar
Carreiras, J. M. B., Jones, J., Lucas, R. M. & Gabriel, C. Land Use and Land Cover Change Dynamics across the Brazilian Amazon: Insights from Extensive Time-Series Analysis of Remote Sensing Data. PLoS ONE 9, e104144 (2014).
Article ADS PubMed PubMed Central Google Scholar
Healey, S. P. et al. Mapping forest change using stacked generalization: An ensemble approach. Remote Sensing of Environment 204, 717–728 (2018).
Article ADS Google Scholar
Xiaochong, X., Bingjie, L., Xiaoping, X., Xia, L. & Qian, S. Mapping annual global land cover changes at a 30m resolution from 2000 to 2015. National Remote Sensing Bulletin 25, 1896–1916 (2021).
Google Scholar
Liangyun, L., Xiao, Z., Xidong, C., Yuan, G. & Jun, M. GLC_FCS30-2020: Global Land Cover with Fine Classification System at 30m in 2020 (v1.2) [Dataset]. Zenodo, https://doi.org/10.5281/zenodo.4280923 (2020).
Liu, J. et al. Spatio-temporal patterns and characteristics of land-use change in China during 2010-2015. Journal of Geographical Sciences 73, 789–802 (2018).
Google Scholar
Liu, J. et al. Spatial and temporal patterns of China’s cropland during 1990–2000: An analysis based on Landsat TM data. Remote Sensing of Environment 98, 442–456 (2005).
Article ADS Google Scholar
Sexton, J. O. et al. Global, 30-m resolution continuous fields of tree cover: Landsat-based rescaling of MODIS vegetation continuous fields with lidar-based estimates of error. International Journal of Digital Earth 6, 427–448 (2013).
Article ADS Google Scholar
Sulla-Menashe, D. & Friedl, M. A. User Guide to Collection 6 MODIS Land Cover (MCD12Q1 and MCD12C1) Product. Usgs: Reston Va, 18 (2018).
Yang, J. & Huang, X. The 30 m annual land cover dataset and its dynamics in China from 1990 to 2019. Earth System Science Data 13, 3907–3925 (2021).

Download references

Acknowledgements

The research was supported by the National Natural Science Foundation of China (No. 41867012), and the Water Conservancy Science and Technology Project of Jiangxi Province, China (No. 202124ZDKT25).

Author information

Authors and Affiliations

Key Laboratory of Poyang Lake Wetland and Watershed Research (Ministry of Education), School of Geography and Environment, Jiangxi Normal University, Nanchang, 330022, China
Shaoyu Zhang, Shuhua Qi, Bisong Hu, Min Huang & Jin Luo
School of Geography, Nanjing Normal University, Nanjing, 210023, China
Hanzeyu Xu
Land Satellite Remote Sensing Application Center, Ministry of Natural Resources, Beijing, 10048, China
Aixia Liu

Authors

Shaoyu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Hanzeyu Xu
View author publications
You can also search for this author in PubMed Google Scholar
Aixia Liu
View author publications
You can also search for this author in PubMed Google Scholar
Shuhua Qi
View author publications
You can also search for this author in PubMed Google Scholar
Bisong Hu
View author publications
You can also search for this author in PubMed Google Scholar
Min Huang
View author publications
You can also search for this author in PubMed Google Scholar
Jin Luo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.H.Q. proposed and conceived the topic of this study. S.Y.Z. was the primary author and Performer in the experiments. X.H.Z.Y. provided help in the data processing. A.X.L., B.S.H., J.L., and M.H. provided instructions on writing articles.

Corresponding author

Correspondence to Shuhua Qi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplemental Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, S., Xu, H., Liu, A. et al. Mapping of secondary forest age in China using stacked generalization and Landsat time series. Sci Data 11, 302 (2024). https://doi.org/10.1038/s41597-024-03133-2

Download citation

Received: 13 January 2023
Accepted: 11 March 2024
Published: 16 March 2024
DOI: https://doi.org/10.1038/s41597-024-03133-2

Subjects

Abstract

Similar content being viewed by others

Ghost roads and the destruction of Asia-Pacific tropical forests

FSC-certified forest management benefits large mammals compared to non-FSC

Hotspots of biogeochemical activity linked to aridity and plant traits across global drylands

Background & Summary

Methods

Data processing

Landsat time series data

New reference forest map

Candidate stable and secondary forest maps

Validation Samples

Detection of establishment times of secondary forest stands

Data-driven VCT

RF for CCDC

Ensemble rule

Accuracy assessment

Comparison with Statistical data

Comparison with other maps

Data Records

Technical Validation

Accuracy assessment

Comparisons with Statistical data

Comparisons with other maps

Usage Notes

The variable thresholds of data-driven VCT

The performance of the improved CCDC_RF

Advantages and limitations

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplemental Information

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links