First comprehensive quantification of annual land use/cover from 1990 to 2020 across mainland Vietnam

Phan, Duong Cao; Trung, Ta Hoang; Truong, Van Thinh; Sasagawa, Taiga; Vu, Thuy Phuong Thi; Bui, Dieu Tien; Hayashi, Masato; Tadono, Takeo; Nasahara, Kenlo Nishida

doi:10.1038/s41598-021-89034-5

Download PDF

Article
Open access
Published: 11 May 2021

First comprehensive quantification of annual land use/cover from 1990 to 2020 across mainland Vietnam

Duong Cao Phan^1,2,
Ta Hoang Trung³,
Van Thinh Truong⁴,
Taiga Sasagawa¹,
Thuy Phuong Thi Vu⁵,
Dieu Tien Bui⁶,
Masato Hayashi⁷,
Takeo Tadono⁷ &
…
Kenlo Nishida Nasahara⁸

Scientific Reports volume 11, Article number: 9979 (2021) Cite this article

9427 Accesses
41 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Extensive studies have highlighted a need for frequently consistent land cover information for interdisciplinary studies. This paper proposes a comprehensive framework for the automatic production of the first Vietnam-wide annual land use/land cover (LULC) data sets (VLUCDs) from 1990 to 2020, using available remotely sensed and inventory data. Classification accuracies ranged from 85.7 ± 1.3 to 92.0 ± 1.2% with the primary dominant LULC and 77.6 ± 1.2% to 84.7 ± 1.1% with the secondary dominant LULC. This confirmed the potential of the proposed framework for systematically long-term monitoring LULC in Vietnam. Results reveal that despite slight recoveries in 2000 and 2010, the net loss of forests (19,940 km²) mainly transformed to croplands over 30 years. Meanwhile, productive croplands were converted to urban areas, which increased approximately ten times. A threefold increase in aquaculture was a major driver of the wetland loss (1914 km²). The spatial–temporal changes varied, but the most dynamic regions were the western north, the southern centre, and the south. These findings can provide evidence-based information on formulating and implementing coherent land management policies. The explicitly spatio-temporal VLUCDs can be benchmarks for global LULC validation, and utilized for a variety of applications in the research of environmental changes towards the Sustainable Development Goals.

Not just crop or forest: an integrated land cover map for agricultural and natural areas

Article Open access 26 January 2024

Annual Impervious Surface Data from 2001–2020 for West African Countries: Ghana, Togo, Benin and Nigeria

Article Open access 18 July 2024

Integrated usage of historical geospatial data and modern satellite images reveal long-term land use/cover changes in Bursa/Turkey, 1858–2020

Article Open access 31 May 2022

Introduction

Information about land use/land cover (LULC) and its dynamic changes are fundamental to a variety of studies on environmental issues¹ such as climate change², drought and flood³, and carbon emissions⁴. That is, frequently updated accurate LULC products provide policymakers with a profound understanding of the complex interplay between land use/cover change (LUCC) and its risk, which helps to inform coherent policies for the sustainable management of land resources^5,6,7.

The ready availability of remote sensing data and computing technologies opens a great era in cost-effective mapping LULC at a broad scale. Numerous algorithms have been developed to improve LULC classification, e.g. Spatial Temporal Adaptive Algorithm⁸, Automatic Land Cover Classification Method⁹, and Apply Change-vector Analysis in Posterior Probability Space¹⁰. Together with the development of these complex algorithms, special projects have been designed for large-scale land cover assessment. For example, at the 10-m spatial resolution, several attempts have been made to publish 13-category LULC maps of Europe^11,12 and global LULC maps¹³ using a great set of Sentinel MSI images. For a coarser spatial resolution (30 m), there are quality multi-category LULC products, including the National Databases of the United States^14,15,16,17, and the GlobeLand30 global product of 10-category LULC¹⁸. Nonetheless, owing to the computational restriction and the limitations of representative reference data to train and test classifiers, these products have not reflected consistently and frequently the detailed patterns and characteristics of LULC at local or national scales^19,20. Also, due to the predefined research periods and differences in the land cover classification systems (LCCSs), these products seldom meet the prime requirement of projects’ specific objectives.

Recently, a remarkable performance in cloud computing has advanced LULC observation sciences. For example, the National Aeronautics and Space Administration Earth Exchange (NASA-NEX) and Amazon Web Service (AWS) allow analysts to access and process the NASA Earth Observation (EO) data on the cloud²¹. More importantly, Google Earth Engine (GEE) provides an outstanding cloud computing platform with open access to a variety of EO data. Thanks to the potential of big data processing of these platforms, researchers have completed extensive studies at greater extent, for example, on urban change monitoring²², cultivated land mapping²³, and forest disturbance detection²⁴. Multi-category land cover products were also produced such as a 13-category land cover map of South-east Asia covering 11 nations²⁵. Although the overall accuracy of such products reaches up to 86%, the authors identify limitations regarding the insufficiency of high-quality reference data for time-series analyses²⁶. Therefore, very few studies have conducted for multi-temporal LULC mapping at a broad scale.

The quantity and quality of training data play an essential role in the production of LULC maps. Yet, collecting sufficient and precise training data requires considerable efforts, especially at large scales and multiple periods²⁷. Several attempts have been proposed methods that allow to collect cost-effectively high-quality training data. Bagan et al. (2019) extracted training data from a previous land cover map and utilized them for mapping new LULC products²⁸. While the authors applied a bi-temporal spectral measurement to decrease the bias of extracted training data, the accuracy of these data may not be ensured due to the inherent classification errors of the previous maps²⁹. To enhance the effectiveness of training data collection, Huang et al. (2020) used spectral similarity and distance indicators to detect the changed and unchanged training sites, and thus kept the unchanged ones as migrated training data²⁷. The measurement was applied for the availability of Landsat TM images. Results showed that the accuracies of the migrated training data obtained over 92.98% and the classification map which used the migrated training data had a similar overall accuracy of 71.42% to that used ground-truth data in 2010. Nevertheless, these results were validated by outdated maps, namely the ESA global CCI land cover data sets, which may contain inherent classification errors. In addition, using the Sentinel MSI images, Ghorbanian et al. (2020) employed the same approach to migrate Iran-wide training data from 2017 to 2019²⁶. The classified map that utilized the migrated training data obtained a great accuracy of 91.35%. Despite the potential of the automatic training migration method, it is still not known whether this method can be applied for multi-sensor data sources such as the different Landsat sensors or the harmonized Landsat and Sentinel images²⁷. Therefore, an exploration of the potential training migration method for multi-sensor remote sensing data is integral for a time-series assessment of multi-category LULC dynamics at a large scale.

Given the ideas, this research aim is to explore the potential training migration method for multi-sensor remote sensing data and then produce the first Vietnam-wide annual land use/cover data sets (VLUCDs) from 1990 to 2020 as a case study. In Vietnam, remotely sensed data have been utilized to produced quality LULC products, but most products cover a small area of the country or a few predefined periods^30,31,32. The previous inter-provincial LULC data sets were seven-category LULC maps for the central and southern Vietnam in 2007 and 2017, and the northern Vietnam in 2007 and 2015²⁰. More recently, Vietnam-wide maps were produced to map annual forest cover from 2015 to 2019^19,33. Despite the potential of these products, due to the primary focus on forest monitoring, the classification accuracy of non-forest LULC categories may be insufficient for other applications. Meanwhile, there has been a highly dynamic LUCC which varies among different regions in Vietnam. Despite the report of continuous net forest gain by the Ministry of Agricultural and Rural Development (MARD)³⁴, a systematically comprehensive review has reported forest loss in Vietnam^35,36. We do not know exactly the rates and patterns of changes at the nation scale. Hence, timely, accurate, and comprehensive LULC products can provide a profound understanding of LUCC patterns and processes. This information can supports policymakers in forming crucial decisions on sustainable development and resource management. The maps may be benchmarks for quantifying regional and global land cover products.

The central novelty of this paper is to propose a new framework for the automatic nationwide annual LULC monitoring and provide the results of the first VLUCDs and LUCC over the recent three decades. There are major tasks: (1) Data preparation; (2) Design a proper LCCS and reference data; (2) Proposing a consistent framework for the automatic production of the VLUCDs; (3) Creating and validating the VLUCDs, and; (4) Detecting profound changes in LULC since 1990. We developed a new random-forest-based classification approach to classify the wide availability of Landsat Thematic Mapper (TM), Enhanced Thematic Mapper Plus (ETM+) and Operational Land Imager (OLI), and Sentinel C-band Synthetic Aperture Radar Ground Range Detected (SAR GRD) and MultiSpectral Instrument (MSI) time-series images over the study period. We describe fully our implementation in the method section.

Results

The accuracy of the first VLUCDs

Utilizing ground-based data and all the freely available remotely sensed images, we have provided a coherent method and the results of the first VLUCDs. The proposed method generated consistently spatio-temporal LULC maps, using a definitive LCCS designed with reference to end-users’ recommendations and a standard LCCS³⁷. For a visual presentation, level-1 VLUCDs of the year 1990 and 2020 are presented in Fig. 1. The 5-year-interval maps (1990, 1995, 2000, 2005, 2010, 2015, and 2020) of level-2 VLUCDs are presented in Fig. 2. The level-1 and level-2 VLUCDs included ten categories of primary dominant land use/cover (PDLC) and eighteen categories of secondary dominant land use/cover (SDLC), respectively (see detail in Supplementary Table S1).

The reliability of the VLUCDs was evaluated by using both visual interpretations and statistical approaches. Based on high-resolution satellite images in Google Earth, we found that the VLUCDs were clear and noise-free. A confusion matrix method with stratified random sampling (1050 points/LULC category) was utilized to independently validate classification accuracy. Statistical metrics were measured, namely producer accuracy (PA), user accuracy (UA), F1 score, overall accuracy (OA), standard error (SE), and kappa coefficient (KC). These metrics of the level-1 and level-2 LULC maps of the year 2020 are fully described in Tables 1 and 2. Meanwhile, overall obtained OA, KC, and uncertainty of the level-1 and level-2 VLUCDs are presented briefly in Figs. 3 and 4. Uncertainty of measurement was estimated with a 95% confidence interval. Specifically, the OA of the level-1 and level-2 LULC maps ranged from 77.6 ± 1.2% to 84.7 ± 1.1%, and 85.7 ± 1.3% to 92.0 ± 1.2% over the study period, respectively.

Table 1 Confusion matrix of the 2020 Vietnam-wide land use/cover map (Level 1) created from the integration of Landsat OLI, Sentinel SAR GRD and MSI satellite images with the random-forest-based algorithm. PA: Producer accuracy (%); PA: Producer accuracy (%); UA: User accuracy (%); SEM: Standard error of the mean for UA; F1: F1 score; Overall accuracy: 91.6%, and Kappa coefficient: 90.7 (%). RL: Residence; RP: Rice paddies; CL: Cropland; GL: Grassland; BL: Barren land; SL: Scrubland; FL: Forest land; WL: Wetland; OW: Open water; AC: Aquaculture.

Full size table

Table 2 Confusion matrix of the 2020 Vietnam-wide LULC map (Level 2) created from the integration of Landsat OLI, and Sentinel SAR GRD and MSI satellite images with the random-forest-based algorithm. PA: Producer accuracy (%); UA: User accuracy (%); SEM: Standard error of the mean for UA; F1: F1 score; Overall accuracy: 84.7%, and Kappa coefficient: 83.8 (%). R1: Residence 1; R2: Residence 2; RP: Rice paddies; WC: Woody crops; OC: Other crops; IC: In-house crops; GL: Grassland; BL: Barren land; SL: Scrubland; DBF: Deciduous broadleaf forest; EBR: Evergreen broadleaf forest; ENF: Evergreen needleleaf forest; PL: Plantation land; MF: Mangrove forest; IW: Inland wetland; OW: Open water; AC: Aquaculture; BA: Bamboo areas.

Full size table

With the numerous LULC types and long-term observation, these results constituted an outstanding achievement³⁸. For the detailed LULC products (Level 2), open water and mangrove had the highest levels of accuracy, accounting for over 96% in both the PA and UA. This successful classification may be explained by the benefits of spectral indices (Table 4) such as the NDWI, WRI, and NDPI, which could distinguish open water from land, aquaculture ponds, and others, whereas mangrove can be accurately identified with the MVI³⁹. This was followed by rice paddies, which had an accuracy of above 90%. It seems possibly that rice is frequently cultivated in flat terrain, where is not be affected by topographic problems such as the shadows of mountainsides. The spectral reflectance of rice is also stable⁴⁰. Although the model could separate forests from others, it tended to misclassifying different forests. Another limitation is to classify plantation forests from woody crops, which is also found by numerous studies^41,42. Likewise, the model could not entirely divide the different types of residential areas, but it showed a clear separation of the residential areas from others. To increase the accuracy of the maps for further analyses, we combined these mixed categories. This combination obtained an increase in accuracy of approximately 6% with a few losses of detail in LULC types.

Distribution and trend of LUCC

Change detection was conducted to comprehend LUCC patterns and processes. To this end, the level-1 (PDLC) Vietnam-wide annual LULC data sets (L1-VLUCDs) was utilized for further analysis in this study. Although the annual maps are integral to obtain the process of LULC dynamic changes in Vietnam, the five-year-interval land cover products in 1990, 1995, 2000, 2005, 2010, 2015, and 2020 were utilized to acquire a more profound change visualization. A post-classification analysis method was employed to measure the spatio-temporal LUCC and the percentage of changes.

The spatial distributions and the patterns of Vietnam LULC are shown in Fig. 1. The temporal distribution of the net changes in LULC from 1990 to 2020 is presented in Fig. 5. The most dominant LULC was forests, accounting for approximately half area of the entire country. This was followed by croplands (16.3%), rice fields (14.2%), and open water (including parts of saltwater, 8.1%). Grassland and scrubland occupied a relatively similar proportion (2.8%) while the smallest LULC was residential areas (1.3%).

What can be clearly seen in Fig. 5 is the steady decline of forest area from 170,458 km² in 1990 to 150,517 km² in 2020. The area of wetlands experienced a slight increase from 4404 km² in 1900 to 5138 km² in 1995, followed by a continual decrease to 2490 km² in 2020. In contrast, there was a sharp increase in the area of aquaculture and residential land, accounting for approximately three and ten times over the three decades. The area of open water showed a slight rise while there was a small fluctuation in the area of the other land types.

Meanwhile, Fig. 6 reveals the highly dynamic change in LULC in Vietnam. The most considerable change was the area of urban land, which increased by about 50% over each five-year interval. Interestingly, the graph shows substantially opposite trends between aquaculture and wetlands. The area of aquaculture decreased 8% whereas the wetlands expanded 17% from 1990 to 1995, followed by a 50% increase in aquaculture but a 22% decrease in wetlands by 2000. Forest cover had undergone an up-and-down variation by 2015, but it has presented a remarkable drop until now. It is noticed that the percentage of change in the forest cover was insignificant, but its dynamic areas were remarkable (Fig. 7).

Major spatio-temporal LULC dynamics

Vietnam’s LULC has experienced a considerable change over the past 30 years. Figure 1c shows the spatio-temporal dynamic changes. The north and south were the most dynamic areas, especially the western north and the most south. The dynamic conversions among different LULC can be seen in Fig. 7. There was a fundamentally dynamic conversion between forests and croplands. Forest areas remarkably reduced while residential and aquaculture land significantly increased. To easily visualize the LULC transformation, we created additional data in Fig. 8. It is noticed that a considerable proportion of forests was converted into croplands while a major driver of wetland loss resulted in the expansion of aquaculture. Residential lands mainly expanded on the areas of rice, croplands, and barren lands, which are located nearby coastlines. To acquire a more detailed visualization of change patterns, few hotspot regions were extracted throughout the country to discuss the change pattern and processes.

Discussion

Large-scale annual LULC information is integral for understanding the land dynamic process, thus supporting the strategies of land management. In reviewing the literature, little consistent multi-spatio-temporal LULC data was found on a national or regional scale. In this study, a comprehensive framework is developed to produce consistently Vietnam-wide annual LULC data sets, using remote sensing and ground-based data. Results show that surface reflectance images can provide a coherent time-series data set as long as they are atmospherically corrected. Surprisingly, although all available Landsat images of the entire year are utilized, there are data gaps due to cloud and shadow masking areas. These gaps might affect the accuracy of classified maps even though gaps have been filled by ancillary data such as terrain indices. This issue may be explained by the fact that Vietnam is one of the cloudiest countries in the world⁴³. However, the harmonious blend of Landsat OLI, Sentinel SAR GRD and MSI has filled such missing-data gaps since 2015, which can also improve the accuracy of mapping (Figs. 3, 4). Besides, the training migration model significantly reduced the cost and efforts in collecting training data.

Regarding change patterns, the rapid development of urbanization is considered an essential interest in Vietnam. The expansion of urban areas has frequently occurred in the capital and regional capitals, namely Hanoi, Hai Phong, Da Nang, Ho Chi Minh (HCM), and Can Tho cities. Herein, we analyse the process of change in HCM as a typical example. As shown in Fig. 9a, the growth of urban land has remarkably increased since the 1990s. This may be explained by the fact that the introduction of new policies known as “Renovation” (1986), which has promoted the development of socio-economic factors, followed by a massive population migration to cities⁴⁴. The urbanization has primarily taken place on croplands, which agrees with the findings of previous studies^30,44,45,46. The development is predicted to accelerate over developing regions, which causes the loss of croplands, and thus may threaten sustainability and livelihoods⁴⁵.

Another considerable change is the uncontrolled development of agricultural and aquaculture land, especially in the Vietnamese Mekong Delta (VMD). As part of the third-largest basin in the world, the VMD plays an integral role in the contribution of agricultural products, due to its favourable natural condition of 700 km coastline and a dense network of rivers. In fact, rice and shrimp have contributed to a vital position in Vietnam’s economic development for decades^47,48. However, the intensification of uncontrolled aquatic farming has caused significant changes in LULC across the region, especially along coastal zones since 2000 (Fig. 9b). There are several possible explanations for this finding. In 2000 and 2001, the Vietnamese government proposed resolutions 09/ND-CP and 1116, which replaced low-value (e.g. rice) to high-value (e.g. shrimp and fruit) agricultural production, encouraging farmers to transform certain coastal areas into aquatic production. Also, due to the high profits of shrimp (200,000 VND/kg) in comparison to traditional crops such as rice (5000 VND/kg)³⁰, numerous inland areas were converted into aquatic farming. Local people illegally cut mangrove forests to expand aquaculture in several coastal regions⁴⁹. Since 2010, there was not only an increase in aquaculture, but also a significant conversion of other croplands into rice paddies (Fig. 9b). These changes resulted in the formulation of another policy aiming at the increase of intensive rice and fish farming in 2012^50,51. These findings indicate that the development of socio-economic policies is considered as the primary reasons driving LUCC. Land policies, therefore, should be formed and implemented in serious consideration of regional socio-economic and environmental development.

Contrary to the expectations, this study indicates a net area loss of forests instead of the constant increase reported by the Vietnamese Ministry of Agriculture and Rural Development (MARD). Despite the net forest regrowth in 2000 and 2010, the forest cover has undergone a decrease in recent years. Also, the area of forests in this study is greater than the data reported by the MARD. These inconsistencies are due to several reasons, especially the difference of forest definitions. The MARD excluded agricultural (e.g., rubber), aqua-cultural ecosystems, scattered trees, bamboos, and palms, etc. from forests³⁴. These non-forest lands, covering a relatively large area of the country (e.g., 10 km² of rubber only; 2017), were highly dynamic¹⁹ but not fully reported by the MARD. In 2008, the revised definition of forests set a minimum of 10% tree cover as forests, instead of 30% tree cover in the previous definition³⁴. Since 2016, they started to include certain agricultural lands managed by the Vietnam Administration of Forestry in the forest lands but without forest cover⁵¹. These revisions likely resulted in an increase in the reporting data of forests. Furthermore, our results show that deforestation occurred in numerous regions. Figure 9c presents a representative example of forest loss in the central highlands. There has been a constant decrease in forest cover due to the expansion of rice paddies, barren lands, and croplands. This finding of forest loss corroborates the discoveries of a great deal of previous work in LULC observation covering Vietnam^52,53.

Regarding limitations of this study, we could not estimate the benefit of the individual sensor’s characteristics although the harmonious blend of the Landsat OLI, Sentinel SAR GRD and MSI images can fill data gaps and improve the accuracy of mapping classification. Moreover, instead of using ground-truth data, we validated the annual maps of the year 1990 to 2014 with the data collected by visual interpretation. Although great and careful efforts were applied in the collection procedure, errors might not be inevitable due to the restricted high-resolution images in Google Earth, especially before 2000. In addition, we have utilized the random forest algorithm to eliminate the less important input features, but there are relatively numerous remaining features, resulting in a high computational cost. For future work, the nationwide annual multi-category LULC maps and overall change detection were successfully developed, but further research should be undertaken to investigate the drivers of LULC changes in more detail for individual land cover. Finally, deep learning neural networks are expected to be applied for large-scale LULC mapping.

Conclusion

Regularly updated and accurate LULC information is fundamental to interdisciplinary studies. The recent advancement of remote sensing and computational science has improved the mapping capacity of LULC. This study set out to develop a new framework for automatically monitoring nationwide annual LULC and provide the first VLUCDs over the past 30 years. To this end, we utilized ground-based data, the informative Landsat TM, ETM + and OLI, and Sentinel SAR GRD and MSI images, after comprehensively assembled and preprocessed on the GEE platform. Then, we developed a new random-forest-based method and an automatic training migration model (ATMM) to map Vietnam LULC. The obtained VLUCDs had overall accuracies ranging from 85.7 ± 1.3% to 92.0 ± 1.2% with the ten primary dominant land use/cover and 77.6 ± 1.2% to 84.7 ± 1.1% with the eighteen secondary dominant land use/cover. This confirms the potential of the proposed framework for systematically long-term monitoring of LULC in Vietnam. Results reveal that there was a decrease in the area of forests (19,940 km²) and wetlands (1914 km²) whereas the area of aquaculture and urban increased approximately three and ten times over the three decades, respectively. The deforestation was mainly due to the expansion of croplands, which were in return replaced by numerous built-up areas. The rapid growth of aquaculture was considered as a main driver of wetland loss. The explicit spatio-temporal benchmark of the VLUCDs can be utilized for a tremendous variety of applications in the research of environmental changes towards the Sustainable Development Goals. In addition, the ATMM allows analysts to remarkably save time, cost, and labour for collecting sufficient and representative training data. This proposed method is possible to apply for a multi-temporal LULC assessment at a broader scale.

Materials and methods

The overall method is presented in Fig. 10 with major steps: (1) Data preparation; (2) Defining a proper LULC classification system and reference data; (3) Proposing a consistent framework for the automatic production of Vietnam-wide annual LULC data sets from 1990 to 2020; (4) Creating and validating the VLUCDs, and; (5) Generating major change pattern and processes of LULC over the past three decades.

Study area

The study area is mainland Vietnam with a population of 97 million people (2018; Fig. 11). The country covers an area of over 300,000 km² including the Red River Delta, and the Mekong River Delta which is the third-largest delta in the world. The topography of Vietnam is diverse (up to 3300 m altitude) with over 75% of the total area being hills and mountains. These areas are covered by mainly tropical rainforests. Climate is changeable but dominated by a tropical monsoon type with mean annual humidity of 84%, mean annual rainfall from 1200 to 3000 mm, and mean annual temperature from 5 to 37 °C⁵⁴. The complex patterns of climate and topography create the rich biodiversity and landscape heterogeneity of Vietnam’s LULC. Nonetheless, there are the identifying characteristics of LULC in different regions. While the southern region is principally occupied with rice, orchards and aquaculture lands, the northern region is primarily covered by forests and plantations, except for the Red River Delta. Dominant LULC types in the northern centre are evergreen broadleaf forests and annual croplands whereas woody crops, deciduous broadleaf forests, and evergreen needle-leaf forests dominate the southern centre. In this study, to reduce the complexity of the landscape information, we divided the whole country into five main regions and separately classified for each region. These regions are presented in Fig. 11.

Land cover classification system

Defining a standard land cover classification system (LCCS) is a crucial step in the practical land cover assessment. It should be delineated precisely depending on the objectives of users and the availability of mapping resources. Most LULC maps employ the theory and framework of the International Geosphere-Biosphere Programme (IGBP)⁵⁵, the Land Cover Classification System (LCCS)³⁷, and the Coastal Change Analysis Program (C-CAP) Land Cover Classifications⁵⁶. Meanwhile, the most updated LCCS of previous LULC products, covering entire Vietnam, includes 18 land cover categories⁵⁷. However, some categories are inappropriate for Vietnam’s LULC. For example, snow and ice do not exist, while one cropland category does not represent the diverse croplands in Vietnam. Although detailed classifications of high and low developed built-up areas play a fundamental role in urban planning and management for the rapid urbanization of Vietnam, they are not in the previous LULC products. In this study, therefore, a new LULC classification system or topology was developed by remaining the appropriate categories of the Food and Agriculture Organization (FAO) LCCS and adding new proper categories based on the local biophysical environment and end-users’ recommendations in Vietnam. First, we classified a 10-category system of primary dominant land use/cover (PDLC/Level-1). The PDLC was then separated into more detailed land types to generate an 18-category system of secondary dominant land use/cover (SDLC/Level-2). We found that this system is appropriate for practically mapping and applications. The categories and descriptions of the system are presented in Supplementary Table S1.

Remote sensing data

Multi-sensor remote sensing data were used in this study. The data were pre-processed and derived from the GEE. The data were re-projected to Universal Transverse Mercator (UTM) projection (Zone 47-49 N and WGS-84 datum) and then resembled into a 30-m spatial resolution using a bicubic interpolation method⁵⁸. The Geospatial Data Abstraction Library (GDAL), the Geographic Resources Analysis Support System, and Python were utilized for these processing tasks. Specifically, the data included the United States Geological Survey (USGS) Landsat TM, ETM+, and OLI Surface Reflectance Tier 1 with a 30-m spatial resolution, Sentinel MSI Level-2A and SAR GRD with a 10-m spatial resolution. The Landsat and Sentinel MSI have been atmospherically corrected while each scene of Sentinel SAR GRD was preprocessed using Sentinel-1 Toolbox for thermal noise removal, radiometric calibration, terrain correction using the Shuttle Radar Topography Mission (SRTM)⁵⁹, and then converting to decibels⁶⁰. Landsat ETM + images, after the Scan Line Corrector failure in 2003, were removed from this study since the failure may result in inconsistently time-series comparison. Over 99% of the datasets from the GEE archive are reported to have high geometric accuracy with the error being less than half a pixel²⁷. Otherwise, the images were eliminated from our image collection to reduce the obvious bias of further analysis.

For reliable and consistent time-series analysis, further processing is integral. For the optical data, to reduce illumination impacts from elevation, aspect and slope, the topographic correction was performed using the Modified Sun-Canopy-Sensor Topographic Correction algorithm⁶¹. While the Landsat Ecosystem Disturbance Adaptive Processing System (LEDAPS) ⁶² was applied to perform atmospheric correction for Landsat TM and ETM+, the Land Surface Reflectance Code (LaSRC)⁶³ was adopted for Landsat OLI. All Landsat images were masked and removed clouds, cloud shadows and saturation pixels utilizing the Function of Mask (CFMASK)⁶⁴. Sen2Cor was adopted to correct atmospheric issues and mask clouds for Sentinel MSI⁶⁵. Finally, because of the different solar and view angles of Landsat OLI and Sentinel MSI, normalizing the bidirectional reflectance distribution function (BRDF) was applied for the data. Although numerous approaches have developed for BRDF correction, the recent technique generated by Roy et al.⁶⁶ is frequently utilized due to its reliability and effective implementation⁶⁷. This method, therefore, was employed for the BRDF correction of all selected optical images in this study. For the Sentinel SAR GRD data, a further process was speckle filtering. The filtering was done using Lee filter, which is superior due to its capacity of maintaining point targets, edge, linear spaces and texture information⁶⁸.

To increase the availability of cloud-free composite data, the harmonization of different Landsat satellite sensor images, and the Landsat OLI with Sentinel MSI into a congruent time-series was desirable for a cloudy region such as Vietnam. The harmonization allows to accurately compare across all years and to measure the spectral similarity and spectral distance between different years. The measurement of spectral similarity and spectral distance was applied for an automatic training migration model, which was described in the following sections. A linear transformation with band-respective coefficients was applied for the harmonization of Landsat TM and ETM + spectral feature to OLI spectral feature⁶⁹. In the meantime, the harmonized Landsat OLI and Sentinel MSI images were processed by employing a method developed by Claverie et al.⁷⁰. The band-respective coefficients with slope and intercept image constants are presented in Table 3. After that, we generated composites of seven bands including blue, green, red, nir, swir 1, swir 2 and thermal bands for two seasons, the dry season from April to September and the wet season from October to March of the following year. These composites were adopted to measure a variety of covariates, which is represented in the following paragraphs. In addition, seasonal composites of VV and VH polarization in ascending and descending orbits of Sentinel SAR GRD were handled in this research.

Table 3 Band-respective coefficients are defined with slope and intercept image constants and used for the harmonized Landsat OLI and Sentinel MSI images.

Full size table

Satellite-based covariate calculation

In this section, a series of covariates was calculated from the band composites. For the optical data, we calculated the medoid⁷¹ and the standard deviation for the six bands (Table 3). Following the successful application of numerous features extracted from original satellite image bands, this study also added the medoid of the 20th and 80th percentile⁵⁷ of the six bands into the seasonal composites to detect the seasonal changes in the biophysical environment.

The ratios between spectral bands were calculated; they are blue/green, red/blue, red/green, red/nir and nir/(red*swir 1). Besides, a great number of spectral indices were also measured from Landsat TM, ETM + and OLI, and Sentinel MSI images (Eqs. (1–17); Table 4). In addition, we calculated the seasonal mean of VH, VV, and the normalized difference between VH and VV polarizations from the Sentinel SAR GRD images. Finally, we generated seasonal composite collections of covariates.

Table 4 Spectral indices derived from Landsat TM, ETM + and OLI, and Sentinel MSI satellite images to enhance the accurate performance of Vietnam-wide annual LULC mapping from 1990 to 2020.

Full size table

Ancillary data sets

Extensive research has shown that ancillary information can improve the accurate performance of LULC classification^20,82,83. In this study, we first added terrain indices including slope, aspect and elevation. These indices were computed from ALOS Global Digital Surface Model or “ALOS World 3D-30 m (AW2D30)”⁸⁴. Also, distance to rivers, coastlines, transport systems and buildings, and soil types were included in the covariate collections. The buildings and transport systems were generated from the OpenStreetMap, while soil types and river networks were extracted from the OpenDevelopmentMekong⁸⁶.

Reference data

Reference data of 18 LULC categories (Supplementary Table S1) was created from field surveys, provincial LULC statistics, and visual interpretations. We conducted nationwide comprehensive surveys in 2015, 2016, 2018, 2019, and 2020 to collect 3078, 2659, 10,550, 41,986 and 32,853 reference samples, respectively. Along with these ground-observed data, previous outdated LULC maps^19,20,87, provincial LULC statistics and high-resolution satellite images available in the Google Earth were also considered. Herein, we generated approximately 9360 polygons of single homogeneous LULC types (Fig. 11) throughout the country for each year from 2015 to 2020. From these polygons, we extracted up to 120,000 reference pixels (points) for each of the years. Meanwhile, due to the non-availability of ground-truth data, reference data from 1990 to 2014 were collected using provincial LULC statistics, the natural-color images of Landsat TM, ETM + and OLI, Sentinel MSI, and high-resolution satellite images available in the Google Earth. For each year, we randomly extracted 1050 points per LULC category for validating the classification models and the others were used for training the classification model.

The reference data of the year 2020 were utilized not only for creating and validating the LULC map of the year 2020 but also for implementing the automatic training migration model from this reference year to any target years. The migration model is described in more detail in the following section.

Automatic training migration model

Training data is tremendously essential in mapping LULC; however, collecting sufficiently accurate training samples is challenging, especially for large-scale areas, long-term history analyses, and data-scarce environments such as Vietnam⁷³. If training data are not collected consistently, it can result in misclassification or low accuracies⁷⁴. Thus, it is paramount to propose a practical approach for training data collections.

In this study⁠, we utilized an automatic model to migrate from the reference data of a reference year to target years. The method had three essential steps. First, we created a set of training data from a reference year (2020). Then, for each pixel, we computed its surface reflectance values from its corresponding Landsat TM, ETM +, and OLI images of the reference year and target years. The surface reflectance values of six bands (Table 3) were utilized for the measurement of Euclidean distance (ED)⁷⁵ and spectral angle distance (SAD)⁷⁶. Finally, with the ED (Eq. 18) and SAD (Eq. 19), we distinguished changed pixels and unchanged pixels by running a trial and error model to determine thresholds. Although the thresholds can be estimated by analysts, the experimented thresholds of ED and SAD in this study was 0.05 and 0.95, respectively. These thresholds were successfully applied to migrate the training data of the year 2020 to target years. The unchanged pixels were preserved and utilized as training data for the target years.

$$ED = \sqrt {\mathop \sum \limits_{i = 1}^{N} \left( {X_{i} - Y_{i} } \right)^{2} }$$

(18)

$$SAD = arccos\left( {\frac{{\mathop \sum \nolimits_{i = 1}^{N} X_{i} Y_{i} }}{{\sqrt {\mathop \sum \nolimits_{i = 1}^{N} \left( {X_{i} } \right)^{2} \mathop \sum \nolimits_{i = 1}^{N} \left( {Y_{i} } \right)^{2} } }}} \right)$$

(19)

where X is spectral signature vector of an image pixel in the reference year; Y is spectral signature vector of an image pixel in the target year; N is the number of image bands (N = 6).

Machine learning modelling

After completing the data preprocessing task, we generated covariate collections, including (1) optical-image-based covariates in dry seasons and (2) in wet seasons, and (3) SAR-based covariates in dry seasons and (4) in wet seasons. The ancillary information was also added to these covariate collections. It is worth noting that some of these covariates or features may not significantly contribute to the enhancement of classification performance while overabundant features can affect the performance speed or run out of computing capacity of the classification model. Hence, random forest algorithm⁷⁷ was employed to estimate important features. We removed some less important features and kept essential features which were represented in Supplementary Table S2.

For classification, we applied a random forest algorithm for several reasons. First, it has previously been observed that the random forest algorithm can handle principal drawbacks that a single-tree-based method may face such as an over-fitting and non-optimal solution⁷⁸. Also, the random forest shows the outperformance of its rivals such as fuzzy adaptive resonance theory-supervised predictive mapping (Fuzzy ARTMAP), support vector machine (SVM), artificial neural network (ANN), Mahalanobis distance (MD), and spectral angle mapper (SAM)³⁹.

A new random-forest-based approach was developed in this study. Unlike the common use of single-time classification, for each pixel, we independently estimated prior probability values belonging to each of the specified land covers for each of the covariate collections. These prior probability values were then joined to create a set of posterior probability values. The largest value of the posterior probabilities corresponding to a specific land cover was utilized to label the predicted land cover. However, the predicted prior probability of a pixel, for example, p(C_k), might reach almost zero or zero because of ‘No data’ of that pixel at that covariate collection. If this occurs, the posterior probability of that pixel will be nearly zero or zero. That is, although the prior probability of most other collections equals 100% voting for a specified land cover, the probability product of this pixel might be almost zero, causing misclassification. Hence, the prior probability of a pixel corresponding to a particular land cover must not be extremely tiny. To this end, Eq. (20) was developed to adjust the prior probability values while posterior probability values were calculated as Eq. (21).

$$p^{,} \left( {C_{k} } \right) = c*p\left( {C_{k} } \right) + \frac{1 - c}{N}$$

(20)

$$p_{c} \left( {C_{k} } \right) = \mathop \prod \limits_{i = 1}^{I} p_{i}^{,} \left( {C_{k} } \right)$$

(21)

where p’(C_k) is the adjustment of prior probability value of a land cover C_k; c is a constant value (c = 0.7) which was evaluated by trial and error experiments in this study; N is the number of land cover categories (N = 18); p_c(C_k) is the posterior probability value of category C_k; and I is the number of covariate collections (I = 4).

The random-forest-based model was performed using Scikit-Learn 0.22 and Python 3.8.5. Since the input data of multi-sensor image bands and covariates dramatically varied over the 30-year period, we could not optimize all the parameters of the random forest algorithm. The number of trees (n_estimators) in the forest and the size of the random subsets of features (max_features), however, are highly recommended to be adjusted⁷⁷. Using RandomizedSearchCV in the Scikit-Learn, we found that n_estimators = 200 (trees) and max_features = 8 were optimal in this work. The other parameters were set as the default values.

Accuracy assessment

Following the wide-ranging recommendations of instruction manuals^88,89, a statistic-based testing data set (Reference data section) was independently generated to estimate the accuracy of final LULC products. We utilized a stratified sampling (1050 points/LULC category) method and a confusion matrix to assess Vietnam-wide annual LULC products from 1990 to 2020. The matrix produced profound accuracy metrics, namely overall accuracy (OA), user accuracy (UA), standard error (SE), and kappa coefficient (KC). The uncertainty of accuracy was measured with a 95% confidence interval. These metrics are fully described in Tables 1 and 2.

Change analysis

The analysis of changes in LULC is to measure the differences including spatio-temporal dynamic patterns, the magnitude, and rate of variations observed over the study period. First, we estimated the diversity of LULC within each 30-m pixel width by counting the number of times that LULC changes over 30 years (Fig. 1c). We then estimated the area of each LULC within a five-year interval from 1990 to 2020 to observe the trend of LULC change (Fig. 5). We also computed the percentage of net change (Eq. 22) and then rescaled the percentage to a rank between 0 and 100% to monitor the most dynamic LULC (Fig. 6). Finally, we employed a Sankey diagram to emphasize the major transfers of LULC⁹⁰.

$$p = \left( {\frac{{A_{t2} - A_{t1} }}{{A_{t1} *\left( {t_{2} - t_{1} } \right)}}} \right)*100$$

(22)

where p is the percentage of net change; and $A_{t1}$ and $A_{t2}$ (km²) are the area of the LULC type in the observation years $t_{1}$ and $t_{2} ,$ respectively (t₁ < t₂).

References

Gómez, C., White, J. C. & Wulder, M. A. Optical remotely sensed time series data for land cover classification: A review. ISPRS J. Photogramm. Remote. Sens. 116, 55–72 (2016).
Article ADS Google Scholar
Dale, V. H. The relationship between land-use change and climate change. Ecol. Appl. 7, 753–769 (1997).
Article Google Scholar
Dai, A. Increasing drought under global warming in observations and models. Nat. Clim. Chang. 3, 52–58 (2013).
Article ADS Google Scholar
Houghton, R. A. et al. Carbon emissions from land use and land-cover change. Biogeosciences 9, 5125–5142 (2012).
Article ADS CAS Google Scholar
Lambin, E. F. et al. The causes of land-use and land-cover change: Moving beyond the myths. Glob. Environ. Chang. 11, 261–269 (2001).
Article Google Scholar
Song, X. P. et al. Global land change from 1982 to 2016. Nature 560, 639–643 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Grimm, N. B. et al. Global change and the ecology of cities. Science 319, 756–760 (2008).
Article ADS CAS PubMed Google Scholar
Utkin, L. V. et al. A weighted random survival forest. Knowl. Based Syst. 177, 136–144 (2019).
Article Google Scholar
Gašparović, M., Zrinjski, M. & Gudelj, M. Automatic cost-effective method for land cover classification (ALCC). Comput. Environ. Urban Syst. 76, 1–10 (2019).
Article Google Scholar
Hu, Y., Dong, Y. & Batunacun. ,. An automatic approach for land-change detection and land updates based on integrated NDVI timing analysis and the CVAPS method with GEE support. ISPRS J. Photogramm. Remote Sens. 146, 347–359 (2018).
Article ADS Google Scholar
ESA. Sentinel-2 Mission Requirements Document. Earth (2007).
Main-Knorn, M. et al. Sen2Cor for Sentinel-2. In 3 (2017). https://doi.org/10.1117/12.2278218.
Gong, P. et al. Stable classification with limited sample: Transferring a 30-m resolution sample set collected in 2015 to mapping 10-m resolution global land cover in 2017. Sci. Bull. 64, 370–373 (2019).
Article Google Scholar
Homer, C., Huang, C., Yang, L., Wylie, B. & Coan, M. Development of a 2001 National Land-Cover Database for the United States. Photogramm. Eng. Remote. Sens. 70, 829–840 (2004).
Article Google Scholar
Fry, J. A. et al. Completion of the 2006 national land cover database for the conterminous united states. Photogramm. Eng. Remote Sens. 77, 858–864 (2011).
Google Scholar
Homer, C. et al. Completion of the 2011 national land cover database for the conterminous United States—Representing a decade of land cover change information. Photogramm. Eng. Remote Sens. 81, 345–354 (2015).
Google Scholar
Yang, L. et al. A new generation of the United States National Land Cover Database: Requirements, research priorities, design, and implementation strategies. ISPRS J. Photogramm. Remote Sens. 146, 108–123 (2018).
Article ADS Google Scholar
Chen, J. et al. Global land cover mapping at 30 m resolution: A POK-based operational approach. ISPRS J. Photogramm. Remote Sens. 103, 7–27 (2015).
Article ADS Google Scholar
Hoang, T. T., Truong, V. T., Hayashi, M., Tadono, T. & Nasahara, K. N. New JAXA high-resolution land use/land cover map for Vietnam aiming for natural forest and plantation forest monitoring. Remote Sens. 12, 2707 (2020).
Phan, D. C., Trung, T. H., Nasahara, K. N. & Tadono, T. JAXA high-resolution land use/land cover map for Central Vietnam in 2007 and 2017. Remote Sens. 10, 1406 (2018).
Article ADS Google Scholar
Nemani, R. Nasa earth exchange: Next generation earth science collaborative. ISPRS Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. XXXVIII-8/, 17–17 (2012).
Mugiraneza, T., Nascetti, A. & Ban, Y. Continuous monitoring of urban land cover change trajectories with landsat time series and landtrendr-google earth engine cloud computing. Remote Sens. 12, 2883 (2020).
Jin, Y., Liu, X., Yao, J., Zhang, X. & Zhang, H. Mapping the annual dynamics of cultivated land in typical area of the Middle-lower Yangtze plain using long time-series of Landsat images based on Google Earth Engine. Int. J. Remote Sens. 41, 1625–1644 (2020).
Article ADS Google Scholar
Hu, Y. & Hu, Y. Detecting forest disturbance and recovery in Primorsky Krai, Russia, using annual landsat time series and multi-source land cover products. Remote Sens. 12, 129 (2020).
Article Google Scholar
Miettinen, J., Shi, C. & Liew, S. C. 2015 Land cover map of Southeast Asia at 250 m spatial resolution. Remote Sens. Lett. 7, 701–710 (2016).
Article Google Scholar
Ghorbanian, A. et al. Improved land cover map of Iran using Sentinel imagery within Google Earth Engine and a novel automatic workflow for land cover classification using migrated training samples. ISPRS J. Photogramm. Remote Sens. 167, 276–288 (2020).
Article ADS Google Scholar
Huang, H. et al. The migration of training samples towards dynamic global land cover mapping. ISPRS J. Photogramm. Remote Sens. 161, 27–36 (2020).
Article ADS Google Scholar
Bagan, H. & Yamagata, Y. Land-cover change analysis in 50 global cities by using a combination of Landsat data and analysis of grid cells. Environ. Res. Lett. 9, 064015 (2014).
Radoux, J. et al. Automated training sample extraction for global land cover mapping. Remote Sens. 6, 3965–3987 (2014).
Article ADS Google Scholar
Tran, H., Tran, T. & Kervyn, M. Dynamics of land cover/land use changes in the Mekong Delta, 1973–2011: A Remote sensing analysis of the Tran Van Thoi District, Ca Mau Province, Vietnam. Remote Sens. 7, 2899–2925 (2015).
Article ADS Google Scholar
Chi, V. K. et al. Land transitions in Northwest Vietnam: An integrated analysis of biophysical and socio-cultural factors. Hum. Ecol. 41, 37–50 (2013).
Article Google Scholar
Villamor, G. B., Catacutan, D. C., Truong, V. A. T. & Thi, L. D. Tree-cover transition in Northern Vietnam from a gender-specific land-use preferences perspective. Land Use Policy 61, 53–62 (2017).
Article Google Scholar
Truong, V. T. et al. JAXA annual forest cover maps for Vietnam during 2015–2018 Using ALOS-2/PALSAR-2 and auxiliary data. Remote Sens. 11, 2412 (2019).
Article ADS Google Scholar
And, R. D. of A. Vietnam’S Modified Submission on Refreence Levels for Redd+ Results Based Payments Under Unfccc. https://redd.unfccc.int/files/vietnam_frl_modified__submission_final_for_posting.pdf (2016).
Xu, X., Jain, A. K. & Calvin, K. V. Quantifying the biophysical and socioeconomic drivers of changes in forest and agricultural land in South and Southeast Asia. Glob. Chang. Biol. 25, 2137–2151 (2019).
Article ADS PubMed Google Scholar
Hansen, M. C. et al. High-resolution global maps of 21st-century forest cover change. Science (80- ) 342, 850–853 (2013).
Article ADS CAS Google Scholar
Di Gregorio, A., and Jansen, L. J. M. Land Cover Classification System (LCCS): Classification Concepts and User Manual. Fao http://www.fao.org/3/x0596e/x0596e00.htm (2000).
Van Thinh, T., Phan, D. C., Nasahara, K. N. & Tadono, T. How does land use/land cover map’s accuracy depend on number of classification classes? Sci. Online Lett. Atmos. 15, 28–31 (2019).
Google Scholar
Talukdar, S. et al. Land-use land-cover classification by machine learning classifiers for satellite observations—a review. Remote Sens. 12, 1135 (2020).
Article ADS Google Scholar
Song, S., Gong, W., Zhu, B. & Huang, X. Wavelength selection and spectral discrimination for paddy rice, with laboratory measurements of hyperspectral leaf reflectance. ISPRS J. Photogramm. Remote Sens. 66, 672–682 (2011).
Article ADS Google Scholar
Szantoi, Z., Smith, S. E., Strona, G., Koh, L. P. & Wich, S. A. Mapping orangutan habitat and agricultural areas using Landsat OLI imagery augmented with unmanned aircraft system aerial photography. Int. J. Remote Sens. 38, 2231–2245 (2017).
Article ADS Google Scholar
Hill, R. A. & Thomson, A. G. Mapping woodland species composition and structure using airborne spectral and LiDAR data. Int. J. Remote Sens. 26, 3763–3779 (2005).
Article ADS Google Scholar
Kontgis, C., Schneider, A. & Ozdogan, M. Mapping rice paddy extent and intensification in the Vietnamese Mekong River Delta with dense time stacks of Landsat data. Remote Sens. Environ. 169, 255–269 (2015).
Article ADS Google Scholar
Kontgis, C. et al. Monitoring peri-urbanization in the greater Ho Chi Minh City metropolitan area. Appl. Geogr. 53, 377–388 (2014).
Article Google Scholar
D’Amour, C. B. et al. Future urban land expansion and implications for global croplands. Proc. Natl. Acad. Sci. USA 114, 8939–8944 (2017).
Article CAS Google Scholar
Ha, T. V., Tuohy, M., Irwin, M. & Tuan, P. V. Monitoring and mapping rural urbanization and land use changes using Landsat data in the northeast subtropical region of Vietnam. Egypt. J. Remote Sens. Sp. Sci. 23, 11–19 (2020).
Google Scholar
Nhan, T. Q., Van Ly, L. T. & Tan, L. V. How much do rice farmers earn from their crops? Evidence from a rice-exporting country. J. Agric. Stud. 8, 302 (2020).
Google Scholar
Truong, T. D. & Do, L. H. Mangrove forests and aquaculture in the Mekong river delta. Land Use Policy 73, 20–28 (2018).
Article Google Scholar
Lam-Dao, N., Pham-Bach, V., Nguyen-Thanh, M., Pham-Thi, M.-T. & Hoang-Phi, P. Change detection of land use and riverbank in Mekong Delta, Vietnam using time series remotely sensed data. J. Resour. Ecol. 2, 370–374 (2011).
Google Scholar
Ha, T. T. P., van Dijk, H. & Visser, L. Impacts of changes in mangrove forest management practices on forest accessibility and livelihood: A case study in mangrove-shrimp farming system in Ca Mau Province, Mekong Delta, Vietnam. Land Use Policy 36, 89–101 (2014).
Article Google Scholar
Le, T. N., Bregt, A. K., van Halsema, G. E., Hellegers, P. J. G. J. & Nguyen, L. D. Interplay between land-use dynamics and changes in hydrological regime in the Vietnamese Mekong Delta. Land Use Policy 73, 269–280 (2018).
Article Google Scholar
Khuc, Q. V., Tran, B. Q., Meyfroidt, P. & Paschke, M. W. Drivers of deforestation and forest degradation in Vietnam: An exploratory analysis at the national level. For. Policy Econ. 90, 128–141 (2018).
Article Google Scholar
Meyfroidt, P., Vu, T. P. & Hoang, V. A. Trajectories of deforestation, coffee expansion and displacement of shifting cultivation in the Central Highlands of Vietnam. Glob. Environ. Chang. 23, 1187–1198 (2013).
Article Google Scholar
Ngo-Duc, T., Kieu, C., Thatcher, M., Nguyen-Le, D. & Phan-Van, T. Climate projections for Vietnam based on regional climate models. Clim. Res. 60, 199–213 (2014).
Article Google Scholar
Lindesay, J. A. et al. International geosphere-biosphere programme/international global atmospheric chemistry SAFARI-92 field experiment: Background and overview. J. Geophys. Res. Atmos. 101, 23521–23530 (1996).
Article ADS CAS Google Scholar
Klemas, V. V., Dobson, J. E., Ferguson, R. L. & Haddad, K. D. A coastal land cover classification system for the NOAA Coastwatch Change Analysis Project. J. Coast. Res. 9, 862–872 (1993).
Google Scholar
Saah, D. et al. Primitives as building blocks for constructing land cover maps. Int. J. Appl. Earth Obs. Geoinf. 85, 101979 (2020).
Google Scholar
Keys, R. G. Cubic convolution interpolation for digital image processing. IEEE Trans. Acoust. 29, 1153–1160 (1981).
Article MathSciNet MATH Google Scholar
Farr, T. G. et al. The shuttle radar topography mission. Rev. Geophys. 45, RG2004 (2007).
Filipponi, F. Sentinel-1 GRD preprocessing workflow. Proceedings 18, 11 (2019).
Google Scholar
Soenen, S. A., Peddle, D. R. & Coburn, C. A. SCS+C: A modified sun-canopy-sensor topographic correction in forested terrain. IEEE Trans. Geosci. Remote Sens. 43, 2148–2159 (2005).
Article ADS Google Scholar
Saleous, N. & Kutler, J. LEDAPS Calibration, Reflectance, Atmospheric Correction Preprocessing Code, Version 2 Summary: Data Citation: Model Product Description: Oak Ridge National Laboratory Distributed Active Archive Center http://daac.ornl.gov/cgi-bin/dsviewer.pl?ds_id=1146 (2012) 10.3334/ORNLDAAC/1146.
Vermote, E., Justice, C., Claverie, M. & Franch, B. Preliminary analysis of the performance of the Landsat 8/OLI land surface reflectance product. Remote Sens. Environ. 185, 46–56 (2016).
Article ADS PubMed Google Scholar
Foga, S. et al. Cloud detection algorithm comparison and validation for operational Landsat data products. Remote Sens. Environ. 194, 379–390 (2017).
Article ADS Google Scholar
Louis, J. et al. Sentinel-2 Global Surface Reflectance Level-2a Product Generated with Sen2Cor. In International Geoscience and Remote Sensing Symposium (IGARSS) 8522–8525 (2019). https://doi.org/10.1109/IGARSS.2019.8898540.
Roy, D. P. et al. A general method to normalize Landsat reflectance data to nadir BRDF adjusted reflectance. Remote Sens. Environ. 176, 255–271 (2016).
Article ADS Google Scholar
Roy, D. P. et al. Examination of Sentinel-2A multi-spectral instrument (MSI) reflectance anisotropy and the suitability of a general method to normalize MSI reflectance to nadir BRDF adjusted reflectance. Remote Sens. Environ. 199, 25–38 (2017).
Article ADS Google Scholar
Lee, J. S., Ainsworth, T. L., Wang, Y. & Chen, K. S. Polarimetric SAR speckle filtering and the extended sigma filter. IEEE Trans. Geosci. Remote Sens. 53, 1150–1160 (2015).
Article ADS Google Scholar
Roy, D. P. et al. Characterization of Landsat-7 to Landsat-8 reflective wavelength and normalized difference vegetation index continuity. Remote Sens. Environ. 185, 57–70 (2016).
Article ADS PubMed Google Scholar
Claverie, M. et al. The Harmonized Landsat and Sentinel-2 surface reflectance data set. Remote Sens. Environ. 219, 145–161 (2018).
Article ADS Google Scholar
Flood, N. Seasonal composite landsat TM/ETM+ Images using the medoid (a multi-dimensional median). Remote Sens. 5, 6481–6500 (2013).
Article ADS Google Scholar
Kaufman, Y. J. & Tanré, D. Atmospherically Resistant Vegetation Index (ARVI) for EOS-MODIS. IEEE Trans. Geosci. Remote Sens. 30, 261–270 (1992).
Article ADS Google Scholar
Saah, D. et al. Land cover mapping in data scarce environments: Challenges and opportunities. Front. Environ. Sci. 7, 150 (2019).
Article Google Scholar
Li, C., Wang, J., Wang, L., Hu, L. & Gong, P. Comparison of classification algorithms and training sample sizes in urban land classification with landsat thematic mapper imagery. Remote Sens. 6, 964–983 (2014).
Article ADS Google Scholar
Richards, J. A. Remote sensing digital image analysis: An introduction. Remote Sensing Digital Image Analysis: An Introduction vol. 9783642300 (2013).
Kruse, F. A. et al. The spectral image processing system (SIPS)-interactive visualization and analysis of imaging spectrometer data. Remote Sens. Environ. 44, 145–163 (1993).
Article ADS Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article MATH Google Scholar
He, Y., Lee, E. & Warner, T. A. A time series of annual land use and land cover maps of China from 1982 to 2013 generated using AVHRR GIMMS NDVI3g data. Remote Sens. Environ. 199, 201–217 (2017).
Article ADS Google Scholar
Zhao, H. & Chen, X. Use of normalized difference bareness index in quickly mapping bare areas from TM/ETM+. In International Geoscience and Remote Sensing Symposium (IGARSS) vol. 3 1666–1668 (IEEE, 2005).
Baloloy, A. B., Blanco, A. C., Raymund Rhommel, R. R. C. & Nadaoka, K. Development and application of a new mangrove vegetation index (MVI) for rapid and accurate mangrove mapping. ISPRS J. Photogramm. Remote Sens. 166, 95–117 (2020).
Article ADS Google Scholar
García, M. J. L. & Caselles, V. Mapping burns and natural reforestation using thematic mapper data. Geocarto Int. 6, 31–37 (1991).
Article Google Scholar
Wright, C. & Gallant, A. Improved wetland remote sensing in Yellowstone National Park using classification trees to combine TM imagery and ancillary environmental data. Remote Sens. Environ. 107, 582–605 (2007).
Article ADS Google Scholar
Hutchinson, C. F. Techniques for combining Landsat and ancillary data for digital classification improvement. Photogramm. Eng. Remote Sens. 48, 123–130 (1982).
Google Scholar
Tadono, T. et al. Precise global DEM generation by ALOS PRISM. ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci. II–4, 71–76 (2014).
Article Google Scholar
Jokar Arsanjani, J., Zipf, A., Mooney, P. & Helbich, M. OpenStreetMap in GIScience. OpenStreetMap in GIScience: Experiences, Research, Applications (2015). https://doi.org/10.1007/978-3-319-14280-7.
Open Development Mekong. OpenDevelopmentMekong. East-West Management Institute https://data.opendevelopmentmekong.net/organization/6f37a27d-2790-4b9a-8570-a36cb1d8108f?res_format=KML (2015).
Truong, V. T. et al. JAXA annual forest cover maps for Vietnam during 2015–2018 Using ALOS-2/PALSAR-2 and auxiliary data. Remote Sens. 11 (2019).
Olofsson, P. et al. Good practices for estimating area and assessing accuracy of land change. Remote Sens. Environ. 148, 42–57 (2014).
Article ADS Google Scholar
Congalton, R. G. A review of assessing the accuracy of classifications of remotely sensed data. Remote Sens. Environ. 37, 35–46 (1991).
Article ADS Google Scholar
Schmidt, M. The Sankey diagram in energy and material flow management—part II: Methodology and current applications. J. Ind. Ecol. 12, 173–185 (2008).
Article Google Scholar
Zha, Y., Gao, J. & Ni, S. Use of normalized difference built-up index in automatically mapping urban areas from TM imagery. Int. J. Remote Sens. 24, 583–594 (2003).
Article ADS Google Scholar
Lacaux, J. P., Tourre, Y. M., Vignolles, C., Ndione, J. A. & Lafaye, M. Classification of ponds from high-spatial resolution remote sensing: Application to Rift Valley Fever epidemics in Senegal. Remote Sens. Environ. 106, 66–74 (2007).
Article ADS Google Scholar
Tucker, C. J. Red and photographic infrared linear combinations for monitoring vegetation. Remote Sens. Environ. 8, 127–150 (1979).
Article ADS Google Scholar
McFeeters, S. K. The use of the Normalized Difference Water Index (NDWI) in the delineation of open water features. Int. J. Remote Sens. 17, 1425–1432 (1996).
Article ADS Google Scholar
Huete, A. R. A soil-adjusted vegetation index (SAVI). Remote Sens. Environ. 25, 295–309 (1988).
Article ADS Google Scholar
Penuelas, J., Baret, F. & Filella, I. Semi-empirical indices to assess carotenoids/chlorophyll a ratio from leaf spectral reflectance. Photosynthetica 31, 221–230 (1995).
CAS Google Scholar
Jayamanna, S., Kawamura, M. & Tsujiko, Y. Relation between social and environmental conditions in colombo, sri lanka and the urban index estimated by satellite remote sensing data. Int. Arch. Photogram. Remote Sens. 31, 321–326 (1996).
Google Scholar
Shen, L. & Li, C. Water body extraction from Landsat ETM+ imagery using adaboost algorithm. In 2010 18th International Conference on Geoinformatics, Geoinformatics 2010 (2010). https://doi.org/10.1109/GEOINFORMATICS.2010.5567762.
Perry, C. R. & Lautenschlager, L. F. Functional equivalence of spectral vegetation indices. Remote Sens. Environ. 14, 169–182 (1984).
Article ADS Google Scholar
As-syakur, A. R., Adnyana, I. W. S., Arthana, I. W. & Nuarsa, I. W. Enhanced built-UP and bareness index (EBBI) for mapping built-UP and bare land in an urban area. Remote Sens. 4, 2957–2970 (2012).
Article ADS Google Scholar
Liu, H. Q. & Huete, A. Feedback based modification of the NDVI to minimize canopy background and atmospheric noise. IEEE Trans. Geosci. Remote Sens. 33, 457–465 (1995).
Article ADS Google Scholar
Gitelson, A. A., Viña, A., Ciganda, V., Rundquist, D. C. & Arkebauer, T. J. Remote estimation of canopy chlorophyll content in crops. Geophys. Res. Lett. 32, 1–4 (2005).
Article CAS Google Scholar

Download references

Acknowledgements

The authors would like to thank the developers of the GEE for providing satellite images and a high computational platform for the big data processing. This work was supported from the Ecosystem Group of Earth Observation Research Centre (EORC) of the JAXA and the Project for Human Resource Development Scholarship by Japanese Grant Aid (JDS), Japan International Cooperation Agency (JICA) https://doi.org/10.13039/501100004532.

Author information

Authors and Affiliations

Graduate School of Science and Technology, University of Tsukuba, Tennoudai 1-1-1, Tsukuba, Ibaraki, 305-8572, Japan
Duong Cao Phan & Taiga Sasagawa
Hydraulic Construction Institute, Vietnam Academy for Water Resources, No. 3, Alley 95, Chua Boc Street, Dong Da district, Hanoi, 116765, Vietnam
Duong Cao Phan
Department of Survey, Mapping and Geographic Information, Ministry of Natural Resources and Environment, 2 Dang Thuy Tram Street, Hanoi, 100000, Vietnam
Ta Hoang Trung
VNU Center for Development in Hoa Lac, Vietnam National University, Hanoi, Thach Hoa Commune, Thach That District, Hanoi, 155500, Vietnam
Van Thinh Truong
Forest Inventory and Planning Institute (FIPI), Ministry of Agriculture and Rural Development (MARD), Vinh Quynh, Thanh Tri, Hanoi, 100000, Vietnam
Thuy Phuong Thi Vu
GIS Group, Department of Business and IT, University of South-Eastern Norway, Gullbringvegen 36, N-3800, Bø i Telemark, Norway
Dieu Tien Bui
Earth Observation Research Center, Japan Aerospace Exploration Agency (JAXA), 2-1-1 Sengen, Tsukuba, Ibaraki, 305-8505, Japan
Masato Hayashi & Takeo Tadono
Faculty of Life and Environmental Sciences, University of Tsukuba, Tennoudai 1-1-1, Tsukuba, Ibaraki, 305-8572, Japan
Kenlo Nishida Nasahara

Authors

Duong Cao Phan
View author publications
You can also search for this author in PubMed Google Scholar
Ta Hoang Trung
View author publications
You can also search for this author in PubMed Google Scholar
Van Thinh Truong
View author publications
You can also search for this author in PubMed Google Scholar
Taiga Sasagawa
View author publications
You can also search for this author in PubMed Google Scholar
Thuy Phuong Thi Vu
View author publications
You can also search for this author in PubMed Google Scholar
Dieu Tien Bui
View author publications
You can also search for this author in PubMed Google Scholar
Masato Hayashi
View author publications
You can also search for this author in PubMed Google Scholar
Takeo Tadono
View author publications
You can also search for this author in PubMed Google Scholar
Kenlo Nishida Nasahara
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.C.P. the conception, design of the work, the acquisition, analysis, and interpretation of data, the creation of new software used in this work, and the original draft; T.H.T. software, investigation, and visualization; V.T.T. validation, investigation, and the original draft; T.S. and T.P.T.V. the original draft and investigation; D.T.B., M.H., T.T. and K.N.N. writing – editing and substantively revised the manuscript. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Duong Cao Phan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Phan, D.C., Trung, T.H., Truong, V.T. et al. First comprehensive quantification of annual land use/cover from 1990 to 2020 across mainland Vietnam. Sci Rep 11, 9979 (2021). https://doi.org/10.1038/s41598-021-89034-5

Download citation

Received: 22 November 2020
Accepted: 20 April 2021
Published: 11 May 2021
DOI: https://doi.org/10.1038/s41598-021-89034-5

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Not just crop or forest: an integrated land cover map for agricultural and natural areas

Annual Impervious Surface Data from 2001–2020 for West African Countries: Ghana, Togo, Benin and Nigeria

Integrated usage of historical geospatial data and modern satellite images reveal long-term land use/cover changes in Bursa/Turkey, 1858–2020

Introduction

Results

The accuracy of the first VLUCDs

Distribution and trend of LUCC

Major spatio-temporal LULC dynamics

Discussion

Conclusion

Materials and methods

Study area

Land cover classification system

Remote sensing data

Satellite-based covariate calculation

Ancillary data sets

Reference data

Automatic training migration model

Machine learning modelling

Accuracy assessment

Change analysis

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links