Main

Global population growth and increasing standards of living inevitably cause the expansion and intensification of global agricultural land use to fulfil growing demands for food, biofuel and other commodities1,2,3. In turn, agriculture expansion and intensification threaten ecosystem functioning and lead to species extinction through habitat loss and fragmentation3,4,5,6. The United Nations’ 2030 Sustainable Development Goals (SDGs) call for balancing increasing agricultural production with maintenance of ecosystem services7. Implementation of SDGs to improve food security, protect freshwater and terrestrial ecosystems, and mitigate climate change requires national policies and international cooperation that are based on consistent, independent and timely data on agriculture extent and productivity8,9. Spatiotemporally consistent satellite observations provide the most accurate and cost-effective solution for global agricultural, land-use mapping and monitoring10. Satellite data have been shown to enable national and global agriculture mapping11,12,13,14,15,16,17. However, no globally consistent, multidecadal, cropland time-series data at locally relevant spatial resolutions (30 m per pixel) exist to date.

In the present study, we present a global cropland extent and change dataset that can contribute to monitoring national and global progress towards SDGs. We define cropland as land used for annual and perennial herbaceous crops for human consumption, forage (including hay) and biofuel. Perennial woody crops, permanent pastures and shifting cultivation are excluded from the definition. The fallow length is limited to 4 years for the cropland class. Our definition is largely consistent with the arable land category reported by the Food and Agriculture Organization (FAO) of the UN18. To create the cropland dataset, we utilized the consistently processed 30 m spatial resolution Landsat satellite data archive19 from 2000 to 2019. The Landsat time-series data were transformed into multitemporal metrics that characterize land surface phenology. These metrics were used as independent variables for a machine-learning classification to map global cropland extent. The classification models were locally calibrated using extensive training data collected by visual interpretation of freely available, high-spatial-resolution satellite images. We used a probability sample, stratified based on the Landsat-based global cropland maps, to estimate cropland area and its associated uncertainty, and to analyse pathways of land-use conversion. Sample reference data were collected through visual interpretation of Landsat time-series data and higher-spatial-resolution satellite images. Cropland maps were integrated with the Moderate Resolution Imaging Spectroradiometer (MODIS)-derived annual net primary production (NPP)20 as a proxy variable for analysing crop productivity. The analysis was performed in 4-year epochs (2000–2003, 2004–2007, 2008–2011, 2012–2015 and 2016–2019). We created one cropland map per epoch (five maps in total), hereafter referred to by the last year of the interval (for example, the 2019 map represents the 2016–2019 epoch).

Results and discussion

Cropland area and change

Using probability sample data, we estimated the 2019 global cropland area to be 1,244.2 ± 62.7 Mha (the uncertainty represents the 95% confidence interval (CI)). Of the global cropland area, 55% is in Eurasia, 17% in Africa, 16% in North and Central America, 9% in South America and 3% in Australia and New Zealand (Table 1; see Extended Data Fig. 1 for region boundaries). During the first two decades of the twenty-first century, global cropland area increased by 101.9 ± 45.1 Mha, equivalent to 9% of the 2003 cropland area (Fig. 1). The largest cropland expansion was observed in Africa (by 53.2 ± 39.4 Mha, or 34%). South America had the largest relative cropland gain (by 37.1 ± 8.7 Mha, or 49%). Australia and New Zealand, as well as south-west Asia, displayed moderate cropland expansion (<0% of the 2003 area). North America, Europe and north and south-east Asia featured small net cropland area change but pronounced gross cropland gain and loss, which balanced each other at the continental scale.

Table 1 The regional and global map-based and sample-based cropland and cropland dynamics areas
Fig. 1: Global cropland area (map based and sample based) and annual cropland NPP.
figure 1

Cropland area was mapped for each 4-year interval. Sample analysis was performed only for the first and the last intervals. The MODIS-based annual NPP represents a 4-year average within the cropland map for the corresponding time interval. The error bars for sample-based cropland area estimates represent the 95% CI whereas the error bars for NPP represent 1 s.d. of annual values within the time interval.

Source data

From 2003 to 2019, the global population increased by 21% from 6.4 billion to 7.7 billion21. As a result, global per-capita cropland area decreased by 10%, from 0.18 ha per person in 2003 to 0.16 ha per person in 2019. An increase in per-capita cropland area was observed only in South America, while it decreased in all other continents (Fig. 2). The largest relative decrease of per-capita cropland area was observed in south-west Asia (by 19%). South-east Asia had the smallest 2019 per-capita cropland area (0.08 ha per person), whereas Australia and New Zealand had the largest (1.34 ha per person).

Fig. 2: Total and per-capita cropland area change, 2003–2019, per geographic region.
figure 2

The size of the bubbles reflects regional 2019 cropland area.

Source data

A comparison of our 2003 and 2019 sample-based cropland area estimates with national 2003 and 2018 arable land area reported by the FAO22 (Extended Data Fig. 2) shows sound agreements (R2 of 0.94 and 0.98 for the years 2003 and 2019, respectively). Our sample-based cropland area estimate is smaller (by 16% and 11% for 2003 and 2019, respectively) compared with FAO arable land area. The FAO arable land change confirms our findings: both datasets demonstrate a net increase in global cropland area, with Africa and South America showing the largest net gains. The net loss of arable land area in North America, Europe and north and south-west Asia reported by the FAO was not confirmed by our results.

Attribution of cropland gains and losses

Of the total 2019 cropland area, 217.5 ± 37.7 Mha (17%) represents new cropland established since 2003. In South America and Africa, this proportion is the highest (39% and 34%, respectively). Half of the new croplands replaced natural woody and herbaceous vegetation (49% of gross cropland gain area; Table 2). Of that total, 11% represents dryland conversion through irrigation, mostly found in south-west and south-east Asia and North America. The largest proportions of natural vegetation conversion to croplands (excluding dryland irrigation) were found in Africa (79% of all gross cropland gain area), south-east Asia (61%) and South America (39%). Cropland expansion is a major factor of forest loss and wildland fragmentation4,23,24, which illustrates a conflict with SDG 15, specifically, the SDG’s targets to halt deforestation and degradation of natural habitats9. The other half of cropland expansion (51%) was due to pasture conversion and recultivation of abandoned arable land. Nearly all cropland expansion in Australia, New Zealand, Europe and northern Asia was found within pastures and long fallows (with no crop cultivation for >4 years). In North and South America, cropland expansion through the conversion of pastures and long fallows was more common (75% and 61%, respectively) than through clearing of natural vegetation24,25.

Table 2 Relative importance of different types of land-use conversions for cropland establishment (gain) and abandonment (loss), estimated from sample reference data

Abandonment or conversion to other land uses affected 10% of the 2003 cropland area (115.5 ± 24.1 Mha). Of that area, 52% was either converted into pastures or abandoned (Table 2); such conversions may be temporary and followed by crop recultivation years later. Industrial and residential construction and infrastructure development were the second largest driver of gross cropland loss, responsible for 16% of the total cropland area reduction. In south-east Asia, 35% of cropland reduction was due to urban sprawl. A portion (13%) of 2003 cropland was converted to permanent woody crops or aquaculture, with the highest proportion of such transitions in south-east Asia (28%). Flooding caused by surface water increase, water erosion and reservoir construction affected the cropland area on all continents (3% total reduction). The remaining 16% of cropland reduction represented tree plantations or restoration of natural vegetation after cropland abandonment.

Cropland dynamics on the continental and national scales

The global Landsat-based, cropland map time-series is complementary to the sample analysis in characterizing global area dynamics (Fig. 3). The sample analysis showed high accuracy of the global cropland maps with variability between regions and lower accuracies for change dynamics (Table 3). The cropland map time-series allowed us to disaggregate change over time and conduct national-scale analyses.

Fig. 3: Global cropland extent and change, 2000–2019.
figure 3

The map shows the proportion of stable cropland, cropland expansion and cropland reduction within 0.025° × 0.025° grid cells. The original cropland map time-series has a spatial resolution of 0.00025° per pixel, approximately 30 m at the Equator. Country boundaries are from GADM (https://gadm.org).

Table 3 Regional and global map accuracy metrics

Global cropland expansion accelerated over the past two decades, with a near doubling of the annual expansion rate from 5.1 MHa per year to 9.0 Mha per year (Table 4). The change in annual cropland expansion rates highlights differences between cropland establishment in Africa and South America. In Africa, cropland expansion accelerated from 2004–2007 to 2016–2019, with a more than twofold increase in annual expansion rates. In contrast, cropland expansion in South America decelerated by 2019, with an annual expansion rate reduced by almost half compared with the 2004–2007 interval.

Table 4 Map-based annual cropland area change

At the national level, the USA had the largest cropland area by 2019, closely followed by India and China (Supplementary Table 4). The largest net cropland increases were found in Brazil (by 23.1 Mha, or 77% increase over year 2003 cropland area) and India (by 15.5 Mha or 13%). The largest cropland area reductions were found in Russia (by 5.7 Mha, or 6% decrease over year 2003 cropland area) and Cuba (by 0.5 Mha or 28%).

Per-country cropland area derived from our 2019 satellite-based map can be compared against the FAO’s arable land estimates for 2018 (ref. 22) (R2 of 0.97; Extended Data Fig. 3a) and with 100-m cropland fraction mapped by the Copernicus Moderate Dynamic Land Cover v.3 dataset17 (R2 of 0.96; Extended Data Fig. 3b). The differences between national cropland estimates for selected countries may be attributed to different factors. We suggest that, in Russia, where crop abandonment is widespread and not fully documented, the arable land is overestimated by the FAO. In Brazil and Paraguay, the Copernicus cropland fraction dataset shows almost twice the size of cropland area compared with our estimate. This overestimation is partly due to misclassification of pastures as croplands by the Copernicus dataset.

Cropland NPP change

The global MODIS-derived annual NPP within the cropland area (Extended Data Fig. 4) increased by 25% between 2003 and 2019 (from 4.4 Pg C year−1 to 5.5 Pg C year−1; Fig. 1). South America had the highest NPP increase (by 0.38 Pg C year−1, or 88%) followed by Africa (by 0.29 Pg C year−1, or 50%) (Table 5). The per-capita annual cropland NPP also increased globally by 3.5%, balancing the per-capita cropland area reduction. Two processes contributed to the global cropland NPP increase, namely the increase in cropland area and the increase in crop primary production per unit area. We found that the mean NPP per unit area within stable croplands (croplands presented over the entire 2000–2019 interval) increased by 10%, from 402 g C m−2 year−1 in 2003 to 442 g C m−2 year−1 in 2019. The highest NPP increase within stable croplands was found in South America (from 528 g C m−2 year−1 in 2003 to 730 g C m−2 year−1 in 2019, or by 25%). The NPP gain within stable croplands explains 34% of the total cropland NPP increase from 2003 to 2019. Although NPP cannot be used to directly estimate yields, as a measure of vegetation productivity it can be used as an indicator of intensification of crop production.

Table 5 Average annual NPP within cropland area

Discussion

The 2019 global cropland map (Extended Data Fig. 5) shows that global cropland distribution and dynamics do not follow national boundaries, but rather reflect agricultural potential, population and land-use history. Major lowland regions of the world have been converted to homogeneous agricultural landscapes, including the Great Plains in North America, the Pampas in South America, the Pontic steppe in Europe, north China and the Manchurian Plains in east Asia, the Indo-Gangetic Plain in south Asia, parts of the Sahel region in Africa and south-east Australia. Cropland expansion in South America occurred synchronously in Brazil, Argentina, Paraguay, Bolivia and Uruguay. A similar pattern of simultaneous cropland expansion was observed within Sahelian and Central African countries. In south-west and south-east Asia, cropland gain was mostly found in drylands, whereas tree plantations, orchards, aquaculture and urban areas replaced former croplands in China and the Lower Mekong countries. In Russia, the massive cropland abandonment in the north26 was partly compensated by the recent cropland expansion in the southern steppe regions, primarily through fallow land recultivation. The cross-boundary distribution of major cropland areas and synchronous cropland dynamics illustrate the importance of international cooperation to ensure global progress towards SDGs.

Global cropland maps provide spatial context on national, cross-boundary and local cropland dynamics, reflecting the history of land tenure, national policies and abrupt events such as natural and man-made disasters (Extended Data Fig. 6). In eastern Europe, the Baltic states and Russia’s Kaliningrad region featured cropland expansion through recultivation of long fallows abandoned after the breakdown of the USSR, whereas the cropland area in neighbouring Poland and Belarus was relatively stable. Cereal, forage and hay production land of the northern Great Plains has different dynamics within Canada, where we observed land abandonment or conversion to permanent pastures, and the USA, where land management has been intensified. The irrigated croplands in Saudi Arabia declined after the depletion of groundwater resources and the implementation of state policies to discourage water-intensive crop production27. The 30-m spatial resolution of the cropland maps supports the analysis of local dynamic factors, for example, cropland abandonment after radioactive contamination following the 2011 nuclear disaster on the Fukushima Daiichi nuclear power plant in Japan.

Changes in total and per-capita mapped cropland area from 2003 to 2019 demonstrate the variability of national responses to the need for increased food production to feed a growing population (Extended Data Fig. 7). For most countries with moderate cropland area gains, we observed small decreases in per-capita cropland area. In many African nations (for example, Cameroon, Chad, Tanzania and Uganda, among others) the relatively large cropland area increases compensated for population growth and resulted in small changes in per-capita cropland area. In other countries, cropland increase was not adequate to follow population growth, causing a substantial decrease of cropland per capita (for example, in Ethiopia, Nigeria, Pakistan, Senegal and Tajikistan). Per-capita cropland area decreased almost twofold in Niger, which experienced high population growth and slow cropland expansion. Per-capita cropland area reduction can be an indicator of food insecurity in poor countries that rely on subsistence agriculture, whereas rich countries like Saudi Arabia can compensate for cropland area decline with food imports28,29. Several African countries with rapid cropland increase (Angola, Cote d’Ivoire, Democratic Republic of the Congo, Mozambique and Zambia) and South American countries with industrial export-oriented agricultural expansion (Brazil, Bolivia, Paraguay and Uruguay) increased per-capita cropland area. The Baltic states of Lithuania and Latvia had the largest increase of cropland per capita due to cropland gain through recultivation of agricultural lands abandoned in the 1990s, coupled with a sharp population decline (>20% reduction since 2000). Despite their small size, these countries are among the top 15 global wheat exporters.

More than three-quarters (77%) of the global population live in regions with per-capita cropland area and cropland NPP below the year 2019 global average (Extended Data Fig. 8). The lowest per-capita 2019 cropland NPP was in south-west Asia (40% of the global average), which has decreased by 7% since 2003. This finding is aligned with the decrease of per-capita cereal production in western Asia by 14% from 2000 to 2019, reported by the FAO22. The per-capita cropland area in south-east Asia in 2019 was half the global average. In contrast, the per-capita cropland areas and cropland NPP in North America, Europe and north Asia in 2019 were twice the global average. South America had nearly threefold higher per-capita cropland NPP than the global average, and it has increased by 59% since 2003. Although per-capita cropland area and cropland NPP has decreased by >10% in Australia and New Zealand since 2003, the region still led the world in both measures in 2019. Regions with per-capita cropland area and cropland NPP above the global average include the largest grain- and soybean-exporting countries (Australia, Argentina, Brazil, USA and Russia).

Regional accuracies (Table 3) highlight the limitations of the Landsat-based cropland maps. North and South America, which are dominated by large-scale industrial farming, have the highest accuracies. In Europe, Asia and Africa, the global map underestimates cropland area due to spatial resolution limitations in mapping heterogeneous landscapes. Cropland maps in Australia and New Zealand overestimate cropland area due to the inclusion of intensively managed permanent pastures, which are not always separable from crops using Landsat data. In addition, mapping change was shown to be more difficult, with accuracies generally lower across all regions. A probability-based sample analysis is the recommended good practice approach30 to estimating the extent of and change in land cover and land use, including croplands. The global cropland map time-series enables a higher sampling efficiency through stratification at the subnational, national and global scales. The difference between our sample-based and map-based cropland area estimates and the arable land area reported by the FAO partly related to the definitional inconsistency. The FAO country reports may include unused arable land and other agricultural land uses10,18, whereas our estimates represent the actively cultivated cropland area.

The annual MODIS NPP is the only publicly available, globally consistent data that reflect recent changes in primary production within croplands. NPP correlates with crop yield, biomass production and carbon sequestration, although variation of crop types and management practices precludes direct estimation of the crop yield from cropland NPP31. The MODIS dataset has been shown to underestimate NPP compared with process-based model estimations, especially for irrigated crops32. The difference in spatial resolution between Landsat-based cropland maps and MODIS-based NPP data may impede the analysis of cropland primary production within heterogeneous landscapes. Our Landsat-based, cropland extent time-series data can provide a useful input for improved NPP and crop yield modelling at higher spatial resolution and with better precision.

High-resolution, satellite-based synoptic data on cropland extent and change provide the basis for tracking progress towards sustainable food production and reduction of the environmental impact of agriculture expansion, as well as for applying crop condition monitoring to support decision-making33. Cropland extent is a key variable required to estimate emissions from agriculture and is, therefore, a part of the essential climate variables required for monitoring and modelling the Earth’s climate34. Locally relevant cropland map time-series enable the monitoring of land-use conversion within high-conservation-value ecosystems and protected areas35. The cropland extent map, integrated with other high spatial and temporal resolution data, such as forest change36 and surface water extent37, can provide a comprehensive overview of human-induced environmental change, which supports assessing the progress of individual countries towards SDGs.

Methods

Cropland-mapping extent and time intervals

The global boundaries for the cropland mapping were informed by the US Geological Survey (USGS) Global Food Security-Support Analysis Data at 30 m (GFSAD)11. The cropland mapping extent was defined using the geographic 1° × 1° grid. We included every 1° × 1° grid cell that contains cropland area according to the GFSAD. Small islands were excluded due to the absence of Landsat geometrically corrected data (Supplementary Fig. 1).

The cropland mapping was performed at 4-year intervals (2000–2003, 2004–2007, 2008–2011, 2012–2015 and 2016–2019). Use of a long interval (rather than a single year) increased the number of clear-sky satellite observations in the time-series, which improves representation of land-surface phenology and the accuracy of cropland detection. For each 4-year interval, we mapped an area as cropland if a growing crop was detected during any of these years. In this way, we implemented the criterion of the maximum fallow length: if an area was not used as cropland for >4 years, it was not included in the cropland map for the corresponding time interval.

Landsat data

We employed the global 16-day normalized surface reflectance Landsat Analysis Ready Data (Landsat ARD19) as input data for cropland mapping. The Landsat ARD were generated from the entire Landsat archive from 1997 to 2019. The Landsat top-of-atmosphere reflectance was normalized using globally consistent MODIS surface reflectance as a normalization target. Individual Landsat images were aggregated into 16-day composites by prioritizing clear-sky observations.

For each 4-year interval, we created a single annualized gap-free 16-day observation time-series. For each 16-day interval, we selected the observation with the highest near-infrared reflectance value (to prioritize observations with the highest vegetation cover) from 4 years of Landsat data. Observations contaminated by haze, clouds and cloud shadows, as indicated by the Landsat ARD quality layer, were removed from the analysis. If no clear-sky data were available for a 16-day interval, we filled the missing reflectance values using linear interpolation.

The annualized, 16-day time-series within each 4-year interval were transformed into a set of multitemporal metrics that provide consistent land-surface phenology inputs for global cropland mapping. Metrics include selected ranks, inter-rank averages and amplitudes of surface reflectance and vegetation index values, and surface reflectance averages for selected land-surface phenology stages defined by vegetation indices (that is, surface reflectance for the maximum and minimum greenness periods). The multitemporal metrics methodology is provided in detail19,38. The Landsat metrics were augmented with elevation data39. In this way, we created spatially consistent inputs for each of the 4-year intervals. The complete list of input metrics is presented in Supplementary Table 1.

Global cropland mapping

Global cropland mapping included three stages that enabled extrapolation of visually delineated cropland training data to a temporally consistent, global cropland map time-series using machine learning. At all three stages, we employed bagged decision tree ensembles40 as a supervised classification algorithm that used class presence and absence data as the dependent variables, and a set of multitemporal metrics as independent variables at a Landsat ARD pixel scale. The bagged decision tree results in a per-pixel cropland probability layer, which has a threshold of 0.5 to obtain a cropland map.

The first stage consisted of performing individual cropland classifications for a set of 924 Landsat ARD 1° × 1° tiles for the 2016–2019 interval (Supplementary Fig. 1). The tiles were chosen to represent diverse global agriculture landscapes. Classification training data (cropland class presence and absence) were manually selected through visual interpretation of Landsat metric composites and high-resolution data from Google Earth. An individual supervised classification model (bagged decision trees) was calibrated and applied to each tile.

At the second stage, we used the 924 tiles that had been classified as cropland/other land and the 2016–2019 metric set to train a series of regional cropland mapping models. The classification was iterated by adding training tiles and assessing the results until the resulting map was satisfactory. We then applied the regional models to each of the preceding 4-year intervals, thus creating a preliminary time-series of global cropland maps.

At the third stage, we used the preliminary global cropland maps as training data to generate temporally consistent global cropland data. As the regional models applied at the second stage were calibrated using 2016–2019 data alone, classification errors may arise due to Landsat data inconsistencies before 2016. The goal of this third stage was to create a robust spatiotemporally consistent set of locally calibrated cropland detection models. For each 1° × 1° Landsat ARD tile (13,451 tiles total), we collected training data for each 4-year interval from the preliminary cropland extent maps within a 3° radius of the target tile, with preference to select stable cropland and non-cropland pixels as training. Training data from all intervals were used to calibrate a single decision tree ensemble for each ARD tile. The per-tile models were then applied to each time interval, and the results were post-processed to remove single cropland class detections and omissions within time-series and eliminate cropland patches <0.5 ha. Manual masks to remove map artefacts (for example, cropland overestimation over temperate wetlands and flooded grasslands) were applied in some regions to improve the map quality. The final global cropland map time-series are available at https://glad.umd.edu/dataset/croplands.

Sample analysis

The sample analysis had two objectives: to estimate cropland area and its associated uncertainty and to assess cropland map accuracy. Sample interpretation and sample-based analysis were done only for the start (2003) and the end (2019) of the cropland-mapping interval. Accuracies of intermediate cropland maps (2007, 2011 and 2015) were not assessed, but were considered to be similar to those of the 2003 and 2019 maps due to implementation of the same classification model and consistently processed Landsat data41. The analysis was performed separately for each of the seven regions outlined in Extended Data Fig. 1, as well as globally. The regional boundaries were aligned with national boundaries to enable comparison with national data. Only land pixels were considered; pixels labelled as permanent water and snow/ice in the Landsat ARD data quality layer were excluded. In each region, we selected five strata based on the map time-series corresponding to stable croplands, cropland gain and loss, possible cropland omission area and other lands (Supplementary Tables 2 and 3). The possible cropland omission stratum (stratum 4) includes areas where omission errors are probable, specifically pixels that were not mapped as cropland and either (1) were identified as crops by the GFSAD11 or (2) had the decision tree-based cropland probability between 0.1 and 0.5. We randomly selected 100 sample units (Landsat data pixels) from each stratum (500 samples pixels per region, 3,500 in total).

Sample interpretation was performed visually using available remotely sensed data time-series, including Landsat ARD 16-day data, composites of selected multitemporal metrics and high-resolution images provided by Google Earth (Supplementary Fig. 2). Each sample pixel was interpreted by two experts independently and the disagreements were discussed and resolved by the research team. The interpretation legend includes the 2003–2019 cropland dynamics categories and land-use transition types. The sample reference data and interpretation results are available at https://glad.umd.edu/dataset/croplands.

Area estimation

The sample-based area estimation was performed following previously published methods42,43. The 2003 and 2019 total cropland area, stable crops, gross cropland loss and gain, and net change were estimated within each region separately, and for the entire world using equation (1). The area and the total number of Landsat pixels for each region and each stratum are provided in Supplementary Table 3. For each of the 100 sample pixels sampled in each stratum, pu was defined by class presence, for example, for 2003 cropland, pu = 0 (2003 cropland absence) or pu = 1 (2003 cropland presence). The pu was defined similarly for the 2019 cropland, stable crop, gross cropland loss and gain classes. For the net cropland area change, pu had values of 1 (cropland gain), −1 (cropland loss) and 0 (no change).

$$\hat A = \mathop {\sum}\nolimits_{h = 1}^H {A_h\bar p_h}$$
(1)

where \(\hat A\) is the estimated cropland/cropland change area,

Ah the area of stratum h,

H the number of sampling strata,

\(\bar p_h = \frac{{\mathop {\sum }\nolimits_{u \in h} p_u}}{{n_h}}\)the mean cropland/cropland change proportion of samples in stratum h; and

nh the sample size (number of sample pixels) in stratum h.

The s.e.m. of the area was estimated from the variances of cropland (or cropland dynamics category) class values of pu for sample pixels in each stratum using equation (2). The 95% CI was obtained by multiplying s.e.m. by 1.96:

$${\mathrm{s.e.}}\left( {\hat A} \right) = \sqrt {\mathop {\sum }\limits_{h = 1}^H A_h^2\left( {1 - \frac{{n_h}}{{N_h}}} \right)\frac{{s_{ph}^2}}{{n_h}}}$$
(2)

where \({\mathrm{s.e.}}\left( {\hat A} \right)\) is the s.e.m. of cropland/cropland change class area and

\(s_{ph}^2 = \frac{{\mathop {\sum }\nolimits_{u \in h} \left( {p_u - \bar p_h} \right)^2}}{{n_h - 1}}\) the sample variance for stratum h.

Proportion of land-use trajectories

We analysed the land-use trajectories of cropland loss and gain using reference sample data within cropland gain and loss strata only. Inclusion of sample pixels from other strata where cropland change was detected would have inflated the area of land-use trajectories that these pixels represent (that is, if a sample pixel from a stable cropland stratum was interpreted as cropland gain due to forest clearing, including the proportion of forest clearing from this large stratum, it will dominate the total regional estimate). The proportion of each land-use trajectory (within cropland gain and loss separately) was estimated from the sample and reported as the percentage of the total gain or loss along with its s.e.m. (Table 2). A combined ratio estimator for stratified random sampling43 was employed to estimate the percentages (equation (3)).

$$\hat R = \frac{{\mathop {\sum }\nolimits_{h = 1}^H A_h\bar y_h}}{{\mathop {\sum }\nolimits_{h = 1}^H A_h\bar x_h}} \times 100$$
(3)

where: \(\hat R\) is the estimated class proportion expressed as a percentage;

H the number of sampling strata;

Ah the area of stratum h;

\(\bar y_h = \frac{{\mathop {\sum}\nolimits_{u \in h} {y_u} }}{{n_h}}\) the sample mean of the yu values in stratum h, where yu = 1 if pixel u is classified as belonging to a specific transition in the reference sample interpretation, and yu = 0 otherwise; and

\(\bar x_h = \frac{{\mathop {\sum}\nolimits_{u \in h} {x_u} }}{{n_h}}\) the sample mean of the xu values in stratum h, where xu = 1 if pixel u is classified as any cropland loss/gain in the reference sample interpretation, and xu = 0 otherwise.

The s.e.m. of the estimated ratio of class proportion expressed as percentage was calculated using equation (4):

$${\mathrm{s.e.}}\left( {\hat R} \right) = \sqrt {\frac{1}{{\hat X^2}}\mathop {\sum }\limits_{h = 1}^H A_h^2\left( {1 - \frac{{n_h}}{{N_h}}} \right)\left( {s_{yh}^2 + \hat R^2s_{xh}^2 - 2\hat Rs_{xyh}} \right)/n_h} \times 100$$
(4)

where: \({\mathrm{s.e.}}\left( {\hat R} \right)\) is the s.e.m. of the estimated proportion expressed as a percentage;

Nh the total number of pixels in stratum h;

nh number of sample pixels in stratum h;

\(\hat X = \mathop {\sum }\limits_{h = 1}^H A_h\bar x_h\) the estimated total area of cropland loss/gain expressed in area units; and

\(s_{yh}^2\) and \(s_{xh}^2\) the sample variances in stratum h; and sxyh the sample covariance in stratum h estimated as follows:

$$s_{yh}^2 = \mathop {\sum}\nolimits_{u \in } {\left( {y_u - \bar y_h} \right)^2/\left( {n_h - 1} \right)}$$
$$s_{xh}^2 = \mathop {\sum}\nolimits_{u \in h} {\left( {x_u - \bar x_h} \right)^2/\left( {n_h - 1} \right)}$$
$$s_{xyh} = \mathop {\sum}\nolimits_{u \in h} {\left( {y_u - \bar y_h} \right)} \left( {x_u - \bar x_h} \right)/\left( {n_h - 1} \right).$$

Map accuracy

The map accuracy metrics include overall accuracy (the proportion of correctly mapped sample pixels), user’s accuracy of the cropland class (which reflects the cropland class commission) and producer’s accuracy of the cropland class (which reflects the cropland class omission)42. All accuracy metrics and respective s.e.m.s are presented as percentages (Table 3).

To estimate overall accuracy, we defined yu = 1 if pixel u is classified correctly and yu = 0 if pixel u is classified incorrectly. The estimator for overall accuracy is then expressed by equation (5), and s.e.m. for overall accuracy is computed using equation (6).

$$\hat O = \frac{{\mathop {\sum }\nolimits_{h = 1}^H N_h\bar y_h}}{N} \times 100$$
(5)

where: \(\hat O\) is the estimated overall accuracy, expressed as a percentage; H the number of sampling strata; Nh the total number of pixels in stratum h; N the total number of pixels in the reporting region; and \(\bar y_h = \mathop {\sum }\limits_{u \in h} y_u/n_h\) the sample mean of the yu values in stratum h.

$${\mathrm{s.e.}}\left( {\hat O} \right) = \sqrt {\frac{1}{{N^2}}\mathop {\sum}\nolimits_{h = 1}^H {N_h^2\left( {1 - n_h/N_h} \right)s_{yh}^2/n_h} } \times 100$$
(6)

where \({\mathrm{s.e.}}\left( {\hat O} \right)\) is the s.e.m. of the overall accuracy, expressed as percentage; nh the number of sample pixels in stratum h; and \(s_{yh}^2\) the sample variance: \(s_{yh}^2 = \mathop {\sum }\limits_{u \in h} \left( {y_u - \bar y_h} \right)^2/(n_h - 1).\) For estimating user’s accuracy of the croplands class, we defined yu = 1 if sample pixel u is correctly mapped as cropland, otherwise yu = 0, and xu = 1 if sample pixel u is mapped cropland, otherwise xu = 0. For the producer’s accuracy, we defined yu = 1 if sample pixel u is correctly mapped as cropland, otherwise yu = 0, and xu = 1 if sample pixel u is interpreted as cropland, otherwise xu = 0. The estimator of the user’s accuracy and producer’s accuracy was then expressed as a ratio estimator (equation (7)) and their s.e.m. calculated using equation (8), which are similar to equations (3) and (4), except that the strata were weighted by their total number of pixels (Nh) rather than the areas (Ah) for the purposes of map accuracy assessment (with pixel being the primary mapping unit):

$$\hat R = \frac{{\mathop {\sum }\nolimits_{h = 1}^H N_h\bar y_h}}{{\mathop {\sum }\nolimits_{h = 1}^H N_h\bar x_h}} \times 100$$
(7)

where \(\hat R\) is the estimated user’s/producer’s accuracy, expressed as a percentage.

$${\mathrm{s.e.}}\left( {\hat R} \right) = \sqrt {\frac{1}{{\hat X^2}}\mathop {\sum }\limits_{h = 1}^H N_h^2\left( {1 - \frac{{n_h}}{{N_h}}} \right)\left( {s_{yh}^2 + \hat R^2s_{xh}^2 - 2\hat Rs_{xyh}} \right)/n_h} \times 100$$
(8)

where \({\mathrm{s.e.}}\left( {\hat R} \right)\) is the s.e.m. of the estimated user’s/producer’s accuracy, expressed as a percentage.

\(\hat X = \mathop {\sum }\limits_{h = 1}^H N_h\bar x_h.\)

Cropland NPP

The cropland NPP was evaluated using the globally consistent Collection 6 MODIS-based, annual year-end gap-filled NPP product (MOD17A3HGF20). The product provides the sum of total daily NPP through the year at a 500-m spatial resolution (kg C m−2 year−1). The annual NPP data were resampled to our Landsat ARD data grid and were overlaid with the corresponding 4-year cropland maps to calculate total and per-unit area NPP for each region and each year. We used average annual NPP for each 4-year interval, except for the 2000–2003 interval, where a 3-year average was used instead to avoid using the year 2000 when MODIS data were incomplete. The s.d. of the annual estimates is provided as an uncertainty metric.

National total and per-capita cropland area and cropland NPP

For the national cropland area analysis, we used public geographic information systems (GIS) country boundaries from GADM (https://gadm.org).

We employed the 2019 Revision of World Population Prospects21 to calculate global, regional and national population for years 2003 and 2019. As the boundaries of analysis regions (Extended Data Fig. 1) are aligned with country boundaries, we were able to summarize the regional population totals from national data. The population data were related to our sample-based (for global and regional estimates) and map-based (for national estimates) cropland area to estimate per-capita cropland area and change. Similarly, we related regional cropland NPP to population data to estimate per-capita cropland NPP for 2003 and 2019.

Reporting Summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.