Background & Summary

Often featured among promising climate change mitigation measures, NT systems, including conservation agriculture (CA), contribute to environmental preservation and sustainable agricultural production1,2. NT is expected to mitigate soil degradation, improve soil structure and water retention properties3,4,5. Several studies indicate that this cropping system can provide a large range of positive environmental externalities such as increased biodiversity, enhanced carbon sequestration and improved soil quality through an increase in soil organic matter6,7,8,9,10. However, the productivity of NT systems compared to conventional cropping systems is still controversial. Since the productivity of NT depends on several interacting factors such as climatic conditions11, soil characteristics1,12, and other agricultural management activities13,14,15,16,17,18,19, the potential of NT to increase agricultural productivity remains highly uncertain.

Several studies1,12,20,21,22 have been conducted to synthetize the current evidence on the productivity in NT systems. Some of these studies relied on global datasets including results of field experiments comparing NT and CT cropping systems over a large range of soil and climate conditions. However, these datasets do not include the most recent published experiments, and provide no or limited information on soil characteristics, climate variables, and management practices. In particular, information on fertilizer inputs, weed and pest control, and intra- and inter- annual climatic variability are frequently missing. Other studies comparing NT and CT rely on a limited number of experiments, are only conducted at a regional scale, or did not make their data fully available23,24,25. Thus, a global dataset reporting findings from the most recent field experiments and including information about a wide range of climatic parameters, soil characteristics and agricultural management practices is still lacking.

To address this gap, we present an updated and extended dataset comparing CT and NT productivity including the most recently published experimental studies, and a detailed description of their environmental characteristics and management practices. Our dataset contains 4403 paired (NT vs. CT) yield observations collected between 1980 and 2017 for eight major staple crops in 50 countries. For each experiment, we provide information on soil texture, pH, the year and month of crop planting and harvesting, the location of the experiment, fertilization, weed and pest control practices, crop type, crop rotation, crop residue management, and crop irrigation. Besides soil characteristics and information on management practices, we also report a large range of climate variables derived from several external databases. These include precipitation, potential evapotranspiration, average temperature, maximum temperature, and minimum temperature during the crop growing season. This dataset can prove useful to disentangle the effects of soil, climate and agronomic drivers of crop yields when comparing NT with CT systems.

Methods

Data collection

The literature search was done in February 2020 using the following keywords ‘Conservation agriculture/No-till/No tillage/Zero tillage’ & ‘Yield/Yield change’ in the websites ‘ScienceDirect’, ‘Science Citation Index (web of science)’. A total of 1012 potentially relevant papers were identified by reviewing the title and abstract, and these papers were then screened according to the procedure summarized in Fig. 1. Papers not reporting yield data for CT and NT systems were excluded, as well as papers reporting experiments on reduced tillage (RT) systems. Papers reporting only mean yield data across different years or sites were also excluded. We then checked whether information on fertilization, weed and pest control, crop irrigation, crop rotation and crop residue management were reported for both CT and NT practices. After these screening and selection steps, all relevant data were manually extracted from the selected papers, including general information about the paper, location and year of the experiment, the number of years under NT when the crop was sown, soil characteristics, crop growing season, crop type, crop management practices and crop yield of CT and NT. However, due to a large number of missing data, the crop growing season, climatic variables and soil characteristics were finally collected through several external databases (Online-only Table 1). The growing season information was generated from a crop calendar database26,27 based on the crop type and the locations of the experiments reported in the papers. The precipitation, average temperature in the growing season were extracted from the UDel_AirT_Precip data provided by NOAA/OAR/ESRL PSL28. The maximum and minimum air temperature during the growing season were generated from CPC Global Temperature data provided by NOAA/OAR/ESRL PSL29 and the potential evapotranspiration data over the growing season were extracted from GLEAM database30,31. Soil textures were collected from the HWSD database32 using the locations of the experimental sites reported in the selected papers (see Online-only Table 1 for additional details). The experiments for which it was not possible to obtain the requested information from the external databases were excluded. The final dataset includes the results extracted from 413 papers (published between 1983 to 2020), 4403 paired yield observations from NT and CT for 8 major crop species, including 370 observations for barley (232 observations for spring barley and 138 for winter barley), 94 observations for cotton, 1690 observations for maize, 195 observation for rice, 160 observations for sorghum, 583 observations for soybean, 61 observations for sunflower, 1250 observations for wheat (1041 observations for winter wheat and 209 observations for spring wheat) in 50 countries from 1980 to 2017 (Fig. 2).

Fig. 1
figure 1

Flow chart of paper collection and selection.

Fig. 2
figure 2

Distribution of experimental sites. The size of the circle indicates the number of observations, and the colours indicates the crop species.

Data Records

All data are available on the figshare repository33, which can be accessed through the link: https://doi.org/10.6084/m9.figshare.12155553. Four files are provided:

  1. 1.

    “Database.csv” includes the data.

  2. 2.

    “Summary of the database.docx”, includes the summary of dataset which explains the definition or assumption for each column in the dataset.

  3. 3.

    “List of references.docx” reports the references of the studies from which data were extracted.

  4. 4.

    “Code.zip”, includes all the codes used in this study.

Online-only Table 1 shows the metadata of our dataset. Six main categories of data are provided:

Category I covers authors, publishing journal and the publishing year.

Category II reports general information about the experiments, including country, location (villages or cities), latitude, longitude of experiment site, soil type and pH at experimental sites, number of replicates, crop types, the initial year of NT practice, crop planting/harvesting month/year, and the period since the initial year of NT practice.

Category III covers information about agricultural management activities under both NT and CT systems (data availabilities of those activities were shown in Fig. 3):

  • Crop rotation with at least 3 crops involved (based on the crop rotation principle of CA defined by FAO34): “Yes”, “No”, “Not reported”. The details of crop sequence are also provided.

  • Soil cover: “Yes”, “No”, “Mixed”, “Not reported”. Details of residue management for the previous crops are also provided.

  • Weed and pest control: “Yes”, “No”, “Mixed”, “Not reported”.

  • Field fertilization: “Yes”, “No”, “Mixed”, “Not reported”. The details of N input and other fertilizer input are also provided.

  • Crop irrigation: “Yes”, “No”, “Mixed”, “Not reported”. The details of the amount of water applied are also provided.

Fig. 3
figure 3

Data availability and break-down of the different crop management practices and NT implementation periods reported in the dataset.

Category IV contains detailed information about the experiment site, experiment setting, management activities, depending on the papers, it may also include the type and quantity used of fertilizer, herbicide, or pesticide.

Category V corresponds to data related to crop yield. It includes the paired crop yields under CT (YieldCT) and NT (YieldNT) systems. The relative yield changes is defined as \(\frac{Yiel{d}_{NT}\,-\,Yiel{d}_{CT}}{Yiel{d}_{CT}}\). The column “Yield increase with NT” reports whether the differences between YieldCT and YieldNT are positive or negative (“Yes” indicates that crop yield is increased with NT practice, while “No” indicates that yield is not increased).

Category VI includes data extracted from the external databases, including crop growing season, climate variables (including precipitation, potential evapotranspiration, minimum/average/maximum temperature) during the growing season, and soil texture.

Crop growing season is defined by a start month and end month, which were extracted from the external crop calendar databases26 of spring barley, winter barley, cotton, maize, rice, sorghum, soybean, sunflower, spring wheat and winter wheat based on the crop type and study sites. Data on the crop calendar corresponds to averaged data and does not change intra-annually, thus the growing season extracted may be different from the actual growing season.

The climatic variables from the external databases are:

  • Accumulated precipitation (P) during the growing season (sum of monthly precipitations during the growing season),

  • Accumulated potential evapotranspiration (E) (sum of monthly evapotranspiration rates during the growing season),

  • Precipitation balance (PB), defined as PB = P – E12,

  • Average air temperature (Tave) during the growing season,

  • Maximum air temperature (Tmax): the maximum value among the daily temperatures in the growing season,

  • Minimum air temperature (Tmin): the minimum value among the daily temperatures in the growing season.

Soil texture was extracted from an external database based32 on the experiments’ locations. In total seven texture classes were included: sandy loam, loam, silt loam, sandy clay loam, clay loam, sandy clay and clay.

Technical Validation

To ensure the reliability of the information collected from the papers, we carefully checked and compared all the collected data with the original paper several times. Quality control of the database was conducted based on outlier detection. For each crop, the outliers of crop yield in CT system and NT system were identified based on the Interquartile Rule35 outlier detection method. For each crop species, an interquartile range (IQR) is defined as the difference between the first and third quartile of crop yield, and a threshold is calculated by adding 1.5 IQR to the third quartile. Any yield data beyond this threshold is flagged as an outlier for the crop species considered. The ratio of crop yield in NT and CT systems \(\left(\frac{Yiel{d}_{NT}}{Yiel{d}_{CT}}\right)\) were also calculated. All outliers and all the observations with a ratio higher than 2 were checked and compared with the values reported in the original papers one more time.

The crop yield values reported in our dataset are consistent with results of previous published studies. Comparing crop yield data of NT and CT, the adoption of NT practice overall leads to a yield decrease (Fig. 4a). A similar trend of crop production decrease with NT was reported in previous studies1,12,20,36. We also find that the combination of NT with crop rotation and soil cover (known as CA) trends to increase crop yield compared to NT practice without rotation and soil cover (Fig. 4a), which is also in line with previous studies1,16,37. Further analysis conducted on each crop confirms that NT tends to decrease the yield of maize1, rice21, and wheat1 (Fig. 4b). The productivity of NT is found higher under dry conditions compared with wetter conditions (Fig. 4c), and similar trends were reported in previous studies1,12.

Fig. 4
figure 4

Comparison of crop yield between NT and CT systems. The boxplots indicate the distributions (min, 1st quartile, median, 3rd quartile, max) of the log yield ratio of NT to CT. The mean log yield ratios of NT to CT were calculated based on a linear mixed effect model and marked as the red diamonds in the boxplots. Statistical tests were conducted to test the significance of the estimated values, ***indicates P-value < 0.001, **indicates P-value < 0.01, *indicates P-value < 0.05, indicates P-value < 0.1. Plot (a) shows the mean log ratios for different types of NT systems vs. CT systems. NToverall represents all the experiments involving NT systems in the dataset, NT-R-SC represents the NT systems without crop rotation and without soil cover, NT+R+SC represents the CA systems or NT systems with both crop rotation and soil cover, and CT is the corresponding control in the experiments. Plot (b) shows the mean log ratios for different crop species. S. indicates spring, while W. indicates winter. Plot (c) shows the mean log ratios for different levels of PBs, corresponding to different level of water stress.

Usage notes

Our dataset can be used to analyze the factors influencing the productivity of NT (or CA) vs. CT. It is possible to train machine learning models to predict the probability of yield increase with NT (or CA) system (e.g. Supplementary Figs. 1 and 2) or the range of yield changes resulting based on the soil type, climate and agronomic inputs provided by this dataset. Global maps of probability of yield increase with NT (or CA) or the range of yield changes can be generated based on the outputs of machine learning models trained with our dataset, and enable policy-makers or agricultural advisors to identify the most promising regions for CA implementation. Details of how to train machine leaning models with our dataset are provided in Supplementary Materials38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,101,102,103,104,105,106,107,108,109,110,111,112,113,114,115,116,117,118,119,120,121,122,123,124,125,126,127,128,129,130,131,132,133,134,135,136,137,138,139,140,141,142,143,144,145,146,147,148,149,150,151,152,153,154,155,156,157,158,159,160,161,162,163,164,165,166,167,168,169,170,171,172,173,174,175,176,177,178,179,180,181,182,183,184,185,186,187,188,189,190,191,192,193,194,195,196,197,198,199,200,201,202,203,204,205,206,207,208,209,210,211,212,213,214,215,216,217,218,219,220,221,222,223,224,225,226,227,228,229,230,231,232,233,234,235,236,237,238,239,240,241,242,243,244,245,246,247,248,249,250,251,252,253,254,255,256,257,258,259,260,261,262,263,264,265,266,267,268,269,270,271,272,273,274,275,276,277,278,279,280,281,282,283,284,285,286,287,288,289,290,291,292,293,294,295,296,297,298,299,300,301,302,303,304,305,306,307,308,309,310,311,312,313,314,315,316,317,318,319,320,321,322,323,324,325,326,327,328,329,330,331,332,333,334,335,336,337,338,339,340,341,342,343,344,345,346,347,348,349,350,351,352,353,354,355,356,357,358,359,360,361,362,363,364,365,366,367,368,369,370,371,372,373,374,375,376,377,378,379,380,381,382,383,384,385,386,387,388,389,390,391,392,393,394,395,396,397,398,399,400,401,402,403,404,405,406,407,408,409,410,411,412,413,414,415,416,417,418,419,420,421,422,423,424,425,426,427,428,429,430,431,432,433,434,435,436,437,438,439,440,441,442,443,444,445,446,447,448,449,450.

The crop yield data for 2018 and later can be extracted from the identified papers, but since some key climatic variables are missing in the external database for this time period (in particular, evapotranspiration), those data are not listed in the dataset provided. We will update the dataset once we have the latest data access to the missing climate variables.

Importantly, our dataset could be easily updated using data produced by new experiments. We welcome anyone interested to share data or papers not included in this meta-database to send them to the corresponding author (YS, yang.su@inrae.fr). We will maintain and add the new observations in the future to expand our dataset with the latest experimental data.