A global reference database of crowdsourced cropland data collected using the Geo-Wiki platform

Laso Bayas, Juan Carlos; Lesiv, Myroslava; Waldner, François; Schucknecht, Anne; Duerauer, Martina; See, Linda; Fritz, Steffen; Fraisl, Dilek; Moorthy, Inian; McCallum, Ian; Perger, Christoph; Danylo, Olha; Defourny, Pierre; Gallego, Javier; Gilliams, Sven; Akhtar, Ibrar ul Hassan; Baishya, Swarup Jyoti; Baruah, Mrinal; Bungnamei, Khangsembou; Campos, Alfredo; Changkakati, Trishna; Cipriani, Anna; Das, Krishna; Das, Keemee; Das, Inamani; Davis, Kyle Frankel; Hazarika, Purabi; Johnson, Brian Alan; Malek, Ziga; Molinari, Monia Elisa; Panging, Kripal; Pawe, Chandra Kant; Pérez-Hoyos, Ana; Sahariah, Parag Kumar; Sahariah, Dhrubajyoti; Saikia, Anup; Saikia, Meghna; Schlesinger, Peter; Seidacaru, Elena; Singha, Kuleswar; Wilson, John W

doi:10.1038/sdata.2017.136

Download PDF

Data Descriptor
Open access
Published: 26 September 2017

A global reference database of crowdsourced cropland data collected using the Geo-Wiki platform

Juan Carlos Laso Bayas ORCID: orcid.org/0000-0003-2844-3842¹,
Myroslava Lesiv¹,
François Waldner²,
Anne Schucknecht^3,4,
Martina Duerauer¹,
Linda See¹,
Steffen Fritz¹,
Dilek Fraisl¹,
Inian Moorthy¹,
Ian McCallum¹,
Christoph Perger¹,
Olha Danylo¹,
Pierre Defourny²,
Javier Gallego³,
Sven Gilliams⁵,
Ibrar ul Hassan Akhtar^6,7,
Swarup Jyoti Baishya⁸,
Mrinal Baruah⁸,
Khangsembou Bungnamei⁸,
Alfredo Campos^9,10,
Trishna Changkakati⁸,
Anna Cipriani^11,12,
Krishna Das⁸,
Keemee Das⁸,
Inamani Das⁸,
Kyle Frankel Davis^13,14,
Purabi Hazarika⁸,
Brian Alan Johnson¹⁵,
Ziga Malek¹⁶,
Monia Elisa Molinari¹⁷,
Kripal Panging⁸,
Chandra Kant Pawe ORCID: orcid.org/0000-0001-7926-5367⁸,
Ana Pérez-Hoyos³,
Parag Kumar Sahariah¹⁸,
Dhrubajyoti Sahariah⁸,
Anup Saikia⁸,
Meghna Saikia¹⁹,
Peter Schlesinger ORCID: orcid.org/0000-0002-4861-5202^20,21,
Elena Seidacaru²²,
Kuleswar Singha⁸ &
…
John W Wilson²³

Scientific Data volume 4, Article number: 170136 (2017) Cite this article

6645 Accesses
49 Citations
28 Altmetric
Metrics details

Subjects

Abstract

A global reference data set on cropland was collected through a crowdsourcing campaign using the Geo-Wiki crowdsourcing tool. The campaign lasted three weeks, with over 80 participants from around the world reviewing almost 36,000 sample units, focussing on cropland identification. For quality assessment purposes, two additional data sets are provided. The first is a control set of 1,793 sample locations validated by students trained in satellite image interpretation. This data set was used to assess the quality of the crowd as the campaign progressed. The second data set contains 60 expert validations for additional evaluation of the quality of the contributions. All data sets are split into two parts: the first part shows all areas classified as cropland and the second part shows cropland average per location and user. After further processing, the data presented here might be suitable to validate and compare medium and high resolution cropland maps generated using remote sensing. These could also be used to train classification algorithms for developing new maps of land cover and cropland extent.

Design Type(s)	database creation objective • image analysis objective • citizen science design
Measurement Type(s)	land cover
Technology Type(s)	image analysis
Factor Type(s)
Sample Characteristic(s)	Earth • area of cropland

Machine-accessible metadata file describing the reported data (ISA-Tab format)

A national-scale land cover reference dataset from local crowdsourcing initiatives in Indonesia

Article Open access 17 September 2022

A crowdsourced global data set for validating built-up surface layers

Article Open access 20 January 2022

Outlining where humans live, the World Settlement Footprint 2015

Article Open access 20 July 2020

Background & Summary

The spatial extent of cropland has been mapped from remote sensing via many different initiatives as part of global land cover mapping activities, e.g., GLC-2000¹, MODIS land cover², GlobeLand30³ and the Climate Change Initiative (CCI) of the European Space Agency⁴. Despite the availability of these and many other products, large spatial disagreement on the location and distribution of cropland still exists^5–7.

As such, quality-assured reference data are needed to undertake robust quantitative assessments and detailed comparisons of global products regarding their representation of cropland extent. Reference data sets can be collected in-situ, e.g., the Land Use Cover Area frame Sample (LUCAS) across EU member states⁸, but due to the high costs involved in field surveys, they are more often gathered through interpretation of high or very high resolution satellite imagery. Some of the reference data used to validate different global land cover products are now being made openly available, e.g., through the GOFC-GOLD validation portal⁹. Because these data sets are not specifically tailored to cropland validation, sample sizes are insufficient, making their efficacy in quality assessments questionable, especially given the lack of sensitivity of accuracy indices⁷.

To collect reference samples specifically designed for cropland map validation, we conducted a three-week cropland identification campaign during September, 2016. The campaign was implemented using the Geo-Wiki (http://www.geo-wiki.org/) crowdsourcing tool. A schematic showing the design and implementation of the campaign is illustrated in Fig. 1. A secondary motivation of the campaign was to gain a better understanding of crowdsourced data quality as well as the reasons why volunteers participate in crowdsourcing campaigns.

This campaign builds on previous crowdsourcing campaigns using Geo-Wiki¹⁰, e.g., to validate a map of land availability for biofuels¹¹ and to map wilderness globally¹², while the results from several campaigns were used to produce a global hybrid cropland map¹³, among others. The scope of the early campaigns was generally directed towards improving global land cover and land use reference data¹⁰, whereas the campaign described here focuses specifically on cropland data. In addition to validation, the data presented here also represent a valuable training tool that can be used to develop new land cover or cropland extent maps as well as to train algorithms to produce remote sensing-based products^14,15.

Methods

To develop the cropland validation campaign, cropland per se had to be defined, and a sample of systematically selected areas was generated. At the same time, the Geo-Wiki platform was modified to implement the campaign, the incentive scheme was developed, and the control data for quality assurance were collected. This section describes the main components of the campaign as outlined in Fig. 1.

Cropland definition

In order to distinguish cropland from other classes, the definition used for the campaign follows that of GEOGLAM/JECAM^16,17 in which ‘The annual cropland from a remote sensing perspective is a piece of land of a minimum of 0.25 ha (minimum width of 30 m) that is sowed/planted and harvestable at least once within the 12 months after the sowing/planting date. The annual cropland produces an herbaceous cover and is sometimes combined with some tree or woody vegetation’. According to this GEOGLAM/JECAM definition, perennial crops, agroforestry plantations, palm oil, coffee, tree crops and fallows are not included in the cropland class. The following exceptions to this definition were made:

Sugarcane plantations and cassava crops are included in the cropland class, although they have a longer vegetation cycle and are not planted yearly.
Taken individually, small plots, such as legumes, do not meet the minimum size criterion of the cropland definition. However, when considered as a continuous heterogeneous field, they are included in cropland.

Moreover, greenhouse crops cannot be monitored by remote sensing and are thus excluded from the definition. Note that the use of this definition may lead to underestimation of cropland in the situation where legumes or other crops are planted among tree crops such as fruit and nut trees or where fields were fallow for 1 or more years but still cultivated. This would not be picked up in the visual interpretation of the imagery using Geo-Wiki although the use of Google Earth historical imagery and the Normalized Difference Vegetation Index (NDVI) profiling tool may have helped to identify cropland in the latter situation.

Sampling design

A stratified systematic sampling procedure was applied to generate the sample locations where the validation would take place in frames/cells of 1°×1° (geographic coordinate system with latitude and longitude) across the globe. A given replicate corresponds to a relative location in each frame. The scheme was designed to correct the distortion of the non-equal area projection. These cells serve as an instrument for defining a first-phase sample.

The strata used were derived from the IIASA cropland probability map¹³ with the aim of sampling areas of lower or higher probability of misclassification with different rates. Areas with a cropland probability between 25 and 75% were assumed to be more difficult to classify and were therefore sampled with a higher rate, while areas with very low or very high probability of cropland were sampled at a lower rate as they are easier to classify. Table 1 summarizes the strata and distribution of samples in each stratum. The size for each stratum as well as the calculated weights that should be used for accuracy metrics are also shown.

Table 1 Strata, sample distribution and strata sizes in the cropland validation campaign.

Full size table

The sampling unit was a frame/pixel of 300 m×300 m corresponding to the grid of PROBA-V images and the final number of sampling units was 35,866.

Data collection using Geo-Wiki

The reference data were acquired through a dedicated Geo-Wiki interface (Fig. 2). Once a participant was registered and logged on, he/she could see a sample location where a semi-transparent 300×300 m frame subdivided in 25 grid cells is superimposed on Google Maps imagery (indicated by A in Fig. 2). Users were then asked to click (i.e., shade in yellow) all grid cells covered by more than 50% cropland. Thus, the final values for sampling units (i.e., a 300×300 m frame=one location) were cropland proportions ranging from 0 (absence of cropland) to 100%. When all sub-cells were examined, the user could either click the submit button or the skip button (indicated by B in Fig. 2) and was then shown the next randomly selected sample location. The user could also add comments regarding the observed location and then submit the validation. The cropland definitions were provided to the participants in an introductory video and through an info button in the Geo-Wiki interface. Additional tools and learning materials were provided to the participants to aid their interpretations. For example, in Geo-Wiki it is possible to switch between imagery from Google Maps and Microsoft Bing as well as viewing the location on OpenStreetMap (indicated by C in Fig. 2), which can provide additional useful information. The system registers whether a participant used imagery from Google Maps or not, which is included as a variable in the data set. Any location could also be saved as a keyhole markup language (kml) file for visualization using the desktop version of Google Earth (indicated by D in Fig. 2), which provides historical imagery, 3D viewing capabilities, geotagged photographs from Panoramio, etc. The usage of this feature was also registered in the data set. Participants were asked to use imagery from the latest date possible between Google Maps and Bing. Learning materials were compiled into an online gallery (Fig. 3), which provided the participants with different examples of cropland and non-cropland surfaces (http://www.geo-wiki.org/Application/modules/sigma_validation/sigma_gallery.html). Finally, it is possible to view different time series of vegetation indices, e.g., the NDVI (indicated by E in Fig. 2), obtained from different satellite sensors, i.e., Landsat 7, 8, MODIS and PROBA-V. These indices allowed participants to view the profiles of vegetation change over time at a particular location, which could help with satellite image interpretation, e.g., cropland is often characterized by a rapid increase in NDVI at growing stage after planting and a rapid decline near maturity stage or after harvesting.

**Figure 2: The Geo-Wiki interface (http://www.geo-wiki.org) for collecting cropland information based on image interpretation.**

Feedback was provided to participants as the campaign progressed using the Geo-Wiki Facebook page https://www.facebook.com/GeoWiki, which contained additional examples and a link to the YouTube explanatory video https://youtu.be/PR3xMPPyp-I showing how to use the interface. Participants could request help from experts for images that were difficult to classify and the answers were then posted to Facebook for all to view.

Quality control measures

Out of the total sample locations, 2,000 were randomly selected and validated by a group of three students trained in satellite imagery interpretation. The methodology for validation of control points was the same as for normal locations. These sample locations were compared for consistency, resulting in the removal of 207 sample units where there was disagreement in 3 or more grid cells/sub-pixels between the student validators. Additionally, independent verification was undertaken by experts at the International Institute for Applied Systems Analysis (IIASA) to ensure the quality of the control data set. Experts are members of IIASA staff with a background in remote sensing or geospatial sciences and considerable experience in image classification. This control data set was then used during the campaign, where participants received one control location for every 20 sample locations although this control location could appear at any point during the sequence of 20 samples. Each time a control location was viewed, the submission sent by the participant was compared with the control validation and a quality score was calculated for each participant as shown in Table 2. This, in combination with the amount of validations undertaken, was used to determine the participant’s ranking on the campaign leader board.

Table 2 Quality score calculation per location. Units for agreement are in number of grid cells/sub-pixels per 300 m×300 m location.

Full size table

The campaign aimed to validate all sample locations at least 3 times by different participants. The final result achieved was that the majority of locations (32,287) were validated 4 to 7 times. Control points were validated more often, sometimes more than once by the same person to check for consistency. Despite a technical problem in the middle of the campaign, where some validations done in the middle of the campaign were not recorded, the full sample of validations was obtained.

Incentives and motivations

The top 30 participants (ranked by quality score) had the option to choose between becoming a co-author on a scientific paper or receiving an Amazon gift voucher ranging in value from 50 to 750 EUR (Table 3) depending upon the final position on the campaign leader board. A total of 26 participants chose to be co-author. They were also asked to fill out a survey providing some basic information about themselves and details regarding their motivation in participating in the campaign.

Table 3 Financial rewards offered according to the final ranking of the participants.

Full size table

The same survey as that sent to the top 30 was also sent to the other participants where they were offered the following incentive: they were entered into a draw in which they could win one of two Amazon vouchers of €50 euros. A total of 20 additional answers were received.

From the 1,793 control locations, a further sub-sample of 60 locations was selected and then evaluated independently by three land cover experts at IIASA following the same methodology as a normal participant. These locations were then reviewed for consensus between the experts, creating a gold standard data set. Although the gold standard was not used to calculate the quality score, it is provided here as an additional data set for independent quality and reliability assessment. These 60 locations were evaluated by all participants sequentially in the middle of the campaign, although no notice was given to the participants and no changes to the Geo-Wiki interface were made.

Data Records

The data are presented in six different data records. The first three data records contain all of the grid cells marked as cropland by either the campaign participants (Data record 1, n=1,086,485), the controls from the trained students (Data Record 2, n=8,918) or the gold standard (Data record 3, n=582) and can be found in crop_all.txt, crop_con.txt and crop_exp.txt (Data Citation 1), respectively. The format and information contained in these first three data records is shown in Table 4. Note that when these data correspond to the control data or to data from the experts, the following fields are not present: comment, timestamp, used_gmaps, viewed_ge, and skip_reason. The userid field in Data Record 2 is the number 111,111 and Data Record 3 is the number 222,222.

Table 4 The format and field descriptions of data records containing all grid cells marked as cropland.

Full size table

Additionally, data records 4 to 6 show the information compiled per 300 m×300 m frame and per user, i.e., one record shows the average (mean) cropland from the 25 grid cells from a given user at a given location. Data Record 4 (n=203,515) contains data from all participants, data record 5 (n=1,793) contains the control data from the trained students while Data Record 6 contains the expert data (n=60). These data sets can be found in loc_all.txt, loc_con.txt and loc_exp.txt (Data Citation 1), respectively, while the format and field descriptions are provided in Table 5. As in data sets 2 and 3, the userid field in data record 5 is the number 111,111 and in data record 6 it is the number 222,222.

Table 5 Format and field descriptions of data records containing average (mean) cropland per frame/location and user.

Full size table

Technical Validation

Figure 4 illustrates the origin of the 50 participants who provided information on the post-campaign survey and their familiarity with the regions validated as well as general information. It is clear that the majority of participants were male (68%) with a background in research (62%), highly educated (92%), and between 20 and 39 years of age (72%). The largest number of participants were from India (17) although more than 20 countries were represented. Participants had varying knowledge of different parts of the world although there was no area where participants had zero familiarity. This may reflect the geographical spread of the participants and their backgrounds.

Figure 5a shows data collected during the campaign, expressed as the average (mean) cropland percentage per location and its global distribution. Figure 5b contains the IIASA-IFPRI hybrid cropland percentage map¹³, and it is provided as a reference; in general, the patterns of cropland between the two maps are similar. Figure 5c shows the number of times a location was validated, where the majority of locations were classified at least 3 to 5 times.

**Figure 5: Cropland validation campaign and worldwide spatial distribution of cropland.**

Usage Notes

The primary use of this reference data set is to validate global cropland maps generated using remote sensing that range from 60 to 300 m in resolution. More specifically, the data allows for an extensive spatially explicit validation of the cropland layer due to the rich amount of reference data. A validation exercise is planned for a 300 m cropland map that has been created for agricultural monitoring purposes as part of the FP7-funded SIGMA project (http://www.geoglam-sigma.info/). The data can also be used to train classification algorithms in developing new cropland maps based on remote sensing or to create hybrid cropland maps by fusing together existing cropland products¹³. Finally, it would be possible to use the data for studies about the quality of crowdsourced data.

Additional Information

How to cite this article: Laso Bayas, J.C. et al. A global reference database of crowdsourced cropland data collected using the Geo-Wiki platform. Sci. Data 4:170136 doi: 10.1038/sdata.2017.136 (2017).

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Fritz, S. et al. Harmonisation, mosaicing and production of the Global Land Cover 2000 database (Beta Version) 41 Office for Official Publications of the European Communities, (2003).
Google Scholar
Friedl, M. A. et al. MODIS Collection 5 global land cover: Algorithm refinements and characterization of new datasets. Remote Sensing of Environment 114, 168–182 (2010).
Article ADS Google Scholar
Chen, J. et al. Global land cover mapping at 30 m resolution: A POK-based operational approach. ISPRS Journal of Photogrammetry and Remote Sensing 103, 7–27 (2015).
Article ADS Google Scholar
Bontemps, S. et al. Consistent global land cover maps for climate modelling communities: Current achievements of the ESA’s land cover CCI. in Proceedings of the ESA Living Planet Symposium 2013 (2013).
Fritz, S. et al. Highlighting continued uncertainty in global land cover maps for the user community. Environmental Research Letters 6, 044005 (2011).
Article ADS Google Scholar
Fritz, S., See, L. & Rembold, F. Comparison of global and regional land cover maps with statistical information for the agricultural domain in Africa. International Journal of Remote Sensing 31, 2237–2256 (2010).
Article ADS Google Scholar
Waldner, F., Fritz, S., Di Gregorio, A. & Defourny, P. Mapping Priorities to Focus Cropland Mapping Activities: Fitness Assessment of Existing Global, Regional and National Cropland Maps. Remote Sensing 7, 7959–7986 (2015).
Article ADS Google Scholar
Gallego, F. J. Validation of GIS layers in the EU: getting adapted to available reference data. International Journal of Digital Earth 4, 42–57 (2011).
Article ADS Google Scholar
Tsendbazar, N. E., de Bruin, S. & Herold, M. Assessing global land cover reference datasets for different user communities. ISPRS Journal of Photogrammetry and Remote Sensing 103, 93–114 (2015).
Article ADS Google Scholar
Fritz, S. et al. A global dataset of crowdsourced land cover and land use reference data. Scientific Data 4, 170075 (2017).
Article Google Scholar
Fritz, S. et al. Downgrading recent estimates of land available for biofuel production. Environ. Sci. Technol. 47, 1688–1694 (2013).
CAS PubMed Google Scholar
See, L. et al. Harnessing the power of volunteers, the internet and Google Earth to collect and validate global spatial information using Geo-Wiki. Technological Forecasting and Social Change 98, 324–335 (2015).
Article Google Scholar
Fritz, S. et al. Mapping global cropland and field size. Glob Change Biol 21, 1980–1992 (2015).
Article ADS Google Scholar
Gengler, S. & Bogaert, P. Integrating crowdsourced data with a land cover product: A Bayesian data fusion approach. Remote Sensing 8, 545 (2016).
Article ADS Google Scholar
Waldner, F. et al. A unified cropland layer at 250 m for global agriculture monitoring. Data 1, 3 (2016).
Article Google Scholar
JECAM. JECAM Guidelines for cropland and crop type definition and field data collection version 1. Available from: http://www.jecam.org/JECAM_Guidelines_for_Field_Data_Collection_v1_0.pdf. (2014)
Waldner, F. et al. Towards a set of agrosystem-specific cropland mapping methods to address the global cropland diversity. International Journal of Remote Sensing 37, 3196–3231 (2016).
Article ADS Google Scholar

Data Citations

See, L. PANGAEA https://doi.org/10.1594/PANGAEA.873912 (2017)

Download references

Acknowledgements

The authors would like to thank the 80 volunteers who contributed to the campaign described in this publication. This research was supported by the ERC funded CrowdLand Project (No. 617754) and the SIGMA project (No. 603719).

Author information

Authors and Affiliations

International Institute for Applied Systems Analysis (IIASA), Laxenburg, Austria
Juan Carlos Laso Bayas, Myroslava Lesiv, Martina Duerauer, Linda See, Steffen Fritz, Dilek Fraisl, Inian Moorthy, Ian McCallum, Christoph Perger & Olha Danylo
Université catholique de Louvain (UCL)—Earth and Life Institute, Louvain-la-Neuve, Belgium
François Waldner & Pierre Defourny
European Commission—Joint Research Centre (JRC), Ispra, Italy
Anne Schucknecht, Javier Gallego & Ana Pérez-Hoyos
Department of Atmospheric Environmental Research, Karlsruhe Institute of Technology (KIT), Garmisch-Partenkirchen 82467, Germany
Anne Schucknecht
Vlaamse Instelling voor Technologisch Onderzoek (VITO), Mol, Belgium
Sven Gilliams
COMSATS Institute of Information Technology, Islamabad, Pakistan
Ibrar ul Hassan Akhtar
Pakistan Space and Upper Atmosphere Research Commission (SUPARCO), Islamabad, Pakistan
Ibrar ul Hassan Akhtar
Gauhati University, Guwahati, India
Swarup Jyoti Baishya, Mrinal Baruah, Khangsembou Bungnamei, Trishna Changkakati, Krishna Das, Keemee Das, Inamani Das, Purabi Hazarika, Kripal Panging, Chandra Kant Pawe, Dhrubajyoti Sahariah, Anup Saikia & Kuleswar Singha
Taguay, Córdoba, Argentina
Alfredo Campos
Instituto de Clima y Agua, Instituto Nacional de Tecnología Agropecuaria (INTA), Buenos Aires, Argentina
Alfredo Campos
Dipartimento di Scienze Chimiche e Geologiche, University of Modena and Reggio Emilia, Modena, Italy
Anna Cipriani
Lamont-Doherty Earth Observatory of Columbia University, Palisades, New York, USA
Anna Cipriani
The Earth Institute, Columbia University, New York, USA
Kyle Frankel Davis
The Nature Conservancy, New York, USA
Kyle Frankel Davis
Institute for Global Environmental Strategies, Kamiyamaguchi, Hayama, Japan
Brian Alan Johnson
Vrije Universiteit, Amsterdam, Netherlands
Ziga Malek
Politecnico di Milano, Milano, Italy
Monia Elisa Molinari
B.Borooah College, Guwahati, 781007, India
Parag Kumar Sahariah
Don Bosco College of Engineering and Technology, Guwahati, India
Meghna Saikia
The Tropical Agriculture Research and Higher Education Center (CATIE), Turrialba, Costa Rica
Peter Schlesinger
University of Idaho, Moscow, USA
Peter Schlesinger
TomTom, Amsterdam, Netherlands
Elena Seidacaru
Department of Zoology and Entomology, University of Pretoria, Pretoria, South Africa
John W Wilson

Authors

Juan Carlos Laso Bayas
View author publications
You can also search for this author in PubMed Google Scholar
Myroslava Lesiv
View author publications
You can also search for this author in PubMed Google Scholar
François Waldner
View author publications
You can also search for this author in PubMed Google Scholar
Anne Schucknecht
View author publications
You can also search for this author in PubMed Google Scholar
Martina Duerauer
View author publications
You can also search for this author in PubMed Google Scholar
Linda See
View author publications
You can also search for this author in PubMed Google Scholar
Steffen Fritz
View author publications
You can also search for this author in PubMed Google Scholar
Dilek Fraisl
View author publications
You can also search for this author in PubMed Google Scholar
Inian Moorthy
View author publications
You can also search for this author in PubMed Google Scholar
Ian McCallum
View author publications
You can also search for this author in PubMed Google Scholar
Christoph Perger
View author publications
You can also search for this author in PubMed Google Scholar
Olha Danylo
View author publications
You can also search for this author in PubMed Google Scholar
Pierre Defourny
View author publications
You can also search for this author in PubMed Google Scholar
Javier Gallego
View author publications
You can also search for this author in PubMed Google Scholar
Sven Gilliams
View author publications
You can also search for this author in PubMed Google Scholar
Ibrar ul Hassan Akhtar
View author publications
You can also search for this author in PubMed Google Scholar
Swarup Jyoti Baishya
View author publications
You can also search for this author in PubMed Google Scholar
Mrinal Baruah
View author publications
You can also search for this author in PubMed Google Scholar
Khangsembou Bungnamei
View author publications
You can also search for this author in PubMed Google Scholar
Alfredo Campos
View author publications
You can also search for this author in PubMed Google Scholar
Trishna Changkakati
View author publications
You can also search for this author in PubMed Google Scholar
Anna Cipriani
View author publications
You can also search for this author in PubMed Google Scholar
Krishna Das
View author publications
You can also search for this author in PubMed Google Scholar
Keemee Das
View author publications
You can also search for this author in PubMed Google Scholar
Inamani Das
View author publications
You can also search for this author in PubMed Google Scholar
Kyle Frankel Davis
View author publications
You can also search for this author in PubMed Google Scholar
Purabi Hazarika
View author publications
You can also search for this author in PubMed Google Scholar
Brian Alan Johnson
View author publications
You can also search for this author in PubMed Google Scholar
Ziga Malek
View author publications
You can also search for this author in PubMed Google Scholar
Monia Elisa Molinari
View author publications
You can also search for this author in PubMed Google Scholar
Kripal Panging
View author publications
You can also search for this author in PubMed Google Scholar
Chandra Kant Pawe
View author publications
You can also search for this author in PubMed Google Scholar
Ana Pérez-Hoyos
View author publications
You can also search for this author in PubMed Google Scholar
Parag Kumar Sahariah
View author publications
You can also search for this author in PubMed Google Scholar
Dhrubajyoti Sahariah
View author publications
You can also search for this author in PubMed Google Scholar
Anup Saikia
View author publications
You can also search for this author in PubMed Google Scholar
Meghna Saikia
View author publications
You can also search for this author in PubMed Google Scholar
Peter Schlesinger
View author publications
You can also search for this author in PubMed Google Scholar
Elena Seidacaru
View author publications
You can also search for this author in PubMed Google Scholar
Kuleswar Singha
View author publications
You can also search for this author in PubMed Google Scholar
John W Wilson
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.C.L.B., M.L., F.W., A.S., M.D., L.S., S.F., D.F., I.Mo., I.Mc., C.P. and O.D. contributed to the conception, planning and implementation of the crowdsourcing campaign, including scientific feedback to participants as the campaign ran. J.C.L.B., M.L., F.W. and L.S. wrote the paper while A.S., M.D., S.F., I.Mo., I.Mc., O.D., P.D., J.G., I.u.H.A., A.Ca., A.Ci, K.F.D., B.A.J., Z.M., P.S. and J.W.W. provided useful edits and suggestions. M.D. and C.P. programmed the Geo-Wiki interface and extracted the data from the Geo-Wiki database. The following co-authors (I.u.H.A., S.J.B., M.B., K.B., A.Ca., T.C., A.Ci, K.D, K. Das, I.D., K.F.D., P.H., B.A.J., Z.M., M.E.M., K.P., C.K.P., A.P.-H., P.K.S., D.S., A.S., M.S., P.S., E.S., K.S., and J.W.W.) were ranked in the top 30 after the campaign was finished, having provided the largest amount of high quality reference data during the campaign.

Corresponding author

Correspondence to Juan Carlos Laso Bayas.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

ISA-Tab metadata

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files made available in this article.

Reprints and permissions

About this article

Cite this article

Laso Bayas, J., Lesiv, M., Waldner, F. et al. A global reference database of crowdsourced cropland data collected using the Geo-Wiki platform. Sci Data 4, 170136 (2017). https://doi.org/10.1038/sdata.2017.136

Download citation

Received: 18 April 2017
Accepted: 02 August 2017
Published: 26 September 2017
DOI: https://doi.org/10.1038/sdata.2017.136

This article is cited by

A labelled dataset to classify direct deforestation drivers from Earth Observation imagery in Cameroon
- Amandine Debus
- Emilie Beauchamp
- Emily R. Lines
Scientific Data (2024)
Assessing and addressing the global state of food production data scarcity
- Endalkachew Abebe Kebede
- Hanan Abou Ali
- Kyle Frankel Davis
Nature Reviews Earth & Environment (2024)
A global land cover training dataset from 1984 to 2020
- Radost Stanimirova
- Katelyn Tarrio
- Mark A. Friedl
Scientific Data (2023)
A national-scale land cover reference dataset from local crowdsourcing initiatives in Indonesia
- Hadi
- Ping Yowargana
- Florian Kraxner
Scientific Data (2022)
A crowdsourced global data set for validating built-up surface layers
- Linda See
- Ivelina Georgieva
- Steffen Fritz
Scientific Data (2022)

Subjects

Abstract

Similar content being viewed by others

Background & Summary

Methods

Cropland definition

Sampling design

Data collection using Geo-Wiki

Quality control measures

Incentives and motivations

Data Records

Technical Validation

Usage Notes

Additional Information

References

References

Data Citations

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

ISA-Tab metadata

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links