Background & Summary

Key environmental challenges arising from global change drivers (e.g., land use change, climate change, pollution, (over-)exploitation of resources, invasions) need to be overcome to manage the continuously growing pressures on countries’ natural resources. To address these key challenges and move towards sustainable development, we need to more rapidly provide relevant information on these drivers and their changes. This information can then be used to monitor environmental impacts in near-real time, and assess progress stemming from new policies (such as European directives or national policies), thereby allowing an evaluation of whether these changes have a positive or negative impact on these policies or if new, adapted policies may be formulated1. Earth Observation (EO) data from ground-based, airborne or satellite-borne instruments provide an effective way to monitor environmental changes.

Emerging global trends of (1) free and open data access policies for Landsat and Sentinel data; (2) the increasing provision of Analysis Ready Data (ARD) from EO satellites and (3) provision of open source software for managing and exploiting EO data, enables monitoring environmental changes at various spatial and temporal scales while complementing traditional data sources such as national statistics, administrative data or census information1,2,3. Significant work has recently been done to lower barriers and facilitate the access of end-users to harness the full potential of EO data, and to address mandates, national processes, or reporting obligations4. Earth Observation Data Cubes (EODCs) have emerged as a technology to manage, access and analyse Big EO Data, thereby strengthening connections between data providers, applications and end-users5.

Switzerland, alongside with Australia6, adopted this approach very early and provided an operational EODC, combined with EO Analysis Ready Data (ARD) covering the national territory. The Swiss Data Cube (SDC – http://www.swissdatacube.ch) is an initiative supported by the Swiss Federal Office for the Environment (FOEN) and developed, implemented and operated by the United Nations Environment Program (UNEP)/GRID-Geneva in partnership with the University of Geneva (UNIGE), the University of Zurich (UZH), and the Swiss Federal Institute for Forest, Snow and Landscape Research (WSL). It serves several purposes: (1) automated processing of satellite imagery to transform low level processed data into information products. (2) solving the conformity (i.e., standardization of measurements) issue of Earth Observation7, supporting Multilateral Environmental Agreements; and (3) allowing to test new methodologies for monitoring our environment.

In line with the above purposes, the SDC automatizes the generation of information products by providing easy access to, and tools to analyse, synoptic data facilitating research into such domains as climate, vegetation, agriculture, urban areas, water quality, snow coverage or generally all changes of our biotic, abiotic and human altered environment. Data access to complex data is facilitated by transforming them into a coherent and co-registered space-time format that is ready for analysis. In order to achieve conformity, the SDC allows product comparison and direct access to Swiss national in-situ sampling networks to validate the output8. Finally, SDC serves as a ‘sandbox’, where based on open-science principles information and design of new algorithms and methodologies for monitoring our environment is facilitated for all users. In addition, data producers are invited to ingest their data into the SDC, as well as adding new analytical methods, improving the possibilities of the analyses even further. Finally, the SDC provides substantial potential for the private sector, offering transparent, comprehensive and synoptic EO data for the purpose of self-evaluating carbon footprints or other environmental impact measures.

Methods

The swiss data cube

The Swiss Data Cube (SDC) initiative has started in 2016 under the mandate of the Swiss Federal Office for the Environment (FOEN). UNEP/GRID-Geneva and University of Geneva were mandated to explore the potential of the Data Cube technology initially developed in Australia9 to efficiently exploit freely and openly available satellite EO data; these are increasingly made freely available by agencies including the United States Geological Survey (USGS), the National Aeronautics and Space Agency (NASA) Landsat program, and the European Commission’s Copernicus and Sentinel programs10,11. After a successful initial phase where the SDC had been implemented by the GRID-Geneva and the University of Geneva, the University of Zurich/Remote Sensing Laboratories (UZH/RSL) and the Swiss Federal Institute for Forest, Snow and Landscape Research (WSL), joined the initiative to foster the use of EO data for environmental monitoring at national scale. The objective of the SDC is to deliver unique and near real-time capabilities to access and analyse EO data, enabling more effective responses to problems of national significance. It can provide the long-term and baseline data required to determine trends, quantify past and present changes, and inform future decisions. This near real-time information can be readily used as an evidence base for the design, implementation and evaluation of policies, programs and regulation, as well as for developing policy advice. Ultimately, it can support the Swiss government for environmental monitoring and reporting commitments, while enabling national scientific institutions to benefit from satellite EO data for research and innovation. Indeed, this technology is significantly improving the way non-expert users can work with EO data ready for analysis; for example, it reduces the time and scientific knowledge required to handle satellite imagery by automating the complex tasks of searching, downloading and pre-processing scenes, while at the same time facilitating the processing of large amount of satellite data12.

The SDC is built on the Open Data Cube (ODC – https://www.opendatacube.org) software suite. The ODC is an open source project, initiated by Geoscience Australia, the Commonwealth Scientific and Industrial Research Organization (CSIRO), the USGS, NASA and the Committee on Earth Observations Satellites (CEOS)13. It is designed to provide a framework to access, store, manage, and analyse large quantities of gridded satellite EO data collections. In particular, the ODC enables the cataloguing of large amounts of satellite EO data; it provides a Python-based Application Programming Interface (API) for data analysis and allows the tracking of data provenance for quality control and updates14 (Fig. 1).

Fig. 1
figure 1

Swiss Data Cube general architecture and software components. (adapted from https://medium.com/opendatacube/what-is-open-data-cube-805af60820d7 and https://www.opendatacube.org/overview).

The systematic and regular delivery of Analysis Ready Data (ARD)15 is fundamental to facilitate the generation of usable information products and supporting the development of end-users applications. CEOS defines ARD as “satellite data that have been processed to a minimum set of requirements and organized into a form that allows immediate analysis with a minimum of additional user effort and interoperability both through time and with other datasets”16. ARD reduces the burden of full utilization of satellite data by providing specifications that limits data preparation efforts to generate relevant, consistent, normalized, interoperable data. These specifications save time, efforts and minimizes the cost of pre-processing data while capitalizing on knowledge and expertise of users by allowing them to spend more time in analysing their data, rather than searching and pre-processing them. The requirements concern parameters such as radiometric and geometric calibration, atmospheric correction, and metadata descriptions16. In optical imagery, the ARD level corresponds to surface reflectance products17 whereas in radar imagery it corresponds to radiometrically normalised (terrain-flattened) backscatter18.

Considering the fact that currently data providers such as the USGS Earth Resources Observation and Science (EROS) Center Science Processing Architecture (ESPA) (for Landsat data) and the Copernicus Open Access Hub (for Sentinel data) are not yet commonly generating ARD products (they usually deliver top of the atmosphere reflectance (L1C) while surface reflectance (L2A) archives are not yet complete), a fundamental element to consider is to have standardised and effective methods for generating ARD products and ensuring that all data ingested and stored in a data cube are consistent. Most of the steps of these procedures can be automated to search, download, and pre-process data for various data holdings, and managing different types of data (e.g., Landsat, Sentinel) to generate ARD products. To reach this objective, the Live Monitoring of Earth Surface (LiMES) framework19 has been used to generate Landsat 5,7,812; Sentinel-120; Sentinel-221 ARD products that are stored in the SDC. LiMES is a framework based on composable chains of interoperable services for automated EO data discovery, access and (pre-)processing to convert EO data into environmental monitoring information products (Fig. 2).

Fig. 2
figure 2

Common workflow for generation of optical Analysis Ready Data products.

Data acquisition and analysis ready data products generation

The SDC holds Analysis Ready EO satellite Data since 1984 for the entire Switzerland. This archive, updated on a daily basis, contains (at the time of writing) 12’435 scenes corresponding to a total volume of 5 TB and more than 1000 billion observations/pixels. It stores data from the two largest EO data providers, the United States Landsat program and the European Copernicus Sentinel program22,23. Each satellite characteristics and ARD workflow are described hereafter. The temporal resolution (i.e., revisit time) for Landsat is 16 days, for Sentinel-1 is 6 days (12 per satellite, 6 in tandem)24 and for Sentinel-2 is 5 days (10 days per satellite, 5 in tandem)25.

Landsat 5-7-8 are sun-synchronous satellites jointly operated by the USGS) and NASA26,27. They are essentially designed for land applications such as Earth resources, land surface, environmental monitoring, agriculture and forestry, disaster monitoring and assessment, ice and snow cover28. The Landsat 5 mission lasted from 1984 to 2013 and carried two sensors, the Multispectral Scanner System (MSS) and the Thematic Mapper (TM). Landsat 7 mission started in 1999 and will be decommissioned in 2021. The spacecraft carried the Enhanced Thematic Mapper (ETM+) sensor. The sensor’s Scan-Line Corrector (SLC) failed in July 200329 and approximately 225 of the pixels per scene have since then not been scanned. However, the spatial and spectral quality of the remaining 78% of pixels images remain valid30. Landsat 8 was launched in 2013 and the mission is expected to continue until 202331,32,33. A new generation of sensors were carried on Landsat 8, the Operational Land Imager (OLI) and Thermal Infrared Sensor (TIRS). Further details on these missions can be found in the CEOS Earth Observation Handbook for Landsat 5 (http://database.eohandbook.com/database/missionsummary.aspx?missionID=226), Landsat 7 (http://database.eohandbook.com/database/missionsummary.aspx?missionID=349), and Landsat 8 (http://database.eohandbook.com/database/missionsummary.aspx?missionID=547)15,26,32,34. The Landsat archive is the longest EO program, initiated in 1972, it has been providing continual and consistent observations for almost 50 years32. Since 2008, the complete data archive has been provided under a free and open access policy35,36. This has enabled dense time-series analysis, moving beyond simple diachronic comparison of a set of images, therefore dramatically improving capabilities to monitor environmental changes37. To cover the whole of Switzerland, it requires eight Landsat scenes (Path/Row: 193/027, 194/027, 195/027, 196/027, 193/028, 194/028, 195/028, 196/028) representing an area of latitude 44.9 to 48.7 and longitude 4.1 to 12.8. Data are downloaded as Collection 1/Tier 1 – Level 2 Surface Reflectance encompassing a surface of approximately 185 km by 180 km15. Collection 1/Tier 1 scenes are data with the highest available data quality (e.g., geometric and radiometric corrections) and considered suitable for time-series analysis38 (https://www.usgs.gov/land-resources/nli/landsat/landsat-collection-1). Level 2 corresponds to surface reflectance (i.e., the estimate based on Landsat sensor observations of the fraction of incoming solar radiation reflected from Earth’s surface). These data are corrected for atmospheric perturbations (e.g., aerosol scattering, thin clouds) enabling direct comparison between multiple images and dates. This corresponds to the ARD level. Two different models are applied to generate these ARD products. Landsat 5 and 7 TM are corrected with the Landsat Ecosystem Disturbance Adaptive Processing System (LEDAPS) algorithm39 whereas Landsat 8 OLI applies the Land Surface Reflectance Code (LaSRC) algorithm40. Both models use auxiliary climate data from MODIS and a radiative transfer model to evaluate atmospheric conditions over a given scene (https://www.usgs.gov/land-resources/nli/landsat/landsat-surface-reflectance).

As part of the SDC, a Python script has been implemented to search, download, and ingest Landsat data. This script generates a list of available and not yet ingested scenes for a given coverage and based on the scene ID, the required data are downloaded. Requests for Level 2 data are submitted via an API to the USGS Earth Resources Observation and Science (EROS) Center Science Processing Architecture (ESPA) On Demand Interface (https://espa.cr.usgs.gov). Once data are downloaded, they are directly ingested into the Swiss Data Cube, and a copy of the original data is kept as a backup. This workflow is updated on a weekly basis to ensure that the archive is always up to date. Currently, the number of ingested Landsat scenes correspond to 5643 images (L5: 2467 images; L7: 2146 images; L8: 1030 images), with a total volume of 1.3 TB (L5: 547 GB; L7: 429 GB; L8 347 GB) covering the years from 1984 up to the present day41,42,43. The yearly growth of the archive accounts for ~300 images and ~100 GB.

Sentinel-1 (S1) is a constellation of currently two sun-synchronous satellites launched by the European Space Agency (ESA) in April 2014 (Sentinel-1A) and April 2016 (Sentinel-1B) respectively. Sentinel-1C and -1D are funded and will be launched after 2023. Each satellite carries a C-band Synthetic Aperture Radar (SAR) with 4 modes of operation. The most-commonly operated mode over land, and the one we used as input was the “interferometric wide-swath mode” (IW), which acquires data at VV and VH polarisations over Europe. The sensor is used e.g., to monitor sea ice zones and the arctic environment as well as marine environments. Given a set of single-look-complex (SLC) input data, they can be used to estimate land surface motion over short and long time-scales using synthetic aperture radar interferometry (InSAR). In our case, ground-range-detected (GRD) products were used as input, so retrievals based on the phase-difference were not possible. The satellites can provide mapping in support of humanitarian aid in crisis situations. A Sentinel-1 Next Generation (S1NG) is being planned for the 2030’s. Detailed descriptions of the sensors are provided in the CEOS EO Handbook for Sentinel-1A (http://database.eohandbook.com/database/missionsummary.aspx?missionID=575) and Sentinel-1B (http://database.eohandbook.com/database/missionsummary.aspx?missionID=576)44. To produce Sentinel-1 Analysis Ready Radiometrically Terrain Corrected (RTC) products, ground range detected (GRD) products were first downloaded from the EU open access source scihub.copernicus.eu and multilooking in the ground range and azimuth dimensions (10 looks in each case) was applied to reduce data volume and expedite processing. The Shuttle Radar Topography Mission (SRTM) digital elevation model (DEM) available at 3 arcsecond resolution was used for reference45. The DEM was used as an input to an image simulation of the local contributing area within each radar product image sample. That area was used to normalise the backscatter (rather than the ellipsoid model otherwise typically used) in ground range geometry. Then that normalised backscatter from each available polarisation (VV & VH) was terrain-geocoded (orthorectified) into the chosen map coordinates, producing a level-1 RTC product46. A comparison19 between products from the original UZH RTC software39 against later implementations of the same algorithm showed older editions of the ESA SNAP software introduced artefacts – this issue is being addressed by ESA with the SNAP software maintainers.

Sets of multiple RTC products (each holding VV & VH polarisations) acquired from different orbit tracks within a defined temporal window were then combined using local resolution weighting47 into level-3 multitemporal backscatter composite products. The region covered is latitude 44.5 to 48.5 N, longitude 5.5-11E, filling 20GB per year at 90 m spatial resolution and 6 day temporal spacing48. Higher spatial and temporal resolutions (e.g. Copernicus DEM at 30 m) are planned for the future.

Sentinel-2 (S2) is a constellation of two sun-synchronous satellites operated by ESA respectively from 2015 to 2022 (Sentinel-2A) and 2017–2024 (Sentinel-2B)25. It supports land monitoring related services, including generation of generic land cover maps, risk mapping and fast images for disaster relief, generation of leaf coverage leaf chlorophyll content and leaf water content49,50. Both satellites carry the MultiSpectral Instrument (MSI) and further details can be found at: Sentinel-2 A (http://database.eohandbook.com/database/missionsummary.aspx?missionID=552) and Sentinel-2 B (http://database.eohandbook.com/database/missionsummary.aspx?missionID=553)25,44. They will be completed and replaced by Sentinel-2 C/D where launch is expected from 2021. Switzerland is covered by twelve Sentinel-2 scenes (ID: 31TGM, 31TGN, 32TLR, 32TLS, 32TLT, 32TMR, 32TMS, 32TMT, 32TNS, 32TNT, 32TPS, 32TPT). This corresponds to a spatial extent of latitude 45.0 to 47.9 N, longitude 5.5 to 11.8E. Data are downloaded as L1C products corresponding to 100 × 100 km2 geometrically corrected Top Of the Atmosphere (TOA) tiles25,51. To efficiently download, access, pre-process and ingest data, a python script has been developed to handle these different steps executed in nightly mode, and to make Sentinel-2 ARD available in the Swiss Data Cube. The first operation is to search and download data by querying the Copernicus Open Access Hub as well as Cloud storage facilities such as those provided by Amazon and Google. The implementation supports both the Open Data Protocol API (https://scihub.copernicus.eu/userguide/ODataAPI) and the gsutil tool (https://cloud.google.com/storage/docs/gsutil). The most efficient combination for S2 download speed, stability, storage, and readiness is to access data from the Google Cloud with data in Cloud Optimized GeoTIFF format (https://www.cogeo.org)21. It ensures good performance for data discovery (i.e., generating a list of available tiles) and access (i.e., fast download). Once data are downloaded, they enter a second step to correct the disturbances caused by the atmosphere and generate normalized surface reflectance that correspond to the ARD level (i.e., Level 2 A). For that, we use the ESA Sen2Cor algorithm (http://step.esa.int/main/third-party-plugins-2/sen2cor/)52 version 2.5.5 (2.8.0 version is not used as it is not able to process products before 2017). The algorithm follows two steps to pre-process data: first, it executes an image classification to identify and generate masks of clouds, cloud shadows, snow and water. Second, it applies an atmospheric correction model to convert TOA values into surface reflectance. Sen2Cor provides additional outputs on Aerosol Optical Thickness (AOT) and Water Vapour (WV)53. In the S2 ARD workflow, all bands are pre-processed and ingested except the band 10 that is only used for atmospheric corrections and consequently not relevant for users. It is important to mention that topographic correction is not applied as it creates strong artefacts; this can be seen by simply comparing Level 1 and Level 2 products of the same scene available on Google Cloud Platform (which applied topographic correction with sen2cor 2.8.0). Once data are pre-processed, they are then ingested into the Swiss Data Cube at a 20 m spatial resolution and are henceforward readily available for users. It should be noted that it has been decided to keep a copy of the original downloaded data because S2 data are stored in a rolling archive in the Copernicus Open Access Hub. This allows reprocessing the entire archive in case new atmospheric correction or more accurate topographic correction algorithms become available.

Finally, for users to fully benefit from the different satellite data types available in the Swiss Data Cube, we ensure that all observations (pixels and tiles) perfectly overlap. This means that one 30 m Landsat pixel covers exactly nine 10 m Sentinel-2 pixels without modification of data (e.g., resampling). This is done by extending the geographical extent in order to get latitude and longitude extent with finite multiple of Landsat and Sentinel-2 pixels and tiles. This is an important aspect to consider, allowing users to benefit from both sensors and to develop time-series analysis that uses multi-sensor data in a consistent way. This workflow is updated on a daily basis to ensure that the archive is always up to date. Currently, the number of S2 ingested tiles corresponds to 6792 images for a total volume of 3.6 TB covering the years from 2015 up to the present days54. The yearly growth of the archive accounts for 2000 images and 1 TB.

To ensure consistent and accurate provision of ARD, an important aspect to consider is the need to have a reprocessing strategy. Indeed, data providers can update their specifications (e.g., geometric, atmospheric corrections) and this may influence time-series analysis by breaking the consistency of data. Since the beginning the ARD generation workflow of the SDC is based on the LiMES framework providing a composable chain of interoperable services (Fig. 2)19. It is supported by large storage capacities and high-performance distributed computers providing a scalable, flexible and efficient processing environment for EO data. We have decided to keep a copy of the original data downloaded from the different repositories. One can argue that it duplicates data, but we consider that is important to have a full copy of the archive on our premises for reprocessing data. For example, if in the future we wish to use a different software component for atmospheric corrections, such as FORCE55 or MAJA56, we can easily replace the existing components in the pre-processing chain and then reprocess entirely the archive. Updated ARD (e.g., Landsat 5, Sentinel-2) or data products (e.g., NDVI time-series) are then possibly versioned to ensure the consistency of the different products and disclaimers/description are mentioned in the metadata description. With such an approach we seek to minimize the possible impacts to the wide variety of users’ expertise, many expecting error-free data. Moreover, as further described in the technical validation section, each datasets have quality information to help users to determine their suitability for specific applications.

Data Records

The final dataset includes all Landsat 5-7-8, Sentinel-1, and Sentinel-2 processed at the Analysis Ready Data level. The five collections (Table 1) are freely available and accessible (for registered users) using the Python Application Programming Interface (API) at: http://sdc.unepgrid.ch:8080 or the web-based Graphical User Interface: http://sdc.unepgrid.ch.

Table 1 ARD collections description stored in the SDC.

A static copy of each collection is made available at the University of Geneva Research Data repository (https://yareta.unige.ch)41,42,43,48,54.

Each collection is described with according metadata following the ISO19115 standard and description are available at: https://geonetwork.swissdatacube.org

Landsat 5: https://geonetwork.swissdatacube.org/geonetwork/srv/eng/catalog.search?node=srv#/metadata/4dc0defe-68c3-4546-8608-33c5c8351c11

Landsat 7: https://geonetwork.swissdatacube.org/geonetwork/srv/eng/catalog.search?node=srv#/metadata/f475c001-1bee-4b53-9a70-dd34a80f29cf

Landsat 8: https://geonetwork.swissdatacube.org/geonetwork/srv/eng/catalog.search?node=srv#/metadata/e1ad9b5d-2287-4cd0-9b89-08ab4cf627f6

Sentinel-1: https://geonetwork.swissdatacube.org/geonetwork/srv/eng/catalog.search?node=srv#/metadata/a7f13bda-7df4-40ec-a059-73bf21553eb3

Sentinel-2: https://geonetwork.swissdatacube.org/geonetwork/srv/eng/catalog.search?node=srv#/metadata/fbc005c9-9168-47af-beb2-0862e8325622

In addition, metadata and data are available as Open Geospatial Consortium (OGC) standards web services endpoints to ensure interoperable discovery, visualization and download.

These endpoints enable users to access and/or integrate these datasets in their desktop, web-based clients (Fig. 3) or own specific analysis workflows.

Fig. 3
figure 3

An example of SDC data visualized in the Swiss Data Cube Viewer.

The Swiss Data Cube can be easily updated with other satellite EO data collections (e.g., MODIS, Sentinel-5P). As new data will be organized and pre-processed following the protocols presented in this paper, new data steams can be readily included.

Technical Validation

All downloaded data are checked by a data curator before being pre-processed and ingested. Each satellite data scene/tile should be documented through specific metadata that provides sufficient information for pre-processing and ingestion.

Landsat data are provided by USGS with Quality Assessment (QA) information to help users to determine their suitability for specific applications. An 8-bit LandsatLook Quality Image and 16-bit Quality Assessment Band57 are also included. Details on each file are described at: https://www.usgs.gov/land-resources/nli/landsat/landsat-collection-1-level-1-quality-assessment-band. Level 2 products are generated by the USGS from level 1 product and using the official LEDAPS/LASRC algorithm. “A Pixel Quality Assurance (pixel_qa) band is provided with all Landsat Surface Reflectance-derived Spectral Indices. The band is in unsigned 16-bit format, values are bit-packed and provide information pertaining to a pixel condition of fill, clear, water, cloud shadow, snow, cloud (yes/no), cloud confidence and cirrus cloud confidence (Landsat 8 only)” https://www.usgs.gov/land-resources/nli/landsat/landsat-sr-derived-spectral-indices-pixel-quality-band.

Sentinel-1: The geometry of Sentinel-1 data is well calibrated: data can be geocoded without any tiepoints to an accuracy of a few centimetres58,59, far better than is possible with optical sensors. Sentinel-1 radiometric stability is monitored and calibrated within the Sentinel-1 Mission Performance Centre60. A review of the quality of multiple implementations of radiometric terrain correction (RTC) processing was published in 201920. The quality of the terrain correction and radiometric corrections depends on the quality of the input DEM made available. Participants in the Copernicus programme (currently not including Switzerland) are able to access a world-wide high-quality DEM with 30 m resolution, with 10 m models available in some regions.

Sentinel-2 processed to level 2 A with Sen2Cor comes with a scene classification (similar to the pixel_qa band provided by USGS for Landsat data) with 12 classes as described at: https://earth.esa.int/web/sentinel/technical-guides/sentinel-2-msi/level-2a/algorithm. It helps identifying pixels that are saturated or defective as well as cloud and cloud shadows that may affect the quality of the images. In addition, ESA provides a monthly status of the quality of Sentinel-2 data through Data Quality Reports (DQR). These reports document geometric and radiometric performances against initial sensor specifications together with observed anomalies and issues. DQR are available at: https://sentinels.copernicus.eu/web/sentinel/data-product-quality-reports

Usage Notes

The Analysis Ready Data provided by the Swiss Data Cube (SDC) contributes to provide information that are synoptic, consistent and spatially explicit, and therefore ideal to monitor environmental changes across the country. These can be provided at pixel level, or aggregated at various administrative levels (communes, cantons or national scale). It can contribute to national policies1,61, support Sustainable Development Goals62,63, and monitor land changes64,65 (e.g., snow, agriculture, urban, biodiversity, water quality,…). Ultimately, in-situ measurements (e.g., on-the-ground sensors) are essential and should be used in conjunction to calibrate and validate generated remotely-sensed data products.

Currently, the SDC is funded on a project-based approach; in the near future, however, a more sustainable funding mechanism is called for, one possibility being through a support for a national digital infrastructure. To reach this objective, several steps are envisioned to demonstrate the effectiveness of the SDC. The first phase is completed as researchers from UNEP/GRID-Geneva, University of Geneva, University of Zurich and the Swiss Federal Institute for Forest, Snow and Landscape (WSL) are able to use this technology for environmental monitoring in Switzerland. In a second phase, we will encourage more beneficiaries at the Federal, Cantonal and Communal level to test the SDC as an information service for decision makers. We believe that a collaborative, open-science approach66, freely accessible to everyone from both the public and private sectors, will greatly pay back all investments and exceed them by providing validated information services. Researchers are also supporting other countries in developing data cube technology and could (accounting for the funds and IT infrastructure) provide data cubes on demand to any location in the world at short notice. Finally, researchers are planning to extend this information service and make it compatible with other data sources, such as cadastral, statistical and meteorological data. The system will provide interfaces to other existing infrastructures. Currently, the approach is based on freely available data such as Landsat and Sentinel. However, the SDC is able to ingest different types of geospatial data provided by various sources… Alternative data cubes on meteorological data, on cadastral and statistical (socio-economic indicators) could be designed by other partners, offering the possibility to cross these data cubes for new types of analysis. This could create a national base digital infrastructure, with significant gain in production, easy analysis, and allow envisioning developing digital replicas of different aspects of the Swiss natural systems (i.e. digital twins).

For researchers, data preparations (i.e., download, stacking the bands, various corrections) are time consuming processes with limited scientific interest. SDC provides “Analysis Ready Data” by automatizing the image processing to a level where the real scientific analysis can start. It removes the “tedious tasks” so that scientists can concentrate on the real value added, i.e., the design of algorithms for retrieving indices, image classifications, trend and time series analysis, environmental assessments, change detections and associated scientific analysis. It offers centralized access to several sensors (so far Landsat, Sentinel-1 and Sentinel-2) all pre-processed using best standards. The quality is therefore granted and homogeneous, ensuring comparability between data. Furthermore, it also allows for the fusion between e.g., radar and passive optical sensor imagery. Finally, the development (on-going) of a solid IT infrastructure based on parallel computing will allow several computing methodologies to be tested. Currently, the main focus in this direction is to port the SDC platform on the High-Performance Computing (HPC) environment available at UNIGE (Baobab cluster - https://baobabmaster.unige.ch/enduser/enduser.html). This will allow different users to work in parallel and have their own individual development environment. The second phase of this initiative is to go even further and improve the processing performances by parallelizing the SDC not only at the data level but also at algorithms level, using specialized parallelization libraries, such as DASK (https://dask.org/).

SDC can provide the basis for environmental protection, which has societal, economic and ecological consequences (e.g. erosion/degradation of agricultural areas), it has implications also in economy, ecology (e.g. biodiversity67), food production and security. Importantly, this technology can be used for quickly assessing the impacts of policies on the ground (e.g., water quality, urban development and agricultural policies) and for production of indices (i.e., drought, vegetation and snow). SDC can be used to support decisions on land planning, and for assessing trends to infer on futures scenarios. Climate change and pressures on ecosystems require timely decisions based on science. SDC provides a complete coverage of the territory and is frequently updated. It is ideal for providing rapid monitoring68 on many topics and it is cost efficient, as most of the transformation of data to information is automatized.

SDC generates the baseline for the service industry to grow. A condition for this growth is reliable and validated data and algorithms from an independent, but highly trustworthy source. SDC could be used as a tool for estimating the impact of environmental damage or environmental hazards on infrastructures in a coherent way and at a national level. This would be beneficial for the (re-)insurance sector and other financial services such as commodity trading, or more generally, every private sector which is affected by environmental issues or relying on earth observational data. Other estimation areas: economic impacts on the agricultural sector, food production and forecasting of the impact of environmental change to potentially reconsider current practices (e.g., water management), funding of start-up ventures, which depend on the trend in snow coverage for example in the building of new ski resorts, etc.

SDC implements available tools to make data discovery, access and processing as interoperable as possible69. It implements all relevant standards promoted by the Open Geospatial Consortium (OGC) such as the Web Map Service (WMS) for data visualization; Web Coverage Service (WCS), Catalog Service for the Web (CSW) and ISO19139 for metadata; and Web Processing Service (WPS) for processing. It will ultimately comply with the FAIR data principles to make data Findable, Accessible, Interoperable and Reusable70,71. This will help to contribute to relevant initiative such the Digital Switzerland72 or the Global Earth Observation System of Systems73 to support decisions and actions through coordinated, comprehensive and sustained EO data and information74.