RaVÆn: unsupervised change detection of extreme events using ML on-board satellites

Růžička, Vít; Vaughan, Anna; De Martini, Daniele; Fulton, James; Salvatelli, Valentina; Bridges, Chris; Mateo-Garcia, Gonzalo; Zantedeschi, Valentina

doi:10.1038/s41598-022-19437-5

Download PDF

Article
Open access
Published: 08 October 2022

RaVÆn: unsupervised change detection of extreme events using ML on-board satellites

Vít Růžička^1,8,
Anna Vaughan²,
Daniele De Martini¹,
James Fulton³,
Valentina Salvatelli^4,8,
Chris Bridges⁵,
Gonzalo Mateo-Garcia^6,8 &
…
Valentina Zantedeschi^7,8

Scientific Reports volume 12, Article number: 16939 (2022) Cite this article

4777 Accesses
8 Citations
124 Altmetric
Metrics details

Subjects

Abstract

Applications such as disaster management enormously benefit from rapid availability of satellite observations. Traditionally, data analysis is performed on the ground after being transferred—downlinked—to a ground station. Constraints on the downlink capabilities, both in terms of data volume and timing, therefore heavily affect the response delay of any downstream application. In this paper, we introduce RaVÆn, a lightweight, unsupervised approach for change detection in satellite data based on Variational Auto-Encoders (VAEs), with the specific purpose of on-board deployment. RaVÆn pre-processes the sampled data directly on the satellite and flags changed areas to prioritise for downlink, shortening the response time. We verified the efficacy of our system on a dataset—which we release alongside this publication—composed of time series containing a catastrophic event, demonstrating that RaVÆn outperforms pixel-wise baselines. Finally, we tested our approach on resource-limited hardware for assessing computational and memory limitations, simulating deployment on real hardware.

Towards global flood mapping onboard low cost satellites with machine learning

Article Open access 31 March 2021

Gonzalo Mateo-Garcia, Joshua Veitch-Michaelis, … Dietmar Backes

In-orbit demonstration of a re-trainable machine learning payload for processing optical imagery

Article Open access 27 June 2023

Gonzalo Mateo-Garcia, Josh Veitch-Michaelis, … Pierre Philippe Mathieu

Tracking vegetation phenology across diverse biomes using Version 2.0 of the PhenoCam Dataset

Article Open access 22 October 2019

Bijan Seyednasrollah, Adam M. Young, … Andrew D. Richardson

Introduction

Satellite observations of the Earth’s surface provide vital data for diverse environmental applications, including disaster management^1,2, landcover change detection³, and ecological⁴, urban⁵ and agricultural⁶ monitoring. Currently, Earth observation (EO) satellites collect and downlink raw or low-compression-rate images for further processing on the ground⁷. Limitations in downlink capacity and speed result in delayed data availability and inefficient use of ground stations. This adversely impacts time-sensitive applications such as disaster management where data is required at low latency to inform time-critical decision making. This problem is set to worsen as the sensing resolution and number of EO satellites in orbit increase⁸, together with further restrictions on the radio-frequency spectrum and licensing availability.

One solution is to identify on-board the most useful data for a particular scenario and prioritise this for rapid downlink. Although on-board processing of payload data using machine learning has long been recognised as a potential method to improve efficiency^9,10, recent advances in hardware and machine learning that make deployment feasible have lead to a resurgence of interest in this area⁷. In recent years supervised classifiers have successfully been tested in orbit to segment clouds^11,12 and floods¹³, with proposed applications such as storm identification¹⁴. Still, supervised classifiers have the significant drawback in that only events of a particular type determined at training time will be identified. The model is therefore unable to generalise to new event types, imager specifications, sensor degradation, scene lighting or local features.

In this work, we present RaVÆn , a new fully-unsupervised novelty-detection model that avoids the limitations inherent in supervised classifiers and is suitable for deployment on remote sensing platforms. We use a Variational Auto-Encoder (VAE)¹⁵ to generate a latent representation of incoming sensor data over a particular region. A novelty score is assigned to this data using the distance in the latent space between representations from consecutive passes. This technique offers a substantial advantage over existing supervised methods as any change between passes can be detected on-board, regardless of the availability of training data for specific event types. Furthermore, even in situations of compound events, this general approach supports the detection of all types of changes present in the region.

We evaluate the performance of this model in detecting changes in land-surface observations from the Sentinel-2 Multispectral Instrument¹⁶ on a dataset of time series of images of natural disasters. Four event types where rapid response by emergency agencies is essential are included: floods, landslides, wildfires and hurricanes. RaVÆn is demonstrated to assign higher novelty scores to regions of known change, and outperforms classical image differencing computer-vision baselines requiring 60$\times$ less onboard storage. We further demonstrate via experiments on constrained hardware that emulates on-board processors that this model is suitable for deployment on a remote sensing platform.

The rest of the paper is organized as follows: in “Background” we frame our proposal in the context of machine learning and satellite onboard deployment, in “Data” we present the datasets that we used for training the model and the test dataset collected for this work which is made public with this paper, in “Methodology” the proposed methodology for change detection and baseline models are presented, “Experimental setup” discusses the metrics and hardware that we used for the experiments; finally, “Results” shows the results of the models and baselines on the annotated dataset and benchmarks the proposed model on constrained hardware and “Conclusion” summarises the conclusions of the paper.

Background

Anomaly detection

The use of VAEs has been explored for unsupervised anomaly detection in¹⁷, where the model reconstruction error is used as anomaly score. Our approach differs in that, instead of basing our predictions on the reconstruction error of a single input, which has been shown in¹⁸ to be an unreliable indicator in the unsupervised context, we consider a sequence of input images from the same location and score them based on their distance in the VAE’s latent space. We could also represent a sequence of images (or extracted tiles) as a single data point, where we would later use methods such as for example Reed-Xiaoli¹⁹, or Mixture of Gaussians to detect anomalous and background sequences. Our problem is however better framed as change detection.

Change detection

The need for annotations of supervised change detection techniques, such as siamese networks in²⁰, can be reduced using active learning approaches as demonstrated in²¹, but then it still lacks in terms of generality. The main challenge of unsupervised change detection is being able to distinguish changes of interest from spurious change due to noise. Many existing approaches^22,23,24 achieve this by combining dimensionality reduction techniques, such as Principal Component Analysis^25,26, and clustering, such as k-means, to detect only relevant change between images of consecutive passes. Approaches based on neural networks (see²⁷ for a review) rely instead on supervised auxiliary tasks, such as semantic segmentation, to extract informative features that are then used to detect change in a time series. Our method leverages neural networks without requiring supervision at any stage. Our chosen method is closest to the work of²⁸, however the focus of on-board deployment is novel.

ML deployment on satellites

Deploying machine learning models onboard remote sensing platforms has been identified as a potential solution to optimize downlinking communication and onboard storage^13,29,30. Most of current public and commercial missions such as e.g. ESA PRISMA³¹ or Maxar’s WorldView-3 acquire images on demand when they are tasked from the ground (specifically when they flight over an specific area of interest). Even global missions, such as Sentinel-1 or Sentinel-2 acquire images only over certain pre-configured conditions (more frequently over Europe and over land locations respectively). Our proposal demonstrates that with relatively simple ML models we can deploy autonomous algorithms to decide onboard which tiles within an acquisition shall be downlinked based on the retrieved observation. With this system we aim to showcase a change of paradigm in Remote Sensing operations: from a regime where sensors acquire and downlink data based on ad-hoc manual configurations to autonomous acquisitions where the sensing platform continuously evaluate if retrieved data has value to prioritise its download or discard it.

Data

As part of this study, we compile and release a new dataset to evaluate the proposed unsupervised change detection models. Images are taken from the Sentinel-2 multi-spectral imager (MSI) instrument¹⁶ (using the L1C processing level of the data) from which we use the ten highest resolution channels with all channels interpolated to the highest resolution of 10m. Training data are taken from the WorldFloods dataset¹³ (Fig. 1a), with a total of 233 scenes and a time series of five images per scene.

The RaVÆn dataset

The validation set consists of 19 scenes captured from Sentinel-2, containing one of four classes of disasters: hurricanes, fire burn scars, landslides and floods (Fig. 1b). We identified events in each of these classes through an extensive search of Sentinel-2 records aided by the Copernicus EMS system³². Each event in the validation set consists of a time series of five images where the first four images are taken before the disaster occurred while the fifth image is taken afterwards. To mitigate the effects of cloud cover, we discarded validation images with greater than 20% cloud cover. Events are only included where all images are within 180 days before and 90 days after the event. For each event a change mask was hand annotated to mark differences between the final two images in the time series, as in Fig. 2. Cloud cover generated using s2cloudless³³ and invalid pixels were also annotated in the change masks. We emphasize that these labels are used for evaluation only.

We describe the statistics of the manually annotated validation dataset in Table 1. While each type of event is represented by a similar amount of locations, the affected area varies significantly depending on the disaster type. Namely the area of burn scars in the Fire dataset has both the largest area of effect and the largest proportion of changed pixels to all non-cloudy pixels (reported as positive ratio).

Table 1 The RaVÆn dataset statistics.

Full size table

Methodology

Preprocessing

Tiles $x^{a,b}$ of $32 \times 32$ pixels—and therefore 320m $\times$ 320m area—are extracted from the Sentinel-2 scenes as shown in Fig. 3 and used as inputs to the considered model. Here, a and b represent the location of the tile in the scene.

The tiles are further normalized by applying a log transform and scaling to constrain them to the $[-1,+1]$ interval using the following transformation for each band:

$$\begin{aligned} x'= & \,log(x)\nonumber \\ x''= & \,2 * \frac{x' - min(x')}{max(x') - min(x')} - 1 \end{aligned}$$

(1)

Values for min and max were selected manually based on visual analysis of the training data distribution and fixed for all experiments. Note that multiplication and subtraction are pixel-wise. These preprocessing steps are consistent with other papers³⁴. On-board satellites, imager sampling and memory interfacing regimes vary and this work omits to match our architecture to any one sampling method.

Model

We employ a Variational Auto-Encoder (VAE) model, as shown in Fig. 3, to learn a low-dimensionality embedding space for tiles $x^{a,b}$ then exploited for change detection. A generic Auto-Encoder (AE) model consists of two networks—called encoder and decoder—usually composed out of convolutional or fully-connected layers of neurons. The encoder network $q_\phi$ learns to project the data from the original domain into a generally lower dimensional “bottleneck” representation—called latent space—while the decoder network $p_\theta$ learns to reconstruct the original data from this latent vector representation. The whole model learns end-to-end and doesn’t require any specific labels, as the task is to reconstruct the original data while obtaining a representative latent space. Given the fact that this latent space is of smaller dimensionality than that of the original data, it can be understood as a compressed space, with only the distinguishing features present in the representation. This learned latent space can be used for further downstream tasks, such as for training with labelled data (in the case of our data, this would for example be in classification of tiles to cloudy or non-cloudy tiles), or for change detection via comparison of the embedded latents, as is further described in this paper. VAEs impose further restrictions on the distribution of the learned latent variables as is detailed in¹⁵. In particular, the VAE imposes a diagonal Gaussian distribution for the embeddings that is parameterized by the mean and standard deviation of each dimension, ${\mathcal {N}}^{a,b} = {\mathcal {N}} (\mu ^{a,b}, \sigma ^{a,b})$. The latent representation $\mu ^{a,b}, \sigma ^{a,b} \in {\mathbb {R}}^n$, where n is called latent size.

Change detection novelty score

At inference, we can drop the decoder $p_\theta$ and use only the trained encoder network $q_\phi$ as a feature extractor to encode individual tiles in their compressed representation, with the advantage of improved robustness to noise and to slight misalignment between tiles³⁵ and reduced computational and memory requirements of storing images from previous passes, which is a critical in a constrained environment.

More formally, given a new tile $x^{a,b}_t$ at time t, we seek to understand if a relevant change has happen from the k-long history of samples at the same location $\{x^{a,b}_{t-k},\dots x^{a,b}_{t-1}\}$. To achieve this, we define a novelty score function $S({\mathbb {R}}^{32\times 32\times 10}) \rightarrow {\mathbb {R}}$ as:

$$\begin{aligned} S(x_t^{a,b}) = \min _{i = 1\ldots k} d(x_{t-i}^{a,b},\, x_t^{a,b}) \end{aligned}$$

(2)

where d$({\mathbb {R}}^{32\times 32\times 10}, {\mathbb {R}}^{32\times 32\times 10}) \rightarrow {\mathbb {R}}$ is an arbitrary difference function between two tiles. We propose to employ the latent representation of the tiles within d and we test three different distance functions in “Results”, the Euclidean and cosine distance between the means $\mu ^{a,b}_t$ and $\mu ^{a,b}_{t-i}$, and the KL divergence between the Gaussian latents ${\mathcal {N}}^{a,b}_t$ and ${\mathcal {N}}^{a,b}_{t-i}$. In Eq. (2) we use the minimum as a function to aggregate the individual distances, with the assumption that it is the last sample of the time-series of tiles, that contains the relevant change. This helps us ignore small fluctuations in the previous tiles.

Compression

In this analysis, we fix the latent size to $n=128$ as initial experiments indicated that larger latent sizes did not yield improved results and that lower values decreased the model performance. This gives us significant compression capabilities when deployed, as, instead of saving the original images or the extracted tiles ($32 \times 32$ image of 10 bands with 12 bits as per the Sentinel-2 radiometric resolution specification), the application simply stores their computed latent representations (in some cases only the encoded mean vector 128 with 16 bit float precision). This approach provides users with 60$\times$ reduction of necessary storage space with the caveat that further compression of the latent vectors is possible³⁶. When comparing this solution against existing satellite practices, the a typical EO satellite predominantly use lossless compression such as CCSDS-123³⁷ or and JPEG-LS³⁸. These achieve compression ratios of approximately 6.5:1 and 2.5:1 respectively. Our solution is lossy and the original raw pixel values cannot be fully reconstructed. Unlike existing lossless systems, our approach ensures accurate information can be utilised quickly and effectively in complementing real-time decision making systems.

Preprint

A shorter version of this paper was previously presented at the Artificial Intelligence for Humanitarian Assistance and Disaster Response Workshop (HADR) at NeurIPS 2021 (virtual)³⁹. The pre-print was peer-reviewed for inclusion in the workshop, which is not archival and does not form part of the NeurIPS conference proceedings. This paper has been updated with follow-up ideas and significantly restructured.

Experimental Setup

Model architecture design

The encoder of our VAE was composed of a series of downsampling blocks. Each downsampling block first had a 2D convolutional layer with kernel size 3, stride size 2, and zero padding of 1, such that the dimensions are halved in the spatial domain. Following this layer, the block also had a sequence of extra 2D convolutional layers (the number extra depth referred to in Table 2). Skip connections were used so that the extra depth convolutional layers formed a residual block. The network could then easily learn to skip these non-downsampling layers. In the residual block, the number of hidden channels and image size were conserved. Each convolution layer used leaky ReLU activations and batch normalisation. Following a given number of downsampling blocks, the result was flattened and further reduced in dimension using a fully connected layer which outputs the mean and log variance. The decoder was essentially the encoder in reverse. The upsampling method used was nearest neighbour upsampling followed by a single convolution. This method was preferred over transpose convolution to avoid checkerboard artefacts^40,41. Last layer of the decoder network uses a linear activation function to allow for reconstruction in the original data range. Finally, for training we use the Adam optimizer and learning rate of 0.001.

Efficiency considerations

To optimize the size of the model and maximize efficiency on constrained devices we conducted a parametric search over both the number of layers and number of units per layer in both the encoder network E and the decoder network D. More precisely we tested three different model architecture configurations (small, medium and large) detailed in Table 2. The main model presented in this paper is denoted as large on Table 2, it used 3 downsampling blocks with 32, 64, 128 channels on each successively smaller scale. A final fully connected layer projects the input to a latent dimension of 128. After each downscale convolution there was a residual block of 2 (extra depth) additional convolutional layers.

Hardware deployment

We use different environments for training the VAE and for inference. For development (training and validation), we use a n1-standard-16 instance on Google Cloud Platform with two NVIDIA Tesla V100 GPUs. In addition we measure the performance of the models on the Xilinx Pynq FPGA board with limited compute power, 650 MHz ARM Cortex-A9 CPU and 512 MB RAM which emulates the resources available on a typical small satellite (motivated by⁴²).

Table 2 Differences in the architecture for different proposed model sizes.

Full size table

Baselines

To compare the performance of this approach to simpler on-board processing methods that does not make use of machine learning, we compare our method to a baseline which compares tiles directly in the input space using the Euclidean or the cosine distance and after applying the same data pre-processing as for the VAE.

Results

Figure 4 shows a qualitative comparison between the VAE model developed in this study and the image differencing baseline. The before image shows a river that floods and therefore changes colour in the after image. The labels and the change scores from our VAE and baseline methods are shown alongside. In this case, the scores were calculated using a history of $k=3$ frames, although only the most recent before frame is shown for brevity. In this example, our method—the cosine embedding—produces a change map that is crisper than the cosine baseline; notably, the small flooded canal can be seen in the cosine embedding image but not in the baseline. In a similar fashion, Fig. 5 shows a qualitative comparison in the case of a burnt-area detection.

The change-score maps, like those in Figs. 4 and 5, were produced for every image in the evaluation set. We use these maps and our labels to calculate the area under the precision-recall curve (AUPRC). We produce the curve tile-wise, so that each individual tile across each image is treated as a positive or negative example of change, rather than treating the full image as one example. This means our quality metric is sensitive to the fact that our evaluation images are not equal; they have different number of tiles and different ratios of positive pixels (as reported in Table 1). We also ignore tiles that have clouds in the after image or in the most recent image before the event. We produce a precision-recall curve for each of the four different event types in our evaluation set and calculate separately the AUPRC.

We further note, that the used AUPRC metric does not require a specific threshold to be selected. In practice, the threshold would be selected based on operational constraints. For example, if only a certain number of tiles were able to be downlinked from a remote sensing platform, then the tiles with the largest change value would be selected for transmission.

Table 3 reports the results of our change detection experiments for all disaster types. We found that cosine distance, applied on the input space or on the embeddings, generally provides the best scores. This is in accordance with other research, which uses the cosine distance for comparisons in the latent space⁴³ or when training contrastive learning methods⁴⁴. For the metrics based in the embedding space, there was some variation between event types. Surprisingly KL-divergence is the lowest-performing metric, and is beaten by both cosine and Euclidean embedding scores in all events, even though these methods do not use the variance values calculated by the VAE. Metrics based on the VAE embedding outperforms the baseline on floods, hurricanes and fires, and reaches similar performance on landslides.

Table 4 shows the effects of including a longer frame history. When three previous images are provided instead of just one, both the embedding and baseline perform better except for the case of landslide dataset where the cosine baseline with memory 1 beats memory 3 with a small margin. The table also shows that our method of detecting significant change based on the embedding space outperforms the baselines in every dataset when $k=3$ by a large margin.

Table 3 AUPRC for baseline and VAE methods with time window $k=1$ (averaged over 5 runs).

Full size table

Table 4 AUPRC for the best performing metrics from Table 3 with and without an extended history k (averaged over 5 runs).

Full size table

Model timings and memory footprint

The purpose of the proposed change detection method is to run onboard a satellite, to be used for filtering or prioritising the image tiles to be downlinked. Therefore, models need to be designed to keep up with the upcoming stream of data on constrained, low-power hardware similar to the available on real remote sensing satellites. Here we report the accuracy and inference time of the different models architectures (see Table 2).

Table 5 AUPRC and timings for different sizes of model (averaged over 5 runs).

Full size table

Table 5 shows the accuracy of a few variations of model size and the time it took to process a 574 $\times$ 509 px image (approx. 5 km $\times$ 5 km at Sentinel-2 10m resolution) whilst running on the CPU of a Xilinx PYNQ. We see the the results of all tested models are comparable and that it is reasonable to aim for the smallest model, which takes only 2.06 s to process the patch. Running onboard the PYNQ means that there is considerable potential to speed up this runtime by a large factor by deploying directly on the FPGA module rather than using the board’s CPU.

Additionally, we report that executing our code on the device left at least 67% of the total RAM available for other processes (we note that this includes any other background processes that would run alongside our code on clean Pynq environment).

Table 6 AUPRC and for models with different latent sizes (averaged over 5 runs).

Full size table

Table 6 shows the experiments with changing the latent size of the default model (denoted as the “Large model” in Table 2) while not altering any other architectural hyper-parameters. We see, that with decreased latent size we encounter drop in performance for most of datasets. We use these results to fix the latent size to $n = 128$ in all other experiments.

Latent space visualisation

To demonstrate the quality of learned embedding space, we show it’s graphical representations using the UMAP⁴⁵ method in Fig. 6. We include both tiles from the image before and from the image after the event. We show that tiles of a certain type cluster together (for example the “flooded water” tiles). We consider this as a possible motivation for follow-up work—further using the latent representations of each tile for downstream tasks. This could be done either by unsupervised clustering, or with weak annotations of pairs of tiles corresponding to select the desired changes (whitelisting) or the changes to be ignored (blacklisting).

Conclusion

In conclusion, we introduce a new method RaVÆn for unsupervised change detection in remote sensing data using a VAE. Our method is evaluated on a new dataset of remote sensing images of disasters which we release for public use with our work. The proposed model outperforms a classical computer vision baseline in all of the tested disaster classes by on average 18% (by 27% in hurricane and by 6% in fire scenarios) in the AUPRC metric when considering three past frames. This demonstrates that RaVÆn is a robust change detection method and suitable for application in improving data acquisition for disaster response. We also confirm that having access to longer temporal series of data can be beneficial when dealing with real-world noisy data (as also shown in⁴⁶)—in particular having access to past three frames instead of just one previous image improves the AUPRC metric on average by 10.6% when using our proposed technique. Finally, we show that while maintaining the model’s performance ($\pm 3\%$) the model size and it’s runtime can be greatly reduced (by 85%) on the Xilinx PYNQ board, which is crucial to demonstrate the possibilities for real world deployment.

Future work could consist of using the obtained latent representations for downstream tasks such as unsupervised clustering or weakly supervised classification of types of changes (to detect or to ignore) and efficient message passing in future constellations of small satellites. Our findings also reveal several exciting research directions, such as improvements on the used metrics for measuring change between encoded latent representations. We would like to explore other unsupervised methods for learning feature extractors, such as general contrastive learning approaches like SimCLR⁴⁴ or methods specific to Remote Sensing data such as SeCo⁴⁷. Methods which allow for better compression of the learned latent vectors³⁶ would also be beneficial for real-world application. Finally, we would also like to explore situations with access to longer temporal series of data, where it would be possible to model cyclic changes which are part of the system’s behaviour and to separate these from other detections.

Data availibility

We are releasing the full annotated evaluation RaVÆn dataset, the code and the pre-trained models alongside this paper at https://github.com/spaceml-org/RaVAEn.

References

Bello, O. M. & Aina, Y. A. Satellite remote sensing as a tool in disaster management and sustainable development: Towards a synergistic approach. Proced. Soc. Behav. Sci. 120, 365–373 (2014).
Article Google Scholar
Huyck, C., Verrucci, E. & Bevington, J. Remote sensing for disaster response: A rapid, image-based perspective. In Earthquake Hazard, Risk and Disasters 1–24 (Elsevier, 2014).
Google Scholar
Mas, J.-F. Monitoring land-cover changes: A comparison of change detection techniques. Int. J. Remote Sens. 20, 139–152 (1999).
Article Google Scholar
Coppin, P., Jonckheere, I., Nackaerts, K., Muys, B. & Lambin, E. Review article digital change detection methods in ecosystem monitoring: A review. Int. J. Remote Sens. 25, 1565–1596 (2004).
Article Google Scholar
Yang, X. X. Urban Remote Sensing: Monitoring, Synthesis and Modeling in the Urban Environment (Wiley, 2021).
Book Google Scholar
Fritz, S. et al. A comparison of global agricultural monitoring systems and current gaps. Agric. Syst. 168, 258–272 (2019).
Article Google Scholar
Kothari, V., Liberis, E. & Lane, N. D. The final frontier: Deep learning in space. In Proceedings of the 21st International Workshop on Mobile Computing Systems and Applications, 45–49 (2020).
Selva, D. & Krejci, D. A survey and assessment of the capabilities of cubesats for earth observation. Acta Astron. 74, 50–68 (2012).
Article Google Scholar
Boucheret, M.-L., Mortensen, I. & Favaro, H. Fast convolution filter banks for satellite payloads with on-board processing. IEEE J. Select. Areas Commun. 17, 238–248 (1999).
Article Google Scholar
Velazco, R., Cheynet, P., Muller, J., Ecoffet, R. & Buchner, S. Artificial neural network robustness for on-board satellite image processing: Results of upset simulations and ground tests. IEEE Trans. Nucl. Sci. 44, 2337–2344 (1997).
Article ADS Google Scholar
Griggin, M., Burke, H., Mandl, D. & Miller, J. Cloud cover detection algorithm for EO-1 Hyperion imagery. In IEEE International Geoscience and Remote Sensing Symposium (IGARSS 2003), vol. 1, 86–89. https://doi.org/10.1109/IGARSS.2003.1293687 (2003).
Giuffrida, G. et al. CloudScout: A deep neural network for on-board cloud detection on hyperspectral images. Remote Sens. 12, 2205 (2020).
Article ADS Google Scholar
Mateo-Garcia, G. et al. Towards global flood mapping onboard low cost satellites with machine learning. Sci. Rep. 11, 7249 (2021).
Article CAS Google Scholar
Hinz, R. et al. Eo-alert: Machine learning-based on-board satellite processing for very-low latency convective storm nowcasting. In EMCWF-ESA Workshop (2020).
Kingma, D. P. & Welling, M. Auto-encoding variational bayes (2013). arXiv:1312.6114.
Drusch, M. et al. Sentinel-2: Esa’s optical high-resolution mission for gmes operational services. Remote Sens. Environ. 120, 25–36 (2012).
Article ADS Google Scholar
Angerhausen, D., Bickel, V. T. & Adam, L. Unsupervised distribution learning for lunar surface technosignature detection. Earth Sp. Sci. Open Arch. 20, 1 (2020).
Google Scholar
Merrill, N. & Eskandarian, A. Modified autoencoder training and scoring for robust unsupervised anomaly detection in deep learning. IEEE Access (2020).
Reed, I. S. & Yu, X. Adaptive multiple-band cfar detection of an optical pattern with unknown spectral distribution. IEEE Trans. Acoust. Speech Signal Process. 38, 1760–1770 (1990).
Article ADS Google Scholar
Caye Daudt, R., Le Saux, B. & Boulch, A. Fully convolutional siamese networks for change detection. 2018 25th IEEE International Conference on Image Processing (ICIP) (2018).
Ruzicka, V., D’Aronco, S., Wegner, J. D. & Schindler, K. Deep active learning in remote sensing for data efficient change detection. In Proceedings of MACLEAN: MAChine Learning for EArth ObservatioN Workshop (ECML/PKDD 2020), vol. 2766 (2020).
Çelik, T. Unsupervised change detection in satellite images using principal component analysis and $k$-means clustering. IEEE Geosci. Remote Sens. Lett. 20, 20 (2009).
Google Scholar
Çelik, T. & Curtis, C. V. Resolution selective change detection in satellite images. In ICASSP (2010).
Cheng, Y., Li, H., Çelik, T. & Zhang, F. Frft-based improved algorithm of unsupervised change detection in SAR images via PCA and k-means clustering. In IGARSS (2013).
Radke, R. J., Andra, S., Al-Kofahi, O. & Roysam, B. Image change detection algorithms: A systematic survey. IEEE Trans. Image Process. 14, 294–307 (2005).
Article ADS MathSciNet Google Scholar
Gong, P. Change detection using principal component analysis and fuzzy set theory. Can. J. Remote Sens. 19, 22–29 (1993).
Article ADS Google Scholar
de Jong, K. L. & Bosman, A. S. Unsupervised change detection in satellite images using convolutional neural networks. In IJCNN (IEEE, 2019).
Kerner, H. R. et al. Toward generalized change detection on planetary surfaces with convolutional autoencoders and transfer learning. IEEE J. Select. Top. Appl. Earth Observ. Remote Sens. 12, 3900–3918 (2019).
Article ADS Google Scholar
Giuffrida, G. et al. The $\phi$-sat-1 mission: The first on-board deep neural network demonstrator for satellite earth observation. IEEE Trans. Geosci. Remote Sens. 60, 1–14 (2021).
Article Google Scholar
Wagstaff, K. L. et al. Enabling onboard detection of events of scientific interest for the europa clipper spacecraft. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2191–2201 (2019).
Cogliati, S. et al. The PRISMA imaging spectroscopy mission: Overview and first performance analysis. Remote Sens. Environ. 262, 112499. https://doi.org/10.1016/j.rse.2021.112499 (2021).
Article ADS Google Scholar
Copernicus emergency management system. Accessed on 4.11.2021.
s2cloudless: Sentinel hub’s cloud detector for sentinel-2 imagery. Accessed on 4.11.2021.
Brown, C. F. et al. Dynamic world, near real-time global 10 m land use land cover mapping. Sci. Data 9, 1–17 (2022).
Article Google Scholar
Saha, S. & Zhu, X. X. Patch-level unsupervised planetary change detection. IEEE Geosci. Remote Sens. Lett. 19, 1–5 (2021).
Google Scholar
Ballé, J., Laparra, V. & Simoncelli, E. P. End-to-end optimized image compression. arXiv:1611.01704 (2016).
BLUE BOOK. Low-complexity lossless and near-lossless multispectral and hyperspectral image compression (2019).
Schelkens, P. et al. The jpeg 2000 family of standards. In Wavelet Applications in Industrial Processing VI Vol. 7248 724802 (SPIE, 2009).
Chapter Google Scholar
Ruzicka, V. et al. Unsupervised change detection of extreme events using ML On-board. In Artificial Intelligence for Humanitarian Assistance and Disaster Response Workshop, (NeurIPS 2021), Vancouver, Canada (2021). arXiv: 2111.02995.
Odena, A., Dumoulin, V. & Olah, C. Deconvolution and checkerboard artifacts. Distill 1, e3 (2016).
Article Google Scholar
Wang, S.-Y., Wang, O., Zhang, R., Owens, A. & Efros, A. A. Cnn-generated images are surprisingly easy to spot... for now. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 8695–8704 (2020).
Rapuano, E. et al. An fpga-based hardware accelerator for cnns inference on board satellites: Benchmarking with myriad 2-based solution for the cloudscout case study. Remote Sens. 13, 1518 (2021).
Article ADS Google Scholar
Mikolov, T., Chen, K., Corrado, G. & Dean, J. Efficient estimation of word representations in vector space. arXiv:1301.3781 (2013).
Chen, T., Kornblith, S., Norouzi, M. & Hinton, G. A simple framework for contrastive learning of visual representations. In International Conference on Machine Learning, 1597–1607 (PMLR, 2020).
McInnes, L., Healy, J. & Melville, J. Umap: Uniform manifold approximation and projection for dimension reduction. arXiv:1802.03426 (2018).
Saha, S., Bovolo, F. & Bruzzone, L. Change detection in image time-series using unsupervised lstm. IEEE Geosci. Remote Sens. Lett. 20, 20 (2020).
Google Scholar
Manas, O., Lacoste, A., Giró-i Nieto, X., Vazquez, D. & Rodriguez, P. Seasonal contrast: Unsupervised pre-training from uncurated remote sensing data. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 9414–9423 (2021).

Download references

Acknowledgements

This work has been enabled by Frontier Development Lab (FDL) Europe, a public partnership between the European Space Agency (ESA) Phi-Lab (ESRIN) and ESA Mission Operations (ESOC), Trillium Technologies and the University of Oxford; the project has been also supported by Google Cloud, D-Orbit and Planet. The authors would like to thank all FDL faculty members, Atlm Güne Baydin and Yarin Gal (University of Oxford), Chedy Raissi (INRIA), Brad Neuberg (Planet) and Nicolas Longépé (ESA ESRIN) for discussions and comments throughout the development of this work. G.M.-G. has been partially supported by the Spanish Ministry of Science and Innovation (project PID2019-109026RB-I00, ERDF) and the European Social Fund.

Author information

Authors and Affiliations

University of Oxford, Oxford, UK
Vít Růžička & Daniele De Martini
University of Cambridge, Cambridge, UK
Anna Vaughan
University of Edinburgh, Edinburgh, UK
James Fulton
Microsoft Research, Cambridge, UK
Valentina Salvatelli
University of Surrey, Guildford, UK
Chris Bridges
University of Valencia, Valencia, Spain
Gonzalo Mateo-Garcia
ServiceNow Research, Canada, University College London, London, UK
Valentina Zantedeschi
Frontier Development Lab, Oxford, UK
Vít Růžička, Valentina Salvatelli, Gonzalo Mateo-Garcia & Valentina Zantedeschi

Authors

Vít Růžička
View author publications
You can also search for this author in PubMed Google Scholar
Anna Vaughan
View author publications
You can also search for this author in PubMed Google Scholar
Daniele De Martini
View author publications
You can also search for this author in PubMed Google Scholar
James Fulton
View author publications
You can also search for this author in PubMed Google Scholar
Valentina Salvatelli
View author publications
You can also search for this author in PubMed Google Scholar
Chris Bridges
View author publications
You can also search for this author in PubMed Google Scholar
Gonzalo Mateo-Garcia
View author publications
You can also search for this author in PubMed Google Scholar
Valentina Zantedeschi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

V.R., A.V., D.D.M. and J.F. were the main researchers for this project and share conceivement of the experiments and coding. V.R. is responsible as the first and corresponding author, he was responsible for conducting and evaluating the experiments and publishing the code repository and results. A.V. with G.M.-G. were responsible for collection of the dataset. V.S., C.B., G.M.-G. and V.Z. were the main domain and machine learning supervisors and were responsible for the conception and guidance of the project. C.B. with V.R. contributed with measuring the model on real-world hardware. G.M.-G. and V.Z. also contributed to the codebase. All authors reviewed and contributed to the manuscript.

Corresponding author

Correspondence to Vít Růžička.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Růžička, V., Vaughan, A., De Martini, D. et al. RaVÆn: unsupervised change detection of extreme events using ML on-board satellites. Sci Rep 12, 16939 (2022). https://doi.org/10.1038/s41598-022-19437-5

Download citation

Received: 04 July 2022
Accepted: 29 August 2022
Published: 08 October 2022
DOI: https://doi.org/10.1038/s41598-022-19437-5

This article is cited by

The OPS-SAT case: A data-centric competition for onboard satellite image classification
- Gabriele Meoni
- Marcus Märtens
- Dario Izzo
Astrodynamics (2024)
Semantic segmentation of methane plumes with hyperspectral machine learning models
- Vít Růžička
- Gonzalo Mateo-Garcia
- Andrew Markham
Scientific Reports (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Towards global flood mapping onboard low cost satellites with machine learning

In-orbit demonstration of a re-trainable machine learning payload for processing optical imagery

Tracking vegetation phenology across diverse biomes using Version 2.0 of the PhenoCam Dataset

Introduction

Background

Anomaly detection

Change detection

ML deployment on satellites

Data

The RaVÆn dataset

Methodology

Preprocessing

Model

Change detection novelty score

Compression

Preprint

Experimental Setup

Model architecture design

Efficiency considerations

Hardware deployment

Baselines

Results

Model timings and memory footprint

Latent space visualisation

Conclusion

Data availibility

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

The OPS-SAT case: A data-centric competition for onboard satellite image classification

Semantic segmentation of methane plumes with hyperspectral machine learning models

Comments

Search

Quick links