An improved SIFT algorithm for registration between SAR and optical images

Zhang, Wannan; Zhao, Yuqian

doi:10.1038/s41598-023-33532-1

Download PDF

Article
Open access
Published: 18 April 2023

An improved SIFT algorithm for registration between SAR and optical images

Wannan Zhang¹ &
Yuqian Zhao¹

Scientific Reports volume 13, Article number: 6346 (2023) Cite this article

1824 Accesses
1 Citations
Metrics details

Subjects

Abstract

Aiming at improving the performance of scale invariant feature transform (SIFT) algorithm during the registration of optical and synthetic aperture radar (SAR) images, a new SIFT algorithm is proposed. Firstly, the nonlinear diffusion scale space of optical and SAR images is constructed by using nonlinear diffusion filtering, the uniform gradient information is calculated by using multi-scale Sobel operator and multi-scale exponential weighted mean ratio operator respectively. Then, after removing the first layer of the scale space with the image blocking strategy, the scale space is partitioned, and Harris feature points are extracted on the basis of consistent gradient information to obtain stable and uniform point features. Descriptors are constructed based on gradient position and direction histogram templates and normalized to overcome nonlinear radiation differences between images. Finally, the correct matching point pairs are obtained by using bilateral fast approximate nearest neighbor (FLANN) search matching method and random sampling consistent (RANSAC) method, and the affine transformation model parameters are obtained. Compared with the other two algorithms, the CMR of this algorithm is improved by 80.53%, 75.61% and 81.74% respectively in the three groups of images, and the RMSE is reduced by 0.6491, 1.0287 and 0.6306 respectively.

Metasurface-enabled single-shot and complete Mueller matrix imaging

Article 02 May 2024

Characterization of the decametre sky at subarcminute resolution

Article 06 May 2024

Deep learning for high-resolution seismic imaging

Article Open access 06 May 2024

Introduction

With the development of sensor technology, the types of remote sensing images are increasing rapidly at the same time. There are obvious differences in the spectrum, spatial resolution, imaging methods, etc. of a single sensor, and there are also differences in the ground information obtained. It is difficult to meet the needs of information diversification by using only one image, and the surface object information obtained from multi-source remote sensing images has a certain degree of complementarity. Therefore, studying the analysis and utilization of different types of remote sensing images can provide more comprehensive information for surface monitoring¹. The spatial resolution of optical images is high, and the information contained therein is easy to interpret, but the imaging results are vulnerable to weather and light conditions; SAR images have all-weather and all-weather imaging characteristics, can penetrate cloud, fog and other media, and have strong complementarity with optical images. It has broad application prospects in military and civilian fields², but the presence of speckle noise will affect the information resolution of SAR images. Therefore, it is of great significance to study the registration of optical and SAR images in image processing and analysis, image fusion, computer vision and other fields³.

Registration algorithms can be divided into gray based registration algorithm and feature-based registration algorithm. The principle of gray-scale based registration algorithm is simple, but it is less robust to images with large gray-scale differences such as optical and SAR images, and changes in scale and rotation have a large impact on the algorithm, which is not suitable for the registration of optical and SAR images⁴. Feature based registration algorithm has good adaptability to gray difference, rotation and scale change, such as SIFT⁵ and accelerated robust feature (SURF) calculation⁶. Among them, SIFT algorithm is widely used in optical image registration⁷. Yu et al.⁸ used the combination of spatial feature detection and local frequency domain description to construct the rotation invariant amplitude descriptor based on the direction histogram, and realized the registration of optical and SAR images. Sedaghat et al. proposed an optimization scheme to solve the problem of uneven distribution of feature points detected by SIFT algorithm in optical remote sensing image registration, which extracted uniformly distributed feature points by using the constraint information of feature points in scale and spatial position⁹. Subsequently, Dellinger et al. applied SIFT algorithm to SAR image registration and solved the impact of speckle noise inherent in SAR images on SIFT algorithm by introducing a new gradient definition¹⁰. Dellinger et al.¹¹ proposed a SAR-SIFT algorithm for SAR image registration. By introducing the multi-scale exponential weighted mean ratio (ROEWA) operator to calculate the gradient of SAR image, the feature points were detected in Harris scale space, and good registration results were achieved. Based on the SIFT algorithm, Ma and other scholars proposed a gradient calculation method that could overcome the nonlinear gray difference between images, and used the spatial position, scale and direction information of feature points to match multi-modal images, which was successfully applied to the registration of visible and SAR images. The proposed algorithm was named PSO-SIFT¹². Wang et al.¹³ used anisotropic diffusion filtering (SRAD) to construct the anisotropic scale space, which reduced the influence of noise on feature extraction and effectively improved the registration accuracy of SAR images. Dellinger et al. Xiang et al. used two different gradient operators for optical and SAR image respectively to extract the consistent gradient information of the two types of images, and adopted multi-scale Harris corner detector to solve the sensitivity of traditional Harris detector to scale change. The proposed algorithm is robust to large scale and rotation differences between optical and SAR images¹⁴. Fan et al. proposed an algorithm with high accuracy for the registration of optical and SAR images. The nonlinear diffusion operator was used to extract evenly distributed feature points, and the phase consistency operator was used to construct descriptors, so as to improve the accuracy of the registration algorithm. However, this algorithm is highly dependent on the image structure information. The registration effect of images containing less structural information is not ideal¹⁵. Divya et al.¹⁶ proposed a SIFT algorithm based on structure tensor, and used the descriptor constructed by the algorithm for SAR image registration, which can increase the number of correctly matched point pairs and improve the position accuracy of registration. Ye et al. proposed a novel feature detector that combined point features and patch features to effectively improve the repetition rate and matching accuracy of feature point pairs between visible and SAR images¹⁷. Reference¹⁸ overcame the problem of image gray scale and improved the registration accuracy by improving SIFT gradient definition. The method is based on neural network which usually adopts the end-to-end network to realize image registration. Literature¹⁹ realized the process of image registration by learning modal invariant features and improved the registration accuracy.

All the above algorithms improve the registration accuracy and speed of optical and SAR image in a fixed range degree, but there are still some limitations. On the one hand, the nonlinear difference between optical and SAR images is large, so it is difficult to extract highly repeatable features with the same feature detector. On the other hand, these algorithms are not robust to airborne SAR images with large noise. To solve these problems, we propose an improved SIFT algorithm for registration between SAR and optical images. Firstly, the nonlinear diffusion scale space of optical and SAR images is constructed by using nonlinear diffusion filtering, and the uniform gradient information is calculated by using multi-scale Sobel operator and multi-scale exponential weighted mean ratio operator respectively. Then, after removing the first layer of the scale space with the image blocking strategy, the scale space is partitioned, and Harris feature points are extracted on the basis of consistent gradient information to obtain stable and uniform point features. Descriptors are constructed based on gradient position and direction histogram templates and normalized to overcome nonlinear radiation differences between images., Finally, the correct matching point pairs are obtained by using bilateral fast approximate nearest neighbor (FLANN) search matching method and random sampling consistent (RANSAC) method, and the affine transformation model parameters are obtained.

Proposed method

Nonlinear diffusion scale space

SIFT algorithm uses Gaussian filter to construct scale space, but the edge retention of Gaussian filter is poor which results in fuzzy natural boundary of objects. To solve this problem, the scale space is constructed by nonlinear diffusion filtering²⁰, and the nonlinear diffusion equation can be expressed as:

$$ L_{t} = div[p(m,n,t)\nabla L] = p(m,n,t)\nabla L + \Delta p\nabla L $$

(1)

where p(m, n, t) is the diffusion function, div is the divergence operator, and ∇ and δ are the gradient and Laplace operators, respectively. Equation (1) is an anisotropic diffusion equation. When p(m, n, t) is a constant, it can be simplified to an isotropic diffusion equation:

$$ L_{t} = {\text{p}}(m,n,t)\nabla L $$

(2)

In order to make p(m, n, t) vary with the local features of the image, the function p(m, n, t) is expressed as:

$$ p(m,n,t) = g[||\nabla L(m,n,t)||] $$

(3)

where, ‖ ‖⋅ is modular operation, ∇L(m, n, t) is the gradient after Gaussian filtering, and g (·) is the edge function. It can not only preserve the edge, but also sharpen the brightness edge when g (·) is between 0 and 1. With the increase of scale, the edge details of the Gaussian scale space are lost more. The nonlinear diffusion scale space can effectively retain the edge detail features, and the blur degree between the images is also low, so the feature extraction effect is better which is conducive to the subsequent consistency gradient calculation.

Partitioning strategy

The uneven distribution of feature points is easy to lead to the final successful matching feature points too concentrated, which is not conducive to the subsequent geometric transformation model. In order to make the feature points evenly distributed on the whole image to obtain a more accurate transformation model and higher registration accuracy, the segmentation strategy was introduced in the scale space. The main purpose of feature extraction stage is to obtain a sufficient number of relevant point features uniformly distributed in the image and scale space, while the block strategy obtains uniform point features in the whole space by extracting the feature points of each regular grid. According to the idea of block strategy²¹, a block strategy for scale space is designed. First, the total number of extracted feature points is specified. Then, each layer image is divided into regular grid cells and the number of feature points of each grid cell is determined. Finally, the first layer of the scale space is eliminated, and the feature points extracted by blocks are corresponding to the corresponding positions of the original scale space.

The position of each block is determined according to the size and number of the block, so as to determine the position of the feature points in the original scale space. The first layer of image in scale space is the original resolution image, and most feature points extracted from this layer are speckled noise, while the randomness of speckled noise will affect the actual feature points nearby, resulting in the mismatching of images²². In order to solve this problem, the feature extraction operation of the first layer of scale space is eliminated, and the image of this layer is not divided into blocks, so as to speed up the image registration speed to a certain extent. The segmentation strategy determines the segmentation form of the image according to the size of the image, and determines the segmentation mesh size of each layer according to the relationship between the scale space coefficients, and can be adjusted according to the actual application requirements, which speeds up the algorithm to a certain extent.

Master direction assignment and descriptor construction

After the feature points are detected, the location information of the feature points is first used to correct the location of the detected feature points, and then the GLOH descriptor is used to describe the feature. GLOH algorithm divides the circular neighborhood of feature points into three concentric circles, and divides the two peripheral concentric circles into 8 equal parts, and the central circle remains unchanged, which is divided into 17 sub-regions. The gradient direction of 0°–180° in each small region is divided into 8 directions, and the histogram of the gradient direction of the feature point is obtained statistically. Then, the peak value of the histogram represents the main direction of the feature point. This structure belongs to the isotropic average and can increase the robustness of the descriptor.

Feature matching and mismatching elimination

FLANN algorithm is a binary feature matching method, which realizes feature matching by calculating the nearest neighbor points of feature points at different Euclidean distances. A FLANN search and matching method is designed, in which I₁ is SAR image and I₂ is optical image. Points q and m are the nearest neighbor feature points and the second nearest neighbor feature points of the midpoint p of I₁ on I₂, respectively, and the corresponding feature vectors are F_q, F_m and F_p. r and s are the nearest neighbor feature points and the second nearest neighbor feature points of the middle point n of I₂ on I₁, respectively. The corresponding eigenvectors are F_r, F_s and F_n. The main steps are as follows:

1.
Formula (4) is used to calculate d_pq and d_pm of Euclidean distance between point q and m and point p, and d_nr and d_ns of Euclidean distance between point r and s and point n:
$$ \left\{ {\begin{array}{*{20}c} {d_{pq} = \sqrt {\sum\limits_{i = 1}^{64} {[F_{p} (i) - F_{q} (i)]^{2} } } } \\ {d_{pm} = \sqrt {\sum\limits_{i = 1}^{64} {[F_{p} (i) - F_{m} (i)]^{2} } } } \\ \end{array} } \right.,\left\{ {\begin{array}{*{20}c} {d_{nr} = \sqrt {\sum\limits_{i = 1}^{64} {[F_{n} (i) - F_{r} (i)]^{2} } } } \\ {d_{ns} = \sqrt {\sum\limits_{i = 1}^{64} {[F_{n} (i) - F_{s} (i)]^{2} } } } \\ \end{array} } \right. $$
(4)
2.
Calculate the distance ratio R₁ and R₂:
$$ R_{1} = \frac{{d_{pq} }}{{d_{pm} }},R_{2} = \frac{{d_{nr} }}{{d_{ns} }} $$
(5)
3.
Judge the relationship between distance ratio R₁, R₂ and threshold Th respectively: if R₁ < T_h, point p and q are matched successfully; otherwise, the matching fails. If R₂ < T_h, point n and r are matched successfully; otherwise, the matching fails.
4.
The feature point pairs with the same matching results of the two searches are retained, and the one-to-one feature matching results of SAR and optical images are obtained.

RANSAC algorithm has high robustness and is often used to eliminate mismatched point pairs. Select 4 pairs of initial matching point pairs randomly from the pre-matching point pairs to calculate the affine transformation model parameters, then calculate the distance between the remaining matching point pairs and the pre-matching point pairs after coordinate transformation, and eliminate the wrong matching point pairs through several sampling calculations to achieve fine matching.

Experimental results and analysis

To evaluate the performance of the proposed method, three pairs of SAR and optical images are experimented. The experiments are compiled with Python3.6, and the network is built through the deep learning framework of Pytorch1.3, and the corresponding CUDA10.0 and cudnn7.0 are configured for GPU acceleration. The test data consists of different characteristics including different resolutions, incidence angles, seasons etc. The dataset description is shown in Table 1. Experimental results are shown in Figs. 1, 2 and 3 and Table 2. Figures 1, 2 and 3 are generated by the software Python3.6, Python Release Python 3.6.10 | Python.org.

Table 1 Detailed description of dataset.

Full size table

Table 2 Quantitative comparison of the proposed method with other SIFT-based algorithms.

Full size table

To quantitatively evaluate the registration performances, we adopt the root-mean-square error (RMSE) between the corresponding matching keypoints, and it can be expressed as

$$ {\text{RMSE}} = \sqrt {\frac{1}{n}\sum\limits_{i = 1}^{n} {(x_{i} - x_{i} ^{\prime})^{2} + } (y_{i} - y_{i} ^{\prime})^{2} } $$

(6)

where (x_i,y_i) and (x_i′,y_i′) are the coordinates of the ith matching keypoint pair; n means the total number of matching points. In addition, correct matching ratio (CMR) is another effective measure which is defined as:

$$ CMR = \frac{correct\;Matches}{{correspondences}} $$

(7)

“correspondences” is the number of matches after using RANSAC, “correctMatches” is the number of correct matches after removing false ones. The results of quantitative evaluation for each method are listed in Table 2.

Analyzing the above experimental results, the following conclusions can be drawn: It can be seen from Table 2 that the correct matching rate obtained by the PSO-SIFT²³ and OS-SIFT²⁴ algorithms is relatively low. The proposed algorithm can obtain more and more uniform correct matching point pairs, which shows that this algorithm is better in suppressing the radiation difference between optical and SAR images. Compared with the other two algorithms, the CMR of this algorithm is improved by 80.53%, 75.61% and 81.74% respectively in the three groups of images, and the RMSE is reduced by 0.6491, 1.0287 and 0.6306 respectively. The reason is that the algorithm in this paper uses nonlinear diffusion filtering to build the scale space, which can effectively maintain the edge features of the image, and obtain an image with clear regional boundaries. In addition, this algorithm can also sharpen the image brightness edge, making the extracted features more effective, and laying the foundation for subsequent feature matching. After the block strategy is introduced, the distribution of feature points in the image is more uniform. Compared with the original algorithm, the uniform feature points can obtain a better transformation model, making the subsequent image registration better.

Conclusion

In this paper, we propose an improved SIFT algorithm for optical and SAR image registration. Firstly, the nonlinear diffusion scale space of optical and SAR images is constructed by using nonlinear diffusion filtering, and the uniform gradient information is calculated by using multi-scale Sobel operator and multi-scale exponential weighted mean ratio operator respectively. Then, after removing the first layer of the scale space with the image blocking strategy, the scale space is partitioned, and Harris feature points are extracted on the basis of consistent gradient information to obtain stable and uniform point features. Descriptors are constructed based on gradient position and direction histogram templates and normalized to overcome nonlinear radiation differences between images., Finally, the correct matching point pairs are obtained by using bilateral FLANN search matching method and RANSAC method, and the affine transformation model parameters are obtained. Experimental results demonstrate its superior matching performance with respect to the state-of-the-art methods. However, the proposed algorithm has an impact on the running speed to some extent, because after the introduction of the block strategy, the algorithm needs to carry out block operation on the scale space when extracting the feature points, and after obtaining the feature points, the corresponding positions in the original scale space can be obtained. In the future, the algorithm can be further optimized to improve the operation speed.

Data availability

The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.

References

Gao, G. et al. An adaptive and Fast CFAR algorithm based on automatic censoring for target detection in high-resolution SAR images. IEEE Trans. Geosci. Remote Sens. 47(6), 1685–1697 (2009).
Article ADS Google Scholar
Vincent, N., Souleres, E. & Suinot, N. BEAWARE: Budget effective all weather accurate radar for Earth observation. In Geoscience & Remote Sensing, Igarss 97 Remote Sensing-a Scientific Vision for Sustainable Development, IEEE International IEEE (2002).
Wang, H. Y. et al. Review of multi-source remote sensing image registration techniques. Comput. Eng. 37(19), 17–21 (2011).
Google Scholar
Su, J. N., Wang, P., Zhang, S. T., Zhao, A. S. & Zhou, A. M. Review of key technologies of product image styling design. J. Mach. Des. 30(1), 97–100 (2013).
Google Scholar
DoG Object Recognition from local scale-invariant features (SIFT) David G. Lowe Presented by David Lee 3/20/2006 Introduction W.
Leonardis, A., Bischof, H., Pinz, A., et al. Computer vision—ECCV 2006 SURF: Speeded up robust features (2006).
Huang, H. B., Xiao-Ling, L. I., Xiong, W. Y., et al. A survey of image registration based on SIFT. Software Guide (2019).
Fang, D., Zheng, N. & Xue, J. Fast Communication: Image Smoothing and Sharpening Based on Nonlinear Diffusion Equation (Elsevier North-Holland Inc, 2008).
MATH Google Scholar
Sedaghat, A., Mokhtarzade, M. & Ebadi, H. Uniform robust scale-invariant feature matching for optical remote sensing images. IEEE Trans. Geosci. Remote Sens. 49(11), 4516–4527 (2011).
Article ADS Google Scholar
Dellinger, F. et al. SAR-SIFT: A sift-like algorithm for SAR images. IEEE Trans. Geosci. Remote Sens. 53(1), 453–466 (2013).
Article ADS Google Scholar
Dellinger, F. et al. SAR-SIFT: A sift-like algorithm for SAR images. IEEE Trans. Geosci. Remote Sens. 53(1), 453–466 (2015).
Article ADS Google Scholar
Ma, W. et al. Remote sensing image registration with modified SIFT and enhanced feature matching. IEEE Geosci. Remote Sens. Lett. 14(1), 3–7 (2016).
Article ADS Google Scholar
Wang, Y. et al. Communications in computer and information science. Adv. Image Graph. Technol. 757, 1–11. https://doi.org/10.1007/978-981-10-7389-2 (2018).
Article Google Scholar
Xiang, Y., Wang, F. & You, H. OS-SIFT: A robust SIFT-like algorithm for high-resolution optical-to-SAR image registration in suburban areas. IEEE Trans. Geosci. Remote Sens. 56, 3078–3090 (2018).
Article ADS Google Scholar
Fan, J. et al. SAR and optical image registration using nonlinear diffusion and phase congruency structural descriptor. IEEE Trans. Geosci. Remote Sens. 56(9), 5368–5379 (2018).
Article ADS Google Scholar
Yingpeng, C. H. I. & Chang, L. I. U. An improved SIFT algorithm for SAR image registration. J. Univ. Chin. Acad. Sci. 36(2), 259 (2019).
Google Scholar
Ye, Y., Wang, M., Hao, S. & Zhu, Q. A novel keypoint detector combining corners and blobs for remote sensing image registration. IEEE Geosci. Remote Sens. Lett. 18(3), 451–455 (2020).
Article ADS Google Scholar
Xue, S. et al. Visible and infrared missile-borne image registration based on improved SIFT and joint features. J. Phys. Conf. Ser. 2010(1), 012103 (2021).
Article Google Scholar
Cui, S. et al. Cross-modality image matching network with modality-invariant feature representation for airborne-ground thermal infrared and visible datasets. IEEE Trans. Geosci. Remote Sens. 60, 1–14 (2021).
Google Scholar
Nordström, K. N. Biased anisotropic diffusion: A unified regularization and diffusion approach to edge detection. Image Vis. Comput. 8(4), 318–327 (1990).
Article Google Scholar
Xuemei, Y. et al. Registration algorithm for SAR and optical images based on improved SIFT. Aerosp. Control 28, 13–17 (2010).
Google Scholar
Wang, S., You, H. & Fu, K. BFSIFT: A novel method to find feature matches for SAR image registration. IEEE Geosci. Remote Sens. Lett. 9(4), 649–653. https://doi.org/10.1109/LGRS.2011.2177437 (2012).
Article ADS Google Scholar
Ma, W. P. et al. Remote sensing image registration with modified SIFT and enhanced feature matching. IEEE Geosci. Remote Sens. Lett. 14(1), 3–7 (2017).
Article ADS Google Scholar
Xiang, Y. M., Wang, F. & You, H. J. OS-SIFT: A robust SIFT-like algorithm for high-resolution optical-to SAR image registration in suburban areas. IEEE Trans. Geosci. Remote Sens. 56(6), 3078–3090 (2018).
Article ADS Google Scholar

Download references

Author information

Authors and Affiliations

School of Automation, Central South University, Changsha, China
Wannan Zhang & Yuqian Zhao

Authors

Wannan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yuqian Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

W.Z. wrote the main manuscript text and Y.Z. revised the manuscript.

Corresponding author

Correspondence to Yuqian Zhao.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, W., Zhao, Y. An improved SIFT algorithm for registration between SAR and optical images. Sci Rep 13, 6346 (2023). https://doi.org/10.1038/s41598-023-33532-1

Download citation

Received: 23 January 2023
Accepted: 14 April 2023
Published: 18 April 2023
DOI: https://doi.org/10.1038/s41598-023-33532-1

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.