Super-resolution of X-ray CT images of rock samples by sparse representation: applications to the complex texture of serpentinite

Omori, Toshiaki; Suzuki, Shoi; Michibayashi, Katsuyoshi; Okamoto, Atsushi

doi:10.1038/s41598-023-33503-6

Download PDF

Article
Open access
Published: 24 April 2023

Super-resolution of X-ray CT images of rock samples by sparse representation: applications to the complex texture of serpentinite

Toshiaki Omori^1,2,3,
Shoi Suzuki¹,
Katsuyoshi Michibayashi⁴ &
…
Atsushi Okamoto⁵

Scientific Reports volume 13, Article number: 6648 (2023) Cite this article

2863 Accesses
4 Citations
2 Altmetric
Metrics details

Subjects

Abstract

X-ray computed tomography (X-ray CT) has been widely used in the earth sciences, as it is non-destructive method for providing us the three-dimensional structures of rocks and sediments. Rock samples essentially possess various-scale structures, including millimeters to centimeter scales of layering and veins to micron-meter-scale mineral grains and porosities. As the limitations of the X-ray CT scanner, sample size and scanning time, it is not easy to extract information on multi-scale structures, even when hundreds meter scale core samples were obtained during drilling projects. As the first step to overcome such barriers on scale-resolution problems, we applied the super-resolution technique by sparse representation and dictionary-learning to X-ray CT images of rock core sample. By applications to serpentinized peridotite, which records the multi-stage water–rock interactions, we reveal that both grain-shapes, veins and background heterogeneities of high-resolution images can be reconstructed through super-resolution. We also show that the potential effectiveness of sparse super-resolution for feature extraction of complicated rock textures.

Digital colloid-enhanced Raman spectroscopy by single-molecule counting

Article 17 April 2024

Mid-infrared wide-field nanoscopy

Article 17 April 2024

Principal component analysis

Article 22 December 2022

Introduction

X-ray computed tomography (X-ray CT) is non-destructive method for providing us the three-dimensional structures. In decades, the applications of X-ray CT to geomaterials have been widely increasing, including rocks, sediments, and meteorites. In particular, the X-ray CT scanner is commonly applied to rock core samples by scientific drilling and/or developments of underground resources^1,2. The downhole profiles of the X-ray CT values is used to extract the basic information on the physical properties of geological formations, and thus CT-value profiles lend themselves to geological interpretations^1,3,4,5. For example, during the Oman drilling projects in 2016-2019, that drilled the crust and mantle sections of Oman ophiolite, the continuous X-ray CT images in total length exceeding 1000 m have been obtained by the onboard operation; the D/V Chikyu using a Discovery CT 750HD (GE Medical Systems)^2,6. Crustal and mantle rocks taken from the Oman drilling projects showed various-scale structures that formed during igneous processes and water–rock interactions, including meter-scale igneous layering to millimeters scale veins to micro-scale mineral grain shapes, and nano-scale pores. However, as the voxel size of CT images at the D/V Chikyu is several hundreds micrometers, it is difficult to obtain the grain-scale information². In contrast, more detailed CT images are obtained by the micro-CT at laboratories or synchrotron-based nano CT scanners at high energy acceleration institutes, although the size of analyzed samples is restricted. The hydrothermal experiments coupled with repeated CT imaging reveals the generations of porosity and evolution of fluid pathways during water–rock reactions^7,8.

Traditionally, interpolation techniques are used to enhance the spatial resolution of images⁹, including bilinear interpolation and higher dimensional interpolation methods such as bicubic interpolation method. Bicubic interpolation uses a simple mathematical cubic function to interpolate data points on a regular two-dimensional grid. These interpolation methods can be applied to various images with low computational cost because they use common simple functions regardless of the target subject. Since these interpolation methods do not consider the characteristics of specific subjects, their simplicity limits their capability to enhance image quality, and they tend to generate excessively smooth images with artifacts⁹.

Super-resolution is a data-driven method based on machine learning for estimating a high-resolution image from a recorded low-resolution image^10,11. In particular, learning-based super-resolution, which learns information about the subject in advance, is able to reconstruct images with higher accuracy by tailoring the learning process to the target image. Recently, super-resolution methods using deep learning methods have been proposed. Various types of deep learning methods have been applied to super-resolution, ranging from convolutional neural networks^12,13 to generative adversarial networks¹⁴. However, the complexity of neural network architectures and the large numbers of network parameters mandate the need for a large amount of data to realize successful learning in most deep learning-based methods^15,16,17,18. Sparse super-resolution is the another method that uses sparse coding to learn image characteristics for super-resolution. Sparse modeling is a framework that is suitable for applications with a sparsity of big data, where there is only a small number of explanatory variables¹⁹. It has been applied in many scientific fields including physics²⁰, astronomy²¹, neuroscience^22,23,24, and earth sciences²⁵. In sparse super-resolution, an image is represented by the product of a dictionary obtained by learning and a coefficient vector so that the number of extracted basis images in the dictionary is minimized¹⁰. Thus, the image is represented under the constraint of a sparse coefficient vector.

In recent years, super-resolution methods have been developed for medical imaging techniques such as CT and magnetic resonance imaging. For the super-resolution of medical images, sparse modeling approaches have been developed, which is a white-box type (i.e., explainable) of machine learning²⁶. For example, basis images, which explain the characteristics of target subject, are explicitly obtained and can be analyzed in sparse modeling approaches. In contrast, most deep learning approaches are the black-box type of machine learning. This means that for scientific purposes, the deep learning approaches have some disadvantages such as low explainability and interpretability owing to their complex mathematical frameworks^27,28,29. For medical CT images, a previous study based on sparse modeling demonstrated that sharp and clear structures such as the sharp and clear boundaries between organs and clear structures in blood vessels are emphasized in the estimated super-resolution images as well as the super-resolution of natural standard images by assuming that only specific frequency elements are considered in the formulation of the super-resolution²⁶. For CT images of rocks, however, it is important to estimate complex structures at high-resolution images such as rough, fine structures as well as macroscopic sharp boundaries such as the boundaries between open cracks and host rocks.

To overcome the scale-resolution problem for rock CT imaging, we propose a super-resolution technique using sparse representation and dictionary-learning. The proposed method can estimate the complex structure of rock rather than sharpening and smoothing the image like conventional interpolation methods. We demonstrated the effectiveness of the proposed method by applying it to the super-resolution of CT images of serpentinized dunite from the Oman ophiolite.

Methods

Serpentinized dunite of Oman ophiolite

In this study, we perform super-resolution of X-ray CT images for serpentinized dunite within the crust-mantle transition zone of the Oman ophiolite, which were suffered from the various stages of hydrothermal alteration. The Oman ophiolite is the best exposed section of oceanic lithosphere, and is located at the southearstern margin of Arabian Peninsula. The ophiolite is composed of pillow to massive submarine basalt, sheeted dike complex, cummulates and gabbros, and upper mantle rocks (dunite and harzburgite^30,31). Hole CM1A was drilled at Wadi Zeeb, northern Sharqiyah ($22^{\circ }54.435^{\prime }\,\hbox {E}, 58^{\circ }20.149^{\prime }\,\hbox {N}$) in 2017 and the CM1A core was described aboard the D/V Chikyu in July to August in 2018. From 0 to 160 m depth, the core consists mainly of gabbroic rocks (olivine gabbro and troctolite), and from 160 to 310 m depth, the CM1A core consists mostly of dunite, which is classified as part of the crust- mantle transition zone. From 310 to 404 m depth, the core consists mainly of harzburgite, which is classified as part of the mantle sequence. Various stages of alteration reactions and veining that occurred over a range of temperatures and fluid infiltration conditions^32,33.

X-ray CT images

We used the X-ray CT images of a serpentinized dunite sample (CM1A-90z02-48-53) taken from the crust-mantle transition zone at the drill site CM1A of the Oman Drilling Project^2,32,34. The core sample of serpentinized dunite used in this study was scanned using a micro-focus X-ray CT scanner (Scan Xmate D225RSS270; Comscantecno) at Tohoku University (Fig. 1a, see Ref.⁸ for detailed information). The voltage was 120 kV, the current was 150 $\upmu \hbox {A}$, and the X-ray spot size was 9 $\upmu \hbox {m}$ (approximately half of 18 W). The pixel matrix was $1856 \times 1856$, and the voxel size was 10 $\upmu \hbox {m}$ (Fig. 1b). As the low-resolution images used in the super-resolution of this study, we also made the artificial degradation figures by averaging the sixteen pixels (Fig. 1c).

The serpentinized dunite sample was completely serpentinized, and no relic of olivine or pyroxenes. The sample is mainly composed of serpentine minerals, brucite, magnetite and Cr-rich spinel³⁴. The matrix parts of the sample show the mesh textures composed of $\hbox {lizardite}+\hbox {brucite}+\hbox {magnetite}$, showing the heterogeneities in scales of the original olivine grain or less. The matrix is cut by the later stage serpentine ($\hbox {antigorite}+\hbox {chrysotile}$) veins with thickness of 1 mm. Around this veins, bright brucite-rich reaction zone is developed with fine grained magnetite³⁴. Cr-rich spinel is a subhedral grains with a size of 30 $\upmu \hbox {m}$. Around the serpentine veins, the rims of the spinel grained are replaced by magnetite and trails of magnetite formed within the blanches of the serpentine veins.

Super-resolution method by sparse expression of rock CT images

Here, we describe the framework of sparse super-resolution for rock CT images. The detailed mathematical formulation is given in the Supplementary Material. In sparse image representation, natural images are expressed by a small number of basis images^19,23,35. Each small area of a natural image called a patch, $\varvec{y}_i \in \{1,2,\ldots ,P \}$ (P: the total number of patches), which can be expressed by basis images $\left\{ \varvec{d}_1, \varvec{d}_2, \ldots ,\varvec{d}_D \right\}$ with a sparse vector (D: the total number of basis images):

$$\begin{aligned} \varvec{y}_i =x_{i,1} \varvec{d}_1 +x_{i,2} \varvec{d}_2 +\cdots +x_{i,D} \varvec{d}_D \end{aligned}$$

(1)

where $\varvec{x}_i =\left\{ x_{i,1},x_{i,2},\ldots ,x_{i,D} \right\}$ is a sparse vector in which most of the elements are zero. In both high-resolution and the corresponding low-resolution images, a sparse vector is assumed to be common as follows (Fig. 2a):

$$\begin{aligned} \varvec{y}_i^{\textrm{high}}= & {} x_{i,1} \varvec{d}_1^{\textrm{high}} +x_{i,2} \varvec{d}_2^{\textrm{high}} +\cdots +x_{i,D} \varvec{d}_D^{\textrm{high}} \end{aligned}$$

(2)

$$\begin{aligned} \varvec{y}_i^{\textrm{low}}= & {} x_{i,1} \varvec{d}_1^{\textrm{low}} +x_{i,2} \varvec{d}_2^{\textrm{low}} +\cdots +x_{i,D} \varvec{d}_D^{\textrm{low}} \end{aligned}$$

(3)

The super-resolution based on sparse representation comprises two steps (Fig. 2b): dictionary learning to obtain basis images by using high-resolution images, and super-resolution to reconstruct a high-resolution image by transferring sparse coefficients obtained from the low-resolution image representation image.

The dictionary learning step obtains a dictionary comprising the basis images $\varvec{D}^{\textrm{high}}=\left\{ \varvec{d}_{1}^{\textrm{high}}, \varvec{d}_{2}^{\textrm{high}},\ldots , \varvec{d}_{D}^{\textrm{high}} \right\}$ from a set of patch images $\varvec{Y}^{\textrm{high}}=\left\{ \varvec{y}_1^{\textrm{high}},\varvec{y}_2^{\textrm{high}},\ldots ,\varvec{y}_{P_{\textrm{DL}}}^{\textrm{high}} \right\}$ where $P_{\textrm{DL}}$ is the total number of patch images used for dictionary learning. We simultaneously optimize a the high-resolution dictionary $\varvec{D}^{\textrm{high}}$ and a matrix with sparse vectors $\varvec{X}$ as follows:

$$\begin{aligned} \left( \varvec{D}_{\textrm{est}}^{\textrm{high}},\varvec{X}_{\textrm{est}} \right) = \arg \min _{\left( \varvec{D}^{\textrm{high}}, \varvec{X} \right) } \Vert \varvec{Y}^{\textrm{high}} - \varvec{D}^{\textrm{high}} \varvec{X} \Vert _2^2 + \lambda \Vert \varvec{X} \Vert _1 \end{aligned}$$

(4)

where the first term represents the discrepancies between the high-resolution patch images $\varvec{Y}^{\textrm{high}}$ and the corresponding reconstructed images $\varvec{D}^{\textrm{high}} \varvec{X}$, and the second term is an $L_1$ regularization term for sparsity condition^36,37. $\lambda$ is a regularization parameter that controls the sparsity. Note that the dictionaries are assumed to have arbitrary frequency elements in the proposed method by Eq. (4), whereas only specific high-frequency elements are considered for the reconstruction in pre-existing methods^10,26. This generalized framework in the proposed method is formulated since various frequency elements should be considered for understanding rock textures, which include both low- and high-frequency elements, whereas specific high-frequency elements are rather important for efficiently obtaining face and object images with clear edges in computer graphics. The low-resolution dictionary $\varvec{D}_{\textrm{est}}^{\textrm{low}}$ is derived from the obtained high-resolution dictionary $\varvec{D}_{\textrm{est}}^{\textrm{high}}$ by using the downsampling matrix $\varvec{L}$ as follows: $\varvec{D}_{\textrm{est}}^{\textrm{low}}=\varvec{L} \varvec{D}_{\textrm{est}}^{\textrm{high}}$.

In the super-resolution step, a sparse vector is estimated that can reconstruct the low-resolution patch images $\tilde{\varvec{Y}}^{\textrm{low}}=\left\{ \tilde{\varvec{y}}_1^{\textrm{low}},\tilde{\varvec{y}}_2^{\textrm{low}},\ldots ,\tilde{\varvec{y}}_{P_{\textrm{SR}}}^{\textrm{low}} \right\}$ ($P_{\textrm{SR}}$: the total number of patch images for super-resolution) in terms of a small number of basis images. For appropriate reconstruction, a matrix with sparse vectors, $\tilde{\varvec{X}}$, is optimized by minimizing the following expression:

$$\begin{aligned} \tilde{\varvec{X}}_{\textrm{est}} = \arg \min _{\tilde{\varvec{X}}} \Vert \varvec{{\tilde{Y}}}^{\textrm{low}}- \varvec{D}_{\textrm{est}}^{\textrm{low}} \tilde{\varvec{X}} \Vert _2^2 +\lambda \Vert \tilde{\varvec{X}} \Vert _1 \end{aligned}$$

(5)

By assuming that high- and low-resolution images have the weight matrix $\tilde{\varvec{X}}$ in common, high-resolution patch images $\tilde{\varvec{Y}}_{\textrm{est}}^{\textrm{high}}$ can be reconstructed from the high-resolution dictionary and estimated weight matrix $\tilde{\varvec{X}}_{\textrm{est}}$ as follows:

$$\begin{aligned} \tilde{\varvec{Y}}_{\textrm{est}}^{\textrm{high}}= \varvec{D}_{\textrm{est}}^{\textrm{high}} \tilde{\varvec{X}}_{\textrm{est}}. \end{aligned}$$

(6)

The high-resolution reconstructed image is further refined by considering reconstruction in both the high- and low-resolution domains. See the Supplementary Material for further details.

Settings for dictionary learning and super-resolution

The dictionary learning involves first creating a high-resolution dictionary. For this study, $P_{\textrm{DL}}=4964$ patches were prepared from six rock CT images used to conduct the dictionary creation. Individual CT images were $1856 \times 1856$ pixels.

The rock CT image used for training was divided into patches, and the average of the CT values for each patch was calculated. The data obtained by subtracting the average value from the CT value for each patch was used for training. The patch size of the high-resolution images was set to $48 \times 48$ pixels after some trial and error, and the number of basis images in the dictionary was set to $D=200$. When the initial high-resolution dictionary $\varvec{D}^{\textrm{high}}$ was fixed, the sparsity coefficients $\varvec{X}$ for the linear combination of high-resolution CT images were estimated. Then, when the estimated sparsity coefficients $\varvec{X}$ were fixed, a high-resolution dictionary $\varvec{D}^{\textrm{high}}$ suitable for sparse representation was obtained by imposing constraints to normalize the scales of the bases. We iterated these two tasks to obtain a high-resolution dictionary $\varvec{D}^{\textrm{high}}$ by using the result when the values converged. A low-resolution dictionary $\varvec{D}^{\textrm{low}}$ was created by downsampling the high-resolution dictionary by a factor of 1/4 using a smoothing filter.

The hyperparameters ($\lambda$, c and $\beta$) for dictionary learning were optimized by setting them to different values and selecting those that resulted in a small reconstruction error.

Results and discussion

Estimation from downsampled low-resolution images

The sparse super-resolution method was applied to estimating high-resolution rock CT images from low-resolution rock CT images (Fig. 1). Figure 3 shows an example of the estimation results: (a) the low-resolution CT images artificially prepared by downsampling the high-resolution CT image, (b) the estimation with bicubic interpolation, (c) estimation with the proposed sparse super-resolution, and (d) the true high-resolution image. First, the fine structures around localized red area is considered, which correspond to spinel grains. With bicubic interpolation, the boundary between red and yellow areas around localized red area was smoothly curved, and the intricate structures around the boundary were lost. In contrast, the proposed sparse super-resolution was able to reconstruct the complex structures around the boundary between the red (spinel grain) and yellow parts (magnetite rims replacing spinel).

Next, we focus on the linear structure in the upper right area of the images in Fig. 3, which corresponds to the serpentine veins. With bicubic interpolation, the boundary between the green and blue areas is overly smooth with a smooth contour. With the proposed sparse super-resolution, the intricate parts of the green and blue areas were reconstructed with a complex structure, which is similar to the true image. For example, some tube-like structures perpendicular to the serpentine veins can be observed in both the true image and image estimated by sparse super-resolution, while such tube-like structures were not reconstructed by bicubic interpolation. Therefore, the proposed sparse super-resolution accurately reconstructed the detailed structures of the serpentine vein and the reaction zone at its boundary.

Finally, the texture in the lower right area the images in Fig. 3 is considered, which corresponds to the mesh structure. The true image shows a complex texture with light blue pixels in a deep blue area. The sparse super-resolution reconstructed such complex structures. In contrast, bicubic interpolation reconstructed smooth textures in a deep blue area. These results show that the proposed sparse super-resolution framework reconstructed textures more accurately than the conventional bicubic interpolation, including spinel grains, their replacement textures, and serpentine veins.

To evaluate the effectiveness of the proposed framework for rock CT images in more detail, histograms of pixels in estimated high-resolution images are shown (middle subfigures of Fig. 3). These histograms may reflect some physical characteristics for a specific area in the rock CT images such as the modal abundances of minerals and porosities. Thus, it is important to evaluate histograms for the similarity between the true and estimated high-resolution images. The histograms for sparse super-resolution showed a smooth peak with pixel values between 1500 and 9000. The histogram for bicubic interpolation showed a sharp peak with pixel values between 2000 and 9000. The histogram for the true high-resolution image showed a smooth peak with pixel values between 1500 and 9000. These results suggest that the proposed sparse super-resolution method reconstructed the distribution of pixel values more precisely than bicubic interpolation.

The spatial distribution of pixel values is also evaluated as shown in the bottom row of Fig. 3. Here the spatial distribution of the pixel values is considered for the high-resolution images over a horizontal distance of $y=70{,}160$. Bicubic interpolation obtained a smoother spatial distribution than sparse super-resolution. When compared with the true high-resolution image, super-resolution accurately reproduced not only the global structure of the distribution (i.e., mineral grains and veins) but also fluctuations in the distribution (i.e., mineral replacement textures).

For a general validation of the proposed method, Fig. 4 shows the results at different position in the CT image. Fine structures in true high-resolution image, such as textures in the blue area and a rough boundary around the red area, were reconstructed by sparse super-resolution, whereas these structures were oversimplified by bicubic interpolation. This tendency can be confirmed in the histogram and pixel values in the cross section with the different methods. For example, the histogram obtained for the sparse super-resolution showed maximal counts, sharpness, and pixel value for maximal counts, which are more similar to the histogram of true high-resolution image than that obtained for the bicubic interpolation method (Fig. 4, middle). These results show that the basis images extracted from rock CT images play an important role in the estimation of complex structure.

Characteristics of dictionary

For dictionary learning, the basis images were initially set randomly. As the dictionary learning proceeded, the spatial features hidden in the training CT image dataset were extracted, as shown in Fig. 5a. Note that the analyses for the basis images can be conducted since the sparse super-resolution is a white-box type method, which is in cotrast to black-box type methods, such as deep learning-based super-resolution methods^27,28,29. All basis images showed not only with large-scale (i.e., low spatial frequency) structures but also small-scale (i.e., high spatial frequency) structures. The coexistence of structures at different scales required enhanced estimation accuracy. We evaluated the estimation accuracy of the proposed method in terms of the peak signal-to-noise ratio (PSNR) and the structural similarity index measure (SSIM)^9,38,39, as shown in Fig. 6. See the Supplementary Material for the definitions of these indices. The PSNR increased with the iterations of dictionary learning but was quite low when the initial randomly set basis images were used. The SSIM also increased with the number of iterations. The improvement in image quality according to these indices corresponds to the adaptation of basis images included in the dictionary.

The basis images were further analyzed by using clustering and dimension reduction techniques. For clustering, the k-means$++$ algorithm was applied to vectors representing basis images⁴⁰. This algorithm avoids the initial-value dependence of the original k-means clustering. To visualize the results in two-dimensional space, a dimension reduction method called the t-distributed stochastic neighbor embedding (t-SNE) method⁴⁰ was applied to the clustering results. As shown in Fig. 5b,c, the basis vectors were separated into eight clusters. The basis images had localized red areas around specific regions and complex textures. The level of localization depended on the cluster. Red areas in cluster 6 were less localized but more dispersed, while the red areas in clusters 1, 2, and 4 were more localized. Cluster 1 had a localized red area (i.e., high CT number) toward the top, whereas cluster 2 had a similar area but toward the bottom.

To investigate the effect of source images on dictionary learning and super-resolution, we conducted dictionary learning using other images included in the standard image datasets called IAPR TC-12 (International Association of Pattern Recognition, Technical Committee 12) collection⁴¹ and SIDBA (Standard Image Data-BAse)⁴², including people, buildings, and landscapes (Fig. 7a). The dictionary obtained from standard image dataset (Fig. 7b) includes clearer boundaries and more straight-line structure than the dictionary obtained from the rock CT images (Fig. 5a). A comparison between the high-resolution images obtained from the two dictionaries (Fig. 8) indicates that the dictionary obtained from the rock CT images (Fig. 8b) provided more accurate high-resolution images than the dictionary obtained from the standard images (Fig. 8c). The high-resolution image reconstructed using the dictionary from rock CT images reproduced fine structures including textures and the boundary around red regions, while the high-resolution image reconstructed using the dictionary from standard images included many sharp structures not seen in the true high-resolution image. The quantitative evaluation indices also demonstrated the superiority of the dictionary obtained from rock CT images, which had a greater PSNR (30.12) than the dictionary obtained from standard images (27.32) as well as SSIM. These results suggest that the basis images extracted by sparse super-resolution include the essential characteristics of the rock textures.

Application to recorded low-resolution images

Here, the proposed method was applied to low-resolution images, which were directly recorded by the CT scanner. The high-resolution images (Fig. 9b,c) were estimated from a low-resolution image (Fig. 9a) directly recorded by the CT scanner. The high-resolution image that was estimated by bicubic interpolation (Fig. 9b) shows a rather simple and clear structure. In contrast, the high-resolution image that was estimated by sparse super-resolution (Fig. 9c) successfully shows more complex structures. For example, in Fig. 9c, the detailed structures of spinel are shown by red areas in the upper-left region, and complex mesh textures are shown by deep blue areas.

The high-resolution image estimated from directly recorded low-resolution image by the sparse super-resolution (Fig. 9c) is found to be more similar to the high-resolution image recorded at almost same position directly (Fig. 9d), compared with that estimated by bicubic interpolation (Fig. 9b). Note that the position of the high-resolution image (Fig. 9d) is almost the same as that of the low-resolution image but is not the same exactly. From these results, the proposed sparse super-resolution is found to be effective for estimating high-resolution rock CT image.

Note that there is still a discrepancy between the high-resolution image that was estimated from the recorded low-resolution image by the sparse super-resolution (Fig. 9c) and the high-resolution image that was recorded at almost the same position (Fig. 9d). This is probably due to the down-sampling matrix simply that was assumed in the present study. A new dictionary learning method for estimating the accurate relationship between high- and low-resolution images with appropriate position correspondence is required to reduce this discrepancy and realize more accurate estimation in future studies.

Concluding remarks

In this study, we have proposed a method to apply sparse super-resolution to rock CT images. In contrast to interpolation algorithms, the proposed method estimates the image by sparse representation and dictionary learning to reconstruct missing information in the low-resolution image. The experimental results showed that sparse super-resolution method was better at reconstructing details of rock CT images than bicubic interpolation. The superiority of the proposed method was quantified by PSNR and SSIM. These results confirmed that the proposed method extracts the important features of rock CT images and obtains better results than conventional interpolation, which may be a significant contributions to practical applications.

The medical CT scanner used for the long geological core analyses provides low resolution images ($>0.1$ voxel size) on the mineral-scale microstructures but is strong tool for quantitative analyses of the continuous geological structures over 100 m. In contrast, the researchers carry out in the detailed microstructural analyses from the limited samples in their labs, by using for example high-resolution X-ray CT. Therefore, if we develop a super-resolution techniques to link the lab micro CT scanner and medical CT in D/Y Chikyu, we can connect the phenomena from nano to micro scale to kilometer scales. This study only shows the super-resolution in the same scanner with different magnification or artificial images created by simple down sampling, it is important to find the way the realistic down sampling that enables to link over the images taken by the different scanners.

Data availability

The original and treated X-ray CT images are available from https://doi.org/10.6084/m9.figshare.5458696. Codes and further data can be made available by the corresponding author (T.O.) upon reasonable request.

References

Tonai, S. et al. A new method for quality control of geological cores by X-ray computed tomography: Application in IODP expedition 370. Front. Earth Sci. 7, 117 (2019).
Article ADS Google Scholar
Kelemen, P. B., Matter, J. M., Teagle, D. A. H., Coggon, J. A. & The Oman Drilling Project Science Team. In Proceedings of the Oman Drilling Project. College Station, TX: International Ocean Discovery Program. (2020).
Støren, E. N., Dahl, S. O., Nesje, A. & Paasche, Ø. Identifying the sedimentary imprint of high-frequency holocene river floods in lake sediments: Development and application of a new method. Quatern. Sci. Rev. 29, 3021–3033 (2010).
Article ADS Google Scholar
Fortin, D. et al. Destructive and non-destructive density determination: Method comparison and evaluation from the Laguna Potrok Aike sedimentary record. Quatern. Sci. Rev. 71, 147–153 (2013).
Article ADS Google Scholar
Reilly, B., Stoner, J. & Wiest, J. Sed CT: Matlab tools for standardized and quantitative processing of sediment core computed tomography (CT) data collected using a medical CT scanner. Geochem. Geophys. Geosyst. 18, 3231–3240 (2017).
Article ADS Google Scholar
Okazaki, K. et al. Major mineral fraction and physical properties of carbonated peridotite (Listvenite) from icdp oman drilling project hole BT1B inferred from X-ray CT core images. J. Geophys. Res. Solid Earth 126, e2021JB022719. https://doi.org/10.1029/2021JB022719 (2021).
Article ADS CAS Google Scholar
Polak, A., Elsworth, D., Liu, J. & Grader, A. S. Spontaneous switching of permeability changes in a limestone fracture with net dissolution. Water Resour. Res.https://doi.org/10.1029/2003WR002717 (2004).
Article Google Scholar
Okamoto, A., Tanaka, H., Watanabe, N., Saishu, H. & Tsuchiya, N. Fluid pocket generation in response to heterogeneous reactivity of a rock fracture under hydrothermal conditions. J. Geophys. Res. 44, 10306–10315. https://doi.org/10.1002/2017GL075476 (2017).
Article Google Scholar
Szeliski, R. Computer Vision: Algorithms and Applications (Springer, 2010).
MATH Google Scholar
Yang, J., Wright, J., Huang, T. S. & Ma, Y. Image super-resolution via sparse representation. IEEE Trans. Image Process. 19, 2861–2873. https://doi.org/10.1109/TIP.2010.2050625 (2010).
Article ADS PubMed MATH MathSciNet Google Scholar
Freeman, W., Jones, T. & Pasztor, E. Example-based super-resolution. IEEE Comput. Graph. Appl. 22, 56–65. https://doi.org/10.1109/38.988747 (2002).
Article Google Scholar
Dong, C., Loy, C. C., He, K. & Tang, X. Learning a deep convolutional network for image super-resolution. In Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6–12, 2014, Proceedings, Part IV 13, 184–199 (Springer, 2014).
Dong, C., Loy, C. C., He, K. & Tang, X. Image super-resolution using deep convolutional networks. IEEE Trans. Pattern Anal. Mach. Intell. 38, 295–307 (2015).
Article Google Scholar
Ledig, C. et al. Photo-realistic single image super-resolution using a generative adversarial network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 4681–4690 (2017).
Wang, Z., Chen, J. & Hoi, S. C. Deep learning for image super-resolution: A survey. IEEE Trans. Pattern Anal. Mach. Intell. 43, 3365–3387 (2020).
Article Google Scholar
Goodfellow, I., Bengio, Y. & Courville, A. Deep Learning (MIT Press, 2016). http://www.deeplearningbook.org.
Grohs, P. & Kutyniok, G. Mathematical Aspects of Deep Learning (Cambridge University Press, 2022).
Book MATH Google Scholar
Roberts, D. A., Yaida, S. & Hanin, B. The Principles of Deep Learning Theory (Cambridge University Press, 2022).
Book MATH Google Scholar
Starck, J.-L., Murtagh, F. & Fadili, J. Sparse Image and Signal Processing: Wavelets and Related Geometric Multiscale Analysis (Cambridge University Press, 2015).
Book MATH Google Scholar
Gregory, P. Bayesian Logical Data Analysis for the Physical Sciences (Cambridge University Press, 2005).
Book MATH Google Scholar
Honma, M. et al. Imaging black holes with sparse modeling. J. Phys. Conf. Ser. 699, 012006 (2016).
Article Google Scholar
Omori, T. & Hukushima, K. Extracting nonlinear spatiotemporal dynamics in active dendrites using data-driven statistical approach. J. Phys. Conf. Ser. 699, 012011 (2016).
Article Google Scholar
Otsuka, S. & Omori, T. Estimation of neuronal dynamics based on sparse modeling. Neural Netw. 109, 137–146. https://doi.org/10.1016/j.neunet.2018.10.006 (2019).
Article PubMed Google Scholar
Yokoi, M. & Omori, T. Sparse modeling approach for estimating odor pleasantness from multi-dimensional sensor data. In 2020 IEEE 2nd Global Conference on Life Sciences and Technologies (LifeTech), 187–188 (IEEE, 2020).
Kuwatani, T. et al. Sparse isocon analysis: A data-driven approach for material transfer estimation. Chem. Geol. 532, 119345 (2020).
Article ADS CAS Google Scholar
Jiang, C., Zhang, Q., Fan, R. & Hu, Z. Super-resolution CT image reconstruction based on dictionary learning and sparse representation. Sci. Rep. 8, 1–10 (2018).
ADS Google Scholar
Stiglic, G. et al. Interpretability of machine learning-based prediction models in healthcare. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 10, e1379 (2020).
Article Google Scholar
Escalante, H. J. et al. Explainable and Interpretable Models in Computer Vision and Machine Learning (Springer, 2018).
Book Google Scholar
Gunning, D. et al. XAI-explainable artificial intelligence. Sci. Robot. 4, eaay7120 (2019).
Article PubMed Google Scholar
Nicolas, A., Boudier, F., Ildefonse, B. & Ball, E. Accretion of Oman and United Arab Emirates ophiolite-discussion of a new structural map. Mar. Geophys. Res. 21, 147–180 (2000).
Article Google Scholar
Pallister, J. S. & Hopson, C. A. Samail Ophiolite plutonic suite: Field relations, phase variation, cryptic variation and layering, and a model of a spreading ridge magma chamber. J. Geophys. Res. 86, 2593–2644. https://doi.org/10.1029/JB086iB04p02593 (1981).
Article ADS CAS Google Scholar
Yoshida, K. et al. Fluid infiltration through oceanic lower crust in response to reaction-induced fracturing: Insights from serpentinized troctolite and numerical models. J. Geophys. Res. Solid Earth 125, e2020JB020268. https://doi.org/10.1029/2020JB020268 (2020).
Article ADS Google Scholar
Bosch, D. et al. Deep and high-temperature hydrothermal circulation in the oman ophiolite-petrological and isotopic evidence. J. Petrol. 45, 1181–1208 (2004).
Article ADS CAS Google Scholar
Yoshida, K. et al. Geological records of transient fluid drainage into the shallow mantle wedge. Sci. Adv. 9, eade6674. https://doi.org/10.1126/sciadv.ade6674 (2023).
Article CAS PubMed PubMed Central Google Scholar
Ito, M., Kuwatani, T., Oyanagi, R. & Omori, T. Data-driven analysis of nonlinear heterogeneous reactions through sparse modeling and Bayesian statistical approaches. Entropy 23, 824. https://doi.org/10.3390/e23070824 (2021).
Article ADS PubMed PubMed Central MathSciNet Google Scholar
Tibshirani, R. Regression shrinkage and selection via the Lasso. J. R. Stat. Soc. Ser. B (Methodol.) 58, 267–288. https://doi.org/10.1111/j.2517-6161.1996.tb02080.x (1996).
Article MATH MathSciNet Google Scholar
Tomioka, R. & Sugiyama, M. Dual-augmented Lagrangian method for efficient sparse reconstruction. IEEE Signal Process. Lett. 16, 1067–1070. https://doi.org/10.1109/LSP.2009.2030111 (2009).
Article ADS Google Scholar
Huynh-Thu, Q. & Ghanbari, M. Scope of validity of psnr in image/video quality assessment. Electron. Lett. 44, 800–801 (2008).
Article ADS Google Scholar
Wang, Z., Bovik, A. C., Sheikh, H. R. & Simoncelli, E. P. Image quality assessment: From error visibility to structural similarity. IEEE Trans. Image Process. 13, 600–612 (2004).
Article ADS PubMed Google Scholar
Vassilvitskii, S. & Arthur, D. $k$-means$++$: The advantages of careful seeding. In Proceedings of the Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms, 1027–1035 (2006).
Grubinger, M., Clough, P., Müller, H. & Deselaers, T. The IAPR TC-12 benchmark: A new evaluation resource for visual information systems. In International Workshop OntoImage’2006 Language Resources for Content-Based Image Retrieval, vol. 2 (2006).
Onoe, M. SIDBA: Standard image data base. Multidimensional Image Processing Center Report 79-1 (1979).

Download references

Acknowledgements

This study was financially supported by JSPS KAKENHI Grant Numbers JP15KK0010, JP21H03509 (T.O.), JP16H06347 (K.M.), JP17H02981, JP18K18778, JP22H04932 and JP22H05295 (A.O.) and JST CREST Grant Numbers JPMJCR1755, JPMJCR1861, JPMJCR1914 (T.O.). The authors thank all members of the Oman Drilling Project for the sample collection, descriptions, and X-ray CT scanning onboard Chikyu.

Author information

Authors and Affiliations

Department of Electrical and Electronic Engineering, Graduate School of Engineering, Kobe University, Kobe, 657-8501, Japan
Toshiaki Omori & Shoi Suzuki
Center for Mathematical and Data Sciences, Kobe University, Kobe, 657-8501, Japan
Toshiaki Omori
Center of Optical Scattering Image Science, Kobe University, Kobe, 657-8501, Japan
Toshiaki Omori
Department of Earth and Planetary Sciences, Graduate School of Environmental Studies, Nagoya University, Nagoya, 464-8601, Japan
Katsuyoshi Michibayashi
Department of Environmental Studies for Advanced Society, Graduate School of Environmental Studies, Tohoku University, Sendai, 980-8579, Japan
Atsushi Okamoto

Authors

Toshiaki Omori
View author publications
You can also search for this author in PubMed Google Scholar
Shoi Suzuki
View author publications
You can also search for this author in PubMed Google Scholar
Katsuyoshi Michibayashi
View author publications
You can also search for this author in PubMed Google Scholar
Atsushi Okamoto
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.O. and A.O. designed research; T.O. S.S., and A.O. performed research; T.O. and S.S. developed methodology of sparse super-resolution and analyzed data; T.O. and A.O. wrote the original draft of the manuscript; T.O., K.M, and A.O. reviewed and edited the manuscript; all authors approved the final version of the manuscript.

Corresponding author

Correspondence to Toshiaki Omori.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Omori, T., Suzuki, S., Michibayashi, K. et al. Super-resolution of X-ray CT images of rock samples by sparse representation: applications to the complex texture of serpentinite. Sci Rep 13, 6648 (2023). https://doi.org/10.1038/s41598-023-33503-6

Download citation

Received: 25 December 2022
Accepted: 13 April 2023
Published: 24 April 2023
DOI: https://doi.org/10.1038/s41598-023-33503-6

This article is cited by

Influence analysis of complex crack geometric parameters on mechanical properties of soft rock
- Yang Zhao
- Xin He
- Atsushi Sainoki
International Journal of Coal Science & Technology (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.