Automatic segmentation tool for 3D digital rocks by deep learning

Phan, Johan; Ruspini, Leonardo C.; Lindseth, Frank

doi:10.1038/s41598-021-98697-z

Download PDF

Article
Open access
Published: 27 September 2021

Automatic segmentation tool for 3D digital rocks by deep learning

Johan Phan^1,2,
Leonardo C. Ruspini^2,3 &
Frank Lindseth¹

Scientific Reports volume 11, Article number: 19123 (2021) Cite this article

4490 Accesses
13 Citations
Metrics details

Subjects

Abstract

Obtaining an accurate segmentation of images obtained by computed microtomography (micro-CT) techniques is a non-trivial process due to the wide range of noise types and artifacts present in these images. Current methodologies are often time-consuming, sensitive to noise and artifacts, and require skilled people to give accurate results. Motivated by the rapid advancement of deep learning-based segmentation techniques in recent years, we have developed a tool that aims to fully automate the segmentation process in one step, without the need for any extra image processing steps such as noise filtering or artifact removal. To get a general model, we train our network using a dataset made of high-quality three-dimensional micro-CT images from different scanners, rock types, and resolutions. In addition, we use a domain-specific augmented training pipeline with various types of noise, synthetic artifacts, and image transformation/distortion. For validation, we use a synthetic dataset to measure accuracy and analyze noise/artifact sensitivity. The results show a robust and accurate segmentation performance for the most common types of noises present in real micro-CT images. We also compared the segmentation of our method and five expert users, using commercial and open software packages on real rock images. We found that most of the current tools fail to reduce the impact of local and global noises and artifacts. We quantified the variation on human-assisted segmentation results in terms of physical properties and observed a large variation. In comparison, the new method is more robust to local noises and artifacts, outperforming the human segmentation and giving consistent results. Finally, we compared the porosity of our model segmented images with experimental porosity measured in the laboratory for ten different untrained samples, finding very encouraging results.

Reconstructing high fidelity digital rock images using deep convolutional neural networks

Article Open access 11 March 2022

Exploring microstructure and petrophysical properties of microporous volcanic rocks through 3D multiscale and super-resolution imaging

Article Open access 24 April 2023

Generating 3D images of material microstructures from a single 2D image: a denoising diffusion approach

Article Open access 18 March 2024

Introduction

Capturing microscopic properties of porous rocks and their interaction with fluids play an instrumental role in many important domains such as carbon storage, oil and gas recovery, and underground water management. The use of 2D and 3D imaging techniques to capture these properties, known as Digital Rock Analysis (DRA), is becoming a common practice in the above-mentioned industrial applications^1,2,3. However, in order to do any computation, the images need to be segmented into their constituent phases. Image segmentation is the process of labeling voxels into classes, which can be later used for the characterization of physical properties. Several segmentation techniques were developed in the past⁴. Due to the nature of the problem these tools include processes such as thresholding and clustering. However, these methods require a large degree of manual interaction and quality control. Some of the conventional methods are global multi-Otsu thresholding⁵, Marker-controlled Watershed⁶, and converging active contours. In addition, these methods require the use of different types of filters to deal with noise and artifacts. The recent advances in deep-learning technologies based on neural networks have led to the emergence of high-performance automatic segmentation techniques, with the highest accuracy rates on popular benchmarks⁷. For 3D images, deep learning-based segmentation techniques using Convolutional Neural Networks (CNNs)⁸ have been successfully used in many fields such as autonomous driving⁹, point cloud analysis¹⁰, medical image analysis^11,12.

Recently, several forms of multivariant classifiers using machine learning have been used to segment 2D micro-CT rock images^13,14. These last methods require the user to define/paint some areas with the desired labels to train with and then perform a segmentation of the whole image. The high degree of abstraction of deep learning methods has proven to be very effective compared to other segmentation techniques. In DRA, deep learning based segmentation has been successfully used on 2D binary segmentation¹⁵ and multi-mineral segmentation^16,17. These last two works have explored the use of different CNNs architectures, such as SegNet¹⁸, ResNet, UNet, UResNet-3D, etc. These studies concluded that the more complex architecture U-ResNet-3D performed better for the majority of the cases in terms of accuracy and topological similarity properties. As reported in this last work, the authors used a single dataset (1100 $\times $ 1100 $\times $ 2200 voxels) for both training and testing. In this respect, due to the small difference in terms of image properties and gray-scale values within a single sample, it is expected that the more complex the network the higher probability of overfitting the problem. Additionally, as we have experienced, training and testing on the same image certainly lead to poor performance on untrained images and different types of rock.

Due to resolution limitations of current imaging technology (field of view vs. spatial resolution), a typical rock image contains a non-neglectful amount of sub-resolution porosity, i.e. the pores of these regions are below the image resolution. Therefore, a three-phase segmentation is necessary to account for the effect of these areas on flow properties^19,20,21. In this work, unresolved porosity regions are also referred to as micro-phase.

The main goal of this work is to study the possibility of using a CNN model to perform automatic three-phase segmentation of real 3D Xray micro-CT images. This means:

1.
No need for human intervention in the segmentation process.
2.
Perform well with different rock types.
3.
No filtering. Handling typical noises and artifacts, such as Gaussian noise, beam-hardening, ring artifacts.
4.
Perform well with untrained data.
5.
Preserve connectivity and continuity of the different regions (3D information).

Materials and methods

In this work we have developed a new segmentation tool, using deep-learning, which specializes in three-phase segmentation. This tool and other AI tools for Digital Rocks are available under the SmartRocks project (smartrocks.com).

Model architecture

The Neural Network architecture used in this work is inspired by the architecture used by many top teams in the TGS Salt Identification Challenge on Kaggle^22,23. It is based on an improved version of the popular U-net architecture¹² with SE-ResNeXt-50 encoder²⁴ initialized with pre-trained parameters. A general view of all the components in this architecture is presented in Fig. 1. We used 2D convolutional blocks, that takes a stack of consecutive slices where each slice is treated as an input channel. In this way, using a 2D convolution allows us to decrease the computational cost and redirect the resources to train a more complex and deeper architecture. Moreover, we maximize the field of view on the X–Y plane (i.e. rotation plane), which is beneficial to deal with global noises and artifacts (e.g. ring artifacts, beam hardening) due to the nature of image acquisition. It also means that we can use pre-trained blocks trained with large datasets, such as the ImageNet dataset²⁵. The common challenge of using 2D convolutional based network to segment 3D images is to preserve the depth information/connectivity of the orthogonal direction (Z direction in our case). To segment a 3D image, our model iterates over every slice of the image and takes a stack of slices containing the center slice and 7 slices in each direction, where 3 of them are consecutive and 4 are skip slices. Finally, our model produces a probability map for each phase, which is build as an average of 2D probability maps for each layer. The final segmentation is generated by selecting the phase with the highest probability for each voxel. To improve the model’s ability to interpret spatial information (cross-channel), we used a scSE block (Concurrent Spatial and Channel Squeeze and Excitation)²⁶ as our decoder. In order to improve the output connectivity, a deep supervision block i.e., convolution + up-sample layers are added at the end of each decoder block to implement the hyper-column technique²⁷. Another important part of our architecture is a global, center block made of a 1 $\times $ 1 convolutional block, which has a similar function to a dense block with fully connected layers. The goal of this intermediate global block is to give the network the capacity of a global classifier, to handle information such as rock type and non-local artifacts.

Training dataset

Our training dataset consists of 10 3D Xray micro-CT images of different rock types. The properties of the images are listed in Table 1. These images were acquired using different procedures: scanners (e.g. Xradia, Heliscan), filters, exposure times, cleaning/cropping, reconstruction algorithms, etc. The main idea of using a diverse training dataset is to give our model the ability to generalize as much as possible. The ground truth segmentation for each of these images was obtained using different open and commercial software packages for filtering and segmenting (Avizo, Pergeos, ImageJ, Mango). In this way, we compensate for any systematic error in the acquisition, filtering, and segmentation procedures for our training data. All the selected images have low noise level and high-quality, which makes the segmentation step relatively easy. We have experienced that using segmentations from noisy images deteriorates the network’s ability to generalize and produces good results on untrained datasets. For the Reservoir carbonates and Savonnieres samples, the utilized segmented images were obtained from dry/wet based porosity maps^28,29, which allow us to have a proper estimation of open and micro-phase porosities.

In addition, we have used a high degree of data augmentation to improve the robustness and generalization capabilities of the model. Besides the typical data augmentation techniques like rotating, random cropping, adding Gaussian noise and/or Salt and Pepper noise, we have also used more domain-specific noise in our training pipelines to match the real problems such as ring artifacts, stripe artifacts, intensity variations, and local blurring.

Table 1 Our training dataset consists of 10 high-resolution and low noise 3D Xray micro-CT images from different rock types.

Full size table

Validation workflow

In order to evaluate the performance of our segmentation tool, we have used synthetic generated data from known ground truth images. We followed an existing workflow³² for generating synthetic gray-scale images using the Astra-toolbox³³. Figure 2 shows the generation/validation workflow, where the ground truth segmented data get projected and reconstructed with filtered back projection in a parallel-beam geometry using a Ram-Lak filter. In addition, we can perform a proper noise sensitivity analysis since this workflow allows us to add different types of noise during the projection and reconstruction processes, reflecting the nature of noises/artifacts in real images. Figure 3 shows four examples of reconstructions from sinograms with different types of noise/artifacts.

Results

Comparison to traditional methods

We have generated several synthetic gray-scale images from real segmented ground truth images by adding different types of noise in the generation process, as described in the previous section. Figure 4 shows an example of four synthetic reconstruction cases: (a) clean reconstruction without noise, (b) reconstruction with Gaussian noise pre-projection, (c) reconstruction with Gaussian noise post-projection, (d) reconstruction with ring artifacts. For these four cases, we compare the results from our method to conventional segmentation methods: multi-Otsu thresholding and Marker-controlled Watershed. In this figure we show the middle slice of the 3D segmented images. The calculated accuracy and geometrical properties for all these cases are presented in Table 2. For each phase we have measured the total voxel-wise accuracy, IOU (Intersection Over Union). We have also calculated the Euler number (or Euler characteristic), a topological invariant that describes an object in a topological way regardless of the way it bent, and the connected components count, i.e the number of connected regions, using a 26-neighborhood connection algorithm³⁴.

Table 2 Comparison of our model and traditional segmentation methods for several synthetic cases: (a) no noise, (b) pre-projection noise, (c) post-projection noise, and (d) ring artifacts.

Full size table

All methods are able to produce a correct segmentation on the noise-free case, Fig. 4a. However, it is interesting noticing that our method already produced closer Euler and connected counts to the ground truth. The micro-phase region is more disconnected in the case of multi-Otsu and Watershed segmentation. In the next case, Fig. 4b (i.e. Gaussian noise pre-projection), the results show a significant loss in accuracy for Otsu (58.43%) and watershed (55.66%) methods. These methods are very sensitive to Gaussian noise since multi-Otsu is based on global thresholding and watershed is based on neighbor gray-scale values. On the other hand, our method has shown a minor reduction in terms of voxel-wise accuracy (98.75%) and IOU, mostly due to the blurred border between the different phases. In the third case, Fig. 4 (i.e. Gaussian noise post-projection), the results show a significant variation in the fraction of each phase for conventional methods, even for open porosity which was relatively stable in the previous case. Despite the significant increment in the level of noise, our model showed 2% accuracy loss in terms of global accuracy, mostly due to the contrast reduction of interfaces, especially for micro-phase regions. This leads to a significant loss in IOU for the micro-phase, going down to 56%. However, as observed visually, this is a small error due to the small amount of micro-phase in the image (less than 1%). Finally in Fig. 4d, we show the last case (i.e. ring artefacts). The results are similar to the previous cases where multi-Otsu and watershed are really sensitive to the introduced noise performing very badly. As expected, the IOU of micro-phase for our method is lower than for the clean image (49.35%). As observer in this figure, the big local contrast of the gray values makes it harder to identify micro-phase regions. In general, these four cases are a good visual and qualitative description of how robust this type of methods is compared to conventional methods.

Noise sensitivity analysis

In this section, we analyse the accuracy of our model to different types of synthetic noise and artifacts for four different rock samples: a bentheimer sandstone, a reservoir sandstone, an estaillades carbonate, and a middle eastern carbonate. In Fig. 5, we can see a synthetic reconstruction of each of these samples without noise. Additionally, we present pore, micro-phase and solid fractions in Table 3. For each rock image and each noise type we have generated around 50 synthetic new images with different noise levels. We have then processed all of them with our segmentation model and measured the accuracy respect to the ground truth. In the Figs. 6, 7 and 8 we show the Reservoir Sandstone gray-scale image and its corresponding segmentation at four different noise levels for Gaussian noise pre-projection, Gaussian noise post-projection, and ring artifact, correspondingly. As shown, in all the cases the model results do not break, even when the level of noise goes far beyond realistic noise levels, as shown in Fig. 7d. To the authors knowledge there is no segmentation model or algorithm in the literature reporting this level of robustness to noise.

Table 3 Fraction distribution for the different rock images used in the noise sensitivty analysis.

Full size table

To quantify the level of noise introduced in each image we have used PSNR (Peak signal to noise ratio) measured in 8 bits. The PSNR between a reference image f and a test image g of size (X, Y, Z) is defined as:

$$\begin{aligned} {\text {PSNR}}(f, g)=10 \log _{10}\left( 255^{2} / {\text {MSE}}(f, g)\right), \end{aligned}$$

(1)

where

$$\begin{aligned} {\text {MSE}}(f, g)=\frac{1}{XYZ} \sum _{i=1}^{X} \sum _{j=1}^{Y} \sum _{k=1}^{Z}\left( f_{i j k}-g_{i j k}\right) ^{2}, \end{aligned}$$

(2)

so PSNR is an inverse logarithmic scale measured in decibels. The accuracy (IOU) vs. noise level (PSNR) curves for all these cases are shown in Fig. 9. As expected, the accuracy of the segmentation decreases monotonically for increasing levels of noise (lower PSNR). For the sandstone cases, with well-defined grains and pores, we observed that micro-phase is the most sensitive to noise and artifacts, while the accuracy of solid and pore are relatively stable even for extreme noise cases. So as expected, the larger the specific surface of a phase the more sensitive to noise this phase is. Accordingly, for the synthetic carbonate cases, without well-defined grains, the results show a slightly lower accuracy for pore than micro-phase regions. We observed that in these cases, the pores regions are more sensitive to noise than micro-phase regions. In general, for all types of noise we observe three different behaviours: no-influence, correlated-noise and model-breakage. For example, for Gaussian noise pre-projection, the inflection point between no-influence occurs at around PSNR = 35 dB and the model-breakage starts at around PSNR = 8 dB. In the first region it is hard to visually perceive any difference between segmented image and ground truth. In the second region, correlated-noise, we observe differences mainly located at the interfaces, starting from single voxels and small clusters. In the third region, model-breakage, one of the phases is affected significantly due to the added noise, most of the information defining the interfaces is missing. For this last region, small increases on the noise level produce significant accuracy loss. As observed in Fig. 7d,h, even for these extreme noise levels the solution maintains its coherence with the ground truth. When it comes to dealing with ring artifacts, Fig. 9c, it is important noticing that PSNR is less sensitive to these non-local artifacts. This means that for segmentations with similar IOUs, ring artifacts give a bigger visual effect than Gaussian noise. In these cases, as observed in Fig. 8, most of the error is concentrated at the center of the image where the information of local structure is completely missed.

In general terms, these results show that when trained appropriately this type of methods are extremely robust dealing with different types of noise and artifacts. In the next sections, we test these capabilities by evaluating the model for several types of real rock images and by comparing its behaviour to state-of-the-art human-assisted segmented images.

Results on real images

Comparison to human-assisted segmentation

Due to the difficulty of defining an objective ground truth for real images, in this section we have directed our focus on comparing the performance of our model with respect to several human-assisted segmentations using traditional processing workflows. In addition, we investigate the variation of human-assisted segmentations and evaluate its effect on derived properties. We have used two images corresponding to Bentheimer and Berea sandstones. These images are challenging due to local variations in the gray-scale values. However, these images are representative of the general type and level of noises found in micro-CT imaging. We have provided the raw images (as produced by the scanners) to 5 different expert users. Each user chose the tools and workflows they considered more relevant to solve the assigned task. All the users have performed some filtering steps before segmenting the images. They have used conventional filters such as anisotropic diffusion³⁵, beam hardening correction, median filter³⁶, non-local means filter³⁷ and ring artifacts removal; and segmentation methods such as multi-thresholding and marker-based watershed. They used implementations from open source and commercial packages such as: Avizo (TFS), Pergeos (TFS), ImageJ (open) and Mango (ANU).

Figure 10 shows the middle slice of the Bentheimer image and the corresponding segmentations from our model and the five users. In general terms, the two more severe problems in dealing with this image are the typical micro-CT stripes in Z direction (local gray-scale variations) and the high-density mineral artifacts. As observed, all the human-assisted segmentations seem to be very sensitive to these types of noise. This is reflected in a non-neglectful over-estimation of the micro-phase regions, which evidently due to the stochastic nature of the noise, it is not uniform for the different users. The segmentations corresponding to users 2 and 4 are clear examples of this problem. In addition, user 3 seems to have over-segmented the micro-phase in the interfaces between solid and pore. Figure 11 shows a detailed region of this image that has been heavily affected by high-density mineral artifact (local beam hardening). In this particular case, all the human-assisted segmentations reflect the second main problem, mentioned above. For all of the human-assisted segmentation cases we observe a significant over-estimation of the micro-phase around the high-density mineral regions due to a sharp shift in local brightness. In contrast, the segmentation of the model seems to handle these types of noise very well, even if not trained with this type of specific noise.

Figure 12 shows a slide of the gray-scale Berea image and the corresponding segmentation images. In terms of quality, this image has less noise and no significant artifact compared to the Bentheimer case. Figure 13 shows a more detailed crop where we can see a non-neglectful variation in the micro-phase volume for each segmentation. In Table 4, we summarized the measured and calculated properties on these segmented images for each of the users and our model. The calculated properties (i.e. permeability, formation factor and tortuosity) are calculated using a pore network model technique^2,38. As we can see, the differences in the segmentations are reflected in all the properties. A significant variation of the results between the users is observed, most notably for the micro-porosity fraction and permeability, which is a common problem when working with real images as reported in the past³⁹.

In the case of the Bentheimer sandstone, our method produced less micro-phase volume compared to the users, which is mainly due to its robustness to noise and artifacts as mentioned above. Our segmentation also has larger permeability compared to the users. This can be due to the larger open-pores (less micro-porosity) but also by the stripes of noise observed for most of the users, spanning the XY plane of the image. In this case, the users micro-phase volume covers a range of 3.51%. For Berea, the results from the users do not show a clear trend as previously (over-segmenting micro-phase). However, the variations of micro-phase volume are still quite significant ranging from 4.61 to 10.71%. Assuming a micro-phase volume as the users average, these variations translate to 75% variation for Bentheimer and a 86% variation for Berea (these numbers are worse assuming lower volumes). However, it is important to see that the model not only is more robust to noise but, additionally, it can help to reduce the human-usage variations since the model will perform equally independent of who is using it, since it does not require parameters tuning to do the segmentation work.

Table 4 Comparison of petrophysical properties between our method and human-assisted segmentations.

Full size table

Comparison to experimental results

In this section, we compare the results of our model to porosities measured experimentally. We used the dataset presented in Shah et al.⁴⁰, where the authors provided micro-CT images at different resolutions for 10 sandstone and carbonate samples and the corresponding experimental porosity, which was measured through the bulk volume by saturating the plugs with water. It is important to remark that these measurements were done on the whole cylindrical sample volume and the scanned region corresponds to a crop in the middle of the cylinder ($\sim $ 50%). However, as these rocks are relatively homogeneous it is expected that porosity values are in the same range. We used the images with the highest resolution of around 4 µm. These images are all 575 cubic voxels. Figures 14 and 15 show the middle slice of each image and the corresponding segmentation by our model. Table 5 shows the measured experimental porosity and the open and micro-phase fractions from our segmentation. For the sandstone cases, assuming a 50% porosity for micro-phase regions there is less than 1% difference compared to the experimental total porosity. For the carbonates, it is harder to assign a given porosity for the micro-phase since carbonates, in general, have a wider porosity distribution. For example, the Ketton segmentation, shown in Fig. 15f illustrates that most of the micro-phase regions have very close gray-scale value to the solid region and, thus, a lower porosity. Another clear example is the case of Middle Eastern 3 carbonate shown in Fig. 15g, where the micro-phase regions are covering the whole image (96%). In such cases, it would be necessary to use other types of imaging techniques, such as dry/wet based porosity map^28,29, instead of a three-phase segmentation. For the rest of the carbonate cases, the segmentation performed by our model seems visually correct and the porosities are in the same range as the experiments.

Table 5 Comparison between experimental and our model segmentation porosity for several sandstone and carbonate micro-CT-images.

Full size table

Conclusion

We have developed a deep learning based three-phase segmentation model and trained it on multiple 3D micro-CT rock images with a wide range of domain-specific augmentation steps. We then studied our model’s performance on synthetic and real images in terms of accuracy and physical properties. Based on these results, it is clear that our segmentation model is capable of producing high-quality segmentations even when given noisy and low-quality input images. We have further validated our model’s performance with experimental results from other studies on multiple types of rocks. In addition, we compared the performance of our segmentation model to five different expert users. The segmentation results strongly indicated that the model is better at dealing with typical image noises compared to the expert users.

As a summary, the presented results demonstrate the potential capacity of a fully automated segmentation workflow, which would lead to a significant improvement on the image processing step compared to current processing workflows. It is worth mentioning that even if this study focuses on three-phase segmentation, it is relatively simple to extend the number of phases handled by the tool, given the availability of the necessary training data.

References

Blunt, M. et al. Pore-scale imaging and modelling. Adv. Water Resour. 51, 197 (2013).
Article ADS Google Scholar
Ruspini, L., Farokhpoor, R. & Øren, P. Pore-scale modeling of capillary trapping in water-wet porous media: A new cooperative pore-body filling model. Adv. Water Resour. 108, 1–14 (2017).
Article ADS Google Scholar
Bultreys, T. et al. Verifying pore network models of imbibition in rocks using time-resolved synchrotron imaging. Adv. Water Resour. 56, e2019WR026587 (2020).
ADS Google Scholar
Iassonov, P., Gebrenegus, T. & Tuller, M. Segmentation of x-ray computed tomography images of porous materials: A crucial step for characterization and quantitative analysis of pore structures. Water Resour. Res. https://doi.org/10.1029/2009WR008087 (2009).
Article Google Scholar
Otsu, N. A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9, 62–66 (1979).
Article Google Scholar
Meyer, F. & Beucher, S. Morphological segmentation. J. Vis. Commun. Image Represent. 1, 21–46. https://doi.org/10.1016/1047-3203(90)90014-M (1990).
Article Google Scholar
Papers with code-semantic segmentation benchmarks. https://paperswithcode.com/task/semantic-segmentation. Accessed 16 Sept 2021 (2020).
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Article ADS CAS Google Scholar
Li, Y. et al. Deep learning for lidar point clouds in autonomous driving: A review. In IEEE Transactions on Neural Networks and Learning Systems (2020).
Qi, C. R., Su, H., Mo, K. & Guibas, L. J. Pointnet: Deep learning on point sets for 3d classification and segmentation. In Proc. IEEE Conference on Computer Vision and Pattern Recognition, 652–660 (2017).
Milletari, F., Navab, N. & Ahmadi, S.-A. V-net: Fully convolutional neural networks for volumetric medical image segmentation. In 2016 Fourth International Conference on 3D Vision (3DV), 565–571 (2016).
Ronneberger, O., Fischer, P. & Brox, T. U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical Image Computing and Computer-Assisted Intervention, 234–241 (2015).
Andrew, M. A quantified study of segmentation techniques on synthetic geological xrm and fib-sem images. Comput. Geosci. 22, 1503 (2018).
Article MathSciNet Google Scholar
Arganda-Carreras, I. et al. Trainable weka segmentation: A machine learning tool for microscopy pixel classification. Bioinformatics 33, 2424 (2017).
Article CAS Google Scholar
Varfolomeev, I., Yakimchuk, I. & Safonov, I. An application of deep neural networks for segmentation of microtomographic images of rock samples. Computers 8, 72 (2019).
Article Google Scholar
Karimpouli, S. & Tahmasebi, P. Segmentation of digital rock images using deep convolutional autoencoder networks. Comput. Geosci. 126, 142–150 (2019).
Article ADS Google Scholar
Da Wang, Y., Shabaninejad, M., Armstrong, R. T. & Mostaghimi, P. Physical accuracy of deep neural networks for 2d and 3d multi-mineral segmentation of rock micro-ct images. Preprint at http://arxiv.org/abs/2002.05322 (2020).
Badrinarayanan, V., Kendall, A. & Cipolla, R. A deep convolutional encoder-decoder architecture for image segmentation. In IEEE, Segnet (2015).
Mehmani, A., Tokan-Lawal, A., Prodanović, M. & Sheppard, A. The effect of microporosity on transport properties in tight reservoirs. In Society of Petroleum Engineers—SPE Americas Unconventional Gas Conference 2011, UGC 2011. https://doi.org/10.2118/144384-MS (2011).
Bultreys, T., Hoorebeke, L. & Cnudde, V. Simulating secondary waterflooding in heterogeneous rocks with variable wettability using an image-based, multiscale pore network modeling. Water Resour. Res. 52, 6833–6850 (2016).
Article ADS Google Scholar
Ruspini, L., Lindkvist, G., Bakke, S., Carnerup, A. & Øren, P. A multi-scale imaging and modeling workflow for tight rocks. In SPE (2016).
Karchevskiy, M., Ashrapov, I. & Kozinkin, L. Automatic salt deposits segmentation: A deep learning approach. Preprint at http://arxiv.org/abs/1812.01429 (2018).
Kaggle. Tgs salt identification challenge (2018).
Hu, J., Shen, L., Albanie, S., Sun, G. & Wu, E. Squeeze-and-excitation networks. IEEE Trans. Pattern Anal. Mach. Intell. https://doi.org/10.1109/TPAMI.2019.2913372 (2019).
Article PubMed Google Scholar
Deng, J. et al. Imagenet: A large-scale hierarchical image database. In 2009 IEEE Conference on Computer Vision and Pattern Recognition, 248–255 (2009).
Roy, A. G., Navab, N. & Wachinger, C. Concurrent spatial and channel ‘squeeze & excitation ’ in fully convolutional networks. In International Conference on Medical Image Computing and Computer-Assisted Intervention, 421–429 (2018).
Hariharan, B., Arbeláez, P., Girshick, R. & Malik, J. Hypercolumns for object segmentation and fine-grained localization. In Proc. IEEE Conference on Computer Vision and Pattern Recognition, 447–456 (2015).
Arns, C. et al. Petrophysical properties derived from X-ray CT images. APPEA J. 43, 577–586 (2003).
Article CAS Google Scholar
Golab, A. et al. 3d porosity and mineralogy characterization in tight gas sandstones. Lead. Edge 29, 1476 (2010).
Article Google Scholar
Mascini, A., Cnudde, V. & Bultreys, T. Event-based contact angle measurements inside porous media using time-resolved micro-computed tomography. J. Colloid Interface Sci. 572, 354 (2020).
Article ADS CAS Google Scholar
Digital rock portal. https://www.digitalrocksportal.org. Accessed 16 Sept 2021 (2019).
Berg, S., Saxena, N., Shaik, M. & Pradhan, C. Generation of ground truth images to validate micro-ct image-processing pipelines. Lead. Edge 37, 412–420. https://doi.org/10.1190/tle37060412.1 (2018).
Article Google Scholar
van Aarle, W. et al. The astra toolbox: A platform for advanced algorithm development in electron tomography. Ultramicroscopy 157, 35–47. https://doi.org/10.1016/j.ultramic.2015.05.002 (2015).
Article CAS PubMed Google Scholar
Toriwaki, J. & Yonekura, T. Euler number and connectivity indexes of a three dimensional digital picture. FORMA-TOKYO 17, 183–209 (2002).
MathSciNet MATH Google Scholar
Frangakis, A. S. & Hegerl, R. Noise reduction in electron tomographic reconstructions using nonlinear anisotropic diffusion. J. Struct. Biol. 135, 239–250. https://doi.org/10.1006/jsbi.2001.4406 (2001).
Article CAS PubMed Google Scholar
Omer, A. A., Hassan, O. I., Ahmed, A. I. & Abdelrahman, A. Denoising ct images using median based filters: A review. ICCCEEE. https://doi.org/10.1109/ICCCEEE.2018.8515829 (2018).
Article Google Scholar
Liu, B. & Liu, J. Overview of image noise reduction based on non-local mean algorithm. MATEC. https://doi.org/10.1051/matecconf/201823203029 (2018).
Article Google Scholar
Ruspini, L. et al. Multiscale digital rock analysis for complex rocks. Transp. Porous Media. https://doi.org/10.1007/s11242-021-01667-2 (2021).
Article Google Scholar
Baveye, P. et al. Observer-dependent variability of the thresholding step in the quantitative analysis of soil images and X-ray microtomography data. GeoDerma 157, 51 (2010).
Article ADS Google Scholar
Shah, S., Gray, F., Crawshaw, J. & Boek, E. Micro-computed tomography pore-scale study of flow in porous media: Effect of voxel resolution. Adv. Water Resour. 95, 276–287 (2016).
Article ADS Google Scholar

Download references

Acknowledgements

This work was partially supported by the Norwegian Research Council (Grant Number 296093) and the members of the SmartRocks joint industry project (ENI AS, Repsol AS, and Chevron Corporation). We thanks S.M.Shah and E.S.Boek, for providing the data used to compare with the experimental results. We also thank the five anonymous users performing segmentations to compare with our model.

Author information

Authors and Affiliations

Department of Computer Science, NTNU, Trondheim, Norway
Johan Phan & Frank Lindseth
Petricore Norway, Trondheim, Norway
Johan Phan & Leonardo C. Ruspini
Codego AS, Trondheim, Norway
Leonardo C. Ruspini

Authors

Johan Phan
View author publications
You can also search for this author in PubMed Google Scholar
Leonardo C. Ruspini
View author publications
You can also search for this author in PubMed Google Scholar
Frank Lindseth
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.R. and J.P. designed the research. J.P. developed the network architecture and performed image processing. J.P. and L.R. wrote the manuscript. All authors reviewed the manuscript.

Corresponding author

Correspondence to Johan Phan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Phan, J., Ruspini, L.C. & Lindseth, F. Automatic segmentation tool for 3D digital rocks by deep learning. Sci Rep 11, 19123 (2021). https://doi.org/10.1038/s41598-021-98697-z

Download citation

Received: 02 March 2021
Accepted: 07 September 2021
Published: 27 September 2021
DOI: https://doi.org/10.1038/s41598-021-98697-z

This article is cited by

Applicability of 2D algorithms for 3D characterization in digital rocks physics: an example of a machine learning-based super resolution image generation
- Sadegh Karimpouli
- Rail Kadyrov
- Erik Hans Saenger
Acta Geophysica (2023)
Workflow Development to Scale up Petrophysical Properties from Digital Rock Physics Scale to Laboratory Scale
- Marco Miarelli
- Augusto Della Torre
Transport in Porous Media (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.