Estimation of pore structure and permeability in tight carbonate reservoir based on machine learning (ML) algorithm using SEM images of Jaisalmer sub-basin, India

Yalamanchi, Pydiraju; Datta Gupta, Saurabh

doi:10.1038/s41598-024-51479-9

Download PDF

Article
Open access
Published: 09 January 2024

Estimation of pore structure and permeability in tight carbonate reservoir based on machine learning (ML) algorithm using SEM images of Jaisalmer sub-basin, India

Pydiraju Yalamanchi¹ &
Saurabh Datta Gupta¹

Scientific Reports volume 14, Article number: 930 (2024) Cite this article

712 Accesses
2 Altmetric
Metrics details

Subjects

Abstract

Analyzing the pore structure in carbonate reservoirs plays a crucial role in predicting fluid flow characteristics within these formations. The goal of the study was to use machine learning techniques for pore structure analysis and estimation of permeability in carbonate reservoirs. We implemented these algorithms by examining 2D scanning electron microscope (SEM) images of carbonate samples from the Jaisalmer sub-basin captured at various magnifications. In the initial stage of the analysis, various binarization algorithms were applied to determine carbonate sample porosity. Among these algorithms, the MaxEntropy algorithm gave a porosity value closely aligned with those obtained through petrography analysis. We employed the watershed algorithm to find the pore network parameters of carbonate samples at various magnifications. We observed that changes in magnification affected pore network parameters, resulting in a reduction in pore size distribution, throat radius, and grain size. Subsequently, we employed the numerical lattice Boltzmann method (LBM) to estimate the permeability of carbonate samples and compared to values derived from well logs. We employed machine learning (ML) algorithms, specifically Artificial Neural Network (ANN) and Support Vector Machine (SVM), to predict the permeability of carbonate samples. The input features for these models were the pore network parameters, while the LBM permeability values served as the output. We examine the prediction performance of these methods against the measured LBM permeability by conducting the error analysis and the coefficient of determination (${R}^{2}$) calculation. Our findings revealed that the ANN models outperformed the SVM models. Specifically, the ANN model displayed an impressive R² value of 0.892, along with root mean square error (RMSE), mean squared error (MSE) and, mean absolute error (MAE) values of 1.927, 3.716 and 1.580, respectively. In contrast, the SVM model yielded an R² value of 0.849, with RMSE, MSE and, MAE values of 2.324, 5.401 and, 2.166 respectively, when assessed on testing data of measured permeability. This study found that ANN is more dependable, robust, and precise than SVM in forecasting carbonate sample permeability.

A case study of petrophysical rock typing and permeability prediction using machine learning in a heterogenous carbonate reservoir in Iran

Article Open access 16 March 2022

Stacked ensemble machine learning for porosity and absolute permeability prediction of carbonate rock plugs

Article Open access 17 June 2023

Utilizing machine learning for flow zone indicators prediction and hydraulic flow unit classification

Article Open access 20 February 2024

Introduction

Pore structure and permeability are crucial in the study of geoscience and petroleum engineering for oil & gas exploration. Pore structure and permeability play a crucial role in simulating fluid flow within the heterogeneous geometry of carbonate porous materials^1,2,3,4,5. To investigate the single and multiphase fluid flow, pore network modeling and its characterization are crucial⁶. Permeability, which describes the flow of fluids through porous media, is one of the most important properties. Pore structure parameters, including porosity, tortuosity, connectivity, pore size, as well as pore shape and aspect ratio^7,8 significantly influenced the permeability of porous media^7,8. Several direct experimental approaches have developed to analyse the pore structure character and permeability of the porous medium. These approaches include mercury injection porosimetry (MIP), nuclear magnetic resonance (NMR), core analysis method developed by Gas Research Institute (GRI), and pulse decay^{9,10,11,12,13,14,15,16}. Despite being frequently employed, these direct experimental measurements have significant limitations in current laboratory operating conditions, such as the involvement of significant time, cost-effectiveness, and a few cores samples¹⁷. These limitations motivate us to develop a new efficient algorithm for analyzing the pore structure characterization and permeability in a porous medium, which can produce reliable results within a short time and be cost effective.

In recent times, researchers widely accept imaging techniques such as X-ray micro-computed tomography and scanning electron microscopy (SEM) for imaging porous media. Incorporating numerical modelling approaches enhances the robustness of this technique to get proper pore structure, permeability, and fluid flow through porous media^1,18,19,20. In between these methodologies, image processing plays a crucial role in characterizing the pore structure of porous media by extracting valuable quantitative information from microscopic images. This enables the precise evaluation of pore size, shape, distribution, and connectivity. This data is vital for understanding the physical properties of porous media, including porosity and permeability⁵. The most well-known direct simulation processes include the finite volume method (FVM)^21,22, finite element methods (FEM)^23,24,25,26, and the lattice Boltzmann method (LBM)^{27,28,29,30,31,32} are commonly employed for modelling porous media and simulating fluid transport process. Indirect strategies like Pore Network Modelling (PNM)^6,33,34,35 and Bundle-of-Tubes³⁶ offer effective means to simulate fluid flow behavior in porous media. These combined approaches contribute to a comprehensive understanding of porous media properties and fluid dynamics. The lattice Boltzmann method (LBM) has developed into the most capable and widely used numerical modelling approach for estimating the permeability and tortuosity of porous media³⁷. It was started by Frisch et al.³⁸ under the name of lattice–gas automata in 1986. The Lattice Boltzmann Method (LBM) is based on the mesoscopic physics of the Lattice Gas Cell Automata (LGCA)³⁹. One can also derive the lattice Boltzmann method (LBM) directly from the discretized Boltzmann equation. The lattice Boltzmann method can predict an image-based throat permeability model with more accuracy. It uses thin sections of 2D SEM images or micro-computed tomography images are the input images²⁹. The LBM simulations are valuable for studying the heterogeneity of carbonate reservoirs. It can accurately model fluid flow in complex, porous media with varying rock properties and enables the exploration of how small-scale variations in the carbonate rock matrix affect fluid flow, including the impact of different pore geometries and connectivity. This information is crucial for optimizing oil and gas recovery strategies, understanding reservoir performance, and predicting the behavior of fluids in heterogeneous carbonate reservoirs. These numerical approaches always require rigorous preparations for the discretization of porous media. To ensure the accuracy of the discretization result, high-quality digital images of porous media are always required. However, getting such images more costs are involved, and it is also time-consuming for geoscience and engineering applications. As a result, standard numerical simulation methods have significant limitations in accurately estimating the pore structure parameters of porous media. Another reason is that carbonate reservoirs are inherently diverse, making it challenging to predict the accurate permeability model using this numerical simulation and empirical methods⁴⁰.

Because of the complexities involved in predicting permeability models for carbonate rocks, including the lack of high-quality images and heterogeneous carbonate samples, some researchers have introduced machine learning (ML) based algorithms. These algorithms provide accurate and reliable approaches to predicting the properties of porous media^41,42,43. There are various ML algorithms focused on the reconstruction of porous media and estimating the pore network parameters, including porosity, permeability, tortuosity, throat radius, pore and grain size, etc. of the porous medium⁴⁴. These algorithms particularly use X-ray micro-computed tomography and 2D SEM images, employing techniques such as Least Square Support Vector Machine (LSSVM), Fuzzy logic, K-means clustering, artificial neural network (ANN), genetic algorithm (GA) and conventional neural network (CNN)^45,46,47,48.

Lu et al.⁴⁹ developed a precise permeability prediction model tailored to distinct pore structure types of Cretaceous carbonate reservoir. The spectral coefficient method effectively discriminates between connected and unconnected pores. The alignment of fractal dimensions with pore structure characteristics serves as a validation of this pore structure classification. Cheng et al.⁵⁰ employed a multiparameter equation derived via multiple regression analysis, comprehensively assessing the influence of pore-throat parameters at different scales on tight sandstone reservoir properties. It accurately predicts permeability and porosity, offering valuable insights for studying pore structure and permeability in tight sandstones. Adegbite et al.⁵¹ used the multiple linear regression analysis and Artificial Neural Network (ANN) to investigate the relationship between porosity, pore radius, throat radius, and permeability. They compared these findings to experimental results got at different levels of mercury saturation. The results revealed that the multiple linear regression technique exhibited the strongest correlation at 35% mercury saturation, whereas the ANN demonstrated a better correlation at 55% mercury saturation. These results highlight the superior performance of the ANN over multiple regression in permeability prediction. The utilization of the Fuzzy Logic method with wireline well log data from carbonate reservoirs in the Middle East yields the most accurate permeability model. These predicted models exhibit an exceptional agreement with core permeability. Rostami et al.⁵² employed various ML algorithms, including radial basis function neural network (RBF-ANN), least square support vector machine (LSSVM), multilayer perception neural network (MLP-ANN), genetic programming (GP) and committee machine intelligent system (CMIS), to achieve a precise estimation of permeability in a heterogenous carbonate reservoir. GP and CMIS models provided the most accurate predictions, showing the highest determination coefficient when the researchers compared the results with core permeability. Zhang et al.⁵³ proposed the conventional neural network (CNN) approach based on the autoencoder (AE) effectively predicts permeability using low-resolution images of porous media where an autoencoder (AE) module trained with unlabelled data and CNN trained with a small amount of labeled data. The results show that this AE-CNN approach outperforms traditional CNN and lattice Boltzmann method (LBM) approaches, with an average R² value of 0.896 and low mean-square errors, showing substantial improvements in prediction accuracy from low-resolution porous media images. Tran et al.⁵⁴ used both ANN and multiple regression to investigate the indirect correlation between pore throat radius, permeability, and porosity of carbonate samples. Compared to multiple regression, ANN exhibited superior performance with a higher correlation factor in predicting permeability. The permeability prediction through numerical viscous flow simulation closely aligns with measured values when using 2D SEM images of porous media⁴. Gohari et al.⁵⁵ successfully extracted pore-network parameters from 2D images of carbonate samples and accurately predicted the true permeability by using the ANN technique. Their finding showed promising results. Predicting 2D permeability from thin section images taken perpendicular to the plane and establishing statistical correlations with the computed 3D permeability of the host volumes resulted in accurate predictions⁵⁶. In this study, we quantified the pore structure and permeability prediction by utilizing scanning electron microscopy (SEM) images taken at various magnifications (× 100, × 150, × 200, and × 300) of carbonate samples from the Kuldhar, Joyan, and Badabag members of Jaisalmer formation. The Jaisalmer limestone formation shows promise for hydrocarbon exploration. However, a lack of comprehensive data and geological complexities limits the comprehensive analysis of this algorithm in the Jaisalmer sub-basin, introducing uncertainties in the study of petrophysical properties of reservoir⁵⁷. We quantified pore structure by estimating key pore network parameters including, porosity, pore size, throat radius, coordination number, and grain size. While the aspect ratio, especially valuable for distinguishing between elongated and spherical pores, prominently applies in scenarios where pore shape is a pivotal factor, the aforementioned parameters play a fundamental role in influencing fluid transport characteristics and permeability in porous media⁵⁸.

The primary aim of this study is to predict the permeability of carbonate samples by using machine learning (ML) algorithms. Many studies have focused on predicting the permeability of 2D SEM images of carbonate samples using machine learning algorithms. Previous investigations have typically compared the predictive models against laboratory-based studies. In our study, we employed both numerical simulation and machine learning (ML) algorithms to estimate the permeability of carbonate samples permeability based on 2D SEM images at various magnifications. We examined the influence of different pore network parameters, such as pore network parameters, such as porosity, average pore radius, average throat radius, average grain size, and average coordination number, in different magnifications. Unlike previous studies that often-compared ML models to laboratory-based studies, we compared ML predictive models with numerical simulation methods. We employed Artificial Neural Network (ANN) and Support Vector Machine (SVM) techniques, using pore network parameters as inputs and lattice Boltzmann method (LBM) simulation results as targets to predict permeability. Our ML models yielded reliable permeability results for carbonate samples. Our study innovated by using the ANN technique, which generated precise and dependable permeability models for carbonate samples, regardless of the availability of core permeability values. The current research customized these methods to address geological and data challenges specific to the oil and gas industry, particularly in Indian carbonate reservoirs, enhancing their relevance and applicability in this context. It outlined a comprehensive workflow in the following section of the manuscript.

A notable innovation in our study is the utilization of the ANN technique, which provided accurate and reliable permeability models for carbonate samples, even where core permeability values were unavailable.

Geology of the study area

The Jaisalmer basin, in the west part of the Aravalli ranges, stands the largest sub basin within the Rajasthan basin, covering an approximate area of 50,000 square kilometres. It’s divided into the northwest and the Barmer basin in the south by faults from the Bikaner-Nagaur basin. The basin exhibits pericratonic characteristics and comprises three depressions known as Shahgarh, Kishangarh and Miajlar depressions⁵⁹. Three major unconformities are present in this basin. These unconformities delineate different stratigraphic sequences in this basin, such as Proterozoic–Early Cambrian, Paleozoic–Mesozoic, Tertiary, and Quaternary periods^60,61,62. Many studies conducted in the Jaisalmer basin which shows the potentiality of hydrocarbon exploration in this basin^61,63,64. The Jaisalmer basin comprises various ranges geological formations identified based on lithostratigraphy, spanning from the Eocene to Jurassic. These formations, namely Bandah, Goru, Habur, Pariwar, Baisakhi Jaisalmer, Lathi, Sanu and Khuiala are further classified into various members⁶⁵. The underlying basement of this basin comprises rocks from the Precambrian era, which predominantly comprise igneous and metamorphic rocks.

Permian rocks are present in the Jaisalmer basin, which is a late Paleozoic–Mesozoic basin with an unconformable relationship with the Proterozoic basement. The basement of the basin primarily comprises the Malani suite, which is composed of Precambrian rocks, including metamorphic rocks. Jaisalmer basin consists thick sequence of sedimentary rocks such as clastic and carbonate formations. The Mesozoic rocks are well exposed in this basin, predominantly composed of limestone, shale, sandstone, and siltstone. The basement primarily comprises Pre-Cambrian rocks, notably the igneous and metamorphic rocks⁶². Several members, namely Hamira, Joyan, Fort, Badabag, Kuldhar, and Jajiya, categorize the Jaisalmer formation lithostratigraphically, which is well-known for its abundance of fossils. We can observe the outcrops of these members around the Jaisalmer city^66,67. Figure 1 shows the different lithostratigraphic members of Jaisalmer formation. In this study, we used 2D SEM images of various carbonate samples of Jaisalmer formation to extract the pore network parameters and permeability prediction. The samples considered for analysis include those from the Hamira, Badabag and Kuldhar members. The lithostratigraphy of these members are illustrated in Table 1. Oolitic, cross-bedded limestone and sandstone are featured in the Jajiya formation. The Kuldhar formation comprises limestones, marls, and greenish shales. The Badabag formation comprises ferruginous sandstones and sandy limestones dating from the Middle to Late Bathonian period. The Joyan and Fort groups are primarily composed of sandstones and cross-bedded limestones. These formations contain corals that developed during the Early Bathonian and Bajocian periods. Last, the Hamira formation comprises limestones, calcareous sandstones, spanning from the Bajocian to the Early Jurassic^67,68,69.

Table 1 Lithostratigraphy of the Jurassic strata of the Jaisalmer formation (modified after Sharma and Pandey⁶⁷).

Full size table

Materials and methods

Data preparation

This study is primarily based on carbonate samples of the Jaisalmer sub basin. We collected these samples from the Hamira, Badabag, and Kuldhar members of the Jaisalmer formation and conducted scanning electron microscope (SEM) tests on them. The SEM images were obtained at four different magnification levels: × 100, × 150, × 200, and × 300 using an acceleration voltage of 20 kV and a resolution of 647 × 486 pixels. 40 2D SEM carbonate samples from Jaisalmer formation were examined. To show the findings of this study, we present two representative carbonate samples, namely S-1 from the Kuldhar member and S-4 from the Badabag member. The selected samples are visible in different magnifications. Data from two wells, named A and B, were used to validate the simulation results. In this study, we used conventional well log data from these wells including gamma-ray (GR), resistivity (RT), density (RHOB), and neutron porosity (NPHI), along with estimated petrophysical logs such as porosity, permeability, and water saturation.

Method

Our study focuses on predicting the permeability of carbonate samples using machine learning (ML) algorithms. Unlike previous studies that primarily compared ML models with laboratory experiments, we introduce a novel approach by comparing ML models with numerical simulation methods, particularly tailored for Indian carbonate reservoirs. Our method involves several key steps as illustrated in Fig. 2 which outlines the sequential steps and process involved. First, we quantitatively estimate the pore network parameters of carbonate samples at various magnifications. These parameters, including porosity, average pore radius, average throat radius, average grain size, and average coordination number, are determined using image processing techniques. Next, we estimate permeability through lattice Boltzmann method (LBM) simulations and validate these results against permeability values derived from well logs. Subsequently, we employed machine learning (ML) algorithms, specifically Artificial Neural Network (ANN) and Support Vector Machine (SVM) techniques, to predict the permeability of carbonate samples. The pore network parameters serve as inputs for these ML models, while the LBM simulation results serve as the target output. To ensure robust model performance, we employed the grid search algorithm and K-fold cross-validation while executing machine learning algorithms. We began by identifying the hyperparameters relevant to the ML models employed in our study. To systematically explore the hyperparameters we used a grid search approach. We defined a set of values for each hyperparameter that we wanted to optimize. During the modeling stage, we adopted the K-fold cross-validation technique to evaluate the performance of different hyperparameter combinations. We employed a K-value of 10 to ensure robustness in our results. We determined the optimal hyperparameters by evaluating the best performance achieved across all folds and selected them as the hyperparameters for our last model. Then we performed feature importance analysis for both ANN and SVM models. While these models do not inherently offer feature importance like decision tree algorithms, we employed distinct methods to access their feature relevance. For ANN, we used permutation importance and, for SVM, feature importance was determined by the magnitude of the feature coefficients. Finally, we identify the most effective ML model by evaluating the coefficient of determination and error matrices. Notably, our study innovatively uses the ANN technique, providing accurate and reliable permeability predictions.

Image processing

Image processing is necessary to analyze the characterization of the pore structure of 2D SEM images. It involves digital adjustments, conversion, and operations to extract valuable information, enhance visibility, and identify objects and patterns. Image processing techniques include image enhancement, noise reduction, and segmentation. Threshold segmentation plays a crucial role in image processing, as the quality of the image directly affects segmentation results. Hence, it is important to perform image enhancement and denoising prior to segmentation⁵.

We accomplish image enhancement by adjusting the grayscale histogram of the image and expanding the dynamic range of grayscale values. The grayscale histogram represents the distribution of grayscale values, with the horizontal axis showing different gray levels and the vertical axis displaying the number of pixels at each gray level. Adjusting the histogram increases the dynamic range of grayscale values and enhances the contrast in the image. In an 8-bit grayscale image, which has 256 intensities, the histogram comprises 256 values representing the pixel distribution. Image denoising is another important process that aims to remove unwanted noise from an image. One effective method is the application of a median filter. The median filter is effective in reducing impulse noise and eliminating salt-and-pepper noise, while preserving the edges of an image. It works by sorting all pixel values within a neighbourhood and replacing the pixel value with the median (middle) value, rather than the average of the surrounding pixel values. Figure 3a illustrates the calculation of the median value of pixel neighbourhood for sample S-3. Image segmentation is a technique that partitions a digital image into distinct subgroups based on specific characteristics, simplifying image complexity, and facilitating further analysis by isolating the desired target. In this study, we focused on segmenting the pores in carbonate samples using a threshold-based segmentation algorithm⁵. The algorithm converts images into a binary format by representing object pixels with a single gray level and background pixels at different levels. Specifically, object pixels are assigned as “black”, while the background is represented as “white”. To perform threshold segmentation, it is essential to determine the threshold value using the following formula, which maps gray-level values to the binary set {0,1}. Equation (1) depicts the segmentation threshold value⁵.

$$\begin{aligned} S\left( {x,y} \right) = & 0,\; if\; f\left( {x,y} \right) < T\left( {x,y} \right) \\ = & 1, if\; f\left( {x,y} \right) \ge T\left( {x,y} \right) . \\ \end{aligned}$$

(1)

S (x, y) represents the value of the generated binary image, f (x, y) denotes the gray level of the original image, and T (x, y) indicates the threshold value of the segmented image at the coordinates (x, y). Figure 3b illustrates the image processing steps we performed on sample S-3.

Threshold-based segmentation algorithms

The threshold-based segmentation algorithms play a pivotal role in image segmentation, serving as a fundamental technique in the field. However, existing algorithms have limitations in effectively handling noisy grayscale images⁵. Researchers have devoted considerable attention to addressing these challenges over the past four decades. Two classifications of threshold segmentation are local thresholding and global thresholding, which involve dividing an image based on specific thresholds. Local thresholding divides the image into smaller sections and determines the threshold value for each section. Global thresholding involves determining a single threshold for the entire image. These algorithms employ various techniques, including histograms, clustering, entropy, and fuzzy logic-based methods to achieve their segmentation goals. In this study, we employed several global threshold algorithms, namely MaxEntropy⁷⁰, Otsu⁷¹, Huang⁷², and Yen⁷³ for image segmentation. This allowed for a comprehensive evaluation of their effectiveness and performance in our specific context.

The MaxEntropy approach is rooted in the maximization of measured information between the object and background within an image, with entropy serving as the metric for information measurement. However, the computational complexity associated with the MaxEntropy algorithm prompted the introduction of the maximum correlation criterion (MCC) by Yen⁷³. This criterion, known as the Yen algorithm, facilitates the calculation of an optimal image threshold. The Huang algorithm proposed by Huang and Wang⁷², aims to reduce the fuzziness measures of an input image in order to determine the optimal threshold value. The concept of fuzziness in this context typically refers to the degree of fuzziness exhibited by a fuzzy set. Fuzziness is quantified using entropy based on the Shannon function from information theory. The Otsu algorithm uses the maximum inter-class variance between background and target images. By maximizing the separability of resulting classes at different gray levels, the Otsu algorithm enables effective threshold determination⁷¹.

Determination of pore network parameters

To comprehend the flow characteristics of porous media, including important factors such as permeability, relative permeability, and fluid flow phenomena, it is crucial to understand the pore structure parameters that occur when the porous material is subjected to pressure differentials. Disparities in pore structure, such as spatial distribution, connectivity types, and pore and throat shapes and sizes, can lead to distinct fluid flow behaviours even when two materials have the same porosity. Therefore, accurate characterization of pore structure is essential, particularly in domains like petroleum and reservoir engineering, where pore network configuration and fluid flow dynamics significantly impact hydrocarbon storage capacity. In this study, we estimated various pore network parameters at different magnifications and thoroughly examined their influence on permeability prediction. In order to determine the 2D pore network parameters of carbonate samples, we employed the watershed algorithm⁷⁴, initially introduced by Baldwin et al.⁷⁵. Implementing this method used MATLAB’s educational image processing tool and the open-source ImageJ software^76,77,78,79. In the watershed algorithm initially binarize the image as solids and voids, then obtain a distance map by calculating the minimum distance from the void to the nearest solid. Using this distance map, we segmented pores based on the concept of water flooding. In the calculated distance map, we considered the brightest colour indicating the lowest points. When simulating water flooding, water accumulates first at the lowest points within each water pool and gradually rises. When water from the different pools meets, the meeting point becomes the boundary between these pools. In this analysis, as all regions are filled with water, a series of meeting points forms a boundary line between the pools. This line is considered as the meeting line between the two pores, and as it coincides with the construction between two pores, it serves as the throat. This distinction allows us to analyze the pore-throat network effectively using 2D images⁸⁰. Figure 4 illustrates the various stages of the watershed segmentation process.

The pore size distribution (PSD) and throat radius are influential parameters in the flow dynamics of porous media. To determine pore-throat sizes in 2D or 3D digital images, a promising approach uses the city-block distance function and watershed segmentation^80,81. The method applies the morphological majority transform function to a binary image, reducing the roughness and noise for accurate PSD determination. The computed city-block distance produces sharp contour lines that delineate pore boundaries, with the brightest lines representing the lowest points. Watershed segmentation then connects marked points to generate ridgelines, enabling identification of the pore area above a specific radius. The contour line where two pores meet shows the throat, representing the constricted region between them.

In a pore network model, the 2D coordination number represents the average number of throats connected to each pore or the number of pore bodies associated with a single pore³. This parameter plays a crucial role in characterizing the pore network and has a substantial influence on the hydraulic conductivity of porous rocks. The average 2D coordination number is determined by calculating the mean number of throats connected to a specific pore. We employ the watershed segmentation algorithm to identify the throats associated with each pore, enabling the computation of this important network property.

To determining the grain size distribution (GSD) from digital images involves the detection of overlapping grains and individual size calculation. In this study, we used a watershed algorithm applied to 2D binary images to determine the GSD. The process involved several steps. The first step was to calculate the Euclidean distance between each pixel and its nearest pore pixels. The H-minima transform prevents over-segmentation by the watershed algorithm. This transform helped identify the ridge line between each pair of local minima in the distance image. Finally, GSD was determined along the principal axes of the images. For the 2D images, the GSD was measured along the vertical axes⁸².

Permeability estimation based on LBM simulation

The goal of this work is to develop a machine learning (ML) based permeability prediction models for 2D SEM images of carbonate samples. However, when core permeability data is unavailable for training the ML models, traditional numerical methods are still necessary. lattice Boltzmann method (LBM) is the most reliable tool for permeability calculation among other traditional numerical methods⁸³. It offers an alternative approach for simulating incompressible fluid flow. In our study, we assume a throat with a random cross-section and a consistent shape along its length to represent single-phase steady-state flow in porous media. Therefore, the permeability of the throat can be determined without simulating the entire length. Equation (2) expresses the lattice Boltzmann equation, which is connected to the Boltzmann equation⁸⁴.

$${f}_{i }\left(x+ {c}_{i}\Delta t, t+\Delta t\right)- {f}_{i}\left(x, t\right) = {\delta }_{i}.$$

(2)

Here, the particle distribution function is ${f}_{i}$, the particle velocity is ${c}_{i}$ in the ith direction, and the collision operator is ${\delta }_{i}$.

In this study, it was expected that the particle distribution would interact with fluid bounce-back boundary conditions. For each time step ($t+\Delta t$) and spatial location $x$, the evolution of the discretized particle distribution function ${f}_{i}$ in accordance with the particle distribution velocities ${c}_{i}$ was determined using the Bhatnagar, Gross, and Krook (BGK) collision model⁸⁵. Equation (3) illustrates this model.

$${f}_{i }\left(x+ {c}_{i}\Delta t, t+\Delta t\right)- {f}_{i}\left(x, t\right)= -\frac{1}{\tau }\left[{f}_{i}\left(x, t\right)- {f}_{i}^{eq}\left(x, t\right)\right].$$

(3)

In this context, $\tau$ represents the dimensionless relaxation time, $\Delta t$ denotes the time step, and the right-hand side term corresponds to the SRT-Bhatnagar, Gross, and Krook (BGK) collision model⁸⁵. The equilibrium distribution function is ${f}_{i}^{eq}\left(x, t\right).$ Equation (4) shows the formulation of the distribution function at equilibrium as a function of velocity (ϑ) in all directions.

$${f}_{i}^{eq}\left(x, t\right)= {\omega }_{i}\rho + \rho {s}_{i}\left(\vartheta \left(x, t\right)\right).$$

(4)

The macroscopic quantities of density $\rho$ and velocity $\rho (\vartheta )$ are calculated using the particle distribution function ${f}_{i}$ as shown in Eqs. (5) and (6).

$$\rho = {\sum }_{i}{f}_{i},$$

(5)

$$\rho \left(\vartheta \right)= {\sum }_{i}{c}_{i}{f}_{i},$$

(6)

and ${s}_{i}(\vartheta )$ is defined in Eq. (7).

$${s}_{i}\left(\vartheta \right)= {\omega }_{i}\left[3\frac{{c}_{i}.\vartheta }{c}+ \frac{9}{2} \frac{{\left({c}_{i}.\vartheta \right)}^{2}}{{c}^{2}}- \frac{3}{2} \frac{\vartheta .\vartheta }{{c}^{2}}\right],$$

(7)

where $\vartheta$ represents the velocity vector, ${\omega }_{i}$ denotes the weight associated with velocity ${c}_{i,}$ and $c$ represents the sound speed.

In this study, we used open-source LBM MATLAB programming to estimate the throat permeability of the porous medium, following the method proposed by Haslam et al.⁸⁶. The study employs a D2Q9 model comprising nine discrete velocity vectors, representing potential fluid flow paths, as shown in Fig. 5. This approach applies to 2D digital images got from scanning electron microscopy (SEM) to determine the throat permeability.

The D2Q9 model has the nine velocity vectors ${c}_{i}$

$${c}_{i}= \left\{\begin{array}{l}\left(0, 0\right)\quad i=0 \\ \left(1, 0\right), \left(0, 1\right), \left(-1, 0\right), \left(0,-1\right) \quad i=1, 2, 3, 4\\ \left(1, 1\right), \left(-1, 1\right), \left(-1, -1\right), \left(1,-1\right) \quad i=5, 6, 7, 8\end{array}.\right.$$

For stationary, nearest and next-nearest vectors, the weight coefficients are ${\omega }_{0}=\frac{4}{9}, {\omega }_{1}= ----={\omega }_{4}=\frac{1}{9}, {\omega }_{5}= ----= {\omega }_{8}=\frac{1}{36}$ respectively.

In this flow simulation, the lattice BGK model applies to solve the steady-state planar Poiseuille equation, which describes the pressure drop in an incompressible fluid flowing through a cylindrical pipe with a constant cross-section under laminar flow conditions. Flow simulation is started by imposing a uniform distribution of vectors at the inlet channels of the geometry. The LBM simulation considers periodic boundary conditions in the flow directions, with the velocity distribution in the outlet channels, set equal to that in the inlet channels. We calculate the LBM permeability of throats in the porous medium as $\left(k= \frac{{r}^{2}}{8}\right)$ for laminar flow, where r represents the throat radius. This formula is independent of tube length and is applicable due to the reasonable assumption of periodic boundary conditions for the considered geometry. The simulation continues until distribution vectors at each channel reach equilibrium, equilibrium showing permeability convergence⁸⁷.

At each iteration, we computed the permeability of throat tubes of the porous medium using Darcy’s law, rearranged as shown in Eq. (8).

$$k= -\frac{\mu U}{\left(\frac{dp}{dx}\right)},$$

(8)

where U is the mean velocity vector in the entire flow domain (toward pressure drop), $k$ is the throat permeability, $\frac{dp}{dx}$ is the pressure gradient, and $\mu$ is the fluid viscosity, Eq. (9) illustrates this calculation.

$$\mu = \frac{\frac{1}{\omega }-0.5}{3}.$$

(9)

Here $\omega$ is a relaxation frequency that is used in the LBM simulation, set to 1 to ensure convergence and minimize errors. In this study, throughout the simulation, the permeability data are recorded as numerical values representing the permeability of the throat tube after multiple iterations. This approach enables the computation of permeability and offers insights into the flow behavior within the porous medium⁸⁸.

Permeability determination from well log data

Several empirical methods have been proposed to determine permeability from well logs, which rely on establishing correlations between porosity, permeability, and irreducible water saturation^89,90,91. In this study, we employed the Timur⁹⁰ relationship to estimate permeability. In this study, we used data from two wells that were equipped with conventional logs, each providing measurements with a resolution of 0.125 m from top to bottom. The logs included Gamma Ray (GR), Resistivity (RT), Density RHOB), and Neutron Porosity (NPHI) logs. These logs were essential in our analysis. Specifically, we leveraged the porosity logs (Density and neutron porosity) to compute the porosity values. These porosity values played a crucial role in determining the irreducible water saturation. To achieve this, we applied a formula introduced by Buckles⁹² and later changed by Holmes et al.⁹³. This changed formula suggested that the product of porosity and irreducible water saturation in a formation remains constant. The following equations were used to illustrate the calculation of permeability based on the Timur⁹⁰ relationship. Equation (10) illustrates the irreducible water saturation calculation.

$${\mathrm{\varnothing }}^{Q}\times {S}_{wirr}=C,$$

(10)

where $\mathrm{\varnothing }$ represents the porosity, expressed as fractions. $Q$ is the porosity exponent, a dimensionless value that can range from 0.8 to 1.3, according to Holmes. In many reservoirs $Q=1$, which corresponds to the original Buckles formula. ${S}_{wirr}$ denotes the irreducible water saturation, also expressed as a fraction. $C$ represents the Buckles constant, a dimensionless value (sandstones = 0.02–0.10, Inter-granular carbonates = 0.01–0.06, Vuggy carbonates = 0.005–0.06). The permeability was estimated using Eq. (11).

$$K=0.136\frac{{\mathrm{\varnothing }}^{4.4}}{{S}_{wirr}^{2}} \left({\text{Timur}}, 1968\right),$$

(11)

where $K$ represents the permeability of formation, expressed as milli Darcy (md), S_wirr represents irreducible water saturation and ∅ represents the porosity.

Machine learning algorithms

Artificial Neural Network (ANN)

Artificial Neural Networks (ANNs) are a class of machine learning models that draw inspiration from the structure and functioning of the human brain. They aim to replicate the processing and interpretation of information observed in biological neurons. Various fields have successfully applied ANNs, including image and speech recognition, natural language processing, and decision-making systems. ANNs gained significant popularity due to their ability to draw inspiration from the structure and functioning of the human brain⁹⁴. An ANN comprises interconnected artificial neurons, referred to as nodes or units. These nodes are organized into layers, typically comprising an input layer, one or more hidden layers, and an output layer (see Fig. 6). Each node receives input signals, performs a computation, and generates an output signal that is then transmitted to nodes in the subsequent layer. The connections between nodes are represented by weights, which determine the significance of input signals in the overall computation. During the training phase, ANN learns to adjust these weights by iteratively processing the training data and comparing the predicted outputs with the desired outputs. This process, known as backpropagation, uses optimization algorithms to minimize the discrepancy between predicted and actual outputs, improving the network’s performance⁹⁵. Determining the number of hidden layers is a key challenge in the application of neural network methods, and various approaches have been proposed to address this issue. Ham and Kostanic⁹⁶ suggested employing a trial-and-error method to determine the optimal number of hidden layers. Several studies have shown that one or two hidden layers can effectively handle complex problems^97,98.

In this study, we used the average pore network parameters as input variables to train an artificial neural network (ANN) model, which comprises a single hidden layer. This configuration enables the prediction of permeability based on the provided parameters. The input parameters comprise porosity, average pore size distribution, average throat size, average grain size, and average 2D coordination number. A stepwise approach was used to determine the optimal number of neurons in the hidden layer. Using a stepwise approach, we determined the optimal number of neurons in the hidden layer by measuring the error at each step while varying the number of neurons from 1 to 20. The ANN is trained using the Levenberg–Marquardt method with the MATLAB neural fitting tool. The trainer conducts the training process iteratively to achieve the best possible results.

Support vector machine (SVM)

Support vector machine (SVM) algorithm, introduced by Vapnik⁹⁹ in 1995, is a widely used machine learning technique for both classification and regression tasks. The name given to the application of SVM for regression is Support Vector Regression (SVR). SVR follows the same principle as support vector classification, aiming to find a mapping function that relates the input features to the target variable. SVM has been successfully applied in various prediction problems across different domains¹⁰⁰. However, unlike traditional regression models, SVR can capture nonlinear relationships between the features and the target variable by utilizing a Kernel function. The kernel function plays a crucial role in SVR by transforming the input features into a higher-dimensional space. By transforming the problem from nonlinear to linear, the optimal solution can be found¹⁰¹. The choice of kernel depends on the dataset and the complexity of the underlying relationship.

Let’s consider a training dataset denoted as $T=\left\{\left({x}_{1},{y}_{1}\right)\dots \left({x}_{n},{y}_{n}\right)\right\}$, where ${x}_{i}$ is the input vector and ${y}_{i}$ is the output vector. The SVM regression problem can be mathematically represented as shown in Eq. (12)¹⁰¹.

$$\begin{aligned} f\left( x \right) = & \mathop \sum \limits_{i = 1}^{n} \left( { \propto_{i} - \propto_{i}^{*} } \right) \cdot \phi_{{(x_{i} )}} \cdot \phi_{\left( x \right)} + b \\ = & \mathop \sum \limits_{i = 1}^{n} \left( { \propto_{i} - \propto_{i}^{*} } \right) \cdot k(x_{i} ,x) + b, \\ \end{aligned}$$

(12)

where ${\propto }_{i}$ and ${\propto }_{i}^{*}$ are the Lagrange multipliers associated with the corresponding input parameters, $k{(x}_{i},x)$ represents the kernel function, ${\varnothing }_{{(x}_{i})}$ and ${\varnothing }_{\left(x\right)}$ represent the transformed feature vectors in the higher dimensional space, and $b$ is the bias term. A typical architecture of SVM regression is depicted in the Fig. 7.

In this study, SVM regression analysis was conducted using various kernels, including linear, cubic, quadratic, fine Gaussian, medium Gaussian, and coarse Gaussian. The root mean square error (RMSE) and mean absolute error (MAE) values were determined for each SVM model with different kernels, and the kernel yielding the lowest RMSE and MAE values were selected as the most appropriate choice. We obtained RMSE and MAE values for each SVM model with different kernels and presented them in Table 2. After careful evaluation, we identified the linear kernel function for permeability prediction in this study. To optimize the SVM model for determining permeability, we employed a grid search approach to determine the most effective tuning parameters. Specifically, we selected the regularization parameter (c) of 10 and kernel-specific parameter gamma (γ) of 0.1 while utilizing the linear kernel function.

Table 2 Comparison of RMSE and MAE error values of different kernel functions of SVM method.

Full size table

Relative influences of the input variables on LBM permeability

Spearman’s rank correlation was used to analyse pore network parameter sensitivity to permeability estimation. Using Eq. (13)¹⁰², we calculated the Spearman’s correlation coefficient (ρ) within the range of − 1 to + 1 to determine the strength and direction of monotonic relationships between input and output variables in our model.

$$\rho = \frac{\sum_{i=1}^{n}\left({T}_{i}-\overline{T }\right)({Q}_{i}-\overline{Q })}{\sqrt{\sum_{i=1}^{n}{({T}_{i}-\overline{T })}^{2} \sum_{i=1}^{n}{({Q}_{i}-\overline{Q })}^{2}}}.$$

(13)

Figure 8 illustrates the impact of input variables on the output variable. Notably, grain size distribution (GSD) exhibited the most influence on LBM permeability, signifying its paramount importance. Conversely, pore size distribution (PSD) demonstrated the least influence, porosity and throat radius (TR) also displayed a significant impact on permeability, underscoring their relevance in the analysis.

Model evaluation metrics

Model evaluation metrics in machine learning are used to assess the performance and quality of the trained model. This metrics provide quantitative measurements that help in understanding how well the model is performing and how accurately it is estimating predictions to meet the desired objectives. In this study, the most common evaluation metrics are used, include the coefficient of determination (${R}^{2}$), Root Mean Square Error (RMSE), Mean Square Error (MSE), and Mean Absolute Error (MAE). Equations (14)–(17) are used to calculate using these parameters.

$${R}^{2}=1- \frac{\sum_{i=1}^{n}{\left({X}_{i}-{Y}_{i}\right)}^{2}}{\sum_{i=1}^{n}{\left(\overline{Y }-{Y}_{i}\right)}^{2}},$$

(14)

$$\mathrm{MSE }= \frac{1}{n}\sum_{i=1}^{n}{({X}_{i}-{Y}_{i})}^{2},$$

(15)

$$\mathrm{RMSE }=\sqrt{\frac{1}{n}\sum_{i=1}^{n}{({X}_{i}-{Y}_{i})}^{2}},$$

(16)

$$\mathrm{MAE }= \frac{1}{n}\sum_{i=1}^{n}\left|{X}_{i}-{Y}_{i}\right|,$$

(17)

where ${X}_{i}$ and ${Y}_{i}$ are the predicted and measured values respectively, $\overline{Y }$ is represents the mean of the actual values and $n$ is the number of samples. According to previous studies, high coefficient of determination and minimal error values are showing high efficiency models.

Results and discussion

Segmentation of carbonate samples

The original 2D scanning electron microscopy (SEM) images of carbonate samples, referred to as S-1 from the Kuldhar member, S-3 from the Hamira member, and S-4 from the Badabag member are shown in Fig. 9 at various magnifications. Increasing the magnification reveals macroscopic cracks within the samples that have a notable impact on carbonate sample permeability. Conversely, lower magnification provides a wider field of view, showing a broader range of pore visibility. To examine the pore characteristics of the carbonate samples, we used all four available threshold algorithms for the grayscale to binary image conversion process to determine the most suitable algorithm. We conducted a gray scale-to-binary image conversion using four threshold algorithms (Huang, Otsu, MaxEntropy, and Yen) to determine the most suitable algorithm. Figure 10 shows the results of binary images, where black and white areas represent pores and the matrix of the carbonate samples, respectively. The MaxEntropy threshold algorithm shows relatively low pore occupancy compared to other threshold algorithms with pores occupying only a small percentage of the images.

Computing the porosity involved analyzing the binary images, counting the pore pixels, and dividing them by the total number of pixels. Figure 11 shows the porosity histograms of carbonate samples S-1 and S-4 at various magnifications (× 100, × 150, × 200, and × 300) got by calculating the grayscale threshold value by four different segmentation algorithms. We observed significant variations in the porosity values among these algorithms. The Otsu and Yen algorithms yielded porosity values ranging from 0.10 to 0.18 for S-1 and 0.14 to 0.19 for S-4. These values exceed the 10% porosity threshold, thus showing the inefficiency of these algorithms compared to petrographic studies. The Huang algorithm exhibited inconsistent behaviour in porosity calculation, resulting in varying outcomes across different samples and magnifications. At × 150 magnification, S-4 showed a maximum porosity of over 0.24, while S-1 had a minimum porosity of 0.12 at × 100 magnification. The Huang algorithm’s limited robustness can be the reason for these disparities.

In contrast, the MaxEntropy algorithm consistently produced porosity values below 0.12 for all samples and magnifications. Specifically, the values are ranging from 0.02 to 0.08 for S-1 and 0.07 to 0.12 for S-4. To validate the accuracy of the threshold segmentation algorithm, we compared the obtained porosity results with measured porosities from petrographic investigations⁶³ (see Table 3), which reported porosity ranges of 0.10 to 0.12 for S-1 and 0.08 to 0.10 for S-4. Overall, the threshold segmentation based on the MaxEntropy algorithm is more reasonable, as it generates porosities for carbonate samples that are closer to the findings of petrographic studies. We identified the MaxEntropy algorithm-generated binary images for further analysis in this study. Figure 12 depicts the binary images generated by the MaxEntropy algorithm, which correspond to the original SEM images of S-1 and S-4 shown in Fig. 9. These binary images were utilized to determine the various pore network parameters, including pore size distribution (PSD), throat radius, grain size distribution (GSD) and 2D coordination number and estimate LBM based permeability for computation of machine learning (ML) algorithms.

Table 3 Comparison of porosity calculated by MaxEntropy algorithm with petrographic studies.

Full size table

Determination of pore network parameters

We effectively examined pore characteristics at the micrometer level by applying the watershed algorithm mentioned in this section = to determine pore network parameters. Figure 13a,b display the pore size distribution variations in carbonate samples S-1 and S-4, observed at magnifications of × 100, × 150, × 200, and × 300, respectively. These plots depict normal distribution curves representing the frequencies of different pore sizes. It is observed that smaller pores are more significantly affected by magnification compared to larger ones. In this study, the smallest detectable pores for S-1 and S-4 have a pore radius of less than 0.48 µm, regardless of magnification. Across all carbonate samples, there is a higher proportion of small-sized pores and a lower proportion of larger pores. The percentage of pores decreases as pore size increases, with larger pores representing only a small fraction of the total. Overall, over 98% of the observed pores in all carbonate samples fall within the range of 0.44 to 20 µm. It is important to note that lower magnification provides a broader range of pore sizes, but some smaller pores may remain undetected. While higher magnification reveals only a portion of the smaller micro-pores, emphasizing the significance of magnification in image analysis. These observations hold value for studying pore structure characterization⁴.

The distribution of throat radius plays a crucial role in determining permeability and controlling fluid flow in carbonate reservoirs. Figure 13c,d illustrate the variation in throat radius for carbonate samples S-1 and S-4 at different magnifications. The peak throat radius shows a relatively narrow distribution, falling within the range of 0.9 to 2.04 µm, and it noticeably diminishes as magnification levels rise. There is an inverse relationship between throat radius and magnification, with an increase in magnification leading to a decrease in the number of throats. Sample S-4 exhibits a greater number of throats with a smaller radius compared to sample S-1. When the throat radius is less than 2 µm, the number of throats increases as magnification decreases, while the number of throats decreases with an increase in the throat radius. Only 2 to 4% of the total number of throats have a radius greater than 18 µm. The study reveals that it predominantly distributed the throat radius in carbonate samples within a range of less than 18 µm. The influence of magnification is more prominent for smaller throat radius but diminishes as the throat radius increases. For a larger throat radius, the curves overlap regardless of magnification.

We analyzed the grain size distribution of carbonate samples S-1 and S-4 by using 2D SEM images at various magnifications. Figure 14a,b depict the variation of grain size distribution at different magnification. The study revealed a limited range of grain sizes, predominantly between 0.8 and 10 µm. Initially, the number of grains increased with grain radius up to 2 µm, after which it decreased. The results showed that the peak radius of sample S-1 was greater than 2 µm at lower magnification but decreased as magnification increased. The impact of magnification was more noticeable for smaller grain sizes and became less significant as the grain radius increased. We observed that lower magnification did not reveal a higher number of grains compared to higher magnifications for the carbonate samples.

Figure 14c,d depict the frequency distribution of carbonate samples S-1 and S-4 at different magnifications, along with their corresponding computed 2D coordination numbers. As magnification increases, pore connectivity within both samples improves significantly. Higher magnification allows for the detection of even the smallest micro throats in the porous media, some less than 0.09 µm in size. As a result, the coordination number at different magnifications reflects varying levels of pore connectivity in the carbonate samples. Lower magnifications reveal larger throats, showing lower pore connectivity, while higher magnification enhances pore connectivity across all samples. The highest pore connectivity occurs within the coordination number range of 1 to 4, and pore connectivity decreases as the coordination number increases. Finally, we observed that the magnification relationship profoundly affects the behaviour of the carbonate reservoir by influencing the pore network parameters. These parameters impact the permeability of porous media. Higher magnification reveals the true intricacies of pore structure, influencing pore size, shape, and connectivity. A well-connected network of larger pores system shows results in the higher permeability.

Permeability calculation using LBM

We estimated the throat permeability of carbonate samples using the lattice Boltzmann method (LBM) simulation. The lattice Boltzmann method (LBM) simulation used a pressure differential assumption across the pore network and determined the pressure through the center of the pores. The fluid flow continuity equation is applied to each pore body to describe the steady-state process. This study only considered a single-phase, incompressible fluid. By applying a linear system of equations, we calculated the pressure of the pore bodies whereas the total flow rate was then computed using the determined pressure. Finally, Darcy’s law was used to calculate the total permeability of the pore network in each carbonate sample.

The permeability of carbonate samples was determined through Lattice Boltzmann Method (LBM) simulations, which achieved iterations convergence. Figures 15 and 16 show convergence curves at different magnifications for samples S-1 and S-4, respectively based on results of LBM simulation. Once the simulation reached equilibrium, the samples’ permeability remained constant even after conducting further iterations. We observed that the LBM permeability of both S-1 and S-4 increased with the number of iterations until a specific value was reached, beyond which it remained constant regardless of magnification. The calculated LBM permeability increased with magnification for most samples. The total permeability values of carbonate samples at various magnifications ranged from 0.92 to 21.42 millidarcies (md), as shown in Table 4. It was observed that the impact of magnification on permeability varies among different samples. In some samples, permeability increases with magnification, while in others, it decreases. This variability can be attributed to rock matrix and presence of smaller pores at higher magnifications. We have computed the relative error values for the numerical simulation of permeability results as illustrated in Fig. 17. These results consistently exhibit low relative errors across all carbonate samples, signifies a close alignment between numerical simulation LBM permeabilities and the measured permeabilities from well logs in the same geographical area. The average permeability values got from well-A and well-B are 10.041md, and 15.561md, respectively. Meanwhile, the average permeabilities simulated through LBM are: 5.418md at × 100 magnification, 7.55md at × 150 magnification, 10.13md at × 200 magnification, and 13.588md at × 300 magnification. These simulation results closely align with the empirically calculated permeabilities, particularly at higher magnification levels. Figure 18 illustrates the empirical permeability calculation got from well logs, which serves to validate the accuracy of the LBM permeability estimations.

Table 4 The measured permeabilities of carbonate samples at various magnifications obtained by LBM simulation technique.

Full size table

Permeability prediction using machine learning (ML)

This study aimed to explore the predictive capabilities of machine learning (ML) models, specifically Support Vector Machines (SVM) and Artificial Neural Networks (ANN), for estimating the permeability of carbonate samples. Five pore network parameters, including porosity, average pore radius, average throat radius, average grain size, and average coordination number, were selected as input features, while the calculated LBM permeability values served as the target output. To ensure reliable ML model performance, high-quality data were essential. Therefore, the dataset was divided into three subsets with 70% allocated for training, 15% for testing, and the remaining 15% for validation purposes. The dataset contained a total of 200 data points, we utilized 140 data points for training, 30 data points for testing, and 30 data points for validation for our ANN and SVM models. Table 5 shows the statistical analysis of the data. To determine the most accurate ML model for permeability prediction, we trained and evaluated the models and used model evaluation metrics.

Table 5 A statistical description of input data used for machine learning prediction.

Full size table

The analysis of estimating the permeability yielded a neural network architecture with 10 neurons for the ANN model. We implemented the input and hidden layers with a linear-type activation function, while the hidden and output layers used a TAN-sigmoidal type activation function. Figure 19 illustrates the visual representation of the several steps taken by the ANN model to estimate the permeability. Figure 20 displays the outcomes of the permeability prediction using the ANN model compared to the measured permeability values for different datasets, including training, testing, validation, and all data. The coefficient of determination (${R}^{2}$) was employed to assess the performance of the ANN model in predicting the permeability. The ${R}^{2}$ values obtained for the training, testing validation, and all data were 0.955, 0.892, 0.908, and 0.921 respectively. These results show a strong correlation between the predicted and measured permeability values. Similarly, the SVM method was utilized with linear kernel function for permeability prediction, and the ${R}^{2}$ values for training, testing, validation and all data subsets were determined as 0.859, 0.849, 0.869, and 0.849 respectively. Figure 21 shows the cross plots of predicted versus measured permeabilities using the SVM method with corresponding ${R}^{2}$ values. The obtained ${R}^{2}$ values indicate a satisfactory level of accuracy and consistency in the prediction of permeability using both ANN and SVM models. The feature importance (Fig. 22) showed that the grain size distribution (GSD) is the most influential feature, and followed by porosity in both models, underlining their significance in determining model predictions. On the other hand, coordination number (CN) exhibits the least influence on the ANN model, while throat radius (TR) has the least impact on the SVM model. Pore size distribution consistently influences both models, contributing to their predictive capabilities.

In order to identify the accuracy of the model is high when the ${R}^{2}$ value approaches 1, showing a strong correlation between predicted and measured values. When the error values are close to zero, it shows that the model prediction is very close to the true values. In this study, using the result obtained for the coefficient of determination (${R}^{2})$ and RMSE, MSE, and MAE values (Table 6). We investigated the efficiency of the ANN and SVM models and selected the appropriate model for permeability prediction. Table 6 shows predictive models for training, testing, validation, and all data have derived the performance indices. Figure 23 shows the error values and ${R}^{2}$ for the permeability, calculated between the measured and predicted values. Table 7 presents some detailed statics for the predictive models generated (all data) using ANN and SVM, as well as statistics for the empirical models generated with LBM and well log data. These statistics show is closely related to the error analysis. According to the findings in Tables 6 and 7, the ANN predictive models outperformed the SVM models in terms of permeability prediction, because of their ability to adapt to complex and non-linear relationships within the data. This is important in carbonate rocks because they formed through a chemical diagenesis process which has more heterogeneity. The predictive ANN models exhibited smaller error values, with an MAE value of 2.166 for testing, and a standard error of 1.874 for testing. Additionally, we computed the error percentage for the predicted permeability values generated by both the ANN and SVM models. The results highlight that the ANN model exhibits a lower error percentage compared to the SVM model. The predicted permeability values generated by our ANN model have been effectively validated against well-log derived permeability values, specifically from well B in one of the study zones. Figure 24a provides a visual representation of this validation, depicting a cross plot between the ML based permeability and well log derived permeability. Remarkably, this comparison reveals highly promising results, with a coefficient of determination exceeding 0.86. This strong correlation underscores the reliability and accuracy of our ANN model’s permeability predictions. The analysis was performed through resampling of the data points of estimated permeability based on well data. Prior to this, the range of minimum and maximum values of well based estimated permeability was restricted in reference to the range of ML based estimated permeability. Figure 24b visually represents this error comparison between the ANN and SVM. However, it is important to note that the choice of model should be based on the specific situation and the required level of accuracy.

Table 6 Comparison of the ${R}^{2}$ values and error metrics of the SVM and ANN models to identify the better accuracy and reliability in predicting permeability.

Full size table

Table 7 Statistics on permeability results obtained by machine learning algorithms and empirical methods.

Full size table

Conclusions

This study aimed to analyze 2D scanning electron microscope (SEM) images of carbonate samples, characterizing their pore structure. We developed a machine learning (ML) algorithm using pore network parameters as input data to predict the permeability. These predictions were compared with LBM simulation results, with the goal of establishing a reliable method for permeability prediction based on 2D SEM image analysis. The following conclusions are made based on the results of this study:

1.
Initially, we investigated the impact of various threshold algorithms on gray scale images to effectively characterize the pore structure. We calculated porosity based on these algorithms, and MaxEntropy algorithm yielded results (0.02 to 0.12) that were closely aligned with petrographic studies (0.10 to 0.12).
2.
We used a watershed algorithm at different magnifications to extract pore network parameters from the 2D SEM images of carbonate samples. At lower radii, we observed a higher number of pores and throats, whereas the number of pores and throats decreased as the radius increased. We also observed this pattern in the grain size distribution.
3.
Magnification significantly influenced the pore network parameters. With increased magnification, the pore radius, throat radius, and grain size decreased. Conversely, the coordination number exhibited the opposite behavior, increasing with higher magnification.
4.
We used Lattice Boltzmann Method (LBM) to estimate the permeability of carbonate samples. The Lattice Boltzmann Method (LBM) confirmed its validity and reliability in determining the permeability of carbonate samples by providing an acceptable value when compared with log-derived permeabilities.
5.
Machine learning (ML) algorithms such as ANN and SVM methods offers a reliable and accurate approaches for permeability prediction models when using image extracted pore network parameters as input features and LBM permeability values as the output model. The evaluation of permeability prediction using the ANN method yielded notable results. The testing data’s coefficient of determination (${R}^{2})$ for the ANN approach was 0.892, with associated error values of 3.716 for MSE, 1.927 for RMSE and 1.580 for MAE. In contrast, when using the SVM method the ${R}^{2}$ for testing data was 0.849, the corresponding error values were 5.401 for MSE, 2.324 for RMSE and 2.166 for MAE. These results indicate that the ANN approach outperforms the SVM method, demonstrating a higher level of accuracy in predicting permeability.

This research provides valuable insights with practical applications in the oil and gas industry, particularly in carbonate reservoirs. This study enhances our comprehension of these reservoirs by accurately predicting the permeability using ML algorithms. These insights can directly improve hydrocarbon exploration and production, optimize reservoir management, especially in carbonate reservoirs. The reliability of LBM simulations for permeability estimation reinforces its applicability, especially when dealing with heterogeneous carbonate samples.

Data availability

The data used in the present study was supported by Dr. Abhayanand Singh Maurya, Department of Earth Sciences at IIT Roorkee, India. It was collected from his published manuscript after proper permission. A suitable acknowledgment and citation of his published manuscripts have been mentioned in this study.

References

Ghasemi, K. et al. Determination of particle shape and size distribution from micro–X-ray CT Scans for petrophysical evaluation and sand control design. In SPE Trinidad and Tobago Section Energy Resources Conference, Port of Spain. One Petro (2018).
Roostaei, M. et al. Comparison of various particle size distribution measurement methods. SPE Res. Eval. Eng. 23, 1159–1179. https://doi.org/10.2118/199335-PA (2020).
Article CAS Google Scholar
Sahimi, M. Flow and Transport in Porous Media and Fractured Rock: From Classical Methods to Modern Approaches (Wiley, 2012).
Google Scholar
Song, S., Ding, Q. & Wei, J. Improved algorithm for estimating pore size distribution from pore space images of porous media. Phys. Rev. E 100, 053314. https://doi.org/10.1103/PhysRevE.100.053314 (2019).
Article ADS CAS PubMed Google Scholar
Song, S. B. et al. Pore structure characterization and permeability prediction of coal samples based on SEM images. J. Nat. Gas Sci. Eng. 67, 160–171 (2019).
Article Google Scholar
Blunt, M. J., Jackson, M. D., Piri, M. & Valvatne, P. H. Detailed physics, predictive capabilities and macroscopic consequences for pore-network models of multiphase flow. Adv. Water Resour. 25(8–12), 1069–1089. https://doi.org/10.1016/S0309-1708(02)00049-0 (2002).
Article ADS CAS Google Scholar
Wang, K. & Sun, W. A semi-implicit discrete-continuum coupling method for porous media based on the effective stress principle at finite strain. Comput. Methods Appl. Mech. Eng. 304, 546–583. https://doi.org/10.1016/j.cma.2016.02.020 (2016).
Article ADS MathSciNet Google Scholar
Dong, H. & Blunt, M. J. Pore-network extraction from micro-computerized-tomography images. Phys. Rev. E 80, 036307 (2009).
Article ADS Google Scholar
Anovitz, L. M. & Cole, D. R. Characterization and analysis of porosity and pore structures. Rev. Mineral. Geochem. 80(1), 61–164. https://doi.org/10.2138/rmg.2015.80.04 (2015).
Article Google Scholar
Clarkson, C. R. et al. Pore structure characterization of North American shale gas reservoirs using USANS/SANS, gas adsorption, and mercury intrusion. Fuels 103, 606–616. https://doi.org/10.1016/j.fuel.2012.06.119 (2013).
Article CAS Google Scholar
Liu, C. et al. Production analysis in shale gas reservoirs based on fracturing-enhanced permeability areas. Sci. China Phys. Mech. Astron. 62(10), 104611. https://doi.org/10.1007/s11433-019-9427-x (2019).
Article ADS MathSciNet Google Scholar
Zamirian, M., Aminian, K., Ameri, S. & Fathi, E. New steady-state technique for measuring shale core plug permeability. In SPE/CSUR Unconventional Resources Conference, Calgary, Canada. https://doi.org/10.2118/171613-MS (2014).
Rezaee, R., Saeedi, A. & Clennell, B. Tight gas sands permeability estimation from mercury injection capillary pressure and nuclear magnetic resonance data. J. Pet. Sci. Eng. 88–89, 92–99. https://doi.org/10.1016/j.petrol.2011.12.014 (2012).
Article CAS Google Scholar
Kamath, J. Evaluation of accuracy of estimation air permeability from mercury injection data. SPE Form. Eval. 7(4), 304–310 (1992).
Article CAS Google Scholar
Alessa, S., Sakhaee-Pour, A., Sadooni, F. N. & Al-Kuwari, H. A. Comprehensive pore size characterization of Midra shale. J. Pet. Sci. Eng. 203, 108576. https://doi.org/10.1016/j.petrol.2021.108576 (2021).
Article CAS Google Scholar
Munch, B. & Holzer, L. Contradicting geometrical concepts in pore size analysis attained with electron microscopy and mercury intrusion. J. Am. Ceram. Soc. 91(12), 4059–4067. https://doi.org/10.1111/j.1551-2916.2008.02736.x (2008).
Article CAS Google Scholar
Akanji, L. T., Nasr, G. G. & Bageri, M. Core-scale characterization of flow in tight Arabian formations. J. Pet. Explor. Prod. Technol. 3(4), 233–241. https://doi.org/10.1007/s13202-013-0062-1 (2013).
Article Google Scholar
Chung, T., Wang, Y. D., Armstrong, R. T. & Mostaghimi, P. Voxel agglomeration for accelerated estimation of permeability from micro-CT images. J. Pet. Sci. Eng 184, 106577. https://doi.org/10.1016/j.petrol.2019.106577 (2020).
Article CAS Google Scholar
Zhang, L., Lu, S., Xiao, D. & Li, B. Pore structure characteristics of tight sandstone in the northern Songliao Basin, China. Mar. Pet. Geol. 88, 170–180 (2017).
Article CAS Google Scholar
Mostaghimi, P., Blunt, M. J. & Bijeljic, B. Computation of absolute permeability on micro-CT images. Math. Geosci. 45(1), 103–125 (2013).
Article MathSciNet Google Scholar
Geiger, S., Roberts, S. G., Matthäi, S. K., Zoppou, C. & Burri, A. Combining finite element and finite volume methods for efficient multiphase flow simulations in highly heterogeneous and structurally complex geologic media. Geofluids 4(4), 284–299. https://doi.org/10.1111/j.1468-8123.2004.00093.x (2004).
Article Google Scholar
Song, Y. S., Chung, K., Kang, T. J. & Youn, J. R. Prediction of permeability tensor for three-dimensional circular braided preform by applying a finite volume method to a unit cell. Compos. Sci. Technol. 64(10–11), 1629–1636 (2004).
Article CAS Google Scholar
Borujeni, A. T., Lane, N. M., Thompson, K. & Tyagi, M. Effects of image resolution and numerical resolution on computed permeability of consolidated packing using LBM and FEM pore-scale simulations. Comput. Fluids 88, 753–763. https://doi.org/10.1016/j.compfluid.2013.05.019 (2013).
Article MathSciNet Google Scholar
Sun, Z., Logé, R. E. & Bernacki, M. 3D finite element model of semi-solid permeability in an equiaxed granular structure. Comput. Mater. Sci. 49(1), 158–170. https://doi.org/10.1016/j.commatsci.2010.04.042 (2010).
Article Google Scholar
Sandino, C., Kroliczek, P., McErlain, D. D. & Boyd, S. K. Predicting the permeability of trabecular bone by micro-computed tomography and finite element modelling. J. Biomech. 47(12), 3129–3134. https://doi.org/10.1016/j.biomech.2014.06.024 (2014).
Article PubMed Google Scholar
White, J. A., Borja, R. I. & Fredrich, J. T. Calculating the effective permeability of sandstone multiscale lattice Boltzmann/finite element simulations. Acta Geotech. 1(4), 195–209. https://doi.org/10.1007/s11440-006-0018-4 (2006).
Article Google Scholar
Chen, L. et al. Pore-scale simulation of multicomponent multiphase reactive transport with dissolution and precipitation. Int. J. Heat Mass Transf. 85, 935–949. https://doi.org/10.1016/j.ijheatmasstransfer.2015.02.035 (2015).
Article CAS Google Scholar
Yin, Y., Qu, Z. G. & Zhang, J. F. Pore-scale prediction of effective mass diffusivity of heterogeneous shale structure using the lattice Boltzmann method. Int. J. Heat Mass Transf. 133, 976–985. https://doi.org/10.1016/j.ijheatmasstransfer.2018.12.184 (2019).
Article Google Scholar
Rabbani, A. & Babaei, M. Hybrid pore network and lattice Boltzmann permeability modelling accelerated by machine learning. Adv. Water Resour. 126, 116–128 (2019).
Article ADS Google Scholar
Keehm, Y., Mukerji, T. & Nur, A. Permeability prediction from thin sections: 3D reconstruction and lattice-Boltzmann flow simulation. Geophys. Res. Lett. 31(4), L04606. https://doi.org/10.1029/2003GL018761 (2004).
Article ADS Google Scholar
Stewart, M. L., Ward, A. L. & Rector, D. R. A study of pore geometry effects on anisotropy in hydraulic permeability using the lattice-Boltzmann method. Adv. Water Resour. 29(9), 1328–1340. https://doi.org/10.1016/j.advwaters.2005.10.012 (2006).
Article ADS Google Scholar
Wang, Y. D., Chung, T., Armstrong, R. T. & Mostaghimi, P. ML-LBM: Machine Learning Aided Flow Simulation in Porous Media. https://doi.org/10.13140/RG.2.2.11248.40961 (2020).
Wu, Y. et al. Pore-scale 3D dynamic modelling and characterization of shale samples: Considering the effects of thermal maturation. J. Geophys. Res. Solid Earth 125(1), e2019JB018309. https://doi.org/10.1029/2019JB018309 (2020).
Article ADS MathSciNet Google Scholar
Fagbemi, S. & Tahmasebi, P. Coupling pore network and finite element methods for rapid modelling of deformation. J. Fluid Mech. 897, 381. https://doi.org/10.1017/jfm.2020.381 (2020).
Article MathSciNet Google Scholar
Hakimov, N., Zolfaghari, A., Kalantari-Dahaghi, A., Negahban, S. & Gunter, G. Pore-scale network modelling of microporosity in low resistivity pay zones of carbonate reservoir. J. Nat. Gas Sci. Eng. 71, 103005. https://doi.org/10.1016/j.jngse.2019.103005 (2019).
Article Google Scholar
Dahle, H. K., Celia, M. A. & Hassanizadeh, S. M. Bundle of tubes model for calculating dynamic effects in the capillary pressure saturation relationship. Transp. Porous Media 58(1–2), 5–22. https://doi.org/10.1007/s11242-004-5466-4 (2005).
Article Google Scholar
Ghassemi, A. & Pak, A. Pore-scale study of permeability and tortuosity for flow through particulate media using Lattice Boltzmann method. Int. J. Numer. Anal. Methods Geomech. 35(8), 886–901. https://doi.org/10.1002/nag.932 (2011).
Article Google Scholar
Frisch, U., Hasslacher, B. & Pomeau, Y. Lattice gas automata for the Navier–Stokes equation. Phys. Rev. Lett. 56, 1505 (1986).
Article ADS CAS PubMed Google Scholar
Higuera, F. J. & Jimenez, J. Boltzmann approach to lattice gas simulations. EPL 9(7), 663–668. https://doi.org/10.1209/0295-5075/9/7/009 (1989).
Article ADS Google Scholar
Chehrazi, A. & Rezaee, R. A systematic method for permeability prediction, a Petro-Facies approach. J. Pet. Sci. Eng. 82(83), 1–16 (2012).
Article Google Scholar
Tahmasebi, P. & Hezarkhani, A. A fast and independent architecture of artificial neural network for permeability prediction. J. Pet. Sci. Eng. 86(87), 118–126 (2012).
Article Google Scholar
Wang, S., Qin, C., Feng, Q., Javadpour, F. & Rui, Z. A framework for predicting the production performance of unconventional resources using deep learning. Appl. Energy 295, 117016. https://doi.org/10.1016/j.apenergy.2021.117016 (2021).
Article Google Scholar
Lu, Y. & Liu, K. Pore structure characterization of eocene low-permeability sandstone via fractal analysis and machine learning: An example from Dongying depression, Bohai Bay Basin, China. ACS Omega 6(7), 11693–11710 (2021).
Article CAS PubMed PubMed Central Google Scholar
Mustafa, A., Tariq, Z., Mahmoud, M. & Abdulraheem, A. Machine learning accelerated approach to infer nuclear magnetic resonance porosity for middle eastern carbonate reservoir. Sci. Rep. 13(1), 3956. https://doi.org/10.1038/s41598-023-30708-7 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Alipour, M., Esatyana, E., Sakhaee-Pour, A., Sadooni, F. N. & Al-Kuwari, H. A. Characterizing fracture toughness using machine learning. J. Pet. Sci. Eng. 200, 108202. https://doi.org/10.1016/j.petrol.2020.108202 (2021).
Article CAS Google Scholar
Goswami, S., Anitescu, C., Chakraborty, S. & Rabczuk, T. Transfer learning enhanced physics informed neural network for phase-field modelling of fracture. Theor. Appl. Fract. Mech. 106, 102447. https://doi.org/10.1016/j.tafmec.2019.102447 (2020).
Article Google Scholar
Rabbani, A., Assadi, A., Kharrat, R., Dashti, N. & Ayatollahi, S. Estimation of carbonates permeability using pore network parameters extracted from thin section images and comparison with experimental data. J. Nat. Gas Sci. Eng. 42, 85–98. https://doi.org/10.1016/j.jngse.2017.02.045 (2017).
Article Google Scholar
Alzubaidi, F., Mostaghimi, P., Swietojanski, P., Clark, S. & Armstrong, R. T. Automated lithology classification from drill core images using conventional neural networks. J. Pet. Sci. Eng. 197, 107933 (2021).
Article CAS Google Scholar
Lu, H. et al. Pore structure characteristics and permeability prediction model in a cretaceous carbonate reservoir, North Persian Gulf Basin. Geofluids 2021, 8876679 (2021).
Article Google Scholar
Cheng, Y. et al. Pore structure and permeability characterization of tight sandstone reservoirs: From a multiscale perspective. Energy Fuels 37(13), 9185–9196 (2023).
Article MathSciNet CAS Google Scholar
Adegbite, J. O., Belhaj, H. & Bera, A. Investigations on the relationship among the porosity, permeability and pore throat size of transition zone samples in carbonate reservoirs using multiple regression analysis, artificial neural network and adaptive neuro-fuzzy interface system. Pet. Res. 6(4), 321–332 (2021).
Google Scholar
Rostami, A., Baghban, A., Mohammadi, A. H., Hemmati, A. S. & Habibzadeh, S. Rigorous prognostication of permeability of heterogeneous carbonate oil reservoirs: Smart modelling and correlation development. Fuel 236, 110–123 (2019).
Article CAS Google Scholar
Zhang, H. et al. Permeability prediction of low-resolution porous media images using autoencoder-based conventional neural network. J. Pet. Sci. Eng. 208, 109589. https://doi.org/10.1016/j.petrol.2021.109589 (2022).
Article ADS CAS Google Scholar
Tran, H., Kasha, A., Sakhaee-Pour, A. & Hussein, I. Predicting carbonate formation permeability using machine learning. J. Pet. Sci. Eng. 195, 107581. https://doi.org/10.1016/j.petrol.2020.107581 (2020).
Article CAS Google Scholar
Gohari, M. S. J., Niri, M. E. & Ghiasi-Freez, J. Improving permeability estimation of carbonate rocks using extracted pore network parameters: A gas field case study. Acta Geophys. 69, 1–19 (2021).
Google Scholar
Saxena, N. et al. Imaging and computational considerations for image computed permeability: Operating envelope of digital rock physics. Adv. Water Resour. 116, 127–144 (2018).
Article ADS Google Scholar
Yalamanchi, P. & Datta Gupta, S. Selection of a suitable rock mixing method for computing Gardner’s constant through machine learning (ML) approach to estimate the compressional velocity: A study from Jaisalmer sub basin, India. Pure Appl. Geophys. 178, 1825–1844. https://doi.org/10.1007/s00024-021-02733-y (2021).
Article ADS Google Scholar
Zhang, S., Zou, C., Peng, C., Yan, L. & Wu, X. Pore structure and its effect on acoustic velocity and permeability of reef-shoal carbonates in the Tarim Basin, Northwest China. J. Geophys. Eng. 19(6), 1340–1354 (2022).
Article Google Scholar
Awasthi, A. M. Geophysical exploration in Jaisalmer Basin—A case history. Geohorizons 1, 1–6 (2002).
Google Scholar
Rao, V. R. Subsurface stratigraphy tectonic setting and petroleum prospects of the Jaisalmer area, Rajasthan, India. In Proc. IV Symposium of Development in Petroleum Resources of Asia and the Far East, Camberra, Australia, Series, Vol. 41, 366–371 (1972).
Pandey, D. K., Fursich, F. T. & Baron-Szabo, R. Jurassic corals from the Jaisalmer Basin, western Rajasthan, India. In Zitteliana 13–37 (2009).
Zadan, K. & Arbab, K. A. A review on Lithostratigraphy and Biostratigraphy of Jaisalmer basin, western Rajasthan, India. Int. Res. J. Earth Sci. 3(8), 37–45 (2015).
Google Scholar
Pandey, R. & Maurya, A. S. Hydrocarbon uncertainty based on facies analysis: Middle Jurassic sequence (Jaisalmer Formation), Jaisalmer Basin, Rajasthan. J. Geol. Soc. India 95, 301–307. https://doi.org/10.1007/s12594-020-1429-9 (2020).
Article CAS Google Scholar
Wandrey, C. J., Law, B. E. & Shah H. A. Sembar Goru/Ghaij total petroleum system Indus Sulaiman–Kirthar geologic province, Palistan and India. In USGC, 2208C 1-23 (2004).
Oldham, R. P. Preliminary note on the geology of northern Jaisalmer. Rec. Geol. Surv. India 19, 157–160 (1886).
Google Scholar
Kachhara, R. P. & Jodhawat, R. L. Bivalve biostratigraphy of the Jaisalmer formation, western Rajasthan, India. In Geological Evolution of Northwestern India (ed. Paliwai, B. S.) 109–117 (Scientific Publishers, 1999).
Google Scholar
Das Gupta, S. K. A revision of the mesozoic-tertiary stratigraphy of the Jaisalmer Basin, Rajasthan, India. Indian J. Earth Sci. 2(1), 77–94 (1975).
Google Scholar
Narayanan, K., Subrahmanyan, M. & Srinivasan, N. Geology of Jaisalmer. Unpublished Report ONGC (1961).
Sharma, J. K. & Pandey, D. K. Taxonomy of late Bathonian–Oxfordian ammonites from the Jaisalmer Basin: Implications for intrabasinal litho-and biostratigraphic correlations. J. Palaeontol. Soc. India 61(2), 249–266 (2016).
Google Scholar
Kapur, J. N., Sahoo, P. K. & Wong, A. K. C. A new method for gray-level picture thresholding using the entropy of histogram. Comput. Vis. Graph. Image Process. 29(3), 273–285. https://doi.org/10.1016/0734-189X(85)90125-2 (1985).
Article Google Scholar
Otsu, N. A threshold selection method from grey-level histograms. IEEE Trans. Syst. Man Cybern. 9, 62–66. https://doi.org/10.1109/tsmc.1979.4310076 (1979).
Article Google Scholar
Huang, L. K. & Wang, M. J. J. Image thresholding by minimizing the measures of fuzziness. Pattern Recogn. 28, 41–51 (1995).
Article ADS Google Scholar
Yen, J. C., Chang, F. J. & Chang, S. A new criterion for automatic multilevel thresholding. IEEE Trans. Image Process. 4, 370–378 (1995).
Article ADS CAS PubMed Google Scholar
Gosticks, J. T. Versatile and efficient pore network extraction method using marker-based watershed segmentation. Phys. Rev. E 96, 023307. https://doi.org/10.1103/PhysrevE.96.023307 (2017).
Article ADS Google Scholar
Baldwin, C. A., Sederman, A. J., Mantle, M. D., Alexander, P. & Gladden, L. F. Determination and characterization of the structure of a pore space from 3D volume images. J. Colloid Interface Sci. 181(1), 79–92 (1996).
Article ADS CAS Google Scholar
Schindelin, J. et al. Fiji: An open-source platform for biological-image analysis. Nat. Methods 9(7), 676–682. https://doi.org/10.1038/nmeth.2019 (2012).
Article CAS PubMed Google Scholar
MATLAB Version: 9.13.0, R2022b. https://www.mathworks.com (The MathWorks Inc.).
Sonka, M., Hlavac, V. & Boyle, R. Image Processing, Analysis, and Machine Vision 4th edn, 170–232 (Cengage Learning, 2014).
Google Scholar
Gonzales, R. C., Woods, R. E. & Eddins, S. L. Digital Image Processing Using MATLAB (Pearson Prentice Hall, 2004).
Google Scholar
Rabbani, A., Jamshidi, S. & Salehi, S. An automated simple algorithm for realistic pore network extraction from micro-tomography images. J. Pet. Sci. Eng. 123, 164–171. https://doi.org/10.1016/j.petrol.2014.08.020 (2014).
Article CAS Google Scholar
Rabbani, A., Ayatollahi, S., Kharrat, R. & Dashti, N. Estimation of 3D pore network coordination number of rocks from watershed segmentation of a single 2-D image. Adv. Water Resour. 94, 264–277. https://doi.org/10.1016/j.advwaters.2016.05.020 (2016).
Article ADS Google Scholar
Srisutthiyakorn, N. & Mavko, G. Computation of grain size distribution in 2-D and 3-D binary images. Comput. Geosci. 126, 21–30. https://doi.org/10.1016/j.cageo.2019.01.019 (2019).
Article ADS Google Scholar
Pan, C., Hilpert, M. & Miller, C. T. Lattice-Boltzmann simulation of two-phase flow in porous media: LBM simulation of two-phase flow. Water Resour. Res. 40(1), 2120. https://doi.org/10.1029/2003WR002120 (2004).
Article Google Scholar
He, X. & Luo, L. S. Lattice Boltzmann for the incompressible Naiver–Stokes equation. J. Stat. Phys. 88, 927–944. https://doi.org/10.1023/B:JOSS.0000015179.12689.e4 (1997).
Article ADS Google Scholar
Bhatnagar, P. L., Gross, E. P. & Krook, M. A model for collision process in gases. I. Small amplitude processes in charged and neutral one-component systems. Phys. Rev. 94(3), 511–525. https://doi.org/10.1103/PhysRev.94.511 (1954).
Article ADS CAS Google Scholar
Haslam, I. W., Crouch, R. S. & Seaid, M. Coupled finite element-lattice Boltzmann analysis. Comput. Methods Appl. Mech. Eng. 197(51–52), 4505–4511. https://doi.org/10.1016/j.cma.2008.04.002 (2008).
Article ADS Google Scholar
Mortensen, N. A., Cockles, F. & Bruus, H. Re-examination of Hagen–Poiseuille flow: Shape dependence of the hydraulic resistance in microchannel. Phys. Rev. E 71(5), 7301. https://doi.org/10.1103/PhysRevE.71.057301 (2005).
Article ADS CAS Google Scholar
Pazdniakou, A. & Adler, P. M. Dynamic permeability of porous media by the lattice Boltzmann method. Adv. Water Resour. 62, 292–302 (2013).
Article ADS Google Scholar
Tixier, M. P. Evaluation of permeability from electric-log resistivity gradients. Oil Gas J. 8, 75–90 (1949).
Google Scholar
Timur, A. An investigation of permeability, porosity, and residual water saturation relationships for sandstone reservoirs. Log Anal. 9(4), 3–5 (1968).
Google Scholar
Coats, G. R. & Dumanoir, J. L. A new approach to improved log-derived permeability. Log Anal. 15, 17–31 (1974).
Google Scholar
Buckles, R. S. Correlating and averaging connate water saturation data. J. Can. Pet. Technol. 4, 42–52. https://doi.org/10.2118/65-01-07 (1965).
Article CAS Google Scholar
Holmes, M., Holmes, A. & Holmes, D. Relationship between porosity and water saturation: Methodology to distinguish mobile from capillary bound water. In Adapted from Oral Presentation at AAPG Annual Convention, Denver, Colorado, 7–10 June 2009 AAPG National Meeting (2009).
Kalkan, E., Akbulut, S., Tortum, A. & Celik, S. Prediction of the unconfined compressive strength of compacted granular soils by using inference systems. Environ. Geol. 58(7), 1429–1440. https://doi.org/10.1007/s00254-008-1645-x (2009).
Article ADS CAS Google Scholar
Adhikary, B. B. & Mutsuyoshi, H. Prediction of shear strength of steel fiber RC beams using neural networks. Constr. Build. Mater. 20(9), 801–811. https://doi.org/10.1016/j.conbuildmat2005.01.047 (2006).
Article Google Scholar
Ham, F. & Kostanic, I. Fundamental neurocomputing concepts. Princ. Neurocomput. Sci. Eng. (2001).
Gordan, B., Armaghani, D. J., Hajihassani, M. & Monjezi, M. Prediction of seismic slope stability through combination of particle swarm optimization and neural network. Eng. Comput. 32(1), 85–97. https://doi.org/10.1007/s00366-015-0400-7 (2016).
Article Google Scholar
Basheer, I. A. Selection of methodology for neural network modelling of constitutive hystereses behavior of soils. Comput. Aided. Civil Infrastruct. Eng. 15(6), 445–463. https://doi.org/10.1111/0885-9507.00206 (2000).
Article Google Scholar
Vapnik, V. N. The Nature of Statistical Learning Theory. Information Science and Statistics Series (Springer, 1995).
Google Scholar
Onifade, M., Lawal, A. I., Aladejare, A. E., Bada, S. & Idris, M. A. Prediction of gross calorific value of solid fuels from their proximate analysis using soft computing and regression analysis. Int. J. Coal Prep. Util. 42(4), 1170–1184. https://doi.org/10.1016/j.advwaters.2013.06.001 (2019).
Article Google Scholar
Xing, J., Luo, K., Wang, H., Gao, Z. & Fan, J. A comprehensive study on estimating heating value of biomass proximate and ultimate analysis with machine learning approaches. Energy 188, 116077 (2019).
Article Google Scholar
Gauthier, T. D. Detecting trends using Spearman’s rank correlation coefficient. Environ. Forens. 2(4), 359–362 (2001).
Article CAS Google Scholar

Download references

Acknowledgements

The authors express our sincere gratitude to Dr. Abhayanand Singh Maurya, Associate Professor in the Department of Earth Science at IIT Roorkee, India, for providing the carbonate samples data in image form. They also thankful for his continuous encouragement and support throughout this research. Additionally, they extend our appreciation to the Exploration Seismic and Simulation lab in the Department of Applied Geophysics, IIT(ISM) Dhanbad for providing the necessary facilities and support to conduct this study. Lastly, they would like to acknowledge all individuals who were directly or indirectly involved in this research.

Author information

Authors and Affiliations

Department of Applied Geophysics, Indian Institute of Technology (Indian School of Mines), Dhanbad, 826004, India
Pydiraju Yalamanchi & Saurabh Datta Gupta

Authors

Pydiraju Yalamanchi
View author publications
You can also search for this author in PubMed Google Scholar
Saurabh Datta Gupta
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.Y. Framed experiment, conducted study, execution of code and writing of manuscript. S.D.G. Framing scope of the work with novelty, defining workflow, validating results and writing manuscript.

Corresponding author

Correspondence to Saurabh Datta Gupta.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yalamanchi, P., Datta Gupta, S. Estimation of pore structure and permeability in tight carbonate reservoir based on machine learning (ML) algorithm using SEM images of Jaisalmer sub-basin, India. Sci Rep 14, 930 (2024). https://doi.org/10.1038/s41598-024-51479-9

Download citation

Received: 13 July 2023
Accepted: 05 January 2024
Published: 09 January 2024
DOI: https://doi.org/10.1038/s41598-024-51479-9

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

A case study of petrophysical rock typing and permeability prediction using machine learning in a heterogenous carbonate reservoir in Iran

Stacked ensemble machine learning for porosity and absolute permeability prediction of carbonate rock plugs

Utilizing machine learning for flow zone indicators prediction and hydraulic flow unit classification

Introduction

Geology of the study area

Materials and methods

Data preparation

Method

Image processing

Threshold-based segmentation algorithms

Determination of pore network parameters

Permeability estimation based on LBM simulation

Permeability determination from well log data

Machine learning algorithms

Artificial Neural Network (ANN)

Support vector machine (SVM)

Relative influences of the input variables on LBM permeability

Model evaluation metrics

Results and discussion

Segmentation of carbonate samples

Determination of pore network parameters

Permeability calculation using LBM

Permeability prediction using machine learning (ML)

Conclusions

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links