Modeling and scale-bridging using machine learning: nanoconfinement effects in porous media

Lubbers, Nicholas; Agarwal, Animesh; Chen, Yu; Son, Soyoun; Mehana, Mohamed; Kang, Qinjun; Karra, Satish; Junghans, Christoph; Germann, Timothy C.; Viswanathan, Hari S.

doi:10.1038/s41598-020-69661-0

Download PDF

Article
Open access
Published: 07 August 2020

Modeling and scale-bridging using machine learning: nanoconfinement effects in porous media

Nicholas Lubbers¹,
Animesh Agarwal²,
Yu Chen³,
Soyoun Son^4,5,
Mohamed Mehana³,
Qinjun Kang³,
Satish Karra³,
Christoph Junghans⁶,
Timothy C. Germann⁷ &
…
Hari S. Viswanathan³

Scientific Reports volume 10, Article number: 13312 (2020) Cite this article

4795 Accesses
24 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Fine-scale models that represent first-principles physics are challenging to represent at larger scales of interest in many application areas. In nanoporous media such as tight-shale formations, where the typical pore size is less than 50 nm, confinement effects play a significant role in how fluids behave. At these scales, fluids are under confinement, affecting key properties such as density, viscosity, adsorption, etc. Pore-scale Lattice Boltzmann Methods (LBM) can simulate flow in complex pore structures relevant to predicting hydrocarbon production, but must be corrected to account for confinement effects. Molecular dynamics (MD) can model confinement effects but is computationally expensive in comparison. The hurdle to bridging MD with LBM is the computational expense of MD simulations needed to perform this correction. Here, we build a Machine Learning (ML) surrogate model that captures adsorption effects across a wide range of parameter space and bridges the MD and LBM scales using a relatively small number of MD calculations. The model computes upscaled adsorption parameters across varying density, temperature, and pore width. The ML model is 7 orders of magnitude faster than brute force MD. This workflow is agnostic to the physical system and could be generalized to further scale-bridging applications.

Predictive scale-bridging simulations through active learning

Article Open access 27 September 2023

Machine learning to predict effective reaction rates in 3D porous media from pore structural features

Article Open access 31 March 2022

A Dataset of 3D Structural and Simulated Transport Properties of Complex Porous Media

Article Open access 03 October 2022

Introduction

Multi-scale physics problems are found in all scientific disciplines. Prominent examples can be found in material science^1,2,3, biology⁴, chemistry ^5,6,7,8,9, and geosciences^10,11,12. Typically, information from computationally intensive fine-scale models have to be translated or upscaled into faster coarse-scale models to solve the problem at the scale of interest. A problem of great scientific and economic interest is the flow of hydrocarbon in nanoporous shale. Traditional porous media approaches such as the LBM allow for complex pore geometries but need to be provided with effective properties that account for nanoconfinement effects in order to accurately simulate mass transport at the continuum scale¹³. Atomistic simulations such as Molecular Dynamics (MD) capture nanoconfinement effects accurately, but are limited to a few pores as they are computationally intractable to simulate for mesoscopic pore geometries. There is a need for approaches that efficiently bridge these two scales without compromising accuracy.

Recently, Machine Learning (ML) has shown great promise in accelerating physics-based models that makes it feasible to build a scale-bridging framework^14,15,16. The applications include fracture propagation in brittle materials¹⁷, computational fluid dynamics¹⁸ and molecular dynamics¹⁹. On another dimension, Machine Learning (ML) techniques have found their way into petroleum engineering and studies of porous media. For instance, Fulford et al. ²⁰ used ML to tackle the challenges for predicting well performance in shale reservoirs, and similarly, Li et al. ²¹ used an ensemble of ML techniques to construct the expensive-to-acquire logs which provided a reliable way to estimate the in-situ geomechanical properties of shale reservoirs. Kamrava et al. ²² used ML to generate synthetic 3D micropore structures in shale. Additionally, Kamrava et al. ²³ estimated permeability of structures. Combining both accelerated computation and porous media, Santos et al. ²⁴ modeled complex fluid flow through 3D porous media geometries. In this work, we describe and implement an ML framework to bridge the molecular and continuum scales in order to accurately simulate hydrocarbon adsorption in nanoporous media.

The physics of nanoconfined hydrocarbons has come to the forefront due to the recent unconventional boom. Hydrocarbon-rich tight formations (low-permeable shale, sandstone, and carbonate rock formations) have very small pores ranging in size from a few to a few hundred nanometers. Hydrocarbons are either stored as free oil/gas in the pore space or adsorbed on the pore walls. Because of the small pores and the resulting low permeability ($10^{-16}$ to $10^{-20}\ \hbox {m}^2$)²⁵, the enclosed hydrocarbon resources are very difficult to access. Hydraulic fracturing and horizontal drilling allow access to the free oil/gas in the fracture network and adjacent damaged zones but the hydrocarbons in the matrix are still untapped^{26, 27}. If the mass transport from the matrix can be accelerated and later-stage production can therefore be improved, then the petroleum industry may find it profitable to continue production from existing wells before drilling new ones, leading to enhanced recovery while minimizing the environmental impact.

However, since hydrocarbon in shale matrix is under nanoconfinement, due to the small pore sizes (e.g., less than 50 nm), it prevents traditional reservoir simulators from accurately predicting mass transport from the shale matrix into the fractures. Specifically, properties such as density, viscosity, phase transition, and adsorption deviate from macroscopic behavior under nanoconfinement due to the increased importance of boundary layer effects, greatly affecting mass transport rates^{28,29,30,31,32,33,34,35,36,37,38,39}.

Adsorption, the accumulation of hydrocarbon molecules onto the pore walls, is one of the most critical nanoconfinement processes that affects the extraction of hydrocarbons out of nanopores⁴⁰. It is estimated that a large portion (20–80%) of the total shale gas in a reservoir is in the absorbed form^{40, 41}. Methane adsorption under nanoconfinement and its effect on transport in shale matrix have been investigated through off-line MD and LBM³⁵. In that study, equilibrium MD simulations are conducted to study methane adsorption on the organic and inorganic walls of nanopores in shale matrix with different pore sizes and pressures. Density and pressure distributions within the adsorbed layer, as well as the pressure-dependent thickness of adsorbed layer, are obtained from the MD simulations. This information is then implemented in the LBM simulations, through which the effect of adsorption on transport is considered. However, in this approach, it is implicitly assumed that there is a clear separation of scales. This is not a generally valid assumption, since confinement effects in the pore affect flow through the nanoporous medium and vice-versa.

We demonstrate our ML-based scale-bridging framework to capture adsorption under nanoconfinement where there is no clear separation of scales. We incorporate atomistic adsorption effects that occur within a nanoconfined pore as simulated accurately by MD into a continuum LBM that is capable of simulating larger scales. We show that our ML framework is accurate and much more efficient than direct MD, allowing up to 7 orders of magnitude speedup making it ideal for a robust scale-bridging framework. Moreover, the workflow is not dependent on the physical characteristics of adsorption phenomena; the workflow is agnostic to the physical system at hand and could be generalized to further surrogate modeling and scale-bridging applications.

Results

In order to bridge the scales between MD and LBM, we take advantage of the recent advances in ML, where Neural Network based emulators can be used to replace physics-based models. Our goal is for LBM to accurately model adsorption under nanoconfinement, requiring that it be informed by MD. To achieve this, we aim at building an upscaler that maps MD inputs to LBM inputs. Both models require pore width, overall density and temperature as inputs to capture adsorption behavior. However, since LBM is a continuum model, an additional adsorption coefficient is needed to capture the fluid-wall interaction in the LBM method. This parameter cannot be directly measured, rather, it is a model parameter that must be calibrated. Since both MD and LBM can be computationally expensive to span the entire input parameter space, in order to build the upscaler, first we build emulators that mimic MD and LBM behavior. Specifically, we utilize Deep Neural Networks (DNNs) as our emulators. We train the emulators using apriori MD and LBM simulations over their respective input parameter spaces (Fig. 1A,B). Our emulators also ensure conservation of mass to prevent unphysical density profiles, and ensure that our emulators produce smooth, symmetric profiles. This is done through engineering the architecture of the NN via constraining the activation functions of the neurons. The spatial density profile is used as the output for both MD and LBM and is representative of the adsorption of the fluid to the pore wall. Having trained two emulators, we train a DNN-based upscaler (Fig. 1C) that maps MD inputs into LBM inputs. The goal of the upscaler is to match the spatial density profiles between the MD and LBM emulators. This is done by finding the optimal density, temperature and the adsorption coefficient that minimize the error between the spatial density profiles from the two emulators (Fig. 1D).

The choice of DNN models is an important one, motivated by the availability of fast Automatic Differentiation algorithms for computing gradients ⁴² that are already implemented in DNN libraries ⁴³. Training the upscaler requires gradient information about changes in LBM profiles with respect to LBM inputs. These gradients are easy to compute from the DNN-LBM emulator, and allow us to use the emulated profile behavior to train an upscaler with less computational effort and human time than would be necessary otherwise. In contrast, it would be far more expensive to calculate numerical finite-difference gradients from discrete LBM simulations, which requires many LBM simulations for any given state point to determine the sensitivity of the profile with respect to the input parameters. Similarly, using automatic gradients is far less laborious than deriving, coding, and testing the gradient of LBM profiles with respect to LBM input combinations within an existing LBM code. In addition, the LBM emulator is about 4 orders of magnitude cheaper than direct LBM (see “Computational costs”), making it more tractable for training the upscaler.

Next we summarize the gaps with traditional scale-bridging methods and the advantages to our approach:

(1)
Sequentially approaching adsorption one input parameter at a time will require a separate set of MD calculations for each parameter to consider, which is computationally laborious. Ours is a holistic method over the entire multivariable parameter space and requires few MD simulations to perform upscaling.
(2)
Because the framework operates over a multivariable parameter space rather than optimizing upscaled parameters on a point-by-point basis, it is suited for the simultaneous modeling of multiple upscaled parameters, because it incorporates the properties of many simulations simultaneously.
(3)
Upscaled functions are often assumed to obey some sort of relationship such as a linear or exponential dependence in order to simplify the procedure. Our approach makes no assumptions about the form of upscaling functions from fine-scale (MD) inputs to coarse-scale (LBM) parameters, except that these functions can be generated by a neural network.
(4)
Our framework is extensible to other forms of upscaling such as flow conditions, nanoscale modifications to equation of state, and phase transition characteristics.
(5)
In this work we focus on 1D pores; however, the overall workflow does not depend on the dimensionality of the system, and can be generalized to two and three dimensional porous geometries.
(6)
The method enables extreme throughput for upscaled parameters. Once trained, calls to the upscaler are very cheap, and can easily be evaluated millions of times at negligible computational cost. This is highly advantageous in comparison to direct scale-bridging methods, which require continual calls to fine-scale simulations to advance the coarse-scale simulation.

In the 1D pore scenario studied here, the inputs to an MD simulation are width, overall methane density, and temperature, denoted w, $\rho _o$, T. LBM simulations use the same width w, and take density $\rho _0'$ and temperature $T'$, as well as the additional adsorption parameter, denoted $a'$. The role of the upscaler is to take any set of inputs w, $\rho _0$, and T, and determine matching effective parameters $\rho _0'$, $T'$, and $a'$ such that adsorption effects in the LBM physics match those in MD.

We use datasets of 1,010 simulations for MD and 8,074 for LBM (for details, see “Methods” below). The parameter ranges for MD are: 3–300 kg/m³ for density, 300–400 K for temperature and 2.4–22 nm for pore width. For LBM, the parameter ranges are (in lattice units, l.u.): 0.25–1.5 l.u. for density, 0.8–2 l.u. for temperature, 3–13 nm for pore width, and 1–10 for the (dimensionless) adsorption parameter. The MD and LBM emulators were then trained on these simulations. The training for the MD and LBM emulators shows good overall profile density, and the upscaler network produces good models of the bulk density, as shown in Fig. 2, which compares the true and predicted values on held-out test fractions of the datasets. We note that the MD plot compares the collected (True) and emulated (Predicted) profiles across each point and shows very little scatter. The LBM plots the same quantity for LBM simulations. While there are a few outlier points, the vast majority of the predicted points fall very close to the to true points. The upscaler is trained to construct LBM inputs that generate the correct bulk density, and so the resulting plot only has one point per profile. For the upscaler, the fit visually near perfect.

Extracted physical quantities

The trained emulators and upscaler can be used to extract physical information about the pore. Figure 3 shows our approach in comparing profiles to a uniform profile with the same bulk density, where bulk density is defined as the density in the center of the pore. Blue regions show excess density compared to the bulk. The excess density represents the nanoconfinement adsorption effect from MD that needs to be captured in LBM through the adsorption parameter. The excess density is calculated as follows. The purple region shows the intersection of the MD profile with the adsorption-free fluid profile, the blue regions are the excess compared to the adsorption-free fluid profile, and the red regions are deficits compared to the bulk (central) density. The excess density is then given as the total area of the MD profile (purple plus blue) minus the total area of the bulk profile (purple plus red). Interestingly, as the methane density increases, the width of the emulated adsorption layer decreases. Although multiple adsorption layers form in MD, as the density is increased further, a deficit forms (red region in Fig. 3) between the first and second layers, which, overall, mitigates the excess density in the second adsorption layer. This signals an increase of structure in the adsorbed particles. This phenomenon naturally arises from the microscopic nature of MD which is captured in our workflow without apriori conceptualization.

Figure 4 compares the excess density across the input space, and indicates that measurements from the MD and upscaled LBM emulators are in good agreement. As seen in Fig. 4A, for low densities, the excess density is approximately linear with the total density. When the density is near 150 kg/m³, a turning point is approached, and excess density begins to decrease with increasing total density. This corresponds to a saturation of the adsorbed particles near the wall; any new particle added to the system is more likely to equilibrate into the bulk than to the adsorbed layer. At extremely low densities below $\approx 5$ kg/m³, there is a small mismatch between the MD emulator and the LBM upscaler—we will remark on this further at the end of this section. The multiple curves for various temperatures indicate that this effect is present for all temperatures, but the saturation effect occurs at larger densities for higher temperatures. Figure 4A also shows that excess density is inversely related to temperature. This demonstrates that as temperature increases, methane preferentially desorbs from the wall into the bulk. This effect is captured in the upscaling scheme.

Figure 4B shows the fraction of excess methane density compared to total density. We found that the nano-confinement allows the nanopores to pack more methane compared to the bulk-based estimates. Quantitatively, one can expect at least 15% more methane in 4 nm pores compared to the bulk-based estimates, which is consistent with prior suggestions⁴⁴. This better packing is more pronounced at lower densities and smaller pores. Note that what we report as the fraction of the excess density is different than the fraction of the adsorbed phase. The excess density reports the ratio of the excess mass, the total mass minus the uniform bulk mass, to the total mass, a quantity which can easily be obtained from an MD profile. The total adsorbed density represents the ratio between the mass of the adsorbed phase to the total mass in the pore. Performing such a measurement requires separating the methane density into separate adsorbed and bulk phase components, and such a distinction cannot be physically made from an MD density profile. Because of the difference in the nature of these two quantities, our reports of excess density differ significantly from the total adsorbed mass reported in the literature ⁴⁵. We note that for extremely low total densities below $\approx 5$ kg/m³, the fraction of excess density exhibits observable disagreement between MD and LBM emulators.

Figure 5 shows the adsorption coefficient predicted by the upscaler, that is used by LBM to model adsorption under confinement, for a wide range of temperatures and total densities, at pore widths of 4 nm and 12 nm. As seen in the figure, the relationship between the adsorption coefficient, total density, and temperature is smooth but complex. This demonstrates that scale-bridging is necessary for informing the LBM adsorption coefficient under confinement. The adsorption coefficient is strongly increasing as the total density decreases, but varies less for large densities. The adsorption coefficient shows a weaker monotonically increasing dependence on temperature. For the 4 nm pore the density dependence is monotonic—this may be a region where nano-confinement induces a supercritical phase⁴⁶. On the other hand, in a 12 nm pore, the non-monotonic trend of adsorption with density may be indicative of a sub-critical phase. We reiterate that this functional form for the LBM adsorption coefficient was not selected a priori, but was generated by training the upscaler through the fusion of information from MD and LBM simulations.

Finally, we discuss the performance of the MD and upscaled LBM emulators for very low density systems (below $\approx 5$ kg/m³), where there is some disagreement between the two systems. This discrepancy is small in absolute terms (Fig. 4A) and noticeable in relative terms (Fig. 4B). There are several possible sources of this discrepancy, each with accompanying remedies:

(1)
The mismatch may stem from the differences between MD and LBM formulations. The discrete nature of MD allows the accumulations of molecules near the surface with no presence of molecules in the center of the pore, that is, the system consistents entirely of adsorbed particles, with essentially zero in the bulk. On the other hand, in LBM simulations, there must be a continuum between the bulk fluid and the the adsorbed fluid; the LBM formulation may not be able to treat such low densities. If this is the case, upscaling cannot be performed for extremely low densities simply because the physics of the fine scale cannot be represented at the coarse scale.
(2)
It may arise because the cost function for the upscaler has units of density (see “Methods”). This explains why the mismatch is not large in absolute terms but magnified in relative terms. The solution in this case is to specify the cost function in terms of the physical observables of interest; if the variable of interest is fractional pore adsorption, then the accompanying cost function should be dimensionless.
(3)
It may be due to the large adsorption coefficients computed by the upscaler for low densities. The LBM dataset was limited to adsorption coefficients less than 10, but for low densities the upscaler drives the emulator out of this range (Fig. 5). One might address this case by extending the range of LBM simulations to larger adsorption values.

We expect that some mixture of these effects accounts for the situation. It is worth noting that the common reservoir conditions in shale formations are unlikely to probe such low densities; as such we expect it to be largely irrelevant to applications. Despite this, we present this discussion as it entails several considerations for future research into ML-based upscaling: One is to keep in mind that it is important to consider the range of physics accessible at the coarse scale. Second, one must carefully consider which properties are intended to be probed and how they give rise to a natural cost function to optimize; small mismatches in one metric may imply larger mismatches under another metric. The last is to consider the range of data sampled: While ML is designed to generalize to new data that is similar to training data, no purely data-driven method could be expected to extrapolate to conditions far outside of the range of data sampled.

Computational costs

The aim of scale-bridging techniques is to alleviate computational costs associated with brute-force fine-scale simulation while retaining the important aspects of the fine-scale physics, and so in this section we review the computational cost of our methods. MD and LBM calculations were performed on various commericially available HPC hardware with CPU-based implementations. Neural networks were treated with a 12-core Intel i9 CPU on a commercially available laptop.

First, we cover the data-collection and model-building phases. Collecting the 1,010 MD simulations required approximately 800 core-h of computation ($\approx$ 1 core-h/MD-call). It is worth noting that this is comparable to the number of fine-scale calls needed in a single time-step in scale-bridging applications ^{1,2,3, 6}. The matching 8,074 LBM simulations are far simpler, taking $\approx$20 core-h total, or $\approx$5 core-s/LBM-call. Training the MD emulator, LBM emulator, and upscaler took approximately 24 core-h each, resulting in $\approx$72 core-h of training time. The dominant cost is thus MD simulation.

Having trained our models, the neural networks can be evaluated very quickly. The emulator must treat the density across the entire pore, and on average takes $\approx$1 core-ms/emulator-call. We remark that generating Fig. 4 requires assessing the profile of $\approx 3200$ sets of densities and temperatures for a fixed-width pore; this is considerably larger than the number of MD calculations performed in total over all pores. The emulators can generate this data in seconds on a single machine. The upscaler does not need to be evaluated for each point in the profile, and so it can be evaluated in $\approx$200 core-$\upmu$s/upscaler-call in serial evaluation. If batched to a sufficient number of combined calls, this can be lowered to $\approx$10 core-$\upmu$s/upscaler-call. If we take the serial mode as a conservative estimate, calling to the upscaler is $10^7$ times cheaper than a single MD simulation.

There are many practical considerations necessary to contexturalize the $10^7$ speedup factor, which can be regarded as an asymptotic speedup possible when other effects do not dominate. First, we mention potential limitations. One is that acheiving this speedup factor requires an application where the number of upscaling calls required is $10^7$ times larger than the number of training MD simulations. While this is large, current supercomputers exceed $10^5$ total cores ^{47, 48}, and atomistic applications have utilized computation at this scale ^49,50,51,52. Another factor is the expense of coarse-scale operations; the speedup factor of our method compared to direct MD is limited by the ratio of computation spent in the coarse model compared to MD. Lastly, implementation details may give rise to overheads associated with communication of data, etc. that affect final performance.

We also consider the practical advantages of the workflow. For one, the training MD simulations that constitute the bulk of the cost can be massively parallelized to take advantage of large HPC environments; in direct scale-bridging, parallelization is limited to the number of MD calls needed in a given time-step. Second, with such a large asymptotic speedup factor, larger coarse-scale simulations can be designed to take advantage of the throughput available with the ML-based upscaler. Third, given that 800 core-h for MD with a united-atom force field can be performed on a small cluster, far more accurate potentials could be afforded. In this case, the speedup factor would be proportionately larger; changing the nature of the MD simulation will not affect the cost of ML. Lastly, all three core algorithms (MD, LBM, and DNN) are good applications to accelerate via GPGPU computation ⁵³.

Conclusions and future prospects

In this work, we have described and demonstrated an ML framework to bridge the molecular and continuum scales in order to accurately simulate hydrocarbon properties in nanoporous media. We incorporate atomistic adsorption effects that occur within a nanoconfined pore as simulated by MD into a continuum LBM that can simulate larger scales with more complex geoemetries. We demonstrated that our ML framework is accurate and much more efficient than direct MD allowing up to 7 orders of magnitude speedup making it ideal for a robust scale-bridging framework that could be deployed even on modest computer clusters. Our approach is not dependent on the physical characteristics of adsorption phenomena. Therefore we expect to be able to extend it to other nanoconfinement effects such as slip and phase transition effects that also affect hydrocarbon flow, but have thus far been neglected in traditional continuum reservoir simulators. Our framework is flexible and extensible to other forms of upscaling. In addition, we expect the flexibility of the upscaling form to provide additional physical insights into how discrete molecular effects manifest themselves in complex 3D pore structures at the continuum scale. If an unexpected upscaling function emerges during our analysis, it could illuminate new physics within a nanoconfined pore. For example, as methane density increases, the adsorption thickness actually increases due to the formation of multiple adsorption layers. This was not explicitly parameterized in our framework but was discovered in the data analyis.

The advantages of our approach were demonstrated for an adsorption application. Our method simultaneously captures adsorption for a wide range of temperatures, pore widths and densities. Prior approaches require re-fitting of upscaling parameter; for example, a change in temperature or pore width implied the need to re-establish the adsorption as a function of total density. In contrast, the ML approach captures the data in one-shot across the full parameter space. While prior approaches modeled adsorption purely as a relationship between bulk and total density, our approach models the entire density profile and calculates the optimal adsorption parameter in the upscaler. This enables validation by ensuring that the estimated profiles are physically realistic. We also constructed our emulators such that they ensure symmetry and conservation of mass to prevent unphysical density profiles. This is done through engineering the architecture of the emulators via engineering the activation functions of the neurons. DNNs prove useful in this regard because their engineerable architectures lend themselves to applying physics informed constraints to the model.

Analyzing the resulting models shows that it captures many physical effects present in MD by dynamically adjusting the parameters of LBM. The performance degrades slightly near the edge of the input space when density becomes very low. Computationally, calls to the trained upscaler are essentially free, and the dominant cost is in producing MD training data. While the cost is not large, it is dominant in the workflow, and would increase when studying systems that require more complex potentials, such as biomolecules or metals with long-range forces. Minimizing the computational cost of acquring data is a good candidate for future work. It is a complex endeavor, involving both continuum and molecular simulation methods, model hyperparameters, and the choices of learning objective. One method to tackle challenge would be Active Learning ⁵⁴, in which an algorithm is applied to automatically select new training data in order to improve the performance of a model; in a computational context such an approach is quite attractive because data could be generated without human intervention, and could even lead to better models with fewer fine-scale computations.

We expect this method to be fully extensible to 2D or 3D geometries, but this would need to be verified. This would require methods such as Convolutional Neural Networks that work well for 2D and 3D systems. In addition, more data management will be required to assemble the many fine-scale and coarse-scale simulations as a training database for the ML approach and to apply the ML model within an LBM simulation. Exploring these tasks is left to future research.

To conclude, we have demonstrated that an ML upscaler can be used to account for discrete molecular effects in larger scale continuum models. This is accomplished by training ML emulators for the fine- and coarse-scale models, and training the ML upscaler to find a mapping between the input spaces of the fine- and coarse-scale models such that their output spaces agree. While we have demonstrated this concept on a simple nanoconfinement example, it is a general framework for scale-bridging that takes advantage of recent advances in ML.

Methods

Molecular dynamics simulations

We performed Molecular Dynamics (MD) simulations of methane in a channel pore under equilibrium conditions. We simulated different conditions to construct a robust training data set for our machine learning framework. Given that our main objective is to develop a scale-bridging machine-learning-based workflow, we assumed that pore walls are composed of frozen methane molecules for the sake of simplicity. The simulation domain comprises bulk methane molecules bounded in the z-direction by pore walls as shown in Fig. 1, left side. The x and y dimensions of the simulation box are 42.5 nm and 4.25 nm, similar to the previous work by Li. ⁵⁵. We generated 1000 simulations varying the pore width from 3 to 13 nm, the bulk density from 3 to $250$ kg/m³ and the temperature from 298 to 398 K; randomly choosing the values for pore width, bulk density, and temperature for each simulation.

We performed all the simulations using LAMMPS molecular dynamics package ⁵⁶. We use the “fix setforce” command in LAMMPS so that the wall particles do not move during the course of the simulation. Our simulation started with an energy minimization stage to avoid the overlap of molecules, then an equilibration for 0.5 ns and sampling phase for 0.5 ns, both under under NVT (canonical) ensemble. Periodic boundary conditions are employed in the x and y directions. We use a time step of 1 femto-second and a Nosé-Hoover thermostat⁵⁷, with a temperature damping parameter of 10 time units. During the sampling phase, we collected the density profile along the z-direction using a bin size of 0.1 nm. We used a pairwise-additive potential, TraPPE-UA⁵⁸, force field to describe the bulk density from interactions between the methane molecules. The TraPPE force field was developed to describe the bulk properties of the adsorbed gases and has a high degree of accuracy in the prediction of properties at different state points⁵⁹.

Lattice Boltzmann simulations

A brief introduction of the Lattice Boltzmann method

The lattice Boltzmann method^{60, 61} (LBM) is among the most popular direct numerical simulation methods to study complex flow in porous media, thanks to its ability to efficiently implement boundary conditions for complex geometries and to account for interfacial dynamics between different fluids. Furthermore, the LBM is well suited for modern manycore processors/co-processors, such as GPUs (Graphics Processing Units), which greatly boost the computing power but also require a higher degree of explicit parallelism. A highly-optimized LBM code is critical to efficiently provide large number of LBM simulation data to ML training process and also to simulate real rock sample with representative elementary volume (REV). In this work, we implemented the LBM adsorption model on a existing in-house developed high-performance LBM code, ‘MF-LBM’, which is able to simulate complex flow in large 3D complex geometries using manycore processors/co-processors^{62, 63}.

The primitive variables in the LBM are the particle distribution functions (PDFs), $f_{i}$, where i represents the ith lattice direction e. The popular D3Q19 lattice model⁶⁴ is employed in this work. The evolution equations of the PDFs are as follow,

$$f_{i} (\user2{x} + \user2{e}_{i} \delta t,t + \delta t) = f_{i} (\user2{x},t) - \frac{{f_{i} (\user2{x},t) - f_{i}^{{eq}} (\user2{x},t)}}{\tau } + F_{i} \delta t,$$

(1)

where the LBM single-relaxation-time (SRT) collision model^{64, 65} is employed. Here $f_{i}^{eq}$ are the equilibrium PDFs, $F_i$ represents a general forcing term and the relaxation rate $\tau$ is related to fluid viscosity. For more detailed description of the general LBM, readers may refer to Chen and Doolean⁶⁰.

The Shan–Chen type Lattice Boltzmann model

To model methane adsorption in nanopores in LBM simulations, the Shan-Chen single-component-multiphase LBM⁶⁶ is utilized, which generates non-local interactions between fluid-fluid particles and fluid-solid particles. The interactive force between fluid particles can be written as

$$\user2{F}(\user2{x},t) = - g\psi (\user2{x},t)\sum\limits_{{i = 1}}^{{18}} {\Omega _{i} } \psi (\user2{x} + \user2{e}_{i} \delta t,t)\user2{e}_{i},$$

(2)

where g is a parameter that controls the interaction strength, $\psi$ is a function of the local fluid density ρ(x) and $\Omega _i$ is the weight factor of e_i direction in the D3Q19 lattice. The fluid-solid interactive force can be formed in the same way as shown in Eq. (2), by using the fictitious density method. We assign fictitious values of fluid density on the solid nodes, meaning that the values of $\psi$ on the solid nodes are known. Equation (2) is then applied to all the fluid nodes, including the fluid boundary nodes. Thus, F(x,t) on the fluid boundary nodes can be obtained and show repulsive or attractive depending on the fictitious density assigned on the neighboring solid nodes. This fictitious density method was originally designed to control wettability⁶⁷ on the solid surface, but can also be used to control the adsorption in the present work.

The interactive force obtained in Eq. (2) is incorporated into Eq. (1) using Guo’s forcing scheme⁶⁸, which reduces the discrete lattice effects⁶⁹. The form of $\psi$ in Eq. (2) determines the equation of state⁷⁰. The Peng-Robinson (P-R) equation of state (EOS) has been introduced into the LBM via the following form of $\psi$⁷⁰,

$$\begin{aligned} \psi =\sqrt{\frac{2(\frac{\rho RT}{1-b\rho } - \frac{a\alpha (T)\rho ^2}{1+2b\rho -b^2\rho ^2}-c^2_s\rho )}{c^2_s g}}, \end{aligned}$$

(3)

where $a=0.45724R^2T^2_c /p_c$; $b=0.00778RT_c /p_c$; R is the universal gas constant; $c_s$ is the speed of sound, $T_c$ and $p_c$ are the critical temperature and critical pressure of the gas, respectively; $\alpha (T)=[ 1 + (0.37464+1.54226\omega -0.26992\omega ^2) \times (1-\sqrt{T/T_c})) ]^2$; $\omega =0.011$ is the acentric factor for methane; T is the temperature. Following the work of Yuan and Schaefer⁷⁰, we set $a=2/49$, $b=2/21$ and $R=1$ in the simulations. The Shan-Chen multiphase model incorporated with P-R EOS shows significant improvement over the original model in terms of spurious currents, temperature ranges, and density ratio⁷⁰.

From Eq. (3) we can see that g, the parameter to control the interaction strength in the original Shan-Chen model, is canceled out. Therefore, the interaction strength is now controlled by T and $\rho$.

Modeling adsorption in Lattice Boltzmann simulations

As shown in Eq. (2), there are net forces on the fluid boundary nodes if the term $\sum\nolimits_{{i = 1}}^{{18}} {\Omega _{i} \psi (\user2{x} + \user2{e}_{i} \delta t,t){\user2{e}}_{i} }$, which approximates the gradient of $\psi$, does not equal 0 on fluid boundary nodes. Therefore, by tuning the fictitious density on the solid boundary nodes, one can alter the gradient of $\psi$ on the fluid boundary nodes and therefore alter the net interactive forces on the fluid boundary nodes. We define an adsorption parameter,

$$\begin{aligned} a'=\frac{\rho _s}{\rho _{total}}, \end{aligned}$$

(4)

where $\rho _s$ is the fictitious density on the solid nodes and $\rho _{total}$ is the total density of the fluid. In such case, there are net interactive forces towards the solid nodes on the fluid boundary nodes when $a'>1$, which attract fluid particles to the solid surfaces. This models adsoprtion in the LBM simulation.

Matching the density profile in the adsorption layer

The gas density profiles near the solid surface are affected by the adsorption coefficient, the interaction strength in Eq. (2), the choice of EOS, and the forcing scheme that incorporates the interactive forces into the LBM equations. Once the EOS and forcing scheme are chosen, the density profile is determined by the adsorption coefficient $a'$ and the interaction strength which is controlled by T and $\rho$ when using the P-R EOS. However, to the best of our knowledge, there are no physics-based LBM that are able to describe the complex fluid-solid interactions inside the adsorption layer.

As described in section “Molecular dynamics simulations”, MD simulations can provide the gas density profile inside the adsorption layer. Therefore, one can tune the parameters, $a'$, T and $\rho$, in the LBM simulation to form desired density profile inside the adsorption layer that matches best with the MD result. For convenience, we replace the tuning parameters T and $\rho$ with $T_r$ and ${\rho _r}_{total}$ respectively, where $T_r=T/T_c$ is the reduced temperature and ${\rho _r}_{total}={\rho }_{total}/\rho _c$ is the reduced total density. Here $T_r$ and ${\rho _r}_{total}$ are both numerical tuning parameters which are decoupled from the physical temperature and density, and the EOS incorporated in the LBM is only used to provide fluid-solid interactions near solid surface. This is justified by the following two facts:

(1)
The temperature variation in the simulation length scale of this work (nanoscale) is negligible. Thus, the flow can be considered as isothermal flow. Tuning $T_r$ in the simulation mostly affects the density profiles inside the adsorption layer.
(2)
The inertial effects in the simulation length scale of this work (nanoscale) is negligible. Thus, one can re-scale the mass unit conversion between the lattice space and physical space.

The above method is in contrast to a previous work³⁵, where the physical $T_r$ and ${\rho _r}_{total}$ are used in the LBM simulation. We found that density profiles inside the adsorption layer of the LBM simulations will be also affected by grid resolution and the forcing scheme used in the Shan-Chen model. Only tuning the adsorption parameter $a'$ is not sufficient to obtain good match for a variety of channel widths and physical conditions. Thus, we decoupled $T_r$ and ${\rho _r}_{total}$ in the simulations from the physical ones and used them as tuning parameters. With fast and accurate LBM emulator trained from MD data, as shown in Fig. 4, tuning $a'$, $T_r$, and ${\rho _r}_{total}$ in the LBM simulation simultaneously for different channel widths under certain constraints to match the MD density profiles becomes possible.

Machine learning workflow

Our machine learning workflow proceeds in three steps: (1) Train an MD-emulator model that maps pore properties to reproduce profiles from MD. (2) Train an LBM-emulator to reproduce profiles from LBM, using an extended parameter set to cover possible adsorption conditions, and (3) Train an upscaler model which maps the MD inputs to effective values of LBM inputs which produce the same profile as estimated from the MD-emulator. We implemented the following methods using PyTorch ⁴³.

The notion of two profiles being the same is evaluated using a cost function that compares the mean-squared differences between profiles, scaled by the overall density. We denote distances between an emulated profile $\rho _A(x)$ and a target density $\rho _B(x)$ using a cost function $\mathcal {L}_{\text {emulator}}$:

$$\begin{aligned} \mathcal {L}_{\text {emulator}}(\rho _A(x),\rho _B(x)) = \frac{1}{n_i}\sum _i \left( \rho _A(x_i) - \rho _B(x_i)\right) ^2, \end{aligned}$$

(5)

where $n_i$ is the number of samples in the profile, and $x_i$ are the sample points.

Emulators

The MD-emulator is based on a fully-connected network conditioned on the position within the profile, that is, the predicted profile $\hat{\rho }(x)$ is computed using inputs x, w, $\rho _0$, and T as a function $\hat{\rho }(x) = \rho _0 f(x,w,\rho _0,T)$, where f is a neural network. The network output across the profile is normalized to produce a profile density; the output density $\hat{\rho }(x)$ is constrained to obey $\sum _i dx \hat{\rho }(x_i) = \rho _0$. The input positions x are transformed via $x \rightarrow \tilde{x} = (2x/w - 1)^2$; this ensures that $\tilde{x}$ is of order 1 (normalizing the feature), as well as ensuring that emulated profiles are symmetric. Put another way, the profile as modeled is a partially applied function $\hat{\rho }(x)|_{w,\rho _0,T}$ that is mathematically equivalent to a Convolutional Neural Network using $1\times 1$ kernels with a single system-wide layer that normalizes the profile to ensure mass conservation; in that sense, our model utilizes parameter-sharing across the pore geometry in the same way that a CNN does.

Our choice to rely on point-wise networks rather than Convolutional Neural Networks is due to the simplifactions that arise for any 1D constant pore geometry (such as the slab analyzed here, or a cylindrical pore geometry): the profile can be treated as a function of 4 variables, rather than a function of $3+w$ variables associated with 3 thermodynamic variables and w profile bins. In a CNN-based formulation, the raw data associated with a 1D representation of the pore geometry would be nearly trivial, as it is constant across the pore and constant in the walls. This plays against the advantages of CNNs, which are designed to process local correlations, and would need to either (1) generate the spatial characteristics of the profile using a very large total receptive field size, or (2) explicitly take as input long-range spatial information such as $\tilde{x}$. However, to treat 2D or 3D geometries with more complex spatial structure of the pore walls, a CNN could be employed to represent the geometry of the pore, which is advantageous over parameterizing 2D or 3D functions using purely local variables; said parameterization could easily be accomplished for specific geometries (e.g. ellipses or rectangular cross-sections), but generic irregular shapes would not admit a low-dimensional parameterization.

Futhermore, implementing our model as a point-wise full-connected model is also computationally advantageous over a CNN formulation because it avoids some of the difficulties associated with treating variable-sized inputs. Treating variable-sized inputs requires either (1) padding all data to the size of the largest sample in the batch, resulted in wasted computation associated with the profile inside a wall, which vanishes, or (2), calling the network only on batch sizes of 1, which limits the computational throughput of training by restricting parallelism. In the case of a 2D or 3D geometry, these concerns would be mitigated as a result of the larger number of per-example input and output bins.

Networks consist of $n_{layers}=3$ (i.e. two hidden layers and one output layer) of $n_{neurons}=30$ each using the softplus activation function $\text {softplus}(x) = \log (1+e^x)$. The use of this smooth activation function ensures that emulated profiles are themselves smooth. Each network contains approximately 1,100 parameters in total. Networks are trained over $n_{epochs}=1{,}000$ using the Adam optimizer⁷¹. $10\%$ of the data is held out for testing, and another $10\%$ is held out for early stopping and validation. The emulator networks are trained in batches of size $n_{batch}=20$ under the cost function $\mathcal {L}_{\text {emulator}}(\hat{\rho }_{\text {MD}}(x),\rho _{\text {MD}}(x))$.

The LBM dataset of 8,074 calculations was obtained by filtering data obtained from a set of 10,000 calculations. 8,245 of the calculations converged within the number of steps prescribed. The data was filtered to remove simulations for which a unique profile is ill-defined due to the appearance of complete condensation, usually at low temperatures and/or large adsorption coefficients. The filtering process removed simulations for which the observed profile was asymmetric as well as simulations for which the bulk density was greater than the density near the wall. The LBM-emulator is trained in the same fashion, except that it has inputs $w,\rho _0',T',a'$. The densities $\rho '$ and temperatures $T'$ are expressed in lattice units. The adsorption $a'$ is dimensionless.

Upscaler

The goal of the upscaling network is to match emulated MD profiles to emulated LBM profiles by learning a mapping between the MD input space and the LBM input space, producing LBM predictions $\hat{\rho _0}', \hat{T'}, \hat{a'}$ from MD variables $w,\rho _0,T$. To constrain the upscaler as a density-conserving predictor, the upscaled density $\hat{\rho }'$ is learned using a linear factor $\hat{\rho }_0' = \beta _\rho \rho _0$ for a single scalar parameter $\beta _\rho$; the upscaler fixes the density scale matching between MD and LBM; this can likewise be rephrased as a global modification to the LBM critical pressure. The other parameters, $\hat{T}'$ and $\hat{a}'$, are generated by a fully connected multitask neural network. The archtecture is same as the emulators, except that the predictions are made using a linear layer (no activation function), the position is not an input to the upscaler, and there is no profile-wise normalization layer. Like the emulators, the upscaler contains approximately 1100 parameters.

Initial explorations used a similar cost function to the emulators, attempting to match the full density profile. However, it was soon discovered that this task is not possible within the framework of LBM examined; it cannot reproduce, for example, the multi-layer structure observed in the MD profiles. As such, the cost function focuses on the primary observable effect of adsorption: The excess density $\rho _{\text {excess}}$, with a small regularization term encouraging the upscaler to keep the MD and upscaled profiles close:

$$\begin{aligned} \mathcal {L}_{\text {upscaler}} = \left( (\rho _{\text {excess,upscaled}}-\rho _{\text {excess,MD}} )^2+ 0.01 * \mathcal {L}_{\text {emulator}}(\hat{\rho }_{\text {upscaled}},\hat{\rho }_{\text {MD}}) \right) / \rho _{0}. \end{aligned}$$

(6)

One advantage of this approach is that the set of input parameters for MD needs not be explicitly matched to LBM inputs; the upscaler itself solves this problem implicitly, as both emulators are defined over the space of inputs in the data collected. As such, we train the upscaler over a large dataset of emulated MD profiles that cover the space of both the MD and LBM simulations. This consists of 10,000 calculations with widths between 2.5 and 12.5 nm, densitites between 2.5 kg/m³ and 250 kg/m³, and temperatures between 300 and 400 K. The upscaler is trained for 100 epochs.

References

Rouet-Leduc, B. et al. Spatial adaptive sampling in multiscale simulation. Comput. Phys. Commun. 185(7), 1857–1864 (2014).
Article ADS MathSciNet MATH CAS Google Scholar
Roehm, D. et al. Distributed database kriging for adaptive sampling (D$^2$KAS). Comput. Phys. Commun. 192, 138–147 (2015).
Article ADS CAS Google Scholar
Leiter, K. W., Barnes, B. C., Becker, R. & Knap, J. Accelerated scale-bridging through adaptive surrogate model evaluation. J. Comput. Sci. 27, 91–106 (2018).
Article Google Scholar
Wehner, J. et al. Electronic excitations in complex molecular environments: Many-body greens functions theory in votca-xtp. J. Chem. Theory Comput. 14(12), 6253–6268 (2018).
Article PubMed PubMed Central CAS Google Scholar
De Fabritiis, G., Delgado-Buscalioni, R. & Coveney, P. V. Multiscale modeling of liquids with molecular specificity. Phys. Rev. Lett. 97, 134501 (2006).
Article ADS PubMed CAS Google Scholar
Murashima, T., Yasuda, S., Taniguchi, T. & Yamamoto, R. Multiscale modeling for polymeric flow: Particle-fluid bridging scale methods. J. Phys. Soc. Jpn. 82(1), 012001 (2013).
Article ADS CAS Google Scholar
Yasuda, S. & Yamamoto, R. Synchronized molecular-dynamics simulation via macroscopic heat and momentum transfer: An application to polymer lubrication. Phys. Rev. X 4, 041011 (2014).
CAS Google Scholar
Sato, T. & Taniguchi, T. Multiscale simulations for entangled polymer melt spinning process. J. Nonnewton. Fluid Mech. 241, 34–42 (2017).
Article MathSciNet CAS Google Scholar
Krekeler, C., Agarwal, A., Junghans, C., Praprotnik, M. & Delle Site, L. Adaptive resolution molecular dynamics technique: down to the essential. J. Chem. Phys. 149(2), 024104 (2018).
Article ADS PubMed CAS Google Scholar
Karra, S., Omalley, D., Hyman, J., Viswanathan, H. S. & Srinivasan, G. Modeling flow and transport in fracture networks using graphs. Phys. Rev. E 97(3), 33304 (2018).
Article ADS CAS Google Scholar
Viswanathan, H. S. et al. Advancing graph-based algorithms for predicting flow and transport in fractured rock. Water Resour. Res. 54(9), 6085–6099 (2018).
Article ADS Google Scholar
Srinivasan, G. et al. Quantifying topological uncertainty in fractured systems using graph theory and machine learning. Sci. Rep. 8(1), 11665 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Liu, X. & Zhang, D. A review of phase behavior simulation of hydrocarbons in confined space: implications for shale oil and shale gas. J. Nat. Gas Sci. Eng. 68, 102901 (2019).
Article Google Scholar
Valera, M. et al. Machine learning for graph-based representations of three-dimensional discrete fracture networks. Comput. Geosci. 1, 1–16 (2018).
Article MathSciNet MATH Google Scholar
Y. Gong, M. Mehana, I. El-Monier, F. Xu, F. Xiong, Machine learning for estimating rock mechanical properties beyond traditional considerations. in Unconventional Resources Technology Conference, Denver, Colorado, 22–24 July 2019, pp. 466–480, 2019.
Smith, J. S., Isayev, O. & Roitberg, A. E. Ani-1: an extensible neural network potential with dft accuracy at force field computational cost. Chem. Sci. 8(4), 3192–3203 (2017).
Article PubMed PubMed Central CAS Google Scholar
Schwarzer, M. et al. Learning to fail: Predicting fracture evolution in brittle material models using recurrent graph convolutional neural networks. Comput. Mater. Sci. 162, 322–332 (2019).
Article Google Scholar
Wang, J.-X., Wu, J.-L. & Xiao, H. Physics-informed machine learning approach for reconstructing reynolds stress modeling discrepancies based on dns data. Phys. Rev. Fluids 2(3), 034603 (2017).
Article ADS Google Scholar
Botu, V. & Ramprasad, R. Adaptive machine learning framework to accelerate ab initio molecular dynamics. Int. J. Quantum Chem. 115(16), 1074–1083 (2015).
Article CAS Google Scholar
Fulford, D. S. et al. Machine learning as a reliable technology for evaluating time/rate performance of unconventional wells. SPE Econ. Manag. 8(01), 23–39 (2016).
Article Google Scholar
Li, H., He, J., & Misra, S. Data-driven in-situ geomechanical characterization in shale reservoirs. in SPE Annual Technical Conference and Exhibition, 24–26 September, Dallas, Texas, USA, no. SPE-191400-MS, Society of Petroleum Engineers, (2018).
Kamrava, S., Tahmasebi, P. & Sahimi, M. Enhancing images of shale formations by a hybrid stochastic and deep learning algorithm. Neural Netw. 118, 310–320 (2019).
Article PubMed Google Scholar
Kamrava, S., Tahmasebi, P. & Sahimi, M. Linking morphology of porous media to their macroscopic permeability by deep learning. Transp. Porous Media 131(2), 427–448 (2020).
Article MathSciNet Google Scholar
Santos, J. E. et al. Poreflow-net: A 3d convolutional neural network to predict fluid flow through porous media. Adv. Water Resour. 138, 103539 (2020).
Article Google Scholar
Curtis, M. E., Ambrose, R. J. & Sondergeld, C. H. Structural characterization of gas shales on the micro-and nano-scales. in Canadian Unconventional Resources and International Petroleum Conference, 19–21 October 2010. Society of Petroleum Engineers, Alberta, Canada. https://doi.org/10.2118/137693-MS (2010).
Hyman, J. et al. Understanding hydraulic fracturing: a multi-scale problem. Philos. Trans. R. Soc. A: Math. Phys. Eng. Sci. 374(2078), 20150426 (2016).
Article ADS CAS Google Scholar
Middleton, R. S. et al. Shale gas and non-aqueous fracturing fluids: opportunities and challenges for supercritical $\text{ CO}_2$. Appl. Energy 147, 500–509 (2015).
Article CAS Google Scholar
Sanhai, W. R., Sakamoto, J. H., Canady, R. & Ferrari, M. Seven challenges for nanomedicine. Nat. Nanotechnol. 3(5), 242 (2008).
Article ADS PubMed CAS Google Scholar
Siria, A. et al. Giant osmotic energy conversion measured in a single transmembrane boron nitride nanotube. Nature 494(7438), 455 (2013).
Article ADS PubMed CAS Google Scholar
Shannon, M.A., Bohn, P.W., Elimelech, M., Georgiadis, J.G., Marinas, B.J., & Mayes, A.M., Science and technology for water purification in the coming decades, in Nanoscience and technology: a collection of reviews from nature Journals, pp. 337–346, World Scientific (2010).
Warner, N. R. et al. Geochemical evidence for possible natural migration of marcellus formation brine to shallow aquifers in pennsylvania. Proc. Nat. Acad. Sci. 109(30), 11961–11966 (2012).
Article ADS PubMed PubMed Central Google Scholar
Mirsaidov, U. M., Zheng, H., Bhattacharya, D., Casana, Y. & Matsudaira, P. Direct observation of stick-slip movements of water nanodroplets induced by an electron beam. Proc. Nat. Acad. Sci. 109(19), 7187–7190 (2012).
Article ADS PubMed PubMed Central Google Scholar
Majumder, M., Chopra, N., Andrews, R. & Hinds, B. J. Nanoscale hydrodynamics: enhanced flow in carbon nanotubes. Nature 438(7064), 44 (2005).
Article ADS PubMed CAS Google Scholar
Holt, J. K. et al. Fast mass transport through sub-2-nanometer carbon nanotubes. Science 312(5776), 1034–1037 (2006).
Article ADS PubMed CAS Google Scholar
Zhao, J., Yao, J., Zhang, L., Sui, H. & Zhang, M. Pore-scale simulation of shale gas production considering the adsorption effect. Int. J. Heat Mass Transf. 103, 1098–1107 (2016).
Article CAS Google Scholar
Gelb, L. D., Gubbins, K., Radhakrishnan, R. & Sliwinska-Bartkowiak, M. Phase separation in confined systems. Rep. Prog. Phys. 62(12), 1573 (1999).
Article ADS CAS Google Scholar
Sapmanee, K. Effects of pore proximity on behavior and production prediction of gas/condensate. PhD thesis, University of Oklahoma (2011).
Qiao, Y., Liu, L. & Chen, X. Pressurized liquid in nanopores: a modified Laplace-Young equation. Nano Lett. 9(3), 984–988 (2009).
Article ADS PubMed CAS Google Scholar
Van Honschoten, J. W., Brunets, N. & Tas, N. R. Capillarity at the nanoscale. Chem. Soc. Rev. 39(3), 1096–1114 (2010).
Article PubMed CAS Google Scholar
Charoensuppanimit, P., Mohammad, S. A. & Gasem, K. A. Measurements and modeling of gas adsorption on shales. Energy Fuels 30(3), 2309–2319 (2016).
Article CAS Google Scholar
Wang, S., Feng, Q., Javadpour, F., Xia, T. & Li, Z. Oil adsorption in shale nanopores and its effect on recoverable oil-in-place. Int. J. Coal Geol. 147, 9–24 (2015).
Article CAS Google Scholar
Baydin, A. G., Pearlmutter, B. A., Radul, A. A. & Siskind, J. M. Automatic differentiation in machine learning: A survey. J. Mach. Learn. Res. 18, 5595–5637 (2017).
MathSciNet MATH Google Scholar
Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Killeen, T., Lin, Z., Gimelshein, N., Antiga, L., Desmaison, A., Kopf, A., Yang, E., DeVito, Z., Raison, M., Tejani, A., Chilamkurthy, S., Steiner, B., Fang, L., Bai, J., & Chintala, S. “Pytorch: An imperative style, high-performance deep learning library,” in Advances in Neural Information Processing Systems 32 (H. Wallach, H. Larochelle, A. Beygelzimer, F. d‘ Alché-Buc, E. Fox, and R. Garnett, eds.), pp. 8024–8035, Curran Associates, Inc., Ny (2019).
Ambrose, R. J., Hartman, R. C., Campos, M. Diaz, Akkutlu, I. Y., & Sondergeld, C. et al. New pore-scale considerations for shale gas in place calculations, in SPE unconventional gas conference, Society of Petroleum Engineers. https://doi.org/10.2118/131772-MS (2010).
Hill, D. G. & Nelson, C. Gas productive fractured shales: an overview and update. Gas Tips 6(3), 4–13 (2000).
Google Scholar
Luo, S., Lutkenhaus, J. L. & Nasrabadi, H. Confinement-induced supercriticality and phase equilibria of hydrocarbons in nanopores. Langmuir 32(44), 11506–11513 (2016).
Article PubMed CAS Google Scholar
Lujan, J., Vigil, M., Kenyon, G., Sanbonmatsu, K., & Albright, B. Trinity supercomputer now fully operational, United States. https://www.osti.gov/servlets/purl/1487137 (2017).
Wells, J., Bland, B., Nichols, J., Hack, J., Foertter, F., Hagen, G., Maier, T., Ashfaq, M., Messer, B., & Parete-Koon, S. Announcing supercomputer summit, United States. https://www.osti.gov/servlets/purl/1259664 (2016).
Perilla, J. R. & Schulten, K. Physical properties of the HIV-1 capsid from all-atom molecular dynamics simulations. Nat. Commun. 8(1), 15959 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Acun, B. et al. Scalable molecular dynamics with namd on the summit system. IBM J. Res. Dev. 62(6), 4:1-4:9 (2018).
Article Google Scholar
Smith, J. S. et al. Approaching coupled cluster accuracy with a general-purpose neural network potential through transfer learning. Nat. Commun. 10(1), 2903 (2019).
Article PubMed PubMed Central CAS Google Scholar
Durrant, J. D. et al. Mesoscale all-atom influenza virus simulations suggest new substrate binding mechanism. ACS Central Sci. 6(2), 189–196 (2020).
Article CAS Google Scholar
Owens, J. D. et al. GPU computing. Proc. IEEE 96(5), 879–899 (2008).
Article Google Scholar
Settles, B. Active learning literature survey. University of Wisconsin-Madison, Department of Computer Sciences, Technical Report 1648. http://digital.library.wisc.edu/1793/60660 (2009).
Li, Z.-Z., Min, T., Kang, Q., He, Y.-L. & Tao, W.-Q. Investigation of methane adsorption and its effect on gas transport in shale matrix through microscale and mesoscale simulations. Int. J. Heat Mass Transf. 98, 675–686 (2016).
Article Google Scholar
Plimpton, S. Fast parallel algorithms for short-range molecular dynamics. J. Comput. Phys. 117(1), 1–19 (1995).
Article ADS MATH CAS Google Scholar
Martyna, G. J., Klein, M. L. & Tuckerman, M. Nosé-hoover chains: The canonical ensemble via continuous dynamics. J. Chem. Phys. 97(4), 2635–2643 (1992).
Article ADS Google Scholar
Martin, M. G. & Siepmann, J. I. Transferable Potentials for Phase Equilibria. 1. United-atom Description of N-alkanes. J. Phys. Chem. B 102(14), 2569–2577 (1998).
Article CAS Google Scholar
Eggimann, B. L., Sunnarborg, A. J., Stern, H. D., Bliss, A. P. & Siepmann, J. I. An online parameter and property database for the TraPPE force field. Mol. Simul. 40(1–3), 101–105 (2014).
Article CAS Google Scholar
Chen, S. & Doolen, G. D. Lattice Boltzmann method for fluid flows. Annu. Rev. Fluid Mech. 30(1), 329–364 (1998).
Article ADS MathSciNet MATH Google Scholar
Aidun, C. K. & Clausen, J. R. Lattice-Boltzmann method for complex flows. Annu. Rev. Fluid Mech. 42, 439–472 (2010).
Article ADS MathSciNet MATH Google Scholar
Chen, Y., Li, Y., Valocchi, A. J. & Christensen, K. T. Lattice Boltzmann simulations of liquid CO₂ displacing water in a 2D heterogeneous micromodel at reservoir pressure conditions. J. Contam. Hydrol. 212, 14–27 (2018).
Article ADS PubMed CAS Google Scholar
Zhao, B., MacMinn, C. W., Primkulov, B. K., Chen, Y., Valocchi, A. J., Zhao, J., Kang, Q., Bruning, K., McClure, J. E., Miller, C. T. et al., “Comprehensive comparison of pore-scale models for multiphase flow in porous media,” Proceedings of the National Academy of Sciences, p. 201901619, (2019).
Qian, Y., Dhumières, D. & Lallemand, P. Lattice BGK models for Navier–Stokes equation. Europhys. Lett. 17(6), 479 (1992).
Article ADS MATH Google Scholar
Chen, H., Chen, S. & Matthaeus, W. H. Recovery of the Navier-Stokes equations using a lattice-gas Boltzmann method. Phys. Rev. A 45(8), R5339 (1992).
Article ADS PubMed CAS Google Scholar
Shan, X. & Chen, H. Lattice Boltzmann model for simulating flows with multiple phases and components. Phys. Rev. E 47(3), 1815 (1993).
Article ADS MathSciNet CAS Google Scholar
Martys, N. S. & Chen, H. Simulation of multicomponent fluids in complex three-dimensional geometries by the lattice Boltzmann method. Phys. Rev. E 53(1), 743 (1996).
Article ADS CAS Google Scholar
Guo, Z., Zheng, C. & Shi, B. Discrete lattice effects on the forcing term in the lattice Boltzmann method. Phys. Rev. E 65(4), 046308 (2002).
Article ADS MATH CAS Google Scholar
Guo, Z. & Shu, C. Lattice Boltzmann method and its applications in engineering Vol. 3 (World Scientific, Singapore, 2013).
Book MATH Google Scholar
Yuan, P. & Schaefer, L. Equations of state in a lattice Boltzmann model. Phys. Fluids 18(4), 042101 (2006).
Article ADS MathSciNet MATH CAS Google Scholar
Kingma, D. P., Ba, J. Adam: A method for stochastic optimization, ArXiv preprint. arXiv:1412.6980 (2014).

Download references

Acknowledgements

This work was supported by the US Department of Energy through the Los Alamos National Laboratory (LANL). LANL is operated by Triad National Security, LLC, for the National Nuclear Security Administration of U.S. Department of Energy (Contract No. 89233218CNA000001). This work was supported by the Laboratory Directed Research and Development program at LANL under project number 20190005DR. We also thank the LANL Institutional Computing Program and the CCS-7 Darwin cluster for computational resources. We thank David Rosenberger, Kipton Barros, and Ying Wai Li for useful discussions, and Vinay Ramakrishnaiah for feedback on the manuscript.

Author information

Authors and Affiliations

Information Sciences Group, Computer, Computational and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
Nicholas Lubbers
Theoretical Biology and Biophysics Group, Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
Animesh Agarwal
Computational Earth Science Group, Earth and Environmental Sciences Division, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
Yu Chen, Mohamed Mehana, Qinjun Kang, Satish Karra & Hari S. Viswanathan
Institut des Sciences de la Terre, Université Grenoble Alpes, Grenoble, France
Soyoun Son
Geophysics Group, Earth and Environmental Sciences Division, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
Soyoun Son
Applied Computer Science Group, Computer, Computational and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
Christoph Junghans
Physics and Chemistry of Materials Group, Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
Timothy C. Germann

Authors

Nicholas Lubbers
View author publications
You can also search for this author in PubMed Google Scholar
Animesh Agarwal
View author publications
You can also search for this author in PubMed Google Scholar
Yu Chen
View author publications
You can also search for this author in PubMed Google Scholar
Soyoun Son
View author publications
You can also search for this author in PubMed Google Scholar
Mohamed Mehana
View author publications
You can also search for this author in PubMed Google Scholar
Qinjun Kang
View author publications
You can also search for this author in PubMed Google Scholar
Satish Karra
View author publications
You can also search for this author in PubMed Google Scholar
Christoph Junghans
View author publications
You can also search for this author in PubMed Google Scholar
Timothy C. Germann
View author publications
You can also search for this author in PubMed Google Scholar
Hari S. Viswanathan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Formulation and oversight of the work were performed by N.L., Q.K., S.K., C.J., T.G., and H.V. LBM code and simulations were performed by Y.C., S.S., S.K., and Q.K. MD simulations were constructed and performed by A.A., S.K., and C.J. Machine Learning was performed by N.L. and A.A. Figures were constructed by N.L., M.M., and S.K. with feedback from the other authors. Writing of the manuscript was performed by N.L., Q.K., M.M., S.K., and H.V. All authors contributed to the editing of the manuscript.

Corresponding authors

Correspondence to Nicholas Lubbers or Hari S. Viswanathan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lubbers, N., Agarwal, A., Chen, Y. et al. Modeling and scale-bridging using machine learning: nanoconfinement effects in porous media. Sci Rep 10, 13312 (2020). https://doi.org/10.1038/s41598-020-69661-0

Download citation

Received: 20 December 2019
Accepted: 14 July 2020
Published: 07 August 2020
DOI: https://doi.org/10.1038/s41598-020-69661-0

This article is cited by

Multi-reward reinforcement learning based development of inter-atomic potential models for silica
- Aditya Koneru
- Henry Chan
- Subramanian K. R. S. Sankaranarayanan
npj Computational Materials (2023)
Predictive scale-bridging simulations through active learning
- Satish Karra
- Mohamed Mehana
- Hari S. Viswanathan
Scientific Reports (2023)
Insight of Discrete Scale and Multiscale Methods for Characterization of Composite and Nanocomposite Materials
- Taha Sheikh
- Kamran Behdinan
Archives of Computational Methods in Engineering (2023)
A Dataset of 3D Structural and Simulated Transport Properties of Complex Porous Media
- Javier E. Santos
- Bernard Chang
- Hari Viswanathan
Scientific Data (2022)
Advances in Computational Intelligence of Polymer Composite Materials: Machine Learning Assisted Modeling, Analysis and Design
- A. Sharma
- T. Mukhopadhyay
- V. Kushvaha
Archives of Computational Methods in Engineering (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.