Three-dimensional biphase fabric estimation from 2D images by deep learning

Chou, Daniel; Etcheverry, Matias; Arson, Chloé

doi:10.1038/s41598-024-59554-x

Download PDF

Article
Open access
Published: 18 April 2024

Three-dimensional biphase fabric estimation from 2D images by deep learning

Daniel Chou¹^na1,
Matias Etcheverry²^na1 &
Chloé Arson¹

Scientific Reports volume 14, Article number: 8957 (2024) Cite this article

207 Accesses
Metrics details

Subjects

Abstract

A pruned VGG19 model subjected to Axial Coronal Sagittal (ACS) convolutions and a custom VGG16 model are benchmarked to predict 3D fabric descriptors from a set of 2D images. The data used for training and testing are extracted from a set of 600 3D biphase microstructures created numerically. Fabric descriptors calculated from the 3D microstructures constitute the ground truth, while the input data are obtained by slicing the 3D microstructures in each direction of space at regular intervals. The computational cost to train the custom ACS-VGG19 model increases linearly with p (the number of images extracted in each direction of space), and increasing p does not improve the performance of the model - or only does so marginally. The best performing ACS-VGG19 model provides a MAPE of 2 to 5% for the means of aggregate size, aspect ratios and solidity, but cannot be used to estimate orientations. The custom VGG16 yields a MAPE of 2% or less for the means of aggregate size, distance to nearest neighbor, aspect ratios and solidity. The MAPE is less than 3% for the mean roundness, and in the range of 5-7% for the aggregate volume fraction and the mean diagonal components of the orientation matrix. Increasing p improves the performance of the custom VGG16 model, but becomes cost ineffective beyond 3 images per direction. For both models, the aggregate volume fraction is predicted with less accuracy than higher order descriptors, which is attributed to the bias given by the loss function towards highly-correlated descriptors. Both models perform better to predict means than standard deviations, which are noisy quantities. The custom VGG16 model performs better than the pruned version of the ACS-VGG19 model, likely because it contains 3 times (p = 1) to 28 times (p = 10) less parameters than the ACS-VGG19 model, allowing better and faster cnvergence, with less data. The custom VGG16 model predicts the second and third invariants of the orientation matrix with a MAPE of 2.8% and 8.9%, respectively, which suggests that the model can predict orientation descriptors regardless of the orientation of the input images.

MicroLib: A library of 3D microstructures generated from 2D micrographs using SliceGAN

Article Open access 22 October 2022

Lossless multi-scale constitutive elastic relations with artificial intelligence

Article Open access 12 April 2022

A reusable neural network pipeline for unidirectional fiber segmentation

Article Open access 02 February 2022

Introduction

Relating fabric tensors to the stiffness tensor is a long-standing issue in geomechanics¹. A fabric tensor is, broadly speaking, a convolution of moments of probability density functions of microstructure descriptors. It can be a scalar, a vector, a matrix, or a tensor of higher order. Perhaps the most widely used fabric tensor in rock mechanics is the crack density tensor, initially defined by Oda², who established a linear correlation between the first invariant of that fabric tensor and uniaxial compression strength in rock. In granular media, the principle of virtual work was invoked to relate the branch density tensor to the expression of the macroscopic stress tensor^3,4. Joint invariants, defined as invariants of fabric tensors that are highly correlated to the stress invariants, were used to replace the stress invariants in the Dracker-Prager yield function, under the assumption of axial symmetry^5,6. In those studies, the stress tensor was highly correlated to the fabric tensor that represents the statistical distribution of particle orientations. Zysset and Curnier⁷ derived an analytical expression of the elasticity tensor as a function of a general fabric tensor that represents the orientation distribution of directional dependent microstructure properties for isotropic, transverse isotropic and orthotropic materials. Later studies established correlations between the mechanical properties of salt rock and tensors that capture the magnitude and orientation of solidity, coordination, local solid volume fraction, and crack volume⁸.

X-ray Computed Tomography (XCT) scanning allows one to obtain stacks of 2D images that represent sections of a 3D material. XCT images are routinely used to reconstruct microstructures in 3D and to calculate statistical geometric features that can be used to define fabric tensors^9,10. Kuo et al.¹¹ calculated 2D fabric tensors from 2D binary images obtained in three orthogonal planes during XCT scanning and established a methodology to calculate 3D fabric tensors from those 2D fabric tensors. They assumed that the 3D fabric tensors were axially symmetric and that the principal values of the 3D fabric tensors were proportional to those of the 2D fabric tensors. To date, identifying 3D fabric descriptors from XCT images remains a challenge^10,11.

Descriptors such as fabric tensors encapsulate a clear physical meaning, but they are chosen based on experience in a particular field of study^12,13. Alternatively, the morphology and heterogeneity of microstructures can be quantified by means of correlation functions^14,15. The N-point correlation function can accurately capture information of a dual phase microstructure. Deep learning has enabled huge advances in pattern detection, recognition and selection. It now has many applications, like in medical imaging. Resnet and the Visual Geometry Group (VGG) emerged as very powerful networks¹⁶, and transfer learning has shown a clear improvement in convergence time and result accuracy^17,18. Although most problems treated with Resnet and the VGG pertain to image classification, it is easy to convert a model for regression and extract good results¹⁹. Thanks to the advancement of deep learning, 2-point correlation functions and descriptors have been used as input for microstructure reconstruction and can serve as proxy to measure the quality of a reconstruction in a generative model. To overcome the bias in the choice of descriptors, statistical methods were created to enable reconstruction with style data only^17,20,21.

In the present study, we propose a deep learning approach to optimize the number of 2D slices in a 3D volume to achieve a targeted accuracy of 3D fabric tensor estimates in biphase media made of cemented aggregates. In “Data generation” section, we explain how we constructed virtual 3D microstructures, calculated associated 3D fabric descriptors that served as ground truth, and extracted 2D slice images that served as input data. In “Deep learning approach” section, we present two custom VGG models that take inputs of different formats, and we explain the protocol for training and testing. Our implementation is coded with python and Matlab²². We used Pytorch as our main machine learning module, and pytorch-lightning as a Pytorch framework. Our results are described and interpreted in “Results” section. The advantages, limitations and possible improvements to the models are discussed in “Discussion” section. Lessons learned and perspectives for future work are summarized in “Conclusions” section.

Data generation

Numerical construction of the virtual specimens

Synthetic three-dimensional biphase microstructures were constructed to represent coarse aggregates embedded in a homogeneous matrix. The process to construct the numerical specimens is illustrated in Fig. 1. Aggregates were scanned and the resulting point clouds were transformed into solid alpha-shapes with Matlab (step 1). In total, 87 alpha-shapes were obtained from point clouds and stored in a database. Loose assemblies of aggregates were created with a Random Sequential Absorption (RSA) algorithm that sequentially and randomly picked alpha shapes from the database and fitted them in a cubic space (step 2). All shapes were equiprobable, and were scaled by the size distribution shown in Fig. 2a. The RSA algorithm takes the target volume fraction and a measure of exclusion distance as input. The RSA places objects randomly in a finite volume and rejects an object that is within the exclusion distance of another object previously fitted in that volume. The maximum volume fraction that can be reached iteratively with an RSA algorithm does not exceed 20%. Despite attempts to alter the exclusion distance criteria^23,24 to improve the efficiency of the RSA algorithm, it remains difficult to generate high-density specimens. For that reason, we used the RSA algorithm to generate six cubes filled with loose aggregate assemblies, placed them along the sides of the target cubic domain, and used the Finite Element Method (FEM) to simulate the packing of the six loose assemblies into the target domain (step 3: dynamic gathering). These explicit dynamic simulations were conducted with Abaqus²⁵. The loose assemblies were pushed by six rigid walls that were subjected to a controlled displacement (approximately 100 mm). The aggregates were modeled as rigid bodies with a mass density of $\rho $ = 2800 kg/m$^3$, and a non-penetration condition was used at the contact between the aggregates. After dynamic gathering, the aggregate volume fraction was between 0.6 and 0.8. Target volume fractions of $V_f\in \{0.1,0.15,0.2,0.25,0.3,0.35,0.4\}$ were obtained by randomly removing aggregates from the aggregate assemblies obtained in step 3 (step 4). As a reference, the aggregate volume fraction in concrete used in construction is around 0.4. In total, 600 cubes filled with aggregate alpha-shapes were created, with the aggregate volume fractions shown in Fig. 2a. The 600 aggregate assemblies were imported into Abaqus, which was used to mesh the space between the aggregates, called matrix (step 5). Abaqus then automatically meshed the aggregates to match the mesh of the matrix. Lastly, the aggregate meshes were saved as .stl files, which can easily be opened in Matlab to calculate statistical microstructure descriptors (step 6).

Generation of 2D images

Regularly spaced two-dimensional images were extracted from the generated 3D microstructures in planes orthogonal to the x, y and z directions (Fig. 3). A 2D grid of squared elements was first created in the plane of interest (for example, at $y=y_0$ for the plane orthogonal to the y direction at position $y=y_0$). Each cell of the 2D grid was turned into a black pixel if the square was in an aggregate, and into a white pixel otherwise. The number of squares in the 2D grid thus equaled the number of pixels in the binary image. The position of the top left node of a square was the determining criterion to decide whether the square was in the aggregate phase or the matrix phase, since a square could lie at the interface between both phases. The Matlab module inpolyhedron²⁶ was used to check whether the nodes of the 2D grid were located inside an aggregate or not. This algorithm has a very poor complexity: ${\mathcal {O}}(p \times N\times w \times h)$ with p the total number of images per direction, N the number of aggregates inside the virtual specimen, w and h the width and height of the 2D grid.

Calculation of 3D fabric tensors (ground truth)

Here, the ground truth is a set of fabric tensors (scalars, vectors and second-order tensors) that describe the composition (e.g., aggregate volume fraction), dispersion (e.g., aggregate distance to nearest neighbor) and geometry (e.g., aggregate size, aspect ratios) of the features of the microstructure. By contrast with microstructure characterization approaches based on correlation functions or the Gaussian Random Field (GRF) method^27,28, here, the fabric tensors are pre-defined and assigned a physical meaning, as explained below.

Principal component analysis (PCA)

By construction, each aggregate in the virtual specimen is a cloud of points (i.e., voxels). A principal component analysis (PCA) was performed on the vectors that connect each point of an aggregate to its barycenter. Each aggregate in the 3D microstructure is represented by a matrix $\underline{\underline{\mathbf {P^k}}}\in \mathbb {R}^{J_k\times D}$ that stores the positions of its points in reference to the barycenter. Here, k is an index that refers to the aggregate number in the specimen ($k\in \{1,2,3\ldots ,N\}$), $J_k$ is the number of points detected in the k$^{th}$ aggregate, and D the space dimension (here, $D=3$). Noting $\underline{\underline{\textbf{C}}}_k\in \mathbb {R}^{D\times D}$ the covariance matrix of $\underline{\underline{\textbf{P}^\textbf{k}}}$, we obtain the eigenvalues ${{\varvec{\lambda }}}^{k}_{i}\in \mathbb {R}^{1\times 1}$ and eigenvectors $\underline{\textbf{v}}^{k}_{i}\in \mathbb {R}^{1\times D}$ of $\underline{\underline{\textbf{C}}}_k$ by solving the following equation:

$$\begin{aligned} \underline{\underline{\textbf{C}}}_k\cdot \underline{v}^{k}_{i}=\lambda ^{k}_{i}\underline{\textbf{v}^{k}_{i}} \end{aligned}$$

(1)

with $i\in \{1,\ldots ,D\}$. The eigenvector $\underline{\textbf{v}}^{k}_{i}$ associated with the greatest/smallest/intermediate eigenvalue $\lambda ^{k}_{i}$ defines the direction of the major/minor/intermediate axis of the k$^{th}$ aggregate. The major/minor/intermediate semi-axis lengths are obtained by projecting the data points on the major/minor/intermediate axes, respectively, as follows:

$$\begin{aligned} \left\{ \begin{array}{c} a_k=\text {max}_j(\underline{\underline{\textbf{P}}}[j,:]\cdot \underline{\textbf{v}}^{k}_{1}) \\ b_k=\text {max}_j(\underline{\underline{\textbf{P}}}[j,:]\cdot \underline{\textbf{v}}^{k}_{2}) \\ c_k=\text {max}_j(\underline{\underline{\textbf{P}}}[j,:]\cdot \underline{\textbf{v}}^{k}_{3}) \\ \end{array} \right. \forall \, k \in \{1,2,3...,N\},\quad \forall \, j \in \{1,2,3...,J_k\} \end{aligned}$$

(2)

where $a_k$, $b_k$ and $c_k$ are respectively the semi -axis lengths of the major, intermediate and minor axes of the k$^{th}$ aggregate, and where the eigenvalues are sorted in descending order: $\lambda ^{k}_{1}\ge \lambda ^{k}_{2}\ge \lambda ^{k}_{3}$.

Definition of the fabric descriptors

Volume fraction

One scalar descriptor denoted $v_f$ encodes the aggregate volume fraction (also called density in the following), calculated as the ratio between the volume of the aggregates by the volume of the cubic specimen.

Size

The size of an aggregate is defined as twice the length of the major semi-axis found by PCA. For the k$^{th}$ aggregate: $G_k\,=\,2\,a_k$, where $a_k$ is defined in Eq. (2). At the scale of the specimen, we define two descriptors: the mean and standard deviation of the distribution $(G_k)_{k=1...N}$.

Aspect Ratio

We define two aspect ratios per aggregate: $b_k / a_k$ and $c_k / a_k$, where $a_k$, $b_k$ and $c_k$ are the lengths of the major, intermediate and minor semi-axes of aggregate k, found by PCA (see Eq. 2). At the scale of the 3D virtual specimen, we define four descriptors: the means and standard deviations of the distributions $(b_k/a_k)_{k=1...N}$ and $(c_k/a_k)_{k=1...N}$.

Roundness

The roundness descriptor encodes the elongation of the aggregates. The roundness $R_k$ of the k$^{th}$ aggregate is defined as the ratio of the aggregate volume by the volume of its circumscribed sphere, of diameter $2\,a_k$ (see Eq. 2). At the scale of the specimen, we define two descriptors: the mean and standard deviation of the aggregate roundness $(R_k)_{k=1...N}$.

Solidity

The solidity $S_k$ of the k$^{th}$ aggregate is defined as the ratio between the volume of the k$^{th}$ aggregate and the volume of its convex hull. At the scale of the specimen, we define two descriptors: the mean and standard deviation of the aggregate solidity $(S_k)_{k=1...N}$.

Orientation

The unit eigenvector associated to the major eigenvalue of the k$^{th}$ aggregate point cloud (calculated by PCA in “Principal component analysis (PCA)” section) is noted $\underline{\textbf{m}}_\textbf{k} = [m_{1, k},\ m_{2, k},\ m_{3, k}]$ in the global coordinate system ($\mathbf {e_1}$, $\mathbf {e_2}$, $\mathbf {e_3}$). The local orientation matrix of the k$^{th}$ aggregate is defined as:

$$\begin{aligned} \begin{bmatrix} F_{k} \end{bmatrix} = \begin{bmatrix} m_{1, k}m_{1, k} &{} m_{1, k}m_{2, k} &{} m_{1, k}m_{3, k}\\ m_{2, k}m_{1, k} &{} m_{2, k}m_{2, k} &{} m_{2, k}m_{3, k}\\ m_{3, k}m_{1, k} &{} m_{3, k}m_{2, k} &{} m_{3, k}m_{3, k}\\ \end{bmatrix} \end{aligned}$$

(3)

The matrix $[F_{k}]$ is symmetric and can be encoded by six coefficients only: $F_{k, 11}, F_{k, 22}, F_{k, 33}, F_{k, 23}, F_{k, 13}, F_{k, 12}$. At the scale of the specimen, we define 12 descriptors: the means and standard deviations of each of the coefficients $F_{k, 11}, F_{k, 22}, F_{k, 33}, F_{k, 23}, F_{k, 13}, F_{k, 12}$ over the distribution of aggregates ($1 \le k \le N$). In order to encode frame-invariant information about the orientation of the aggregates, we also encode the second and third invariants ($I_2$ and $I_3$) of the average orientation matrix [F], as follows:

$$\begin{aligned} \begin{bmatrix} F \end{bmatrix} = \frac{1}{N}\sum _{k=1}^N [F_k] \end{aligned}$$

(4)

$$\begin{aligned} \left\{ \begin{array}{l} I_2 = (F_{11}F_{22} - F_{12}F_{12}) + (F_{22}F_{33} - F_{23}F_{23}) + (F_{11}F_{33} - F_{13}F_{13}) \\ I_3 = F_{11}F_{22}F_{33} + 2F_{12}F_{23}F_{13} - F_{22}F_{13}F_{13} - F_{11}F_{23}F_{23} - F_{33}F_{12}F_{12} \\ \end{array} \right. \end{aligned}$$

(5)

Distance to nearest neighbor

We compute the barycenter-to-barycenter distance between each aggregate k and its nearest neighbor, nd$_k$. At the scale of the specimen, we define two descriptors: the mean and standard deviation of the distances to aggregate nearest neighbor (nd$_k$)$_{k=1...N}$.

Correlated descriptors

Figure 4 shows the correlations between the 27 descriptors defined in “Principal component analysis (PCA)” section. The geometric aggregate descriptors (mainly size, aspect ratio, roundness and solidity) exhibit a high degree of correlation (close to 1). As expected, the aggregate volume fraction is negatively correlated to the average distance between an aggregate and its nearest neighbor. Of note, the non-diagonal coefficients of the average aggregate orientation tensors ($F_{12}$, $F_{23}$ and $F_{31}$) are not correlated to any other descriptor, likely because the non-diagonal terms of the aggregate orientation tensors are close to zero. The values of $F_{12}$, $F_{23}$ and $F_{31}$ exhibit a low magnitude and a low variance, which suggests that the non-diagonal terms of the orientation tensors will be difficult to estimate with a deep learning algorithm. We will test this hypothesis in the performance assessment presented in “Results” section.

Deep learning approach

We developed two deep learning strategies to estimate all or part of the 27 fabric descriptors defined in “Principal component analysis (PCA)” section from sets of 2D images extracted from the 3D microstructure in orthogonal planes. We used Convolutionnal Neural Networks (CNN) because they are particularly suitable for image datasets, and we worked with the VGG, because VGG networks are pre-trained to find interesting patterns in $224\times 224$ RGB images. Pre-training allows better and faster convergence by transfer learning. The VGG appeared in 2014 and was used in classification tasks, notably in the ImageNet Large-Scale Visual Recognition Challenge, where it beat state-of-the-art models like GoogleNet²⁹. VGG models have also been used extensively for image style transfer and 2D micro-structure analysis^30,31,32. The first algorithm that we tested takes three 3D images (i.e., three stacks of 2D images) as input, whereas the second algorithm takes three channels of 2D images (i.e., three concatenated 2D images) as input. We assessed the performance of the deep neural networks when 1, 3, 5 or 10 images are extracted along each spatial direction. In the following, we note p the number of 2D images per direction.

Model 1: three stacks of 2D images as input

Structure of the CNN

Model 1 is based on the pretrained VGG19 network^29,30,31 and it is designed to calculate the 27 fabric descriptors defined in “Principal component analysis (PCA)” section, except: the invariants $I_2$ and $I_3$; the mean and standard deviation of the distribution of distances to nearest neighbor. In total, Model 1 was thus trained to estimate 23 descriptors, which were concatenated into a vector of dimensions ($1 \times 23$). The original VGG19 model is composed of 16 convolutional layers and 3 fully connected (FC) layers. We only kept the convolutional layers before the third max pooling layer, which are critical for 2D microstructure image characterization³¹. The convolutional layers of the original VGG19 model comprise 20,024,384 trainable parameters, while the convolutional layers of the pruned VGG19 model comprise only 1,145,408 trainable parameters, which reduces the training time significantly and allows running the calculations on the open access platform Kaggle (see Table 1).

Table 1 Computational constraints of Kaggle platform (as of Fall 2022).

Full size table

CNNs take 2D images as input. Axial-Coronal-Sagittal (ACS) convolutions are applied to the convolutional layers of the pruned VGG19 model in order to use 3D images as input. We used the ACSConv package, which was initially developed to handle 3D medical data sets. The ACS conversion makes it possible for a 2D CNN model to process a 3D data set without increasing the number of trainable parameters of the convolutional layers³³. Changing the number of images (p) extracted in each direction of the 3D microstructure only affects the number of parameters of the FC layers (see Table 2).

Table 2 Number of trainable parameters in the pruned ACS-VGG19 model, as a function of the number of images p extracted in each direction of the 3D microstructure.

Full size table

The structure of the pruned ACS-VGG19 model adopted here is illustrated in Fig. 5. The network is made of 20 layers, including:

Fifteen convolutional layers, organized in three convolutional blocks that each contain a first convolutional layer followed by a ReLU layer, a second convolutional layer followed by a ReLU layer, and a max pooling layer;
A FC block that contains three linear layers separated by ReLU activation layers (two ReLU activation layers total).

Format of the input data

Figure 6 illustrates how 2D images extracted in planes orthogonal to the x, y and z directions are assembled into a triplet of stacks of depth p, and how these triplets are then concatenated to form a unique input tensor of dimensions ($3 \times p \times w \times h$) as input. The input tensor stores p stacked images in 3 orthogonal directions of space, and each image has a width w of 224 pixels and a height h of 224 pixels.

Data splitting

Each of the 600 virtual specimens provides two data sets: 23 ground truth fabric descriptors and 2D images extracted in three orthogonal directions of space. We use 60% of the specimen data for training, 20% for validation and 20% for testing. The training data set is used to update the parameters of the model to minimize the training loss at each iteration. The validation data set is used during training to calculate the loss for estimating unseen data and adjust hyperparameters so as to optimize the learning curve. The testing data set is used after training to assess the performance of the model in predicting unseen data. The assessment is based on a comparison between the fabric descriptors estimated during testing and the ground truth descriptors of the testing data set.

Pre-processing, measure of error and hyperparameters

The loss functions measure the distance between estimated and ground-truth fabric descriptors. In order to use the same weight for each fabric descriptor, we applied a minimum-maximum normalization to each fabric descriptor. For a descriptor X, the normalized descriptor $\overline{X}$ is $(X-X_{min})/(X_{max}-X_{min})$.

We assessed the performance of Model 1 with three different loss functions: the Mean Square Error (MSE), the Root-Mean-Square Error (RMSE) and the Mean Absolute Error (MAE) , which are defined as follows:

$$\begin{aligned} \text {MSE}\,&=\,\frac{1}{N_{tot}}\sum _{nd}(y_{nd}-\hat{y}_{nd})^2 \end{aligned}$$

(6)

$$\begin{aligned} \text {RMSE}\,&=\,\sqrt{\text {MSE}} \end{aligned}$$

(7)

$$\begin{aligned} \text {MAE}\,&=\,\frac{1}{N_{tot}}\sum _{nd}|y_{nd}-\hat{y}_{nd}| \end{aligned}$$

(8)

in which $y_{nd}$ and $\hat{y}_{nd}$ are the ground-truth and estimated values of the d$^{th}$ fabric descriptor (one of the 23 fabric descriptors under study) in the n$^{th}$ estimation.

A stochastic gradient descent (SGD) algorithm is employed. While the gradient descent algorithm updates model parameters after calculating the loss for the whole training set, the SGD updates the model parameters based on the loss of a single data point, picked randomly in the training set. The stochasticity of the SGD algorithm accelerates convergence and avoids overfitting. The main equations of the SGD algorithm are:

$$\begin{aligned}&\underline{\underline{{\textbf{w}}}}^{k+1}=\underline{\underline{{\textbf{w}}}}^k-\eta (\nabla L^k-2\lambda ||\underline{\underline{{\textbf{w}}}}^k||^2_2) \end{aligned}$$

(9)

$$\begin{aligned}&\left\{ \begin{array}{cc} \underline{\underline{{\textbf{v}}}}^k=\gamma \underline{\underline{{\textbf{v}}}}^{k-1}+\eta \nabla L^k \\ \underline{\underline{{\textbf{w}}}}^{k+1}=\underline{\underline{{\textbf{w}}}}^k -\underline{\underline{{\textbf{v}}}}^k \end{array} \right. \end{aligned}$$

(10)

The matrix $\underline{\underline{{\textbf{w}}}}$ is the weight matrix to be updated, and $\underline{\underline{{\textbf{v}}}}$ is coined as the velocity. The learning rate $\eta $ controls the step size. The weight decay $\lambda $ is used to avoid overfitting. The momentum $\gamma $ avoids locking the solution in a local optimum. The values of the hyperparameters are adjusted by trial and error to improve performance. Table 3 summarizes the values of the hyperparameters used in this study.

Table 3 Hyper-parameters used in Model 1.

Full size table

Since the convolutional layers of the VGG19 model are pre-trained, we compared two strategies: (i) Only the parameters of the FC layers of Model 1 are trained with the input data set of this study. The convolutional layers are fixed, i.e., we are fixing all the parameters of the convolutional layers to the values obtained during pre-training. (ii) The parameters of all the layers of Model 1 are trained with the input data set of this study. The convolutional layers are trainable, i.e., the parameters learned during pre-training are recalculated during training.