Introduction

Solid oxide fuel cells (SOFCs) are characterized by high efficiency as well as fuel flexibility, but their drawback is the degradation due to high operating temperature in the range of 600–900 oC. The harsh operating conditions results in the microstructural evolution during operation. In particular, nickel (Ni) agglomeration and migration1,2, Ni oxidation and reduction3,4, ceramic support cracking5,6, carbon deposition7, phase segregation8,9 and poisoning of impurities such as sulfur10,11 are observed in the fuel electrodes. Several attempts have been undertaken to predict those changes at the microstructural level. Wang et al.12 and Xiang et al.13 attempted to model nickel depletion and coarsening with the phase field model. Jiao and Shikazono14 proposed a method to simulate nickel oxide reduction process in the composite structure incorporating the volume change followed by the phase field simulation of particle sintering. Hara et at.15 simulated Ni sintering with a Monte Carlo method. Even though successful results were obtained, such methods require a priori assumptions and simplifications in the modeling, which cause difficulties in predicting complex and combined degradation phenomena.

The sizes of particles in SOFC electrodes are in the sub-micron range11,16,17,18,19,20. Recently, even finer microstructures with nano-particles are achieved by techniques such as infiltration and exsolution5,21,22,23, etc. Typical observation techniques of SOFC electrode microstructures are scanning electron microscopy (SEM), transmission electron microscopy (TEM) and 3-D characterization by focused ion beam—scanning electron tomography (FIB-SEM)24,25. Measuring time-series changes of an identical sample using these techniques is challenging due to the destruction of the samples. Several samples from the same fabrication batch are usually observed, and their microstructural statistics are compared in the post-mortem characterization. Obtaining paired data of an identical sample before and after the processes is challenging, since it requires non-destructive imagining techniques. Nano-XCT is the only choice26,27 to meet the resolution required for the electrode microstructure analyses. In addition, several attempts were taken to observe the SOFC electrodes using operando techniques. Patterned electrodes under SOFC and SOEC operations were observed by a laser confocal scanning microscope by Komatsu et al. and Ouyang et al.28,29. Jeangros et al. developed measurement setup for a single-chamber SOFC operated inside an environmental TEM (E-TEM)30. However, those studies require simplifications in geometries and operation procedures.

Machine learning methods are widely used in processing and extracting information directly from large experimental datasets. The convolutional-based neural network (CNN) are well adjusted to process 2-D image and 3-D volume data31,32,33,34. However, they usually require high quality paired data for training. UNsupervised Image-to-image Translation (UNIT) network has become popular in many image processing applications such as style transfer, image colorization and object transfiguration35, which can correlate two image domains without explicitly providing paired training examples. However, applications of UNIT to analyze electrode microstructures are scarce. A similar technique for unsupervised image translation, cycle-consistent generative adversarial network (CycleGAN) was successfully used for enhancing the quality of SEM images36, correction of out-of-focus microscopic images37 and transformation of the optical microscopy images38.

Machine learning applications for fuel cells and porous materials have been rapidly evolving in recent years39. For SOFC applications, neural networks and fuzzy logic models were initially applied to optimize and predict SOFC performance40. Artificial neural networks (ANN) with a perceptron architecture with a limited number of input and outputs were incorporated for material screening41, predictions of SOFC performance depending on operating conditions42,43,44, flow field structure45, electrode microstructural parameters46, and SOFC stack electrochemical impedance spectra (EIS)47. Moreover, coupling ANN with algorithms such as swarm optimization46,48 and genetic algorithm45,46,49,50 provided new strategy for system optimization. Recently, more complex neural networks are applied for processing time series data and for analyzing SOFC microstructures. Recurrent neural networks (RNN) and long-short time memory networks (LSTM) utilize sequential information as the input data, which were applied for SOFC fault diagnosis51, prediction of remaining lifetime52 and prediction of EIS and DRT degradations53. Applications of convolutional neural networks (CNN) and generative adversarial networks (GAN) for SOFC and porous materials microstructures, e.g. image segmentation, super-resolution, parameter prediction and artificial structures generation, have been receiving great attention. The semantic segmentation algorithms based on various CNN architectures were proposed for identifying the phases in raw SEM and FIB-SEM images54,55,56. Super resolution CNN based algorithm was proposed to improve the resolution of optical microscope images to SEM-alike quality57. Sequential FIB-SEM images were super-resolved in the stacking direction58. Furthermore, CNNs have been used in material engineering to extract microstructural properties such as effective diffusivity34,59, ionic conductivity60, elastic deformations61, and electron back-scatter diffraction patterns62, etc. The CNNs successfully predicted parameters of anodes63 and cathodes64 from their microstructures. The generative machine learning was utilized for fabricating artificial SOFC microstructures. Gayon-Lombardo et al.33 and Hsu et al.65 demonstrated successful applications of GANs to synthesize 3-D SOFC microstructures. Kishimoto et al.66 synthesized microstructures with predefined properties by conditional GAN, and Sciazko et al.31 reconstructed 3D microstructures from 2D cross-sectional images by GAN. Machine learning was also successfully utilized for detecting materials degradation and failure occurrence67. However, applications of machine learning tools to predict microstructural changes of electrodes are still limited.

In this work, a framework to predict microstructural changes by the physically constrained UNIT network is proposed. The validity of the proposed approach is tested for the NiO reduction process in the Ni-based SOFC fuel electrodes. The volume of NiO decreases by 41% during the reduction process68, and the reduced Ni 3D microstructures depend on the reduction conditions such as temperature69,70 and gas composition69,71. Moreover, the effects from the interactions between supporting ceramic material and NiO particles cannot be excluded72,73. The reduction process greatly influences the electrode morphological parameters such as triple phase boundary density (TPB), Ni network connectivity and pore tortuosity, which determines electrode’s electrochemical performance. In addition, it was reported that initial electrode microstructure after the reduction process influences long term degradation70. Understanding the reduction mechanism is highly relevant for predicting electrode degradation during reduction/oxidation (redox) cycles occurring under fuel-rich and fuel-lean cycling conditions73,74.

Here, microstructure degradation due to the NiO reduction process is treated as the computer vision problem for the image-to-image translation. As the paired training images of an identical sample before and after the NiO reduction are very difficult to obtain, it is necessary to apply unsupervised training approach. In the unsupervised training, only available information is two independent sets of image data in which one consists of images in an arbitrary domain and the other consists of images in another domain. However, the paired examples demonstrating dependance between images in both domains do not exist. In the present study, we propose a method using UNIT with an additional physical constraint incorporated into the UNIT generator loss function enforcing the partly known physical phenomena. The network is trained with un-paired SEM images from the nickel oxide (NiO)-yttria stabilized zirconia (YSZ) and NiO-gadolinium doped ceria (GDC) electrodes. In addition, conditional UNIT (C-UNIT) is trained to capture the influence of reduction temperature as well as predicting the microstructure dependance on reduction time.

Results and discussion

Validation

The validity of the present method was confirmed with the toy-problem tests equivalent to the simplified reduction process as shown in Figs. 13. Three types of toy-problems were carried out for the following purposes: (I) determining interdomain translation operator for predicting structural changes in the same class of microstructures, (II) predicting evolution in new microstructures and (III) extrapolating the prediction for the new process depending on the external conditions. Two data domains were defined, where domain A consists of spherical particles with a diameter of 15-pixels (pointed by green color) and 4-pixels core (red color). Domain B consists of analogical 8-pixels diameter particles with 4-pixels core. The red core particle is assumed to be immobile during the translation process, and this constraint is introduced in the generator loss function. Training data for the UNIT network have been prepared by randomly placing the particles in space independently for both Domains A and B. While the statistical characteristics of the particles in both domains are known, an image translation operator between the domains is not available, as there are no explicit examples demonstrating how images from Domain A are translated into Domain B.

Fig. 1: Validation toy problems with single and multiple particles.
figure 1

a, c Training data; b, d input and output of UNIT and ground truth.

Fig. 2: Validation toy problem for samples with various porosities.
figure 2

a Training data (11–20 and 191–200 particles in the domain), b input and output of UNIT and ground truth for samples with high, low and middle porosities, c global pixel-wise accuracy and d predicted phase fractions.

Fig. 3: Validation toy problem depending on external condition.
figure 3

a Training data with conditional output. Input and output of C-UNIT and ground truth for conditions of b 7 and c 15 pixels. d Predicted phase fractions. e Microstructures with various porosities and predicted phase fractions.

In the first toy problem, a single particle was placed in each domain (Fig. 1a). The UNIT network can discover the translation pattern directly from the unpaired training data. The network outputs a particle with identical spatial position but different diameter of the green particle (Fig. 1b). The comparison between UNIT predictions and ground truth data for 200 samples gives the pixel-wise accuracy of over 99.6%. In the next validation attempt, 10 particles are randomly placed in each domain allowing overlaps. The UNIT reproduces this complicated translation with an accuracy over 98% (Fig. 1c, d).

The ability of the UNIT to predict changes in new microstructures which were not used in the training dataset was tested in the second toy problem. Samples with various porosities controlled by the numbers of particles were used for training. The training data were samples with 11–20 and 191–200 randomly placed particles, representing high and low porosity structures, respectively. Testing was conducted for the samples with full spectra of porosities, i.e. 1–200 particles randomly placed in the domain. The original size of the prepared data was 256 × 256 pixels, while the training was conducted with the randomly extracted patches of 64 × 64 pixels as shown in Fig. 2a. Prediction of the trained UNIT network are shown in Fig. 2b. The UNIT provides excellent prediction not only for the low and high porosity structures, but it works well for all samples with pixel-wise accuracy over 98% (Fig. 2c). From Fig. 2d, it is seen that phase fractions are almost ideally reproduced.

The third validation toy problem starts with an assumption that the simplified reduction process depends on the external condition which determines the diameter of the green particles in Domain B. The diameter of the green particles in Domain A was always kept as 20 pixels, while the diameter of particles in Domain B was varied between 4 – 19 pixels. This can be understood as the dependence of microstructure on hydrogen exposure time. The UNIT network was modified, in which additional information of the condition was included as the input to both generator and discriminators. Examples for five different conditions, i.e. green particle diameters of 5, 8, 11, 14 and 17 pixels, were used for network training as shown in Fig. 3a. The prediction results for the trained C-UNIT network are shown in Fig. 3b, c for the new conditions not used in the training process. As shown in Fig. 3d, C-UNIT can predict well the dependance not only for data from the training dataset, but it also successfully predicts for the new conditions. In addition, as shown in Fig. 3e, the generalization capability of the C-UNIT network was tested for four different microstructures with various porosities. The test samples with 20, 40, 80 and 150 particles in Domain A have porosities of 70, 54, 29 and 14%, respectively. Global pixel-wise accuracy over 95% is achieved for samples with high to middle porosity structures. The C-UNIT prediction for the sample with initial porosity of 14% has discrepancies compared with the grand truth structure even though the overall pixel-wise accuracy is over 85%. This is due to the significant structural difference from the training data, in which the network was trained with samples with porosity of approximately 50%. The particles highly overlap with each other in the dense samples, and this type of training examples have not been provided during the training. In order to improve the results for dense microstructures, it is desired to provide similar examples for network training. In addition, the prediction accuracy of large green particles with diameter of 19 pixels is slightly lower than other conditions. The training data contained particles only up to 17 pixels. Thus, the 19 pixel particles are beyond the training range. As illustrated by these facts, extrapolation using a neural network trained on a specific data range can potentially lead to errors.

Microstructure change during NiO-YSZ reduction

Firstly, the UNIT network was applied to predict NiO reduction process at 800 oC. The real as-sintered NiO-YSZ and as-reduced Ni-YSZ microstructures are shown in Fig. 4a. The microstructures were taken from two different samples in the same fabrication batch. Artificially reduced Ni-YSZ structure is the output of the trained UNIT network. Prediction is conducted for the same NiO-YSZ input data, as it was used for training UNIT. Even though the training was conducted with smaller data patches, the UNIT network can predict changes in the new microstructure of any size. The YSZ phase is not changed as it is assumed that YSZ backbone is immobile during the reduction process. The theoretical composition in Fig. 4b is calculated from the theoretical volume change of NiO into Ni assuming full reduction of NiO using the as-sintered sample before reduction. Good agreement is observed between the volume fractions of real and artificially reduced samples. Increase in porosity represents the phase change from NiO to Ni. The UNIT reduced Ni-YSZ has not only a good visual agreement with real Ni-YSZ, but also shows good agreement in statistical characteristics as shown in Fig. 4b–e. UNIT can reproduce well the volume fractions, surface and TPB densities, which are crucial for electrochemical performance.

Fig. 4: Prediction of microstructure change during NiO-YSZ reduction.
figure 4

a Real NiO-YSZ, artificially reduced Ni-YSZ and real Ni-YSZ microstructures (real data are taken from the training dataset). Comparison between sintered, real reduced and artificially reduced (UNIT) microstructures: b volume fraction, c surface area density, d phase boundary density, and e TPB density.

Additional validation is carried out with new NiO-YSZ microstructure images that were not part of the UNIT training dataset, as shown in Fig. 5. The predictions with new data are comparable to the prediction with training images in terms of both visual and statistical parameter accuracy. The slight difference in the YSZ surface area of artificially reduced sample S2 is due to the quality of the original input data. The TPB density is slightly underestimated by approximately 6%.

Fig. 5: Validation with new NiO-YSZ microstructures.
figure 5

a Artificially reduced Ni-YSZ from real NiO-YSZ samples S1 and S2. Comparison between sintered, real reduced and artificially reduced (UNIT) microstructures: b volume fraction, c surface area density, d phase boundary density, and e TPB density.

Prediction of new microstructures

In this subsection, UNIT network was trained using samples Ni50GDC50 (Ni : GDC = 50: 50 vol%) and Ni60GDC40-IP (Ni: GDC = 60: 40 vol% with low porosity), where IP denotes a sample made by isostatic pressing. These samples have different Ni and pore volume fractions as shown in Fig. 6a. Volume fraction and surface area density were selected as the representative microstructural parameters for the evaluation. The trained UNIT can predict microstructural change during reduction very well for samples used in the training dataset. Moreover, it can also generate microstructures with new properties as shown for the Ni60GDC40 sample which is not used for training. This implies that the network can be trained with limited number of real experimental data, and can be utilized for predicting variable parameters.

Fig. 6: Prediction of NiO reduction in NiO-GDC microstructures with varied properties.
figure 6

a Real NiO-GDC, artificially reduced Ni-GDC and real Ni-GDC microstructures with various compositions. Comparison between sintered, real reduced and artificially reduced (UNIT) microstructures: b volume fraction and c surface area density.

Reduction at various temperatures

It is well known that the reduction temperature strongly influences the microstructures of Ni-based composites. Reduction of bulk NiO crystals by hydrogen has been studied at different temperatures in the range of 270 and 1320 °C75, where complex porous network appeared at reduction temperature below 900 °C. In the present study, NiO-YSZ was reduced at 500, 600, 800 and 1000 °C as shown in Fig. 7a, which follows the abovementioned behavior. Although the volume fractions of all samples after reduction are very close to the theoretical, the microstructures strongly depend on the reduction temperature. The samples reduced at lower temperature are characterized by large number of small pores inside the Ni particles, while higher reduction temperature results in larger pores.

Fig. 7: NiO-YSZ samples reduced at various temperatures.
figure 7

a Real and b artificial microstructures, c volume fractions and d surface area density.

The C-UNIT network was trained with Ni-YSZ samples reduced at various temperatures as shown in Fig. 7b. As can be seen in Fig. 7c, d, C-UNIT can predict well the dependence of microstructure on temperature, not only for the temperatures used for training, but also for temperatures which were not used for training. The prediction accuracies are similar for the data in both the training dataset and the testing set. However, there is a difference in the shape of the nano-pores between real and artificially reduced samples at 500 °C. This may be due to the structure of the UNIT network, which relies solely on convolutional layers. In general, U-net configuration is superior to the encoder-decoder architectures in preserving small image details. Modifying the structures of UNIT network with skipped connections may also help to overcome this issue. In addition, the discrepancy may be due to the variation of YSZ phase fraction between different samples in the training data, which might be caused by the size of training image or by the accuracy of segmentation.

Predictions for temperatures in the range of 400–1100 °C are shown in the Supplementary Video 1. It is clear that the reduction initially progresses within the NiO grains by forming complicated inner pore network. The changes in shape and size from the original NiO grains are minimal. At higher temperature, the inner pores disappear and Ni particles coarsen, resulting in increased size of pores. The different final structures are the result of competition between chemical reaction and Ni sintering, which are both enhanced at higher reduction temperature.

Time-dependent reduction process

Shimura et al. 73 observed dynamic changes of Ni-YSZ microstructures during oxidation and reduction. The samples were reduced at 800 oC in pure H2, and the microstructures are observed at different times. The pore structure formation inside the reduced Ni was observed in the initial period. The reduction is almost completed within 30 s of exposure to H2 gas. After the reduction reaction, sintering of Ni phase progresses and the pores inside Ni disappears.

Here, an attempt is taken to model the time dependent reduction process by the C-GAN network. NiO-YSZ samples were reduced for 5, 15, 30 and 3600 s, as shown in Fig. 8a. The pores inside NiO grains in the initial NiO-YSZ structure are due to the redox treatment prior to the reduction test. The details of experimentation can be found in our previous work73. Ni and NiO phases are not distinguished in this study, as it was not possible to identify them from the collected SEM images

Fig. 8: Reduced NiO-YSZ samples at various times.
figure 8

a Segmented SEM images and b training data for C-UNITsintering and C-UNITreduction networks.

Two independent C-UNIT networks, i.e. C-UNITreduction and C-UNITsintering, were trained to predict initial reduction and following Ni sintering processes. Two data points, 30 and 3600 s, were used for C-UNITsintering, and microstructures at 0, 5, 15 and 30 s were used to train C-UNITreduction. The conditional UNIT architecture was chosen for C-UNITsintering instead of conventional UNIT to simulate intermediate time points, even though only two training data are available. The details of the training data in each domain are given in Fig. 8b.

The results of combined C-UNITsintering and C-UNITreduction networks are shown in Fig. 9 and in the Supplementary Video 2. As the generalization ability of UNIT network was already demonstrated in previous subsections, only the prediction with training data is shown. The predicted volumes fractions and surface areas are in reasonable agreement with the real structures. The error in the predicted values is shown in Supplementary Fig. 1. The largest error is observed in the predictions of pore and YSZ, which falls within the range of 10%. This resulted from the errors in the initial phase segmentation of training data. NiO grains have large inner pores, which are difficult to segment as resin could not be infiltrated during FIB-SEM measurement. Moreover, the variation between the samples for training is quite large, i.e. YSZ fraction varies significantly in the training data which should be the same for all the samples. As YSZ immobility constraint is imposed, this variation is compensated by the shapes of other phases. Decrease of NiO and increase of pore represent the reduction process. The most significant volume change occurs during initial 5 s of the reduction process, i.e. the size of NiO particles decreases from the circumference and the size of the original large pores increase. At the same time, inner pores appear inside the NiO particles. From 5 to 30 s, reduction progresses and the size and number of inner pores increase. The size of large pores does not change, which implies that the reduction process proceeds mostly at the core of NiO particles. This is consistent with the complex inner pore structures reported in the literature. Reduction of NiO is followed by sintering of Ni, which is well reproduced by the C-UNIT. Large Ni particles grow on the expense of small Ni structures. In addition, the C-UNIT network can predict stable Ni-YSZ interfaces.

Fig. 9: Results of training network for time-dependent prediction.
figure 9

a NiO-YSZ reduced structures predicted by C-UNITsintering and C-UNITreduction, comparisons of b volume fraction and c surface area density.

Future perspectives

A framework for predicting microstructure changes in SOFC electrodes using a conditional unsupervised image-to-image translation network with physical constraints is proposed. UNIT and C-UNIT networks were used to predict the reduction process of Ni-based SOFC electrodes from unpaired training data. The networks were trained with image patches of NiO-YSZ and Ni-YSZ microstructures before and after reduction. A physical constraint was included in the generator loss function to constrain YSZ and GDC backbones, which are assumed to be immobile during the reduction process.

The reduced microstructures predicted by the UNIT show excellent visual and quantitative agreements with the real samples. Furthermore, UNIT can predict microstructural changes in samples that were not used for training.

C-UNIT which is conditioned by temperature can predict the microstructure changes at various reduction temperatures. In addition, the C-UNIT can predict time-dependent microstructure evolution process. The proposed method has a potential to predict various microstructure evolution processes not only NiO reduction, but also many processes which are important for degradation studies.

This study investigated the applicability of physically constrained UNIT and C-UNIT in 2D problems. However, in order to fully understand the SOFC degradation processes related to their microstructural changes, it is necessary to analyze 3D data. The future work will focus on the extension of the present 2D UNIT to 3D space. This will provide more accurate analysis and potential for precise 3D electrochemical calculations. However, there are significant challenges associated with the extension to 3D, primarily related to data collection and network training. Measuring 3D data requires more complex experimental techniques such as FIB-SEM, which takes 12 to 36 hours to measure a single microstructure. This problem can be potentially solved by applying machine learning to generate artificial 3D models from 2D cross-sectional images31. In addition, it is anticipated that the proposed data patching and augmentation techniques will be helpful in reducing the amount and volume of the raw sample set required. Another challenge which must be considered is the considerable time and memory resources required for the training the 3D UNIT network.

This study aims to demonstrate the usability of UNIT and C-UNIT to predict complex shape change phenomena in porous microstructures. Further improvements of the network structures, e.g. hyperparameters and learning strategy are necessary. Moreover, extension of the methodology may include combination with the LSTM networks. Additionally, UNIT and C-UNIT networks can be very useful for generating training data for physics informed neural networks (PINN) which requires paired data76. Utilizing PINN networks for the inverse problems can identify the physical parameters describing degradation processes.

We demonstrated that UNIT and C-UNIT can successfully predict NiO reduction process. It is expected that this technique can be further utilized for other microstructural changes, e.g. Ni migration during SOEC operation, Ni coarsening under high current density and humidity, carbon deposition, segregation of impurities, ceramic support cracking, etc. Moreover, present approach can be applied to other porous multiphase materials such as polymer electrolyte fuel cells, batteries, photovoltaic cells, catalytic reactors and membranes, etc.

Methods

Physically constrained UNIT network

A schematic illustration of the UNIT network is presented in Fig. 10. The UNIT network is a type of generative adversarial neural network which consists of single generator and two discriminators. Two images represent source and target domains, i.e. NiO-YSZ (x1) and Ni-YSZ (x2) microstructures, are used as the UNIT generator input. The UNIT network generates four output images, among which two are source-to-target (x11→2) and target-to-source (x22→1) translated images. In this context, the source-to-target translation represents an artificial reduction from NiO-YSZ to Ni-YSZ (Fig. 10d), and the target-to-source translation can be understood as the artificial “un-reduction” of Ni-YSZ to NiO-YSZ. The “un-reduction” is not the oxidation process, but rather can be understood as the inverse translation operator, which shows how the sample used to look like before reduction. The other two UNIT generator outputs are self-reconstructed images, i.e. source-to-source (x11→1) and target-to-target (x12→2) translated images. The generator network consists of three subnetworks in both encoding and generative parts to follow variational autoencoder (VAE) architecture. An additional input condition C, representing the reduction temperature or reduction time, is included as the input for C-UNIT network. The specifications of the generator and discriminator network architectures are given in Tables 1, 2, respectively.

Fig. 10: Structure of UNIT network.
figure 10

a UNIT generator network, b sintering discriminator and c reducing discriminator. Image patches processed by UNIT for d artificial reduction and e artificial un-reduction (the YSZ phase is assumed to be immobile).

Table 1 Generator architecture
Table 2 Discriminators architecture

The multifactor loss function is incorporated to jointly solve learning problems of VAE and GAN for image reconstruction, translation and cycle-reconstruction streams, as well as to fulfill physical constraint. Generator loss function \({L}_{{\rm{gen}}}\) is calculated as:

$$\begin{array}{ll}{L}_{{\rm{gen}}}={{w}_{1}\bullet L}_{{\rm{self}}}+{{w}_{2}\bullet L}_{{\rm{cycle}}}+{{w}_{3}\bullet L}_{{\rm{KL}}}+{{w}_{4}\bullet L}_{{\rm{KLcycle}}}\\\qquad\quad+\,{{w}_{5}\bullet L}_{{\rm{adv}}}+{{w}_{6}\bullet L}_{{\rm{physic}}},\end{array}$$
(1)

where individual losses are defined as:

$${\rm{self}}-{\rm{reconstruction\; loss}}:{L}_{{\rm{self}}}={{||}{X}_{{\rm{real}}}-{X}_{{\rm{self}}-{\rm{reconstructed}}}{||}}_{1},$$
(2)
$${\rm{cycle\; consistency\; loss}}:{L}_{{\rm{cycle}}}={{||}{X}_{{\rm{real}}}-{X}_{{\rm{cycle}}-{\rm{reconstructed}}}{||}}_{1},$$
(3)
$${\rm{hidden\; KL\; loss}}:{L}_{{\rm{KL}}}={\left({X}_{{\rm{esa|self}}-{\rm{reconstructed}}}\right)}^{2},$$
(4)
$${\rm{cycle\; hidden\; KL\; loss}}:{L}_{{\rm{KLcycle}}}={\left({X}_{{\rm{esa|cycle}}-{\rm{reconstructed}}}\right)}^{2},$$
(5)
$${\rm{and\; adversarial\; loss}}:{L}_{{\rm{adv}}}={\left(1-{\bar{X}}_{{\rm{trans}}}\right)}^{2}.$$
(6)

Additional physics loss \({L}_{{\rm{physic}}}\) is included in the generator loss to enforce the physical constraint, which is the immobility of ceramic backbone during the reduction process:

$${L}_{{\rm{physic}}}={{||}{X}_{{\rm{real|YSZ}}}-{X}_{{\rm{trans|YSZ}}}{||}}_{1},$$
(7)

where \({{||}\bullet {||}}_{1}\) represent the L1 distance between the input images and the translated images.\(\,{X}_{{\rm{real}}}\), \({X}_{{\rm{self}}-{\rm{reconstructed}}}\) and \({X}_{{\rm{cycle}}-{\rm{reconstructed}}}\) are real, self-reconstructed and cycle reconstructed images, respectively. \({X}_{{\rm{esa|self}}-{\rm{reconstructed}}}\) and \({X}_{{\rm{esa|cycle}}-{\rm{reconstructed}}}\) are activations of the encoder shared block in the self-reconstruction and cycle reconstruction streams, respectively. \({\bar{X}}_{{\rm{trans}}}\) represents the predictions of the discriminator on generated images. \({X}_{{\rm{real|YSZ}}}\) and \({X}_{{\rm{trans|YSZ}}}\) represent the YSZ phase in the real and respective translated images.

The discriminators’ loss function \({L}_{{\rm{dis}}}\) are defined as:

$${L}_{{\rm{dis}}}={\left(1-{\bar{X}}_{{\rm{real}}}\right)}^{2}+{\left(0-{\bar{X}}_{{\rm{trans}}}\right)}^{2},$$
(8)

where \({\bar{X}}_{{\rm{real}}}\) and \({\bar{X}}_{{\rm{trans}}}\) are the predictions of the discriminator on real and generated images, respectively. The discriminator loss function is calculated separately for as-sintered (NiO-YSZ) and as-reduced images (Ni-YSZ).

During the training step, patches of real microstructures are randomly extracted from full-sized SEM image to increase the training dataset. In addition, geometrical augmentation is applied by random rotation and flipping of the images as shown in Supplementary Fig. 2. The training is conducted with the phase-segmented images and the network uses “one-hot” encoding method proposed by Gayon-Lombardo et al.33. The image patches have 64 × 64 and 256 × 256 pixels for toy validation and real microstructures, respectively. The numbers of incorporated training images and their size are shown for each of the analyzed cases in Tables 3, 4 for toy-problems and real NiO-based electrodes, respectively. For the toy problems, training and validation data were prepared by randomly placing the particles in the domain. The particles were independently placed at different positions for oxidized and reduced domains for the training data. On the other hand, for the validation data, the positions of the particles were identical in order to check the accuracy of the UNIT network. For the reduction prediction of the real structure, segmented SEM images were used as training and validation data. Raw SEM images of NiO-YSZ and Ni-YSZ anodes are shown in Supplementary Figs. 35 in the supplementary materials. Only the homogonous regions of functional layers were used for training, and the size of training images was 700–1300 µm2. The validation using real microstructures is challenging, as it is not possible from SEM to obtain an identical sample pair before and after reduction. Therefore, statistical parameters are compared between real and artificially reduced samples. Two types of input data were used for NiO-YSZ electrodes, i.e. images used for training and different microstructure taken from the same sample. In the case of NiO-GDC electrodes, two samples were used for training and another sample is used for validation. For time-dependent NiO-YSZ reduction, only one initial microstructural image was available. Adaptive moment estimation optimizer (Adam) was used for training using GPUs (NVIDIA A100), whose details are given in Table 5. Single training epoch took 1.2 s, where the networks were trained for 1000 – 5000 epochs. Utilizing the trained network for processing a new microstructure took 0.5 s for a single image. After training, the generator can process larger size of microstructures than the patch size incorporated in the training process. The trained UNIT generator can artificially produce reduced Ni structures from any new sintered sample image. The source-to-target translation path for the artificial reduction is represented by the red arrows in Fig. 9a.

Table 3 Number and size of training images for toy problems
Table 4 Number and size of training images for reduction of Ni-based anodes
Table 5 Details of the training procedure

Reduction experiment

Microstructure evolutions in the reduction process were investigated using two SOFC anode cermets as shown in Fig. 11. The electrolyte supported SOFCs with Ni-YSZ (Kusaka, Japan)77 and Ni-GDC (AGC Seimi, Japan and ShinEtsu, Japan)78 fuel electrodes were fabricated by screen-printing slurries on commercial YSZ pellets (Tosoh, Japan). The Ni-GDC electrode composition was controlled by mixing the powders at ratios of Ni : GDC = 50 : 50 vol.% (Ni50GDC50) and Ni : GDC = 60 : 40 vol.% (Ni60GDC40). Another sample Ni60GDC40-IP with a composition of Ni : GDC = 60 : 40 vol.% was prepared by isostatically pressing dried electrode green body at 200 MPa for 30 min to reduce porosity78. Pure NiO current collection layer was screen-printed on the top of the fuel electrodes. Anodes and current collector layers were sintered in air at 1350 oC for 3 h. The counter electrode was fabricated with GDC barrier layer and LSCF (Fuel Cell Materials, USA) cathode on the other side of the electrolyte. The anode active area was 0.785 cm2 and the thickness of the NiO-YSZ (Ni-GDC) electrode and NiO current collector layer were approximately 18-20 and 5 µm, respectively.

Fig. 11
figure 11

Experimental workflow.

The electrodes were reduced with two steps in the SOFC testing rig (Bel-SOFC, MicrotracBel, Japan). Firstly, 50 sccm of H2 : N2 = 5 : 95% gas mixture was supplied for 30 min, and then pure hydrogen was provided. The time of pure hydrogen exposure was adjusted to 6, 4, 0.5, 0.5 h depending on the reduction temperatures conducted at 500, 600, 800, and 1000 oC, respectively. The open circuit voltage was monitored to ensure complete NiO reduction. Additionally, full reduction of NiO was confirmed by conducting EDX measurement.

An additional experiment was conducted to investigate the influence of reduction time for a given temperature. The cell was operated for 20 h prior to the oxidation-reduction experiment at 800 oC. Then, the anode was oxidized by supplying pure oxygen, flushed with nitrogen, and finally exposed to pure hydrogen. The hydrogen reduction time was set as 5 s, 15 s, 30 s, 20 mins and 1 h. The samples were quenched to room temperature with supplying nitrogen. More details can be found in our previous work73.

As-sintered and as-reduced samples were embedded in epoxy resin (Epofix, Struers), polished (Cross section polisher, JEOL, SM-09020CP), and observed with the scanning electron microscope (SEM, JSM-7001F, JEOL). The cross-sectional images were segmented by patch convolutional neural network in the encode-decoder configuration54,79. The microstructure parameters were calculated from the 2D cross-sections by TauFactor Matlab library80. The 3D value of TPB density was estimated from 2D images by the stereological formula as described in ref. 81.