Three-dimensional coherent X-ray diffraction imaging via deep convolutional neural networks

Wu, Longlong; Yoo, Shinjae; Suzana, Ana F.; Assefa, Tadesse A.; Diao, Jiecheng; Harder, Ross J.; Cha, Wonsuk; Robinson, Ian K.

doi:10.1038/s41524-021-00644-z

Download PDF

Article
Open access
Published: 28 October 2021

Three-dimensional coherent X-ray diffraction imaging via deep convolutional neural networks

npj Computational Materials volume 7, Article number: 175 (2021) Cite this article

5095 Accesses
21 Citations
1 Altmetric
Metrics details

Subjects

Abstract

As a critical component of coherent X-ray diffraction imaging (CDI), phase retrieval has been extensively applied in X-ray structural science to recover the 3D morphological information inside measured particles. Despite meeting all the oversampling requirements of Sayre and Shannon, current phase retrieval approaches still have trouble achieving a unique inversion of experimental data in the presence of noise. Here, we propose to overcome this limitation by incorporating a 3D Machine Learning (ML) model combining (optional) supervised learning with transfer learning. The trained ML model can rapidly provide an immediate result with high accuracy which could benefit real-time experiments, and the predicted result can be further refined with transfer learning. More significantly, the proposed ML model can be used without any prior training to learn the missing phases of an image based on minimization of an appropriate ‘loss function’ alone. We demonstrate significantly improved performance with experimental Bragg CDI data over traditional iterative phase retrieval algorithms.

Pretraining a foundation model for generalizable fluorescence microscopy-based image restoration

Article 12 April 2024

Towards a general-purpose foundation model for computational pathology

Article 19 March 2024

Scaling deep learning for materials discovery

Article Open access 29 November 2023

Introduction

Coherent X-ray diffraction imaging (CDI) has been widely utilized to characterize the internal three-dimensional (3D) structure of single particles [1,2,3,4]. Particularly, Bragg CDI has emerged as a promising technique for 3D strain imaging of crystalline particles [5,6,7,8,9,10,11]. As modern X-ray sources, such as diffraction-limited storage rings and fourth-generation X-ray free-electron lasers, are developing worldwide to provide higher coherent flux densities, time-resolved and in-situ CDI experiments for single-particle imaging are becoming more and more capable to explore small particles’ dynamical phenomena such as driven melting, thermal fluctuation, driven phase transitions, catalysis, and high-pressure phenomena [12,13,14,15,16,17]. Due to the lost phase information in measured coherent X-ray diffraction signals, it is necessary to use phase retrieval [18,19,20,21] as a key component of CDI, to reconstruct the real-space 3D images with morphological details from the measured signals.

Until now, the extensively used approach for CDI phase retrieval is the iterative methods, such as the hybrid input-output (HIO) method by Fienup [19], the difference map (DM) by Elser [22], and the relaxed averaged alternating reflection (RAAR) method by Luke [23]. In general, these iterative phase retrieval methods can be expressed as successive projections [22,24,]. Theoretically, for a finite object, when the modulus of its Fourier Transform is well oversampled, a unique solution is guaranteed for these methods [19,20,]. However, for experimental data with inherent noise, these projection-based methods are found to struggle with local minima, which leads to an ambiguous, rather than unique, solution [25]. Thus, when inverting coherent X-ray diffraction patterns, conventional iterative methods typically need thousands of iterations and switch algorithms to confidently converge to a reproducible solution and require tuning of many algorithmic parameters and expert strategies [24,26,27,]. Because these methods are based on projections, the calculated error usually is only used to monitor the convergence and rarely used as feedback to adjust the related algorithmic parameters, which makes these methods sensitive to their initialization conditions.

For phase retrieval, deep-neural-network-based ML methods have recently shown a significant advantage in providing rapid reconstruction results in a CDI experiment [28,29,30]. There has been rapid progress for 2-dimensional (2D) phase retrieval using convolutional neural networks (CNN) recently [28,30,]. Meanwhile, an adaptive ML-based approach for 3D phase retrieval has been demonstrated by using spherical harmonics [29]. However, so far, most of the proposed neural networks employ a supervised learning approach, matching input diffraction patterns in reciprocal space to output particle morphological information in real space, which usually needs large training datasets to train the neural network so that it can represent a universal approximation function. When a deep neural network is trained with limited data, its ability to generalize as a universal function is reduced, as seen in the accuracy of the reconstructed results, and a subsequent refinement procedure is needed to follow the supervised learning [28,29,]. Furthermore, in practice, it is difficult to obtain enough ground-truth experimental coherent X-ray diffraction data for training. When applied to experimental data, an ML model trained with less data than ideal may also suffer and may need very lengthy experiment-specific retraining.

Here, we demonstrate a comprehensive 3D CNN-based approach to reconstruct the interior complex morphological information of a range of nanoparticles from their measured coherent X-ray diffraction patterns. When trained in a supervised learning approach, this CNN model can be applied to real-time 3D single-particle imaging experiments, for example, using an X-ray Free-electron Laser (XFEL) [31]. Further, while seeking to improve the accuracy of the result, we find that the trained CNN model can also be used in an unsupervised transfer learning mode. We demonstrate significantly improved accuracy with both simulated data as well as experimental data. Additionally, when recovering the 3D structure of a particle with the unsupervised learning approach, we find no distinguishable difference between the quality of the obtained results whether the pre-trained ML model is used (i.e., transfer learning) or the randomly initialized model is used, except for their convergence speed. This is important in situations where the acquisition of training data is challenging. With the unsupervised learning approach, the flexibility of the self-defined loss function in the CNN model makes this method more robust to coherent X-ray diffraction data of lower quality than the traditional approach.

Results and discussion

ML model and data sets

Generally, for coherent X-ray diffraction imaging experiments, either forward-scattering CDI or for Bragg CDI [32,33,], the measured X-ray diffraction intensity $I\left( {{{\mathbf{Q}}}} \right)$ is proportional to the modulus squared of the Fourier Transform of a complex field $\rho \left( {{{\mathbf{r}}}} \right)$:

$$I\left( {{{\mathbf{Q}}}} \right) = \left| {{\int} {\rho \left( {{{\mathbf{r}}}} \right)e^{i{{{\mathbf{Q}}}} \cdot {{{\mathbf{r}}}}}d{{{\mathbf{r}}}}} } \right|^2,$$

(1)

where ${{{\mathbf{Q}}}} = {{{\mathbf{q}}}} - {{{\mathbf{h}}}}$, and ${{{\mathbf{q}}}} = {{{\mathbf{k}}}}_{{{\mathrm{f}}}} - {{{\mathbf{k}}}}_{{{\mathrm{i}}}}$ is the momentum transfer defined by the incident and diffracted X-ray wavevectors ${{{\mathbf{k}}}}_{{{\mathrm{i}}}}$ and ${{{\mathbf{k}}}}_{{{\mathrm{f}}}}$. Here, h equals to zero for a forward CDI experiment, and h is a reciprocal lattice vector of the crystal in a Bragg CDI experiment. In Eq. (1), the complex field $\rho \left( {{{\mathbf{r}}}} \right)$ is related to the local complex refractive index of a particle in a forward CDI experiment and in Bragg diffraction geometry, it mainly represents the local crystal lattice strain inside a particle [7,13,]. In all cases, this complex-valued structure information inside the particle could also be expressed as $\rho \left( {{{\mathbf{r}}}} \right) = s\left( {{{\mathbf{r}}}} \right){{{\mathrm{e}}}}^{{{{\mathrm{i}}}}\phi \left( {{{\mathbf{r}}}} \right)}$, where $s\left( {{{\mathbf{r}}}} \right)$ and $\phi \left( {{{\mathbf{r}}}} \right)$ are the corresponding amplitude and phase distributions of the measured particle, separately.

The goal of a CDI experiment is to numerically obtain this complex particle density function uniquely in real space, whose modulus squared of the Fourier Transform best matches the experimental coherent X-ray diffraction intensity distribution of the measured particle [20]. As shown in Fig. 1, our developed deep neural network for 3D coherent X-ray diffraction imaging adopts the typical ‘encoder-decoder’ architecture. It takes the amplitude of the 3D coherent X-ray diffraction pattern in reciprocal space as input and outputs the real-space amplitude and phase images. As presented in Fig. 1, the proposed model is implemented using an architecture composed entirely of 3D convolutional, max-pooling, and upsampling layers. The model adopts the general convolutional encoder-decoder network architecture, which has three main parts: a 3D convolutional encoder that encodes the input x-ray diffraction data through a series of convolutional blocks, followed by two decoder parts which utilize the encoded result to generate the real-space amplitude and phase information of the measured particles. In this 3D CNN, the leaky rectified linear unit (LRLU) [34] is used for all activation functions except for the final 3D convolutional layer, where the rectified linear unit (RLU) activation function is used. The modules used in Fig. 1 to connect the input from the previous layer to the next layer’s output are convolution blocks (3 × 3 × 3 convolution + LRLU + BN, where BN refers to batch normalization), followed by convolution blocks (3 × 1 × 1 convolution + 1 × 3 × 1convolution + 1 × 1 × 3 convolution + LRLU + BN). It should also be mentioned that the array size of output particle image arrays (i.e., amplitude and phase) in each dimension is half of the size of the input diffraction data to keep the problem overdetermined.

**Fig. 1: Overall scheme of 3D deep neural network for single-particle coherent diffraction imaging inversion.**

Supervised learning approach

In the deep neural network supervised learning method, the quantity and diversity of the training dataset directly affect the network’s performance when unknown data are presented. In the real world, the complex structure $\rho \left( {{{\mathbf{r}}}} \right) = s\left( {{{\mathbf{r}}}} \right){{{\mathrm{e}}}}^{{{{\mathrm{i}}}}\phi \left( {{{\mathbf{r}}}} \right)}$ for a particle varies a lot from particle to particle. For demonstration purposes, a shape known as a superellipsoid is used to describe the particle shape $s({{{\mathbf{r}}}})$ and a 3D Gaussian correlated profile is used to describe the corresponding phase $\phi ( {{{\mathbf{r}}}})$ distribution (see Methods for details). Then, after the generated particle is randomly rotated in real space, a 3D coherent diffraction pattern is obtained by Fourier Transformation. Only the amplitude information of the diffraction pattern is kept for training, and the phase information is discarded.

By applying this method with a wide range of random parameters, we simulated 30,000 3D diffraction patterns and used them to train the CNN model. With the corresponding particles known a priori, the proposed 3D CNN model was trained in a supervised learning approach, by solving

$$l_{{{\mathrm{s}}}} = {{{\mathrm{arg}}}}\mathop {{\min }}\limits_{\rho _{{{\mathrm{p}}}}} \left[ {\rho _{{{\mathrm{p}}}}\left( {{{\mathbf{r}}}} \right) - \rho _{{{\mathrm{g}}}}\left( {{{\mathbf{r}}}} \right)} \right],$$

(2)

where, $\rho _{{{\mathrm{p}}}}$ is the output from the CNN model, and $\rho _{{{\mathrm{g}}}}$ is the corresponding ground truth for the complex particle. For this loss function $l_{{{\mathrm{s}}}}$, which was minimized during the training, we used a combination of the relative root mean square error $\chi$, and the modified Pearson correlation coefficient $r_{{{\mathrm{p}}}}$ (see Methods for details) to measure the agreement between the output amplitude and phase images of the predicted particles with their ground truth both in real and reciprocal space. This is appropriate for diffraction data with a large dynamic range. The $\chi$ is dominated by the strong central part of the diffraction pattern. The $r_{{{\mathrm{p}}}}$ is a statistical metric that measures the linear similarity between two variables [35]. When training the 3D CNN model, the prepared training data were divided into two disjoint sets, where 95% of them were used to train the model, and the rest of them were used for validation.

While the 3D CNN model was being trained by the supervised learning approach, Supplementary Fig. 1 shows the training and validation loss as a function of the training epochs. It can be seen that the loss for the validation testing is generally continually decreasing. After 100 training epochs, the loss for the validation can reach 0.031, which illustrates that the proposed 3D CNN model can already provide a highly accurate estimation of the reconstruction. The nearly identical losses for training and validation result from an early stop placed on the training to avoid overfitting. To demonstrate the performance of this trained CNN model, Fig. 2 shows four representative predictions from test diffraction patterns, not used for training the CNN model. The predicted amplitude and phase of the particles show excellent agreement with their ground truth. This CNN model is an ML method of phase retrieval, which provides a very fast inversion of a diffraction pattern (~9 ms computation in our hardware). Unlike an iterative phase-retrieval method, this could be very useful in a real-time 3D CDI experiment, for example to capture movies of a moving or evolving object.

**Fig. 2: Performance of trained 3D CNN model in testing data.**

When the CNN model learns to match input coherent diffraction data to output particle data, it does not only learn to solve the data fitting problem but also incorporates comprehensive prior information (such as support size or phase range of a particle) in a data-driven manner [36]. Perhaps the greatest strength for the ML-based phase retrieval method is that the model can learn far more complicated prior information. The ability to ultimately learn both the best possible inverse solver and the specific prior information makes the model very powerful.

Unsupervised learning approach

Since the supervised ML-based approach is data-driven, sometimes the predicted results might miss subtle features in the data which were not captured by the training. To improve the quality of the obtained reconstruction, we have developed a refinement procedure by using an unsupervised transfer learning approach. This refinement improves the reconstruction of a single diffraction pattern at a time. The problem of phase retrieval for coherent X-ray imaging experiments can also be considered to be an optimization problem [22,37,], expressed as

$$l_{{{\mathrm{u}}}} = {{{\mathrm{arg}}}}\mathop {{\min }}\limits_{\rho _{{{\mathrm{p}}}}} \left[ {\left| {FT\rho _{{{\mathrm{p}}}}\left( {{{\mathbf{r}}}} \right)} \right|^2 - I_{{{\mathrm{m}}}}\left( {{{\mathbf{Q}}}} \right)} \right],$$

(3)

where, $l_{{{\mathrm{u}}}}$ is the loss function for unsupervised learning, which describes the difference between the numerically obtained particle $\rho _{{{\mathrm{p}}}}({{{\mathbf{r}}}})$ and the measured coherent X-ray diffraction intensity $I_{{{\mathrm{m}}}}({{{\mathbf{Q}}}})$. $FT$ represents the Fourier Transform operation.

In Fig. 3, we demonstrate that the proposed unsupervised transfer learning approach can further improve the reconstruction quality and reach a high accuracy, comparable with the best iterative algorithms. As shown in Fig. 3(a), we demonstrate this approach with a 3D diffraction pattern, which is obtained with the parameters that are different from the parameters used to generate the training data. The corresponding real-space particle is given in Fig. 3b. As can be seen from Fig. 3a, b, while the particle shape is symmetrical, the broken symmetry of the diffraction pattern results from the internal asymmetry of the real-space 3D phase distribution, which is common in Bragg CDI from particles with strain distributions. The trained 3D CNN model yields the reconstructed amplitude and phase structure shown in Fig. 3c, with a corresponding estimated error of 0.13. Compared with its ground truth in Fig. 3b, the trained CNN model gives a relatively poor prediction, indicating that a refinement is necessary, because features of the input diffraction are not fully captured by the training. Here, the pre-trained CNN model from the supervised learning was then used in the unsupervised transfer learning to further refine the reconstructed result using the loss function defined in Eq. (3) (see Methods for details). Figure 3e shows the result of this unsupervised transfer learning approach, and Fig. 3f shows the trend of the corresponding loss (or error metric) with training epoch. After this refinement approach on a noise-free diffraction pattern of a test particle, significant improvement was achieved, where the error decreased from 0.13 to 2 × 10^–6. The time cost for the transfer learning is ~28.67 ms per epoch (i.e., ~3.19 h for 4 × 10⁴ epochs) in our hardware.

**Fig. 3: Representative results using 3D CNN model in unsupervised learning mode.**

When using the unsupervised transfer learning approach, to explore the importance of the pre-training for this CNN model, we further tested the model with the same coherent x-ray diffraction data by using randomly initialized bias and weight parameters (i.e., without transfer). Figure 3g, h shows the corresponding obtained diffraction pattern and particle by using this method separately, with the corresponding loss given in Fig. 3f. Comparing Fig. 3g, h with Fig. 3d, e, it can be seen that there is no obvious difference between the final reconstructed results. This is a significant discovery: the ability of the Neural Network to retrieve phases directly without pre-training. As presented, Fig. 3f shows that the loss converges faster for the CNN model after pre-training than the model with random initialization, however there is no significant difference between the final reconstructed results. To further quantify this effect on the final results, Fig. 3i presents the Fourier Spectral Weight (FSW) of the reconstructed results from the two different methods, which is obtained by integrating the reconstructed diffraction amplitude of the predicted particle over shells of constant $\left| {{{\mathbf{Q}}}} \right|$. As shown in Fig. 3i, there is no noticeable difference between the FSWs from the two predicted results, which indicates that the two methods agree quite well at all spatial frequencies.

Performance of 3D CNN model on experimental data

Since the internal structure of a crystalline particle is usually unknown in CDI experiments, it is vital that our proposed ML approach gives a credible reconstruction result for phase retrieval in the presence of unavoidable noise. With CDI experiments, there is little prior knowledge of the structure available for building a training dataset. Figure 4a–d shows isosurface renditions of four very different experimental Bragg coherent X-ray diffraction patterns of individual SrTiO₃, BaTiO₃, Pd, and Au nanocrystals (see Methods for details), which were measured at beamline 34-ID-C of the Advanced Photon Source using methods reported by Robinson & Harder [7]. From Fig. 4a–d, these four 3D Bragg coherent X-ray different patterns have different diffraction fringe spacings and directions, indicating their distinct sizes and facets in real space.

**Fig. 4: Performance of 3D CNN model on experimental coherent X-ray diffraction data.**

By using our 3D CNN model with the proposed unsupervised transfer learning approach, the corresponding predicted results are shown in Fig. 4e–l. Figure 4e–h shows the corresponding calculated X-ray diffraction intensities, obtained as the modulus squared of the Fourier Transform of the predicted CNN model structures shown in Fig. 4i–l. There is excellent agreement between the experimental and calculated X-ray diffraction patterns, confirming that the CNN model can reach a high reconstruction accuracy. This result also extends to the randomly initialized CNN model with experimental data, for which Supplementary Fig. 2 shows the corresponding reconstructed results with the same four sets of experimental Bragg CDI data. Comparing Fig. 4 and Supplementary Fig. 2, it can be seen that there are no obvious differences between the final reconstructed particles, which endorses the capability of the untrained CNN model in the presence of experimental noise.

Given the tendency of the conventional iterative algorithms to show imperfect convergence and non-degenerate solutions with real (noisy) experimental data, as mentioned in the introduction, we investigated the reproducibility of the solutions obtained with the untrained CNN model by using different random numbers seeding the calculation. Using the coherent X-ray diffraction pattern in Fig. 4a, we repeated the reconstruction 100 times with two different methods: the CNN model with random parameters (i.e., untrained model) in the unsupervised learning approach and the conventional iteration method with random initialization (see Methods for details and Supplementary Fig. 3 for the same comparison using simulated data). As shown in Fig. 5, the corresponding statistical error was found to have multiple solutions in both cases with roughly the same χ² = 0.0241 ± 0.0005 (standard deviation). The $r_{{{\mathrm{p}}}}$ error of 0.9922 for ML was better than 0.9915 for iterative. The $r_p$ is statistical metric that measures the similarity between two variables. However, $\chi ^2$ is usually dominated by the strong central part of a diffraction pattern. Thus, this difference is probably because the calculated loss (or error) explicitly optimized the $r_{{{\mathrm{p}}}}$ and χ² together. It is reassuring that the conventional iterative method performs so well since it is based only on projection, although it shows a slightly wider distribution of solutions. We also noticed qualitatively that the solutions from the ML method appeared sharper and had flatter, better-distinguished facets than the iterative algorithms, as can be seen in Fig. 6 and Supplementary Fig. 4.

**Fig. 5: Comparison of error metrics for two different methods.**

**Fig. 6: Comparison of experimental results for unsupervised learning and conventional iterative methods.**

Based on above results, it can be concluded that our 3D CNN approach has great potential to be applied to asymmetric data previously untested owing to the need to solve for a complex density function whether there is enough training dataset to obtain a well-optimized ML learning model or not. When there are enough suitable data to train the CNN model, it can be used in a real-time experiment, such as in a single-shot XFEL setup, to provide a rapid estimation of the reconstruction. When needed, a subsequent unsupervised transfer learning refinement can then make the final predicted results reach the possible maximum accuracy.

Furthermore, the unsupervised learning approach makes it possible to use an untrained ML model for ab-initio phase retrieval of the structure of sub-micron-sized particles in 3D. This will be valuable when building a related training dataset for the training of an ML model is challenging. Additionally, in the unsupervised learning approach, the self-defined loss function, used for feedback to optimize the prediction, makes the ML model more powerful than the conventional iterative methods, where the calculated error during iteration is only used to monitor the convergence. In this work, we used a combination of Correlation Coefficient and $\chi ^2$ error to define the loss function for the unsupervised learning, which extracted more sensitivity to the weaker features in the data than the $\chi ^2$ error alone. However, one is not limited to this form and could further extend it for different phase-retrieval problems, for example using likelihood function to account for the statistical error.

In conclusion, we have demonstrated a comprehensive ML approach for the 3D reconstruction of single-particle structures in real space from their experimental coherent X-ray diffraction intensities in reciprocal space. The trained CNN model can provide immediate high accuracy results, which will benefit a real-time CDI experiment. More importantly, we found that the unsupervised learning approach was able to reach a high reconstruction accuracy, comparable with traditional methods, either starting from a pre-trained model (i.e., transfer learning) or just a purely random configuration. The flexibility of the self-defined loss function in the ML model should make the CNN model more robust to experimental coherent diffraction data when used in unsupervised learning. The quality of the images obtained in the four examples shown here is better than can be achieved with current state-of-the-art iterative algorithms in use today. We believe our results will see very broad applications in coherent x-ray diffraction imaging and related research fields. This will also have significant effect on neural network design, where the combination of supervised and unsupervised learning can be generalized to solve other phase retrieval problems.

Methods

3D training dataset

The 3D diffraction intensities were generated by taking the Fourier Transform of the simulated complex-valued 3D particles $\rho \left( {{{\mathbf{r}}}} \right) = s\left( {{{\mathbf{r}}}} \right){{{\mathrm{e}}}}^{{{{\mathrm{i}}}}\phi \left( {{{\mathbf{r}}}} \right)}$, created from the particle’s amplitudes $s\left( {{{\mathbf{r}}}} \right)$ and phases $\phi \left( {{{\mathbf{r}}}} \right)$. The simulated particles $\rho \left( {{{\mathbf{r}}}} \right)$ were then randomly rotated in 3D. Only the amplitude of the computed 3D diffraction intensities was retained for both training and testing of the CNN model. We used a superellipsoid shape for the amplitude $s\left( {{{\mathbf{r}}}} \right)$, whose implicit form is

$$\left( {\left| {\frac{x}{a}} \right|^{2/e} + \,\left| {\frac{y}{b}} \right|^{2/e}} \right)^{e/n} + \,\left| {\frac{z}{c}} \right|^{2/n} = 1,$$

(4)

where the exponents parameter n and e control the roundedness of the particle. a, b, and c are the upper bounds of the particle size along the x, y, and z directions, respectively. All of these values were selected from random distributions to create a diverse set of shapes. For the phase distribution $\phi \left( {{{\mathbf{r}}}} \right)$ of the particles, a 3D Gaussian correlated profile [28,38,] was used, which is given as

$$\phi \left( {{{\mathbf{r}}}} \right) = \frac{{L_{{{\mathrm{x}}}}^{1/2}L_{{{\mathrm{y}}}}^{1/2}L_{{{\mathrm{z}}}}^{1/2}}}{{\pi ^{3/4}}}{\int} {{\int} {{\int} {e^{ - \frac{{\left( {x - x{^\prime} } \right)^2}}{{2L_{{{\mathrm{x}}}}^2}} - \frac{{\left( {y - y{^\prime} } \right)^2}}{{2L_{{{\mathrm{y}}}}^2}} - \frac{{\left( {z - z{^\prime} } \right)^2}}{{2L_{{{\mathrm{z}}}}^2}}}} } } z_{{{\mathrm{u}}}}\left( {x,y,z} \right)dx{^\prime} dy{^\prime} dz{^\prime} ,$$

(5)

where, $z_{{{\mathrm{u}}}}\left( {x,y,z} \right)$ obeys an uncorrelated Gaussian random distribution. $L_{{{\mathrm{x}}}}$, $L_{{{\mathrm{y}}}}$ and $L_{{{\mathrm{z}}}}$ are the transverse correlation lengths along the x, y and z directions, respectively. During the simulation of the 3D diffraction patterns, the phase of the simulated particle was scaled and shifted to [0, 1], and outside the particle, the phase is set to zero. The generated training dataset contains a wide variety of amplitude and phase states.

Supervised learning approach

The 3D CNN model was trained in a supervised approach on pairs of real-space objects and their reciprocal-space diffraction patterns. We used a loss function $l_s$, to constrain the real and reciprocal space data at the same time:

$$l_{{{\mathrm{s}}}} = \frac{1}{{\alpha _1 + \alpha _2 + \alpha _3}}\left[ {\alpha _1L_1\left( {A_{{{\mathrm{p}}}},A_{{{\mathrm{g}}}}} \right) + \alpha _2L_2\left( {\phi _{{{\mathrm{p}}}},\phi _{{{\mathrm{g}}}}} \right) + \alpha _3L_3\left( {\sqrt {I_{{{\mathrm{p}}}}} ,\sqrt {I_{{{\mathrm{g}}}}} } \right)} \right],$$

(6)

where $L_1\left( {x_{{{\mathrm{p}}}},x_{{{\mathrm{g}}}}} \right) = L_2\left( {x_{{{\mathrm{p}}}},x_{{{\mathrm{g}}}}} \right) = \frac{{\sqrt {\mathop {\sum}\nolimits_n {\left( {x_{{{\mathrm{p}}}} - x_{{{\mathrm{g}}}}} \right)} ^2} }}{{\sqrt {\mathop {\sum}\nolimits_n {x_{{{\mathrm{g}}}}^2} } }}$ and $L_3\left( {x_{{{\mathrm{p}}}},x_{{{\mathrm{g}}}}} \right) = 1 - \frac{{\mathop {\sum}\nolimits_n {\left| {x_{{{\mathrm{p}}}} - \bar x_{{{\mathrm{p}}}}} \right| \cdot \left| {x_{{{\mathrm{g}}}} - \bar x_{{{\mathrm{g}}}}} \right|} }}{{\sqrt {\left[ {\mathop {\sum}\nolimits_n {\left( {x_{{{\mathrm{p}}}} - \bar x_{{{\mathrm{p}}}}} \right)} ^2} \right]\left[ {\mathop {\sum}\nolimits_n {\left( {x_{{{\mathrm{g}}}} - \bar x_{{{\mathrm{g}}}}} \right)} ^2} \right]} }}$. In Eq. (7), $L_1$ and $L_2$ are the loss function for the amplitude and phase of the particle in real space, separately. $L_3$ is the loss function for the X-ray diffraction intensity in reciprocal space, which is used to constrain the relation between the predicted amplitude and phase from the ML model in reciprocal space. Here, the subscript p denotes the predicted result from ML model, and the subscript g denotes the corresponding ground truth. $L_3$, is based on the Pearson correlation coefficient. For the training, we used $\alpha _1 = 1$, $\alpha _2 = 1$ and $\alpha _3 = 1$. The proposed CNN model was implemented based on the Pytorch platform using Python [39]. When training the CNN model, we adopted two optimizers: Adaptive Moment Estimation (ADAM) and Stochastic Gradient Descent (SGD) to optimize the weights and biases of the CNN model [40,41,]. During the training, the two optimizers were switched every 25 epochs for a total of 100. The start learning rate for both optimizers were 0.01, and after every 25 epochs, the learning rate was reduced by a factor of 0.95. In our study, the size of the input 3D coherent X-ray diffraction pattern was 64 × 64 × 64 pixels. The training was completed on a computer with 256 GB of RAM and two NVIDIA Quadro V100 GPUs.

Unsupervised learning approach

When the 3D CNN model was used in unsupervised learning approach, only the 3D coherent X-ray diffraction pattern was available as input. During the optimization, the loss function $l_{{{\mathrm{u}}}}$ was defined as:

$$l_{{{\mathrm{u}}}} = \frac{1}{{\beta _1 + \beta _2}}\left[ {\beta _1L_3\left( {\sqrt {I_{{{\mathrm{p}}}}} ,\sqrt {I_{{{\mathrm{m}}}}} } \right) + \beta _2L_4\left( {\sqrt {I_{{{\mathrm{p}}}}} ,\sqrt {I_{{{\mathrm{m}}}}} } \right)} \right],$$

(7)

where $I_{{{\mathrm{m}}}}$ is the measured or validation 3D coherent X-ray diffraction intensity. $I_{{{\mathrm{p}}}} = \left| {FT\rho _{{{\mathrm{p}}}}\left( {{{\mathbf{r}}}} \right)} \right|^2$ is the calculated 3D diffraction intensity. $\rho _{{{\mathrm{p}}}}\left( {{{\mathbf{r}}}} \right)$ is the complex particle density predicted by the CNN model after zero-padding to the same size of input diffraction data $I_{{{\mathrm{m}}}}$. $L_4$ is the conventional $\chi ^2$ error function defined as $L_4 = \frac{{\mathop {\sum}\nolimits_n {\left( {\sqrt {I_{{{\mathrm{p}}}}} - \sqrt {I_{{{\mathrm{m}}}}} } \right)} ^2}}{{\mathop {\sum}\nolimits_n {I_{{{\mathrm{m}}}}} }}$. We used two different ways to initiate the CNN model’s weight and bias parameters, either from our pre-trained CNN model (i.e., transfer learning) or by using random numbers. In Eq. (7), the weighting coefficients $\beta _1$ followed a modified Weibull distribution:

$$\beta _1 = a_0\frac{k}{\lambda }\left( {\frac{n}{\lambda }} \right)^{k - 1}{{{\mathrm{e}}}}^{ - \left( {n/\lambda } \right)^k} + a_1,$$

(8)

where $k = 1$, $\lambda = 0.5$. n is the training epoch. $a_0$ and $a_1$ are the scale factors to let $\beta _1$ gradually change from 10⁴ to 1 during the training, as shown in Supplementary Figure 5, while $\beta _2$ remained equal to 1. Two optimizers, ADAM and SGD were utilized to optimize the results, switching every 200 epochs. The learning rate for both optimizers started at 0.006 and after every 200 epochs the learning rate was reduced by a factor of 0.95. When the CNN model is applied to the experimental Bragg diffraction data, due to the existence of the shear distortion effects [42,43,44] in Bragg CDI, all the predicted results were converted from detector to laboratory coordinates after zero-padding to the same size of input diffraction data.

Bragg CDI experiments

The Bragg CDI experiments were performed at 34-ID-C at Advanced Photon Source (APS), Argonne National Laboratory, USA. A front-end horizontal slit of 100 μm was used to improve the source coherence, and a double crystal monochromator was used to set the energy of the incident X-ray to 9 keV. A coherent beam of 30 × 70 μm² was selected and focused to ~630 × 470 nm² by Kirkpatrick–Baez (KB) mirrors before impinging on the samples. The four samples were chemically synthesized in nanocrystal format by different methods and attached to silicon wafer substrates for handling. The corresponding 3D coherent diffraction intensities were obtained by a rocking curve of the target Bragg peak of the samples, {101} for BaTiO₃ and SrTiO₃ and {111} for Au and Pd, as a series of 2D coherent diffraction patterns were recorded by a Medipix detector with 55 × 55 μm² pixels. In the figures, all experimental Bragg coherent X-ray diffraction patterns are presented in laboratory coordinates [42,43,44].

Conventional iterative phase-retrieval method

For the conventional iterative phase retrieval, the measured Bragg 3D diffraction patterns in detector coordinate were used as input to an iterative phase-retrieval scheme described by Robinson & Harder [7] to reconstruct their corresponding real-space particles’ information, separately. During the reconstruction, the initial particle was obtained by inverse Fourier transformation of the amplitude of the input diffractions pattern with a random phase distribution, whose corresponding range is [−π, π]. The initial support size of the particle in real space is half the size of the input diffraction pattern array in each dimension. The algorithm starts with 50 steps of error reduction. Then, it was switched between hybrid input-output with β = 0.9 and error reduction after every 50 iterations. After 100 iterations, the shrink-wrap method [45] was applied in real space to dynamically update the support every ten iterations. At the end, 100 steps of error reduction were used to assure convergence. The total number of iterations was 2000. After reconstruction, all the reconstructed results were converted from detector to laboratory coordinates [42,43,44]. All the isosurfaces shown in the paper are plotted by using the open-source ParaView [46].

Data availability

The source data that support the findings of this study are available upon request from the corresponding author.

Code availability

The python codes used in this study will be made available to readers upon request to the corresponding authors.

References

Chapman, H. N. & Nugent, K. A. Coherent lensless X-ray imaging. Nat. Photonics. 4, 833–9 (2010).
Article CAS Google Scholar
Walmsley, I. A. Quantum optics: science and technology in a new light. Science 348, 525–30 (2015).
Article CAS Google Scholar
Xiong, G., Moutanabbir, O., Reiche, M., Harder, R. & Robinson, I. Coherent X-ray diffraction imaging and characterization of strain in silicon-on-insulator nanostructures. Adv. Mater. 26, 7747–63 (2014).
Article CAS Google Scholar
Bielecki, J., Maia, F. & Mancuso, A. P. Perspectives on single particle imaging with x rays at the advent of high repetition rate x-ray free electron laser sources. Struct. Dyn 7, 040901 (2020).
Article CAS Google Scholar
Williams, G. J., Pfeifer, M. A., Vartanyants, I. A. & Robinson, I. K. Three-dimensional imaging of microstructure in Au nanocrystals. Phys. Rev. Lett. 90, 175501 (2003).
Article CAS Google Scholar
Pfeifer, M. A., Williams, G. J., Vartanyants, I. A., Harder, R. & Robinson, I. K. Three-dimensional mapping of a deformation field inside a nanocrystal. Nature 442, 63–66 (2006).
Article CAS Google Scholar
Robinson, I. & Harder, R. Coherent X-ray diffraction imaging of strain at the nanoscale. Nat. Mater. 8, 291–8 (2009).
Article CAS Google Scholar
Yau, A., Cha, W., Kanan, M. W., Stephenson, G. B. & Ulvestad, A. Bragg coherent diffractive imaging of single-grain defect dynamics in polycrystalline films. Science 356, 739–42 (2017).
Article CAS Google Scholar
Wen, H., Cherukara, M. J. & Holt, M. V. Time-resolved X-ray microscopy for materials science. Annu. Rev. Mater. Res. 49, 389–415 (2019).
Article CAS Google Scholar
Decking, W., Abeghyan, S., Abramian, P., Abramsky, A., Aguirre, A. & Albrecht, C. et al. A MHz-repetition-rate hard X-ray free-electron laser driven by a superconducting linear accelerator. Nat. Photonics. 14, 391–7 (2020).
Article CAS Google Scholar
Huijts, J., Fernandez, S., Gauthier, D., Kholodtsova, M., Maghraoui, A. & Medjoubi, K. et al. Broadband coherent diffractive imaging. Nat. Photonics. 14, 618–22 (2020).
Article CAS Google Scholar
Clark, J. N., Beitra, L., Xiong, G., Higginbotham, A., Fritz, D. M. & Lemke, H. T. et al. Ultrafast three-dimensional imaging of lattice dynamics in individual gold nanocrystals. Science 341, 56–59 (2013).
Article CAS Google Scholar
Newton, M. C., Leake, S. J., Harder, R. & Robinson, I. K. Three-dimensional imaging of strain in a single ZnO nanorod. Nat. Mater. 9, 120–4 (2010).
Article CAS Google Scholar
Ulvestad, A., Welland, M. J., Cha, W., Liu, Y., Kim, J. W. & Harder, R. et al. Three-dimensional imaging of dislocation dynamics during the hydriding phase transformation. Nat. Mater. 16, 565–71 (2017).
Article CAS Google Scholar
Rose, M., Bobkov, S., Ayyer, K., Kurta, R. P., Dzhigaev, D. & Kim, Y. Y. et al. Single-particle imaging without symmetry constraints at an X-ray free-electron laser. IUCrJ 5, 727–36 (2018).
Article CAS Google Scholar
Passos, A. R., Rochet, A., Manente, L. M., Suzana, A. F., Harder, R. & Cha, W. et al. Three-dimensional strain dynamics govern the hysteresis in heterogeneous catalysis. Nat. Commun. 11, 4733 (2020).
Article CAS Google Scholar
Barty, A., Boutet, S., Bogan, M. J., Hau-Riege, S., Marchesini, S. & Sokolowski-Tinten, K. et al. Ultrafast single-shot diffraction imaging of nanoscale dynamics. Nat. Photonics 2, 415–9 (2008).
Article CAS Google Scholar
Gerchberg, R. W. A practical algorithm for the determination of phase from image and diffraction plane pictures. Optik 35, 237–46 (1972).
Google Scholar
Fienup, J. R. Phase retrieval algorithms: a comparison. Appl. Opt. 21, 2758–69 (1982).
Article CAS Google Scholar
Bates, R. H. T. Uniqueness of solutions to two-dimensional fourier phase problems for localized and positive images. Comput. Vis. Graph. Image Process 25, 205–17 (1984).
Article Google Scholar
Robinson, I. K., Vartanyants, I. A., Williams, G. J., Pfeifer, M. A. & Pitney, J. A. Reconstruction of the shapes of gold nanocrystals using coherent x-ray diffraction. Phys. Rev. Lett. 87, 195505 (2001).
Article CAS Google Scholar
Elser, V. Phase retrieval by iterated projections. J. Opt. Soc. Am. A Opt. Image Sci. Vis 20, 40–55 (2003).
Article Google Scholar
Luke, D. R. Relaxed averaged alternating reflections for diffraction imaging. Inverse Probl. 21, 37–50 (2005).
Article Google Scholar
Marchesini, S. Invited article: a [corrected] unified evaluation of iterative projection algorithms for phase retrieval. Rev. Sci. Instrum 78, 011301 (2007).
Article CAS Google Scholar
Thibault, P. & Guizar-Sicairos, M. Maximum-likelihood refinement for coherent diffractive imaging. New J. Phys. 14, 063004 (2012).
Article Google Scholar
Gao, Y., Huang, X., Yan, H. & Williams, G. J. Bragg coherent diffraction imaging by simultaneous reconstruction of multiple diffraction peaks. Phys. Rev. B 103, 014102 (2021).
Article CAS Google Scholar
Shechtman, Y., Eldar, Y. C., Cohen, O., Chapman, H. N., Miao, J. & Segev, M. Phase retrieval with application to optical imaging: a contemporary overview. IEEE Signal Process Mag. 32, 87–109 (2015).
Article Google Scholar
Wu, L., Juhas, P., Yoo, S. & Robinson, I. Complex imaging of phase domains by deep neural networks. IUCrJ 8, 12–21 (2021).
Article CAS Google Scholar
Scheinker, A. & Pokharel, R. Adaptive 3D convolutional neural network-based reconstruction method for 3D coherent diffraction imaging. J. Appl. Phys. 128, 184901 (2020).
Article CAS Google Scholar
Cherukara, M. J., Nashed, Y. S. G. & Harder, R. J. Real-time coherent diffraction inversion using deep generative networks. Sci. Rep. 8, 16520 (2018).
Article Google Scholar
Harder, R. Deep neural networks in real-time coherent diffraction imaging. IUCrJ 8, 1–3 (2021).
Article CAS Google Scholar
Chapman, H. N., Barty, A., Bogan, M. J., Boutet, S., Frank, M. & Hau-Riege, S. P. et al. Femtosecond diffractive imaging with a soft-X-ray free-electron laser. Nat. Phys 2, 839–43 (2006).
Article CAS Google Scholar
Harder, R., Pfeifer, M. A., Williams, G. J., Vartaniants, I. A. & Robinson, I. K. Orientation variation of surface strain. Phys. Rev. B 76, 115425 (2007).
Article Google Scholar
Guan, Z & Tsai, EH PtychoNet: Fast and High Quality Phase Retrieval for Ptychography. Tech. Rep., https://doi.org/10.2172/1599580 (2019).
Duda, RO & Hart, PE Pattern classification and scene analysis. Vol. 3 (Wiley New York, 1973).
Wang, G., Ye, J. C. & De Man, B. Deep learning for tomographic image reconstruction. Nat. Mach. Intell 2, 737–48 (2020).
Article Google Scholar
Bauschke, H. H., Combettes, P. L. & Luke, D. R. Phase retrieval, error reduction algorithm, and Fienup variants: a view from convex optimization. J. Opt. Soc. Am. A Opt. Image Sci. Vis 19, 1334–45 (2002).
Article Google Scholar
Garcia, N. & Stoll, E. Monte Carlo calculation for electromagnetic-wave scattering from random rough surfaces. Phys. Rev. Lett. 52, 1798–801 (1984).
Article CAS Google Scholar
Paszke, A et al. Pytorch: an imperative style, high-performance deep learning library. Preprint at http://arxiv.org/abs/1912.01703, (2019).
Kingma, DP & Ba, J. Adam: a method for stochastic optimization. Preprint at http://arxiv.org/abs/1412.6980, (2014).
Saad, D. On-line learning in neural networks. (Cambridge University Press, 2009).
Pfeifer, MA. Structural studies of lead nanocrystals using coherent X-ray diffraction. (University of Illinois at Urbana-Champaign, 2005).
Li, P., Maddali, S., Pateras, A., Calvo-Almazan, I., Hruszkewycz, S. O. & Cha, W. et al. General approaches for shear-correcting coordinate transformations in Bragg coherent diffraction imaging. Part II. J. Appl. Crystallogr. 53, 404–18 (2020).
Article CAS Google Scholar
Maddali, S., Li, P., Pateras, A., Timbie, D., Delegan, N. & Crook, A. L. et al. General approaches for shear-correcting coordinate transformations in Bragg coherent diffraction imaging. Part I. J. Appl. Crystallogr 53, 393–403 (2020).
Article CAS Google Scholar
Marchesini, S., He, H., Chapman, H. N., Hau-Riege, S. P., Noy, A. & Howells, M. R. et al. X-ray image reconstruction from a diffraction pattern alone. Phys. Rev. B 68, 140101 (2003).
Article Google Scholar
Ayachit, U The paraview guide: a parallel visualization application. (Kitware, Inc., 2015).

Download references

Acknowledgements

Work at Brookhaven National Laboratory was supported by the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences, under Contract No. DE-SC0012704. J.D. received funding from the China Scholarship Council (CSC). Work at UCL was funded by EPSRC. Measurements were carried out at the Advanced Photon Source (APS) beamline 34-ID-C, which was supported by the U. S. Department of Energy, Office of Science, Office of Basic Energy Sciences, under Contract No. DE-AC02-06CH11357. The beamline 34-ID-C was built with U.S. National Science Foundation grant DMR-9724294.

Author information

Authors and Affiliations

Computational Science Initiative, Brookhaven National Laboratory, Upton, NY, USA
Longlong Wu & Shinjae Yoo
Condensed Matter Physics and Materials Science Department, Brookhaven National Laboratory, Upton, NY, USA
Longlong Wu, Ana F. Suzana, Tadesse A. Assefa & Ian K. Robinson
Stanford Institute for Materials and Energy Sciences, SLAC National Accelerator Laboratory, California, USA
Tadesse A. Assefa
London Centre for Nanotechnology, University College London, London, UK
Jiecheng Diao & Ian K. Robinson
Advanced Photon Source, Argonne, IL, USA
Ross J. Harder & Wonsuk Cha

Authors

Longlong Wu
View author publications
You can also search for this author in PubMed Google Scholar
Shinjae Yoo
View author publications
You can also search for this author in PubMed Google Scholar
Ana F. Suzana
View author publications
You can also search for this author in PubMed Google Scholar
Tadesse A. Assefa
View author publications
You can also search for this author in PubMed Google Scholar
Jiecheng Diao
View author publications
You can also search for this author in PubMed Google Scholar
Ross J. Harder
View author publications
You can also search for this author in PubMed Google Scholar
Wonsuk Cha
View author publications
You can also search for this author in PubMed Google Scholar
Ian K. Robinson
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.W., S.Y., and I.K.R. developed the convolutional neural networks and performed the experimental data analysis. I.K.R., L.W., A.F.S., T.A.A., J.C., R.H., and W.C. carried out the BCDI experiments at 34-ID-C at Advanced Photon Source (APS), Argonne National Laboratory, USA. L.W. and I.K.R. wrote the manuscript and all the authors contributed to discussion of the manuscript.

Corresponding authors

Correspondence to Longlong Wu or Ian K. Robinson.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wu, L., Yoo, S., Suzana, A.F. et al. Three-dimensional coherent X-ray diffraction imaging via deep convolutional neural networks. npj Comput Mater 7, 175 (2021). https://doi.org/10.1038/s41524-021-00644-z

Download citation

Received: 26 July 2021
Accepted: 01 October 2021
Published: 28 October 2021
DOI: https://doi.org/10.1038/s41524-021-00644-z

This article is cited by

Ultrafast Bragg coherent diffraction imaging of epitaxial thin films using deep complex-valued neural networks
- Xi Yu
- Longlong Wu
- Ian K. Robinson
npj Computational Materials (2024)
Deep learning at the edge enables real-time streaming ptychographic imaging
- Anakha V. Babu
- Tao Zhou
- Mathew J. Cherukara
Nature Communications (2023)
Deep learning based on parameterized physical forward model for adaptive holographic imaging with unpaired data
- Chanseok Lee
- Gookho Song
- Mooseok Jang
Nature Machine Intelligence (2023)
Resolution-enhanced X-ray fluorescence microscopy via deep residual networks
- Longlong Wu
- Seongmin Bak
- Xiaojing Huang
npj Computational Materials (2023)

Subjects

Abstract

Similar content being viewed by others

Introduction

Results and discussion

ML model and data sets

Supervised learning approach

Unsupervised learning approach

Performance of 3D CNN model on experimental data

Methods

3D training dataset

Supervised learning approach

Unsupervised learning approach

Bragg CDI experiments

Conventional iterative phase-retrieval method

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links