Causal analysis of competing atomistic mechanisms in ferroelectric materials from high-resolution scanning transmission electron microscopy data

Ziatdinov, Maxim; Nelson, Christopher T.; Zhang, Xiaohang; Vasudevan, Rama K.; Eliseev, Eugene; Morozovska, Anna N.; Takeuchi, Ichiro; Kalinin, Sergei V.

doi:10.1038/s41524-020-00396-2

Download PDF

Article
Open access
Published: 21 August 2020

Causal analysis of competing atomistic mechanisms in ferroelectric materials from high-resolution scanning transmission electron microscopy data

npj Computational Materials volume 6, Article number: 127 (2020) Cite this article

4358 Accesses
23 Citations
7 Altmetric
Metrics details

Subjects

Abstract

Machine learning has emerged as a powerful tool for the analysis of mesoscopic and atomically resolved images and spectroscopy in electron and scanning probe microscopy, with the applications ranging from feature extraction to information compression and elucidation of relevant order parameters to inversion of imaging data to reconstruct structural models. However, the fundamental limitation of machine learning methods is their correlative nature, leading to extreme susceptibility to confounding factors. Here, we implement the workflow for causal analysis of structural scanning transmission electron microscopy (STEM) data and explore the interplay between physical and chemical effects in a ferroelectric perovskite across the ferroelectric–antiferroelectric phase transitions. The combinatorial library of the Sm-doped BiFeO₃ is grown to cover the composition range from pure ferroelectric BFO to orthorhombic 20% Sm-doped BFO. Atomically resolved STEM images are acquired for selected compositions and are used to create a set of local compositional, structural, and polarization field descriptors. The information-geometric causal inference (IGCI) and additive noise model (ANM) analysis are used to establish the pairwise causal directions between the descriptors, ordering the data set in the causal direction. The causal chain for IGCI and ANM across the composition is compared and suggests the presence of common causal mechanisms across the composition series. Ultimately, we believe that the causal analysis of the multimodal data will allow exploring the causal links between multiple competing mechanisms that control the emergence of unique functionalities of morphotropic materials and ferroelectric relaxors.

Experimental discovery of structure–property relationships in ferroelectric materials via active learning

Article 04 April 2022

Correlative imaging of ferroelectric domain walls

Article Open access 07 January 2022

Thermodynamics of order and randomness in dopant distributions inferred from atomically resolved imaging

Article Open access 25 March 2021

Introduction

Functionality of material systems such as morphotropic phase boundary systems^1,2,3,4, ferroelectric relaxors^5,6,7,8, spin and cluster glasses^9,10,11,12, charge ordered manganites^{13,14,15,16,17}, are determined by the complex interplay between structural, orbital, chemical, spin, and other degrees of freedom^18,19. Traditionally, these materials system has been explored via the combination of macroscopic physical property measurements and scattering techniques, with the theoretical counterpart being provided via combination of analytical and numerical methods. Given that the physics of these materials is ultimately linked to the emergence of frustrated degenerate ground states driven by competing interactions, analyses based on the macroscopically averaged descriptors such as concentrations, order parameter fields, etc. provide only limited insight into the generative and especially causal physics of these materials systems.

Progress in the high-resolution imaging techniques have allowed visualization of these materials systems to the atomic level. Techniques such as scanning tunneling microscopy have provided insight into electronic structure of surfaces and superconductive and magnetic order parameters^20,21. Scanning transmission electron microscopy (STEM) enabled studies of chemical composition down to the single atom level^22,23,24 and, via quantitative mapping of structural distortions, enabled visualization of order parameter fields such as polarization^25,26,27,28, tilts^29,30,31, and mechanical^{25,32,33,34,35} and chemical^35,36,37 strains. However, this emergence of data brings the challenge of analysis of systems with multiple spatially distributed degrees of freedom, including determination of both the functional laws connecting the functionalities and structure and the causal links that define the cause and effect relationship in the nonstationary and non-ergodic systems.

Recently, machine learning (ML) has emerged as a powerful tool for the analysis of mesoscopic and atomically resolved images and spectroscopy in electron and scanning probe microscopy^38,39,40,41. The applications ranging from feature extraction⁴² to information compression and elucidation of relevant order parameters⁴³ to inversion of imaging data to reconstruct structural models have been demonstrated. However, the fundamental limitation of the vast majority of machine learning methods is their correlative nature, leading to extreme susceptibility to confounding factors and observational biases^44,45. While in classical statistical methods methodologies to address confounder- or selective bias induced phenomena such as Simpson paradox are established⁴⁶, the complex and often nontransparent nature of modern machine learning tools such as deep neural networks renders them extremely prone to misinterpretation. We pose that correlative machine learning provides a reliable and powerful tool in cases when the causal links are well established, as is atom finding in SPM and STEM and analysis of 4D STEM data when this condition is satisfied. Notably, ML applications in theory generally fall under this category since the causal links are postulated. Alternatively, ML methods work well when the confounding factors are effectively frozen via the narrowness of experimental conditions or experimental system. However, both these conditions are violated for experimental studies, when causal relationships are known only partially (and are in fact often the target of study) and confounding and observational bias factors (composition uncertainty, microscope tuning, contaminations) are abundant.

One approach to explore the generative physical models from the microscopic data is based on the fit to the relevant mesoscopic or atomistic models, i.e., discovering the generative physical models. On the mesoscopic level, direct match between the solution between Ginzburg–Landau equations and order parameter fields determined from the atomically resolved data can be used to determine the interface terms via corresponding boundary conditions⁴⁷, as well as the nature of coupling and gradient terms³⁹. Statistical distance minimization can be used to directly match the discrete data to lattice model, e.g., to reconstruct the interaction parameters^48,49,50,51. However, even when the functional laws describing the system are known, this level of the description is not sufficient to establish the causal mechanisms active in the system. As a simple example, the knowledge of the ideal gas law does not establish whether pressure is cause or an effect of the volume change unless the character of the process is established.

In many cases, it can be argued that the causal effects can be estimated based on the energy scales of corresponding phenomena, e.g., magnetic properties driven by relatively weak energy scales are unlikely to affect atomic structure. However, this is not the case when the energy scales are comparable, or when depolarization and global effects become significant. For example, while the magnetization energy density per volume can be small, concentration of magnetically- induced mechanical stresses can lead to stress corrosion at the domain walls. Specifically, for ferroelectric materials it is generally assumed that cationic order is frozen at the state of material formation, and then polarization field evolves to accommodate average polarization instability and local pinning. However, it is known that ions can redistribute to compensate polarization, with examples including segregation at the domain walls, memory effects, etc^2,52. Hence, for nonequilibrium and non-ergodic materials the question of cause and effect become paramount. For example, does polarization align to the cationic disorder or does polarization instability at the morphotropic phase boundaries drive the cationic disorder?

More generally, being able to answer causal questions is required both for meaningful applications of machine learning techniques and inferring the materials physics, since causal knowledge allows avoiding correlative, but incorrect conclusions, explore counterfactuals and interventions⁵³, i.e., realistic strategies for materials development. Correspondingly, we argue that analyzing causal relationships from the observations of atomically resolved degrees of freedom is key for understanding the physics of non-ergodic systems.

Here, we implement the workflow for causal analysis of STEM data and explore the interplay between physical and chemical effects in a ferroelectric perovskite across the ferroelectric–antiferroelectric phase transition. The combinatorial library of the Sm-doped BiFeO₃ (BFO) is grown to cover the composition range from pure ferroelectric BFO to orthorhombic 20% Sm-doped BFO^54,55,56,57. Atomically resolved STEM images are acquired for selected compositions and are used to create a set of local compositional, structural, and polarization field descriptors. The information-geometric causal inference (IGCI) and the additive noise model (ANM) are used to establish the pairwise causal directions between the descriptors, ordering the data set in the causal direction. The causal chain for IGCI and ANM across the composition is compared, suggesting the similarity of causal mechanisms across the Sm-BFO compositions.

Results and discussion

Combinatorial library

The combinatorial library of Sm_xBi_1−xFeO₃ (0 ≤ x ≤ 0.2) was fabricated on a SrTiO₃ (001) substrate. The chemical compositions at different positions across the substrate were characterized by wavelength dispersive spectroscopy measurements. X-ray diffraction results (Fig. 1) indicate that the (002) peak of the Sm_xBi_1−xFeO₃ layer gradually moves towards a higher angle as the Sm doping concentration (x) increases. As shown in Fig. 1b–d, the representative piezoresponse force microscopy images indicate that the domain structure in the combinatorial library changes from strip domains in the pure BiFeO₃ side to mosaic-like domains at an intermediate doping level (x ≈ 0.08), and eventually no domain structure can be identified for the highest doping level (x ≈ 0.2).

**Fig. 1: Experimental characterization of Sm_xBi_1−xFeO₃ films.**

The TEM samples were prepared for three sites along the gradient composition sample with nominal compositions of 0%, 7%, and 20% Sm doping (see “Methods”). STEM data was collected from the [100] pseudocubic zone axis using High-Angle Annular Dark Field (HAADF) detector imaging as shown in Fig. 2, thereby providing the projected atomic structure as well as compositional information by the atomic column intensity (which scales by ~Z^1.7). As the concentration of Sm increases through the sample series there is an observed phase transition from the prototypical rhombohedral ferroelectric phase⁵⁸ of BiFeO₃ (Fig. 2a) to an antiferrodistortive orthorhombic phase⁵⁹ at 20% Sm (Fig. 2c). The transition is readily observable in maps of the polar atomic displacement between the A-site and B-site cation sublattices, P, which is shown for the three compositions in Fig. 2.

**Fig. 2: Atomic resolution HAADF STEM imaging of Sm_xBi_1−xFeO₃.**

For the pure BiFeO₃ phase P is a proxy for the electrical dipole moment and the distribution in Fig. 2a illustrates the polydomain structure characteristic of an r-phase ferroelectric including a 109° (vertical) and 180° (inclined) domain walls. The distribution of P in the 20% Sm composition depicts the large oscillation of P_y (see inset) corresponding to the antiferrodistortive orthorhombic structure (Fig. 2c). The intermediate 7% Sm composition exhibits a mixed structure, with the small domains identifiable to both structures appearing in the near interface region (Fig. 2b).

To describe the local materials behaviors, we parametrize the data choosing the perovskite unit cell as the basis. We introduce a set of descriptors for the local material behavior based on the properties and distribution of the five cation atomic columns in each unit cell. These are outlined in Table 1, along with the corresponding calculations. Unit-cell parameters are defined from local neighborhood atomic columns of HAADF STEM data corresponding to a five cation perovskite-type cell: corner A-sites A₁, A₂, A₃, A₄, and central B-site B₁ (labels in Fig. 3a). Parameters include structural descriptors from positional data regarding unit cell size and shape (a, b, a/b, θ, Vol), compositional information from atomic HAADF intensity and distribution (I₁, I₂, I₃, I₄, I₅), and electrical polarization information from non-centrosymmetric displacement of the A- and B-site sublattices (P). Examples for several of these descriptors for a HAADF STEM unit cell are illustrated in Fig. 3a. We here define a and b as the two lattice vectors of the unit cell connecting A-site corner positions, their internal angle θ, magnitude ratio a/b, and total cell volume Vol. I₁ corresponds to the mean atomic column intensity and scales with the sample mass-thickness. I₂–I₅ correspond to internal asymmetries. Notably, I₅ corresponds to the intensity ratio between cation sublattices, thus readily distinguishes the Sm_xBi_1−xFeO₃ film and SrTiO₃ substrate. Our choice of basis also includes several internal gradient terms including A-site intensity asymmetries in I₂–I₄ and the gradient of the a and b vector between opposed edges of the unit cell (denoted as abΔ). Moreover, additional gradient terms can be derived using a larger basis or calculated across multiple neighbor cells.

Table 1 Unit cell descriptors.

Full size table

Real-space distributions of selected local physical and chemical descriptors of materials structure and functionality are shown in Fig. 3 for the three compositions. The plots are depicted in unit-cell space, each data point corresponding to the local unit cell descriptor in an a, b addressed grid. The selected descriptor maps are categorized as compositional parameters (top) associated with the unit cell intensity represent a convolution of the (slowly changing) film thickness and local composition; structural parameters (mid) including lattice parameters, unit cell volume, and internal angle; and polarization components (bottom) describing the ferroelectric functionality. Using the unit-cell basis the alternating P_y component from an atom-level mapping (Fig. 1c) is not observed here in Fig. 2d, but is instead captured by structural descriptors a and θ. Not depicted are internal descriptor or cross-unit cell gradient terms, distribution maps for internal gradient descriptors can be found in Supplementary Information Figs. 1–3.

General theory of ferroelectrics

To provide the physical context for causal analysis of the observables in the STEM experiment, we note that in general thermodynamics of ferroelectric materials can be described via Landau–Ginzburg–Devonshire theory, where the energy of material can be represented as free energy functional

$$G = {\int} {d^3x\left( {{\Delta}G_{AFD} + {\Delta}G_{FE} + {\Delta}G_{AFE} + {\Delta}G_{BQC} + {\Delta}G_{ST} + {\Delta}G_{EL}} \right).}$$

(1)

describe the antiferrodistortive (AFD), ferroelectric (FE), and antiferroelectric (AFE) long-range orders. AFD order is described by an axial vector, Φ_i, that is perpendicular to the rotation plane of the oxygen octahedral tilts. FE and AFE long-range orders, which interact with AFD order, and transform to one another depending on the Sm content, are described by FE and AFE order parameters, $P_i = \frac{1}{2}\left( {P_i^a + P_i^b} \right)$ and $A_i = \frac{1}{2}\left( {P_i^a - P_i^b} \right)$, where $P_i^a$ and $P_i^b$ are the polarization components of two (or more) equivalent sublattices “a” and “b”. Antiferromagnetic order is not included in Eq. (1), since its impact on AFD, FE, and AFE orders is negligibly small, as a rule.

The individual AFD, FE, and AFE contributions are generally representable as the expansions in powers (2–4 for the second order, or 2–4–6 for the first order phase transitions) of corresponding order parameters, gradient terms defining the spatial behavior of the order parameter fields, coupling terms with the conjugate fields (electric, strain, strain gradient), and biquadratic coupling terms describing the interactions between order parameters. The important aspect of Ginzburg–Landau theory is that the free energy of material is generally nonlocal, since the order parameter and depolarizations fields can be found only from the solution of the boundary value problem. Therefore, in the most general description, the individual observables are linked through the integral transforms, representing extremely complex forms of parameter coupling.

Here, we note that under some general conditions including macroscopic uniformity, this relationship can be simplified to yield the local nonlinear relationship between state variables, with the nonlocal effects being represented via the unknown mean local fields. These nonlinear and nonlocal partial differential equations can be linearized around the specific ground state to give the linear relationship between the observed and non-observable parameters. Secondly, we note that in the presence of the strong composition fluctuations and nanodomains, the local fields will be the superposition of slowly varying (on the atomic level) depolarization fields and disorder related fields. Here, we aim to explore the causal relationships from the observed descriptor fields.

Statistical analysis

The initial insight into the statistical properties of this material system can be deduced from the analysis of joint pairwise distributions as shown in Fig. 4. Figure 4a shows the pairwise distributions for the set of parameters P_x, P_y, V, and I₅ for the rhombohedral ferroelectric phase. Note that the full data set and analyses are contained in the accompanying Jupyter notebook. Here, the diagonal elements contain the distribution functions for individual parameters. Notably both P_x and P_y distributions have four peaks, suggesting the more complex polarization distribution that can be expected in the system of ferroelectric domains (positive and negative) and substrate (0). In particular, P_x component shows pronounced peak splitting for the nonzero polarization orientation. The distribution functions for molar volume Vol and chemical variability I₅ show two clear peaks corresponding to ferroelectric materials and substrate respectively. Note that the width of the I₅ distribution is much broader, reflecting the higher variability or noise level in data.

The pairwise distributions between the descriptors are shown on the upper and lower triangular matrices. Here, the full data in the upper diagonal provides the general insight into the outliers. The kernel density estimates in the lower diagonal provide the insight into the statistically significant parts of the distributions. The pairwise distribution between P_x and P_y has three peaks clearly corresponding to the two dominant domain orientations and the substrate. Similar structures are visible for P_x–Vol and P_y–Vol distributions, clearly showing the similar molar volumes for the ferroelectric phase and dissimilar molar volume for the substrate. Finally, the distribution function between I₅ and P_x, P_y, Vol show complex multimodal distributions. It should be noted here that for ferroelectric phase the observations within the ferroelectric domain will impose the observational bias on the data; hence ideally the imaged volume should contain multiple domain or, alternatively, exceed the correlation length for observed variables.

Similar analysis for the 7% Sm mixed phase is shown in Fig. 4b. In this case, the distribution functions peaks are clearly non-Gaussian, reflecting complex nature of the mixed phases. Similarly, pair distribution functions clearly show the asymmetry in the peak shapes, etc. Similar behavior is observed for the 20% orthorhombic phase.

Even cursory examination of the distributions in Fig. 4 (or full versions available from the notebook) illustrates that these are generally not marginalizable, i.e., joint distributions between the parameters cannot be represented as the product of the marginal distribution functions. This in turn suggests the presence of the functional or causal link between the parameters. However, while some of these links can be speculated about (i.e., it can be argued that chemical fluctuations control order parameter distributions), multiple counter-examples such as cation redistribution during the aging of ferroelectric materials, etc. suggest that these “natural” explanations are not necessarily correct. Hence, we aim to analyze the causal distributions from the observational data.

Causal analysis

Generally, analysis of causal relationships is one of the most complex problems in ML. For two observed variables, the number of possible causal relationships is limited and methodologies to establish directionality of causal link and presence of possible confounders are available. For more complex cases, the analysis of directed acyclic causal graphs has been explored by Pearl group^44,46. However, analysis of the cause and effect relationships in the presence of cycles and feedbacks represents significantly more complex problem, and numerical schemes to address these have been reported only recently by Mooij and others⁶⁰.

Here we explore two step approach for analysis of the possible causal relationships between the STEM observables. First, the causal directions are analyzed for all pairs of variables to yield pairwise causal relationships and represented as a “causal sieve” matrix. By construction, the matrix is antisymmetric with 1 and −1 elements. Secondly, the properties of graph which adjacency matrix is given by the “causal sieve” are explored.

To describe causal direction for two variables, we use and compare the IGCI⁶¹ and the ANM⁶². The IGCI method is based on the assumption of the independence of the “cause” distribution and the conditional distribution of the “effect” given the cause^61,63,64. It can be shown, using an empirical slope-based estimator⁶⁵, that X causes Y if

$$\mathop {\sum}\limits_{j = 1}^{N - 1}{\mathrm{log}\frac{{\left|{y_{j + 1} - y_j}\right|}}{{\left| {x_{j + 1} - x_j}\right|}}}-\mathop {\sum}\limits_{j = 1}^{N - 1} {\mathrm{log}\frac{{\left| {x_{j + 1} - x_j}\right|}}{{\left|{y_{j + 1} - y_j}\right|}}}\,<\, 0,$$

(2)

and vice versa, where the (x_j, y_j) pairs are ordered ascendingly according to x in the first term and according to y in the second term. IGCI was first assumed to be applicable only to noise-free observations where Y = f(X) and X = f ⁻¹(Y) but was later shown (empirically) to work on noisy data as well⁶⁴. The Eq. (2) was used for all the IGCI-based analysis of the cause–effect pairs in the current paper.

As a second pairwise causality check, we use the ANM estimator for finding a causal direction from the observed data. The simple idea behind the ANM method is that the effect is a function of its cause plus a noise term independent of the cause⁶⁶. In the ANM one performs the nonlinear regression fitting, first for X on Y and then for Y on X, and calculates the difference between test scores for the independence of residuals in both cases. The negative difference value implies that X causes Y, while the positive value implies that Y causes X.

For our analysis, we used a Gaussian process (GP) regressor with the squared exponential kernel. Because the exact inference of the GP regressor parameters is intractable for the datasets with ~10⁴ points, we used the inducing points-based sparse GP approximation⁶⁷ with variational free energy (VFE) inference method, as implemented in Pyro’s probabilistic programming language⁶⁸. The inducing points were selected uniformly from the observation data points with a step of ~20. In addition to the GP regressor, we also added an option for choosing a two-layer neural network as a regressor (see the accompanying Jupyter notebook). The independence of residuals test was done by calculating the Hilbert–Schmidt Independence Criterion⁶⁹ with the Gaussian kernel whose width was set to the median distance between points in input space.

The IGCI typically takes advantage of some specific features of the dataset, whereas the ANM tend to yield good results as long as the additive assumption holds⁷⁰. Before applying to the experimental observations, both IGCI and ANM methods were first tested on the publicly available database of labeled cause–effect pairs⁵³ and the resultant accuracy in predictions (~64% and ~66%, respectively) was comparable to the results reported for the same database in the machine learning literature⁵³. We then calculate a matrix of pairwise dependencies for the list of descriptors derived from the experimental descriptions.

The causal sieve matrices for three explored compositions are shown in Fig. 5 for a selected subset of variables to enable ease of visualization. The analysis of the full set of structural, chemical, and polarization parameters is available in the accompanying notebook. Here, the positive 1 value means that the column value is identified as the cause, whereas the row variable is the effect, Col -> Row. Interestingly, the structures of the IGCI causal matrices in the studied cases are always such that for N matrix one row contains N positive entries, another row contains N − 1 positive entries, etc. This implies that the descriptors can be formally ranked in the order of causal importance. Note that it does not imply that the system can be represented by linear causal chain. Rather, here we treat this behavior as observation.

Remarkably, the IGCI results are very similar across three different compositions. For rhombohedral and orthorhombic phases, the chemical composition parametrized as I₁ is identified as a cause variable affecting all other parameter but not affected by them. Second in importance is the molar volume Vol. For the intermediate phase, the Vol variable is higher in the “causal importance” than I₁. In all three cases, the (I₁, Vol) variables are followed by polarization components (P_x, P_y), differential chemical composition I₅, and finally by tetragonality a/b.

The more direct way to visualize and compare the causal relationship is by arranging the observables as the causal chains as shown in Fig. 6 for all three compositions. As seen in Fig. 6, the dependence chain has significant overlaps between different compositions and analysis via IGCI and ANM. For IGCI, the chemical variables such as local composition and molar volume are clearly higher in the causal chain. In addition, the polarization components are arranged as (P_y, P_x) → I₅ for all three compositions. This observation suggests that I₅ (differential contrast between A and B site cation intensity) is related to the physical distortion rather than chemical composition. Finally, tetragonality is the weakest variable for all the three phases and is ranked below the chemical and polarization components. While we are hesitant to derive definitive conclusions from this analysis in the lack of a large body of comparative studies, we note that this behavior generally comports to that expected from physics of material (except for I₅ variable)

For the ANM model, the analysis is less straightforward. Here, for the intermediate and orthorhombic phases, the chemical composition is identified as a more casually significant variable. This is in the agreement with the IGCI results. On the other hand, for the ferroelectric phase, one of the polarization components has the higher rank affecting molar volume tetragonality, etc. We note that this behavior is likely to be due to the nature of the ANM criterion, relying on the regression between the variable pairs. Given the fundamental difference between the ferroelectric and non-ferroelectric phase (presence of domains), this affects regression results and necessitates transition to Bayesian estimators.

To summarize, we have implemented pairwise causal analysis of the atomic scale structural, chemical, and polarization phenomena in the Sm-doped BiFeO₃ using scanning transmission electron microscopy data as descriptors. The causal sieve approach is implemented using the IGCI and ANM to establish the pairwise causal relationships between observables. The results can be represented as an ordered array of causal importance. For Sm-BFO compositions series studied here it is generally found via IGCI that the chemical effects including local composition and molar volume are higher on the causal chain and are not affected by polarization. The polarization effects are secondary, and differential chemical contrast and tetragonality are the weakest. The ANM analysis results are more difficult to interpret; here we argue that functional relationship between the variables are fundamentally different in dissimilar phases and therefore GP interpolation approach produces fundamentally different responses. This behavior will be explored in the future.

Overall, we note that optimization and discovery of new materials as well as understanding of fundamental physical mechanisms can be significantly accelerated if the causality of corresponding mechanisms can be understood, allowing exploration of counterfactuals and interventions and avoiding correlative but incorrect conclusions. The fundamental physics offers a large set of knowledge on functional relationship between the materials parameters; however, real material systems often are characterized by only partially known physics or presence of non-equilibrated non-ergodic processes. We expect that in these cases causal analysis can provide the knowledge of cause and effect relationships necessary for materials optimization, design, and especially discovery.

Methods

Materials

The combinatorial library of Sm-doped BiFeO₃ and the SrRuO₃ layer were both fabricated through pulsed laser deposition. Specifically, after reaching the base pressure (~2.0 × 10⁻⁸ Torr) of the deposition chamber, the SrTiO₃ (001) substrate was heated up to 600 °C, and an oxygen flow was introduced to the chamber to maintain a desired deposition pressure (~100 mTorr). A laser energy density of ~0.8 J/cm² and an ablation frequency of 20 Hz were adopted for the deposition of the films. During the deposition of the Bi_1−xSm_xFeO₃ layer, a BiFeO₃ target and a SmFeO₃ target were alternatively ablated, and a shadow mask was controlled to move accordingly to obtain a uniform composition gradient across the substrate.

Samples characterization

TEM samples were prepared by FIB liftout and local low energy Ar ion milling, down to 0.5 keV, in a Fischione NanoMill. STEM was performed at 200 kV on a NION UltraSTEM. The three compositions were imaged consecutively to help maintain consistent imaging/microscope conditions. A correction algorithm was applied to correct for slow-scan axis scanning aberrations by reconstruction from two orthogonal source images according to⁷¹. All three datasets were defined with the [100] pseudocubic a-vectors along the thin film in-plane axis and the b-vector along the film growth axis. The atomic column positions (A_1,xy, A_2,xy, A_3,xy, A_4,xy, B_1,xy inputs for polar displacement maps in Fig. 1 and descriptors a, b, aΔ, bΔ, θ, Vol, and P) were determined by simultaneous 2D Guassian fits of local 5-atom perovskite unit cells. Atomic column HAADF intensity (A₁, A₂, A₃, A₄, and B₁ inputs for I₁–I₅ descriptors) was measured as the local Gaussian weighted 9-pixel intensity centered at the atom fit position. The source datasets were deliberately misaligned several degrees from the scan axes to aid the identification of residual scanning artifacts. Display images (Fig. 1) and vector coordinates (a, b, aΔ, bΔ, and P) were subsequently rotated to align the mean a-vector to the horizontal axis. Calculation for the descriptors was performed according to Table 1. Unit cell grid maps for selected descriptors are shown in Fig. 2 and in totality in the Supplementary Figs. 1–3) along with corresponding plotting information.

Data availability

Data is available via the accompanying notebook (see code availability statement).

Code availability

The code and data can be accessed via an executable Google Colab notebook at https://colab.research.google.com/github/ziatdinovmax/Notebooks-for-papers/blob/master/ferroics-causal-analysis.ipynb.

References

Grinberg, I., Suchomel, M. R., Davies, P. K. & Rappe, A. M. Predicting morphotropic phase boundary locations and transition temperatures in Pb- and Bi-based perovskite solid solutions from crystal chemical data and first-principles calculations. J. Appl. Phys. 98, 094111 (2005).
Google Scholar
Damjanovic, D. Ferroelectric, dielectric and piezoelectric properties of ferroelectric thin films and ceramics. Rep. Prog. Phys. 61, 1267–1324 (1998).
CAS Google Scholar
Woodward, D. I., Knudsen, J. & Reaney, I. M. Review of crystal and domain structures in the PbZr_xTi_1-xO₃ solid solution. Phys. Rev. B 72, 104110 (2005).
Google Scholar
Zeches, R. J. et al. A strain-driven morphotropic phase boundary in BiFeO(3). Science 326, 977–980 (2009).
CAS Google Scholar
Glinchuk, M. D. & Stephanovich, V. A. Dynamic properties of relaxor ferroelectrics. J. Appl. Phys. 85, 1722–1726 (1999).
CAS Google Scholar
Tagantsev, A. K. & Glazounov, A. E. Does freezing in PbMg1/3Nb2/3O3 relaxor manifest itself in nonlinear dielectric susceptibility? Appl. Phys. Lett. 74, 1910–1912 (1999).
CAS Google Scholar
Glinchuk, M. D. & Stephanovich, V. A. Theory of the nonlinear susceptibility of relaxor ferroelectrics. J. Phys. -Condes. Matter 10, 11081–11094 (1998).
CAS Google Scholar
Glazounov, A. E. & Tagantsev, A. K. Direct evidence for Vogel-Fulcher freezing in relaxor ferroelectrics. Appl. Phys. Lett. 73, 856–858 (1998).
CAS Google Scholar
Katzgraber, H. G., Gary, F. B. & Zimanyi, G. T. Fingerprinting hysteresis. Physica B 343, 10–14 (2004).
CAS Google Scholar
Vugmeister, B. E. & Rabitz, H. Coexistence of the critical slowing down and glassy freezing in relaxor ferroelectrics. Phys. Rev. B 61, 14448–14453 (2000).
CAS Google Scholar
Binder, K. & Reger, J. D. Theory of orientational glasses models, concepts, simulations. Adv. Phys. 41, 547–627 (1992).
CAS Google Scholar
Binder, K. & Young, A. P. Spin-glasses—experimental facts, theoretical concepts, and open questions. Rev. Mod. Phys. 58, 801–976 (1986).
CAS Google Scholar
Tokura, Y. & Nagaosa, N. Orbital physics in transition-metal oxides. Science 288, 462–468 (2000).
CAS Google Scholar
Imada, M., Fujimori, A. & Tokura, Y. Metal-insulator transitions. Rev. Mod. Phys. 70, 1039–1263 (1998).
CAS Google Scholar
Fiebig, M., Miyano, K., Tomioka, Y. & Tokura, Y. Visualization of the local insulator-metal transition in Pr0.7Ca0.3MnO3. Science 280, 1925–1928 (1998).
CAS Google Scholar
Urushibara, A. et al. Insulator-metal transition and giant magnetoresistance in LA_1-xSR_xMNO₃. Phys. Rev. B 51, 14103–14109 (1995).
CAS Google Scholar
Tokura, Y. Critical features of colossal magnetoresistive manganites. Rep. Prog. Phys. 69, 797–851 (2006).
CAS Google Scholar
Dagotto, E. Complexity in strongly correlated electronic systems. Science 309, 257–262 (2005).
CAS Google Scholar
Dagotto, E., Hotta, T. & Moreo, A. Colossal magnetoresistant materials: the key role of phase separation. Phys. Rep. 344, 1–153 (2001).
CAS Google Scholar
Wang, Y. et al. Observing atomic collapse resonances in artificial nuclei on graphene. Science 340, 734 (2013).
CAS Google Scholar
Allan, M. P. et al. Identifying the ‘fingerprint’ of antiferromagnetic spin fluctuations in iron pnictide superconductors. Nat. Phys. 11, 177–182 (2015).
CAS Google Scholar
Muller, D. A. et al. Atomic-scale chemical imaging of composition and bonding by aberration-corrected microscopy. Science 319, 1073 (2008).
CAS Google Scholar
Browning, N. D., Chisholm, M. F. & Pennycook, S. J. Atomic-resolution chemical analysis using a scanning transmission electron microscope. Nature 366, 143–146 (1993).
CAS Google Scholar
Batson, P. E. Simultaneous STEM imaging and electron energy-loss spectroscopy with atomic-column sensitivity. Nature 366, 727–728 (1993).
CAS Google Scholar
Catalan, G. et al. Flexoelectric rotation of polarization in ferroelectric thin films. Nat. Mater. 10, 963–967 (2011).
CAS Google Scholar
Nelson, C. T. et al. Spontaneous vortex nanodomain arrays at ferroelectric heterointerfaces. Nano Lett. 11, 828–834 (2011).
CAS Google Scholar
Jia, C.-L. et al. Unit-cell scale mapping of ferroelectricity and tetragonality in epitaxial ultrathin ferroelectric films. Nat. Mater. 6, 64–69 (2007).
CAS Google Scholar
Sun, Y. et al. Subunit cell–level measurement of polarization in an individual polar vortex. Sci. Adv. 5, eaav4355 (2019).
Borisevich, A. et al. Mapping octahedral tilts and polarization across a domain wall in BiFeO₃ from Z-contrast scanning transmission electron microscopy image atomic column shape analysis. ACS Nano 4, 6071–6079 (2010).
CAS Google Scholar
Kan, D. et al. Tuning magnetic anisotropy by interfacially engineering the oxygen coordination environment in a transition metal oxide. Nat. Mater. 15, 432–437 (2016).
CAS Google Scholar
Borisevich, A. Y. et al. Suppression of octahedral tilts and associated changes in electronic properties at epitaxial oxide heterostructure interfaces. Phys. Rev. Lett. 105, 087204 (2010).
CAS Google Scholar
Sun, C. et al. Atomic and electronic structure of Lomer dislocations at CdTe bicrystal interface. Sci. Rep. 6, 27009 (2016).
Google Scholar
Tang, Y. L., Zhu, Y. L. & Ma, X. L. On the benefit of aberration-corrected HAADF-STEM for strain determination and its application to tailoring ferroelectric domain patterns. Ultramicroscopy 160, 57–63 (2016).
CAS Google Scholar
Fitting, L., Thiel, S., Schmehl, A., Mannhart, J. & Muller, D. A. Subtleties in ADF imaging and spatially resolved EELS: a case study of low-angle twist boundaries in SrTiO₃. Ultramicroscopy 106, 1053–1061 (2006).
CAS Google Scholar
Arredondo, M. et al. Direct evidence for cation non-stoichiometry and cottrell atmospheres around dislocation cores in functional oxide interfaces. Adv. Mater. 22, 2430–2434 (2010).
CAS Google Scholar
Grieb, T. et al. Determination of the chemical composition of GaNAs using STEM HAADF imaging and STEM strain state analysis. Ultramicroscopy 117, 15–23 (2012).
CAS Google Scholar
Muller, D. A., Nakagawa, N., Ohtomo, A., Grazul, J. L. & Hwang, H. Y. Atomic-scale imaging of nanoengineered oxygen vacancy profiles in SrTiO₃. Nature 430, 657–661 (2004).
CAS Google Scholar
Rashidi, M. & Wolkow, R. A. Autonomous scanning probe microscopy in situ tip conditioning through machine learning. ACS Nano 12, 5185–5189 (2018).
CAS Google Scholar
Li, Q. et al. Quantification of flexoelectricity in PbTiO₃/SrTiO₃ superlattice polar vortices using machine learning and phase-field modeling. Nat. Commun. 8, 1468 (2017).
CAS Google Scholar
Ziatdinov, M., Maksov, A. & Kalinin, S. V. Learning surface molecular structures via machine vision. Npj Comput. Mater. 3, 31 (2017).
Google Scholar
Ziatdinov, M. et al. Building and exploring libraries of atomic defects in graphene: scanning transmission electron and scanning tunneling microscopy study. Sci. Adv. 5, eaaw8989 (2019).
CAS Google Scholar
Ziatdinov, M. et al. Atomic-scale observation of structural and electronic orders in the layered compound alpha-RuCl₃. Nat. Commun. 7, 13774 (2016).
CAS Google Scholar
Ziatdinov, M., Nelson, C., Vasudevan, R. K., Chen, D. Y. & Kalinin, S. V. Building ferroelectric from the bottom up: the machine learning analysis of the atomic-scale ferroelectric distortions. Appl. Phys. Lett. 115, 052902 (2019).
Google Scholar
Bareinboim, E. & Pearl, J. Causal inference and the data-fusion problem. Proc. Natl Acad. Sci. USA 113, 7345–7352 (2016).
CAS Google Scholar
Shpitser, I. & Pearl, J. Complete identification methods for the causal hierarchy. J. Mach. Learn. Res. 9, 1941–1979 (2008).
Google Scholar
Pearl, J. Causality: Models, Reasoning and Inference. (Cambridge University Press, 2009).
Borisevich, A. Y. et al. Exploring mesoscopic physics of vacancy-ordered systems through atomic scale observations of topological defects. Phys. Rev. Lett. 109, 065702 (2012).
CAS Google Scholar
Vlcek, L. et al. Learning from imperfections: predicting structure and thermodynamics from atomic imaging of fluctuations. ACS Nano 13, 718–727 (2019).
CAS Google Scholar
Vlcek, L., Maksov, A., Pan, M. H., Vasudevan, R. K. & Kahnin, S. V. Knowledge extraction from atomically resolved images. ACS Nano 11, 10313–10320 (2017).
CAS Google Scholar
Vlcek, L., Sun, W.W. & Kent, P.R.C. Combining configurational energies and forces for molecular force field optimization. J. Chem. Phys. 147, 161713 (2017).
Vlcek, L., Vasudevan, R. K., Jesse, S. & Kalinin, S. V. Consistent integration of experimental and ab initio data into effective physical models. J. Chem. Theory Comput. 13, 5179–5194, https://doi.org/10.1021/acs.jctc.7b00114 (2017).
Article CAS Google Scholar
Tagantsev, A. K., Stolichnov, I., Colla, E. L. & Setter, N. Polarization fatigue in ferroelectric films: basic experimental findings, phenomenological scenarios, and microscopic features. J. Appl. Phys. 90, 1387–1402 (2001).
CAS Google Scholar
Mooij, J. M., Peters, J., Janzing, D., Zscheischler, J. & Scholkopf, B. Distinguishing cause from effect using observational data: methods and benchmarks. J. Mach. Learn. Res. 17, 102 (2016).
Google Scholar
Troyanchuk, I. O. et al. Phase transitions, magnetic and piezoelectric properties of rare-earth-substituted BiFeO₃ ceramics. J. Am. Ceram. Soc. 94, 4502–4506 (2011).
CAS Google Scholar
Borisevich, A. Y. et al. Atomic-scale evolution of modulated phases at the ferroelectric-antiferroelectric morphotropic phase boundary controlled by flexoelectric interaction. Nat. Commun. 3, 775 (2012).
CAS Google Scholar
Maran, R. et al. Interface control of a morphotropic phase boundary in epitaxial samarium modified bismuth ferrite superlattices. Phys. Rev. B 90, 245131 (2014).
Google Scholar
Maran, R. et al. Enhancement of dielectric properties in epitaxial bismuth ferrite-bismuth samarium ferrite superlattices. Adv. Electron. Mater. 2, 1600170 (2016).
Google Scholar
Kubel, F. & Schmid, H. Structure of a ferroelectric and ferroelastic monodomain crystal of the perovskite BiFeO₃. Acta Crystallogr. Sect. B-Struct. Commun. 46, 698–702 (1990).
Google Scholar
Tao, H., Lv, J., Zhang, R., Xiang, R. & Wu, J. Lead-free rare earth-modified BiFeO₃ ceramics: phase structure and electrical properties. Mater. Des. 120, 83–89 (2017).
CAS Google Scholar
Rubenstein, P.K., Bongers, S., Scholkopf, B. & Mooij, J. From Deterministic ODEs to Dynamic Structural Causal Models. (Auai Press, 2018).
Janzing, D. et al. Information-geometric approach to inferring causal directions. Artif. Intell. 182, 1–31 (2012).
Google Scholar
Peters, J., Mooij, J. M., Janzing, D. & Scholkopf, B. Causal discovery with continuous additive noise models. J. Mach. Learn. Res. 15, 2009–2053 (2014).
Google Scholar
Daniusis, P. et al. In Proc. 26th Conference on Uncertainty in Artificial Intelligence (UAI) 07:01-08 (Catalina Island, California, 2010).
Janzing, D., Steudel, B., Shajarisales, N. & Schölkopf, B. Justifying Information-Geometric Causal Inference. In Measures of Complexity, 253–265 (Springer, 2015).
Peters, J., Janzing, D. & Schölkopf, B. Elements of Causal Inference: Foundations and Learning Algorithms. (MIT press, 2017).
Hoyer, P.O., Janzing, D., Mooij, J.M., Peters, J. & Schölkopf, B. Nonlinear causal discovery with additive noise models. In Advances in Neural Information Processing Systems, 689–696 (2009).
Quiñonero-Candela, J. & Rasmussen, C. E. A unifying view of sparse approximate Gaussian process regression. J. Mach. Learn. Res. 6, 1939–1959 (2005).
Google Scholar
Bingham, E. et al. Pyro: deep universal probabilistic programming. J. Mach. Learn. Res. 20, 973–978 (2019).
Google Scholar
Gretton, A. et al. A Kernel Statistical Test of Independence. In Advances in Neural Information Processing Systems, 585–592 (2007).
Goudet, O. et al. Learning Functional Causal Models with Generative Neural Networks. In Explainable and Interpretable Models in Computer Vision and Machine Learning, 39–80 (Springer, 2018).
Ophus, C., Nelson, C. T. & Ciston, J. Correcting nonlinear drift distortion of scanning probe and scanning transmission electron microscopies from image pairs with orthogonal scan directions. Ultramicroscopy 162, 1–9 (2016).
CAS Google Scholar

Download references

Acknowledgements

This effort (electron microscopy, feature extraction) is based upon work supported by the U.S. Department of Energy (DOE), Office of Science, Basic Energy Sciences (BES), Materials Sciences and Engineering Division (S.V.K., C.N.) and was performed and partially supported (M.Z., R.K.V.) at the Oak Ridge National Laboratory’s Center for Nanophase Materials Sciences (CNMS), a U.S. Department of Energy, Office of Science User Facility. The work at the University of Maryland was supported in part by the National Institute of Standards and Technology Cooperative Agreement 70NANB17H301 and the Center for Spintronic Materials in Advanced infoRmation Technologies (SMART) one of centers in nCORE, a Semiconductor Research Corporation (SRC) program sponsored by NSF and NIST. A.N.M. work was partially supported by the European Union’s Horizon 2020 research and innovation program under the Marie Skłodowska-Curie (grant agreement No 778070). The authors express deepest gratitude for Prof. Judea Pearl (UCLA) and Dr. Vint Cerf (Google) for introduction in the field of causal machine learning and productive discussion.

Author information

Authors and Affiliations

The Center for Nanophase Materials Sciences, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA
Maxim Ziatdinov, Christopher T. Nelson, Rama K. Vasudevan & Sergei V. Kalinin
Computational Sciences and Engineering Division, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA
Maxim Ziatdinov
Department of Materials Science and Engineering, University of Maryland, College Park, MD, 20742, USA
Xiaohang Zhang & Ichiro Takeuchi
Institute for Problems of Materials Science, National Academy of Sciences of Ukraine, Krjijanovskogo 3, Kyiv, 03142, Ukraine
Eugene Eliseev
Institute of Physics, National Academy of Sciences of Ukraine, 46, pr. Nauky, Kyiv, 03028, Ukraine
Anna N. Morozovska

Authors

Maxim Ziatdinov
View author publications
You can also search for this author in PubMed Google Scholar
Christopher T. Nelson
View author publications
You can also search for this author in PubMed Google Scholar
Xiaohang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Rama K. Vasudevan
View author publications
You can also search for this author in PubMed Google Scholar
Eugene Eliseev
View author publications
You can also search for this author in PubMed Google Scholar
Anna N. Morozovska
View author publications
You can also search for this author in PubMed Google Scholar
Ichiro Takeuchi
View author publications
You can also search for this author in PubMed Google Scholar
Sergei V. Kalinin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.V.K. proposed the concept and led the paper writing. M.Z. performed all the statistical and causal analysis of data and co-wrote the paper. C.N. performed STEM experiments and analyzed experimental data to extract descriptors. X.Z. and I.T. prepared the samples. R.K.V. aided and analysis and paper writing. E.E. and A.N.M. aided in providing the physical context for causal analysis.

Corresponding author

Correspondence to Sergei V. Kalinin.

Ethics declarations

The authors declare no competing financial or nonfinancial interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ziatdinov, M., Nelson, C.T., Zhang, X. et al. Causal analysis of competing atomistic mechanisms in ferroelectric materials from high-resolution scanning transmission electron microscopy data. npj Comput Mater 6, 127 (2020). https://doi.org/10.1038/s41524-020-00396-2

Download citation

Received: 30 March 2020
Accepted: 24 July 2020
Published: 21 August 2020
DOI: https://doi.org/10.1038/s41524-020-00396-2

This article is cited by

Machine learning for automated experimentation in scanning transmission electron microscopy
- Sergei V. Kalinin
- Debangshu Mukherjee
- Steven R. Spurgeon
npj Computational Materials (2023)
From atomically resolved imaging to generative and causal models
- Sergei V. Kalinin
- Ayana Ghosh
- Maxim Ziatdinov
Nature Physics (2022)
Recent advances and applications of deep learning methods in materials science
- Kamal Choudhary
- Brian DeCost
- Chris Wolverton
npj Computational Materials (2022)
Applications of machine learning in perovskite materials
- Ziman Wang
- Ming Yang
- Hang Zhang
Advanced Composites and Hybrid Materials (2022)