Automated stopping criterion for spectral measurements with active learning

Ueno, Tetsuro; Ishibashi, Hideaki; Hino, Hideitsu; Ono, Kanta

doi:10.1038/s41524-021-00606-5

Download PDF

Article
Open access
Published: 25 August 2021

Automated stopping criterion for spectral measurements with active learning

npj Computational Materials volume 7, Article number: 139 (2021) Cite this article

4197 Accesses
20 Citations
10 Altmetric
Metrics details

Subjects

Characterization and analytical techniques

Abstract

The automated stopping of a spectral measurement with active learning is proposed. The optimal stopping of the measurement is realised with a stopping criterion based on the upper bound of the posterior average of the generalisation error of the Gaussian process regression. It is revealed that the automated stopping criterion of the spectral measurement gives an approximated X-ray absorption spectrum with sufficient accuracy and reduced data size. The proposed method is not only a proof-of-concept of the optimal stopping problem in active learning but also the key to enhancing the efficiency of spectral measurements for high-throughput experiments in the era of materials informatics.

Bayesian active learning with model selection for spectral experiments

Article Open access 14 February 2024

Active learning-assisted neutron spectroscopy with log-Gaussian processes

Article Open access 19 April 2023

Multi-objective Bayesian active learning for MeV-ultrafast electron diffraction

Article Open access 03 June 2024

Introduction

Machine learning and artificial intelligence (AI)-related techniques have been rapidly implemented in daily lives and industries, such as electronic commerce, manufacturing, and automated driving. This situation is not an exception for various scientific fields; in particular, materials science combined with information science is referred to as ‘materials informatics’^{1,2,3,4,5,6,7}. Materials informatics has been successful in predicting and finding new functional materials^{8,9,10,11,12,13} or optimising devices and material microstructures^14,15.

In general, machine learning techniques, such as deep learning, require substantial data to learn. Therefore, in materials informatics, theoretical calculations, such as first-principles calculations or molecular dynamics simulations, are useful methods for accumulating materials property data to construct a materials database^16,17,18. However, substantial problems in materials science, e.g., the coercivity in permanent magnets, catalytic reactions, and the charge and discharge of batteries, are difficult to completely describe with theoretical calculations. These physical and chemical phenomena have complexities that span multi-temporal and spatial scales. These problems are difficult to model and require a tremendous computational cost. Therefore, an experimental approach is essential for understanding these phenomena. First, data acquisition by experiments incurs various costs related to real specimen preparation, human-operated measurement equipment, trial-and-error analyses, etc. There is a strong demand to promote the efficiency of experimental methods to accelerate materials science beyond traditional processes. In addition to materials informatics, ‘measurement informatics’, a research area that integrates measurements and machine learning, has been explored to date. With this approach, the measurement^19,20,21 and data analysis efficiency can be enhanced^{22,23,24,25,26}.

Active learning (AL) is a machine learning scheme used to obtain predictive models with high precision at a limited cost through the sophisticated selection of samples for labelling²⁷. Similar to other machine learning methods, AL is utilised in materials informatics^{28,29,30,31,32,33,34} and measurement informatics^19,20. Ueno et al. developed a method for X-ray magnetic circular dichroism (XMCD) spectral measurement with AL¹⁹. XMCD spectroscopy is a variation of X-ray absorption spectroscopy (XAS) using circularly polarised X-rays to investigate the element-specific magnetic properties of materials³⁵. In particular, magneto-optical sum rules relate the XMCD spectrum to the spin and orbital magnetic moments, which are the fundamental physical parameters of elements^36,37,38.

Measurements of XAS and XMCD spectra are usually performed in a step-by-step manner, i.e., the measurement of the X-ray absorption intensity and tuning of the X-ray energy is repeated over the energy range of the spectrum (Fig. 1b). In conventional XAS and XMCD measurements, the numbers of energy points and energy steps are predetermined by the experimenter, and usually, several hundred points are measured. These measurement conditions are set based on experimenters’ experience and intuition. This type of conventional DoE has possibility to be optimised by a machine learning technique. The XMCD spectral measurement with AL uses Gaussian process regression (GPR)³⁹ to predict the whole spectral shape from the measured data (Fig. 1a). The standard deviation of the prediction is used to determine the optimal energy point to measure. The measurement stops if the convergence criterion is satisfied. Compared to conventional measurements, this method succeeded in reducing the number of measurement energy points.

**Fig. 1: Concept of the spectral measurement with active learning.**

However, the XMCD spectral measurement with AL depends on the magnetic moment evaluated from the predicted spectrum as the stopping criterion. This condition means that the method is only applicable when the relation between the spectrum and the physical parameter is known. In addition, the physical parameter must be evaluated quickly from the spectrum to fit in the AL cycle of the measurement and GPR. In the common situation of a spectral measurement, the evaluation of physical parameters from the spectra is not straightforward; e.g., a comparison between the experimental and theoretical spectra is often needed. Therefore, a stopping criterion without the physical parameter is required to apply the spectral measurement with AL to general types of spectra.

The optimal stopping problem is a long-standing problem in AL²⁷. Without an appropriate stopping criterion, the active learner would require many unnecessary samples to be labelled or too few samples, resulting in poor predictive performance. Despite its importance, there have been few studies on timing considerations for AL^40,41,42, mainly because of its problem dependency. From the viewpoint of materials science, an optimal stopping rule is highly desired to avoid useless and costly experiments. In the typical problem setting of AL in materials and measurement informatics, the experiment is stopped when the budget is exhausted or the experimenter is satisfied with the results²⁸. The former assumes the experiment of one who needs the best result in a limited time. In such a situation, the experimenter will set the maximum iteration of experiments within the limited time⁴³. The latter suppose the situation that the experiment is iterated until the experimenter satisfies with the target property, e.g. search for material with the highest melting point⁴⁴ and convergence of the magnetic moments within the predetermined threshold¹⁹.

In this paper, we applied a universal stopping criterion for XAS spectral measurement with AL. The stopping criterion is based on the stability of the expected generalisation errors⁴⁵. This stopping criterion is completely evaluated on a mathematical basis; therefore, it is applicable to general spectral measurements whose relation to the physical parameter is unknown. We applied the method to the spectral measurement with AL of several types of simulated and experimental XAS spectra. It is revealed that the automated stopping criterion of the spectral measurement gives an approximated XAS spectrum with sufficient accuracy. The proposed method can be applied not only to spectral measurements but also to other types of measurements and improves the efficiency of high-throughput experiments in the era of materials informatics.

Results

Spectral measurements with AL

Figure 1c shows the flowchart of a spectral measurement with AL. Spectral measurement with AL includes the following steps: (1) First, initial spectral data Y₀(X₀) are sampled. Here, Y₀ = (y_i, …, y_j) represents the spectral intensity at energy points X₀ = (x_i, …, x_j). (2) Subsequently, GPR is applied to the initial data, and we obtain a mean μ_n and standard deviation σ_n for each energy point. n = 0, 1, … represents the number of samplings (n = 0 for the initial sampling). The mean μ_n is regarded as a predicted spectrum. The stopping criterion is evaluated with the result of the GPR fitting. (3) If the stopping criterion is not satisfied, then the next sampling point x_next is automatically determined based on an acquisition function. (4) Subsequently, a new energy point is sampled and added to the measured data Y_n = (y_i, …, y_next, …, y_j). Finally, repeating process (2), GPR is applied to Y_n(X_n) again. Processes (2) to (4) are repeated until the stopping criterion is satisfied.

In our previous work¹⁹, a physical parameter of interest was evaluated by comparing the fitted spectrum and the reference spectrum of a known material. Then, the physical parameter converges to a predetermined accuracy range, and the measurement is terminated. However, this method has some limitations for general use, i.e., only applicable to materials whose physical parameter can be evaluated with the similarity measure, and the reference value of the physical parameter is available. Moreover, different results are generated for different similarity measures, and the threshold must be properly determined. Therefore, we adopt the criterion that depends only on the fitted spectra to promote the application of spectral measurement with AL.

The choice of a covariance function (kernel) $K({{{x}}},{{{x}}}^{\prime} )$ is essential in GPR. We adopt the standard power exponential correlation function defined by

$$K({{{x}}},{{{x}}}^{\prime} )={\theta }_{1}\exp \left(-{\left(\frac{| {{{x}}}-{{{x}}}^{\prime} | }{{\theta }_{2}}\right)}^{2}\right)$$

(1)

with the amplitude parameter θ₁ and bandwidth parameter θ₂. The power exponential or Gaussian correlation function performs better than the Matérn correlation function for the fitting of the XAS spectra¹⁹.

Stopping criterion

Approximating the spectrum by the GPR is considered a problem of supervised learning. In this setting, the goodness of approximation is evaluated by the average prediction error of the intensity at an unseen energy point, which is called the generalisation error defined by

$${{{\mathcal{L}}}}(f)=\int {\mathrm{d}}y\int {\mathrm{d}}x{(y-f(x))}^{2}p(x,y).$$

(2)

where p(x, y) is a joint density function of (x, y). Note that f(x) is a stochastic predictor sampled from the fitted Gaussian process; hence, ${{{\mathcal{L}}}}(f)$ is a random variable. In a problem setting of the spectral measurement, x, y and f(x) are the energy, the ground truth (unknown) spectrum and the fitted curve by the GPR model, respectively.

Let p_t(f) be the posterior distribution of the predictive model f(x) obtained by fitting a Gaussian process to the observation up to time t. Then, the posterior average of the generalisation error is defined by

$${{{{\mathcal{L}}}}}_{t}=\int {\mathrm{d}}f\,{p}_{t}(f){{{\mathcal{L}}}}(f).$$

(3)

If the gap $| {{{{\mathcal{L}}}}}_{t}-{{{{\mathcal{L}}}}}_{t+1}|$ of the posterior average of generalisation errors at t and t + 1 is small enough, then there is only a small gain by performing an additional observation, and the experiment should be stopped. It is not possible to directly calculate the generalisation error because we do not access the distribution p(x, y). A method to estimate the upper bound of ${{{{\mathcal{L}}}}}_{t}-{{{{\mathcal{L}}}}}_{t+1}$ is proposed in ref. ⁴⁶, and a stopping criterion of AL based on the convergence of the gap $| {{{{\mathcal{L}}}}}_{t}-{{{{\mathcal{L}}}}}_{t+1}|$ is developed in ref. ⁴⁵. Suppose ${{{\mathcal{L}}}}(f)\in [a,b]$, then

$$| {{{{\mathcal{L}}}}}_{t}-{{{{\mathcal{L}}}}}_{t+1}| \le (b-a)\{r({p}_{t},{p}_{t+1})+r({p}_{t+1},{p}_{t})\}$$

(4)

holds where

$$r(p,q)=\exp \left\{{W}_{0}\left(\frac{{{{\rm{KL}}}}(p| | q)-1}{{{{\rm{e}}}}}\right)+1\right\}-1.$$

(5)

In the above equation, W₀ is the main branch of the Lambert W-function⁴⁷, ${{{\rm{KL}}}}(p| | q)=\int p(x){{\mathrm{log}}}\,\frac{p(x)}{q(x)}{\mathrm{d}}x$ is the Kullback–Leibler divergence⁴⁸, and e is the base of the natural logarithm. The Kullback-Leibler divergence between posterior distributions p_t and p_t+1 is shown to be exactly calculated by using the observed points ${\{({x}_{i},{y}_{i})\}}_{i = 1}^{t}$; hence, the upper bound of $| {{{{\mathcal{L}}}}}_{t}-{{{{\mathcal{L}}}}}_{t+1}|$ is computable from the data at hand. To align the scale of r(p_t, p_t + 1) + r(p_t + 1, p_t) and remove the constant term b − a, we consider the ratio ${\lambda }_{t}=\frac{r({p}_{t},{p}_{t+1})+r({p}_{t+1},{p}_{t})}{r({p}_{1},{p}_{2})+r({p}_{2},{p}_{1})}$. When this ratio is smaller than a certain threshold λ ∈ [0, 1], the experiments are stopped. Figure 2 shows the schematic of the stopping criterion with the error ratio. Intuitively, the threshold λ is regarded as the expected rate of the improvement of fitting by one additional observation. The stopping time is not very sensitive to the value of λ when it is set to be small enough, but it is possible to determine this parameter by a simulation study in advance. We take this strategy and report the experimental results in the following section.

**Fig. 2: Schematic of the stopping criterion with the error ratio.**

Application to the simulated spectrum

First, we applied the method to the simulated noise-free Ni L_2,3 XAS of divalent nickel ions (Ni²⁺) to verify the effectiveness of the overall strategy. Details of the simulation are described in the Methods section. All spectra used in the study are presented in Supplementary Fig. 1. Figure 3a–f shows snapshots of the GPR fitting of the simulated L_2,3 XAS of Ni²⁺. In Fig. 3a, randomly selected initial data points (data size = 10) and GPR fitting are shown. The L₃ peak is measured occasionally in the initial sampling. The covariance function exhibits a large standard deviation between the sampled data points, and the next sampling point is chosen from these x values with a large value of the acquisition function (Eq. (11)). The GPR fitting after several samplings (data size = 20) is shown in Fig. 3b. The whole spectral shape appears, but the standard deviation is still large, approximately one-third of the intensity at the L₃ peak between the sampled data points. Fig. 3c shows the GPR fitting after 50 samplings (data size = 60). Satellite peaks around the L₃ main peak and multiplet structure around the L₂ peak appear at this degree of sampling density. Figure 3d–f shows the results of the GPR fitting for the different stopping timings, i.e., different thresholds. The standard deviation is relatively small compared to the intensity of the L₃ and L₂ peaks in the whole energy range. The intensity relations of the multiplet structure around the L₂ peak are correctly approximated. In Fig. 3d, e, the difference between the GPR fitting for different thresholds only appears in the standard deviations around non-peak regions. In Fig. 3f, the GPR fitting for a data size = 178 is almost the same as the GPR fitting for a data size = 116, but an increase in sampling points around the peak region is visible.

**Fig. 3: Spectral measurement of simulated Ni L_2,3 XAS for Ni²⁺ with active learning.**

To visualise the progress of the spectral measurement and the stopping timing, the error ratio and the test error versus data size are shown in Fig. 3g, h, respectively. In Fig. 3g, the error ratio λ_t is plotted as a function of the data size, i.e., the number of measurement points x. The error ratio decreases with increasing data size and converges to an almost constant value. Figure 3h shows the test error, i.e., the posterior average of the generalisation errors (Eq. (3)), versus the data size. The test error steeply decreases with increasing data size as compared to the error ratio in the initial stage of the measurement and also converges to a constant value. The spikes in error ratio in Fig. 3g come from the subtraction of the test errors at steps to evaluate the upper bound of the test error in Fig. 3h. The spikes in error ratio coincide with the spikes in the difference of test error between the n-th and (n − 1)-th sampling. The spikes are inevitable because the discontinuous improvement of the test error occasionally arises in the AL. Alternatively, the minimum value of the error ratio is plotted in Fig. 3g. It is revealed that the minimum value of the error ratio gradually decreases with the increase of data size. Therefore, immoderate early stopping of a measurement can be avoided. The vertical dashed lines in Fig. 3g, h indicate the stopping timings for different thresholds that correspond to the GPR fitting shown in Fig. 3d–f. These results indicate that the automated stopping of the XAS measurement based on the generalisation error gives the GPR fitting with small errors from the ground truth spectrum with the reduced data size.

To perceive the overall picture of the spectral measurement with AL, the predicted mean μ and standard deviation σ versus data size are visualised as heat maps in Fig. 3i, j, respectively. The stopping timings for different thresholds are represented as horizontal dashed lines. In Fig. 3i, the peak structures around L₃ and L₂ regions are shown as thick coloured vertical lines. The peak structures seem to be explicit with increasing data size. On the other hand, the standard deviation decreases with increasing data size, as shown in Fig. 3j. Sampled energy points indicated by orange markers in Fig. 3i, j, are nearly uniformly distributed in the whole spectral energy range. We confirmed a nearly uniform distribution by plotting a histogram of sampled energy points as shown in Supplementary Fig. 6a. Thus, it seems that biased sampling, e.g., intensive sampling around the peak region, does not occur in the spectral measurement with AL with the acquisition function with the form of Eq. (11).

Application to the experimental spectrum

Next, we applied the method to the experimental data to demonstrate its applicability to actual spectral measurements. Experimental data inherently include measurement noise; therefore, it is essential to ascertain the noise tolerance of the present method. Moreover, the experimental XAS spectrum of Ni metal has a finite background that comes from a continuum state approximated by the double step-like function. Figure 4a–f shows snapshots of the GPR fitting of the experimental Ni L_2,3 XAS of nickel metal. In Fig. 4a, randomly selected initial data points and the GPR fitting are shown. The standard deviation σ is large in sparsely sampled energy regions, as is the usual behaviour of the GPR. The overall spectral shape appears after 50 samplings, as shown in Fig. 4c. In addition to the L₃ and L₂ main peaks, the so-called 6 eV satellite⁴⁹ can be seen at ~ 858 eV at this sampling density. Figure 4d–f shows the results of the GPR fitting for the different stopping timings with a specific threshold. Similar to the results for the simulated Ni²⁺ XAS spectrum, the L₃ main peak is properly approximated by the GPR fitting with a stopping timing of λ = 0.1. The standard deviation around the L₂ peak region and other non-peak energy regions decreases with increasing measurements, as shown in Fig. 4e, f.

**Fig. 4: Spectral measurement of experimental Ni L_2,3 XAS with active learning.**

Figure 4g, h shows the error ratio and the test error versus the data size. As shown in Fig. 4g, the error ratio decreases with increasing data size and converges to constant values. Occasionally appearing spikes originate likewise the results for Ni²⁺ shown in Fig. 4g. Figure 4h shows the data size dependence of the test error. The test error decreases with increasing data size and converges to a constant value, as in the case of Ni²⁺. The stopping timing (vertical dashed lines in Fig. 4g, h) appears in a data size from 69 to 84. The GPR fittings shown in Fig. 4d–f indicate that relevant stopping timing gives accurate spectral shapes, including main peaks and multiplet structures. These results indicate that the present method also works for the experimental XAS spectrum with noise. The animation of GPR fittings and the evolution of the error ratio and the test error presented above are shown in the Supplementary Movie.

Heat maps of the predicted mean μ and standard deviation σ versus the data size are shown in Fig. 4i, j. In Fig. 4i, peak structures around the L₃ and L₂ regions appear as thick coloured vertical lines and become explicit with increasing data size as with the case of the simulated Ni²⁺ XAS. The sampling tendency is also similar to the case of the simulated Ni²⁺ XAS measurement, and sampling energy points are uniformly distributed in the whole spectral energy range (see also a histogram in Supplementary Fig. 6d).

Time-cost evaluation

It is essential to estimate the time cost, i.e. total measurement time, with the spectral measurement with AL for realistic application to measurements. The time cost for the spectral measurement with AL t_AL is defined as

$${t}_{{{{\rm{AL}}}}}=\mathop{\sum}\limits_{n}\left({t}_{{{{\rm{ene}}}}}\times {{\Delta }}{E}_{n}^{{{{\rm{Init}}}}}+{t}_{{{{\rm{meas}}}}}+{t}_{{{{\rm{GP}}}}}\right)+\mathop{\sum}\limits_{n}\left({t}_{{{{\rm{ene}}}}}\times {{\Delta }}{E}_{n}^{{{{\rm{AL}}}}}+{t}_{{{{\rm{meas}}}}}+{t}_{{{{\rm{GP}}}}}+{t}_{{{{\rm{SC}}}}}\right),$$

(6)

where t_ene, t_meas, t_GP and t_SC are the time to change unit energy, e.g. 1 eV, time to measure single spectral intensity, time to compute the GPR and time to evaluate the stopping criterion, respectively. ${{\Delta }}{E}_{n}^{{{{\rm{Init}}}}}$ is a distance between energies in the initial sampling and ${{\Delta }}{E}_{n}^{{{{\rm{AL}}}}}$ is a distance between energies of the n-th and the (n − 1)-th sampling. Thus, t_AL consists of time costs for the initial sampling and the sampling in AL. On the other hand, the time cost for the conventional DoE t_CDoE is defined as

$${t}_{{{{\rm{CDoE}}}}}=\mathop{\sum}\limits_{n}\left({t}_{{{{\rm{ene}}}}}\times {{\Delta }}{E}^{{{{\rm{CDoE}}}}}+{t}_{{{{\rm{meas}}}}}\right).$$

(7)

For simplicity, let ΔE^CDoE be a constant by assuming a measurement that constant energy step is used in all spectral energy ranges.

Figure 5 shows the time cost evaluated for Ni²⁺L_2,3 XAS spectral measurement with various t_meas/t_ene ratios, which is variable with the experimental conditions. In this evaluation, we assume the signal-to-noise ratio (SNR) of the XAS spectrum does not depend on t_meas, i.e. we ignore the effect of SNR on the spectral measurement with AL, which will be explored in a further study. Meanwhile, it is supposed that t_GP/t_ene = 0.1 for all evaluation because the computational time for GPR t_GP is much shorter than t_ene and t_meas, when the number of measurements is in the order of <1000. We note that the computational cost for evaluating the stopping criterion is negligible. Time cost in Fig. 5 exhibits arbitrary units, one can read vertical axis in seconds, for example, t_meas = t_ene = 1 [sec] and t_GP = 0.1 [sec] in case of Fig. 5a. Both in the spectral measurement with AL and with the conventional DoE, time cost monotonically increases with data size. The slope of the time cost dependence of the conventional DoE is constant because ΔE^CDoE is constant as mentioned above. On another hand, the slope of the time cost dependence of the spectral measurement with AL changes because ${{\Delta }}{E}_{n}^{{{{\rm{ADoE}}}}}$ changes in each sampling. At the stopping timing of the spectral measurement with AL, an experimenter can observe the whole spectral shape as shown in Fig. 3d–f. In the conventional DoE, the whole spectral shape appears only after the measurement is finished. The spectral measurement with AL outperforms the conventional DoE, i.e. lower time cost is achieved in most cases, except for the case of t_meas/t_ene = 1 with the threshold λ = 0.025. A similar tendency is observed for the measurement of Co²⁺L_2,3 XAS; however, in other cases, the spectral measurement with AL always realises lower time cost than that of the conventional DoE as shown in Supplementary Figs. 7–11.

**Fig. 5: Time cost estimation of the Ni L_2,3 XAS spectral measurement for simulated Ni²⁺.**

Discussion

In this paper, we proposed the application of an automated stopping criterion for spectral measurement with AL to enhance the efficiency of spectral measurements. The method was applied to the simulated and experimental Ni L_2,3 XAS spectra. Predicted spectra with GPR fitting demonstrate satisfactory accuracy at different stopping timings with several thresholds. The method was applied to XAS spectra other than Ni. The GPR fittings and data size dependency of the error ratio and the test error for simulated and experimental Mn and Co L_2,3 XAS are shown in Supplementary Figs. 2–5. These results show a similar tendency to the simulated and experimental Ni L_2,3 XAS results. Time costs for various XAS spectral measurements with t_meas/t_ene = 1 are summarised in Fig. 6. The GPR fittings give a reasonable approximation of the XAS spectra in the automated stopping timing whose number of measurements is dramatically reduced as compared to the conventional experimental design. It is revealed that the automated stopping criterion works well for the spectral measurement with AL in general.

**Fig. 6: Time costs at stopping timings for various XAS spectral measurements.**

Based on the estimation of time cost, the spectral measurement with AL outperforms that with the conventional DoE in many cases as shown in Figs. 5 and 6. In particular, the advantage of the spectral measurement with AL is emphasised in the cases of large t_meas/t_ene ratio. Therefore, the spectral measurement with AL especially is effective in measurements with long measurement time per energy or other scanning parameters such as XAS measurement of a very dilute system, e.g. single molecule on the surface or inelastic neutron scattering, generally known as a neutron-hungry experiment. It is also effective to experiment like scanning transmission X-ray microscopy (STXM). In the STXM experiment, two-dimensional spatial scan is performed at each X-ray energy, so the measurement time per energy becomes much longer than that of changing X-ray energy. Thus, advantage of the reduction of the measurement point by the spectral measurement with AL becomes prominent for experiments with long measurement time per scanning parameter. Note that the so-called ‘on-the-fly’ scan is used as a very quick measurement technique in particular for a single XAS spectral measurement. It is important to use properly such techniques and the spectral measurement with AL depending on the experimenter’s purpose to improve the efficiency of the measurement.

Here, we discuss why the automated stopping criterion for the spectral measurement with AL works. In Fig. 1a, the measurement energy points in the spectral measurement with AL jump backward and forward in the whole spectral energy range. Thus, GPR fitting can approximate the rough shape of the spectrum in the early stage of the measurement, and the generalisation error becomes small. Fine spectral features, such as satellite peaks, appear as the measurement progresses; however, these measurements are not very relevant to the improvement in the test error between different thresholds. In other words, the method works with a balance between ‘exploration’ and ‘exploitation’. Exploratory sampling is more important than exploitative sampling to reduce the generalisation error and stop the experiment with a minimum number of measurements in the spectral measurement with AL. Alternatively, exploitative sampling becomes effective when one wants to measure detailed spectral features. This type of sampling becomes possible by using an acquisition function proposed in the literature²⁰. The utilisation of prior knowledge regarding spectra has the potential to design an acquisition function; however, this subject is a topic for future research.

In conclusion, we applied the stopping criterion based on the stability of the expected generalisation errors for the XAS spectral measurement with AL. This stopping criterion can be evaluated from the self-contained information of the GPR fitting. It is revealed that the automated stopping criterion of the spectral measurement gives an approximated XAS spectrum with sufficient accuracy. The implementation utilises the application of the state-of-the-art theory of the optimal stopping problem in AL to actual measurements. The proposed method can be applied not only to spectral measurements but also to other types of measurements. An enhancement of the spectral measurement efficiency enables the high-throughput characterisation of materials for the construction of an experimental materials database in the era of materials informatics.

Methods

Simulation and measurement of X-ray absorption spectra

The simulation of XAS spectra was performed using CTM4XAS software⁵⁰.

Ni, Mn and Co L_2,3 XAS spectra were calculated for Ni²⁺, Mn²⁺, and Co²⁺ ions. Crystal field parameters were set to O_h symmetry with 10Dq = 1.0. The calculated multiplets were broadened with Lorentzian and Gaussian functions of 0.2 eV half-width at half-maximum each.

The XAS experiment was performed at the BL-19B at the Photon Factory, Institute of Materials Structure Science, High Energy Accelerator Research Organization, Japan⁵¹. Three types of samples, manganese dioxide (MnO₂) powder and pieces of bulk cobalt (Co) and nickel (Ni), were mounted at the sample manipulator in the vacuum chamber. Mn, Co and Ni L_2,3 XAS were obtained at room temperature by the total electron yield method, which measures the sample drain current. XAS spectra were obtained by dividing the sample current I by the mirror current I₀ to negate the intensity variation in the incident X-ray. All simulated and experimental spectra used in this study are shown in Supplementary Fig. 1.

Gaussian process regression

The fundamental idea of spectral measurement with AL is identifying the spectral measurement problem with supervised curve fitting or the regression problems. Energy point x is considered as an explanatory variable in the regression, and the corresponding intensity y is the response variable in terms of the regression analysis. In this section, we briefly explain GPR, which was adopted in the present study to realise AL. The details of GPR are thoroughly described in ref. ³⁹.

For energy points x_i, we assume that the intensity y_i at x_i is modelled as y_i = f(x_i) + ε_i, where ${\varepsilon }_{i} \sim {{{\mathcal{N}}}}(0,{\xi }^{2})$ is the observation noise. In Gaussian process modelling, function ${f}{{(x)}}$ itself is assumed to be a random variable under a Gaussian distribution with mean ${{\mu_0}{(x)}}$ and covariance function ${{{\bf{K}}}}({{{x}}},{{{x}}}^{\prime} )$; hence, y is also a realisation of the Gaussian random variable with mean ${{\mu_0}{(x)}}$ and variance ${{\sigma}^2(x)}={\bf{K}}{(x,x)}+\xi^2$. Given a collection of observations y corresponding to the collection of energy points X_n = (x₁, …, x_n), the mean function and the variance function of the Bayesian posterior distribution are denoted by $\hat{{{{\mu}}}}({{{x}}})$ and ${\hat{{{{\sigma}}}}}^{2}({{{x}}})$, respectively. To obtain the mean and variance values at a newly observed point x^*, we consider the joint distribution of y and f(x^*), which is expressed as

$$\left(\begin{array}{l}{{{\bf{y}}}}\\ f({x}^{* })\end{array}\right) \sim {{{\mathcal{N}}}}\left(\left(\begin{array}{l}{{{{\boldsymbol{\mu }}}}}_{0}({{{{\bf{X}}}}}_{n})\\ {\mu }_{0}({x}^{* })\end{array}\right),\left(\begin{array}{ll}{{{{\bf{K}}}}}_{n,n}+{\xi }^{2}{{{{\bf{I}}}}}_{n}&{{{{\bf{K}}}}}_{n,* }\\ {{{{\bf{K}}}}}_{n,* }^{\top }&{{{\bf{K}}}}({x}^{* },{x}^{* })\end{array}\right)\right)$$

(8)

where x, ${{{{\bf{K}}}}}_{n,n}={{{\bf{K}}}}({{{{\bf{X}}}}}_{n},{{{{\bf{X}}}}}_{n})\in {{\mathbb{R}}}^{n\times n}$, and ${{{{\bf{K}}}}}_{n,* }={{{\bf{K}}}}({{{{\bf{X}}}}}_{n},{x}^{* })\in {{\mathbb{R}}}^{n}$. The mean function value of the posterior distribution of f(x^*) is obtained as

$$\hat{\mu }({x}^{* })={\mu }_{0}({x}^{* })+{{{{\bf{k}}}}}_{n}{({x}^{* })}^{\top }{({{{{\bf{K}}}}}_{n,n}+{\xi }^{2}{{{{\bf{I}}}}}_{n})}^{-1}({{{\bf{y}}}}-{{{{\boldsymbol{\mu }}}}}_{0}({{{{\bf{X}}}}}_{n}))$$

(9)

where k_n = (K(x₁, x^*), …, K(x_n, x^*)). Moreover, the posterior variance at the new energy point x^* is obtained as

$${\hat{\sigma }}^{2}({x}^{* })={{{\bf{K}}}}({x}^{* },{x}^{* })-{{{{\bf{k}}}}}_{n}{({x}^{* })}^{\top }{({{{{\bf{K}}}}}_{n,n}+{\xi }^{2}{{{{\bf{I}}}}}_{n})}^{-1}{{{{\bf{k}}}}}_{n}({x}^{* }).$$

(10)

The acquisition function is defined as follows

$$a\left(\hat{\sigma },\hat{\mu },t\right)=\frac{\hat{\sigma }({x}^{* })}{{\sigma }_{\max }}+\frac{1}{t}\sqrt{\frac{\hat{\mu }({x}^{* })}{{\mu }_{\max }}},$$

(11)

where ${\sigma }_{\max }$ and ${\mu }_{\max }$ are maximum standard deviation and mean among consequent measurements at time 1, …, t.

The amplitude θ₁ and the bandwidth θ₂ of the covariance function ${{{\bf{K}}}}({{{x}}},{{{x}}}^{\prime} )$ (Eq. (1)) and noise variance are predetermined by maximising the marginal likelihood of the Gaussian process model for a similar dataset measured in the past by the same device.

Both the simulated and experimental XAS spectra used in the study have 2000 data points in total. In the present implementation of AL, the data points were divided into three parts: the initial sampling, the pool data and the test data for evaluating the generalisation error, those data sizes were set to 10, 900 and 1090, respectively. Therefore, we assumed the total energy points measured in the conventional DoE are same as the size of the pool data (N = 900).

Data availability

The data obtained in this study are available from the authors upon reasonable request.

Code availability

The computer codes developed in this study are available from the authors upon reasonable request.

References

Rajan, K. Materials informatics. Mater. Today 8, 38–45 (2005).
Article CAS Google Scholar
Mueller, T., Kusne, A. G. & Ramprasad, R. In Reviews in Computational Chemistry Vol. 29 (eds Parrill, A. L & Lipkowitz, K. B) (Wiley, 2016).
Lookman, T., Alexander, F. J. & Rajan, K. Information Science for Materials Discovery and Design (Springer, 2016).
Agrawal, A. & Choudhary, A. N. Perspective: Materials informatics and big data: realisation of the ‘fourth paradigm’ of science in materials science. APL Mater. 4, 053208 (2016).
Article Google Scholar
Schleder, G. R., Padilha, A. C. M., Acosta, C. M., Costa, M. & Fazzio, A. From DFT to machine learning: recent approaches to materials science–a review. J. Phys. Mater. 2, 032001 (2019).
Article CAS Google Scholar
Agrawal, A. & Choudhary, A. Deep materials informatics: applications of deep learning in materials science. MRS Commun. 9, 779–792 (2019).
Article CAS Google Scholar
Schmidt, J., Marques, M. R. G., Botti, S. & Marques, A. L. Recent advances and applications of machine learning in solid state materials science. npj Comput. Mater. 5, 83 (2019).
Article Google Scholar
Yamawaki, M., Ohnishi, M., Ju, S. & Shiomi, J. Multifunctional structural design of graphene thermoelectrics by Bayesian optimisation. Sci. Adv. 4, eaar4192 (2018).
Article Google Scholar
Iwasaki, Y. et al. Machine-learning guided discovery of a new thermoelectric material. Sci. Rep. 9, 2751 (2019).
Article Google Scholar
Iwasaki, Y. et al. Identification of advanced spin-driven thermoelectric materials via interpretable machine learning. npj Comput. Mater. 5, 103 (2019).
Article Google Scholar
Chen, L. et al. Frequency-dependent dielectric constant prediction of polymers using machine learning. npj Comput. Mater. 6, 61 (2020).
Article CAS Google Scholar
Zhong, M. et al. Accelerated discovery of CO₂ electrocatalysts using active machine learning. Nature 581, 178–183 (2020).
Article CAS Google Scholar
Kusne, A. G. et al. On-the-fly closed-loop materials discovery via Bayesian active learning. Nat. Commun. 11, 5966 (2020).
Article CAS Google Scholar
Liu, P. et al. Machine learning assisted design of γ’-strengthened Co-base superalloys with multi-performance optimization. npj Comput. Mater. 6, 62 (2020).
Article CAS Google Scholar
Mao, Y., He., Q. & Zhao, X. Designing complex architectured materials with generative adversarial networks. Sci. Adv. 6, eaaz4169 (2020).
Article Google Scholar
Curtarolo, S. et al. AFLOWLIB.ORG: a distributed materials property repository from high-throughput ab initio calculations. Comput. Mater. Sci. 58, 227–235 (2012).
Article CAS Google Scholar
Jain, A. et al. Commentary: The Materials Project: a materials genome approach to accelerating materials innovation. APL Mater. 1, 011002 (2013).
Article Google Scholar
Saal, J. E., Kirklin, S., Aykol, M., Meredig, B. & Wolverton, C. Materials design and discovery with high-throughput density functional theory: The Open Quantum Materials Database (OQMD). JOM. 65, 1501–1509 (2013).
Article CAS Google Scholar
Ueno, T. et al. Adaptive design of an X-ray magnetic circular dichroism spectroscopy experiment with Gaussian process modelling. npj Comput. Mater. 4, 4 (2018).
Article Google Scholar
Wakabayashi, Y. K. et al. Improved adaptive sampling method utilizing Gaussian process regression for prediction of spectral peak structures. Appl. Phys. Express 11, 112401 (2018).
Article Google Scholar
Saito, K. et al. Accelerating small-angle scattering experiments on anisotropic samples using kernel density estimation. Sci. Rep. 9, 1526 (2019).
Article Google Scholar
Suzuki, Y., Kotsugi, M., Hino, H. & Ono, K. Automated estimation of materials parameter from X-ray absorption and electron energy-loss spectra with similarity measures. npj Comput. Mater. 5, 39 (2019).
Article Google Scholar
Matsumura, T., Nagamura, N., Akaho, S., Nagata, K. & Ando, Y. Spectrum adapted expectation-maximization algorithm for high-throughput peak shift analysis. Sci. Technol. Adv. Mater. 20, 733–745 (2019).
Article Google Scholar
Shinotsuka, H. et al. Development of spectral decomposition based on Bayesian information criterion with estimation of confidence interval. Sci. Technol. Adv. Mater. 21, 402–419 (2020).
Article CAS Google Scholar
Ozaki, Y. et al. Automated crystal structure analysis based on blackbox optimisation. npj Comput. Mater. 6, 75 (2020).
Article Google Scholar
Suzuki, Y. et al. Symmetry prediction and knowledge discovery from X-ray diffraction patterns using an interpretable machine learning approach. Sci. Rep. 10, 21790 (2020).
Article CAS Google Scholar
Hino, H. Active Learning: problem settings and recent developments. Preprint at https://arxiv.org/abs/2012.04225 (2020).
Lookman, T. et al. Active learning in materials science with emphasis on adaptive sampling using uncertainties for targeted design. npj Comput. Mater. 5, 21 (2019).
Article Google Scholar
Terayama, K. et al. Efficient construction method for phase diagrams using uncertainty sampling. Phys. Rev. Mater. 3, 033802 (2019).
Article CAS Google Scholar
Tian, Y. et al. Role of uncertainty estimation in accelerating materials development via active learning. J. Appl. Phys. 128, 014103 (2020).
Article CAS Google Scholar
del Rosario, Z. et al. Assessing the frontier: Active learning, model accuracy, and multi-objective candidate discovery and optimization. J. Chem. Phys. 153, 024112 (2020).
Article Google Scholar
Pestourie, R. et al. Active learning of deep surrogates for PDEs: application to metasurface design. npj Comput. Mater. 6, 164 (2020).
Article Google Scholar
Tian, Y. et al. Efficient estimation of material property curves and surfaces via active learning. Phys. Rev. Mater. 5, 013802 (2021).
Article CAS Google Scholar
Xie, Y. et al. Bayesian force fields from active learning for simulation of inter-dimensional transformation of stanene. npj Comput. Mater. 7, 40 (2021).
Article CAS Google Scholar
van der Laan, G. & Figueroa, A. I. X-ray magnetic circular dichroism – A versatile tool to study magnetism. Coord. Chem. Rev. 277–278, 95–129 (2014).
Article Google Scholar
Thole, B. T., Carra, P., Sette, F. & van der Laan, G. X-Ray circular dichroism as a probe of orbital magnetism. Phys. Rev. Lett. 68, 1943–1946 (1992).
Article CAS Google Scholar
Carra, P., Thole, B. T., Altarelli, M. & Wang, X. X-ray circular dichroism and local magnetic fields. Phys. Rev. Lett. 70, 694–697 (1993).
Article CAS Google Scholar
Chen, C. T. et al. Experimental confirmation of the X-ray magnetic circular dichroism sum rules for iron and cobalt. Phys. Rev. Lett. 75, 152–155 (1995).
Article CAS Google Scholar
Rasmussen, C. E. & Williams, C. K. I. Gaussian Processes for Machine Learning (MIT Press, 2006).
Google Scholar
Schohn, G. & Cohn, D. Less is more: Active Learning with support vector machines. In Proc. of the Seventeenth International Conference on Machine Learning (ICML2000) (ed. Langley, P.) (Morgan Kaufmann Publishers Inc, 2000).
Krause, A. & Guestrin, C. Nonmyopic Active Learning of Gaussian processes: an exploration-exploitation approach. In Proc. of the 24th International Conference on Machine Learning (ICML2007) (Oregon State University, Corvalis, 2007).
Altschuler, M. & Bloodgood, M. Stopping Active Learning Based on Predicted Change of F Measure for Text Classification. IEEE 13th International Conference on Semantic Computing (ICSC2019) (Newport Beach Marriot Bayview, Newport Beach, CA, 2019).
Balachandran, P. V. et al. Materials Discovery and Design (Springer, 2018).
Seko, A., Maekawa, T., Tsuda, K. & Tanaka, I. Machine learning with systematic density-functional theory calculations: Application to melting temperatures of single- and binary-component solids. Phys. Rev. B 89, 054303 (2014).
Article Google Scholar
Ishibashi H. & Hino, H. Stopping criterion for active learning based on error stability. Preprint at https://arxiv.org/abs/2104.01836 (2021).
Ishibashi, H. & Hino, H. Stopping criterion for active learning based on deterministic generalization bounds. In Proc. of the Artificial Intelligence and Statistics (AISTATS2020) (AISTATS 2020, 2020).
Corless, M. R., Gonnet, G. H., Hare, D. E. G., Jeffrey, D. J. & Knuth, D. E. On the Lambert W function. Adv. Comput. Math., 5, 329–359 (1996).
Kullback, S. & Leibler, R. A. On information and sufficiency. Ann. Math. Stat. 22, 79–86 (1951).
Article Google Scholar
Jo, T. & Sawatzky, G. A. Ground state of ferromagnetic nickel and magnetic circular dichroism in Ni 2p core x-ray-absorption spectroscopy. Phys. Rev. B 43, 8771–8774 (1991).
Article CAS Google Scholar
Stavitski, E. & de Groot, F. M. F. The CTM4XAS program for EELS and XAS spectral shape analysis of transition metal L edges. Micron 41, 687–694 (2010).
Article CAS Google Scholar
Fukushi, K. et al. Photon Factory BL-19: a new STXM beamline with wide energy range for aquaplanetology. In Asteroid Science in the Age of Hayabusa2 and OSIRIS-REx (Tucson, 2019).

Download references

Acknowledgements

This work was supported by JST-Mirai Program Grant Numbers JPMJMI19G1 and JPMJMI21G2. T.U. acknowledges the support of JSPS KAKENHI Grant Number JP18K13984 and QST President’s Strategic Grant (Exploratory Research). H.H. acknowledges the support of NEDO Grant Number JPNP18002 and JST CREST Grant Number JPMJCR1761. This work was carried out under the ISM Cooperative Research Program (H30-J-4302 and 2019-ISMCRP-4206). The XAS experiment was performed under the approval of the Photon Factory Program Advisory Committee (Proposal No. 2018MP001). The authors thank Dr. Yasuo Takeichi for the support of the experiments at the Photon Factory.

Author information

Authors and Affiliations

Synchrotron Radiation Research Center, Kansai Photon Science Institute, Quantum Beam Science Research Directorate, National Institutes for Quantum and Radiological Science and Technology, Sayo, Hyogo, Japan
Tetsuro Ueno
Department of Human Intelligence Systems, Graduate School of Life Science and Systems Engineering, Kyushu Institute of Technology, Kitakyushu, Fukuoka, Japan
Hideaki Ishibashi
The Institute of Statistical Mathematics, Research Organization of Information and Systems, Tachikawa, Tokyo, Japan
Hideitsu Hino
Department of Applied Physics, Graduate School of Engineering, Osaka University, Suita, Osaka, Japan
Kanta Ono
Center for Integrative Quantum Beam Science, Institute of Materials Structure Science, High Energy Accelerator Research Organization, Tsukuba, Ibaraki, Japan
Kanta Ono

Authors

Tetsuro Ueno
View author publications
You can also search for this author in PubMed Google Scholar
Hideaki Ishibashi
View author publications
You can also search for this author in PubMed Google Scholar
Hideitsu Hino
View author publications
You can also search for this author in PubMed Google Scholar
Kanta Ono
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.U., H.H., and K.O. conceived the project. T.U. H.I. and H.H. wrote the manuscript. H.I and H.H. performed the computation of the AL. H.I. and H.H. wrote the computer codes. T.U. performed the XAS experiment. All authors discussed the results and reviewed the manuscript.

Corresponding author

Correspondence to Tetsuro Ueno.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Movie 1

Supplementary Movie 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ueno, T., Ishibashi, H., Hino, H. et al. Automated stopping criterion for spectral measurements with active learning. npj Comput Mater 7, 139 (2021). https://doi.org/10.1038/s41524-021-00606-5

Download citation

Received: 07 April 2021
Accepted: 05 August 2021
Published: 25 August 2021
DOI: https://doi.org/10.1038/s41524-021-00606-5

This article is cited by

Bayesian active learning with model selection for spectral experiments
- Tomohiro Nabika
- Kenji Nagata
- Masato Okada
Scientific Reports (2024)
Autonomous atomic Hamiltonian construction and active sampling of X-ray absorption spectroscopy by adversarial Bayesian optimization
- Yixuan Zhang
- Ruiwen Xie
- Hongbin Zhang
npj Computational Materials (2023)