Modelling atomic and nanoscale structure in the silicon–oxygen system through active machine learning

Erhard, Linus C.; Rohrer, Jochen; Albe, Karsten; Deringer, Volker L.

doi:10.1038/s41467-024-45840-9

Download PDF

Article
Open access
Published: 02 March 2024

Modelling atomic and nanoscale structure in the silicon–oxygen system through active machine learning

Nature Communications volume 15, Article number: 1927 (2024) Cite this article

3692 Accesses
1 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Silicon–oxygen compounds are among the most important ones in the natural sciences, occurring as building blocks in minerals and being used in semiconductors and catalysis. Beyond the well-known silicon dioxide, there are phases with different stoichiometric composition and nanostructured composites. One of the key challenges in understanding the Si–O system is therefore to accurately account for its nanoscale heterogeneity beyond the length scale of individual atoms. Here we show that a unified computational description of the full Si–O system is indeed possible, based on atomistic machine learning coupled to an active-learning workflow. We showcase applications to very-high-pressure silica, to surfaces and aerogels, and to the structure of amorphous silicon monoxide. In a wider context, our work illustrates how structural complexity in functional materials beyond the atomic and few-nanometre length scales can be captured with active machine learning.

Autonomous scanning probe microscopy investigations over WS2 and Au{111}

Article Open access 02 May 2022

Origins of structural and electronic transitions in disordered silicon

Article 06 January 2021

Rapidly predicting Kohn–Sham total energy using data-centric AI

Article Open access 24 August 2022

Introduction

Elemental silicon and its oxide, silica (SiO₂), are widely studied building blocks of the world around us¹: from minerals in geology to silicon-based computing architectures; thin-film solar cells in which amorphous silicon is the active material²; or zeolite catalysts based on the SiO₂ parent composition³. Some of these materials have a single phase and are precisely defined on the atomic scale, whereas others show longer-ranging, hierarchical structures and varying degrees of disorder. For example, silica aerogels contain pores with sizes of 5–100 nm, leading to very low thermal conductivity and making aerogels promising candidates for thermal insulation⁴. Under pressure, SiO₂ shows amorphous–amorphous transitions to structures exceeding sixfold coordination⁵, crystallisation from the amorphous phase under shock compression⁶, and conversely the formation of complex disordered phases from crystalline SiO₂⁷. Beyond fundamental studies, there is much technological importance in silicon–oxygen phases with nanoscale structure—the interface between Si and SiO₂ is essential in silicon metal-oxide semiconductors, and defects at this interface have been investigated for decades^8,9,10.

A material in the binary silicon–oxygen system which is in fact dominated by such interfaces is the so-called silicon monoxide (SiO). The structure of SiO was controversially discussed for long^11,12; today, it is known as a nanoscopic mixture of amorphous Si and SiO₂^13,14,15. Initial applications of SiO have been in protective layers for mirrors¹⁶ or dielectrics for thin-film capacitors¹⁷; more recently, the same material has emerged as a promising anode material for lithium-ion batteries^18,19. However, to be able to fully exploit SiO in next-generation energy-storage solutions, it would be valuable to understand the features of the nanoscopic structure on an atomistic level.

To develop atomic-scale models of complex materials such as SiO, molecular-dynamics (MD) computer simulations have become a central research tool. While there are now plenty of interatomic potentials for silicon^20,21,22 and silica^23,24,25, the number of potentials for the mixed (i.e., full binary) system is limited due to its chemical complexity^26,27,28,29. Alongside established, empirically fitted potentials based on physical models, alternatives based on large datasets and machine learning (ML) have emerged in recent years. These models have been fitted for silicon³⁰ as well as silica³¹ and also for the more complex silica–water system³². ML potentials promise the accuracy of first-principles methods such as density-functional theory (DFT) for a small fraction of the cost. ML potentials are now firmly established in the field of computational materials science and their application to homogeneous phases has been well documented.

In the present work, we describe a unified computational model for the Si–O system that we have obtained with the help of an active-learning scheme for local environments. We extract representative atomic environments from large-scale simulations and embed them in a melt-quenched amorphous matrix, allowing us to sample representative environments for the fitting of accurate ML potentials. Our final model shows high accuracy across a wide configurational space including high-pressure silica, silica surfaces, and mixtures of silica and silicon. We showcase the usefulness of the method by creating fully atomistically resolved, 10-nm-scale structure models of amorphous and partially crystalline SiO.

Results

Active learning for nanoscale structure

We have developed a comprehensive dataset of atomistic structures and quantum-mechanical reference data for the binary Si–O system, as well as an interatomic potential fitted to that database in the atomic cluster expansion (ACE) framework^33,34,35. We initialised the protocol with two existing datasets for silicon (Bartók et al. ³⁰) and silica (Erhard et al. ³¹) respectively, and we then gradually explored the relevant configurational space using the active-learning workflow illustrated in Fig. 1. Quantum-mechanical reference ("training”) data for energies and forces were obtained with the strongly constrained and appropriately normed (SCAN)³⁶ exchange–correlation functional for DFT, which shows good performance for elemental silicon³⁷ and the various silica polymorphs³¹.

**Fig. 1: An active-learning workflow for complex atomistic structures.**

Our active-learning workflow follows three main tracks: high-pressure bulk silica, silica surfaces, and non-stoichiometric SiO_x systems (Fig. 1a). The individual tracks are kept separate during initial training, i.e., they do not share their newly generated training data; however, in the end, all structures are merged into one comprehensive database.

The single subtracks are further divided into stages. In the first stage, we added initial structures, e.g., for crystalline high-pressure polymorphs or surface models. In the next stage, we fitted moment tensor potential (MTP) models³⁸ to the database and used these MTPs to explore configurational space in MD and to identify new structures by active learning³⁹. Energies and forces for new structures were computed with DFT and added to the database. This process was iterated until the extrapolation threshold (Supplementary Note 1B) was not exceeded during the MD trajectories anymore.

The third stage, highlighted in red in Fig. 1a, is the most important part of our workflow, and is based on large-scale simulations in each track. We used 2–4 MTPs trained on the same database to estimate a per-atom committee error, as is commonly done for neural-network potentials⁴⁰. For atoms with high uncertainty (Supplementary Note 1C), we extracted the environments into smaller, “DFT-sized” cells by an approach that we call amorphous matrix embedding (Fig. 1b). After identifying an atom with high uncertainty, we cut out a cube containing the corresponding environment of the atom. This cube has a size which is feasible for performing DFT computations; it is generally chosen larger than twice the cut-off of the potential. After extracting the cube, the atoms within the cut-off radius of the atom with high uncertainty are kept fixed. The remaining structure is melted in an ML-MD simulation to create an amorphous matrix and smooth boundaries. Details of the procedure can be found in Supplementary Note 1C. We note very recent, related approaches to isolating fragments for active learning based on minimising the uncertainty for boundary atoms^41,42.

The final database was obtained by merging the data of all tracks together, including some additional samples such as clusters and vacancies. This database contains 11,428 structures with a total of ≈1.3 million atoms (Supplementary Table 1). For validation, we held out 5% of these structures from training, selected at random.

Performance

The final potential is a complex non-linear ACE model, obtained by summation of one linear and seven non-linear ACE terms (Methods). This approach allows a more flexible description than just a linear or Finnis–Sinclair-like embedding, at only moderately higher computational expense. The resulting potential has a test-set root mean square error (RMSE) of 16.7 meV atom⁻¹ for energies and 306 meV Å⁻¹ for forces. These errors are averaged over the full dataset, however, and so they are not in themselves sufficient to characterise the quality of the potential. For example, they refer to a highly heterogeneous set of structures, with target energy values spanning more than 8 eV atom⁻¹, and a range of forces of 40 eV Å⁻¹ covered by the database. Furthermore, the numerical accuracy of the potential in certain parts of configurational space (e.g., crystalline polymorphs) is far more important than in others (e.g., liquid and amorphous structures).

In Table 1, we therefore show the performance of our model on different separate test sets. The complex non-linear ACE is compared to our previous silica GAP model described in ref. ³¹ ("SiO₂-GAP-22” in the following), and also to simpler ACE models fitted to the new database using linear and Finnis–Sinclair-like embeddings, respectively. Indeed, the complex non-linear ACE potential is the only one among the three which achieves comparable errors to SiO₂-GAP-22 for amorphous and crystalline silica structures. In contrast, for amorphous elemental silicon, mixed-stoichiometry as well as high-pressure phases the complex non-linear ACE is significantly more accurate than SiO₂-GAP-22, since these structures are not part of the GAP database. This table therefore indicates the main challenge – and its solution – in the present model compared to the previous GAP: both are highly accurate for crystalline (≈1 meV atom⁻¹) and bulk amorphous (≈5 meV atom⁻¹) SiO₂, but our ACE model caters to a much wider range of scenarios outside of the 1:2 stoichiometric composition.

Table 1 ML model performance

Full size table

Figure 2 shows the phase diagram of SiO₂ calculated by thermodynamic integration^43,44 using the ACE potential compared to a CALPHAD phase diagram from the literature⁴⁵. The ACE and CALPHAD predictions agree well throughout, and for the boundary between quartz and coesite we observe almost quantitative agreement. In contrast, the cristobalite and tridymite phases seem to be over-stabilised. At 0 GPa, the melting point is notably overestimated (about 2400 K, compared to ≈2000 K experimentally⁴⁵); moreover, the phase stability regions of both phases are more extended than in the reference. To illustrate the sensitivity of the analysis to small errors in predicted energies, we added a fictitious energy penalty of 5 meV atom⁻¹ for cristobalite and tridymite (Supplementary Fig. 1a); in this case, the transition lines agree much better with the CALPHAD reference than before. Further numerical tests showed that the tridymite–cristobalite transition line, in particular, is strongly affected by small shifts in energy (Supplementary Fig. 1b–f). We thus conclude that the quantitative deviation seen in Fig. 2 is due to the inaccuracy of the underlying exchange–correlation functional, rather than indicating a shortcoming of the ACE approach. This is an example of the more general problem that any issues with the ground-truth method (e.g., numerical instabilities) will translate into the ML model.

**Fig. 2: Temperature–pressure phase diagram of SiO₂.**

High-pressure structural transitions of SiO₂

Figure 3 characterises high-pressure properties of silica. In Fig. 3a, we show energy–volume curves of α-quartz, coesite, stishovite, α-PbO₂-type, and pyrite-type silica as predicted by our ACE model and compared with DFT data, with which they agree well. In addition, we tested the behaviour of the model for rosiaite-type silica, which was recently observed in experiment⁴⁶ and predicted theoretically⁴⁷ for direct compression of α-quartz. In contrast to the structures mentioned before, this particular polymorph is not part of the training database. Nevertheless, the ACE model reproduces DFT data for this structure similarly well as for the other polymorphs.

**Fig. 3: Silica at megabar pressures.**

Figure 3b shows an enthalpy–pressure diagram at 0 K. For lower pressures, there is a transition from α-quartz to coesite between 2.5 and 3.0 GPa, consistent with the predicted phase diagram (Fig. 2), followed by a transition to stishovite at 5.5–6.0 GPa. At higher pressures of ≈110 GPa, we observe the transition from stishovite to α-PbO₂-type silica. Experimentally, rather than stishovite (rutile type), the structurally closely related CaCl₂ (distorted rutile) type polymorph of silica is stable. The transition from CaCl₂- to α-PbO₂-type silica was observed at 120 GPa and 2400 K⁴⁸. Given that our enthalpy data correspond to a temperature of 0 K, both values agree well with each other. For the transition of α-PbO₂- to pyrite-type silica, our ACE model predicts a pressure of ≈246 GPa, in good agreement with the experimentally determined transition pressure of ≈260 GPa at 1800 K⁴⁹. Finally, rosiaite-type silica⁴⁶ is correctly identified as metastable over the pressure range studied.

Figure 3c shows the pressure evolution of the average coordination number (CN) of silicon atoms in amorphous silica, extracted from an MD simulation at room temperature and under isostatic pressure. The ACE results agree well with experiment up to about 50 GPa^5,50, and with ab initio MD⁵¹ results over the whole pressure range. The good agreement with experiment is particularly pronounced for the data from ref. ⁵⁰. Above 50 GPa, our model underestimates the average CN: at 175 GPa the experimental estimate is about 7; the ACE simulation predicts it to be 6. Importantly, this does not mean that there are no 7-fold coordinated environments, but there remain some 5-fold coordinated atoms as well, lowering the average (Fig. 3d). A possible reason for the good agreement with the ab initio result, but the deviation from experiment, might be the limited time scales in our simulations, which hinder a complete transition into higher-coordinated environments. Moreover, we note that computed X-ray-Raman spectra of the ab initio structures from ref. ⁵¹ are in good agreement with experiment indicating a lower CN. Other MD simulations also showed slightly lower CNs than the experimental values⁵². Figure 3e shows three different 7-fold coordinated environments extracted from the simulations.

A CN of 7 in amorphous silica might be surprising, since silicon is sixfold-coordinated in all crystalline silica polymorphs that are stable in this pressure range. However, the pyrite-type phase, which becomes thermodynamically stable at ≈240–260 GPa, contains silicon atoms with a 6+2-fold environment. A recent study found certain, but limited, similarities between these 7-fold environments in glassy and pyrite-type silica⁵².

In Supplementary Fig. 2, we show two additional structural fingerprints which have been commonly analysed in experiment: the position of the first sharp diffraction peak and the Si–O bond length. For both cases, our simulations show good agreement with experiment.

SiO₂ surfaces and aerogels

Figure 4 tests the ability of the potential to accurately predict surface energies. We begin with validation for different α-quartz surfaces: we created surface slab models, relaxed them with the ACE potential, and evaluated the energetics, and therefore the surface energy per area, with DFT single-point computations (Fig. 4a). The ACE results agree well with DFT, especially considering that the training database does not contain all the surface terminations shown. The ACE model is also able to predict the stability of the reconstructed α-quartz (001) surface (Supplementary Fig. 3a, b). Whilst these surface energies can be computed with DFT, realistic amorphous surface energies are much more difficult to obtain, due to the required system sizes. Therefore, Fig. 4b validates the potential on 125 small-scale surface structures of amorphous SiO₂, each containing 192 atoms. The surface models are created based on bulk structures from ref. ³¹; the latter had been generated in melt–quench simulations with different interatomic potentials and therefore span a range of energies. Regardless of the starting structure, the ACE model captures the surface energy for all slab models very well: the total RMSE is about 0.01 eV/Å², and only a slight underestimation compared to DFT is seen. Moreover, there are no clear outliers although the various surface energies indicate a large diversity of the surface structures.

The amorphous surfaces shown are already very complex, but in reality they are often not flat as here. They have curvature, for example when occurring inside pores, and such complex structures can no longer be directly validated with DFT. Figure 4c therefore shows how well atomic environments in various porous amorphous structures are covered by the dataset. These structures were prepared by straining amorphous structures at elevated temperatures to the desired density. To validate the performance of the potential on this model, we show the linear extrapolation grade according to the maxvol selection^41,53. An extrapolation grade above 1 corresponds to atomic environments that have not been covered in the training database. This does not mean that the potential is no longer reliable, as there is a certain range of more or less reliable extrapolation, but as the extrapolation grade increases, non-physical behaviour and failure of the potential becomes more and more likely³⁹. For all porous structures, regardless of density, we find that the maximum extrapolation value is less than 1. Thus, we observe no pronounced extrapolation in any of the cases considered, indicating an accurate description of the potential for a variety of curved surfaces. In Supplementary Fig. 3e, f, we demonstrate the effectiveness of the ACE potential in reconstructing artifical amorphous surfaces. To this end, we cut out a spherical cavity from an amorphous bulk structure. Upon heating, the amount of incorrectly arranged atoms near the surface (<3 Å) decreases over time, indicating that the surface atoms are undergoing significant reconstructions.

Elemental silicon

Whilst our ACE model is designed for the binary Si–O system, we show in Table 2 the performance for diamond-like elemental silicon compared to both DFT and experiment. The bulk modulus is very well reproduced, whereas the vacancy formation energy is underestimated by about 30%. The experimental surface energies are well recovered by the ACE model, but this may be partly due to serendipity, because the SCAN ground-truth data show poorer quality (Table 2). Moreover, the potential captures the reconstruction of the diamond-type silicon (100) surface (Supplementary Fig. 3c, d). We also computed the linear thermal expansion coefficient of diamond-type silicon in the quasiharmonic approximation, finding almost perfect agreement between the ACE prediction and experiment; in particular, the unusual negative expansion coefficient below 130 K is reproduced (Supplementary Fig. 4). Finally, melt–quench simulations were performed to generate a-Si structures (Supplementary Fig. 5). The agreement with the experimental structure factor is as good as that for a GAP-18-generated structure from ref. ⁵⁴. In addition, we are able to achieve lower quenching rates with the ACE than with the GAP, and for quenching rates as low as 10¹⁰ K s⁻¹, we observed crystallisation.

Table 2 Properties of diamond-type silicon

Full size table

Compared to Si-GAP-18³⁰, we observe higher errors with respect to the reference data (Table 2). This is not a principal shortcoming of ACE compared to GAP: it was already shown that it is possible to fit an ACE potential with similar numerical accuracy as Si-GAP-18 to the same training database³⁴. Instead, the lower accuracy might be caused by the extension of the database to a second element, compared to the Si-GAP-18 one, and its strong focus on the SiO₂ part of the configurational space. We assume that this causes, in turn, a less accurate description of the configurational space of the elemental species. Larger training databases might help to overcome this issue in the future. Due to the lower accuracy in reproducing the SCAN data, our potential has some shortcomings for higher-pressure structures: the bc8 phase is erroneously predicted to be stable at elevated pressure (Supplementary Fig. 6), and upon compressing a-Si we do not observe the eventual crystallisation that is described by Si-GAP-18 (Supplementary Fig. 7)⁵⁵. We emphasise that very-high-pressure silicon phases were not the scope of the present work – instead, we focus on the accurate description of ambient-pressure silicon as a constituent part of mixed binary phases and nanostructures.

SiO and mixed silicon–silica systems

Whilst the results so far have served to demonstrate the usefulness of the approach – both in terms of development of datasets and the fitting within the ACE framework – we are now able to study an actual application problem. To this end, Fig. 5a shows structural models of SiO. Experimentally, amorphous SiO is obtained by deposition of SiO from the gas phase⁵⁶. In contrast, we created our models by melt–quench simulations. SiO phases are known to be metastable with respect to Si and SiO₂. For example, a recent DFT-based crystal-structure prediction study explored possible ordered phases of homogeneous SiO, and found that these are metastable compared to a mixture of crystalline Si and SiO₂⁵⁷. We verified that our ACE potential similarly reproduces the metastability of the ambient-pressure phases (Supplementary Fig. 9), and that it accurately predicts relevant Si–SiO₂ interface energies (Supplementary Fig. 10). In good agreement with these results, our melt-quenched structures show a clear segregation between a-Si-like (blue) and a-SiO₂-like regions (red). With decreasing quench rate, the number of silicon grains decreases while their size increases. Figure 5b shows the structure factor, S(q), for the structure quenched with 5 × 10¹² K s⁻¹ (data for the other structures are shown in Supplementary Fig. 11), which agrees best with the experimental data from ref. ¹⁵. Figure 5c shows the ratio between the volume of the silicon grains divided by the interface area. In the approximation of spherical particles, the grain diameter is d = 6 ⋅ V_Si,grains/A_interface. From this we can estimate average grain diameters between 24 and 54 Å for the tested quench rates. These grain diameters agree very well with transmission electron microscopy measurements, which indicated diameters of 30–40 Å¹⁴.

**Fig. 5: Nanoscale segregation in amorphous silicon monoxide.**

Figure 5d shows the excess energies of the structures referenced to to α-quartz and diamond-type silicon. The SiO structures were relaxed by optimisation of the cell size as well as the atomic positions at 0 K. As experimental reference, we show the standard enthalpy of formation of SiO⁵⁸. The structures generated using quench rates of 5 × 10¹² and 2 × 10¹² K s⁻¹ have energies comparable to experiment. Indeed, we can even create structures that are energetically more favourable than in experiment, noting again that our procedure to produce the structures deviates significantly from the experimental one.

But is this really an improvement compared to existing, empirically fitted interatomic potentials? We tested the Munetoh potential⁵⁹ and a charge-optimised many-body (COMB) potential²⁹ for the same procedure to generate structural models of SiO. The Munetoh potential yielded a homogeneous structure without observable segregation into silicon and SiO₂, and the resulting structure factor (Supplementary Fig. 11c) deviates strongly from experiment. For the COMB potential, we observed pore formation at elevated temperatures, finally resulting in a strongly increased simulation-cell size. Therefore, we only equilibrated our best-matching structure at room temperature and analysed the change in structure factor (Supplementary Fig. 11f): again, we observed a strong deviation from experiment, indicating that the structure is very different from the ACE model prediction.

Crystallisation of silicon in amorphous SiO

Silicon in silicon monoxide is experimentally known to crystallise above 850 °C⁶⁰. Figure 6 illustrates simulations of such crystallisation processes. The SiO structures shown in Fig. 5 were heated to 1400 K, causing the silicon-rich regions to melt while the silica matrix remained solid. The structures were then quenched to 1200 K within 20 ns (Fig. 6a). Through this cooling process, we noticed crystallisation in the silicon-rich regions of the SiO structure. Details are shown in Supplementary Fig. 12, indicating how crystallisation starts from two seeds, appearing shortly after each other and propagating throughout the structure. Before thermal treatment, all structures show nearly no sign of crystallinity, whereas afterwards, the structures with larger silicon grains do (Fig. 6c–f). This also affects the structure factor: those systems with no or only a small amount of crystallinity (quench rates: 1 × 10¹³ and 5 × 10¹² K s⁻¹) show structure factors that are still similar to the experimental structure factor of SiO (Fig. 6g), whereas those with larger amounts of crystalline silicon (1 × 10¹² and 2 × 10¹² K s⁻¹) show distinct S(q) peaks that indicate crystallinity (Fig. 6h). This comparison allows us to exclude the occurrence of large amounts of crystalline silicon in silicon monoxide samples, given the experimental S(q) (cf. Fig. 6g and ref. ¹⁵).

**Fig. 6: Crystallisation of silicon in silicon monoxide.**

Figure 6i shows that all structures are energetically more favourable after the thermal treatment. However, even though one might expect that the stronger crystallised structures gain more energy, we observe no direct connection between the energy gain and the level of crystallinity after heat treatment. The reason is likely that for the fast-quenched structure (1 × 10¹³ K s⁻¹), the silicon-rich regions might be not perfectly arranged. Thermal treatment thus lowers the interface energy and the internal energy of the amorphous silicon region. For the crystallised structures there is, on one hand, an energy gain due to crystallisation but, on the other hand, an energy loss due to the higher interface energy between crystalline silicon and the silica matrix compared to that of a-Si and the silica matrix (cf. Supplementary Fig. 13). Based on these interface energies, we constructed a simple physically-based model (Methods) to calculate the energy gain of a spherical silicon inclusion in an amorphous silica matrix by crystallisation. This energy gain is shown in Fig. 6j for interface energies taken from four pairs of manually constructed interfaces. The difference between the models is rather small, indicating that the minimum radius of a spherical inclusion has to be around 9 Å to be energetically favourable in a crystalline state. We note that the manually constructed models do not have the same interface orientations as in the SiO structures and are also not perfectly relaxed, causing an overestimation of the interface energies – as seen, for example, in the difference between the interface energies of the SiO structures and of the manually constructed a-Si–a-SiO₂ interface in Supplementary Fig. 13d. However, since for our model the difference between the interface energy between a-Si–a-SiO₂ and c-Si–a-SiO₂ is the only relevant quantity, we assume that these effects partially cancel.

Discussion

Understanding the microscopic nature of interfaces and nanostructured matter is essential to advancing materials research. Here, we have presented an active-learning scheme that we term “amorphous matrix embedding” that can realistically represent environments from large-scale simulations in DFT-accessible cells, enabling fast and accurate atomistic modelling of heterogeneous materials. We used the approach to develop a general-purpose interatomic potential for binary Si–O phases with varied compositions that is able to describe the trifecta of modelling challenges in this material system: very-high-pressure phases (relevant to geology), surfaces (relevant to catalysis), and mixed stoichiometric compositions with nanoscale heterogeneity (relevant to battery systems).

Using the ACE approach, we observe a speed-up of about two orders of magnitude compared to the more established GAP framework. This makes it possible to access long time scales and large length scales with DFT-like accuracy. Of course, there are still some shortcomings of this potential, e.g., the lower accuracy for pure silicon compared to the state of the art – but this use-case is not the focus of our work, as there are already competitive GAP and ACE models available^30,34. In our case, the quality of the underlying meta-GGA data might cause an outperformance compared to earlier ML-potentials fitted with more economical GGA labels. The potential also underestimates the FSDP height of amorphous SiO₂ (Supplementary Fig. 14), as already observed for our earlier GAP model³¹. Further research is required to identify the origin of this underestimation.

We hope that our work, and the dataset and resources developed therein, will advance the modelling of porous silica nanostructures as well as of high-pressure silica. For the Si–SiO₂ interface, alternative interatomic potential models are scarce and the higher-quality potentials come with an expensive charge-equilibration term. Our tests showed that the ACE potential describes silicon monoxide in much closer agreement with experiment than existing empirical models. Additionally, we are able to generate crystallites in the SiO matrix as observed experimentally, showing that the ACE potential can be used for a wide range of applications in the Si–O system, including both ordered and disordered structures.

We view the present database and ML potential model as a starting point for wider-ranging studies in this important material system. In the future, higher accuracy for the mixed system might be achieved by using charge-equilibration schemes coupled with ML potentials⁶¹. However, this would come with much longer computing times as well as worse scaling for larger systems. Moreover, in the future, we will include lithium in the potential to investigate the battery performance of SiO on the atomistic scale.

Methods

Machine-learning potential fitting

We used two frameworks for fitting ML potential models. While constructing the reference database, we used Moment Tensor Potentials³⁸ with active learning⁵³ as implemented in the MLIP package³⁹. For the final potential fit, we used the nonlinear Atomic Cluster Expansion (ACE)^33,34 as implemented in PACEMAKER³⁵. For ACE, we tested a range of combinations of embeddings, and found the following to be suitable:

$${E}_{i}=\phi+\sqrt{\phi }+\mathop{\sum}\limits_{i}{\phi }^{{f}_{i}},$$

(1)

with ϕ_i being atomic properties, which are expanded by the ACE basis functions (for details see ref. ³³). The exponents of the embeddings include fractional exponents and higher integer powers of ${f}_{i}\in \left\{1/8,\, 1/4,\, 3/8,\, 3/4,\, 7/8,\, 2\right\}$. We found that especially fractions between 0 and 1 improved the behaviour of the potential. This approach goes beyond the previously suggested linear embedding (only the first term) and Finnis–Sinclair (the first two terms) type embedding³⁴, and is referred to as “complex” embedding in Table 1. For the expansion of the atomic properties ϕ_i we used 600 basis functions with 5700 parameters. As radial basis we employed Bessel functions. A κ value of 0.01, which gives the ratio between force and energy weights value, was used during fitting. For optimisation we used the BFGS algorithm for 2000 steps.

DFT computations

All DFT computations were performed using VASP^62,63 and the projector augmented-wave method^64,65. We used the SCAN functional³⁶ with an energy cutoff of 900 eV and a k-spacing of 0.23 Å⁻¹. Surface were performed with dipole corrections. We note that these convergence parameters are optimised for silica; however, we found them to be also well converged for mixed phases and for pure silicon structures. Only for very-high-pressure silicon allotropes, a higher k-spacing would provide a relevant advantage; however, since these are not in the scope of the present work, we neglect these inaccuracies.

MD workflows

Simulation protocols were implemented using the atomic simulation environment (ASE)⁶⁶ and the OVITO Python interface⁶⁷. While optimisation and small-cell MD were partially performed with ASE, large-scale MD and statics simulations were carried out using LAMMPS⁶⁸. The time step was 1 fs. For NVT simulations, we used a Nosé–Hoover thermostat with temperature damping constant of 100 fs; for NPT simulations, we added a Nosé–Hoover barostat with a pressure damping constant of 1000 fs.

For quench simulations we used the same protocol as in ref. ³¹. This protocol starts with a randomisation part at 6000 K for 10 ps (NVT). Afterwards the temperature is immediately reduced to 4000 K and held there for 100 ps (NPT, zero external pressure). From there the melt is quenched with different quench rates to 300 K (NPT, zero external pressure). At this temperature the structure is equilibrated for another 10 ps. In case of ‘hybrid’ simulations these quenches have been performed using the CHIK potential²⁵ and afterwards the structure has been equilibrated for another 20 ps with the ACE potential.

As input for the compression simulations we generated amorphous structures using this quenching protocol with a quench rate of 10¹³ K s⁻¹ and only the ACE potential. The compression was performed under isostatic conditions. In each step, the pressure was initially increased by 1 GPa within 2.5 ps of simulation time, followed by equlibration over 2.5 ps at the new pressure. This procedure was iteratively repeated. Coordination numbers were determined after equilibration.

The aerogel structures were created by a similar protocol as in ref. ³¹. An initial structure was randomised at 6000 K for 10 ps, instantly cooled to 4000 K and kept there for 100 ps. From this temperature, the liquid was cooled to 300 K with a quench rate of 10¹³ K s⁻¹. During the equilibration at 4000 K and up to half of the quenching process, the cells were additionally extended to the desired density.

The mixed structures were created using the same protocol as in ref. ³¹ for producing amorphous structures. The volume of the silicon grains was determined within OVITO by creating bonds between silicon and oxygen atoms (cut-off: 2 Å) and deleting all atoms which have more than one such bond. This deletes the whole silica matrix. The interface area and volume is then determined by using the ConstructSurfaceMesh modifier (Gaussian density method, resolution: 50, radius scaling: 100%, isovalue: 0.1) on the remaining atoms.

Phase diagram calculations

Thermodynamic integration was carried out as implemented in calphy^43,44. We used 50,000 equilibration steps, 800,000 switching steps for the switching to the Einstein crystal, as well as 300 steps/K for the thermodynamic integration to calculate the temperature dependence. Due to numerical issues, we fixed the spring constants of the Einstein crystal to 2 eV Å⁻² for oxygen and 4 eV Å⁻² for silicon. We carefully checked the influence of this constraint on the final results and found it to be negligible. More details on the phase diagram calculations can be found in Supplementary Note 2.

Structure factors

Faber–Ziman structure factors were obtained by summation of the Fourier transformations of the partial radial distribution functions calculated with OVITO. The corresponding partial structure factors were weighted by atomic form factors taken from ref. ⁶⁹. For the high-pressure structures, we used a cut-off radius of 20 Å for the radial distribution function, and analysed a single snapshot (without time averaging). For the SiO structures, we used a cut-off of 80 Å and an average over 10 snapshots.

Surface energies

In these calculations we consider only stockiometric slabs. The surface energies, γ, were calculated as

$$\gamma=\frac{{E}_{{{{{{{{\rm{slab}}}}}}}}}-N\cdot {E}_{{{{{{{{\rm{ref}}}}}}}}}}{A},$$

(2)

where A is the total surface area (at the top and bottom of the slab combined), N is the number of particles in the slab, E_ref is the bulk reference energy and E_slab is the potential energy of the slab. The slab energies for Table 2, Fig. 4a (relaxed), and Supplementary Fig. 3 have been calculated for DFT- and ACE-relaxed structures. For Fig. 4b, we used the ACE-relaxed structures also to determine the DFT single-point surface energy. The reference energy for the α-quartz surface is the energy of the optimised α-quartz unit cell per atom, and the reference energy for the diamond surface is obtained for the optimised diamond-type unit cell. The reference energy of the amorphous sample is the bulk energy of the same relaxed amorphous structure without surfaces.

Enthalpy and structural analysis at high pressure

The enthalpy H is given by

$$H(p)=E(V)+p(V)\cdot V,$$

(3)

where E is the internal energy, p is the pressure, and V is the volume. The volume dependence of the energy was determined by a Birch–Murnaghan fit to the energy–volume curve of each polymorph. p(V) was given by the corresponding derivative. The energy-volume curves were calculated by varying the volume by ±20% for α-quartz and coesite, by ±25% for stishovite and ±30% for all other phases. The corresponding structures were structurally optimised, allowing changes of the positions as well of the box shape, however keeping the volume fixed. For the analysis of the compression MD simulations, coordination numbers were determined by integrating over the first peak of the partial Si–O radial distribution function. The Si–O bond distances are given by the first peak position of the partial Si–O radial-distribution function.

Interface energy model for the SiO crystallisation

We build a interface energy based model for a spherical inclusion of silicon with radius r within an amorphous SiO₂ matrix. The energy difference between the crystallised silicon and the amorphous silicon is given by,

$$\Delta E=4\pi {r}^{2}({\gamma }_{{{{{{{{\rm{c}}}}}}}}-{{{{{{{\rm{Si}}}}}}}}-{{{{{{{\rm{a}}}}}}}}-{{{{{{{{\rm{SiO}}}}}}}}}_{2}}-{\gamma }_{{{{{{{{\rm{a}}}}}}}}-{{{{{{{\rm{Si}}}}}}}}-{{{{{{{\rm{a}}}}}}}}-{{{{{{{{\rm{SiO}}}}}}}}}_{2}})-\frac{4\pi {r}^{3}({E}_{{{{{{{{\rm{a}}}}}}}}-{{{{{{{\rm{Si}}}}}}}}}-{E}_{{{{{{{{\rm{c}}}}}}}}-{{{{{{{\rm{Si}}}}}}}}})}{3{V}_{{{{{{{{\rm{atom}}}}}}}}}}.$$

(4)

Here, ${\gamma }_{{{{{{{{\rm{a}}}}}}}}-{{{{{{{\rm{Si}}}}}}}}-{{{{{{{\rm{a}}}}}}}}-{{{{{{{{\rm{SiO}}}}}}}}}_{2}}$ is the interface energy between amorphous silicon and amorphous silica, ${\gamma }_{{{{{{{{\rm{c}}}}}}}}-{{{{{{{\rm{Si}}}}}}}}-{{{{{{{\rm{a}}}}}}}}-{{{{{{{{\rm{SiO}}}}}}}}}_{2}}$ is the interface energy between crystalline silicon and amorphous silica, E_a−Si is the energy of the corresponding amorphous silicon structure (cf. Supplementary Table IV), E_c−Si is the energy of crystalline silicon, and V_atom is the volume per atom. Details about the interface energies we used can be found in Supplementary Note 2.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The potential parameter files, the reference data with SCAN labels, and additional supporting data (including LAMMPS scripts and input configurations) generated in thus study are openly available in the Zenodo repository at https://doi.org/10.5281/zenodo.10419194⁷⁰. Source data are provided with this paper.

Code availability

The codes for potential fitting and evaluation are publicly available and were used as provided by their respective authors, without modification. Custom-written Python scripts for data analysis are provided together with the corresponding data in the Zenodo repository.

References

Heaney, P. J., Prewitt, C. T. & Gibbs, G. V. (eds.) Silica: Physical Behavior, Geochemistry, and Materials Applications (De Gruyter, Berlin, Boston, 1994).
Yoshikawa, K. et al. Silicon heterojunction solar cell with interdigitated back contacts for a photoconversion efficiency over 26%. Nat. Energy 2, 1–8 (2017).
Article Google Scholar
Li, Y. & Yu, J. Emerging applications of zeolites in catalysis, separation and host–guest assembly. Nat. Rev. Mater. 6, 1156–1174 (2021).
Article ADS CAS Google Scholar
Soleimani Dorcheh, A. & Abbasi, M. Silica aerogel; synthesis, properties and characterization. J. Mater. Process. Technol. 199, 10–26 (2008).
Article CAS Google Scholar
Prescher, C. et al. Beyond sixfold coordinated Si in SiO₂ glass at ultrahigh pressures. Proc. Natl Acad. Sci. 114, 10041–10046 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Tracy, S. J., Turneaure, S. J. & Duffy, T. S. In situ X-Ray Diffraction of Shock-Compressed Fused Silica. Phys. Rev. Lett. 120, 135702 (2018).
Article ADS CAS PubMed Google Scholar
Tracy, S. J., Turneaure, S. J. & Duffy, T. S. Structural response of α-quartz under plate-impact shock compression. Sci. Adv. 6, eabb3913 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Terman, L. M. An investigation of surface states at a silicon/silicon oxide interface employing metal-oxide-silicon diodes. Solid State Electron. 5, 285–299 (1962).
Article ADS CAS Google Scholar
Card, H. C. Si–SiO₂ interface state spectroscopy using MOS tunneling structures. Solid State Electron. 22, 809–817 (1979).
Article ADS CAS Google Scholar
Pantelides, S. T. et al. Si/SiO2 and SiC/SiO2 Interfaces for MOSFETs – Challenges and Advances. Mater. Sci. Forum 527–529, 935–948 (2006).
Article Google Scholar
Potter, H. N. Silicon Monoxide. Transcr. Electrochem. Soc. 12, 191–214 (1907).
Google Scholar
Brady, G. W. A Study of Amorphous SiO. J. Phys. Chem. 63, 1119–1120 (1959).
Article CAS Google Scholar
Greaves, G. N. EXAFS and the structure of glass. J. Non Cryst. Solids 71, 203–217 (1985).
Article ADS CAS Google Scholar
Schulmeister, K. & Mader, W. TEM investigation on the structure of amorphous silicon monoxide. J. Non Cryst. Solids 320, 143–150 (2003).
Article ADS CAS Google Scholar
Hirata, A. et al. Atomic-scale disproportionation in amorphous silicon monoxide. Nat. Commun. 7, 11591 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Hass, G. Preparation, Structure, and Applications of Thin Films of Silicon Monoxide and Titanium Dioxide. J. Am. Ceramic Soc. 33, 353–360 (1950).
Article CAS Google Scholar
Poat, D. Properties of pulse-deposited thin-film silicon monoxide capacitors. Thin Solid Films 4, 123–136 (1969).
Article ADS Google Scholar
Yang, J. et al. SiO_x-based anodes for secondary lithium batteries. Solid State Ionics 152–153, 125–129 (2002).
Article Google Scholar
Liu, Z. et al. Silicon oxides: A promising family of anode materials for lithium-ion batteries. Chem. Soc. Rev. 48, 285–309 (2019).
Article ADS CAS PubMed Google Scholar
Stillinger, F. H. & Weber, T. A. Computer simulation of local order in condensed phases of silicon. Phys. Rev. B 31, 5262–5271 (1985).
Article ADS CAS Google Scholar
Tersoff, J. New empirical approach for the structure and energy of covalent systems. Phys. Rev. B 37, 6991–7000 (1988).
Article ADS CAS Google Scholar
Lee, B.-J. A modified embedded atom method interatomic potential for silicon. Calphad 31, 95–104 (2007).
Article ADS CAS Google Scholar
van Beest, B. W. H., Kramer, G. J. & van Santen, R. A. Force fields for silicas and aluminophosphates based on ab initio calculations. Phys. Rev. Lett. 64, 1955–1958 (1990).
Article ADS Google Scholar
Vashishta, P., Kalia, R. K., Rino, J. P. & Ebbsjö, I. Interaction potential for SiO 2 : A molecular-dynamics study of structural correlations. Phys. Rev. B 41, 12197–12209 (1990).
Article ADS CAS Google Scholar
Carré, A., Horbach, J., Ispas, S. & Kob, W. New fitting scheme to obtain effective potential from Car-Parrinello molecular-dynamics simulations: Application to silica. EPL Europhys. Lett. 82, 17001 (2008).
Article ADS Google Scholar
Yasukawa, A. Using An Extended Tersoff Interatomic Potential to Analyze The Static-Fatigue Strength of SiO2 under Atmospheric Influence. JSME Int. J. Ser. A Mech. Mater. Eng. 39, 313–320 (1996).
CAS Google Scholar
van Duin, A. C. T. et al. ReaxFFSiO Reactive Force Field for Silicon and Silicon Oxide Systems. J. Phys. Chem. A 107, 3803–3811 (2003).
Article Google Scholar
Yu, J., Sinnott, S. B. & Phillpot, S. R. Charge optimized many-body potential for the Si / SiO₂ system. Phys. Rev. B 75, 085311 (2007).
Article ADS Google Scholar
Shan, T.-R. et al. Second-generation charge-optimized many-body potential for Si / SiO₂ and amorphous silica. Phys. Rev. B 82, 235302 (2010).
Article ADS Google Scholar
Bartók, A. P., Kermode, J., Bernstein, N. & Csányi, G. Machine Learning a General-Purpose Interatomic Potential for Silicon. Phys. Rev. X 8, 041048 (2018).
Google Scholar
Erhard, L. C., Rohrer, J., Albe, K. & Deringer, V. L. A machine-learned interatomic potential for silica and its relation to empirical models. npj Comput. Mater. 8, 1–12 (2022).
Article Google Scholar
Roy, S., Dürholt, J. P., Asche, T. S., Zipoli, F. & Gómez-Bombarelli, R. Learning a reactive potential for silica-water through uncertainty attribution. Preprint at https://arxiv.org/abs/2307.01705 (2023).
Drautz, R. Atomic cluster expansion for accurate and transferable interatomic potentials. Phys. Rev. B 99, 014104 (2019).
Article ADS CAS Google Scholar
Lysogorskiy, Y. et al. Performant implementation of the atomic cluster expansion (PACE) and application to copper and silicon. npj Comput. Mater. 7, 1–12 (2021).
Article Google Scholar
Bochkarev, A. et al. Efficient parametrization of the atomic cluster expansion. Phys. Rev. Mater. 6, 013804 (2022).
Article CAS Google Scholar
Sun, J., Ruzsinszky, A. & Perdew, J. P. Strongly Constrained and Appropriately Normed Semilocal Density Functional. Phys. Rev. Lett. 115, 036402 (2015).
Article ADS PubMed Google Scholar
Bonati, L. & Parrinello, M. Silicon liquid structure and crystal nucleation from ab initio deep metadynamics. Phys. Rev. Lett. 121, 265701 (2018).
Article ADS CAS PubMed Google Scholar
Shapeev, A. V. Moment Tensor Potentials: A Class of Systematically Improvable Interatomic Potentials. Multiscale Model. Simul. 14, 1153–1173 (2016).
Article MathSciNet Google Scholar
Novikov, I. S., Gubaev, K., Podryabinkin, E. V. & Shapeev, A. V. The MLIP package: Moment tensor potentials with MPI and active learning. Mach. Learn. Sci. Technol. 2, 025002 (2020).
Article Google Scholar
Artrith, N. & Behler, J. High-dimensional neural network potentials for metal surfaces: A prototype study for copper. Phys. Rev. B 85, 045439 (2012).
Article ADS Google Scholar
Lysogorskiy, Y., Bochkarev, A., Mrovec, M. & Drautz, R. Active learning strategies for atomic cluster expansion models. Phys. Rev. Mater. 7, 043801 (2023).
Article CAS Google Scholar
Kong, L. et al. Overcoming the size limit of first principles molecular dynamics simulations with an in-distribution substructure embedding active learner. Preprint at https://arxiv.org/abs/2311.08177 (2023).
Menon, S., Lysogorskiy, Y., Rogal, J. & Drautz, R. Automated free-energy calculation from atomistic simulations. Phys. Rev. Mater. 5, 103801 (2021).
Article CAS Google Scholar
de Koning, M., Antonelli, A. & Yip, S. Optimized Free-Energy Evaluation Using a Single Reversible-Scaling Simulation. Phys. Rev. Lett. 83, 3973–3977 (1999).
Article ADS Google Scholar
Swamy, V., Saxena, S. K., Sundman, B. & Zhang, J. A thermodynamic assessment of silica phase diagram. Journal of Geophys. Res. Solid Earth 99, 11787–11794 (1994).
Article ADS Google Scholar
Otzen, C., Liermann, H.-P. & Langenhorst, F. Evidence for a rosiaite-structured high-pressure silica phase and its relation to lamellar amorphization in quartz. Nat. Commun. 14, 606 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Tsuchiya, T. & Nakagawa, S. A new high-pressure structure of SiO₂ directly converted from α-quartz under nonhydrostatic compression. J. Phys. Condensed Matter 34, 304003 (2022).
Article CAS Google Scholar
Murakami, M., Hirose, K., Ono, S. & Ohishi, Y. Stability of CaCl₂-type and α-PbO₂-type SiO₂ at high pressure and temperature determined by in-situ X-ray measurements. Geophys. Res. Lett. 30, 1207 (2003).
Kuwayama, Y., Hirose, K., Sata, N. & Ohishi, Y. The Pyrite-Type High-Pressure Form of Silica. Science 309, 923–925 (2005).
Article ADS CAS PubMed Google Scholar
Kono, Y., Shu, Y., Kenney-Benson, C., Wang, Y. & Shen, G. Structural Evolution of SiO₂ Glass with Si Coordination Number Greater than 6. Phys. Rev. Lett. 125, 205701 (2020).
Article ADS CAS PubMed Google Scholar
Petitgirard, S. et al. Magma properties at deep Earth’s conditions from electronic structure of silica. Geochem. Perspect. Lett. 9, 32–37 (2019).
Murakami, M. et al. Ultrahigh-pressure form of SiO₂ glass with dense pyrite-type crystalline homology. Phys. Rev. B 99, 045153 (2019).
Article ADS CAS Google Scholar
Podryabinkin, E. V. & Shapeev, A. V. Active learning of linearly parametrized interatomic potentials. Comput. Mater. Sci. 140, 171–180 (2017).
Article CAS Google Scholar
Deringer, V. L. et al. Realistic Atomistic Structure of Amorphous Silicon from Machine-Learning-Driven Molecular Dynamics. J. Phys. Chem. Lett. 9, 2879–2885 (2018).
Article CAS PubMed Google Scholar
Deringer, V. L. et al. Origins of structural and electronic transitions in disordered silicon. Nature 589, 59–64 (2021).
Article ADS CAS PubMed Google Scholar
Ferguson, F. T. & Nuth, J. A. Vapor Pressure of Silicon Monoxide. J. Chem. Eng. Data 53, 2824–2832 (2008).
Article CAS Google Scholar
AlKaabi, K., Prasad, D. L. V. K., Kroll, P., Ashcroft, N. W. & Hoffmann, R. Silicon Monoxide at 1 atm and Elevated Pressures: Crystalline or Amorphous? J. Am. Chem. Soc. 136, 3410–3423 (2014).
Article CAS PubMed Google Scholar
Nagamori, M., Boivin, J. A. & Claveau, A. Gibbs free energies of formation of amorphous Si₂O₃, SiO and Si₂O. J. Non Cryst. Solids 189, 270–276 (1995).
Article ADS CAS Google Scholar
Munetoh, S., Motooka, T., Moriguchi, K. & Shintani, A. Interatomic potential for Si–O systems using Tersoff parameterization. Comput. Mater. Sci. 39, 334–339 (2007).
Article CAS Google Scholar
Mamiya, M., Takei, H., Kikuchi, M. & Uyeda, C. Preparation of fine silicon particles from amorphous silicon monoxide by the disproportionation reaction. J. Crystal Growth 229, 457–461 (2001).
Article ADS CAS Google Scholar
Ko, T. W., Finkler, J. A., Goedecker, S. & Behler, J. A fourth-generation high-dimensional neural network potential with accurate electrostatics including non-local charge transfer. Nat. Commun. 12, 398 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Kresse, G. & Furthmüller, J. Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set. Comput. Mater. Sci. 6, 15–50 (1996).
Article CAS Google Scholar
Kresse, G. & Furthmüller, J. Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. Phys. Rev. B 54, 11169–11186 (1996).
Article ADS CAS Google Scholar
Blöchl, P. E. Projector augmented-wave method. Phys. Rev. B 50, 17953–17979 (1994).
Article ADS Google Scholar
Kresse, G. & Joubert, D. From ultrasoft pseudopotentials to the projector augmented-wave method. Phys. Rev. B 59, 1758–1775 (1999).
Article ADS CAS Google Scholar
Larsen, A. H. et al. The atomic simulation environment—a Python library for working with atoms. J. Phys. Condens. Matter 29, 273002 (2017).
Article Google Scholar
Stukowski, A. Visualization and analysis of atomistic simulation data with OVITO–the Open Visualization Tool. Modell. Simul. Mater. Sci. Eng. 18, 015012 (2010).
Article ADS Google Scholar
Thompson, A. P. et al. LAMMPS - a flexible simulation tool for particle-based materials modeling at the atomic, meso, and continuum scales. Comput. Phys. Commun. 271, 108171 (2022).
Article CAS Google Scholar
Prince, E. (ed.) International Tables for Crystallography. C: Mathematical, Physical and Chemical Tables, 3rd edn (Kluwer Academic, Dordrecht, 2004).
Erhard, L. C., Rohrer, J., Albe, K. & Deringer, V. L. Research data for “Modelling atomic and nanoscale structure in the silicon–oxygen system through active machine learning”. Zenodo, https://doi.org/10.5281/zenodo.10419194 (2024).
Hall, J. J. Electronic Effects in the Elastic Constants of n -Type Silicon. Phys. Rev. 161, 756–761 (1967).
Article ADS CAS Google Scholar
Fukata, N., Kasuya, A. & Suezawa, M. Vacancy Formation Energy of Silicon Determined by a New Quenching Method. Japanese J. Appl. Phys. 40, L854 (2001).
Article ADS CAS Google Scholar
Jaccodine, R. J. Surface Energy of Germanium and Silicon. J. Electrochem. Soc. 110, 524 (1963).
Article ADS CAS Google Scholar

Download references

Acknowledgements

L.C.E. thanks Niklas Leimeroth for useful discussions. L.C.E. acknowledges support from the German Academic Exchange Service (Forschungsstipendien für Doktorandinnen und Doktoranden) and the Erasmus+ programme for support of two research stays at the University of Oxford. The research was supported by the Bundesministerium für Bildung und Forschung (BMBF) within the project FESTBATT under Grant No. 03XP0174A. J.R. and K.A. acknowledge support by the Deutsche Forschungsgemeinschaft (DFG, Grant no. 405621137, 405621160, and 521536863). L.C.E. acknowledges helpful discussion within the DFG GRK-2561 MatCom-ComMat. The authors gratefully acknowledge the computing time provided to them at the NHR Center NHR4CES at TU Darmstadt (project number 01539 and p0020142). This is funded by the Federal Ministry of Education and Research, and the state governments participating on the basis of the resolutions of the GWK for national high performance computing at universities (www.nhr-verein.de/unsere-partner).

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Institute of Materials Science, Technische Universität Darmstadt, Otto-Berndt-Strasse 3, D-64287, Darmstadt, Germany
Linus C. Erhard, Jochen Rohrer & Karsten Albe
Department of Chemistry, Inorganic Chemistry Laboratory, University of Oxford, Oxford, OX1 3QR, United Kingdom
Volker L. Deringer

Authors

Linus C. Erhard
View author publications
You can also search for this author in PubMed Google Scholar
Jochen Rohrer
View author publications
You can also search for this author in PubMed Google Scholar
Karsten Albe
View author publications
You can also search for this author in PubMed Google Scholar
Volker L. Deringer
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.C.E. performed all computations and analysis, with guidance from J.R., K.A., and V.L.D. All authors contributed substantially to the design of the research and to the interpretation of the results. L.C.E. and V.L.D. wrote the paper with input from all authors.

Corresponding authors

Correspondence to Jochen Rohrer, Karsten Albe or Volker L. Deringer.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Thomas Reichenbach, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Erhard, L.C., Rohrer, J., Albe, K. et al. Modelling atomic and nanoscale structure in the silicon–oxygen system through active machine learning. Nat Commun 15, 1927 (2024). https://doi.org/10.1038/s41467-024-45840-9

Download citation

Received: 06 July 2023
Accepted: 02 February 2024
Published: 02 March 2024
DOI: https://doi.org/10.1038/s41467-024-45840-9

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.