Abstract
Silicon–oxygen compounds are among the most important ones in the natural sciences, occurring as building blocks in minerals and being used in semiconductors and catalysis. Beyond the well-known silicon dioxide, there are phases with different stoichiometric composition and nanostructured composites. One of the key challenges in understanding the Si–O system is therefore to accurately account for its nanoscale heterogeneity beyond the length scale of individual atoms. Here we show that a unified computational description of the full Si–O system is indeed possible, based on atomistic machine learning coupled to an active-learning workflow. We showcase applications to very-high-pressure silica, to surfaces and aerogels, and to the structure of amorphous silicon monoxide. In a wider context, our work illustrates how structural complexity in functional materials beyond the atomic and few-nanometre length scales can be captured with active machine learning.
Similar content being viewed by others
Introduction
Elemental silicon and its oxide, silica (SiO2), are widely studied building blocks of the world around us1: from minerals in geology to silicon-based computing architectures; thin-film solar cells in which amorphous silicon is the active material2; or zeolite catalysts based on the SiO2 parent composition3. Some of these materials have a single phase and are precisely defined on the atomic scale, whereas others show longer-ranging, hierarchical structures and varying degrees of disorder. For example, silica aerogels contain pores with sizes of 5–100 nm, leading to very low thermal conductivity and making aerogels promising candidates for thermal insulation4. Under pressure, SiO2 shows amorphous–amorphous transitions to structures exceeding sixfold coordination5, crystallisation from the amorphous phase under shock compression6, and conversely the formation of complex disordered phases from crystalline SiO27. Beyond fundamental studies, there is much technological importance in silicon–oxygen phases with nanoscale structure—the interface between Si and SiO2 is essential in silicon metal-oxide semiconductors, and defects at this interface have been investigated for decades8,9,10.
A material in the binary silicon–oxygen system which is in fact dominated by such interfaces is the so-called silicon monoxide (SiO). The structure of SiO was controversially discussed for long11,12; today, it is known as a nanoscopic mixture of amorphous Si and SiO213,14,15. Initial applications of SiO have been in protective layers for mirrors16 or dielectrics for thin-film capacitors17; more recently, the same material has emerged as a promising anode material for lithium-ion batteries18,19. However, to be able to fully exploit SiO in next-generation energy-storage solutions, it would be valuable to understand the features of the nanoscopic structure on an atomistic level.
To develop atomic-scale models of complex materials such as SiO, molecular-dynamics (MD) computer simulations have become a central research tool. While there are now plenty of interatomic potentials for silicon20,21,22 and silica23,24,25, the number of potentials for the mixed (i.e., full binary) system is limited due to its chemical complexity26,27,28,29. Alongside established, empirically fitted potentials based on physical models, alternatives based on large datasets and machine learning (ML) have emerged in recent years. These models have been fitted for silicon30 as well as silica31 and also for the more complex silica–water system32. ML potentials promise the accuracy of first-principles methods such as density-functional theory (DFT) for a small fraction of the cost. ML potentials are now firmly established in the field of computational materials science and their application to homogeneous phases has been well documented.
In the present work, we describe a unified computational model for the Si–O system that we have obtained with the help of an active-learning scheme for local environments. We extract representative atomic environments from large-scale simulations and embed them in a melt-quenched amorphous matrix, allowing us to sample representative environments for the fitting of accurate ML potentials. Our final model shows high accuracy across a wide configurational space including high-pressure silica, silica surfaces, and mixtures of silica and silicon. We showcase the usefulness of the method by creating fully atomistically resolved, 10-nm-scale structure models of amorphous and partially crystalline SiO.
Results
Active learning for nanoscale structure
We have developed a comprehensive dataset of atomistic structures and quantum-mechanical reference data for the binary Si–O system, as well as an interatomic potential fitted to that database in the atomic cluster expansion (ACE) framework33,34,35. We initialised the protocol with two existing datasets for silicon (Bartók et al. 30) and silica (Erhard et al. 31) respectively, and we then gradually explored the relevant configurational space using the active-learning workflow illustrated in Fig. 1. Quantum-mechanical reference ("training”) data for energies and forces were obtained with the strongly constrained and appropriately normed (SCAN)36 exchange–correlation functional for DFT, which shows good performance for elemental silicon37 and the various silica polymorphs31.
Our active-learning workflow follows three main tracks: high-pressure bulk silica, silica surfaces, and non-stoichiometric SiOx systems (Fig. 1a). The individual tracks are kept separate during initial training, i.e., they do not share their newly generated training data; however, in the end, all structures are merged into one comprehensive database.
The single subtracks are further divided into stages. In the first stage, we added initial structures, e.g., for crystalline high-pressure polymorphs or surface models. In the next stage, we fitted moment tensor potential (MTP) models38 to the database and used these MTPs to explore configurational space in MD and to identify new structures by active learning39. Energies and forces for new structures were computed with DFT and added to the database. This process was iterated until the extrapolation threshold (Supplementary Note 1B) was not exceeded during the MD trajectories anymore.
The third stage, highlighted in red in Fig. 1a, is the most important part of our workflow, and is based on large-scale simulations in each track. We used 2–4 MTPs trained on the same database to estimate a per-atom committee error, as is commonly done for neural-network potentials40. For atoms with high uncertainty (Supplementary Note 1C), we extracted the environments into smaller, “DFT-sized” cells by an approach that we call amorphous matrix embedding (Fig. 1b). After identifying an atom with high uncertainty, we cut out a cube containing the corresponding environment of the atom. This cube has a size which is feasible for performing DFT computations; it is generally chosen larger than twice the cut-off of the potential. After extracting the cube, the atoms within the cut-off radius of the atom with high uncertainty are kept fixed. The remaining structure is melted in an ML-MD simulation to create an amorphous matrix and smooth boundaries. Details of the procedure can be found in Supplementary Note 1C. We note very recent, related approaches to isolating fragments for active learning based on minimising the uncertainty for boundary atoms41,42.
The final database was obtained by merging the data of all tracks together, including some additional samples such as clusters and vacancies. This database contains 11,428 structures with a total of ≈1.3 million atoms (Supplementary Table 1). For validation, we held out 5% of these structures from training, selected at random.
Performance
The final potential is a complex non-linear ACE model, obtained by summation of one linear and seven non-linear ACE terms (Methods). This approach allows a more flexible description than just a linear or Finnis–Sinclair-like embedding, at only moderately higher computational expense. The resulting potential has a test-set root mean square error (RMSE) of 16.7 meV atom−1 for energies and 306 meV Å−1 for forces. These errors are averaged over the full dataset, however, and so they are not in themselves sufficient to characterise the quality of the potential. For example, they refer to a highly heterogeneous set of structures, with target energy values spanning more than 8 eV atom−1, and a range of forces of 40 eV Å−1 covered by the database. Furthermore, the numerical accuracy of the potential in certain parts of configurational space (e.g., crystalline polymorphs) is far more important than in others (e.g., liquid and amorphous structures).
In Table 1, we therefore show the performance of our model on different separate test sets. The complex non-linear ACE is compared to our previous silica GAP model described in ref. 31 ("SiO2-GAP-22” in the following), and also to simpler ACE models fitted to the new database using linear and Finnis–Sinclair-like embeddings, respectively. Indeed, the complex non-linear ACE potential is the only one among the three which achieves comparable errors to SiO2-GAP-22 for amorphous and crystalline silica structures. In contrast, for amorphous elemental silicon, mixed-stoichiometry as well as high-pressure phases the complex non-linear ACE is significantly more accurate than SiO2-GAP-22, since these structures are not part of the GAP database. This table therefore indicates the main challenge – and its solution – in the present model compared to the previous GAP: both are highly accurate for crystalline (≈1 meV atom−1) and bulk amorphous (≈5 meV atom−1) SiO2, but our ACE model caters to a much wider range of scenarios outside of the 1:2 stoichiometric composition.
Figure 2 shows the phase diagram of SiO2 calculated by thermodynamic integration43,44 using the ACE potential compared to a CALPHAD phase diagram from the literature45. The ACE and CALPHAD predictions agree well throughout, and for the boundary between quartz and coesite we observe almost quantitative agreement. In contrast, the cristobalite and tridymite phases seem to be over-stabilised. At 0 GPa, the melting point is notably overestimated (about 2400 K, compared to ≈2000 K experimentally45); moreover, the phase stability regions of both phases are more extended than in the reference. To illustrate the sensitivity of the analysis to small errors in predicted energies, we added a fictitious energy penalty of 5 meV atom−1 for cristobalite and tridymite (Supplementary Fig. 1a); in this case, the transition lines agree much better with the CALPHAD reference than before. Further numerical tests showed that the tridymite–cristobalite transition line, in particular, is strongly affected by small shifts in energy (Supplementary Fig. 1b–f). We thus conclude that the quantitative deviation seen in Fig. 2 is due to the inaccuracy of the underlying exchange–correlation functional, rather than indicating a shortcoming of the ACE approach. This is an example of the more general problem that any issues with the ground-truth method (e.g., numerical instabilities) will translate into the ML model.
High-pressure structural transitions of SiO2
Figure 3 characterises high-pressure properties of silica. In Fig. 3a, we show energy–volume curves of α-quartz, coesite, stishovite, α-PbO2-type, and pyrite-type silica as predicted by our ACE model and compared with DFT data, with which they agree well. In addition, we tested the behaviour of the model for rosiaite-type silica, which was recently observed in experiment46 and predicted theoretically47 for direct compression of α-quartz. In contrast to the structures mentioned before, this particular polymorph is not part of the training database. Nevertheless, the ACE model reproduces DFT data for this structure similarly well as for the other polymorphs.
Figure 3b shows an enthalpy–pressure diagram at 0 K. For lower pressures, there is a transition from α-quartz to coesite between 2.5 and 3.0 GPa, consistent with the predicted phase diagram (Fig. 2), followed by a transition to stishovite at 5.5–6.0 GPa. At higher pressures of ≈110 GPa, we observe the transition from stishovite to α-PbO2-type silica. Experimentally, rather than stishovite (rutile type), the structurally closely related CaCl2 (distorted rutile) type polymorph of silica is stable. The transition from CaCl2- to α-PbO2-type silica was observed at 120 GPa and 2400 K48. Given that our enthalpy data correspond to a temperature of 0 K, both values agree well with each other. For the transition of α-PbO2- to pyrite-type silica, our ACE model predicts a pressure of ≈246 GPa, in good agreement with the experimentally determined transition pressure of ≈260 GPa at 1800 K49. Finally, rosiaite-type silica46 is correctly identified as metastable over the pressure range studied.
Figure 3c shows the pressure evolution of the average coordination number (CN) of silicon atoms in amorphous silica, extracted from an MD simulation at room temperature and under isostatic pressure. The ACE results agree well with experiment up to about 50 GPa5,50, and with ab initio MD51 results over the whole pressure range. The good agreement with experiment is particularly pronounced for the data from ref. 50. Above 50 GPa, our model underestimates the average CN: at 175 GPa the experimental estimate is about 7; the ACE simulation predicts it to be 6. Importantly, this does not mean that there are no 7-fold coordinated environments, but there remain some 5-fold coordinated atoms as well, lowering the average (Fig. 3d). A possible reason for the good agreement with the ab initio result, but the deviation from experiment, might be the limited time scales in our simulations, which hinder a complete transition into higher-coordinated environments. Moreover, we note that computed X-ray-Raman spectra of the ab initio structures from ref. 51 are in good agreement with experiment indicating a lower CN. Other MD simulations also showed slightly lower CNs than the experimental values52. Figure 3e shows three different 7-fold coordinated environments extracted from the simulations.
A CN of 7 in amorphous silica might be surprising, since silicon is sixfold-coordinated in all crystalline silica polymorphs that are stable in this pressure range. However, the pyrite-type phase, which becomes thermodynamically stable at ≈240–260 GPa, contains silicon atoms with a 6+2-fold environment. A recent study found certain, but limited, similarities between these 7-fold environments in glassy and pyrite-type silica52.
In Supplementary Fig. 2, we show two additional structural fingerprints which have been commonly analysed in experiment: the position of the first sharp diffraction peak and the Si–O bond length. For both cases, our simulations show good agreement with experiment.
SiO2 surfaces and aerogels
Figure 4 tests the ability of the potential to accurately predict surface energies. We begin with validation for different α-quartz surfaces: we created surface slab models, relaxed them with the ACE potential, and evaluated the energetics, and therefore the surface energy per area, with DFT single-point computations (Fig. 4a). The ACE results agree well with DFT, especially considering that the training database does not contain all the surface terminations shown. The ACE model is also able to predict the stability of the reconstructed α-quartz (001) surface (Supplementary Fig. 3a, b). Whilst these surface energies can be computed with DFT, realistic amorphous surface energies are much more difficult to obtain, due to the required system sizes. Therefore, Fig. 4b validates the potential on 125 small-scale surface structures of amorphous SiO2, each containing 192 atoms. The surface models are created based on bulk structures from ref. 31; the latter had been generated in melt–quench simulations with different interatomic potentials and therefore span a range of energies. Regardless of the starting structure, the ACE model captures the surface energy for all slab models very well: the total RMSE is about 0.01 eV/Å2, and only a slight underestimation compared to DFT is seen. Moreover, there are no clear outliers although the various surface energies indicate a large diversity of the surface structures.
The amorphous surfaces shown are already very complex, but in reality they are often not flat as here. They have curvature, for example when occurring inside pores, and such complex structures can no longer be directly validated with DFT. Figure 4c therefore shows how well atomic environments in various porous amorphous structures are covered by the dataset. These structures were prepared by straining amorphous structures at elevated temperatures to the desired density. To validate the performance of the potential on this model, we show the linear extrapolation grade according to the maxvol selection41,53. An extrapolation grade above 1 corresponds to atomic environments that have not been covered in the training database. This does not mean that the potential is no longer reliable, as there is a certain range of more or less reliable extrapolation, but as the extrapolation grade increases, non-physical behaviour and failure of the potential becomes more and more likely39. For all porous structures, regardless of density, we find that the maximum extrapolation value is less than 1. Thus, we observe no pronounced extrapolation in any of the cases considered, indicating an accurate description of the potential for a variety of curved surfaces. In Supplementary Fig. 3e, f, we demonstrate the effectiveness of the ACE potential in reconstructing artifical amorphous surfaces. To this end, we cut out a spherical cavity from an amorphous bulk structure. Upon heating, the amount of incorrectly arranged atoms near the surface (<3 Å) decreases over time, indicating that the surface atoms are undergoing significant reconstructions.
Elemental silicon
Whilst our ACE model is designed for the binary Si–O system, we show in Table 2 the performance for diamond-like elemental silicon compared to both DFT and experiment. The bulk modulus is very well reproduced, whereas the vacancy formation energy is underestimated by about 30%. The experimental surface energies are well recovered by the ACE model, but this may be partly due to serendipity, because the SCAN ground-truth data show poorer quality (Table 2). Moreover, the potential captures the reconstruction of the diamond-type silicon (100) surface (Supplementary Fig. 3c, d). We also computed the linear thermal expansion coefficient of diamond-type silicon in the quasiharmonic approximation, finding almost perfect agreement between the ACE prediction and experiment; in particular, the unusual negative expansion coefficient below 130 K is reproduced (Supplementary Fig. 4). Finally, melt–quench simulations were performed to generate a-Si structures (Supplementary Fig. 5). The agreement with the experimental structure factor is as good as that for a GAP-18-generated structure from ref. 54. In addition, we are able to achieve lower quenching rates with the ACE than with the GAP, and for quenching rates as low as 1010 K s−1, we observed crystallisation.
Compared to Si-GAP-1830, we observe higher errors with respect to the reference data (Table 2). This is not a principal shortcoming of ACE compared to GAP: it was already shown that it is possible to fit an ACE potential with similar numerical accuracy as Si-GAP-18 to the same training database34. Instead, the lower accuracy might be caused by the extension of the database to a second element, compared to the Si-GAP-18 one, and its strong focus on the SiO2 part of the configurational space. We assume that this causes, in turn, a less accurate description of the configurational space of the elemental species. Larger training databases might help to overcome this issue in the future. Due to the lower accuracy in reproducing the SCAN data, our potential has some shortcomings for higher-pressure structures: the bc8 phase is erroneously predicted to be stable at elevated pressure (Supplementary Fig. 6), and upon compressing a-Si we do not observe the eventual crystallisation that is described by Si-GAP-18 (Supplementary Fig. 7)55. We emphasise that very-high-pressure silicon phases were not the scope of the present work – instead, we focus on the accurate description of ambient-pressure silicon as a constituent part of mixed binary phases and nanostructures.
SiO and mixed silicon–silica systems
Whilst the results so far have served to demonstrate the usefulness of the approach – both in terms of development of datasets and the fitting within the ACE framework – we are now able to study an actual application problem. To this end, Fig. 5a shows structural models of SiO. Experimentally, amorphous SiO is obtained by deposition of SiO from the gas phase56. In contrast, we created our models by melt–quench simulations. SiO phases are known to be metastable with respect to Si and SiO2. For example, a recent DFT-based crystal-structure prediction study explored possible ordered phases of homogeneous SiO, and found that these are metastable compared to a mixture of crystalline Si and SiO257. We verified that our ACE potential similarly reproduces the metastability of the ambient-pressure phases (Supplementary Fig. 9), and that it accurately predicts relevant Si–SiO2 interface energies (Supplementary Fig. 10). In good agreement with these results, our melt-quenched structures show a clear segregation between a-Si-like (blue) and a-SiO2-like regions (red). With decreasing quench rate, the number of silicon grains decreases while their size increases. Figure 5b shows the structure factor, S(q), for the structure quenched with 5 × 1012 K s−1 (data for the other structures are shown in Supplementary Fig. 11), which agrees best with the experimental data from ref. 15. Figure 5c shows the ratio between the volume of the silicon grains divided by the interface area. In the approximation of spherical particles, the grain diameter is d = 6 ⋅ VSi,grains/Ainterface. From this we can estimate average grain diameters between 24 and 54 Å for the tested quench rates. These grain diameters agree very well with transmission electron microscopy measurements, which indicated diameters of 30–40 Å14.
Figure 5d shows the excess energies of the structures referenced to to α-quartz and diamond-type silicon. The SiO structures were relaxed by optimisation of the cell size as well as the atomic positions at 0 K. As experimental reference, we show the standard enthalpy of formation of SiO58. The structures generated using quench rates of 5 × 1012 and 2 × 1012 K s−1 have energies comparable to experiment. Indeed, we can even create structures that are energetically more favourable than in experiment, noting again that our procedure to produce the structures deviates significantly from the experimental one.
But is this really an improvement compared to existing, empirically fitted interatomic potentials? We tested the Munetoh potential59 and a charge-optimised many-body (COMB) potential29 for the same procedure to generate structural models of SiO. The Munetoh potential yielded a homogeneous structure without observable segregation into silicon and SiO2, and the resulting structure factor (Supplementary Fig. 11c) deviates strongly from experiment. For the COMB potential, we observed pore formation at elevated temperatures, finally resulting in a strongly increased simulation-cell size. Therefore, we only equilibrated our best-matching structure at room temperature and analysed the change in structure factor (Supplementary Fig. 11f): again, we observed a strong deviation from experiment, indicating that the structure is very different from the ACE model prediction.
Crystallisation of silicon in amorphous SiO
Silicon in silicon monoxide is experimentally known to crystallise above 850 °C60. Figure 6 illustrates simulations of such crystallisation processes. The SiO structures shown in Fig. 5 were heated to 1400 K, causing the silicon-rich regions to melt while the silica matrix remained solid. The structures were then quenched to 1200 K within 20 ns (Fig. 6a). Through this cooling process, we noticed crystallisation in the silicon-rich regions of the SiO structure. Details are shown in Supplementary Fig. 12, indicating how crystallisation starts from two seeds, appearing shortly after each other and propagating throughout the structure. Before thermal treatment, all structures show nearly no sign of crystallinity, whereas afterwards, the structures with larger silicon grains do (Fig. 6c–f). This also affects the structure factor: those systems with no or only a small amount of crystallinity (quench rates: 1 × 1013 and 5 × 1012 K s−1) show structure factors that are still similar to the experimental structure factor of SiO (Fig. 6g), whereas those with larger amounts of crystalline silicon (1 × 1012 and 2 × 1012 K s−1) show distinct S(q) peaks that indicate crystallinity (Fig. 6h). This comparison allows us to exclude the occurrence of large amounts of crystalline silicon in silicon monoxide samples, given the experimental S(q) (cf. Fig. 6g and ref. 15).
Figure 6i shows that all structures are energetically more favourable after the thermal treatment. However, even though one might expect that the stronger crystallised structures gain more energy, we observe no direct connection between the energy gain and the level of crystallinity after heat treatment. The reason is likely that for the fast-quenched structure (1 × 1013 K s−1), the silicon-rich regions might be not perfectly arranged. Thermal treatment thus lowers the interface energy and the internal energy of the amorphous silicon region. For the crystallised structures there is, on one hand, an energy gain due to crystallisation but, on the other hand, an energy loss due to the higher interface energy between crystalline silicon and the silica matrix compared to that of a-Si and the silica matrix (cf. Supplementary Fig. 13). Based on these interface energies, we constructed a simple physically-based model (Methods) to calculate the energy gain of a spherical silicon inclusion in an amorphous silica matrix by crystallisation. This energy gain is shown in Fig. 6j for interface energies taken from four pairs of manually constructed interfaces. The difference between the models is rather small, indicating that the minimum radius of a spherical inclusion has to be around 9 Å to be energetically favourable in a crystalline state. We note that the manually constructed models do not have the same interface orientations as in the SiO structures and are also not perfectly relaxed, causing an overestimation of the interface energies – as seen, for example, in the difference between the interface energies of the SiO structures and of the manually constructed a-Si–a-SiO2 interface in Supplementary Fig. 13d. However, since for our model the difference between the interface energy between a-Si–a-SiO2 and c-Si–a-SiO2 is the only relevant quantity, we assume that these effects partially cancel.
Discussion
Understanding the microscopic nature of interfaces and nanostructured matter is essential to advancing materials research. Here, we have presented an active-learning scheme that we term “amorphous matrix embedding” that can realistically represent environments from large-scale simulations in DFT-accessible cells, enabling fast and accurate atomistic modelling of heterogeneous materials. We used the approach to develop a general-purpose interatomic potential for binary Si–O phases with varied compositions that is able to describe the trifecta of modelling challenges in this material system: very-high-pressure phases (relevant to geology), surfaces (relevant to catalysis), and mixed stoichiometric compositions with nanoscale heterogeneity (relevant to battery systems).
Using the ACE approach, we observe a speed-up of about two orders of magnitude compared to the more established GAP framework. This makes it possible to access long time scales and large length scales with DFT-like accuracy. Of course, there are still some shortcomings of this potential, e.g., the lower accuracy for pure silicon compared to the state of the art – but this use-case is not the focus of our work, as there are already competitive GAP and ACE models available30,34. In our case, the quality of the underlying meta-GGA data might cause an outperformance compared to earlier ML-potentials fitted with more economical GGA labels. The potential also underestimates the FSDP height of amorphous SiO2 (Supplementary Fig. 14), as already observed for our earlier GAP model31. Further research is required to identify the origin of this underestimation.
We hope that our work, and the dataset and resources developed therein, will advance the modelling of porous silica nanostructures as well as of high-pressure silica. For the Si–SiO2 interface, alternative interatomic potential models are scarce and the higher-quality potentials come with an expensive charge-equilibration term. Our tests showed that the ACE potential describes silicon monoxide in much closer agreement with experiment than existing empirical models. Additionally, we are able to generate crystallites in the SiO matrix as observed experimentally, showing that the ACE potential can be used for a wide range of applications in the Si–O system, including both ordered and disordered structures.
We view the present database and ML potential model as a starting point for wider-ranging studies in this important material system. In the future, higher accuracy for the mixed system might be achieved by using charge-equilibration schemes coupled with ML potentials61. However, this would come with much longer computing times as well as worse scaling for larger systems. Moreover, in the future, we will include lithium in the potential to investigate the battery performance of SiO on the atomistic scale.
Methods
Machine-learning potential fitting
We used two frameworks for fitting ML potential models. While constructing the reference database, we used Moment Tensor Potentials38 with active learning53 as implemented in the MLIP package39. For the final potential fit, we used the nonlinear Atomic Cluster Expansion (ACE)33,34 as implemented in PACEMAKER35. For ACE, we tested a range of combinations of embeddings, and found the following to be suitable:
with ϕi being atomic properties, which are expanded by the ACE basis functions (for details see ref. 33). The exponents of the embeddings include fractional exponents and higher integer powers of \({f}_{i}\in \left\{1/8,\, 1/4,\, 3/8,\, 3/4,\, 7/8,\, 2\right\}\). We found that especially fractions between 0 and 1 improved the behaviour of the potential. This approach goes beyond the previously suggested linear embedding (only the first term) and Finnis–Sinclair (the first two terms) type embedding34, and is referred to as “complex” embedding in Table 1. For the expansion of the atomic properties ϕi we used 600 basis functions with 5700 parameters. As radial basis we employed Bessel functions. A κ value of 0.01, which gives the ratio between force and energy weights value, was used during fitting. For optimisation we used the BFGS algorithm for 2000 steps.
DFT computations
All DFT computations were performed using VASP62,63 and the projector augmented-wave method64,65. We used the SCAN functional36 with an energy cutoff of 900 eV and a k-spacing of 0.23 Å−1. Surface were performed with dipole corrections. We note that these convergence parameters are optimised for silica; however, we found them to be also well converged for mixed phases and for pure silicon structures. Only for very-high-pressure silicon allotropes, a higher k-spacing would provide a relevant advantage; however, since these are not in the scope of the present work, we neglect these inaccuracies.
MD workflows
Simulation protocols were implemented using the atomic simulation environment (ASE)66 and the OVITO Python interface67. While optimisation and small-cell MD were partially performed with ASE, large-scale MD and statics simulations were carried out using LAMMPS68. The time step was 1 fs. For NVT simulations, we used a Nosé–Hoover thermostat with temperature damping constant of 100 fs; for NPT simulations, we added a Nosé–Hoover barostat with a pressure damping constant of 1000 fs.
For quench simulations we used the same protocol as in ref. 31. This protocol starts with a randomisation part at 6000 K for 10 ps (NVT). Afterwards the temperature is immediately reduced to 4000 K and held there for 100 ps (NPT, zero external pressure). From there the melt is quenched with different quench rates to 300 K (NPT, zero external pressure). At this temperature the structure is equilibrated for another 10 ps. In case of ‘hybrid’ simulations these quenches have been performed using the CHIK potential25 and afterwards the structure has been equilibrated for another 20 ps with the ACE potential.
As input for the compression simulations we generated amorphous structures using this quenching protocol with a quench rate of 1013 K s−1 and only the ACE potential. The compression was performed under isostatic conditions. In each step, the pressure was initially increased by 1 GPa within 2.5 ps of simulation time, followed by equlibration over 2.5 ps at the new pressure. This procedure was iteratively repeated. Coordination numbers were determined after equilibration.
The aerogel structures were created by a similar protocol as in ref. 31. An initial structure was randomised at 6000 K for 10 ps, instantly cooled to 4000 K and kept there for 100 ps. From this temperature, the liquid was cooled to 300 K with a quench rate of 1013 K s−1. During the equilibration at 4000 K and up to half of the quenching process, the cells were additionally extended to the desired density.
The mixed structures were created using the same protocol as in ref. 31 for producing amorphous structures. The volume of the silicon grains was determined within OVITO by creating bonds between silicon and oxygen atoms (cut-off: 2 Å) and deleting all atoms which have more than one such bond. This deletes the whole silica matrix. The interface area and volume is then determined by using the ConstructSurfaceMesh modifier (Gaussian density method, resolution: 50, radius scaling: 100%, isovalue: 0.1) on the remaining atoms.
Phase diagram calculations
Thermodynamic integration was carried out as implemented in calphy43,44. We used 50,000 equilibration steps, 800,000 switching steps for the switching to the Einstein crystal, as well as 300 steps/K for the thermodynamic integration to calculate the temperature dependence. Due to numerical issues, we fixed the spring constants of the Einstein crystal to 2 eV Å−2 for oxygen and 4 eV Å−2 for silicon. We carefully checked the influence of this constraint on the final results and found it to be negligible. More details on the phase diagram calculations can be found in Supplementary Note 2.
Structure factors
Faber–Ziman structure factors were obtained by summation of the Fourier transformations of the partial radial distribution functions calculated with OVITO. The corresponding partial structure factors were weighted by atomic form factors taken from ref. 69. For the high-pressure structures, we used a cut-off radius of 20 Å for the radial distribution function, and analysed a single snapshot (without time averaging). For the SiO structures, we used a cut-off of 80 Å and an average over 10 snapshots.
Surface energies
In these calculations we consider only stockiometric slabs. The surface energies, γ, were calculated as
where A is the total surface area (at the top and bottom of the slab combined), N is the number of particles in the slab, Eref is the bulk reference energy and Eslab is the potential energy of the slab. The slab energies for Table 2, Fig. 4a (relaxed), and Supplementary Fig. 3 have been calculated for DFT- and ACE-relaxed structures. For Fig. 4b, we used the ACE-relaxed structures also to determine the DFT single-point surface energy. The reference energy for the α-quartz surface is the energy of the optimised α-quartz unit cell per atom, and the reference energy for the diamond surface is obtained for the optimised diamond-type unit cell. The reference energy of the amorphous sample is the bulk energy of the same relaxed amorphous structure without surfaces.
Enthalpy and structural analysis at high pressure
The enthalpy H is given by
where E is the internal energy, p is the pressure, and V is the volume. The volume dependence of the energy was determined by a Birch–Murnaghan fit to the energy–volume curve of each polymorph. p(V) was given by the corresponding derivative. The energy-volume curves were calculated by varying the volume by ±20% for α-quartz and coesite, by ±25% for stishovite and ±30% for all other phases. The corresponding structures were structurally optimised, allowing changes of the positions as well of the box shape, however keeping the volume fixed. For the analysis of the compression MD simulations, coordination numbers were determined by integrating over the first peak of the partial Si–O radial distribution function. The Si–O bond distances are given by the first peak position of the partial Si–O radial-distribution function.
Interface energy model for the SiO crystallisation
We build a interface energy based model for a spherical inclusion of silicon with radius r within an amorphous SiO2 matrix. The energy difference between the crystallised silicon and the amorphous silicon is given by,
Here, \({\gamma }_{{{{{{{{\rm{a}}}}}}}}-{{{{{{{\rm{Si}}}}}}}}-{{{{{{{\rm{a}}}}}}}}-{{{{{{{{\rm{SiO}}}}}}}}}_{2}}\) is the interface energy between amorphous silicon and amorphous silica, \({\gamma }_{{{{{{{{\rm{c}}}}}}}}-{{{{{{{\rm{Si}}}}}}}}-{{{{{{{\rm{a}}}}}}}}-{{{{{{{{\rm{SiO}}}}}}}}}_{2}}\) is the interface energy between crystalline silicon and amorphous silica, Ea−Si is the energy of the corresponding amorphous silicon structure (cf. Supplementary Table IV), Ec−Si is the energy of crystalline silicon, and Vatom is the volume per atom. Details about the interface energies we used can be found in Supplementary Note 2.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
The potential parameter files, the reference data with SCAN labels, and additional supporting data (including LAMMPS scripts and input configurations) generated in thus study are openly available in the Zenodo repository at https://doi.org/10.5281/zenodo.1041919470. Source data are provided with this paper.
Code availability
The codes for potential fitting and evaluation are publicly available and were used as provided by their respective authors, without modification. Custom-written Python scripts for data analysis are provided together with the corresponding data in the Zenodo repository.
References
Heaney, P. J., Prewitt, C. T. & Gibbs, G. V. (eds.) Silica: Physical Behavior, Geochemistry, and Materials Applications (De Gruyter, Berlin, Boston, 1994).
Yoshikawa, K. et al. Silicon heterojunction solar cell with interdigitated back contacts for a photoconversion efficiency over 26%. Nat. Energy 2, 1–8 (2017).
Li, Y. & Yu, J. Emerging applications of zeolites in catalysis, separation and host–guest assembly. Nat. Rev. Mater. 6, 1156–1174 (2021).
Soleimani Dorcheh, A. & Abbasi, M. Silica aerogel; synthesis, properties and characterization. J. Mater. Process. Technol. 199, 10–26 (2008).
Prescher, C. et al. Beyond sixfold coordinated Si in SiO2 glass at ultrahigh pressures. Proc. Natl Acad. Sci. 114, 10041–10046 (2017).
Tracy, S. J., Turneaure, S. J. & Duffy, T. S. In situ X-Ray Diffraction of Shock-Compressed Fused Silica. Phys. Rev. Lett. 120, 135702 (2018).
Tracy, S. J., Turneaure, S. J. & Duffy, T. S. Structural response of α-quartz under plate-impact shock compression. Sci. Adv. 6, eabb3913 (2020).
Terman, L. M. An investigation of surface states at a silicon/silicon oxide interface employing metal-oxide-silicon diodes. Solid State Electron. 5, 285–299 (1962).
Card, H. C. Si–SiO2 interface state spectroscopy using MOS tunneling structures. Solid State Electron. 22, 809–817 (1979).
Pantelides, S. T. et al. Si/SiO2 and SiC/SiO2 Interfaces for MOSFETs – Challenges and Advances. Mater. Sci. Forum 527–529, 935–948 (2006).
Potter, H. N. Silicon Monoxide. Transcr. Electrochem. Soc. 12, 191–214 (1907).
Brady, G. W. A Study of Amorphous SiO. J. Phys. Chem. 63, 1119–1120 (1959).
Greaves, G. N. EXAFS and the structure of glass. J. Non Cryst. Solids 71, 203–217 (1985).
Schulmeister, K. & Mader, W. TEM investigation on the structure of amorphous silicon monoxide. J. Non Cryst. Solids 320, 143–150 (2003).
Hirata, A. et al. Atomic-scale disproportionation in amorphous silicon monoxide. Nat. Commun. 7, 11591 (2016).
Hass, G. Preparation, Structure, and Applications of Thin Films of Silicon Monoxide and Titanium Dioxide. J. Am. Ceramic Soc. 33, 353–360 (1950).
Poat, D. Properties of pulse-deposited thin-film silicon monoxide capacitors. Thin Solid Films 4, 123–136 (1969).
Yang, J. et al. SiOx-based anodes for secondary lithium batteries. Solid State Ionics 152–153, 125–129 (2002).
Liu, Z. et al. Silicon oxides: A promising family of anode materials for lithium-ion batteries. Chem. Soc. Rev. 48, 285–309 (2019).
Stillinger, F. H. & Weber, T. A. Computer simulation of local order in condensed phases of silicon. Phys. Rev. B 31, 5262–5271 (1985).
Tersoff, J. New empirical approach for the structure and energy of covalent systems. Phys. Rev. B 37, 6991–7000 (1988).
Lee, B.-J. A modified embedded atom method interatomic potential for silicon. Calphad 31, 95–104 (2007).
van Beest, B. W. H., Kramer, G. J. & van Santen, R. A. Force fields for silicas and aluminophosphates based on ab initio calculations. Phys. Rev. Lett. 64, 1955–1958 (1990).
Vashishta, P., Kalia, R. K., Rino, J. P. & Ebbsjö, I. Interaction potential for SiO 2 : A molecular-dynamics study of structural correlations. Phys. Rev. B 41, 12197–12209 (1990).
Carré, A., Horbach, J., Ispas, S. & Kob, W. New fitting scheme to obtain effective potential from Car-Parrinello molecular-dynamics simulations: Application to silica. EPL Europhys. Lett. 82, 17001 (2008).
Yasukawa, A. Using An Extended Tersoff Interatomic Potential to Analyze The Static-Fatigue Strength of SiO2 under Atmospheric Influence. JSME Int. J. Ser. A Mech. Mater. Eng. 39, 313–320 (1996).
van Duin, A. C. T. et al. ReaxFFSiO Reactive Force Field for Silicon and Silicon Oxide Systems. J. Phys. Chem. A 107, 3803–3811 (2003).
Yu, J., Sinnott, S. B. & Phillpot, S. R. Charge optimized many-body potential for the Si / SiO2 system. Phys. Rev. B 75, 085311 (2007).
Shan, T.-R. et al. Second-generation charge-optimized many-body potential for Si / SiO2 and amorphous silica. Phys. Rev. B 82, 235302 (2010).
Bartók, A. P., Kermode, J., Bernstein, N. & Csányi, G. Machine Learning a General-Purpose Interatomic Potential for Silicon. Phys. Rev. X 8, 041048 (2018).
Erhard, L. C., Rohrer, J., Albe, K. & Deringer, V. L. A machine-learned interatomic potential for silica and its relation to empirical models. npj Comput. Mater. 8, 1–12 (2022).
Roy, S., Dürholt, J. P., Asche, T. S., Zipoli, F. & Gómez-Bombarelli, R. Learning a reactive potential for silica-water through uncertainty attribution. Preprint at https://arxiv.org/abs/2307.01705 (2023).
Drautz, R. Atomic cluster expansion for accurate and transferable interatomic potentials. Phys. Rev. B 99, 014104 (2019).
Lysogorskiy, Y. et al. Performant implementation of the atomic cluster expansion (PACE) and application to copper and silicon. npj Comput. Mater. 7, 1–12 (2021).
Bochkarev, A. et al. Efficient parametrization of the atomic cluster expansion. Phys. Rev. Mater. 6, 013804 (2022).
Sun, J., Ruzsinszky, A. & Perdew, J. P. Strongly Constrained and Appropriately Normed Semilocal Density Functional. Phys. Rev. Lett. 115, 036402 (2015).
Bonati, L. & Parrinello, M. Silicon liquid structure and crystal nucleation from ab initio deep metadynamics. Phys. Rev. Lett. 121, 265701 (2018).
Shapeev, A. V. Moment Tensor Potentials: A Class of Systematically Improvable Interatomic Potentials. Multiscale Model. Simul. 14, 1153–1173 (2016).
Novikov, I. S., Gubaev, K., Podryabinkin, E. V. & Shapeev, A. V. The MLIP package: Moment tensor potentials with MPI and active learning. Mach. Learn. Sci. Technol. 2, 025002 (2020).
Artrith, N. & Behler, J. High-dimensional neural network potentials for metal surfaces: A prototype study for copper. Phys. Rev. B 85, 045439 (2012).
Lysogorskiy, Y., Bochkarev, A., Mrovec, M. & Drautz, R. Active learning strategies for atomic cluster expansion models. Phys. Rev. Mater. 7, 043801 (2023).
Kong, L. et al. Overcoming the size limit of first principles molecular dynamics simulations with an in-distribution substructure embedding active learner. Preprint at https://arxiv.org/abs/2311.08177 (2023).
Menon, S., Lysogorskiy, Y., Rogal, J. & Drautz, R. Automated free-energy calculation from atomistic simulations. Phys. Rev. Mater. 5, 103801 (2021).
de Koning, M., Antonelli, A. & Yip, S. Optimized Free-Energy Evaluation Using a Single Reversible-Scaling Simulation. Phys. Rev. Lett. 83, 3973–3977 (1999).
Swamy, V., Saxena, S. K., Sundman, B. & Zhang, J. A thermodynamic assessment of silica phase diagram. Journal of Geophys. Res. Solid Earth 99, 11787–11794 (1994).
Otzen, C., Liermann, H.-P. & Langenhorst, F. Evidence for a rosiaite-structured high-pressure silica phase and its relation to lamellar amorphization in quartz. Nat. Commun. 14, 606 (2023).
Tsuchiya, T. & Nakagawa, S. A new high-pressure structure of SiO2 directly converted from α-quartz under nonhydrostatic compression. J. Phys. Condensed Matter 34, 304003 (2022).
Murakami, M., Hirose, K., Ono, S. & Ohishi, Y. Stability of CaCl2-type and α-PbO2-type SiO2 at high pressure and temperature determined by in-situ X-ray measurements. Geophys. Res. Lett. 30, 1207 (2003).
Kuwayama, Y., Hirose, K., Sata, N. & Ohishi, Y. The Pyrite-Type High-Pressure Form of Silica. Science 309, 923–925 (2005).
Kono, Y., Shu, Y., Kenney-Benson, C., Wang, Y. & Shen, G. Structural Evolution of SiO2 Glass with Si Coordination Number Greater than 6. Phys. Rev. Lett. 125, 205701 (2020).
Petitgirard, S. et al. Magma properties at deep Earth’s conditions from electronic structure of silica. Geochem. Perspect. Lett. 9, 32–37 (2019).
Murakami, M. et al. Ultrahigh-pressure form of SiO2 glass with dense pyrite-type crystalline homology. Phys. Rev. B 99, 045153 (2019).
Podryabinkin, E. V. & Shapeev, A. V. Active learning of linearly parametrized interatomic potentials. Comput. Mater. Sci. 140, 171–180 (2017).
Deringer, V. L. et al. Realistic Atomistic Structure of Amorphous Silicon from Machine-Learning-Driven Molecular Dynamics. J. Phys. Chem. Lett. 9, 2879–2885 (2018).
Deringer, V. L. et al. Origins of structural and electronic transitions in disordered silicon. Nature 589, 59–64 (2021).
Ferguson, F. T. & Nuth, J. A. Vapor Pressure of Silicon Monoxide. J. Chem. Eng. Data 53, 2824–2832 (2008).
AlKaabi, K., Prasad, D. L. V. K., Kroll, P., Ashcroft, N. W. & Hoffmann, R. Silicon Monoxide at 1 atm and Elevated Pressures: Crystalline or Amorphous? J. Am. Chem. Soc. 136, 3410–3423 (2014).
Nagamori, M., Boivin, J. A. & Claveau, A. Gibbs free energies of formation of amorphous Si2O3, SiO and Si2O. J. Non Cryst. Solids 189, 270–276 (1995).
Munetoh, S., Motooka, T., Moriguchi, K. & Shintani, A. Interatomic potential for Si–O systems using Tersoff parameterization. Comput. Mater. Sci. 39, 334–339 (2007).
Mamiya, M., Takei, H., Kikuchi, M. & Uyeda, C. Preparation of fine silicon particles from amorphous silicon monoxide by the disproportionation reaction. J. Crystal Growth 229, 457–461 (2001).
Ko, T. W., Finkler, J. A., Goedecker, S. & Behler, J. A fourth-generation high-dimensional neural network potential with accurate electrostatics including non-local charge transfer. Nat. Commun. 12, 398 (2021).
Kresse, G. & Furthmüller, J. Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set. Comput. Mater. Sci. 6, 15–50 (1996).
Kresse, G. & Furthmüller, J. Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. Phys. Rev. B 54, 11169–11186 (1996).
Blöchl, P. E. Projector augmented-wave method. Phys. Rev. B 50, 17953–17979 (1994).
Kresse, G. & Joubert, D. From ultrasoft pseudopotentials to the projector augmented-wave method. Phys. Rev. B 59, 1758–1775 (1999).
Larsen, A. H. et al. The atomic simulation environment—a Python library for working with atoms. J. Phys. Condens. Matter 29, 273002 (2017).
Stukowski, A. Visualization and analysis of atomistic simulation data with OVITO–the Open Visualization Tool. Modell. Simul. Mater. Sci. Eng. 18, 015012 (2010).
Thompson, A. P. et al. LAMMPS - a flexible simulation tool for particle-based materials modeling at the atomic, meso, and continuum scales. Comput. Phys. Commun. 271, 108171 (2022).
Prince, E. (ed.) International Tables for Crystallography. C: Mathematical, Physical and Chemical Tables, 3rd edn (Kluwer Academic, Dordrecht, 2004).
Erhard, L. C., Rohrer, J., Albe, K. & Deringer, V. L. Research data for “Modelling atomic and nanoscale structure in the silicon–oxygen system through active machine learning”. Zenodo, https://doi.org/10.5281/zenodo.10419194 (2024).
Hall, J. J. Electronic Effects in the Elastic Constants of n -Type Silicon. Phys. Rev. 161, 756–761 (1967).
Fukata, N., Kasuya, A. & Suezawa, M. Vacancy Formation Energy of Silicon Determined by a New Quenching Method. Japanese J. Appl. Phys. 40, L854 (2001).
Jaccodine, R. J. Surface Energy of Germanium and Silicon. J. Electrochem. Soc. 110, 524 (1963).
Acknowledgements
L.C.E. thanks Niklas Leimeroth for useful discussions. L.C.E. acknowledges support from the German Academic Exchange Service (Forschungsstipendien für Doktorandinnen und Doktoranden) and the Erasmus+ programme for support of two research stays at the University of Oxford. The research was supported by the Bundesministerium für Bildung und Forschung (BMBF) within the project FESTBATT under Grant No. 03XP0174A. J.R. and K.A. acknowledge support by the Deutsche Forschungsgemeinschaft (DFG, Grant no. 405621137, 405621160, and 521536863). L.C.E. acknowledges helpful discussion within the DFG GRK-2561 MatCom-ComMat. The authors gratefully acknowledge the computing time provided to them at the NHR Center NHR4CES at TU Darmstadt (project number 01539 and p0020142). This is funded by the Federal Ministry of Education and Research, and the state governments participating on the basis of the resolutions of the GWK for national high performance computing at universities (www.nhr-verein.de/unsere-partner).
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Contributions
L.C.E. performed all computations and analysis, with guidance from J.R., K.A., and V.L.D. All authors contributed substantially to the design of the research and to the interpretation of the results. L.C.E. and V.L.D. wrote the paper with input from all authors.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Thomas Reichenbach, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Erhard, L.C., Rohrer, J., Albe, K. et al. Modelling atomic and nanoscale structure in the silicon–oxygen system through active machine learning. Nat Commun 15, 1927 (2024). https://doi.org/10.1038/s41467-024-45840-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-024-45840-9
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.