Abstract
Polaron defects are ubiquitous in materials and play an important role in many processes involving carrier mobility, charge transfer and surface reactivity. Determining small polarons’ spatial distributions is essential to understand materials properties and functionalities. However, the required exploration of the configurational space is computationally demanding when using first principles methods. Here, we propose a machinelearning (ML) accelerated search that determines the ground state polaronic configuration. The ML model is trained on databases of polaron configurations generated by density functional theory (DFT) via molecular dynamics or random sampling. To establish a mapping between configurations and their stability, we designed descriptors modelling the interactions among polarons and charged point defects. We used the DFT+ML protocol to explore the polaron configurational space for two surfacesystems, reduced rutile TiO_{2}(110) and Nbdoped SrTiO_{3}(001). The MLaided search proposes additional polaronic configurations and can be utilized to determine optimal polaron distributions at any charge concentration.
Introduction
Polarons are quasiparticles that can form in polarizable materials by entanglement between charge carriers and lattice distortions^{1,2}. An unbound electron or hole injected in a material can interact with the lattice vibrations (phonon). If the electronphonon coupling is strong enough, the excess carrier induces local structural deformations associated with a potential well in which the carrier is selftrapped^{3,4}. Polarons represent a longstanding yet still exciting field of research with profound impact in different disciplines ranging from physics to chemistry and material science. These charged quasiparticles play a pivotal role in several phenomena of practical importance such as carrier mobility^{5,6,7,8,9,10,11}, surface reactivity^{12,13,14,15,16} and optical excitations^{17,18,19,20}, as well as representing a testbed for the development of many theoretical models and numerical methods^{1}. Small polarons, whose wave function is spatially confined within a few Ångström (Å) around their trapping site, are subjected to thermallyactivated hopping processes, which enable polaron mobility. As a consequence, small polarons can travel through the material forming different spatial distributions (polaron configurations) that have a strong impact on the properties and functionality of the material^{21}. Predicting favorable polaron configurations is key to correctly interpret experimental measurements and predict material’s behaviour, but it is a challenging task. Polaron groundstate distributions result from the balance between contrasting interactions, primarily polaronpolaron repulsion and the attraction between the negativelycharged polarons and the positivelycharged donor defects (e.g., oxygen vacancies and/or dopants)^{22}. Considering that polaron formation is favored on surface or nearsurface sites, dimensionality effects and surface reconstruction also play a crucial role in defining the optimal polaron configuration, complicating the scenario even further^{14,21}.
Materialspecific properties of small polarons are typically computed by using first principles approaches in the framework of DFT and appropriate extensions^{22,23,24,25}. The DFT modelling of defectsinduced polarons is complicated by the need to adopt large supercells in order to attenuate artificial interactions between periodic images of the polaron, which hampers an efficient exploration of the huge configurational space and makes the calculations computationally very demanding^{26,27,28}.
The routine approach to explore the polaron configurational space is based on a combination of manual selection and molecular dynamics (MD)^{22,29,30}. This protocol involves a threestep procedure: (i) Generating a pool of initial structures, where the polaron trapping sites are manually selected (using different types of sitecontrolled strategies^{22,30,31}); (ii) subsequent MD runs at temperatures high enough to activate small polaron hopping, thus allowing for a guided exploration of energetically favorable configurations; (iii) finally, a set of groundstate static DFT calculations for all inequivalent configurations found in the MD runs will determine the energetically favorable solutions at low temperature. This DFTMD scheme can be easily automatized in a workflow, but the prohibitively long time required to execute MD runs on large supercells and the sporadic thermallyactivated polaron hopping events, prevent an efficient exploration and restricts de facto the search to a limited subset of the full configurational space.
An alternative approach to bypass the extensive MDsimulations (step ii) would be to create a larger pool of manuallyselected polaronic configurations in step i and directly determine their energy stability as in step iii. However, this strategy relies on human intuition that could bias the selection process of trapping sites, thus excluding possible favorable configurations from the investigated pool. Exhaustive searches including all possible symmetrically inequivalent configurations can be performed for the simplest cases, i.e., the dilute limit, where the amount of polarons is relatively low^{30}. At higher defect concentrations, however, the combinatorial growth of configurations is excessive (i.e., dense limit). Therefore, random sampling approaches have to be used, which cannot guarantee the determination of the proper ground state. For this reason, alternative sampling methods are necessary to correctly describe the properties of materials hosting small polarons.
In this work we propose a datadriven strategy for an accelerated exploration of the polaron configurational space in order to predict the optimal trappingsite patterns for polarons. In recent years, machine learning (ML) has been used extensively in problems involving many combinatorial possibilities^{32}, as in the cases of chemical compound space^{33}, materials design^{34} and the calculation of potential energy surfaces and force fields^{35,36,37}. ML approaches accounting for the electronic charge, spin, and oxidation states in materials have been recently designed using artificial neural networks^{38,39,40}, however, their application to multipolaron system has not been addressed yet. Here, we propose a simple kernel regression scheme^{41,42}, with descriptors that embody the polaronpolaron and polarondefects interactions, trained on minimal DFT datasets to assess the relative stability of the initial (few hundreds) polaron configurations. The trained ML algorithm can then be used to extend the exploration of the configurational space by systematically analyzing millions of configurations, going far beyond the limits of first principles MD or any alternative sampling approaches based entirely on DFT.
The proposed DFT+ML strategy is applied here to two prototypical polaronic materials considering different types of doping: the oxygendefective rutile TiO_{2}(110) surface^{12,15,22,29,43,44,45,46} and the Nbdoped perovskite SrTiO_{3}(001) surface^{47,48,49}. We show that the MLaided search correctly identifies the groundstate polaron configuration for arbitrary carrier density (from the dilute to the dense limit), as confirmed by benchmark DFT tests on selected MLpredicted configurations. Our results show that our proposed method can be employed to efficiently determine the optimal polaron patterns in diverse polaronic materials, and can be trained to account for the interaction of polarons with different pointlike defects (oxygen vacancies and dopants, but also interstitials and adatoms for example).
Results
Machine learning approach
We start with a brief description of the MLaided algorithm designed to predict the stability of multipolaron configurations (more details can be found in the Methods 4). The schematic protocol, shown in Fig. 1, involves a mapping between a general surface structure containing chargedonor defects (e.g., dopants or oxygen vacancies, Fig. 1(a)) and possible polaron hosting sites (typically transition metal ions) with a kernelregression scheme. The connection is established by means of a simple descriptor (D) of local interactions (Fig. 1b, details provided in Descriptors section) which encode the spatial distance between the reference polaron to other polarons and point defects included within a cutoff sphere. This representation is used in a kernel regression scheme (Fig. 1(c)) to map a given manypolaron configuration to the corresponding polaron formation energy (E_{pol}) as calculated by DFT (see Fig. 1(d)). In DFT the polaron formation energy \({E}_{{{{\rm{pol}}}}}^{{{{\rm{DFT}}}}}\) is defined as the total energy difference between the polaronic solution (\({E}_{{{{\rm{loc}}}}}^{{{{\rm{dist}}}}}\), with the excess charges localized in a locally distorted lattice sites) and the delocalized solution (\({E}_{{{{\rm{deloc}}}}}^{{{{\rm{undist}}}}}\), with all excess electrons uniformly delocalized over the lattice), \({E}_{{{{\rm{pol}}}}}^{{{{\rm{DFT}}}}}={E}_{{{{\rm{loc}}}}}^{{{{\rm{dist}}}}}{E}_{{{{\rm{deloc}}}}}^{{{{\rm{undist}}}}}\)^{22}. In our approach, the kernel regressor assigns a virtual singlepolaron energy to every polaron in the configuration; the predicted single polaron energies are summed up to return the total polaronic energy of the test configuration, which can be compared to the E_{pol} calculated by DFT.
This scheme is used to first train the kernel regressor on an energy dataset of different polaronic configurations calculated by DFT. Then, the trained ML algorithm can be applied to polaron configurations not included in the DFT training database, in order to drastically expand the exploration of the configuration space. To ensure the quality of the predictions, the optimal polaron patterns identified by the ML algorithm can be ultimately tested and refined by few final DFT calculations.
In the following, we assess the quality and general efficiency of this computational strategy on rutile TiO_{2}(110) and perovskite SrTiO_{3}(001) surfaces by studying the relative stability of millions of possible polaron configurations with varying polaron concentration (i.e., various polaron densities), from the dilute to dense polaron limits.
Rutile TiO_{2}(110)
The structural unit of rutile TiO_{2}(110) is sketched in Fig. 2(a). The surface layer S0 consists of alternating rows of five and sixfoldcoordinated Ti atoms, referred to as Ti^{A} and Ti^{B}, respectively. Subsurface and subsubsurface layers are indicated with the label S1 and S2, respectively. Ti atoms coordinated below the surface Ti\({}_{{{{\rm{S0}}}}}^{{{{\rm{A}}}}}\)/ Ti\({}_{{{{\rm{S0}}}}}^{{{{\rm{B}}}}}\) sites are labelled as A/Bsites, respectively, with an additional subscriptlindicating the layer, e.g., Ti\({}_{{{{\rm{S1}}}}}^{{{{\rm{A}}}}}\) refers to a Ti^{A} site located in the subsurface layer S1, whereas Ti\({}_{{{{\rm{S2}}}}}^{{{{\rm{B}}}}}\) indicates a Btype Ti located in S2. The surface terminates with two parallel rows of oxygen atoms oriented along the [001] direction, which are easily removable to form surface oxygen vacancies (V_{O}). Each V_{O} effectively donates two electrons that can be trapped in titanium sites converting two pristine Ti^{4+} ions into Ti^{3+} ions. Increasing the oxygen vacancy concentration \({c}_{{V}_{{{{\rm{O}}}}}}\) leads to higher densities of polarons. Due to the combinatorial growth of accessible polaron configurations with progressively larger amounts of polarons, finding the most favorable distribution of polarons is a very challenging task.
Here we employ a 5 layers thick slab with a large 9 × 2 twodimensional (2D) unit cell containing 36 Ti sites per layer^{14,21}. Since the bottom two layers are fixed to bulk form (polaron inactive since no structural relaxation are allowed), this setup results in 108 potential trapping sites. We have inspected nine different defect concentrations starting from 1 V_{O} per 2D unit cell (corresponding to \({c}_{{V}_{{{{\rm{O}}}}}}\) = 5.5%) up to 9 V_{O} (\({c}_{{V}_{{{{\rm{O}}}}}}\) = 50%). Since polaron trapping at Ti sites follows a binomial distribution, the number of possible configurations ranges from ≈5 × 10^{3} (2 polarons, \({c}_{{V}_{{{{\rm{O}}}}}}\) = 5.5%) to ≈10^{20} (18 polarons, \({c}_{{V}_{{{{\rm{O}}}}}}\) = 50%), without considering symmetries (see Supplementary Notes for a more detailed discussion).
To tackle this formidable problem, we rely on a previously generated MD polaron energy dataset^{14,21}. It consists of 492 symmetricallyinequivalent polaron configurations, obtained by running MD simulations at nine different V_{O} concentration levels (ranging from \({c}_{{V}_{{{{\rm{O}}}}}}\) = 5.5% up to \({c}_{{V}_{{{{\rm{O}}}}}}\) = 50%). Details on the database are provided in Supplementary Table 1. The MDbased dataset suggest that at low \({c}_{{V}_{{{{\rm{O}}}}}}\), polarons preferably localize at Ti\({}_{{{{\rm{S1}}}}}^{{{{\rm{A}}}}}\) sites, while at higher defect and polaron concentration the S0 sites become comparatively more populated^{14,21}.
To begin with, this DFTMD database was split in train, validation, and test datasets (70%, 20%, 10%, respectively). The ML scatter plots shown in Fig. 2b, c show that the model can well reproduce the \({\overline{E}}_{{{{\rm{pol}}}}}^{{{{\rm{DFT}}}}}\) with a mean square error MSE ≈ 1.2 ⋅ 10^{−4} for training, and MSE ≈ 1.3 ⋅ 10^{−4} for both validation and test (see Supplementary Table 5). We note that configurations at very shallow \({\overline{E}}_{{{{\rm{pol}}}}}^{{{{\rm{DFT}}}}}\) (i.e., values higher than −0.2 eV) show larger errors, which we attribute to the lack of samples in the training dataset, owing to rare occurrence of these unfavorable polaronic solutions in the MD. In fact, these polaron energies correspond to configurations where a polaron is localized on the unstable Ti\({}_{{{{\rm{S1}}}}}^{{{{\rm{B}}}}}\) site. Since we aim to identify the optimal polaron configuration, this issue is not of great concern, but it highlights the need of a well representative sampling of the polaron configurational space. As we show in SrTiO3(001) section, a random sampling can overcome these limits of an MDgenerated database.
To assess the validity of our method to predict configurations that strongly differ from the available training data, we have performed an additional test on unexplored polaron V_{O} concentrations, by adopting the ’omitted defect concentration’ strategy described below. We constructed nine distinct test cases, one for each V_{O} concentration level available in the MD database. In each of these cases, we limited the training dataset to samples belonging to only 8 of the 9 available defect concentrations. Finally, we tested the model on the remaining concentration, not used in the training. We analyzed qualitative and quantitative aspects of this procedure by conducting a direct comparison between MLderived and target DFT energies for specific defect concentrations. We also compared the most stable configurations found via ML and via DFTMD, by means of a convex hull diagram (reporting the most stable configuration at every defect concentration), to inspect whether ML is capable to deliver a qualitatively consistent energy landscape. This is particularly important for the case under study, since experimental measurements indicate that, for \({c}_{{V}_{{{{\rm{O}}}}}}\) ≈ 20%, highly defective TiO_{2}(110) undergoes a 1 × 1 to 1 × 2 surface reconstruction, which can be connected to a lowering of the polaron energy with increasing \({c}_{{V}_{{{{\rm{O}}}}}}\)beyond a certain critical polaron concentration^{14,50}.
The results are collected in Fig. 2(d) and in the Supplementary Fig. 5 and Supplementary Table 5. Even though we notice an apparent loss of precision for configurations that are different from those included in the training data, leading to larger systematic errors (the resulting MSE of \({\overline{E}}_{{{{\rm{pol}}}}}\) is of the order of 10^{−3} for all test concentrations), the ML model is still able to correctly identify the optimal arrangements of polarons from the MD dataset and find the correct minimum at \({c}_{{V}_{{{{\rm{O}}}}}}\) = 22%. This is shown by the dot sizes in Fig. 2(d), indicating the MLpredicted favorability of a configuration relative to other predicted configurations in the respective defect concentration. This result demonstrates that the DFT+ML scheme, trained with the available DFT dataset, is capable to predict with good accuracy the relative stability of solutions corresponding to different concentrations.
From these results, based on a MLprocessing of the DFTMD input database, we can conclude that a largescale MLbased systematic search of new configurations (i.e., configurations not included in the original DFT database) should be feasible, provided that the final results are validated by a subsequent comparative DFT simulation. In this way, an accelerated ML exploration of the full configurational space would be possible, and only a small fraction of first principles calculations would be necessary, limited to the most favorable configurations found by ML. Different methods could be used to perform this exhaustive search in the configuration space, and find the optimal polaron arrangement for each concentration: Our designed ’exhaustive search’ strategy is described in the following.
Systematic search of configurations via ML
An efficient search in the entire configuration space needs to address the problem of the combinatorial growth of polaron configurations with increasing polaron density (increasing \({c}_{{V}_{{{{\rm{O}}}}}}\)). In our DFT calculations, the unit cell contains 108 possible Ti trapping site, leading to \(\left({108}\atop{18}\right)\approx 1{0}^{21}\) possible arrangements of polarons in the most defective case (\({c}_{{V}_{{{{\rm{O}}}}}}\) = 50%, 9 V_{O} defects in our cell with 18 polarons). To cope with this huge combinatorial space, and avoid an inefficient blind search, we have developed a bottomup searching strategy based on the concept of polaron building units. We have first trained our ML model on the entire DFTMD database (492 inequivalent configurations). We have then addressed the problem of two polarons (lowest considered concentration, \({c}_{{V}_{{{{\rm{O}}}}}}\) = 5.5%) and predicted all \(\left({108}\atop{2}\right)=5778\) possible variants using the ML model, thus extending considerably the available DFTMD database at that concentration. From these 5778 configurations we selected the 100 with the most favorable polaron energy \({E}_{{{{\rm{pol}}}}}^{{{{\rm{ML}}}}}\) as calculated by ML, and used them as polaron building units to construct the polaron configurations at the next concentration level (\({c}_{{V}_{{{{\rm{O}}}}}}\) = 11.1%, 4 polarons). To this aim, we followed the same additive strategy, namely adding two additional polarons to each of the 100 twopolarons configurations, scrutinizing with the ML model all 106 not yet occupied hosting sites. Similarly to the previous concentration, also in this case only a subset of 100 distinct, energetically most favorable configurations has been used to build the next configurations (\({c}_{{V}_{{{{\rm{O}}}}}}\) = 16.7%, 6 polarons). The same protocol has been adopted for all higher concentrations. Finally, at each concentration the three best MLpredicted polaron configurations have been verified by performing a DFT run using MLpredicted pattern as input with a selective initialization of the occupation matrix (see Density Functional Theory section and Supplementary Methods). We refer to these final set of energies as exhaustive search MLDFT energies.
The scheme described above allows for a quick calculation of roughly 4 ⋅ 10^{6} configurational energies, excluding a larger number of highly unfavorable configurations, and keeping the computational cost constant among each defect concentration. The results are displayed in Fig. 2(d), where we show the comparison between the reference MDDFT convex hull (input database, dashed line) and the best obtained MLDFT energies at each concentration (full lines). In Supplementary Figs. 6 and 8a, b most stable configurations and the distribution of polarons to specific sites, respectively, are shown. Supplementary Table 7 and Supplementary Fig. 9 displays the energy distribution of the most relevant polaron configurations at \({c}_{{V}_{{{{\rm{O}}}}}}\)= 11.1%. The overall outcome is very satisfactory with a few positive aspects to note. First, the MLaided DFT procedure finds a lowest absolute minimum not present in the MD database, clearly demonstrating the remarkable efficiency of the proposed ML scheme in exploring the configurational space. The absolute minimum is shifted from \({c}_{{V}_{{{{\rm{O}}}}}}\)= 22.2% to 27.8% resulting in a much smoother shape of the MLderived convex hull. At defect concentrations lower than 22% the two curves essentially overlap, with the ML configurations all slightly lower than the best MD configurations (in this case with a modest improvement of few meV). More subtle are cases at \({c}_{{V}_{{{{\rm{O}}}}}}\) > 33%, where polarons start populating Ti\({}_{{{{\rm{S0}}}}}^{{{{\rm{B}}}}}\)sites. This is due to a poor sampling of Ti\({}_{{{{\rm{S0}}}}}^{{{{\rm{B}}}}}\)site in the MD simulations, and to a tendency at high concentration towards charge delocalization which impedes a sitespecific assignment of the excess charge, pushing the system out of the polaron regime. This problem has been particularly obvious at the highest concentration, where we were not able to reproduce the lowenergy configurations predicted by the MLmodel in our DFT calculations (see missing data point of MLDFT curve at \({c}_{{V}_{{{{\rm{O}}}}}}\) = 50%). As already mentioned, to avoid an unrealistic denselypacked arrangement of polarons or occupation of highly unfavorable trapping sites (unlikely to be observed in experiment^{51}), TiO_{2}(110) samples undergo a polarondriven (1 × 1) → (1 × 2) structural reconstruction at \({c}_{{V}_{{{{\rm{O}}}}}}\) ≈ 20%^{14}. Therefore, the apparent inefficiency of the ML search above the critical concentration is not an intrinsic deficiency of the designed strategy, rather it should be traced back to physical arguments that hamper the construction of a suitable database for an unrealistic situation. Forcing the structure to maintain the (1 × 1) symmetry leads to formation of Ti\({}_{{{{\rm{S0}}}}}^{{{{\rm{B}}}}}\) polarons, which are energetically not favorable, and tend towards charge delocalization and to polaron diffusion to the bulk (trapping at S_{2} sites)^{52}. In this sense the present ML data provide further support and validation to the DFTbased description of the polarondriven surface reconstruction discussed in ref. ^{14}.
SrTiO_{3}(001)
To assess the degree of transferability and generality of the proposed methodology we applied a similar scheme to Nbdoped SrTiO_{3}(001), a different structure (atomically flat perovskite surface, see Fig. 3(a)) with different source of excess charge (chemical doping instead of V_{O}), resulting in different type of interaction categories (see Descriptors section). To further generalize our DFT+ML procedure we decided to follow a different strategy to build the necessary DFT database. Instead of MD runs, which require long execution time and could result in an inefficient exploration of the configuration ground state, we have adopted a randomly generated polaronic database using the occupation matrix approach (see Density Functional Theory section, Supplementary Table 2 and Supplementary Methods). This procedure has few advantages as compared to the MDsampling methodology. First of all, obtained samples are less correlated than in the MDruns. Secondly, the bias towards low energy configurations, which are disproportionately more often visited in an MDsimulation, is removed. This should result in a more general model that has higher accuracy across all possible polaron patterns. Lastly, and most importantly, it fully bypasses the cost of the MDsimulations and only the structural relaxations for each distinct polaronic pattern has to be performed (see Supplementary Discussion). Following the same protocol as discussed in the previous section, we again performed a randomized split for training the model on the entire database, consisting of 379 polaronic configurations. Following this we assessed the extrapolation capabilities by testing the model on defect concentrations not present in the training data. Lastly, we performed an exhaustive bottomup search for all defect concentrations, were we then evaluated most favorable predictions at the DFTlevel.
Figure 3b, c show the results from the randomized datasplit. The model is converged to a similar extent as in the case of TiO_{2} (\({\overline{E}}_{{{{\rm{pol}}}}}\) MSE of 5.7 ⋅ 10^{−5}, 9.3 ⋅ 10^{−5} for training and test data, respectively), and, noticeable and unlike TiO_{2}, the model delivers consistent accuracy at virtually all energies. This increased performance most likely originates from the less biased energy database obtained by the random sampling. In the case of omitted defect concentrations (relative favorability prediction shown in Fig. 3d), the model correctly extrapolates the low energy configurations for the omitted concentration based on the energies of the 4 included concentrations, and suffers to a smaller extent from systematic errors as compared to TiO_{2} (compare TiO_{2} and SrTiO_{3} cases in Supplementary Tables 5 and 6 as well as Supplementary Fig. 5).
Finally, we tested the efficiency of the exhaustive ML search, by exploring 2.25 ⋅ 10^{6} nonequivalent configurations. The outcome (see dashed and full line in Fig. 3(d)), clearly shows that this MLaugmented scheme outperforms the standard randomdatabase approach, as it finds multiple configurations lower in energy than the minima identified in the randomized search, at any polaron concentration. We confirmed the five lowest energies \({E}_{{{{\rm{pol}}}}}^{{{{\rm{ML}}}}}\) predicted by the ML exhaustive search at each Nbconcentration by a comparative static DFTcalculation and each of the MLpredicted polaron pattern was found more stable than the optimal pattern included in the training database. The most stable configuration predicted by ML (see Supplementary Fig. 7 for a collection of the most optimal polaron configurations) typically improved the mean polaronic energy \({\overline{E}}_{{{{\rm{pol}}}}}\) by 30 to 50 meV compared to the reference data. Interestingly, the results of the exhaustive ML search suggest a rationale for the most stable configuration based on a few simple rules: the energetically most stable configurations host polarons in the surface and subsurface layer, usually placed as close as possible to the Nbdopants (preferably below or above a dopant rather than in the same atomic layer (see site distribution of favorable configurations collected in Supplementary Fig. 8(c)).
Discussion
In this paper we presented an MLaided procedure to enhance and accelerate the identification of small polaron ground state configurations in multidefect systems with varying defect concentration, by employing simple and general descriptors based on distancedependent interaction categories and a standard kernel regression fed by a DFT energy database. We tested and discussed a few alternative protocols:

(i)
A conventional train/validation/test ML protocol.

(ii)
Omitteddefect concentration model, based on extrapolating the polaron energy for a given defect concentration from the DFT energies obtained at other concentrations.

(iii)
Exhaustive ML search. An exploration of the polaron configurational space based on a guided bottomup selection of the most favorable configuration from all possible nonequivalent configurations at each given defect concentration.
We assessed the generality of the procedure by applying it to two different materials (TiO_{2}(110) and SrTiO_{3}(001)) with different types of defects (V_{O} and Nb dopants) and adopting different strategies to construct the DFT database (MD and random sampling). Our data indicates that a randomized sampling approach is superior to MDgenerated database which suffers from undesirable correlation between the MDgenerated configurations and excessive computational cost. Importantly, the combination of random sampling and exhaustive ML search results in a robust algorithm that delivers very good results, as demonstrated for SrTiO_{3}(001) where this procedure leads to a systematic improvement at all explored concentration: the exhaustive ML search finds configurations with lower 0K DFT energy as compared to those included in the input database.
While our model has been applied to the identification of polaron configurations with static dopant/vacancy patterns, it can be further extended to consider optimized configurations with mobile point defects considering other type of defects (e.g. hydrogen adatom or Ti interstitials) and other materials^{9}. In fact, the descriptor only relies on identifying polaron hosting sites with different local coordination and lattice symmetry and their relative position with respect to the surface, to structure a list of distances between polarons and defects. From this information, the descriptor structure can easily be attained for any material and only few parameters (i.e., number of included distances in each interaction category and cutoff radius) need to be determined to optimize the performance.
A final positive aspect offered by the proposed method is the arbitrary scalability with respect to the supercell size, enabling access to large scale simulations, where defect arrangements could be precisely aligned with experimental data, to determine likely polaronic configurations observed in surface imaging techniques. We note, however, that the unit cells used to train the ML model must be sufficiently large (e.g., at least as large as those used in this study, see Methods Section) in order to prevent spurious effects arising from the interaction of the charged pointdefects with their periodic image in the DFT calculations. Also, qualitative extrapolations and interpolations to defect concentrations where no data is available and predictions seem plausible in the presented test cases and could be further developed.
Methods
Density functional theory
Densityfunctional theory (DFT) and firstprinciples molecular dynamics (MD) calculations were performed using the Vienna Abinitio Simulation Package (VASP)^{53,54}. For our DFT+U calculations we adopted the generalized gradient approximation with Perdew, Burke, and Ernzerhof parametrization (PBE)^{55}, including an onsite effective U = 3.9 eV enacted on the dorbitals of Ti atoms in the case of rutile TiO_{2} (previously determined by constrained randomphase approximation (cRPA)^{44}) and U = 4.5 eV for SrTiO_{3}, here enacted on the dorbitals of Ti and Nb, in line with the cRPA value determined in previous works.^{56,57,58} Standard projectoraugmented wave (PAW) potentials^{59,60} were used for Ti, Sr and Nb atoms, while oxygen was modeled adopting softer potentials to reduce convergence requirements on the energy cutoff. We used the Γ point only for the integration in the reciprocal space, and standard convergence criteria with a planewave energy cutoff of 250 eV^{21} for rutile TiO_{2}, and 350 eV for SrTiO_{3}.
The rutile TiO_{2}(110) and SrTiO_{3}(001) surfaces were modeled by super cells, containing five stochiometric layers in large twodimensional 9 × 2 and 6 × 4 unit cells, respectively. The large lateral dimensions of the unit cells allow for a high number of possible polaron patterns, and reduce to a negligible extent the spurious interaction of charged pointdefects with their image in the calculations with periodic boundary conditions^{21}. The three surface layers and the corresponding labeling are shown in Figs. 2(a), 3(a). In both cases the bottom two stochiometric layers were kept fixed at bulk positions in order to mimic and retrieve the electronic and structural properties of the bulk. Oxygen vacancies (V_{O}) on the twofold coordinated O sites on the TiO_{2} surface, and Nb dopants replacing Ti atoms on the surface and subsurface SrTiO_{3} layers, were modeled at nine and five different concentration levels, \({c}_{{V}_{{{{\rm{O}}}}}}\) and c_{Nb} respectively. The defect positions were chosen such that interdefect distances are maximized, and the concentrations are given in percentage with respect to the number of twofold surface oxygen sites in TiO_{2} and the total number of Ti sites in SrTiO_{3}. SrTiO_{3} is known to exhibit a wide variety of surface reconstructions, and it has recently been shown that flat bulkterminated (001)surface with surface defects can be stabilized using novel cleaving procedure^{61,62}.
The polaronic localization sites were identified by inspecting the size of the local magnetic moments on Ti ions (larger than 0.5μ_{B}) and relaxations to 0 K have been performed from each distinct polaron localization pattern. In the case of rutile TiO_{2} nonequivalent polaron configurations were generated via MD at high temperature (700 K). Instead, to build a database of polaronic configurations in SrTiO_{3}, for each concentration, an appropriate number of Ti sites within the top three layers were randomly chosen to host the polarons. For localizing the excess charge in the selected Ti hosting site we employed the occupation matrix control tool^{30}. This tool allows us to constrain the electron density matrix of atoms in the cell directly, such that a polaron can be placed explicitly at the desired site, and in the desired orbital. Polaron configurations suggested by the exhaustive MLaided search were always initialized via occupation matrix control, starting from initial occupation matrices taken from the training databases. Examples of occupation matrices are collected in the Supplementary Material in Section S2.
A final note on the calculation of the DFT polaron energy \({E}_{{{{\rm{pol}}}}}^{{{{\rm{DFT}}}}}\): DFT calculations do not grant access to formation energies of individual polarons, but only to the total energy of all polarons in the unit cell. In order to compare polaron formation energies for configurations with different polaron concentration, we defined a mean polaronic energy \({\overline{E}}_{{{{\rm{pol}}}}}^{{{{\rm{DFT}}}}}={E}_{{{{\rm{pol}}}}}^{{{{\rm{DFT}}}}}/{N}_{{{{\rm{pol}}}}}\), obtained by averaging over the number of polarons N_{pol} in a given configuration. This operation allows us to compare the polaron energies obtained for different defect concentrations.
Descriptors
To design a suitable descriptor, we have exploited the fact that polaron energies are affected by two main interactions^{21}:

(i)
Electrostatic interaction between charged defects: attractive interaction between negatively charged polarons and positively charged defects and polaronpolaron repulsion, both rapidly decreases with increasing spatial separation.

(ii)
Polaron orbital symmetry, determined by the local coordination and symmetry of the hosting site as well as by the distance from the surface layer (For example, in rutile TiO_{2}(110) the distancedependence of the polaronpolaron Ti\({}_{{{{\rm{S1}}}}}^{{{{\rm{A}}}}}\)Ti\({}_{{{{\rm{S1}}}}}^{{{{\rm{A}}}}}\) interaction is different from Ti\({}_{{{{\rm{S1}}}}}^{{{{\rm{A}}}}}\)Ti\({}_{{{{\rm{S0}}}}}^{{{{\rm{A}}}}}\)^{21}).
Based on these principles, we designed a descriptor composed of a list of pairwise polaronpolaron and polarondefect interactions, defined by the corresponding interpolaron and polarondefect distances within a cutoffsphere R_{c} around each polaron (defect: V_{O} and Nb, for TiO_{2} and SrTiO_{3}, respectively). In this way, fixed portions of the descriptor vector can be assigned to specific interaction categories. An interaction category always depends on the local coordination of the hosting site (e.g., Ti\({}_{{{{\rm{S1}}}}}^{{{{\rm{A}}}}}\)) and the local coordination of its interacting partner (e.g., Ti\({}_{{{{\rm{S0}}}}}^{{{{\rm{A}}}}}\)). Within each interaction category, a fixed number of preassigned slots n can be used, containing rescaled distances according to the expression:
The rescaling allows that the vector is filled with zeros, were no interaction is present. For an exemplary description of the evaluation of a polaron descriptor see the Supplementary Methods. Mind that each descriptor corresponds to the environment of a single polaron. Therefore, the number of descriptors from the databases are 4368 and 2257 for TiO_{2} and SrTiO_{3}, respectively.
We complete this section by providing a brief description of the resulting interaction categories for TiO_{2} and SrTiO_{3}. More details can be found in Section S3 in the Supplementary Materials.
Interaction categories in TiO_{2}(110)
In rutile TiO_{2} the three topmost layers contain hosting sites with two different local orientations, for a total of 36 interaction categories (Ti\({}_{{{{\rm{Si}}}}}^{{{{\rm{X}}}}}\)Ti\({}_{{{{\rm{Sj}}}}}^{{{{\rm{Y}}}}}\), with X,Y ∈ {A, B} and i,j ∈ {0, 1, 2}). Due to the employed 9 × 2 supercell and the slightly different distancedependence of stacked and nonstacked hosting sites of same local coordination^{21}, we distinguish between stacked (A or Bsites) and non stacked (\({{{\rm{A}}}}^{\prime}\) or \({{{\rm{B}}}}^{\prime}\)sites). This adds 18 additional interaction categories. Lastly, a category for the interaction with the V_{O} is added for each differently coordinated sites (Ti\({}_{{{{\rm{Si}}}}}^{{{{\rm{X}}}}}\)V_{O}, with X ∈ {A, B} and i ∈ {0, 1, 2}), resulting in 60 possible interaction categories. Three shortest distances per interaction category and a cutoff radius of 15 Å resulted in optimal model predictions for this descriptor. The full list of interaction categories for TiO_{2} is given in the Supplementary Materials in Supplementary Table 3 and examples of interaction categories are collected in Supplementary Figs. 1, 2.
Interaction categories SrTiO_{3}(001)
Cubic and atomically flat SrTiO_{3} has a more symmetrical structure, which results in a simpler descriptor. The three topmost layers contain identically coordinated hosting sites, leading to nine different interaction categories (Ti_{Si}Ti_{Sj}, with i,j ∈ {0, 1, 2}). However, a more fine grained distinction of polarondopant interactions is necessary, since dopants can in principle lay in any layer (unlike V_{O} in rutile TiO_{2}(110)). Therefore, we consider six additional interaction categories (Ti_{Si}Nb_{Sj}, with i ∈ {0, 1, 2} and j ∈ {0, 1}), resulting in a total of 15 different interaction categories with an optimal number of four included distances per category and a R_{c} = 13 Å. A full list of interaction categories for SrTiO_{3} is given in the Supplementary Materials in Supplementary Table 4 and examples of interaction categories are collected in Supplementary Fig. 1.
Machine learning model
We have used instancebased learning in form of kernel regression^{41}, where a kernelfunction k( ⋅ , ⋅ ) (or similarity measure) of the descriptor of interest \({\textbf{x}}^{\prime}\) and all descriptors x_{i} of the training set is calculated and scalar multiplied with the optimized regression parameters α, giving a weighted mean of the target quantity based on similarity to training instances:
We determine optimal regression parameters of each kernel regressor, corresponding to a specific type of polaron, via backpropagation and gradientdescent performed on the sum of each kernel regressors prediction. We found that training the regressors with a stochastic gradient descent variant results in better extrapolation capabilities than performing an exact fit. To optimize regression parameters of the different kernel regressors on the training data, the predicted energy \({E}_{{{{\rm{pol}}}}}^{{{{\rm{ML}}}}}\) is used to calculate the loss function with respect to the target polaron energy \({E}_{{{{\rm{pol}}}}}^{{{{\rm{DFT}}}}}\) of the configuration from the training dataset . We employed an adapted mean squared error loss function J, where the error of each training sample is normalized to the number of polarons in the configuration (N_{pol,i}).
Without a normalization of the error to the number of polarons, the model tended to systematically underpredict energies at low defect concentrations, which can likely be attributed to a greater total number of polarons at high defect concentrations. More polarons result in a cumulative higher error and the model shifts towards providing a better fit for configurations with many polarons at high defect concentrations – compromising accuracy for configurations with fewer polarons. For the optimization of the loss function, an Adam optimizer^{63} has been used with a learning rate set to 0.0001, which resulted in slow yet consistent convergence of regression parameters (see Supplementary Fig. 3, also showing error convergence in dependence of training samples). It has been found that an initialization of regression parameters set to 0 resulted in a faster convergence than a random initialization. A batch size of 64 randomly chosen configurations per epoch, and a weight decay of 0.1 to enforce small regression parameters, also benefited convergence of the model to consistent performance throughout many tests. We used the Laplacian kernel
for all models and found that a hyperparameter γ between 0.1 and 0.5 leads to optimal results (see Supplementary Fig. 4). The algorithm has been implementd using NumPy^{64} and ScikitLearn^{65} for preprocessing, and PyTorch^{66} to perform the backpropagation and optimization of parameters.
Data availability
All relevant data (polaronic structures, sitespecific magnetic moments, energies, delocalized energies and cellsizes of the polaron configurations for all materials) are available within the supplementary data or at https://github.com/QuantumMaterialsModelling/PolConfML.
Code availability
Sample code for application to the proposed systems is available at the github repository https://github.com/QuantumMaterialsModelling/PolConfML.
References
Franchini, C., Reticcioli, M., Setvin, M. & Diebold, U. Polarons in materials. Nat. Rev. Mater. 560–586 (2021).
Alexandrov, A. S. & Devreese, J. T. Advances in Polaron Physics (Springer, 2010).
Landau, L. D. Über die Bewegung der Elektronen im Kristallgitter. Phys. Z. Sowjet. 664, 644–645 (1933).
Fröhlich, H., Pelzer, H. & Zienau, S. Properties of slow electrons in polar materials. Philos. Mag. 41, 221 (1950).
Coropceanu, V. et al. Charge transport in organic semiconductors. Chemical Reviews 107, 926–952 (2007).
Moser, S. et al. Tunable polaronic conduction in anatase TiO_{2}. Phys. Rev. Lett. 110, 196403 (2013).
Fratini, S. & Ciuchi, S. Dynamical meanfield theory of transport of small polarons. Phys. Rev. Lett. 91, 256403 (2003).
Deskins, N. A. & Dupuis, M. Intrinsic Hole Migration Rates in TiO_{2} from Density Functional Theory. J. Phys. Chem. C 113, 346–358 (2009).
Zhang, D., Han, Z.K., Murgida, G. E., GandugliaPirovano, M. V. & Gao, Y. Oxygenvacancy dynamics and entanglement with polaron hopping at the reduced CeO_{2}(111) surface. Phys. Rev. Lett. 122, 096101 (2019).
Mishchenko, A. S., Nagaosa, N., De Filippis, G., de Candia, A. & Cataudella, V. Mobility of holstein polaron at finite temperature: An unbiased approach. Phys. Rev. Lett. 114, 146401 (2015).
Mishchenko, A. S. et al. Polaron mobility in the “beyond quasiparticles” regime. Phys. Rev. Lett. 123, 076601 (2019).
Papageorgiou, A. C. et al. Electron traps and their effect on the surface chemistry of TiO_{2}(110). Proc. Natl. Acad. Sci. 107, 2391–2396 (2010).
Reticcioli, M. et al. Interplay between adsorbates and polarons: CO on rutile TiO_{2}(110). Phys. Rev. Lett. 122, 016805 (2019).
Reticcioli, M. et al. Polarondriven surface reconstructions. Phys. Rev. X 7, 031053 (2017).
Rousseau, R., Glezakou, V.A. & Selloni, A. Theoretical insights into the surface physics and chemistry of redoxactive oxides. Nat. Rev. Mater. https://doi.org/10.1038/s4157802001989 (2020).
Sokolović, I. et al. Resolving the adsorption of molecular O_{2} on the rutile TiO_{2}(110) surface by noncontact atomic force microscopy. Proc. Natl. Acad. Sci. 117, 14827–14837 (2020).
Mechelen, J. L. M. et al. Electronphonon interaction and charge carrier mass enhancement in SrTiO_{3}. Phys. Rev. Lett. 100, 226403 (2008).
Yoon, S. et al. Raman and optical spectroscopic studies of smalltolarge polaron crossover in the perovskite manganese oxides. Phys. Rev. B 58, 2795–2801 (1998).
Klimin, S., Tempere, J., Devreese, J. T., Franchini, C. & Kresse, G. Optical response of an interacting polaron gas in strongly polar crystals. Appl. Sci. 10, 2059 (2020).
Srimath Kandada, A. R. & Silva, C. Exciton polarons in twodimensional hybrid metalhalide perovskites. Journal Phys. Chem. Lett. 11, 3173–3184 (2020).
Reticcioli, M., Setvin, M., Schmid, M., Diebold, U. & Franchini, C. Formation and dynamics of small polarons on the rutile TiO_{2}(110) surface. Phys. Rev. B 98, 045306 (2018).
Reticcioli, M., Diebold, U., Kresse, G. & Franchini, C. Small Polarons in Transition Metal Oxides, 1–39 (Springer International Publishing, Cham, 2019).
Shluger, A. L. & Stoneham, A. M. Small polarons in real crystals: concepts and problems. J. Phys.: Condensed Matt. 5, 3049–3086 (1993).
Sio, W. H., Verdi, C., Poncé, S. & Giustino, F. Polarons from first principles, without supercells. Phys. Rev. Lett. 122, 246403 (2019).
Sio, W. H., Verdi, C., Poncé, S. & Giustino, F. Ab initio theory of polarons: Formalism and applications. Phys. Rev. B 99, 235139 (2019).
Goyal, A., Gorai, P., Peng, H., Lany, S. & Stevanović, V. A computational framework for automation of point defect calculations. Computational Mater. Sci. 130, 1–9 (2017).
Makov, G. & Payne, M. C. Periodic boundary conditions in ab initio calculations. Phys. Rev. B 51, 4014–4022 (1995).
Freysoldt, C., Neugebauer, J. & Van de Walle, C. G. Fully ab initio finitesize corrections for chargeddefect supercell calculations. Phys. Rev. Lett. 102, 016402 (2009).
Kowalski, P. M., Camellone, M. F., Nair, N. N., Meyer, B. & Marx, D. Charge localization dynamics induced by oxygen vacancies on the TiO_{2}(110) surface. Phys. Rev. Lett. 105, 146405 (2010).
Allen, J. P. & Watson, G. W. Occupation matrix control of d and felectron localisations using DFT+U. Phys. Chem. Chem. Phys. 16, 21016–21031 (2014).
Pham, T. D. & Deskins, N. A. Efficient method for modeling polarons using electronic structure methods. J. Chem. Theory Comput. 16, 5264–5278 (2020).
Schmidt, J., Marques, M. R. G., Botti, S. & Marques, M. A. L. Recent advances and applications of machine learning in solidstate materials science. npj Comput. Mater. 5, 1–36 (2019).
von Lilienfeld, O. A. & Burke, K. Retrospective on a decade of machine learning for chemical discovery. Nat. Commun. 11, 4895 (2020).
Ramprasad, R., Batra, R., Pilania, G., MannodiKanakkithodi, A. & Kim, C. Machine learning in materials informatics: recent applications and prospects. npj Comput. Mater. 3, 1–13 (2017).
Behler, J., Lorenz, S. & Reuter, K. Representing moleculesurface interactions with symmetryadapted neural networks. J. Chem. Phys. 127, 014705 (2007).
Jinnouchi, R., Karsai, F. & Kresse, G. Onthefly machine learning force field generation: Application to melting points. Physical Rev. B 100, 014105 (2019).
Bartók, A. P., Payne, M. C., Kondor, R. & Csányi, G. Gaussian approximation potentials: The accuracy of quantum mechanics, without the electrons. Physical Rev. Lett. 104, 136403 (2010).
Ko, T. W., Finkler, J. A., Goedecker, S. & Behler, J. A fourthgeneration highdimensional neural network potential with accurate electrostatics including nonlocal charge transfer. Nat. Commun. 12, 398 (2021).
Unke, O. T. et al. SpookyNet: Learning force fields with electronic degrees of freedom and nonlocal effects. Nat. Commun. 12, 7273 (2021).
Eckhoff, M., Lausch, K. N., Blöchl, P. E. & Behler, J. Predicting oxidation and spin states by highdimensional neural networks: Applications to lithium manganese oxide spinels. The Journal of Chemical Physics 153, 164107 (2020).
Bishop, C. M. Pattern Recognition and Machine Learning. Information Science and Statistics (SpringerVerlag, New York, 2006).
Schütt, K. T. et al. (eds.) Machine Learning Meets Quantum Physics. Lecture Notes in Physics (Springer International Publishing, 2020).
Diebold, U. The surface science of titanium dioxide. Surface Science Reports 48, 53–229 (2003).
Setvin, M. et al. Direct view at excess electrons in TiO_{2} rutile and anatase. Phys. Rev. Lett. 113, 086402 (2014).
Deskins, N. A. & Dupuis, M. Electron transport via polaron hopping in bulk TiO_{2}: A density functional theory characterization. Phys. Rev. B 75, 195212 (2007).
Moses, P. G., Janotti, A., Franchini, C., Kresse, G. & Van De Walle, C. G. Donor defects and small polarons on the TiO_{2}(110) surface. J. Appl. Phys. 119, 181503 (2016).
Klyukin, K. & Alexandrov, V. Effect of intrinsic point defects on ferroelectric polarization behavior of SrTiO_{3}. Phys. Rev. B 95, 035301 (2017).
Janotti, A., Varley, J. B., Choi, M. & Van de Walle, C. G. Vacancies and small polarons in SrTiO_{3}. Phys. Rev. B 90, 085202 (2014).
Eglitis, R. I. Ab initio calculations of SrTiO_{3}, BaTiO_{3}, PbTiO_{3}, CaTiO_{3}, SrZrO_{3}, PbZrO_{3} and BaZrO_{3} (001), (011) and (111) surfaces as well as F centers, polarons, KTN solid solutions and Nb impurities therein. Int. J. Modern Phys. B 28, 1430009 (2014).
Onishi, H. & Iwasawa, Y. Reconstruction of TiO_{2}(110) surface: STM study with atomicscale resolution. Surface Sci. 313, L783–L789 (1994).
Krüger, P. et al. Defect States at the TiO_{2}(110) Surface probed by resonant photoelectron diffraction. Phys. Rev. Lett. 100, 055501 (2008).
Shibuya, T., Yasuoka, K., Mirbt, S. & Sanyal, B. Subsurface polaron concentration as a factor in the chemistry of reduced TiO_{2} (110) surfaces. J. Phys. Chem. C 121, 11325–11334 (2017).
Kresse, G. & Furthmüller, J. Efficiency of abinitio total energy calculations for metals and semiconductors using a planewave basis set. Comput. Mater. Sci. 6, 15–50 (1996).
Kresse, G. & Furthmüller, J. Efficient iterative schemes for ab initio totalenergy calculations using a planewave basis set. Phys. Rev. B 54, 11169–11186 (1996).
Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 77, 3865–3868 (1996).
Hao, X., Wang, Z., Schmid, M., Diebold, U. & Franchini, C. Coexistence of trapped and free excess electrons in SrTiO_{3}. Phys. Rev. B 91, 085204 (2015).
Hou, Z. & Terakura, K. Defect states induced by oxygen vacancies in cubic SrTiO_{3}: Firstprinciples calculations. J. Phys. Soc. Japan 79, 114704 (2010).
Choi, M., Oba, F., Kumagai, Y. & Tanaka, I. Antiferrodistortivelike oxygenoctahedron rotation induced by the oxygen vacancy in cubic SrTiO_{3}. Adv. Mater. 25, 86–90 (2013).
Blöchl, P. E. Projector augmentedwave method. Phys. Rev. B 50, 17953–17979 (1994).
Kresse, G. & Joubert, D. From ultrasoft pseudopotentials to the projector augmentedwave method. Phys. Rev. B 59, 1758–1775 (1999).
Sokolović, I. et al. Quest for a pristine unreconstructed SrTiO_{3}(001) surface: An atomically resolved study via noncontact atomic force microscopy. Phys. Rev. B 103, L241406 (2021).
Sokolović, I., Schmid, M., Diebold, U. & Setvin, M. Incipient ferroelectricity: A route towards bulkterminated SrTiO_{3}. Phys. Rev. Mater. 3, 034407 (2019).
Kingma, D. P. & Ba, J. Adam: A Method for Stochastic Optimization. (2014).
van der Walt, S., Colbert, S. C. & Varoquaux, G. The NumPy Array: A structure for efficient numerical computation. Comput. Sci. Eng. 13, 22–30 (2011).
Pedregosa, F. et al. Scikitlearn: Machine Learning in Python. J. Machine Learning Res. 12, 2825–2830 (2011).
Paszke, A. et al. Pytorch: An imperative style, highperformance deep learning library. In Wallach, H. et al. (eds.) Advances in Neural Information Processing Systems 32, 8024–8035 (Curran Associates, Inc., 2019).
Acknowledgements
This work was supported by the Austrian Science Fund (FWF) project POLOX (Grant No. I 2460N36), project Super (Grant No. P 32148N36) and the SFBF81 project TACO. The computational results have been achieved using the Vienna Scientific Cluster (VSC).
Author information
Authors and Affiliations
Contributions
C.F. conceptualized the research. V.B. has constructed and extended the model and executed all calculations on TiO_{2} and F.E. performed all calculations on SrTiO_{3}, supervised by M.R. & C.F. C.F. & V.B. wrote the first draft. All authors contributed to a substantial discussion of the results and a critical reading of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Birschitzky, V.C., Ellinger, F., Diebold, U. et al. Machine learning for exploring small polaron configurational space. npj Comput Mater 8, 125 (2022). https://doi.org/10.1038/s41524022008058
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41524022008058