Abstract
The selection of the elements to combine delimits the possible outcomes of synthetic chemistry because it determines the range of compositions and structures, and thus properties, that can arise. For example, in the solid state, the elemental components of a phase field will determine the likelihood of finding a new crystalline material. Researchers make these choices based on their understanding of chemical structure and bonding. Extensive data are available on those element combinations that produce synthetically isolable materials, but it is difficult to assimilate the scale of this information to guide selection from the diversity of potential new chemistries. Here, we show that unsupervised machine learning captures the complex patterns of similarity between element combinations that afford reported crystalline inorganic materials. This model guides prioritisation of quaternary phase fields containing two anions for synthetic exploration to identify lithium solid electrolytes in a collaborative workflow that leads to the discovery of Li3.3SnS3.3Cl0.7. The interstitial site occupancy combination in this defect stuffed wurtzite enables a low-barrier ion transport pathway in hexagonal close-packing.
Similar content being viewed by others
Introduction
Researchers select new chemistries to investigate based on hypotheses generated from their understanding, for example, in choosing specific combinations of elements (referred to as phase fields hereafter) to explore to synthesise new materials. Here it is the choice of the chemical elements defining the phase field that is decisive in determining the outcomes of the study, as this delimits the attainable compositions and structures: clearly, subsequent decisions are critically important but the bounds are set by the choice of a phase field through selection of a subset of elements from those available in the periodic table. In solid-state materials chemistry, information on stable crystalline compounds is available at scale (>200,000 entries in the Inorganic Crystal Structure Database, ICSD)1. The factors underlying the stability, whether kinetic or thermodynamic, of these compounds are many and complex, reflecting the diverse forms of bonding interaction between their constituent elements. It is, however; difficult for researchers to hold thousands of such prior examples in mind whilst deciding which of the large numbers of unexplored phase fields to study2. Synthetic exploration of phase fields yields new structures and materials compositions that drive condensed matter science3,4,5,6. We aggregate the information in ICSD on reported phases to define those phase fields that contain synthetically isolated compounds, rather than individual materials compositions, and thus guide element selection for synthesis.
There has been a surge of machine learning (ML) studies that aim to extract the underlying patterns of chemistry available from ICSD7,8,9,10, based on the knowledge of the composition and structure at the level of an individual material. These statistical methods represent materials as multidimensional vectors (feature vectors). Significant progress has been achieved in deriving the required features1,11. Both the supervised and unsupervised learning strategies have been successfully applied to a wide variety of structure-property relationships in chemical sciences. Supervised learning8,12,13,14 infers relationships between materials’ features and materials’ properties, and requires large hand-labelled datasets for training. Unsupervised ML, on the other hand, infers underlying patterns of chemical knowledge in the absence of human-labelled data15,16. We report a collaborative ML-human expert workflow, in which unsupervised learning addresses the combinatorial problem of the discovery of new materials in the high-level formulation required by synthetic chemists to recognize the patterns at the level of element combinations that define those phase fields known to contain synthetically isolated crystalline compounds. This goes beyond the traditional focus at the level of individual materials to support the decisions made in identifying new chemistries to explore. We present a neural network model that tackles this distinct prediction task to allow prioritisation of new phase fields for investigation according to the extent to which their particular element combinations reflect those chemistries that lead to phase stability in reported materials. We use this model to rank unexplored quaternary two anion phase fields for experimental investigation as lithium-ion conductor candidates, noting the relative under-representation of multiple anion compounds as a class of crystalline materials17,18 and their importance, notwithstanding their overall rarity, as solid electrolytes that arises from a range of properties19,20. Machine learning-assisted researcher assessment of the candidate element combinations then identified the Li-Sn-S-Cl field. Probe structure computation21,22 targeted a region within this field for synthetic exploration that afforded the defect stuffed wurtzite Li3.3SnS3.3Cl0.7. Structural and dynamical analysis of this phase demonstrated a new pathway for lithium transport in hexagonal close-packing (hcp).
Results and discussion
Machine learning (ML) models are powerful tools to study multivariate correlations that exist within large datasets but are hard for humans to identify16,23. Our aim is to build a model that captures the chemical interactions between the element combinations that afford reported crystalline inorganic materials, noting that the aim of such models is efficacy rather than interpretability, and that as such they can be complementary guides to human experts. The model should assist expert prioritization between the promising element combinations by ranking them quantitatively. Researchers have practically understood how to identify new chemistries based on element combinations for phase-field exploration, but not at significant scale. However, the prioritization of these attractive knowledge-based choices for experimental and computational investigation is critical as it determines substantial resource commitment. The collaborative ML workflow24,25 developed here includes a ML tool trained across all available data at a scale beyond that, which humans can assimilate simultaneously to provide numerical ranking of the likelihood of identifying new phases in the selected chemistries. We illustrate the predictive power of ML in this workflow in the discovery of a new solid-state Li-ion conductor from unexplored quaternary phase fields with two anions. To train a model to assist prioritization of these candidate phase fields, we extracted 2021 MxM′yAzA′t phases reported in ICSD (Fig. 1, Step 1), and associated each phase with the phase fields M-M′-A-A′ where M, M′ span all cations, A, A′ are anions {N3−, P3−, As3−, O2−, S2−, Se2−, Te2−, F−, Cl−, Br−, and I−} and x, y, z, t denote concentrations (Fig. 1, Step 2). Data were augmented by 24-fold elemental permutations to enhance learning and prevent overfitting (Supplementary Fig. 2).
ML models rely on using appropriate features (often called descriptors)26 to describe the data presented, so feature selection is critical to the quality of the model. The challenge of selecting the best set of features among the multitude available for the chemical elements (e.g., atomic weight, valence, ionic radius, etc.)26 lies in balancing competing considerations: a small number of features usually makes learning more robust, while limiting the predictive power of resulting models, large numbers of features tend to make models more descriptive and discriminating while increasing the risk of overfitting. We evaluated 40 individual features26,27 (Supplementary Fig. 4, 5) that have reported values for all elements and identify a set of 37 elemental features that best balance these considerations. We thus describe each phase field of four elements as a vector in a 148-dimensional feature space (37 features × 4 elements = 148 dimensions).
To infer relationships between entries in such a high-dimensional feature space in which the training data are necessarily sparsely distributed28, we employ the variational autoencoder (VAE), an unsupervised neural network-based dimensionality reduction method (Fig. 1, Step 3), which quantifies nonlinear similarities in high-dimensional unlabelled data29 and, in addition to the conventional autoencoder, pays close attention to the distribution of the data features in multidimensional space. A VAE is a two-part neural network, where one part is used to compress (encode) the input vectors into a lower-dimensional (latent) space, and the other to decode vectors in latent space back into the original high-dimensional space. Here we choose to encode the 148-dimensional input feature space into a four-dimensional latent feature space (Supplementary Methods). The VAE model is trained to minimize the Euclidean distances (reconstruction errors) between the original and decoded vectors in the 148-dimensional feature space (Fig. 1, Step 4). In addition, VAEs avoid overfitting by putting additional constraints arising from changes in feature distribution in the latent space to make it better organized, e.g., by favoring similar decoding for nearby vectors in the latent space. In this way, VAEs ensure that the lower-dimensional vectors in latent space capture the necessary information to properly describe and discriminate between entries in the training set. Because the training set in our case includes only phase fields where quaternary compounds are experimentally reported in the ICSD, we expect our model to learn the encoding biased to those phase fields that contain synthetically accessible quaternary compounds. The reconstruction error then captures the degree to which data deviate from the learned model, and therefore can be used to rank new input data entries by their similarity to general trends in the training data30,31; in our case, between the candidate phase fields and those phase fields that contain compounds reported in ICSD. Similarly to one-class classifiers, the VAE is trained on positives, hence it ranks based on the likelihood of positive outcomes, rather than the absence of negative outcomes. This is appropriate for the materials discovery problem, as we can only make positive statements with certainty about observed compounds, while the absence of reported compounds in a particular phase field might simply reflect the synthetic pathways explored to date: there are only strong positive examples available. Validation of the VAE thus combines statistical evaluation with a qualitative assessment of the tool based on its role in supporting the successful exploration of new chemistries.
In the second stage (Stage II, Fig. 1), the VAE model built in Stage I is used to quantify the likelihood that selected unexplored quaternary two anion phase fields contain stable compounds. Focusing the search on new Li-ion conductors, the choice of anions and cations can be narrowed down by their physical characteristics. For example, we avoid transition metals, which may introduce undesired redox properties, as well as toxic or particularly scarce elements. Hence, we select the candidate phase fields Li-M-A-A′, where M = {B, Mg, Al, Si, P, K, Ca, Zn, Sr, Y, Zr, Sn, Ba, Ta, La} represent cations, A and A′ represent anions, {N, O, S, F, Cl, Br, I}, forming 303 unexplored phase fields in total. This constitutes a set of candidate chemistries that are attractive to researchers because they respect the criteria used to select them but which require prioritization for investigation. The VAE is trained to rank these candidates by their reconstruction error (RE - the key metric for autoencoders across a variety of applications31,32,33,34) and thus provides quantitative guidance for consideration by the researcher in the choice between chemistries that are qualitatively of interest. A full list of the ranked candidate phase fields is given in Supplementary Table 3. This list suggests the most promising candidates with a high likelihood of forming isolated compounds, whose detailed theoretical and experimental future study should be prioritized.
The distribution of normalized reconstruction errors (Fig. 2a) illustrates that the model is able to reconstruct efficiently most of the reported phase fields, as 79.5% have a normalized RE below 0.5. To validate the model, we hold out five different validation datasets (each consisting of 20% of the available data from ICSD), while using the remaining 80% of data to train five different VAE models. We then assess how the validation datasets are reconstructed using the corresponding VAEs. Using this method, on average, 79.8% of the phase fields in the validation datasets have normalized reconstruction errors below 0.5. In addition to conventional validation of unsupervised methods, we also ascertain that the distributions of RE persist across the validation sets, ICSD training data, and the testing data of unexplored phase fields (Supplementary Fig. 3).
The resulting ranking of unexplored elemental combinations allows us to narrow down the search to the most promising phase fields, justifying subsequent application of computationally- and experimentally expensive research techniques. To evaluate the model, we focus on the cluster of data with the smallest reconstruction errors in the test dataset (highlighted with dark orange in Fig. 2a). Among these promising phase fields, a new composition was discovered recently (not included in our training data) in the Li-P-S-O phase field35 (ranked #8). Three further examples of the realization of highly ranked chemistries are recently available36,37 (Supplementary Fig. 18-20). Among the top five unexplored phase fields, the highest Li-ion conductivity for ternary phases underlying the candidate quaternaries was reported for Li0.8Sn0.8S238 (Supplementary Table 4). This numerical ranking guided our choice of the Li-Sn-S-Cl field (ranked #5, Supplementary Table 3) for experimental investigation.
The VAE quantifies understanding of which element combinations are likely to lead to new isolated compounds, and reflects that understanding well enough to build researcher trust in the resulting ranking through the viability of the highly scored element combinations. This trust is essential to allow confident use of the tool in choosing within the attractive candidate set of 303 phase fields. The numerical ranking of chemistry by the VAE can be used with other numerical criteria (in this case the Li-ion conductivity of ternary phases underlying the candidate quaternaries) to support researcher decisions. The selection of the Li-Sn-S-Cl field from multiple attractive, viable candidate chemistries is a typical decision required in synthetic materials chemistry.
We identified the optimal regions within the selected quaternary for synthetic exploration with high-throughput probe structure prediction21,22. This computes energies for structural models with cell sizes up to 13 × 13 × 13 Å3 aiming to capture the accessible bonding motifs at the chosen composition, thus approximating the lowest accessible energy at that composition and highlighting regions of chemical space where previously unreported phases are likely to be found. We employ Crystal Structure Prediction (CSP) for 244 different compositions in the Li-Sn-S-Cl phase field (cyan tripods in Fig. 2b) with the complementary algorithms of basin-hopping ChemDASH22,39, which explores the energy landscape, with starting ionic configurations based on hcp and rhombohedral lattices and evolutionary XtalOpt40 exploring random and mutated (hybrid) ionic configurations drawn from a population of structures, spanning in total up to 1200 structures for each composition to maximize coverage of possible ionic arrangements. Both methods are coupled with VASP41 for energy computation at the DFT level of accuracy (cf. Methods). We use these energies and those of reported compounds to calculate the convex hull8, which defines thermodynamic stability at 0 K (Fig. 2b). Regions close (≤50 meV per atom) to the convex hull suggest a range of potentially stable compositions for experimental synthesis. This includes a composition with an energy on the convex hull, Li3SnS3Cl (Fig. 2b, Composition #2), i.e., predicted independently by both CSP methods to be thermodynamically stable at 0 K. Its composition, and its predicted wurtzite-related structure, with one of the two tetrahedral sites selectively occupied and anion order around the distinct cations (Fig. 2c), are closely related to the experimentally discovered phase that is discussed herein.
We sample the experimental points from across the phase field, while favoring compositions with a predicted low formation energy, aligned with the motivation of the workflow to focus and thus minimize synthetic work needed to identify new phases. Maximizing the coverage with a minimum number of samples, we choose the lowest energy point for synthesis from each sector of the phase field on a 3 × 3 grid (Fig. 2b), discarding compositions with low lithium content (<10 mol%). This leads to the selection of six samples (#1-6, circled tripods in Figure 2b), which we synthesized via a high-temperature solid-state route in evacuated quartz ampoules and identified the resulting phases with X-ray diffraction (Supplementary Table 5).
For two out of the six samples (magenta circles on Fig. 2b), there is a set of Bragg reflections that does not correspond to any reported phase (Supplementary Fig. 6a, b), which we associate with a new phase denoted Phase A. From the Rietveld refinements of these two samples, we derive the relative weight fractions that are used to determine a subsequent composition close to that of Phase A (Supplementary Methods). Graphically, this corresponds to the intersection of the two yellow lines in Fig. 2b—a composition close to Li3.6SnS3.6Cl0.4 (Sample #7). After the reaction, this sample revealed the presence of Phase A with a single impurity of orthorhombic Li4SnS4 (Supplementary Fig. 6c) indicating that Phase A lies on the line: Li4-xSnS4-xClx, 0.4 < x < 1 (magenta line, Fig. 2b). Samples were made along the line using the same synthetic procedure (Supplementary Methods). For x = 0.7, no impurities were detected by laboratory XRD, therefore, the composition of Phase A was identified as Li3.3SnS3.3Cl0.7 (magenta cross, Fig. 2b). It is important to note that, as neither element ratios nor cell sizes are bounded within a phase field, it is not possible to target the exact composition for synthesis. This reinforces the importance, as the first step of the workflow, of the quantitative assessment of the selection of the phase field itself as the overall determinant of whether materials are subsequently isolated, and thus the aggregation of individual phases at the level of phase fields for model construction: identification of element combinations for study at the level of the periodic table is thus the problem addressed by the VAE model. We determine the structure of Phase A through combined Rietveld refinement of powder Synchrotron X-Ray Diffraction (SXRD) and Neutron Powder Diffraction (NPD) data (Fig. 3a–c). The reflections corresponding to Phase A were indexed to a hexagonal unit cell with lattice parameters a = 3.9723(3) Å and c = 6.3524(6) Å. Systematic absences gave the extinction symbol P – – c with the refinement of both the datasets demonstrating site occupancies within the hcp anion array consistent only with the P63mc space group (Supplementary Discussion).
The diffraction-derived average structure is a defect stuffed wurtzite, where Cl and S atoms are randomly distributed on the fully occupied anionic site and the Sn and Li atoms on the fully occupied tetrahedral T+ interstices, with the T− sites empty, while the remaining 0.3 lithium atoms per formula unit are located on the octahedral site, O (Fig. 3e). Refinement of the site occupancy factors (sof) reveals full occupancy of the anionic and the tetrahedral sites with S/Cl and Sn/Li atomic ratios of 0.823(8)/0.175(8) and 0.242(3)/0.760(3), respectively, in agreement with the expected composition. Room temperature 6Li Magic Angle Spinning (MAS) Nuclear Magnetic Resonance (NMR) reveals the presence of one main deshielded signal centered at 1.4 ppm (Fig. 3d), in agreement with the majority of Li occupying the tetrahedral interstices (Fig. 3f - octahedral sites are located near 0 ppm)42, as well as a small shoulder at 1.8 ppm attributed to a small amount (∼10 mol%) of orthorhombic Li4SnS443. Interestingly, the computed probe structure for the closely related composition Li3SnS3Cl, which lies on the convex hull and thus reflects potential thermodynamic stability, also shows a wurtzite-type structure, with S2− and Cl− anions packed in an hcp manner, Li+ and Sn4+ occupying T+ interstices only, and both anions and cations ordering over these sites in a 3:1 arrangement (Fig. 2c). Here Sn4+ is purely co-ordinated by sulfide while Li+ forms both LiS2Cl2 (one third) and LiS3Cl tetrahedra (two-thirds).
In the experimental average structure, there is no cation or anion site ordering over diffraction length scales, but there is evidence that local anion order influences the cation coordination environments, because different average positions for Li1 and Sn on the single T+ site are refined from the combined diffraction data. The Li1 position is slightly displaced towards the base of the tetrahedron compared to Sn, which presents a much more regular coordination environment (Fig. 3f, Supplementary Table 7, 8) that can be associated with a predominant S-only single anion first coordination sphere driven by the higher formal cation charge and consistent with the anion ordering at the distinct cation sites in the probe structure of Li3SnS3Cl (Fig. 2c). The less regular geometry around Li1 would then be associated with the presence of both S and Cl anions in its first coordination sphere, as seen in the computed probe structure, driven by the lower formal charges of Li and Cl.
The Fourier difference map (Supplementary Fig. 7) enabled us to identify the second Li site (Li2), within the O interstice, and the refinement of the sof showed a non-negligible occupancy of 0.092(8) (Fig. 3g). This model led to an excellent fit to both datasets (Fig. 3a–c) and accounted for the remaining Li atoms, with a refined phase composition of Li3.41(4)SnS3.29(3)Cl0.70(3) (Supplementary Discussion), close to the measured bulk composition Li3.305(14)Sn1.000(9)S3.317(44)Cl0.6269(8) as determined by ICP-AES and supported by WDX analysis (Supplementary Discussion, Supplementary Fig. 8). The measured bulk composition is consistent with the presence of only S and Cl as anion-forming elements, determined by ICP-AES and WDX, and with the nominal composition Li3.3SnS3.3Cl0.7, which we use throughout the paper on the basis of these bulk and phase analyses.
Li3.3SnS3.3Cl0.7 is compositionally related to the known material Li4SnS4, which has been reported to exhibit crystalline dimorphism, with both the structures consisting of hcp-type packing of the S2− anions43,44 but it has a quite different structure. The thermodynamically stable Li4SnS4 phase crystallizes in an orthorhombic space group with a γ-Li3PO4 structure type43, while the low-temperature phase (prepared by mechanosynthesis or wet chemistry) is metastable and transforms irreversibly into the orthorhombic phase when heated above 350 °C44,45. Low-temperature Li4SnS4 has regular hcp S2− packing in space group P63/mmc but is not a wurtzite, as the Sn cations partially occupy both the T+ and the T− sites in the hcp anion array i.e., it is quite unlike Li3.3SnS3.3Cl0.7, where only one of the two sets of tetrahedral sites are occupied, lowering the average structure symmetry to P63mc. Cationic substitution of As and Sb for Sn in stable Li4SnS4 were reported previously, and shown to maintain the orthorhombic structure44,45. Comparing these quaternary chemistries that emerge from the Li-Sn-S ternary, Cl is a more sustainable element than As or Sb46, which represent cation substitution into a single anion material, and the resulting two anion chemistry leads to a well-defined original structure. The distinction between the single and multiple anion chemistry is demonstrated by the different structures accessed. The new two anion structure represents a platform for subsequent cation substitution.
In Li3.3SnS3.3Cl0.7, the introduction of the second anion, Cl−, stabilized the high symmetry hexagonal sulfide packing to permit high synthesis temperatures that afforded sufficient crystallinity (coherent scattering domain of 200(10) nm, Supplementary Methods) to locate and refine Li positions and occupancies, producing the first full structural model of a Li-ion conducting defect stuffed wurtzite.
Here, the use of the VAE and CSP tools in the collaborative workflow enabled targeting the exploration of an entire phase field that resulted in the discovery of a new phase. The isolation of such a phase of narrow compositional width with a specific and high content of the second anion would not be facilitated by low-level doping of the known Li4SnS4 structure, which is the most common approach. Although it is not possible to prove exhaustively that other cations would not also lead to the formation of this new phase, because the VAE is trained only on positive examples, the successful outcome supports the practical use of the VAE in assisting selection of element combinations for synthetic exploration, and demonstrates the consequential nature of the choice of the Li-Sn-S-Cl phase field from the other candidates in the test set that also offer attractive chemistries.
We measure lithium-ion conductivity on a pressed pellet by AC-impedance spectroscopy (ACIS). The Nyquist plot shows two overlapping semicircles in the high-frequency region (Fig. 4a) and a Warburg tail in the low-frequency region (inset, Fig. 4a), indicating that the material is a typical ion conductor. The bulk and grain boundary conductivities are extracted from fitting Nyquist plots with an equivalent circuit (Fig. 4a, Supplementary Discussion). The electronic conductivity is determined by DC polarization measurement (Supplementary Discussion, Supplementary Fig. 10). At room temperature, the total, bulk, and electronic conductivity are σtot = 2.4(1) × 10−6 S cm−1, σbulk = 3.2(3) × 10−5 S cm−1, and σe = 1.3(6) × 10−8 S cm−1, respectively. The electronic contribution to the conductivity is negligible, and accounts for only around 1% of the total conductivity. The temperature dependence of σbulk followed an Arrhenius behavior (Fig. 4d, blue curve), with an activation energy, Ea, of 0.38(3) eV. The bulk conductivity of Li3.3SnS3.3Cl0.7 is slightly lower than the reported total conductivities of orthorhombic Li4SnS4 (σtot = 7.0 × 10−5 S cm−1)43 and hexagonal Li4SnS4 (σtot = 1.1 × 10−4 S cm−1)44. Moreover, galvanostatic Li plating/stripping over 400 h in a Li | Li3.3SnS3.3Cl0.7 | Li symmetric cell and the ex situ structural characterization confirm the back-and-forth Li+ transport through the Li3.3SnS3.3Cl0.7 solid electrolyte, as well as across the Li3.3SnS3.3Cl0.7 | Li0 interface (Fig. 4e, Supplementary Discussion, Supplementary Fig. 12), and good chemical compatibility between the Li3.3SnS3.3Cl0.7 solid electrolyte and Li metal (Supplementary Fig. 13, 14). An initial slight increase in plating/stripping overpotentials in the first 25 cycles (50 h) suggests the evolution of a growing interphase layer (Supplementary Discussion, Supplementary Fig. 12). Thereafter, no further overpotential increase is observed (even at higher temperatures), indicating a suppression of further interfacial reactions. Ex situ XRD patterns (Supplementary Fig. 13) and Raman spectra (Supplementary Fig. 14) of the cycled bulk pellet show the starting structure of Li3.3SnS3.3Cl0.7 is well maintained after 400 h cycling. This result is in contrast with the chemical incompatibility between Sb- and As-doped Li4SnS4 single anion materials and Li metal46,47. The new phase highlights an effective route to enhance the interfacial stability of a solid electrolyte against Li by tailoring the chemical composition and structure with the use of two anions48. In general, the bulk stability of the quaternary ionic conductor as studied here against lithium metal depends on the formation energy of its corresponding decomposition products49. An unfavorable decomposition energy is thus expected for the current Li3.3SnS3.3Cl0.7, which is enabled by the introduction of the second anion. The interfacial stability against lithium may arise from the formation of some initial decomposition species (Supplementary Fig. 14) that are thermodynamically stable with lithium metal (such as LiCl) and electronically insulating, which kinetically inhibit a further decomposition of the solid electrolyte.
We record the 7Li NMR spectra and spin-lattice relaxation rates (SLR) of Li3.3SnS3.3Cl0.7 to provide further insight into the dynamics on the MHz and kHz timescale (Methods, Supplementary Discussion). The temperature dependence of the full width at half maximum of the 7Li static NMR spectra (Fig. 4b) reveals line narrowing starting at around 200 K upon heating, and hence an increase in Li motion at this temperature. The inflection point of this temperature-dependent line width defines the Li+ ion jump rate, τ−1, which is of the order of the line width in the rigid lattice regime (∼5 kHz), yielding a value of ∼3.3(2) × 104 s−1 at T ~250 K. The SLR rates are purely induced by diffusion processes here and their increase and decrease in the rotating frame (T1ρ−1) with increasing temperature are characteristic of the slow and fast motion regime, respectively49. Maxima in Fig. 4c are observed when Li+ jump rates, τ−1, are on the order of the probe spin-lock frequency ω1 and obey the relationship 2ω1 ≈ τc−1 (where τc−1 is the correlation rate of the Li motion and is essentially the average τ−1)50 giving jump rate values of the order of 1.8 × 105 − 1 × 106 s−1 in the 330–355 K temperature range.
We then estimate the NMR conductivity σNMR from combined Nernst-Einstein and Einstein-Smoluchowski equations (Supplementary Equation 5) and extract an NMR activation barrier of 0.23(5) eV for Li-ion diffusion (Fig. 4d). This value is lower than that determined by ACIS as NMR spectroscopy determines the barrier of diffusion of Li to its neighboring site, whereas ACIS probes longer-range translational diffusion. The extrapolated conductivity value at 303 K is 2.6(7) × 10−5 S cm−1, in good agreement with the bulk conductivity value of 3.2(3) × 10−5 S cm−1 obtained from ACIS.
The frequency dependence of the NMR SLR rates (Fig. 4c) also provides insight into the dimensionality of the Li+ diffusion processes. The T1ρ−1 values obtained are dependent on the probe frequency ω1 and hence eliminate the possibility of three-dimensional (3D) diffusion51,52. Furthermore, plots of T1ρ−1 against (τ/ω1)0.5 and τln(1/ω1τ) that are characteristic (Supplementary Discussion) of one- (1D) and two-dimensional (2D) diffusion processes, respectively51,52, are shown in Fig. 5a for data obtained at 425 K (Supplementary Fig. 17 shows data at other temperatures) and indicate 1D or 2D Li+ diffusion.
Analysis of the periodic distribution of the scattering density provides further important experimental evidence for possible diffusion pathways and their dimensionality50,53. Figure 5b shows the nuclear density obtained by the maximum entropy method (MEM). Three potential diffusion pathways are highlighted: i) Path 1 (O–O): 1D diffusion from one Li2 octahedron to another through their common face along the c axis (green arrows, Fig. 5b, c), ii) Path 2 (T+ –T−–T+): 3D diffusion from the Li1 (T+) tetrahedra to one of the vacant T− tetrahedra of the same layer via their common edge (3 possibilities spanning the whole (ab) plane, grey arrows on Fig. 5b), followed by diffusion to the T+ site in the layer above through their common face (along c, grey arrows Fig. 5b, c) and iii) Path 3 (T+-O-T+): 2D diffusion in the sulfide slab from one Li1 tetrahedron to another by passing through the edge of the T+ base, via the octahedral interstice and again through the edge of the adjacent Li1 tetrahedron (blue arrows, Fig. 5b,e).
The nuclear density along each diffusion pathway is linked to an energy scale (the One Particle Potential, OPP) via the Boltzmann distribution54,55, ρ ∼ρ0 × e(−OPP/kT) (Fig. 5d, f). In this approximation, each ion is treated as individual Einstein oscillator, considered non-deformable by bonding, and any additional defect formation energy is neglected. OPP values are not quantitative and may not be related to other techniques, but qualitative and relative comparison within the same phase is valid and has been demonstrated56. Path 3 shows much higher energy barriers compared to Path 1 and 2 indicating that 2D diffusion is not dominant. As NMR also dismisses possible 3D conductivity, Path 2 with its three-dimensional nature can also be discarded. This is further reinforced by the absence of vacancies on the Li1 site preventing direct Li1-Li1 jumps as described in Paths 2 and 3. By combining NMR and MEM data, we demonstrate 1D diffusion in the defect stuffed mixed anion wurtzite, involving Li2-Li2 jumps between the partially occupied O sites along the c axis described by Path 1. O–O was indeed identified as the lower energy barrier pathway in a model sulfide hcp lattice57. However, near the wurtzite structure, as O is a nonstable site, it is often left vacant when composition allows, and the limiting mechanism is defined by diffusion through T–O shared faces with higher activation energy, therefore hindering conductivity. By using two anions to stabilize Li cation excess with both O and T sites occupied, we show that defect stuffed wurtzites should be considered as promising Li ionic conductor candidates by locating the Li cations and resolving their conduction pathways. The cation stuffing onto the O site opens the lower energy barrier O–O diffusion pathway as the main limiting transport mechanism, in contrast to tetrahedral-only materials. This is consistent with the lower activation energy of Li3.3SnS3.3Cl0.7 (Ea = 0.38(3) eV) compared to non-stuffed wurtzite Li(BH4)1-xBrx (Ea = 0.52–0.64 eV, σtot, 313K = 10−6 S cm−1)58. 1D diffusion channels are favorable for accessing liquid-like ionic conductivity behavior, as in Li10GeP2S1259, and further tuning of the composition to increase octahedral occupancy in stuffed wurtzites would increase carrier density. More generally, O–O pathways along the c axis, involving face sharing of octahedra, have not yet been experimentally reported in other hcp sulfide materials (Supplementary Table 10).
Selection of the elements to combine is a foundational choice in synthetic chemistry, which is difficult because of the diversity of possible bonding and structure types, and the scale of the available options, and consequential because it determines considerable subsequent investment of experimental and computational effort. By learning the interplay between the chemical characteristics of the elements themselves in those combinations that afford synthetically accessible crystalline inorganic solids, we can support human decisions to identify and prioritize chemistries under consideration for experimental exploration. Despite the absence of strong negative data in materials synthesis, the unsupervised approach of the VAE allowed training on positive examples only. We connect the latent space of the VAE to chemical space and thus select a two anion quaternary phase field that affords a new lithium-ion conductor, because the resulting representation captures complex patterns of similarity between the known and candidate phase fields that allows these candidates to be effectively ranked. Targeting of a region within this quaternary by probe structure prediction focuses experimental investigation and enables the discovery of a new mixed anion stuffed wurtzite phase of composition Li3.3SnS3.3Cl0.7 that displays a Li+ ion diffusion pathway previously unseen experimentally in hcp anion arrays, and inaccessible to date with monoanionic chemistry.
Here the collaborative workflow supports the choice to explore the more rarely investigated two anion chemistry. The distinctive structure, resulting understanding of cation transport in close-packed anion arrays and enhanced interfacial stability to lithium metal over the cation-substituted single anion systems then reveals the impact of this change to multiple anion materials. The specific choice of chemistry to study versus other attractive, viable two anion systems was supported quantitatively by the VAE, because it provides a ranking of candidate chemistries based on data at a scale that is complementary to knowledge of human researchers: that individual highly ranked chemistries align with human understanding is a point in favour of the VAE, which reinforces and quantifies this understanding, and additionally numerically ranks such chemistries. This support came not as a standalone tool but as part of a workflow with both deterministic and probabilistic components, where the final decisions are made by researchers. The workflow could be extended to include other approaches, for example, in focussing high-throughput experimental methods on promising chemistries. The new structure and property outcomes in the five successful highly ranked chemistries to date build trust in the collaborative ML approach and emerge from detailed serial experimentation and analysis using multiple techniques.
The outcome highlights the potential automated evaluation of chemical space that starts with element combination selections made at the level of the periodic table and finishes with the prioritization of specific compositional ranges. This may identify chemistries, that, while close in the latent space of the VAE and thus highly accessible synthetically, appear to the human expert quite distinct from reported materials chemistry, accelerating exploration and discovery.
Methods
Variational autoencoder
The VAE model27 consists of two parts—an encoder, with 4 hidden layers with 148, 74, 37, 18 nodes, respectively, and a decoder, with 4 hidden layers with 18, 37, 74, 148 nodes. We add 0.5 dropout60 after each hidden layer. Nodes are activated with ReLU61. The data were processed in batches of 24 entries, and weights and biases of the model were optimized during training with the ‘Adam’ method62.
Crystal structure prediction (CSP)
In CSP with ChemDASH39, for each composition, the structure was initialized with anions (S2−, Cl1−) located on a 2 × 2 × 2 or 3 × 2 × 2 grid in a close-packed arrangement and cations (Li1+, Sn4+) occupying the interstitial sites. Up to 600 Li-Sn-vacancy and S-Cl atomic swaps were performed from initial structures and optimized geometrically with VASP to produce candidate structures for each composition.
In CSP with XtalOpt40, each composition was initialized with a random structure and up to nine evolutional generations were considered with 50 mutated structures in each. The generations were created by mutations of a structure as well as by combining two-parent structures into a new structure. Mutations are direct transformations of the crystal structures—crossover, strain, nonlinear “ripple”, exchange (atomic swaps), and their combinations.
VASP calculations
Calculations for structure prediction were based on energy calculations after geometry optimization for reference and probe structures that were performed in VASP-5.4.441 with PAW pseudopotentials63, a 700 eV kinetic energy cutoff for plane waves, and 5 × 5 × 5 k-points sampling. 1e–10 eV threshold for total energy convergence in self-consistent runs, and 0.001 eV/Å threshold for convergence of forces were used for all computations.
Synthesis
A sample of Li3.3SnS3.3Cl0.7 was synthesized by solid-state annealing in evacuated sealed quartz tubes using stoichiometric amounts of Li2S (Merck, 99.98 %), SnS (Alfa Aesar, 99 + %), LiCl (Merck, 99.99 %), S (Merck, 99.98 %). Precursors were weighed in order to yield a total mass of powder of approximately 300 mg. Powders were combined and mixed thoroughly with an agate pestle and mortar for 15 min, transferred into an alumina crucible, and then sealed in a quartz tube under a pressure of 10−4 mbar. The ampoule containing the sample was heated to 700 °C at a ramp rate of 5 °C min−1, held at 700 °C for 12 hours, and then cooled to room temperature at a ramp rate of 5 °C min−1. The resulting powder was then manually ground in order to obtain a fine powder. Precursors and resulting powders were handled in an Ar-filled glovebox (O2 < 1 ppm). Neutron powder diffraction (NPD) experiments were conducted on a 7Li-enriched sample of 7Li3.3SnS3.3Cl0.7, using 7Li2S as precursor material, which was synthesized according to the method described by Leube et al.64, starting from 7Li2CO3 (Merck, 99 % 7Li).
Diffraction
Synchrotron X-ray diffraction (SXRD) was performed at the I11 beamline at Diamond Light Source (Oxfordshire, UK), with an incident wavelength of 0.82637(1) Å using a wide-angle position sensitive detector, and samples sealed in D = 0.3 mm glass capillaries to prevent air exposure. Time-of-flight (ToF) neutron powder diffraction (NPD) data were collected at room temperature using the Polaris instrument at ISIS neutron source (Oxfordshire, U.K.). Samples were loaded in D = 8 mm thin-walled vanadium cylindrical cans and sealed using an In gasket in an Argon-filled glovebox.
AC-impedance spectroscopy
A pellet of the Li3.3SnS3.3Cl0.7 powder was made by uniaxial pressing 30 mg of powder in a 5 mm diameter cylindrical steel die at a pressure of 125 MPa. A relative density of 80% was obtained by this method. Au electrodes were subsequently sputter coated onto both the faces of the pellet using a Q150R Plus—Rotary Pumped Coater. A.C. impedance measurements were performed using a custom-built sample holder, in the temperature range 25–125 °C, using an impedance analyser (Keysight Technologies E4990A) in the frequency range from 12 MHz to 20 Hz (with an amplitude of 50 mV). All procedures and measurements were carried out in an Ar-filled glovebox to avoid sample decomposition in the air. For measurements at a lower frequency, AC impedance was collected from 3 MHz to 1 mHz with a voltage amplitude of 100 mV using a Biologic VSP-300 potentiostat/galvanostat on a second pellet. This second measurement was made to visualize the blocking electrode response with only access to the total conductivity, while the first measurement was used to determine and dissociate bulk and grain boundary conductivities in the function of temperature (Supplementary Discussion). The impedance spectra were fitted with an equivalent circuit using the ZView2 programme65. Both the measurements give the same total conductivity within error.
Maximum entropy method (MEM) analysis
Maximum entropy calculation was performed with the programme Dysnomia66 using an input file containing observed structure factors from the NPD data of Bank 4 of Polaris and generated by FullProf67. Visualization of nuclear densities and extraction of 2D displays was then performed in the programme Vesta68.
Nuclear Magnetic Resonance (NMR)
All NMR data were recorded at 9.4 T on a Bruker AVIII HD spectrometer. 6Li Magic Angle Spinning (MAS) NMR experiments were obtained with a 4 mm HXY MAS probe (in double resonance mode) with the X channel tuned to 7Li at ω0/2π (6Li) = 58.9 MHz and under MAS at a rate of ωr/2π = 10 kHz. Spectra were obtained with a pulse length of 3 µs at a radiofrequency (rf) field amplitude of ω1/2π = 83 kHz. The sample was packed into a 4 mm MAS rotor under an Ar-atmosphere. Variable temperature 7Li static NMR experiments were recorded on a 4 mm HXY MAS probe (in double resonance mode) at and below room temperature and on a 4 mm HX High-Temperature MAS Probe above room temperature with the X channel tuned to 7Li at ω0/2π (7Li) = 156 MHz. The 7Li spectra were recorded with a pulse length of 1.5 µs at an rf field amplitude of ω1/2π = 83 kHz. The sample was flame sealed in glass inserts under an Argon atmosphere of 10−3 mbar. All 6,7Li spectra were referenced to 10 M LiCl in D2O at 0 ppm.
Additional experimental methods are available in the Supplementary Methods.
Data availability
The raw ICSD-2017 data used in this study is available at https://www.github.com/lrcfmd/PhaseFieldRanking. The distribution of the phase fields’ rankings, computed phase field’s energy profile (convex hull) and experimental data generated in this study are available via University of Liverpool data repository at http://datacat.liverpool.ac.uk/id/eprint/1157. The X-ray crystallographic coordinates for the structure reported in this study have been deposited at the Cambridge Crystallographic Data Centre (CCDC), under deposition number 2023329. These data can be obtained free of charge from The Cambridge Crystallographic Data Centre via www.ccdc.cam.ac.uk/data_request/cif.
Code availability
The code (implementation of VAE and Phase Field Ranking method) developed for this work is available at https://www.github.com/lrcfmd/PhaseFieldRanking. https://doi.org/10.5281/zenodo.5113851
References
Zagorac, D., Müller, H., Ruehl, S., Zagorac, J. & Rehme, S. Recent developments in the inorganic crystal Structure database: theoretical crystal structure data and related features. J. Appl. Cryst. 52, 918–925 (2019).
Davies, D. W. et al. Computational screening of all stoichiometric inorganic materials. Chem 1, 617–627 (2016).
Haynes, A. S., Stoumpos, C. C., Chen, H., Chica, D. & Kanatzidis, M. G. Panoramic synthesis as an effective materials discovery tool: the system Cs/Sn/P/Se as a test case. J. Am. Chem. Soc. 139, 10814–10821 (2017).
Canfield, P. C. New materials physics. Rep. Prog. Phys. 83, 016501 (2019).
Xia, Z. & Poeppelmeier, K. R. Chemistry-inspired adaptable framework structures. Acc. Chem. Res. 50, 1222–1230 (2017).
Wong-Ng, W., Roth, R. S., Vanderah, T. A. & McMurdie, H. F. Phase equilibria and crystallography of ceramic oxides. J. Res. Natl Inst. Stand. Technol. 106, 1097–1134 (2001).
Ward, L., Agrawal, A., Choudhary, A. & Wolverton, C. A general-purpose machine learning framework for predicting properties of inorganic materials. Npj Comput. Mater. 2, 16028 (2016).
Schmidt, J. et al. Predicting the thermodynamic stability of solids combining density functional theory and machine learning. Chem. Mater. 29, 5090–5103 (2017).
Liu, Y., Zhao, T., Ju, W. & Shi, S. Materials discovery and design using machine learning. J. Materiomics 3, 159–177 (2017).
Butler, K. T., Davies, D. W., Cartwright, H., Isayev, O. & Walsh, A. Machine learning for molecular and materials science. Nature. 559, 547–555 (2018).
Oliynyk, A. O., Adutwum, L. A., Harynuk, J. J. & Mar, A. Classifying crystal structures of binary compounds AB through cluster resolution feature selection and support vector machine analysis. Chem. Mater. 28, 6672–6681 (2016).
Sendek, A. D. et al. Machine learning-assisted discovery of solid Li-ion conducting materials. Chem. Mater. 31, 342–352 (2019).
Oliynyk, A. O. et al. Disentangling structural confusion through machine learning: structure prediction and polymorphism of equiatomic ternary phases ABC. J. Am. Chem. Soc. 139, 17870–17881 (2017).
Zhuo, Y., Mansouri Tehrani, A. & Brgoch, J. Predicting the band gaps of inorganic solids by machine learning. J. Phys. Chem. Lett. 9, 1668–1673 (2018).
Zhang, Y. et al. Unsupervised discovery of solid-state lithium ion conductors. Nat. Commun. 10, 1–7 (2019).
Tshitoyan, V. et al. Unsupervised word embeddings capture latent knowledge from materials science literature. Nature. 571, 95–98 (2019).
Harada, J. K., Charles, N., Poeppelmeier, K. R. & Rondinelli, J. M. Heteroanionic materials by design: progress toward targeted properties. Adv. Mater. 31, 1805295 (2019).
Kageyama, H. et al. Expanding frontiers in materials chemistry and physics with multiple anions. Nat. Commun. 9, 772 (2018).
Kraft, M. A. et al. Influence of lattice polarizability on the ionic conductivity in the lithium superionic argyrodites Li6PS5X (X = Cl, Br, I). J. Am. Chem. Soc. 139, 10909–10918 (2017).
Bates, J. B. et al. Electrical properties of amorphous lithium electrolyte thin films. Solid State Ion. 53–56, 647–654 (1992).
Collins, C. et al. Accelerated discovery of two crystal structure types in a complex inorganic phase field. Nature. 546, 280–284 (2017).
Gamon, J. et al. Computationally guided discovery of the sulfide Li3AlS3 in the Li–Al–S phase field: structure and lithium conductivity. Chem. Mater. 31, 9699–9714 (2019).
Canabarro, A., Fanchini, F. F., Malvezzi, A. L., Pereira, R. & Chaves, R. Unveiling phase transitions with machine learning. Phys. Rev. B. 100, 045129 (2019).
Tschandl, P. et al. Human–computer collaboration for skin cancer recognition. Nat. Med. 26, 1229–1234 (2020).
More Than Machines. Nat. Mach. Intell. 1, 1–1 (2019).
Jha, D. et al. ElemNet: deep learning the chemistry of materials from only elemental composition. Sci. Rep. 8, 1–13 (2018).
Glawe, H., Sanna, A., Gross, E. K. U. & Marques, M. A. L. The optimal one dimensional periodic table: a modified pettifor chemical scale from data mining. N. J. Phys. 18, 093011 (2016).
Pavlenko, T. On feature selection, curse-of-dimensionality and error probability in discriminant analysis. J. Stat. Plan. Inference. 115, 565–584 (2003).
Kingma, D. P. & Welling, M. Auto-encoding variational bayes. arXiv:1312.6114 [cs, stat] (2014).
Gong, D. et al. Memorizing Normality to Detect Anomaly: Memory-Augmented Deep Autoencoder for Unsupervised Anomaly Detection. in 2019 IEEE/CVF International Conference on Computer Vision (ICCV) 1705–1714 (2019). https://doi.org/10.1109/ICCV.2019.00179
Amarbayasgalan, T., Jargalsaikhan, B. & Ryu, K. H. Unsupervised novelty detection using deep autoencoders with density based clustering. Appl. Sci. 8, 1468 (2018).
Zimek, A., Schubert, E. & Kriegel, H.-P. A survey on unsupervised outlier detection in high-dimensional numerical data. Stat. Anal. Data Min.: Data Sci. J. 5, 363–387 (2012).
Lin, E., Mukherjee, S. & Kannan, S. A deep adversarial variational autoencoder model for dimensionality reduction in single-cell RNA sequencing. Anal. Bioinform. 21, 64 (2020).
Walker, J., Doersch, C., Gupta, A. & Hebert, M. An Uncertain Future: Forecasting from Static Images Using Variational Autoencoders. in Computer Vision – ECCV 2016 (eds. Leibe, B., Matas, J., Sebe, N. & Welling, M.) 835–851 (Springer International Publishing, 2016). https://doi.org/10.1007/978-3-319-46478-7_51
Suzuki, K. et al. Synthesis, structure, and electrochemical properties of crystalline Li-P-S-O solid electrolytes: novel lithium-conducting oxysulfides of Li10GeP2S12 family. Solid State Ion. 288, 229–234 (2016).
Gamon, J. et al. Li4.3AlS3.3Cl0.7: a sulfide-chloride lithium solid electrolyte with highly disordered structure and increased conductivity. ChemRxiv: https://doi.org/10.26434/chemrxiv.14454627.v1 (2021).
Morscher, A. et al. Li6SiO4Cl2: a hexagonal argyrodite based on antiperovskite layer stacking. Chem. Mater. 33, 2206–2217 (2021).
Holzmann, T. et al. Li0.6[Li0.2Sn0.8S2] – a layered lithium superionic conductor. Energy Environ. Sci. 9, 2578–2585 (2016).
Sharp, P. M., Dyer, M. S., Darling, G. R., Claridge, J. B. & Rosseinsky, M. J. Chemically directed structure evolution for crystal structure prediction. Phys. Chem. Chem. Phys. 22, 18205–18218 (2020).
Lonie, D. C. & Zurek, E. XtalOpt: an open-source evolutionary algorithm for crystal structure prediction. Comput. Phys. Commun. 182, 372–387 (2011).
Kresse, G. & Hafner, J. Ab initio molecular dynamics for liquid metals. Phys. Rev. B 47, 558–561 (1993).
MacKenzie, K. J. D. & Smith, M. E. Multinuclear solid-state nuclear magnetic resonance of inorganic materials. (Elsevier, 2002).
Kaib, T. et al. New lithium chalcogenidotetrelates, LiChT: synthesis and characterization of the Li+-conducting tetralithium ortho-sulfidostannate Li4SnS4. Chem. Mater. 24, 2211–2219 (2012).
Kanazawa, K. et al. Mechanochemical synthesis and characterization of metastable hexagonal Li4SnS4 solid electrolyte. Inorg. Chem. 57, 9925–9930 (2018).
Choi, Y. E. et al. Coatable Li4SnS4 solid electrolytes prepared from aqueous solutions for all-solid-state lithium-ion batteries. ChemSusChem. 10, 2605–2611 (2017).
Kwak, H. et al. Li+ Conduction in air-stable Sb-substituted Li4SnS4 for all-solid-state Li-ion batteries. J. Power Sources 446, 227338 (2020).
Sahu, G. et al. Air-stable, high-conduction solid electrolytes of arsenic-substituted Li4SnS4. Energy Environ. Sci. 7, 1053–1058 (2014).
Krohns, S. et al. The route to resource-efficient novel materials. Nat. Mater. 10, 899–901 (2011).
Banerjee, A. et al. Interfaces and interphases in all-solid-state batteries with inorganic solid electrolytes. Chem. Rev. 120, 6878–6933 (2020).
Zhang, Z. et al. Li4-xSbxSn1-xS4 solid solutions for air-stable solid electrolytes. J. Energy Chem. 41, 171–176 (2020).
Asano, T. et al. Solid halide electrolytes with high lithium-ion conductivity for application in 4 V class bulk-type all-solid-state batteries. Adv. Mater. 30, 1803075 (2018).
Nishimura, S. et al. Experimental visualization of lithium diffusion in LixFePO4. Nat. Mater. 7, 707–711 (2008).
Sholl, C. A. Nuclear spin relaxation by translational diffusion in liquids and solids: high- and low-frequency limits. J. Phys. C: Solid State Phys. 14, 447–464 (1981).
Kuhn, A. et al. Li ion diffusion in the anode material Li12Si7: ultrafast quasi-1D diffusion and two distinct fast 3D jump processes separately revealed by 7Li NMR relaxometry. J. Am. Chem. Soc. 133, 11018–11021 (2011).
Weber, D. A. et al. Structural insights and 3D diffusion pathways within the lithium superionic conductor Li10GeP2S12. Chem. Mater. 28, 5905–5915 (2016).
Boysen, H. The determination of anharmonic probability densities from static and dynamic disorder by neutron powder diffraction. Z. Kristallogr. Cryst. Mater. 218, 123–131 (2003).
Wang, Y. et al. Design principles for solid-state lithium superionic conductors. Nat. Mater. 14, 1026–1031 (2015).
Cascallana-Matias, I., Keen, D. A., Cussen, E. J. & Gregory, D. H. Phase behavior in the LiBH4–LiBr system and structure of the anion-stabilized fast ionic, high temperature phase. Chem. Mater. 27, 7780–7787 (2015).
Kamaya, N. et al. A lithium superionic conductor. Nat. Mater. 10, 682–686 (2011).
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I. & Salakhutdinov, R. Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958 (2014).
Agarap, A. F. Deep learning using rectified linear units (ReLU). arXiv:1803.08375 [cs, stat] (2019).
Kingma, D. P. & Ba, J. Adam: a method for stochastic optimization. arXiv:1412.6980 [cs] (2017).
Kresse, G. & Joubert, D. From ultrasoft pseudopotentials to the projector augmented-wave method. Phys. Rev. B 59, 1758–1775 (1999).
Leube, B. T. et al. Lithium transport in Li4.4M0.4M′0.6S4 (M = Al3+, Ga3+, and M′ = Ge4+, Sn4+): combined crystallographic, conductivity, solid state NMR, and computational studies. Chem. Mater. 30, 7183–7200 (2018).
Johnson, D. ZView: a software program for IES analysis 3.5d. (Scribner Associates Inc., 2007).
Momma, K., Ikeda, T., Belik, A. A. & Izumi, F. Dysnomia, a computer program for maximum-entropy method (MEM) analysis and its performance in the MEM-based pattern fitting. Powder Diffr. 28, 184–193 (2013).
FullProf Suite. Crystallographic tool for rietveld, profile matching & integrated intensity refinements of X-ray and/or neutron data. http://www.ill.eu/sites/fullprof/ (2006).
Momma, K. & Izumi, F. VESTA 3 for three-dimensional visualization of crystal, volumetric and morphology data. J. Appl. Cryst. 44, 1272–1276 (2011).
Izenman, A. J. Recent developments in nonparametric density estimation. J. Am. Stat. Assoc. 86, 205–224 (1991).
Sears, V. F. Neutron scattering lengths and cross sections. Neutron N. 3, 26–37 (1992).
Acknowledgements
We thank the UK Engineering and Physical Sciences Research Council (EPSRC) for funding through grant number EP/N004884. We are grateful for computational support from the UK Materials and Molecular Modelling Hub, which is partially funded by EPSRC (EP/P020194). We acknowledge the ISCF Faraday Challenge project: “SOLBAT The Solid-State (Li or Na) Metal-Anode Battery” [grant number FIRG007], including partial support of a studentship to B.B.D., also supported by the University of Liverpool. We thank Diamond Light Source for access to beamline I11 (proposal CY23666) and Prof. Chiu Tang and Dr. Claire Murray for assistance on the beamline. We thank STFC for access to Polaris at ISIS Spallation Neutron Source (Xpress proposal 1990189) and Dr. Ron Smith for running the measurement. M.W.G. and V.G. thank the Leverhulme Trust for funding via the Leverhulme Research Centre for Functional Materials Design. M.J.R. thanks the Royal Society for a Research Professorship.
Author information
Authors and Affiliations
Contributions
A.V. identified, developed, and implemented the VAE model in discussion with V.G., M.W.G., and M.S.D. and performed all the CSP. J.G. performed all synthetic work on the Li-Sn-S-Cl phases, solved and analysed the structure in collaboration with L.M.D. and J.B.C. and performed the impedance characterization, with assistance from A.M. B.B.D. and F.B. designed, performed, and analysed all the NMR experiments. R.C., A.R.N., and L.J.H. designed and performed the symmetrical cell and interface evaluation experiments. M.Z. performed all the electron microscopy. J.F.S. performed all the experimental work on the Li-Mg-S-Cl phases, and evaluated all the data in collaboration with L.M.D. and J.B.C.; P.M.S. and M.S.D. performed all the computational work on these phases. A.V., J.G., and M.J.R. wrote the first draft, all authors contributed to the completion of the manuscript. A.V., J.G., M.W.G., V.G., L.M.D., M.S.D. identified the project idea. M.J.R. directed the project.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Peer review information Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Vasylenko, A., Gamon, J., Duff, B.B. et al. Element selection for crystalline inorganic solid discovery guided by unsupervised machine learning of experimentally explored chemistry. Nat Commun 12, 5561 (2021). https://doi.org/10.1038/s41467-021-25343-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-021-25343-7
This article is cited by
-
Bayesian optimization acquisition functions for accelerated search of cluster expansion convex hull of multi-component alloys
npj Computational Materials (2024)
-
Machine Learning Discovers Invariants of Braids and Flat Braids
Advances in Applied Clifford Algebras (2024)
-
Methods and applications of machine learning in computational design of optoelectronic semiconductors
Science China Materials (2024)
-
Unleashing the Potential of NASICON Materials for Solid-State Batteries
JOM (2024)
-
Accelerating the prediction of stable materials with machine learning
Nature Computational Science (2023)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.