Element selection for crystalline inorganic solid discovery guided by unsupervised machine learning of experimentally explored chemistry

Vasylenko, Andrij; Gamon, Jacinthe; Duff, Benjamin B.; Gusev, Vladimir V.; Daniels, Luke M.; Zanella, Marco; Shin, J. Felix; Sharp, Paul M.; Morscher, Alexandra; Chen, Ruiyong; Neale, Alex R.; Hardwick, Laurence J.; Claridge, John B.; Blanc, Frédéric; Gaultois, Michael W.; Dyer, Matthew S.; Rosseinsky, Matthew J.

doi:10.1038/s41467-021-25343-7

Download PDF

Article
Open access
Published: 21 September 2021

Element selection for crystalline inorganic solid discovery guided by unsupervised machine learning of experimentally explored chemistry

Andrij Vasylenko¹,
Jacinthe Gamon¹,
Benjamin B. Duff^1,2,
Vladimir V. Gusev^1,3,
Luke M. Daniels¹,
Marco Zanella¹,
J. Felix Shin¹,
Paul M. Sharp^1,3,
Alexandra Morscher¹,
Ruiyong Chen¹,
Alex R. Neale ORCID: orcid.org/0000-0001-7675-5432^1,2,
Laurence J. Hardwick^1,2,
John B. Claridge^1,3,
Frédéric Blanc ORCID: orcid.org/0000-0001-9171-1454^1,2,3,
Michael W. Gaultois ORCID: orcid.org/0000-0003-2172-2507^1,3,
Matthew S. Dyer^1,3 &
…
Matthew J. Rosseinsky ORCID: orcid.org/0000-0002-1910-2483^1,3

Nature Communications volume 12, Article number: 5561 (2021) Cite this article

16k Accesses
33 Citations
313 Altmetric
Metrics details

Subjects

Abstract

The selection of the elements to combine delimits the possible outcomes of synthetic chemistry because it determines the range of compositions and structures, and thus properties, that can arise. For example, in the solid state, the elemental components of a phase field will determine the likelihood of finding a new crystalline material. Researchers make these choices based on their understanding of chemical structure and bonding. Extensive data are available on those element combinations that produce synthetically isolable materials, but it is difficult to assimilate the scale of this information to guide selection from the diversity of potential new chemistries. Here, we show that unsupervised machine learning captures the complex patterns of similarity between element combinations that afford reported crystalline inorganic materials. This model guides prioritisation of quaternary phase fields containing two anions for synthetic exploration to identify lithium solid electrolytes in a collaborative workflow that leads to the discovery of Li_3.3SnS_3.3Cl_0.7. The interstitial site occupancy combination in this defect stuffed wurtzite enables a low-barrier ion transport pathway in hexagonal close-packing.

Accelerated identification of equilibrium structures of multicomponent inorganic crystals using machine learning potentials

Article Open access 12 May 2022

Unsupervised discovery of solid-state lithium ion conductors

Article Open access 20 November 2019

Towards universal neural network potential for material discovery applicable to arbitrary combination of 45 elements

Article Open access 30 May 2022

Introduction

Researchers select new chemistries to investigate based on hypotheses generated from their understanding, for example, in choosing specific combinations of elements (referred to as phase fields hereafter) to explore to synthesise new materials. Here it is the choice of the chemical elements defining the phase field that is decisive in determining the outcomes of the study, as this delimits the attainable compositions and structures: clearly, subsequent decisions are critically important but the bounds are set by the choice of a phase field through selection of a subset of elements from those available in the periodic table. In solid-state materials chemistry, information on stable crystalline compounds is available at scale (>200,000 entries in the Inorganic Crystal Structure Database, ICSD)¹. The factors underlying the stability, whether kinetic or thermodynamic, of these compounds are many and complex, reflecting the diverse forms of bonding interaction between their constituent elements. It is, however; difficult for researchers to hold thousands of such prior examples in mind whilst deciding which of the large numbers of unexplored phase fields to study². Synthetic exploration of phase fields yields new structures and materials compositions that drive condensed matter science^3,4,5,6. We aggregate the information in ICSD on reported phases to define those phase fields that contain synthetically isolated compounds, rather than individual materials compositions, and thus guide element selection for synthesis.

There has been a surge of machine learning (ML) studies that aim to extract the underlying patterns of chemistry available from ICSD^7,8,9,10, based on the knowledge of the composition and structure at the level of an individual material. These statistical methods represent materials as multidimensional vectors (feature vectors). Significant progress has been achieved in deriving the required features^1,11. Both the supervised and unsupervised learning strategies have been successfully applied to a wide variety of structure-property relationships in chemical sciences. Supervised learning^8,12,13,14 infers relationships between materials’ features and materials’ properties, and requires large hand-labelled datasets for training. Unsupervised ML, on the other hand, infers underlying patterns of chemical knowledge in the absence of human-labelled data^15,16. We report a collaborative ML-human expert workflow, in which unsupervised learning addresses the combinatorial problem of the discovery of new materials in the high-level formulation required by synthetic chemists to recognize the patterns at the level of element combinations that define those phase fields known to contain synthetically isolated crystalline compounds. This goes beyond the traditional focus at the level of individual materials to support the decisions made in identifying new chemistries to explore. We present a neural network model that tackles this distinct prediction task to allow prioritisation of new phase fields for investigation according to the extent to which their particular element combinations reflect those chemistries that lead to phase stability in reported materials. We use this model to rank unexplored quaternary two anion phase fields for experimental investigation as lithium-ion conductor candidates, noting the relative under-representation of multiple anion compounds as a class of crystalline materials^17,18 and their importance, notwithstanding their overall rarity, as solid electrolytes that arises from a range of properties^19,20. Machine learning-assisted researcher assessment of the candidate element combinations then identified the Li-Sn-S-Cl field. Probe structure computation^21,22 targeted a region within this field for synthetic exploration that afforded the defect stuffed wurtzite Li_3.3SnS_3.3Cl_0.7. Structural and dynamical analysis of this phase demonstrated a new pathway for lithium transport in hexagonal close-packing (hcp).

Results and discussion

Machine learning (ML) models are powerful tools to study multivariate correlations that exist within large datasets but are hard for humans to identify^16,23. Our aim is to build a model that captures the chemical interactions between the element combinations that afford reported crystalline inorganic materials, noting that the aim of such models is efficacy rather than interpretability, and that as such they can be complementary guides to human experts. The model should assist expert prioritization between the promising element combinations by ranking them quantitatively. Researchers have practically understood how to identify new chemistries based on element combinations for phase-field exploration, but not at significant scale. However, the prioritization of these attractive knowledge-based choices for experimental and computational investigation is critical as it determines substantial resource commitment. The collaborative ML workflow^24,25 developed here includes a ML tool trained across all available data at a scale beyond that, which humans can assimilate simultaneously to provide numerical ranking of the likelihood of identifying new phases in the selected chemistries. We illustrate the predictive power of ML in this workflow in the discovery of a new solid-state Li-ion conductor from unexplored quaternary phase fields with two anions. To train a model to assist prioritization of these candidate phase fields, we extracted 2021 M_xM′_yA_zA′_t phases reported in ICSD (Fig. 1, Step 1), and associated each phase with the phase fields M-M′-A-A′ where M, M′ span all cations, A, A′ are anions {N³⁻, P³⁻, As³⁻, O²⁻, S²⁻, Se²⁻, Te²⁻, F⁻, Cl⁻, Br⁻, and I⁻} and x, y, z, t denote concentrations (Fig. 1, Step 2). Data were augmented by 24-fold elemental permutations to enhance learning and prevent overfitting (Supplementary Fig. 2).

**Fig. 1: Schematics of the method for phase-field ranking for synthetic exploration.**

ML models rely on using appropriate features (often called descriptors)²⁶ to describe the data presented, so feature selection is critical to the quality of the model. The challenge of selecting the best set of features among the multitude available for the chemical elements (e.g., atomic weight, valence, ionic radius, etc.)²⁶ lies in balancing competing considerations: a small number of features usually makes learning more robust, while limiting the predictive power of resulting models, large numbers of features tend to make models more descriptive and discriminating while increasing the risk of overfitting. We evaluated 40 individual features^26,27 (Supplementary Fig. 4, 5) that have reported values for all elements and identify a set of 37 elemental features that best balance these considerations. We thus describe each phase field of four elements as a vector in a 148-dimensional feature space (37 features × 4 elements = 148 dimensions).

To infer relationships between entries in such a high-dimensional feature space in which the training data are necessarily sparsely distributed²⁸, we employ the variational autoencoder (VAE), an unsupervised neural network-based dimensionality reduction method (Fig. 1, Step 3), which quantifies nonlinear similarities in high-dimensional unlabelled data²⁹ and, in addition to the conventional autoencoder, pays close attention to the distribution of the data features in multidimensional space. A VAE is a two-part neural network, where one part is used to compress (encode) the input vectors into a lower-dimensional (latent) space, and the other to decode vectors in latent space back into the original high-dimensional space. Here we choose to encode the 148-dimensional input feature space into a four-dimensional latent feature space (Supplementary Methods). The VAE model is trained to minimize the Euclidean distances (reconstruction errors) between the original and decoded vectors in the 148-dimensional feature space (Fig. 1, Step 4). In addition, VAEs avoid overfitting by putting additional constraints arising from changes in feature distribution in the latent space to make it better organized, e.g., by favoring similar decoding for nearby vectors in the latent space. In this way, VAEs ensure that the lower-dimensional vectors in latent space capture the necessary information to properly describe and discriminate between entries in the training set. Because the training set in our case includes only phase fields where quaternary compounds are experimentally reported in the ICSD, we expect our model to learn the encoding biased to those phase fields that contain synthetically accessible quaternary compounds. The reconstruction error then captures the degree to which data deviate from the learned model, and therefore can be used to rank new input data entries by their similarity to general trends in the training data^30,31; in our case, between the candidate phase fields and those phase fields that contain compounds reported in ICSD. Similarly to one-class classifiers, the VAE is trained on positives, hence it ranks based on the likelihood of positive outcomes, rather than the absence of negative outcomes. This is appropriate for the materials discovery problem, as we can only make positive statements with certainty about observed compounds, while the absence of reported compounds in a particular phase field might simply reflect the synthetic pathways explored to date: there are only strong positive examples available. Validation of the VAE thus combines statistical evaluation with a qualitative assessment of the tool based on its role in supporting the successful exploration of new chemistries.

In the second stage (Stage II, Fig. 1), the VAE model built in Stage I is used to quantify the likelihood that selected unexplored quaternary two anion phase fields contain stable compounds. Focusing the search on new Li-ion conductors, the choice of anions and cations can be narrowed down by their physical characteristics. For example, we avoid transition metals, which may introduce undesired redox properties, as well as toxic or particularly scarce elements. Hence, we select the candidate phase fields Li-M-A-A′, where M = {B, Mg, Al, Si, P, K, Ca, Zn, Sr, Y, Zr, Sn, Ba, Ta, La} represent cations, A and A′ represent anions, {N, O, S, F, Cl, Br, I}, forming 303 unexplored phase fields in total. This constitutes a set of candidate chemistries that are attractive to researchers because they respect the criteria used to select them but which require prioritization for investigation. The VAE is trained to rank these candidates by their reconstruction error (RE - the key metric for autoencoders across a variety of applications^31,32,33,34) and thus provides quantitative guidance for consideration by the researcher in the choice between chemistries that are qualitatively of interest. A full list of the ranked candidate phase fields is given in Supplementary Table 3. This list suggests the most promising candidates with a high likelihood of forming isolated compounds, whose detailed theoretical and experimental future study should be prioritized.

The distribution of normalized reconstruction errors (Fig. 2a) illustrates that the model is able to reconstruct efficiently most of the reported phase fields, as 79.5% have a normalized RE below 0.5. To validate the model, we hold out five different validation datasets (each consisting of 20% of the available data from ICSD), while using the remaining 80% of data to train five different VAE models. We then assess how the validation datasets are reconstructed using the corresponding VAEs. Using this method, on average, 79.8% of the phase fields in the validation datasets have normalized reconstruction errors below 0.5. In addition to conventional validation of unsupervised methods, we also ascertain that the distributions of RE persist across the validation sets, ICSD training data, and the testing data of unexplored phase fields (Supplementary Fig. 3).

**Fig. 2: Outcome of the phase-field ranking method: from selection of elements to composition.**

The resulting ranking of unexplored elemental combinations allows us to narrow down the search to the most promising phase fields, justifying subsequent application of computationally- and experimentally expensive research techniques. To evaluate the model, we focus on the cluster of data with the smallest reconstruction errors in the test dataset (highlighted with dark orange in Fig. 2a). Among these promising phase fields, a new composition was discovered recently (not included in our training data) in the Li-P-S-O phase field³⁵ (ranked #8). Three further examples of the realization of highly ranked chemistries are recently available^36,37 (Supplementary Fig. 18-20). Among the top five unexplored phase fields, the highest Li-ion conductivity for ternary phases underlying the candidate quaternaries was reported for Li_0.8Sn_0.8S₂³⁸ (Supplementary Table 4). This numerical ranking guided our choice of the Li-Sn-S-Cl field (ranked #5, Supplementary Table 3) for experimental investigation.

The VAE quantifies understanding of which element combinations are likely to lead to new isolated compounds, and reflects that understanding well enough to build researcher trust in the resulting ranking through the viability of the highly scored element combinations. This trust is essential to allow confident use of the tool in choosing within the attractive candidate set of 303 phase fields. The numerical ranking of chemistry by the VAE can be used with other numerical criteria (in this case the Li-ion conductivity of ternary phases underlying the candidate quaternaries) to support researcher decisions. The selection of the Li-Sn-S-Cl field from multiple attractive, viable candidate chemistries is a typical decision required in synthetic materials chemistry.

We identified the optimal regions within the selected quaternary for synthetic exploration with high-throughput probe structure prediction^21,22. This computes energies for structural models with cell sizes up to 13 × 13 × 13 Å³ aiming to capture the accessible bonding motifs at the chosen composition, thus approximating the lowest accessible energy at that composition and highlighting regions of chemical space where previously unreported phases are likely to be found. We employ Crystal Structure Prediction (CSP) for 244 different compositions in the Li-Sn-S-Cl phase field (cyan tripods in Fig. 2b) with the complementary algorithms of basin-hopping ChemDASH^22,39, which explores the energy landscape, with starting ionic configurations based on hcp and rhombohedral lattices and evolutionary XtalOpt⁴⁰ exploring random and mutated (hybrid) ionic configurations drawn from a population of structures, spanning in total up to 1200 structures for each composition to maximize coverage of possible ionic arrangements. Both methods are coupled with VASP⁴¹ for energy computation at the DFT level of accuracy (cf. Methods). We use these energies and those of reported compounds to calculate the convex hull⁸, which defines thermodynamic stability at 0 K (Fig. 2b). Regions close (≤50 meV per atom) to the convex hull suggest a range of potentially stable compositions for experimental synthesis. This includes a composition with an energy on the convex hull, Li₃SnS₃Cl (Fig. 2b, Composition #2), i.e., predicted independently by both CSP methods to be thermodynamically stable at 0 K. Its composition, and its predicted wurtzite-related structure, with one of the two tetrahedral sites selectively occupied and anion order around the distinct cations (Fig. 2c), are closely related to the experimentally discovered phase that is discussed herein.

We sample the experimental points from across the phase field, while favoring compositions with a predicted low formation energy, aligned with the motivation of the workflow to focus and thus minimize synthetic work needed to identify new phases. Maximizing the coverage with a minimum number of samples, we choose the lowest energy point for synthesis from each sector of the phase field on a 3 × 3 grid (Fig. 2b), discarding compositions with low lithium content (<10 mol%). This leads to the selection of six samples (#1-6, circled tripods in Figure 2b), which we synthesized via a high-temperature solid-state route in evacuated quartz ampoules and identified the resulting phases with X-ray diffraction (Supplementary Table 5).

For two out of the six samples (magenta circles on Fig. 2b), there is a set of Bragg reflections that does not correspond to any reported phase (Supplementary Fig. 6a, b), which we associate with a new phase denoted Phase A. From the Rietveld refinements of these two samples, we derive the relative weight fractions that are used to determine a subsequent composition close to that of Phase A (Supplementary Methods). Graphically, this corresponds to the intersection of the two yellow lines in Fig. 2b—a composition close to Li_3.6SnS_3.6Cl_0.4 (Sample #7). After the reaction, this sample revealed the presence of Phase A with a single impurity of orthorhombic Li₄SnS₄ (Supplementary Fig. 6c) indicating that Phase A lies on the line: Li_4-xSnS_4-xCl_x, 0.4 < x < 1 (magenta line, Fig. 2b). Samples were made along the line using the same synthetic procedure (Supplementary Methods). For x = 0.7, no impurities were detected by laboratory XRD, therefore, the composition of Phase A was identified as Li_3.3SnS_3.3Cl_0.7 (magenta cross, Fig. 2b). It is important to note that, as neither element ratios nor cell sizes are bounded within a phase field, it is not possible to target the exact composition for synthesis. This reinforces the importance, as the first step of the workflow, of the quantitative assessment of the selection of the phase field itself as the overall determinant of whether materials are subsequently isolated, and thus the aggregation of individual phases at the level of phase fields for model construction: identification of element combinations for study at the level of the periodic table is thus the problem addressed by the VAE model. We determine the structure of Phase A through combined Rietveld refinement of powder Synchrotron X-Ray Diffraction (SXRD) and Neutron Powder Diffraction (NPD) data (Fig. 3a–c). The reflections corresponding to Phase A were indexed to a hexagonal unit cell with lattice parameters a = 3.9723(3) Å and c = 6.3524(6) Å. Systematic absences gave the extinction symbol P – – c with the refinement of both the datasets demonstrating site occupancies within the hcp anion array consistent only with the P6₃mc space group (Supplementary Discussion).

**Fig. 3: Structure determination of Li_3.3SnS_3.3Cl_0.7.**

The diffraction-derived average structure is a defect stuffed wurtzite, where Cl and S atoms are randomly distributed on the fully occupied anionic site and the Sn and Li atoms on the fully occupied tetrahedral T⁺ interstices, with the T⁻ sites empty, while the remaining 0.3 lithium atoms per formula unit are located on the octahedral site, O (Fig. 3e). Refinement of the site occupancy factors (sof) reveals full occupancy of the anionic and the tetrahedral sites with S/Cl and Sn/Li atomic ratios of 0.823(8)/0.175(8) and 0.242(3)/0.760(3), respectively, in agreement with the expected composition. Room temperature ⁶Li Magic Angle Spinning (MAS) Nuclear Magnetic Resonance (NMR) reveals the presence of one main deshielded signal centered at 1.4 ppm (Fig. 3d), in agreement with the majority of Li occupying the tetrahedral interstices (Fig. 3f - octahedral sites are located near 0 ppm)⁴², as well as a small shoulder at 1.8 ppm attributed to a small amount (∼10 mol%) of orthorhombic Li₄SnS₄⁴³. Interestingly, the computed probe structure for the closely related composition Li₃SnS₃Cl, which lies on the convex hull and thus reflects potential thermodynamic stability, also shows a wurtzite-type structure, with S²⁻ and Cl⁻ anions packed in an hcp manner, Li⁺ and Sn⁴⁺ occupying T⁺ interstices only, and both anions and cations ordering over these sites in a 3:1 arrangement (Fig. 2c). Here Sn⁴⁺ is purely co-ordinated by sulfide while Li⁺ forms both LiS₂Cl₂ (one third) and LiS₃Cl tetrahedra (two-thirds).

In the experimental average structure, there is no cation or anion site ordering over diffraction length scales, but there is evidence that local anion order influences the cation coordination environments, because different average positions for Li1 and Sn on the single T⁺ site are refined from the combined diffraction data. The Li1 position is slightly displaced towards the base of the tetrahedron compared to Sn, which presents a much more regular coordination environment (Fig. 3f, Supplementary Table 7, 8) that can be associated with a predominant S-only single anion first coordination sphere driven by the higher formal cation charge and consistent with the anion ordering at the distinct cation sites in the probe structure of Li₃SnS₃Cl (Fig. 2c). The less regular geometry around Li1 would then be associated with the presence of both S and Cl anions in its first coordination sphere, as seen in the computed probe structure, driven by the lower formal charges of Li and Cl.

The Fourier difference map (Supplementary Fig. 7) enabled us to identify the second Li site (Li2), within the O interstice, and the refinement of the sof showed a non-negligible occupancy of 0.092(8) (Fig. 3g). This model led to an excellent fit to both datasets (Fig. 3a–c) and accounted for the remaining Li atoms, with a refined phase composition of Li_3.41(4)SnS_3.29(3)Cl_0.70(3) (Supplementary Discussion), close to the measured bulk composition Li_3.305(14)Sn_1.000(9)S_3.317(44)Cl_0.6269(8) as determined by ICP-AES and supported by WDX analysis (Supplementary Discussion, Supplementary Fig. 8). The measured bulk composition is consistent with the presence of only S and Cl as anion-forming elements, determined by ICP-AES and WDX, and with the nominal composition Li_3.3SnS_3.3Cl_0.7, which we use throughout the paper on the basis of these bulk and phase analyses.

Li_3.3SnS_3.3Cl_0.7 is compositionally related to the known material Li₄SnS₄, which has been reported to exhibit crystalline dimorphism, with both the structures consisting of hcp-type packing of the S²⁻ anions^43,44 but it has a quite different structure. The thermodynamically stable Li₄SnS₄ phase crystallizes in an orthorhombic space group with a γ-Li₃PO₄ structure type⁴³, while the low-temperature phase (prepared by mechanosynthesis or wet chemistry) is metastable and transforms irreversibly into the orthorhombic phase when heated above 350 °C^44,45. Low-temperature Li₄SnS₄ has regular hcp S²⁻ packing in space group P6₃/mmc but is not a wurtzite, as the Sn cations partially occupy both the T⁺ and the T⁻ sites in the hcp anion array i.e., it is quite unlike Li_3.3SnS_3.3Cl_0.7, where only one of the two sets of tetrahedral sites are occupied, lowering the average structure symmetry to P6₃mc. Cationic substitution of As and Sb for Sn in stable Li₄SnS₄ were reported previously, and shown to maintain the orthorhombic structure^44,45. Comparing these quaternary chemistries that emerge from the Li-Sn-S ternary, Cl is a more sustainable element than As or Sb⁴⁶, which represent cation substitution into a single anion material, and the resulting two anion chemistry leads to a well-defined original structure. The distinction between the single and multiple anion chemistry is demonstrated by the different structures accessed. The new two anion structure represents a platform for subsequent cation substitution.

In Li_3.3SnS_3.3Cl_0.7, the introduction of the second anion, Cl⁻, stabilized the high symmetry hexagonal sulfide packing to permit high synthesis temperatures that afforded sufficient crystallinity (coherent scattering domain of 200(10) nm, Supplementary Methods) to locate and refine Li positions and occupancies, producing the first full structural model of a Li-ion conducting defect stuffed wurtzite.

Here, the use of the VAE and CSP tools in the collaborative workflow enabled targeting the exploration of an entire phase field that resulted in the discovery of a new phase. The isolation of such a phase of narrow compositional width with a specific and high content of the second anion would not be facilitated by low-level doping of the known Li₄SnS₄ structure, which is the most common approach. Although it is not possible to prove exhaustively that other cations would not also lead to the formation of this new phase, because the VAE is trained only on positive examples, the successful outcome supports the practical use of the VAE in assisting selection of element combinations for synthetic exploration, and demonstrates the consequential nature of the choice of the Li-Sn-S-Cl phase field from the other candidates in the test set that also offer attractive chemistries.

We measure lithium-ion conductivity on a pressed pellet by AC-impedance spectroscopy (ACIS). The Nyquist plot shows two overlapping semicircles in the high-frequency region (Fig. 4a) and a Warburg tail in the low-frequency region (inset, Fig. 4a), indicating that the material is a typical ion conductor. The bulk and grain boundary conductivities are extracted from fitting Nyquist plots with an equivalent circuit (Fig. 4a, Supplementary Discussion). The electronic conductivity is determined by DC polarization measurement (Supplementary Discussion, Supplementary Fig. 10). At room temperature, the total, bulk, and electronic conductivity are σ_tot= 2.4(1) × 10⁻⁶ S cm⁻¹, σ_bulk = 3.2(3) × 10⁻⁵ S cm⁻¹, and σ_e = 1.3(6) × 10⁻⁸ S cm⁻¹, respectively. The electronic contribution to the conductivity is negligible, and accounts for only around 1% of the total conductivity. The temperature dependence of σ_bulk followed an Arrhenius behavior (Fig. 4d, blue curve), with an activation energy, E_a, of 0.38(3) eV. The bulk conductivity of Li_3.3SnS_3.3Cl_0.7 is slightly lower than the reported total conductivities of orthorhombic Li₄SnS₄ (σ_tot = 7.0 × 10⁻⁵ S cm⁻¹)⁴³ and hexagonal Li₄SnS₄ (σ_tot = 1.1 × 10⁻⁴ S cm⁻¹)⁴⁴. Moreover, galvanostatic Li plating/stripping over 400 h in a Li | Li_3.3SnS_3.3Cl_0.7| Li symmetric cell and the ex situ structural characterization confirm the back-and-forth Li⁺ transport through the Li_3.3SnS_3.3Cl_0.7 solid electrolyte, as well as across the Li_3.3SnS_3.3Cl_0.7 | Li⁰ interface (Fig. 4e, Supplementary Discussion, Supplementary Fig. 12), and good chemical compatibility between the Li_3.3SnS_3.3Cl_0.7 solid electrolyte and Li metal (Supplementary Fig. 13, 14). An initial slight increase in plating/stripping overpotentials in the first 25 cycles (50 h) suggests the evolution of a growing interphase layer (Supplementary Discussion, Supplementary Fig. 12). Thereafter, no further overpotential increase is observed (even at higher temperatures), indicating a suppression of further interfacial reactions. Ex situ XRD patterns (Supplementary Fig. 13) and Raman spectra (Supplementary Fig. 14) of the cycled bulk pellet show the starting structure of Li_3.3SnS_3.3Cl_0.7 is well maintained after 400 h cycling. This result is in contrast with the chemical incompatibility between Sb- and As-doped Li₄SnS₄ single anion materials and Li metal^46,47. The new phase highlights an effective route to enhance the interfacial stability of a solid electrolyte against Li by tailoring the chemical composition and structure with the use of two anions⁴⁸. In general, the bulk stability of the quaternary ionic conductor as studied here against lithium metal depends on the formation energy of its corresponding decomposition products⁴⁹. An unfavorable decomposition energy is thus expected for the current Li_3.3SnS_3.3Cl_0.7, which is enabled by the introduction of the second anion. The interfacial stability against lithium may arise from the formation of some initial decomposition species (Supplementary Fig. 14) that are thermodynamically stable with lithium metal (such as LiCl) and electronically insulating, which kinetically inhibit a further decomposition of the solid electrolyte.

**Fig. 4: Lithium conductivity and dynamics in Li_3.3SnS_3.3Cl_0.7.**

We record the ⁷Li NMR spectra and spin-lattice relaxation rates (SLR) of Li_3.3SnS_3.3Cl_0.7 to provide further insight into the dynamics on the MHz and kHz timescale (Methods, Supplementary Discussion). The temperature dependence of the full width at half maximum of the ⁷Li static NMR spectra (Fig. 4b) reveals line narrowing starting at around 200 K upon heating, and hence an increase in Li motion at this temperature. The inflection point of this temperature-dependent line width defines the Li⁺ ion jump rate, τ⁻¹, which is of the order of the line width in the rigid lattice regime (∼5 kHz), yielding a value of ∼3.3(2) × 10⁴ s⁻¹ at T ~250 K. The SLR rates are purely induced by diffusion processes here and their increase and decrease in the rotating frame (T_1ρ⁻¹) with increasing temperature are characteristic of the slow and fast motion regime, respectively⁴⁹. Maxima in Fig. 4c are observed when Li⁺ jump rates, τ⁻¹, are on the order of the probe spin-lock frequency ω₁ and obey the relationship 2ω₁ ≈ τ_c⁻¹ (where τ_c⁻¹ is the correlation rate of the Li motion and is essentially the average τ⁻¹)⁵⁰ giving jump rate values of the order of 1.8 × 10⁵ − 1 × 10⁶ s⁻¹ in the 330–355 K temperature range.

We then estimate the NMR conductivity σ_NMR from combined Nernst-Einstein and Einstein-Smoluchowski equations (Supplementary Equation 5) and extract an NMR activation barrier of 0.23(5) eV for Li-ion diffusion (Fig. 4d). This value is lower than that determined by ACIS as NMR spectroscopy determines the barrier of diffusion of Li to its neighboring site, whereas ACIS probes longer-range translational diffusion. The extrapolated conductivity value at 303 K is 2.6(7) × 10⁻⁵ S cm⁻¹, in good agreement with the bulk conductivity value of 3.2(3) × 10⁻⁵ S cm⁻¹ obtained from ACIS.

The frequency dependence of the NMR SLR rates (Fig. 4c) also provides insight into the dimensionality of the Li⁺ diffusion processes. The T_1ρ⁻¹ values obtained are dependent on the probe frequency ω₁ and hence eliminate the possibility of three-dimensional (3D) diffusion^51,52. Furthermore, plots of T_1ρ⁻¹ against (τ/ω₁)^0.5 and τln(1/ω₁τ) that are characteristic (Supplementary Discussion) of one- (1D) and two-dimensional (2D) diffusion processes, respectively^51,52, are shown in Fig. 5a for data obtained at 425 K (Supplementary Fig. 17 shows data at other temperatures) and indicate 1D or 2D Li⁺ diffusion.

**Fig. 5: Li diffusion pathways in Li_3.3SnS_3.3Cl_0.7.**

Analysis of the periodic distribution of the scattering density provides further important experimental evidence for possible diffusion pathways and their dimensionality^50,53. Figure 5b shows the nuclear density obtained by the maximum entropy method (MEM). Three potential diffusion pathways are highlighted: i) Path 1 (O–O): 1D diffusion from one Li2 octahedron to another through their common face along the c axis (green arrows, Fig. 5b, c), ii) Path 2 (T⁺ –T⁻–T⁺): 3D diffusion from the Li1 (T⁺) tetrahedra to one of the vacant T⁻ tetrahedra of the same layer via their common edge (3 possibilities spanning the whole (ab) plane, grey arrows on Fig. 5b), followed by diffusion to the T⁺ site in the layer above through their common face (along c, grey arrows Fig. 5b, c) and iii) Path 3 (T⁺-O-T⁺): 2D diffusion in the sulfide slab from one Li1 tetrahedron to another by passing through the edge of the T⁺ base, via the octahedral interstice and again through the edge of the adjacent Li1 tetrahedron (blue arrows, Fig. 5b,e).

The nuclear density along each diffusion pathway is linked to an energy scale (the One Particle Potential, OPP) via the Boltzmann distribution^54,55, ρ ∼ρ₀ × e^(−OPP/kT) (Fig. 5d, f). In this approximation, each ion is treated as individual Einstein oscillator, considered non-deformable by bonding, and any additional defect formation energy is neglected. OPP values are not quantitative and may not be related to other techniques, but qualitative and relative comparison within the same phase is valid and has been demonstrated⁵⁶. Path 3 shows much higher energy barriers compared to Path 1 and 2 indicating that 2D diffusion is not dominant. As NMR also dismisses possible 3D conductivity, Path 2 with its three-dimensional nature can also be discarded. This is further reinforced by the absence of vacancies on the Li1 site preventing direct Li1-Li1 jumps as described in Paths 2 and 3. By combining NMR and MEM data, we demonstrate 1D diffusion in the defect stuffed mixed anion wurtzite, involving Li2-Li2 jumps between the partially occupied O sites along the c axis described by Path 1. O–O was indeed identified as the lower energy barrier pathway in a model sulfide hcp lattice⁵⁷. However, near the wurtzite structure, as O is a nonstable site, it is often left vacant when composition allows, and the limiting mechanism is defined by diffusion through T–O shared faces with higher activation energy, therefore hindering conductivity. By using two anions to stabilize Li cation excess with both O and T sites occupied, we show that defect stuffed wurtzites should be considered as promising Li ionic conductor candidates by locating the Li cations and resolving their conduction pathways. The cation stuffing onto the O site opens the lower energy barrier O–O diffusion pathway as the main limiting transport mechanism, in contrast to tetrahedral-only materials. This is consistent with the lower activation energy of Li_3.3SnS_3.3Cl_0.7 (E_a = 0.38(3) eV) compared to non-stuffed wurtzite Li(BH₄)_1-xBr_x (E_a = 0.52–0.64 eV, σ_{tot, 313K} = 10⁻⁶ S cm⁻¹)⁵⁸. 1D diffusion channels are favorable for accessing liquid-like ionic conductivity behavior, as in Li₁₀GeP₂S₁₂⁵⁹, and further tuning of the composition to increase octahedral occupancy in stuffed wurtzites would increase carrier density. More generally, O–O pathways along the c axis, involving face sharing of octahedra, have not yet been experimentally reported in other hcp sulfide materials (Supplementary Table 10).

Selection of the elements to combine is a foundational choice in synthetic chemistry, which is difficult because of the diversity of possible bonding and structure types, and the scale of the available options, and consequential because it determines considerable subsequent investment of experimental and computational effort. By learning the interplay between the chemical characteristics of the elements themselves in those combinations that afford synthetically accessible crystalline inorganic solids, we can support human decisions to identify and prioritize chemistries under consideration for experimental exploration. Despite the absence of strong negative data in materials synthesis, the unsupervised approach of the VAE allowed training on positive examples only. We connect the latent space of the VAE to chemical space and thus select a two anion quaternary phase field that affords a new lithium-ion conductor, because the resulting representation captures complex patterns of similarity between the known and candidate phase fields that allows these candidates to be effectively ranked. Targeting of a region within this quaternary by probe structure prediction focuses experimental investigation and enables the discovery of a new mixed anion stuffed wurtzite phase of composition Li_3.3SnS_3.3Cl_0.7 that displays a Li⁺ ion diffusion pathway previously unseen experimentally in hcp anion arrays, and inaccessible to date with monoanionic chemistry.

Here the collaborative workflow supports the choice to explore the more rarely investigated two anion chemistry. The distinctive structure, resulting understanding of cation transport in close-packed anion arrays and enhanced interfacial stability to lithium metal over the cation-substituted single anion systems then reveals the impact of this change to multiple anion materials. The specific choice of chemistry to study versus other attractive, viable two anion systems was supported quantitatively by the VAE, because it provides a ranking of candidate chemistries based on data at a scale that is complementary to knowledge of human researchers: that individual highly ranked chemistries align with human understanding is a point in favour of the VAE, which reinforces and quantifies this understanding, and additionally numerically ranks such chemistries. This support came not as a standalone tool but as part of a workflow with both deterministic and probabilistic components, where the final decisions are made by researchers. The workflow could be extended to include other approaches, for example, in focussing high-throughput experimental methods on promising chemistries. The new structure and property outcomes in the five successful highly ranked chemistries to date build trust in the collaborative ML approach and emerge from detailed serial experimentation and analysis using multiple techniques.

The outcome highlights the potential automated evaluation of chemical space that starts with element combination selections made at the level of the periodic table and finishes with the prioritization of specific compositional ranges. This may identify chemistries, that, while close in the latent space of the VAE and thus highly accessible synthetically, appear to the human expert quite distinct from reported materials chemistry, accelerating exploration and discovery.

Methods

Variational autoencoder

The VAE model²⁷ consists of two parts—an encoder, with 4 hidden layers with 148, 74, 37, 18 nodes, respectively, and a decoder, with 4 hidden layers with 18, 37, 74, 148 nodes. We add 0.5 dropout⁶⁰ after each hidden layer. Nodes are activated with ReLU⁶¹. The data were processed in batches of 24 entries, and weights and biases of the model were optimized during training with the ‘Adam’ method⁶².

Crystal structure prediction (CSP)

In CSP with ChemDASH³⁹, for each composition, the structure was initialized with anions (S²⁻, Cl¹⁻) located on a 2 × 2 × 2 or 3 × 2 × 2 grid in a close-packed arrangement and cations (Li¹⁺, Sn⁴⁺) occupying the interstitial sites. Up to 600 Li-Sn-vacancy and S-Cl atomic swaps were performed from initial structures and optimized geometrically with VASP to produce candidate structures for each composition.

In CSP with XtalOpt⁴⁰, each composition was initialized with a random structure and up to nine evolutional generations were considered with 50 mutated structures in each. The generations were created by mutations of a structure as well as by combining two-parent structures into a new structure. Mutations are direct transformations of the crystal structures—crossover, strain, nonlinear “ripple”, exchange (atomic swaps), and their combinations.

VASP calculations

Calculations for structure prediction were based on energy calculations after geometry optimization for reference and probe structures that were performed in VASP-5.4.4⁴¹ with PAW pseudopotentials⁶³, a 700 eV kinetic energy cutoff for plane waves, and 5 × 5 × 5 k-points sampling. 1e–10 eV threshold for total energy convergence in self-consistent runs, and 0.001 eV/Å threshold for convergence of forces were used for all computations.

Synthesis

A sample of Li_3.3SnS_3.3Cl_0.7 was synthesized by solid-state annealing in evacuated sealed quartz tubes using stoichiometric amounts of Li₂S (Merck, 99.98 %), SnS (Alfa Aesar, 99 + %), LiCl (Merck, 99.99 %), S (Merck, 99.98 %). Precursors were weighed in order to yield a total mass of powder of approximately 300 mg. Powders were combined and mixed thoroughly with an agate pestle and mortar for 15 min, transferred into an alumina crucible, and then sealed in a quartz tube under a pressure of 10⁻⁴ mbar. The ampoule containing the sample was heated to 700 °C at a ramp rate of 5 °C min⁻¹, held at 700 °C for 12 hours, and then cooled to room temperature at a ramp rate of 5 °C min⁻¹. The resulting powder was then manually ground in order to obtain a fine powder. Precursors and resulting powders were handled in an Ar-filled glovebox (O₂ < 1 ppm). Neutron powder diffraction (NPD) experiments were conducted on a ⁷Li-enriched sample of ⁷Li_3.3SnS_3.3Cl_0.7, using ⁷Li₂S as precursor material, which was synthesized according to the method described by Leube et al.⁶⁴, starting from ⁷Li₂CO₃ (Merck, 99 % ⁷Li).

Diffraction

Synchrotron X-ray diffraction (SXRD) was performed at the I11 beamline at Diamond Light Source (Oxfordshire, UK), with an incident wavelength of 0.82637(1) Å using a wide-angle position sensitive detector, and samples sealed in D = 0.3 mm glass capillaries to prevent air exposure. Time-of-flight (ToF) neutron powder diffraction (NPD) data were collected at room temperature using the Polaris instrument at ISIS neutron source (Oxfordshire, U.K.). Samples were loaded in D = 8 mm thin-walled vanadium cylindrical cans and sealed using an In gasket in an Argon-filled glovebox.

AC-impedance spectroscopy

A pellet of the Li_3.3SnS_3.3Cl_0.7 powder was made by uniaxial pressing 30 mg of powder in a 5 mm diameter cylindrical steel die at a pressure of 125 MPa. A relative density of 80% was obtained by this method. Au electrodes were subsequently sputter coated onto both the faces of the pellet using a Q150R Plus—Rotary Pumped Coater. A.C. impedance measurements were performed using a custom-built sample holder, in the temperature range 25–125 °C, using an impedance analyser (Keysight Technologies E4990A) in the frequency range from 12 MHz to 20 Hz (with an amplitude of 50 mV). All procedures and measurements were carried out in an Ar-filled glovebox to avoid sample decomposition in the air. For measurements at a lower frequency, AC impedance was collected from 3 MHz to 1 mHz with a voltage amplitude of 100 mV using a Biologic VSP-300 potentiostat/galvanostat on a second pellet. This second measurement was made to visualize the blocking electrode response with only access to the total conductivity, while the first measurement was used to determine and dissociate bulk and grain boundary conductivities in the function of temperature (Supplementary Discussion). The impedance spectra were fitted with an equivalent circuit using the ZView2 programme⁶⁵. Both the measurements give the same total conductivity within error.

Maximum entropy method (MEM) analysis

Maximum entropy calculation was performed with the programme Dysnomia⁶⁶ using an input file containing observed structure factors from the NPD data of Bank 4 of Polaris and generated by FullProf⁶⁷. Visualization of nuclear densities and extraction of 2D displays was then performed in the programme Vesta⁶⁸.

Nuclear Magnetic Resonance (NMR)

All NMR data were recorded at 9.4 T on a Bruker AVIII HD spectrometer. ⁶Li Magic Angle Spinning (MAS) NMR experiments were obtained with a 4 mm HXY MAS probe (in double resonance mode) with the X channel tuned to ⁷Li at ω₀/2π (⁶Li) = 58.9 MHz and under MAS at a rate of ω_r/2π = 10 kHz. Spectra were obtained with a pulse length of 3 µs at a radiofrequency (rf) field amplitude of ω₁/2π = 83 kHz. The sample was packed into a 4 mm MAS rotor under an Ar-atmosphere. Variable temperature ⁷Li static NMR experiments were recorded on a 4 mm HXY MAS probe (in double resonance mode) at and below room temperature and on a 4 mm HX High-Temperature MAS Probe above room temperature with the X channel tuned to ⁷Li at ω₀/2π (⁷Li) = 156 MHz. The ⁷Li spectra were recorded with a pulse length of 1.5 µs at an rf field amplitude of ω₁/2π = 83 kHz. The sample was flame sealed in glass inserts under an Argon atmosphere of 10⁻³ mbar. All ^6,7Li spectra were referenced to 10 M LiCl in D₂O at 0 ppm.

Additional experimental methods are available in the Supplementary Methods.

Data availability

The raw ICSD-2017 data used in this study is available at https://www.github.com/lrcfmd/PhaseFieldRanking. The distribution of the phase fields’ rankings, computed phase field’s energy profile (convex hull) and experimental data generated in this study are available via University of Liverpool data repository at http://datacat.liverpool.ac.uk/id/eprint/1157. The X-ray crystallographic coordinates for the structure reported in this study have been deposited at the Cambridge Crystallographic Data Centre (CCDC), under deposition number 2023329. These data can be obtained free of charge from The Cambridge Crystallographic Data Centre via www.ccdc.cam.ac.uk/data_request/cif.

Code availability

The code (implementation of VAE and Phase Field Ranking method) developed for this work is available at https://www.github.com/lrcfmd/PhaseFieldRanking. https://doi.org/10.5281/zenodo.5113851

References

Zagorac, D., Müller, H., Ruehl, S., Zagorac, J. & Rehme, S. Recent developments in the inorganic crystal Structure database: theoretical crystal structure data and related features. J. Appl. Cryst. 52, 918–925 (2019).
Article CAS Google Scholar
Davies, D. W. et al. Computational screening of all stoichiometric inorganic materials. Chem 1, 617–627 (2016).
Article CAS PubMed PubMed Central Google Scholar
Haynes, A. S., Stoumpos, C. C., Chen, H., Chica, D. & Kanatzidis, M. G. Panoramic synthesis as an effective materials discovery tool: the system Cs/Sn/P/Se as a test case. J. Am. Chem. Soc. 139, 10814–10821 (2017).
Article CAS PubMed Google Scholar
Canfield, P. C. New materials physics. Rep. Prog. Phys. 83, 016501 (2019).
Article ADS PubMed CAS Google Scholar
Xia, Z. & Poeppelmeier, K. R. Chemistry-inspired adaptable framework structures. Acc. Chem. Res. 50, 1222–1230 (2017).
Article CAS PubMed Google Scholar
Wong-Ng, W., Roth, R. S., Vanderah, T. A. & McMurdie, H. F. Phase equilibria and crystallography of ceramic oxides. J. Res. Natl Inst. Stand. Technol. 106, 1097–1134 (2001).
Article CAS PubMed PubMed Central Google Scholar
Ward, L., Agrawal, A., Choudhary, A. & Wolverton, C. A general-purpose machine learning framework for predicting properties of inorganic materials. Npj Comput. Mater. 2, 16028 (2016).
Article Google Scholar
Schmidt, J. et al. Predicting the thermodynamic stability of solids combining density functional theory and machine learning. Chem. Mater. 29, 5090–5103 (2017).
Article CAS Google Scholar
Liu, Y., Zhao, T., Ju, W. & Shi, S. Materials discovery and design using machine learning. J. Materiomics 3, 159–177 (2017).
Article ADS Google Scholar
Butler, K. T., Davies, D. W., Cartwright, H., Isayev, O. & Walsh, A. Machine learning for molecular and materials science. Nature. 559, 547–555 (2018).
Article ADS CAS PubMed Google Scholar
Oliynyk, A. O., Adutwum, L. A., Harynuk, J. J. & Mar, A. Classifying crystal structures of binary compounds AB through cluster resolution feature selection and support vector machine analysis. Chem. Mater. 28, 6672–6681 (2016).
Article CAS Google Scholar
Sendek, A. D. et al. Machine learning-assisted discovery of solid Li-ion conducting materials. Chem. Mater. 31, 342–352 (2019).
Article CAS Google Scholar
Oliynyk, A. O. et al. Disentangling structural confusion through machine learning: structure prediction and polymorphism of equiatomic ternary phases ABC. J. Am. Chem. Soc. 139, 17870–17881 (2017).
Article CAS PubMed Google Scholar
Zhuo, Y., Mansouri Tehrani, A. & Brgoch, J. Predicting the band gaps of inorganic solids by machine learning. J. Phys. Chem. Lett. 9, 1668–1673 (2018).
Article CAS PubMed Google Scholar
Zhang, Y. et al. Unsupervised discovery of solid-state lithium ion conductors. Nat. Commun. 10, 1–7 (2019).
ADS CAS Google Scholar
Tshitoyan, V. et al. Unsupervised word embeddings capture latent knowledge from materials science literature. Nature. 571, 95–98 (2019).
Article ADS CAS PubMed Google Scholar
Harada, J. K., Charles, N., Poeppelmeier, K. R. & Rondinelli, J. M. Heteroanionic materials by design: progress toward targeted properties. Adv. Mater. 31, 1805295 (2019).
Article CAS Google Scholar
Kageyama, H. et al. Expanding frontiers in materials chemistry and physics with multiple anions. Nat. Commun. 9, 772 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Kraft, M. A. et al. Influence of lattice polarizability on the ionic conductivity in the lithium superionic argyrodites Li₆PS₅X (X = Cl, Br, I). J. Am. Chem. Soc. 139, 10909–10918 (2017).
Article CAS PubMed Google Scholar
Bates, J. B. et al. Electrical properties of amorphous lithium electrolyte thin films. Solid State Ion. 53–56, 647–654 (1992).
Article Google Scholar
Collins, C. et al. Accelerated discovery of two crystal structure types in a complex inorganic phase field. Nature. 546, 280–284 (2017).
Article ADS CAS PubMed Google Scholar
Gamon, J. et al. Computationally guided discovery of the sulfide Li₃AlS₃ in the Li–Al–S phase field: structure and lithium conductivity. Chem. Mater. 31, 9699–9714 (2019).
Article CAS PubMed PubMed Central Google Scholar
Canabarro, A., Fanchini, F. F., Malvezzi, A. L., Pereira, R. & Chaves, R. Unveiling phase transitions with machine learning. Phys. Rev. B. 100, 045129 (2019).
Article ADS CAS Google Scholar
Tschandl, P. et al. Human–computer collaboration for skin cancer recognition. Nat. Med. 26, 1229–1234 (2020).
Article CAS PubMed Google Scholar
More Than Machines. Nat. Mach. Intell. 1, 1–1 (2019).
Jha, D. et al. ElemNet: deep learning the chemistry of materials from only elemental composition. Sci. Rep. 8, 1–13 (2018).
Article ADS Google Scholar
Glawe, H., Sanna, A., Gross, E. K. U. & Marques, M. A. L. The optimal one dimensional periodic table: a modified pettifor chemical scale from data mining. N. J. Phys. 18, 093011 (2016).
Article Google Scholar
Pavlenko, T. On feature selection, curse-of-dimensionality and error probability in discriminant analysis. J. Stat. Plan. Inference. 115, 565–584 (2003).
Article MathSciNet MATH Google Scholar
Kingma, D. P. & Welling, M. Auto-encoding variational bayes. arXiv:1312.6114 [cs, stat] (2014).
Gong, D. et al. Memorizing Normality to Detect Anomaly: Memory-Augmented Deep Autoencoder for Unsupervised Anomaly Detection. in 2019 IEEE/CVF International Conference on Computer Vision (ICCV) 1705–1714 (2019). https://doi.org/10.1109/ICCV.2019.00179
Amarbayasgalan, T., Jargalsaikhan, B. & Ryu, K. H. Unsupervised novelty detection using deep autoencoders with density based clustering. Appl. Sci. 8, 1468 (2018).
Article Google Scholar
Zimek, A., Schubert, E. & Kriegel, H.-P. A survey on unsupervised outlier detection in high-dimensional numerical data. Stat. Anal. Data Min.: Data Sci. J. 5, 363–387 (2012).
Article MathSciNet MATH Google Scholar
Lin, E., Mukherjee, S. & Kannan, S. A deep adversarial variational autoencoder model for dimensionality reduction in single-cell RNA sequencing. Anal. Bioinform. 21, 64 (2020).
CAS Google Scholar
Walker, J., Doersch, C., Gupta, A. & Hebert, M. An Uncertain Future: Forecasting from Static Images Using Variational Autoencoders. in Computer Vision – ECCV 2016 (eds. Leibe, B., Matas, J., Sebe, N. & Welling, M.) 835–851 (Springer International Publishing, 2016). https://doi.org/10.1007/978-3-319-46478-7_51
Suzuki, K. et al. Synthesis, structure, and electrochemical properties of crystalline Li-P-S-O solid electrolytes: novel lithium-conducting oxysulfides of Li₁₀GeP₂S₁₂ family. Solid State Ion. 288, 229–234 (2016).
Article CAS Google Scholar
Gamon, J. et al. Li_4.3AlS_3.3Cl_0.7: a sulfide-chloride lithium solid electrolyte with highly disordered structure and increased conductivity. ChemRxiv: https://doi.org/10.26434/chemrxiv.14454627.v1 (2021).
Morscher, A. et al. Li₆SiO₄Cl₂: a hexagonal argyrodite based on antiperovskite layer stacking. Chem. Mater. 33, 2206–2217 (2021).
Article CAS PubMed PubMed Central Google Scholar
Holzmann, T. et al. Li_0.6[Li_0.2Sn_0.8S₂] – a layered lithium superionic conductor. Energy Environ. Sci. 9, 2578–2585 (2016).
Article CAS Google Scholar
Sharp, P. M., Dyer, M. S., Darling, G. R., Claridge, J. B. & Rosseinsky, M. J. Chemically directed structure evolution for crystal structure prediction. Phys. Chem. Chem. Phys. 22, 18205–18218 (2020).
Article CAS PubMed Google Scholar
Lonie, D. C. & Zurek, E. XtalOpt: an open-source evolutionary algorithm for crystal structure prediction. Comput. Phys. Commun. 182, 372–387 (2011).
Article ADS CAS MATH Google Scholar
Kresse, G. & Hafner, J. Ab initio molecular dynamics for liquid metals. Phys. Rev. B 47, 558–561 (1993).
Article ADS CAS Google Scholar
MacKenzie, K. J. D. & Smith, M. E. Multinuclear solid-state nuclear magnetic resonance of inorganic materials. (Elsevier, 2002).
Kaib, T. et al. New lithium chalcogenidotetrelates, LiChT: synthesis and characterization of the Li⁺-conducting tetralithium ortho-sulfidostannate Li₄SnS₄. Chem. Mater. 24, 2211–2219 (2012).
Article CAS Google Scholar
Kanazawa, K. et al. Mechanochemical synthesis and characterization of metastable hexagonal Li₄SnS₄ solid electrolyte. Inorg. Chem. 57, 9925–9930 (2018).
Article CAS PubMed Google Scholar
Choi, Y. E. et al. Coatable Li₄SnS₄ solid electrolytes prepared from aqueous solutions for all-solid-state lithium-ion batteries. ChemSusChem. 10, 2605–2611 (2017).
Article CAS PubMed Google Scholar
Kwak, H. et al. Li⁺ Conduction in air-stable Sb-substituted Li₄SnS₄ for all-solid-state Li-ion batteries. J. Power Sources 446, 227338 (2020).
Article CAS Google Scholar
Sahu, G. et al. Air-stable, high-conduction solid electrolytes of arsenic-substituted Li₄SnS₄. Energy Environ. Sci. 7, 1053–1058 (2014).
Article CAS Google Scholar
Krohns, S. et al. The route to resource-efficient novel materials. Nat. Mater. 10, 899–901 (2011).
Article ADS CAS PubMed Google Scholar
Banerjee, A. et al. Interfaces and interphases in all-solid-state batteries with inorganic solid electrolytes. Chem. Rev. 120, 6878–6933 (2020).
Article CAS PubMed Google Scholar
Zhang, Z. et al. Li_4-xSb_xSn_1-xS₄ solid solutions for air-stable solid electrolytes. J. Energy Chem. 41, 171–176 (2020).
Article Google Scholar
Asano, T. et al. Solid halide electrolytes with high lithium-ion conductivity for application in 4 V class bulk-type all-solid-state batteries. Adv. Mater. 30, 1803075 (2018).
Article CAS Google Scholar
Nishimura, S. et al. Experimental visualization of lithium diffusion in Li_xFePO₄. Nat. Mater. 7, 707–711 (2008).
Article ADS CAS PubMed Google Scholar
Sholl, C. A. Nuclear spin relaxation by translational diffusion in liquids and solids: high- and low-frequency limits. J. Phys. C: Solid State Phys. 14, 447–464 (1981).
Article ADS CAS Google Scholar
Kuhn, A. et al. Li ion diffusion in the anode material Li₁₂Si₇: ultrafast quasi-1D diffusion and two distinct fast 3D jump processes separately revealed by ⁷Li NMR relaxometry. J. Am. Chem. Soc. 133, 11018–11021 (2011).
Article CAS PubMed Google Scholar
Weber, D. A. et al. Structural insights and 3D diffusion pathways within the lithium superionic conductor Li₁₀GeP₂S₁₂. Chem. Mater. 28, 5905–5915 (2016).
Article CAS Google Scholar
Boysen, H. The determination of anharmonic probability densities from static and dynamic disorder by neutron powder diffraction. Z. Kristallogr. Cryst. Mater. 218, 123–131 (2003).
Wang, Y. et al. Design principles for solid-state lithium superionic conductors. Nat. Mater. 14, 1026–1031 (2015).
Article ADS CAS PubMed Google Scholar
Cascallana-Matias, I., Keen, D. A., Cussen, E. J. & Gregory, D. H. Phase behavior in the LiBH₄–LiBr system and structure of the anion-stabilized fast ionic, high temperature phase. Chem. Mater. 27, 7780–7787 (2015).
Article CAS Google Scholar
Kamaya, N. et al. A lithium superionic conductor. Nat. Mater. 10, 682–686 (2011).
Article ADS CAS PubMed Google Scholar
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I. & Salakhutdinov, R. Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958 (2014).
MathSciNet MATH Google Scholar
Agarap, A. F. Deep learning using rectified linear units (ReLU). arXiv:1803.08375 [cs, stat] (2019).
Kingma, D. P. & Ba, J. Adam: a method for stochastic optimization. arXiv:1412.6980 [cs] (2017).
Kresse, G. & Joubert, D. From ultrasoft pseudopotentials to the projector augmented-wave method. Phys. Rev. B 59, 1758–1775 (1999).
Article ADS CAS Google Scholar
Leube, B. T. et al. Lithium transport in Li_4.4M_0.4M′_0.6S₄ (M = Al³⁺, Ga³⁺, and M′ = Ge⁴⁺, Sn⁴⁺): combined crystallographic, conductivity, solid state NMR, and computational studies. Chem. Mater. 30, 7183–7200 (2018).
Article CAS Google Scholar
Johnson, D. ZView: a software program for IES analysis 3.5d. (Scribner Associates Inc., 2007).
Momma, K., Ikeda, T., Belik, A. A. & Izumi, F. Dysnomia, a computer program for maximum-entropy method (MEM) analysis and its performance in the MEM-based pattern fitting. Powder Diffr. 28, 184–193 (2013).
Article ADS CAS Google Scholar
FullProf Suite. Crystallographic tool for rietveld, profile matching & integrated intensity refinements of X-ray and/or neutron data. http://www.ill.eu/sites/fullprof/ (2006).
Momma, K. & Izumi, F. VESTA 3 for three-dimensional visualization of crystal, volumetric and morphology data. J. Appl. Cryst. 44, 1272–1276 (2011).
Article CAS Google Scholar
Izenman, A. J. Recent developments in nonparametric density estimation. J. Am. Stat. Assoc. 86, 205–224 (1991).
MathSciNet MATH Google Scholar
Sears, V. F. Neutron scattering lengths and cross sections. Neutron N. 3, 26–37 (1992).
Article Google Scholar

Download references

Acknowledgements

We thank the UK Engineering and Physical Sciences Research Council (EPSRC) for funding through grant number EP/N004884. We are grateful for computational support from the UK Materials and Molecular Modelling Hub, which is partially funded by EPSRC (EP/P020194). We acknowledge the ISCF Faraday Challenge project: “SOLBAT The Solid-State (Li or Na) Metal-Anode Battery” [grant number FIRG007], including partial support of a studentship to B.B.D., also supported by the University of Liverpool. We thank Diamond Light Source for access to beamline I11 (proposal CY23666) and Prof. Chiu Tang and Dr. Claire Murray for assistance on the beamline. We thank STFC for access to Polaris at ISIS Spallation Neutron Source (Xpress proposal 1990189) and Dr. Ron Smith for running the measurement. M.W.G. and V.G. thank the Leverhulme Trust for funding via the Leverhulme Research Centre for Functional Materials Design. M.J.R. thanks the Royal Society for a Research Professorship.

Author information

Authors and Affiliations

Department of Chemistry, University of Liverpool, Liverpool, UK
Andrij Vasylenko, Jacinthe Gamon, Benjamin B. Duff, Vladimir V. Gusev, Luke M. Daniels, Marco Zanella, J. Felix Shin, Paul M. Sharp, Alexandra Morscher, Ruiyong Chen, Alex R. Neale, Laurence J. Hardwick, John B. Claridge, Frédéric Blanc, Michael W. Gaultois, Matthew S. Dyer & Matthew J. Rosseinsky
Stephenson Institute for Renewable Energy, University of Liverpool, Liverpool, UK
Benjamin B. Duff, Alex R. Neale, Laurence J. Hardwick & Frédéric Blanc
Leverhulme Research Centre for Functional Materials Design, Materials Innovation Factory, University of Liverpool, Liverpool, UK
Vladimir V. Gusev, Paul M. Sharp, John B. Claridge, Frédéric Blanc, Michael W. Gaultois, Matthew S. Dyer & Matthew J. Rosseinsky

Authors

Andrij Vasylenko
View author publications
You can also search for this author in PubMed Google Scholar
Jacinthe Gamon
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin B. Duff
View author publications
You can also search for this author in PubMed Google Scholar
Vladimir V. Gusev
View author publications
You can also search for this author in PubMed Google Scholar
Luke M. Daniels
View author publications
You can also search for this author in PubMed Google Scholar
Marco Zanella
View author publications
You can also search for this author in PubMed Google Scholar
J. Felix Shin
View author publications
You can also search for this author in PubMed Google Scholar
Paul M. Sharp
View author publications
You can also search for this author in PubMed Google Scholar
Alexandra Morscher
View author publications
You can also search for this author in PubMed Google Scholar
Ruiyong Chen
View author publications
You can also search for this author in PubMed Google Scholar
Alex R. Neale
View author publications
You can also search for this author in PubMed Google Scholar
Laurence J. Hardwick
View author publications
You can also search for this author in PubMed Google Scholar
John B. Claridge
View author publications
You can also search for this author in PubMed Google Scholar
Frédéric Blanc
View author publications
You can also search for this author in PubMed Google Scholar
Michael W. Gaultois
View author publications
You can also search for this author in PubMed Google Scholar
Matthew S. Dyer
View author publications
You can also search for this author in PubMed Google Scholar
Matthew J. Rosseinsky
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.V. identified, developed, and implemented the VAE model in discussion with V.G., M.W.G., and M.S.D. and performed all the CSP. J.G. performed all synthetic work on the Li-Sn-S-Cl phases, solved and analysed the structure in collaboration with L.M.D. and J.B.C. and performed the impedance characterization, with assistance from A.M. B.B.D. and F.B. designed, performed, and analysed all the NMR experiments. R.C., A.R.N., and L.J.H. designed and performed the symmetrical cell and interface evaluation experiments. M.Z. performed all the electron microscopy. J.F.S. performed all the experimental work on the Li-Mg-S-Cl phases, and evaluated all the data in collaboration with L.M.D. and J.B.C.; P.M.S. and M.S.D. performed all the computational work on these phases. A.V., J.G., and M.J.R. wrote the first draft, all authors contributed to the completion of the manuscript. A.V., J.G., M.W.G., V.G., L.M.D., M.S.D. identified the project idea. M.J.R. directed the project.

Corresponding author

Correspondence to Matthew J. Rosseinsky.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Vasylenko, A., Gamon, J., Duff, B.B. et al. Element selection for crystalline inorganic solid discovery guided by unsupervised machine learning of experimentally explored chemistry. Nat Commun 12, 5561 (2021). https://doi.org/10.1038/s41467-021-25343-7

Download citation

Received: 14 June 2021
Accepted: 04 August 2021
Published: 21 September 2021
DOI: https://doi.org/10.1038/s41467-021-25343-7

This article is cited by

Methods and applications of machine learning in computational design of optoelectronic semiconductors
- Xiaoyu Yang
- Kun Zhou
- Lijun Zhang
Science China Materials (2024)
Unleashing the Potential of NASICON Materials for Solid-State Batteries
- Anand Parejiya
- Marm Dixit
- Ilias Belharouak
JOM (2024)
Accelerating the prediction of stable materials with machine learning
- Sean D. Griesemer
- Yi Xia
- Chris Wolverton
Nature Computational Science (2023)
Element selection for functional materials discovery by integrated machine learning of elemental contributions to properties
- Andrij Vasylenko
- Dmytro Antypov
- Matthew J. Rosseinsky
npj Computational Materials (2023)
Material symmetry recognition and property prediction accomplished by crystal capsule representation
- Chao Liang
- Yilimiranmu Rouzhahong
- Huashan Li
Nature Communications (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.