An autonomous laboratory for the accelerated synthesis of novel materials

Szymanski, Nathan J.; Rendy, Bernardus; Fei, Yuxing; Kumar, Rishi E.; He, Tanjin; Milsted, David; McDermott, Matthew J.; Gallant, Max; Cubuk, Ekin Dogus; Merchant, Amil; Kim, Haegyeom; Jain, Anubhav; Bartel, Christopher J.; Persson, Kristin; Zeng, Yan; Ceder, Gerbrand

doi:10.1038/s41586-023-06734-w

Download PDF

Article
Open access
Published: 29 November 2023

An autonomous laboratory for the accelerated synthesis of novel materials

Nature volume 624, pages 86–91 (2023)Cite this article

124k Accesses
39 Citations
1049 Altmetric
Metrics details

Subjects

Abstract

To close the gap between the rates of computational screening and experimental realization of novel materials^1,2, we introduce the A-Lab, an autonomous laboratory for the solid-state synthesis of inorganic powders. This platform uses computations, historical data from the literature, machine learning (ML) and active learning to plan and interpret the outcomes of experiments performed using robotics. Over 17 days of continuous operation, the A-Lab realized 41 novel compounds from a set of 58 targets including a variety of oxides and phosphates that were identified using large-scale ab initio phase-stability data from the Materials Project and Google DeepMind. Synthesis recipes were proposed by natural-language models trained on the literature and optimized using an active-learning approach grounded in thermodynamics. Analysis of the failed syntheses provides direct and actionable suggestions to improve current techniques for materials screening and synthesis design. The high success rate demonstrates the effectiveness of artificial-intelligence-driven platforms for autonomous materials discovery and motivates further integration of computations, historical knowledge and robotics.

On-the-fly closed-loop materials discovery via Bayesian active learning

Article Open access 24 November 2020

Emerging materials intelligence ecosystems propelled by machine learning

Article 09 November 2020

Navigating phase diagram complexity to guide robotic inorganic materials synthesis

Article Open access 09 April 2024

Main

Although promising new materials can be identified at scale using high-throughput computations, their experimental realization is often challenging and time-consuming. Accelerating the experimental segment of materials discovery requires not only automation but autonomy—the ability of an experimental agent to interpret data and make decisions based on it. Pioneering efforts have demonstrated autonomy in several aspects of materials research, including robotic and Bayesian-driven optimization of carbon nanotube yield^3,4, photovoltaic performance⁵ and photocatalysis activity⁶. In contrast to conventional ML algorithms used for optimization, human researchers benefit from a wealth of background knowledge that informs their decision-making, and it is increasingly recognized^7,8,9 that autonomy will require a fusion of encoded domain knowledge, access to diverse data sources and active learning.

Here we present the A-Lab, an autonomous laboratory that integrates robotics with the use of ab initio databases, ML-driven data interpretation, synthesis heuristics learned from text-mined literature data and active learning to optimize the synthesis of novel inorganic materials in powder form. Although autonomous workflows based on liquid handling have been demonstrated in organic chemistry^10,11,12,13, the A-Lab addresses the unique challenges of handling and characterizing solid inorganic powders. These often require milling to ensure good reactivity between precursors, which can have a wide range of physical properties related to differences in their density, flow behaviour, particle size, hardness and compressibility. The use of solid powders is well suited for manufacturing and technological scaleup, and the approach of the A-Lab to synthesis produces multigram sample quantities that facilitate device-level testing.

Given a set of air-stable target materials (that is, desired synthesis products whose yield we aim to maximize) screened using the Materials Project¹⁴, the A-Lab generates synthesis recipes using ML models trained on historical data from the literature and then performs these recipes with robotics. The synthesis products are characterized by X-ray diffraction (XRD), with two ML models working together to analyse their patterns. When synthesis recipes fail to produce a high target yield, active learning closes the loop by proposing improved follow-up recipes. Over 17 days of operation, the A-Lab successfully synthesized 41 of 58 target materials that span 33 elements and 41 structural prototypes (Supplementary Fig. 1 and Supplementary Table 1). Inspection of the 17 unobtained targets revealed synthetic as well as computational failure modes, several of which could be overcome through minor adjustments to the lab’s decision-making. With its high success rate in validating predicted materials, the A-Lab showcases the collective power of ab initio computations, ML algorithms, accumulated historical knowledge and automation in experimental research.

Autonomous materials-discovery platform

The materials-discovery pipeline followed by the A-Lab is schematically shown in Fig. 1. All target materials considered in this work are new to the lab, that is, not present in the training data for the algorithms it uses to propose synthesis recipes, and 52 of the 58 targets have no previous synthesis reports, to the best of our knowledge (Methods). The experiments reported in this study represent the first attempts by the A-Lab to synthesize any of these targets. Each target is predicted to be on or very near (<10 meV per atom) the convex hull formed by stable phases taken from the Materials Project¹⁴ and cross-referenced with an analogous database from Google DeepMind. Because the A-Lab handles samples in open air, we only considered targets that are predicted not to react with O₂, CO₂ and H₂O (Methods).

**Fig. 1: Autonomous materials discovery with the A-Lab.**

For each compound proposed to the A-Lab, up to five initial synthesis recipes are generated by a ML model that has learned to assess target ‘similarity’ through natural-language processing of a large database of syntheses extracted from the literature¹⁵, mimicking the approach of a human to base an initial synthesis attempt on analogy to known related materials. A synthesis temperature is then proposed by a second ML model trained on heating data from the literature¹⁶ (Methods). If these literature-inspired recipes fail to produce >50% yield for their desired targets, the A-Lab continues to experiment using Autonomous Reaction Route Optimization with Solid-State Synthesis (ARROWS³), an active-learning algorithm that integrates ab initio computed reaction energies with observed synthesis outcomes to predict solid-state reaction pathways¹⁷. Experiments are performed under the guidance of this algorithm until the target is obtained as the majority phase or all synthesis recipes available to the A-Lab are exhausted.

The A-Lab carries out experiments using three integrated stations for sample preparation, heating and characterization, with robotic arms transferring samples and labware between them (Fig. 1 and Extended Data Figs. 1 and 2). The first station dispenses and mixes precursor powders before transferring them into alumina crucibles. A robotic arm from the second station loads these crucibles into one of four available box furnaces to be heated (Methods). After allowing the samples to cool, another robotic arm transfers them to the third station, where they are ground into a fine powder and measured by XRD. The operations of the lab are controlled through an application programming interface, which enables on-the-fly job submission from human researchers or decision-making agents (Extended Data Fig. 3).

The phase and weight fractions of the synthesis products are extracted from their XRD patterns by probabilistic ML models trained on experimental structures from the Inorganic Crystal Structure Database (ICSD) following the methodology outlined in previous work^18,19. Because the target materials considered in this work have no experimental reports, their diffraction patterns are simulated from computed structures available in the Materials Project and corrected to reduce density functional theory (DFT) errors (Supplementary Note 1). For each sample, the phases identified by ML are confirmed with automated Rietveld refinement (Methods and Supplementary Note 2) and the resulting weight fractions are reported to the management server of the A-Lab to inform subsequent experimental iterations, if necessary, in search of an optimal recipe with high target yield.

Experimental synthesis outcomes

Using the described workflow, the A-Lab synthesized 41 of the 58 target compounds over 17 days of continuous experimentation, representing a 71% success rate. We show in the next section that this success rate could be improved to 74% with only minor modifications to the lab’s decision-making algorithm, and further to 78% if the computational techniques were also improved. The high success rate demonstrates that comprehensive ab initio calculations can be used to effectively identify new, stable and synthesizable materials. The outcome for all 58 compounds is plotted in Fig. 2 against their decomposition energies (on a log scale), a common thermodynamic metric that describes the driving force to form a compound from its neighbours on the phase diagram²⁰ (Supplementary Fig. 2). A negative (positive) decomposition energy indicates that a material is stable (metastable) at 0 K. Of the targets considered in this work, 50 are predicted to be stable, whereas the remaining eight are metastable but lie near the convex hull. Over the range of decomposition energies considered, we do not observe a clear correlation between decomposition energy and whether a material was successfully synthesized.

**Fig. 2: Outcomes from targeted syntheses of DFT-predicted materials.**

In total, 35 of the 41 materials synthesized by the A-Lab were obtained using recipes proposed by ML models trained on synthesis data from the literature (Supplementary Note 3). These literature-inspired recipes were more likely to succeed when the reference materials are highly similar to our targets (Supplementary Fig. 3), confirming that target ‘similarity’ is a useful metric to select effective precursors²¹. At the same time, precursor selection remains a highly nontrivial task, even for thermodynamically stable materials. Despite 71% of targets eventually being obtained, only 37% of the 355 synthesis recipes tested by the A-Lab produced their targets. This finding echoes previous work that has established the strong influence of precursor selection on the synthesis path, ultimately deciding whether it forms the target or becomes trapped in a metastable state^22,23,24,25.

The active-learning cycle of the A-Lab¹⁷ identified synthesis routes with improved yield for nine targets, of which six had zero yield from the initial literature-inspired recipes. Targets optimized with active learning are indicated by the bars containing diagonal lines in Fig. 2. In this framework, improved synthesis routes are designed using two hypotheses: (1) solid-state reactions tend to occur between two phases at a time (that is, pairwise)^26,27,28 and (2) intermediate phases that leave only a small driving force to form the target material should be avoided, as they often require long reaction time and high temperature^22,23,29.

The A-Lab continuously builds a database of pairwise reactions observed in its experiments—88 unique pairwise reactions (Supplementary Table 2) were identified from the synthesis experiments performed in this work. This database allows the products of some recipes to be inferred, precluding their testing; a recipe that yields an observed set of intermediates (already present in the lab’s database) need not be pursued at higher temperatures, as the remaining reaction pathway is already known (Fig. 3a,b). This can reduce the search space of possible synthesis recipes by up to 80% when many precursor sets react to form the same intermediates (Fig. 3e and Supplementary Notes 4 and 5). Furthermore, knowledge of reaction pathways can be used to give priority to intermediates with a large driving force to form the target, computed using formation energies available in the Materials Project (Fig. 3c,d). For example, the synthesis of CaFe₂P₂O₉ was optimized by avoiding the formation of FePO₄ and Ca₃(PO₄)₂, which have a small driving force (8 meV per atom) to form the target. This led to the identification of an alternative synthesis route that forms CaFe₃P₃O₁₃ as an intermediate, from which there remains a much larger driving force (77 meV per atom) to react with CaO and form CaFe₂P₂O₉, causing an approximately 70% increase in the yield of the target (Supplementary Note 6).

**Fig. 3: Active learning with pairwise reaction analysis.**

Barriers to synthesis

Seventeen of the 58 targets evaluated by the A-Lab were not obtained even after its active-learning cycle. We identify slow reaction kinetics, precursor volatility, amorphization and computational inaccuracy as four broad categories of ‘failure modes’ that prevented the synthesis of these targets. The prevalence of each failure mode is shown in Fig. 4, accompanied by their affected targets.

**Fig. 4: Barriers to the synthesis of materials predicted to be stable.**

Sluggish reaction kinetics hindered 11 of the 17 failed targets, each containing reaction steps with low driving forces (<50 meV per atom; Supplementary Fig. 4). In principle, these targets can be made accessible by using a higher synthesis temperature, longer heating time, improved precursor mixing or intermittent regrinding—standard procedures that are at present outside the domain of the A-Lab’s active-learning algorithm. As such, we manually reground the original synthesis products generated by the A-Lab and heated them to higher temperatures, which led to the successful formation of two further targets, Y₃Ga₃In₂O₁₂ and Mg₃NiO₄, bringing our total success rate to 74% (Supplementary Note 7). One could also use more reactive precursors to provide a greater driving force to form the target, although our experiments were constrained to air-stable binary precursors that sometimes restricted the A-Lab’s choice of synthesis routes to those forming highly stable intermediates. System modifications to enable multistep heating, intermediate regrinding and expanded precursor selection should improve the ability of the lab to adapt and overcome failed synthesis attempts.

Precursor volatility disrupted all synthesis experiments targeting CaCr₂P₂O₉, causing a change in the net stoichiometry of its samples (Supplementary Note 8). This can be attributed to the use of ammonium phosphate precursors, NH₄H₂PO₄ and (NH₄)₂HPO₄, which proceed through a series of decomposition reactions and ultimately evaporate above 450 °C (ref. ³⁰). Still, recipes based on these precursors can succeed if the ammonium phosphate reacts with another precursor before its evaporation temperature, effectively locking the phosphate ions in the solid state. For example, volatility does not seem to be an issue for the Mn-containing phosphates targeted in this work, as each Mn oxides precursor reacts with the ammonium phosphates at low temperature (<500 °C) to form Mn₂(PO₄)₃ as an intermediate. This precursor behaviour can, in principle, be learned when sufficient pairwise reaction data have been collected, after which the A-Lab may favour the selection of precursors that trap in phosphate ions at low temperature and therefore preclude unwanted volatility.

Melting of samples at high temperature inhibited the crystallization of one target, Mo(PO₃)₅, whose synthesis attempts produced amorphous samples (Supplementary Fig. 5). Although the use of a molten flux can sometimes improve reaction kinetics³¹, the formation of an amorphous state that is low in energy may reduce the driving force for crystallization. Indeed, using the workflow outlined in ref. ³², we identified amorphous configurations of Mo(PO₃)₅ with energies as low as 61 meV per atom above the crystalline ground state, a finding that is consistent with the widely reported glass-forming ability of phosphate-rich compounds^33,34.

Some failure modes result from inaccuracies in the computed stability of the target and therefore cannot be addressed by modifications to the experimental procedures. Fundamental-electronic-structure challenges are probably affecting La₅Mn₅O₁₆, as all the attempts to synthesize this phase instead yielded LaMnO₃, which DFT unexpectedly predicts to be highly unstable (120 meV per atom above the hull), even though it is widely reported in the literature to be experimentally accessible³⁵. If the energy of LaMnO₃ were lowered, consistent with its experimental stability, La₅Mn₅O₁₆ would be destabilized (above the hull). Errors in the computed energy of LaMnO₃ may arise from its strong Jahn–Teller activity³⁶, compositional off-stoichiometry³⁷ or the presence of f-states in La—all of which present challenges to conventional DFT. Problems with YbMoO₄ were found to be because of a poor pseudopotential choice in the Materials Project that destabilizes the well-known oxide, Yb₂O₃, and it is likely that, in more accurate calculations, YbMoO₄ is not stable. A similar lanthanide-related electronic-structure problem may also be responsible for the failure to synthesize BaGdCrFeO₆. These examples demonstrate the ability of the A-Lab to provide important feedback to high-throughput computed datasets. With improved calculations that exclude the computationally problematic compounds in this work, our total success rate would increase to 78% (43/55 targets).

Outlook

In 17 days of closed-loop operation, the A-Lab performed 355 experiments and successfully realized 41 of 58 novel inorganic crystalline solids with diverse structures and chemistries. This unexpectedly high success rate (71%) for the synthesis of computationally predicted compounds was achieved by integrating robotics with: (1) DFT-computed data to survey the energetic landscape of precursors, reaction intermediates and final products; (2) heuristic suggestions for synthesis procedures obtained from ML models trained on text-mined synthesis data; (3) ML interpretation of experimental data; and (4) an active-learning algorithm that improves on failed synthesis procedures. The study also revealed several opportunities to enhance the lab’s active-learning algorithm by addressing failures caused by slow reaction kinetics, which would enable an improved success rate of 74% with in-line solutions.

Our paper demonstrates that autonomous research agents can markedly accelerate the pace of materials research. Researchers initialized the A-Lab by proposing 58 target materials, which were successfully realized at a rate of >2 new materials per day with minimal human intervention. Such rapid discovery points to a vast landscape of opportunities in materials synthesis and development. Although this work focused on a limited subset of all possible synthesis targets, many new candidates await evaluation. As the breadth of ab initio computations continues to grow, so will this list of novel materials.

Advances in simulations, ML and robotics have intersected to enable ‘expert systems’ that show autonomy as an emergent quality by the sum of its automated components. The A-Lab demonstrates this by combining modern theory-driven and data-driven ML techniques with a modular workflow that can discover novel materials with minimal human input. Lessons learned from continuing experiments can inform both the system itself and the greater community through systematic data generation and collection. The systematic nature of the A-Lab provides a unique opportunity to answer fundamental questions about the factors that govern the synthesizability of novel materials, serving as an experimental oracle to validate predictions made on the basis of data-rich resources such as the Materials Project. In future iterations of the platform, such an oracle may be expanded to investigate factors beyond synthesizability, including microstructure and device performance. Although our current success rate for the synthesis of novel compounds is high, the remaining discrepancies between current predictions and their experimental outcomes is a crucial signal required to improve our understanding of materials synthesis.

Methods

Materials screening

The 58 targets evaluated by the A-Lab were identified from the Materials Project database (version 2022.10.28). We first obtained all entries from the Materials Project that were marked as ‘theoretical’ (that is, not represented in the ICSD) and predicted to be thermodynamically stable (at 0 K) or very close to the convex hull (<10 meV per atom). We did not consider materials with ≤2 elements nor those containing elements that are radioactive (Ac, Th, Pa, U, Np, Pu, Tc), exceedingly rare (Pd, Pt, Rh, Ir, Au, Ru, Os, Re, Tl, Sc, Tm, Pm, Rb, Cs) or toxic (Hg, As). Owing to concerns with the experimental handling of certain materials systems (for example, sulfides), we constrained our selection to only include the following types of material: oxides, carbonates, bicarbonates, hydroxides, sulfates, sulfites, bisulfates, silicates, fluorides, chlorides, bromides, orthoborates, metaborates, tetraborates, phosphates, phosphites, chlorates, chlorites and hypochlorites. Finally, we removed all compounds predicted to have uncommon and potentially challenging oxidation states (for example, Co⁴⁺), as determined by pymatgen³⁸.

The novelty of each candidate material was verified by cross-checking with several experimental sources. We first removed all compositions that appeared in SynTERRA, a text-mined set of experimental synthesis data extracted from more than 24,000 publications³⁹. Furthermore, we removed any materials with compositions appearing in the ‘Handbook of Inorganic Substances’⁴⁰. Although these methods are not exhaustive, they provide an automated and high-throughput approach to screen for materials novelty. For the remaining 432 candidates that were labelled as previously unsynthesized using this workflow, we filtered by thermodynamic stability in air. This was done by calculating the formation energy of each compound in a grand potential with respect to oxygen, assuming standard atmospheric conditions (p_O2 = 21,200 Pa) and temperatures ranging from 600 to 1,100 °C. We further checked for reactivity with CO₂ and H₂O under those same conditions by using the Interface Reactions module in pymatgen^38,41. From the resulting list of 146 new compounds that were stable in air, we selected 58 targets for which precursors were readily available. Later in the process, we found literature evidence for a small number of these compounds, but most (52/58 compounds) are believed to have no previous reports (Supplementary Note 9).

The algorithm we used for identifying potential synthesis targets is available on GitHub (https://github.com/mattmcdermott/novel-materials-screening). It operates autonomously once given the following information: which elements to consider in the target materials, how large an upper limit to impose on each material’s energy above the convex hull, the atmospheric conditions under which the materials will be synthesized and a threshold on the reaction energies that exist between each material and the gaseous species present in the specified atmosphere. The algorithm then scrapes the Materials Project and produces a list of candidate materials that satisfy these criteria. Further filtering may be considered on the basis of the availability and cost of precursors for each target. Although this is done manually in the current version of the algorithm, potential improvements could automate the process by using online data from chemical inventory lists and vendor websites.

Synthesis recipes from text-mined knowledge

We have established a pipeline for recommending synthesis recipes by using a knowledge base of 33,343 solid-state synthesis procedures extracted from 24,304 publications¹⁶. For a given target, the initial recipe is selected on the basis of the most common precursors in the knowledge base. We then transition to a similarity-based strategy for recipe selection. Each target is transformed into a numerical vector by using a synthesis-context-based encoding model¹². The similarity between a given (new) target and each known material in the knowledge base is evaluated using the cosine similarity between their encoded vectors. After identifying the reference material that is most similar to the target, its precursors are included in the new recommendation. When these precursors do not cover all the elements in the target, we use a masked precursor completion model¹² to account for such missing precursors. Subsequent recommendations are implemented by moving down the list of known materials ranked to be most similar to the target.

For each set of recommended precursors, the most effective synthesis temperature is predicted using an XGBoost regressor trained in previous work¹¹. The target and its associated precursors are transformed into three sets of features: (1) precursor properties including melting points, standard enthalpies of formation and standard Gibbs free energies of formation; (2) target compositional features indicating which elements are present; and (3) the calculated thermodynamic driving force associated with pairwise reaction paths from precursors to target. Although the proposed synthesis temperature is dependent on the precursors, not just the target, it may vary for each recipe. However, to maximize the efficiency with which the A-Lab operates, we chose to use one fixed temperature for each target. This temperature was calculated by averaging the proposed synthesis temperatures for the top five precursor sets recommended for a given target. This allowed all such precursor sets to be batched in a single furnace.

Robotic synthesis and characterization

The A-Lab performs fully automated solid-state synthesis and characterization. It is a bespoke robotic platform that consists of a precursor preparation station with a central robot arm (Mitsubishi) for powder dispensing and mixing (custom-made with Labman Automation Ltd.), a high-temperature heating station with four box furnaces (based on F48055-60, Thermo Scientific, with custom actuators to control its door), a product-handling station developed in-house for powder retrieval and sample loading, a characterization station with a powder X-ray diffractometer (Aeris Minerals, Malvern Panalytical) and two collaborative robot arms (UR5e, Universal Robots) that transfer samples and labware between stations. Further details on the robotic platform are provided in Supplementary Note 10.

The synthesis process starts from the precursor preparation station, where the necessary consumables (plastic vials, ZrO₂ balls and crucibles) and precursor dosing bottles containing between 50 and 100 g of powders are manually loaded before starting a new experimental campaign. Prescribed amounts of the precursor powders are dispensed into a plastic vial by an automatic dispenser balance (Quantos, Mettler Toledo). The precursor powders are then mixed thoroughly with ethanol and ten 5-mm ZrO₂ balls in a dual asymmetric centrifuge (Smart DAC250, Hauschild) for 9 min. To ensure proper slurry viscosity, the ethanol amount is automatically calculated on the basis of the amount and density of each powder comprising the mixture. The resulting slurry is transferred with an automated pipettor (rLine LH-710969, Sartorius) into an alumina crucible, which is then dried at 80 °C in a closed evaporation system. A UR5e robot arm on a linear rail (Olympus Controls) removes the dried samples from the precursor preparation station and loads them into one of four box furnaces. Heating is performed in batches, with each furnace containing up to eight samples on an alumina tray. Each batch is heated to 300 °C with a slow ramping rate of 2 °C min⁻¹ to raise the likelihood that any phosphate precursor has time to react before it becomes volatile at higher temperature. The samples are then heated to the specified synthesis temperature with a nominal ramp rate of 15 °C min⁻¹, followed by a 4-h dwell. After the dwell is complete, the samples are naturally cooled to 100 °C, at which point a UR5e arm removes the samples from the furnace and waits another 10 min to allow the samples to cool to room temperature.

A separate UR5e arm transfers the cooled samples to the next station for powder retrieval and characterization. There, a 10-mm alumina ball is placed in each crucible by an automatic ball dispenser developed in-house and then sent to a vertical shaker that grinds the samples into fine powders. The resulting powders are then poured by the UR5e arm from the crucibles into a clean plastic vial covered using a steel mesh. By inverting the container, the powder is dispensed through the mesh onto an XRD sample holder and subsequently flattened with an acrylic disc. The UR5e arm transfers each flattened sample into the diffractometer for X-ray measurements, which are performed using 8-min scans that range from 2θ = 10° to 100°. The XRD sample holders must be cleaned manually when the lab has depleted its stock. Precursor powders should also be refilled or replaced, when necessary, although this can be performed without stopping the workflow of the lab.

Phase analysis

Given an XRD pattern obtained from an unknown sample, we apply XRD-AutoAnalyzer to identify the constituent phases and estimate their weight fractions¹⁸. This algorithm relies on a convolutional neural network (CNN) consisting of six convolutional layers, with max pooling applied between each, followed by three fully connected layers with ReLU activation. Batch normalization and a dropout rate of 50% is applied between the fully connected layers for regularization. At inference, we apply Monte Carlo dropout to create an ensemble of 100 networks with 50% of their connections randomly excluded. The final prediction is taken as the phase that seems most frequently in the ensemble and its associated confidence is defined as the fraction of models that predict it.

A unique model instance is trained on the chemical space defined by each target. Experimental-structure entries with elements shared by the given target are extracted from the ICSD, also including carbonates and hydroxides. For the DFT-calculated target, we apply a machine-learned volume correction to its lattice parameters (Supplementary Note 1) before including it in the training set. From each reference phase, 200 diffraction patterns are simulated with stochastic variations derived from experimental artefacts including lattice strain, crystallographic texture, impurity peaks and poor crystallinity. These augmented patterns are used to train the CNN for 50 epochs, after which they are ready for the analysis of novel patterns.

To confirm the predictions of the CNN, we use an automated approach to multiphase Rietveld refinement. An agent with two deep neural networks (actor/critic) were trained using reinforcement learning based on a proximal policy optimization algorithm⁴² implemented in a custom gym environment⁴³ that interacts with the GSAS-II software package⁴⁴ through a scripting interface⁴⁵ (Supplementary Note 2). The environment is initialized by refining the background, followed by the scale factor and sample displacement. After initialization is performed on the basis of these parameters, the algorithm freely refines the lattice parameters, phase fractions, isotropic microstrains and particle sizes. For each step in the refinement, our algorithm decides which parameters to refine and/or reset to the initial values, with the objective of minimizing the difference between the calculated and the experimentally observed patterns.

When the automated refinement gives a poor fit, manual analysis is performed. For targets for which we suspect the poor fit resulted from configurational disorder, we refined the XRD patterns using cation-disordered versions of the target’s structure taken from the Materials Project. The cations allowed to be exchanged (disordered) with one another were selected on the basis of the Hume-Rothery rules, as detailed in previous work¹⁸. Such cases were still considered successful as long as the disordered version of the target retained the same crystal structure and overall composition as the ordered version.

Active-learning algorithm

Active learning is performed using ARROWS³, our recently developed algorithm that learns from previous experimental outcomes to identify improved reaction pathways. Given the products obtained from a set of precursors proposed by our natural-language models at temperature T_NLP, ARROWS³ first suggests that a lower temperature (T_NLP − 300 °C) be tested for the same precursor set. The intent of this approach is to reveal which intermediate phases lead to the formation of each impurity observed at higher temperature. From the low-temperature-synthesis outcome, information is extracted about the pairwise reactions that occurred, including those between the precursors (to form the observed intermediates), as well as those between the intermediates (to form the high-temperature impurities). New synthesis experiments are then proposed on the basis of sets of precursors expected to avoid such reactions, giving priority to those with a maximal thermodynamic driving force to form the target. The driving force is calculated as the free-energy difference between a target and its associated precursors, in which all solid energies (at 0 K) are extracted from the Materials Project and corrected using a machine-learned descriptor that accounts for vibrational-entropy contributions at the specified temperature⁴⁶.

After testing a precursor set at low temperature (T_NLP − 300 °C), iteratively higher temperatures (ΔT = 100 °C) are examined until the target is obtained with a yield exceeding 50% or until the temperature reaches T_NLP. At each step, the algorithm determines which pairwise reactions occurred and records them in a database that is referred to throughout all other experiments performed by the A-Lab. In subsequent iterations, ARROWS³ gives priority to sets of precursors containing pairs of phases that are expected to form the desired target, while avoiding pairs that form unwanted impurities. Moreover, to avoid testing redundant synthesis routes for which different precursors form identical products, the algorithm checks whether the low-temperature (T_NLP − 300 °C) intermediates obtained from a given precursor set differ from those obtained with previous (unsuccessful) recipes. If not, then no further experiments are proposed for that set of precursors. This process is repeated until the target is successfully obtained or until all the available precursor sets have been exhausted. Further details on the active-learning process are provided in Supplementary Notes 4–6 and 11.

Data availability

All data generated during this study are included in the Supplementary Information. This includes the refined XRD patterns for each successful synthesis outcome, as well as their associated structure files.

Code availability

The screening algorithm we used for identifying potential synthesis targets is available at https://github.com/mattmcdermott/novel-materials-screening. The Python scripts and machine-learning models used to propose literature-inspired synthesis recipes can be found online at https://github.com/CederGroupHub/SynthesisSimilarity and https://github.com/CederGroupHub/s4 for precursor and temperature selection, respectively. The methods for XRD analysis are available at https://github.com/njszym/XRD-AutoAnalyzer. Active learning was performed using a package found at https://github.com/njszym/ARROWS.

References

Jain, A., Shin, Y. & Persson, K. A. Computational predictions of energy materials using density functional theory. Nat. Rev. Mater. 1, 15004 (2016).
Article CAS ADS Google Scholar
Sun, J. et al. Accurate first-principles structures and energies of diversely bonded systems from an efficient density functional. Nat. Chem. 8, 831–836 (2016).
Article CAS PubMed Google Scholar
Nikolaev, P. et al. Autonomy in materials research: a case study in carbon nanotube growth. NPJ Comput. Mater. 2, 16031 (2016).
Article Google Scholar
Chang, J. et al. Efficient closed-loop maximization of carbon nanotube growth rate using Bayesian optimization. Sci. Rep. 10, 9040 (2020).
Article CAS PubMed PubMed Central ADS Google Scholar
MacLeod, B. P. et al. Self-driving laboratory for accelerated discovery of thin-film materials. Sci. Adv. 6, eaaz8867 (2020).
Article CAS PubMed PubMed Central ADS Google Scholar
Burger, B. et al. A mobile robotic chemist. Nature 583, 237–241 (2020).
Article CAS PubMed ADS Google Scholar
Ludwig, A. Discovery of new materials using combinatorial synthesis and high-throughput characterization of thin-film materials libraries combined with computational methods. NPJ Comput. Mater. 5, 70 (2019).
Article ADS Google Scholar
Ren, Z. et al. Embedding physics domain knowledge into a Bayesian network enables layer-by-layer process innovation for photovoltaics. NPJ Comput. Mater. 6, 9 (2020).
Article CAS ADS Google Scholar
Sun, S. et al. A data fusion approach to optimize compositional stability of halide perovskites. Matter 4, 1305–1322 (2021).
Article CAS Google Scholar
Li, J. et al. Synthesis of many different types of organic small molecules using one automated process. Science 347, 1221–1226 (2015).
Article CAS PubMed PubMed Central ADS Google Scholar
Kitson, P. J. et al. Digitization of multistep organic synthesis in reactionware for on-demand pharmaceuticals. Science 359, 314–319 (2018).
Article CAS PubMed ADS Google Scholar
Coley, C. W. et al. A robotic platform for flow synthesis of organic compounds informed by AI planning. Science 365, eaax1566 (2019).
Article CAS PubMed Google Scholar
Manzano, J. S. et al. An autonomous portable platform for universal chemical synthesis. Nat. Chem. 14, 1311–1318 (2022).
Article CAS PubMed Google Scholar
Jain, A. et al. Commentary: The Materials Project: a materials genome approach to accelerating materials innovation. APL Mater. 1, 011002 (2013).
Article ADS Google Scholar
He, T. et al. Inorganic synthesis recommendation by machine learning materials similarity from scientific literature. Sci. Adv. 9, eadg8180 (2023).
Article CAS PubMed PubMed Central Google Scholar
Huo, H. et al. Machine-learning rationalization and prediction of solid-state synthesis conditions. Chem. Mater. 34, 7323–7336 (2022).
Article CAS PubMed PubMed Central Google Scholar
Szymanski, N. J., Nevatia, P., Bartel, C. J., Zeng, Y. & Ceder, G. Autonomous and dynamic precursor selection for solid-state materials synthesis. Nat. Commun. https://doi.org/10.1038/s41467-023-42329-9 (2023).
Szymanski, N. J., Bartel, C. J., Zeng, Y., Tu, Q. & Ceder, G. Probabilistic deep learning approach to automate the interpretation of multi-phase diffraction spectra. Chem. Mater. 33, 4204–4215 (2021).
Article CAS Google Scholar
Szymanski, N. J. et al. Adaptively driven X-ray diffraction guided by machine learning for autonomous phase identification. NPJ Comput. Mater. 9, 31 (2023).
Article CAS ADS Google Scholar
Bartel, C. J. Review of computational approaches to predict the thermodynamic stability of inorganic solids. J. Mater. Sci. 57, 10475–10498 (2022).
Article CAS ADS Google Scholar
He, T. et al. Similarity of precursors in solid-state synthesis as text-mined from scientific literature. Chem. Mater. 32, 7861–7873 (2020).
Article CAS Google Scholar
Miura, A. et al. Selective metathesis synthesis of MgCr₂S₄ by control of thermodynamic driving forces. Mater. Horiz. 7, 1310–1316 (2020).
Article CAS Google Scholar
Bianchini, M. et al. The interplay between thermodynamics and kinetics in the solid-state synthesis of layered oxides. Nat. Mater. 19, 1088–1095 (2020).
Article CAS PubMed ADS Google Scholar
Aykol, M., Montoya, J. H. & Hummelshøj, J. Rational solid-state synthesis routes for inorganic materials. J. Am. Chem. Soc. 143, 9244–9259 (2021).
Article CAS PubMed Google Scholar
Martinolich, A. J. & Neilson, J. R. Toward reaction-by-design: achieving kinetic control of solid state chemistry with metathesis. Chem. Mater. 29, 479–489 (2017).
Article CAS Google Scholar
Miura, A. et al. Observing and modeling the sequential pairwise reactions that drive solid-state ceramic synthesis. Adv. Mater. 33, 2100312 (2021).
Article CAS Google Scholar
Cordova, D. L. M. & Johnson, D. C. Synthesis of metastable inorganic solids with extended structures. ChemPhysChem 21, 1345–1368 (2020).
Article CAS PubMed Google Scholar
Malkowski, T. F. et al. Role of pairwise reactions on the synthesis of Li_0.3La_0.57TiO₃ and the resulting structure–property correlations. Inorg. Chem. 60, 14831–14843 (2021).
Article CAS PubMed Google Scholar
Todd, P. K. et al. Selectivity in yttrium manganese oxide synthesis via local chemical potentials in hyperdimensional phase space. J. Am. Chem. Soc. 143, 15185–15194 (2021).
Article CAS PubMed Google Scholar
Pardo, A., Romero, J. & Ortiz, E. High-temperature behaviour of ammonium dihydrogen phosphate. J. Phys. Conf. Ser. 935, 012050 (2017).
Article Google Scholar
Gupta, S. K. & Mao, Y. Recent developments on molten salt synthesis of inorganic nanomaterials: a review. J. Phys. Chem. C 125, 6508–6533 (2021).
Article CAS Google Scholar
Aykol, M., Dwaraknath, S. S., Sun, W. & Persson, K. A. Thermodynamic limit for synthesis of metastable inorganic materials. Sci. Adv. 4, eaaq014 (2018).
Article Google Scholar
Bridge, B. & Patel, N. D. The elastic constants and structure of the vitreous system Mo-P-O. J. Mater. Sci. 21, 1186–1205 (1986).
Article ADS Google Scholar
Muñoz, F. & Sánchez-Muñoz, L. The glass-forming ability explained from local structural differences by NMR between glasses and crystals in alkali metaphosphates. J. Non-Cryst. Solids 503–504, 94–97 (2019).
Article ADS Google Scholar
Norby, P., Krogh Andersen, I. G., Andersen, E. K. & Andersen, N. H. The crystal structure of lanthanum manganate(iii), LaMnO₃, at room temperature and at 1273 K under N₂. J. Solid State Chem. 119, 191–196 (1995).
Article CAS ADS Google Scholar
Kim, Y.-J., Park, H.-S. & Yang, C.-H. Raman imaging of ferroelastically configurable Jahn–Teller domains in LaMnO₃. NPJ Quantum Mater. 6, 62 (2021).
Article CAS ADS Google Scholar
Alonso, J. A. et al. Non-stoichiometry, structural defects and properties of LaMnO_3+δ with high δ values (0.11≤δ≤0.29). J. Mater. Chem. 7, 2139–2144 (1997).
Article CAS Google Scholar
Ong, S. P. et al. Python Materials Genomics (pymatgen): a robust, open-source python library for materials analysis. Comput. Mater. Sci. 68, 314–319 (2013).
Article CAS Google Scholar
Kononova, O. et al. Text-mined dataset of inorganic materials synthesis recipes. Sci. Data 6, 203 (2019).
Article PubMed PubMed Central Google Scholar
Villars, P., Cenzual, K. & Gladyshevskii, R. Handbook of Inorganic Substances (De Gruyter, 2017).
Richards, W. D., Miara, L. J., Wang, Y., Kim, J. C. & Ceder, G. Interface stability in solid-state batteries. Chem. Mater. 28, 266–273 (2016).
Article CAS Google Scholar
Schulman, J., Wolski, F., Dhariwal, P., Radford, A. & Klimov, O. Proximal policy optimization algorithms. Preprint at https://arxiv.org/abs/1707.06347 (2017).
Brockman, G. et al. OpenAI Gym. Preprint at https://arxiv.org/abs/1606.01540 (2016).
Toby, B. H. & Von Dreele, R. B. GSAS-II: the genesis of a modern open-source all purpose crystallography software package. J. Appl. Cryst. 46, 544–549 (2013).
Article CAS ADS Google Scholar
O’Donnell, J. H., Von Dreele, R. B., Chan, M. K. Y. & Toby, B. H. A scripting interface for GSAS-II. J. Appl. Cryst. 51, 1244–1250 (2018).
Article ADS Google Scholar
Bartel, C. J. et al. Physical descriptor for the Gibbs energy of inorganic crystalline solids and temperature-dependent materials chemistry. Nat. Commun. 9, 4168 (2018).
Article PubMed PubMed Central ADS Google Scholar

Download references

Acknowledgements

This work was primarily financed by the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences, Materials Sciences and Engineering Division under contract no. DE-AC02-05-CH11231 (D2S2 programme, KCD2S2) and the Laboratory Directed Research and Development Program of Lawrence Berkeley National Laboratory. Development of the active-learning algorithms, compound-discovery methods and equipment acquisition were supported by the Materials Project programme (KC23MP). Machine-learning techniques for the interpretation of XRD patterns were developed by the Joint Center for Energy Storage Research programme JCESR 2.0 under contract no. DE-AC02-05-CH11231. Computations were performed using the National Energy Research Scientific Computing Center (NERSC), a U.S. Department of Energy Office of Science User Facility supported by the Office of Science and the U.S. Department of Energy under contract no. DE-AC02-05CH11231. Work done at UC Berkeley was supported by Umicore Specialty Oxides and Chemicals. N.J.S. was supported in part by the National Science Foundation Graduate Research Fellowship under grant no. 1752814. We thank Labman Automation for their role in the design and construction of hardware for precursor preparation. We also thank M. Sargent at Berkeley Lab for capturing photos of the A-Lab.

Author information

These authors contributed equally: Nathan J. Szymanski, Bernardus Rendy, Yuxing Fei, Rishi E. Kumar

Authors and Affiliations

Department of Materials Science and Engineering, University of California, Berkeley, Berkeley, CA, USA
Nathan J. Szymanski, Bernardus Rendy, Yuxing Fei, Tanjin He, Matthew J. McDermott, Max Gallant, Kristin Persson & Gerbrand Ceder
Materials Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Nathan J. Szymanski, Bernardus Rendy, Yuxing Fei, Tanjin He, David Milsted, Matthew J. McDermott, Max Gallant, Haegyeom Kim, Christopher J. Bartel, Kristin Persson, Yan Zeng & Gerbrand Ceder
Energy Technologies Area, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Rishi E. Kumar & Anubhav Jain
Google DeepMind, London, UK
Ekin Dogus Cubuk & Amil Merchant

Authors

Nathan J. Szymanski
View author publications
You can also search for this author in PubMed Google Scholar
Bernardus Rendy
View author publications
You can also search for this author in PubMed Google Scholar
Yuxing Fei
View author publications
You can also search for this author in PubMed Google Scholar
Rishi E. Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Tanjin He
View author publications
You can also search for this author in PubMed Google Scholar
David Milsted
View author publications
You can also search for this author in PubMed Google Scholar
Matthew J. McDermott
View author publications
You can also search for this author in PubMed Google Scholar
Max Gallant
View author publications
You can also search for this author in PubMed Google Scholar
Ekin Dogus Cubuk
View author publications
You can also search for this author in PubMed Google Scholar
Amil Merchant
View author publications
You can also search for this author in PubMed Google Scholar
Haegyeom Kim
View author publications
You can also search for this author in PubMed Google Scholar
Anubhav Jain
View author publications
You can also search for this author in PubMed Google Scholar
Christopher J. Bartel
View author publications
You can also search for this author in PubMed Google Scholar
Kristin Persson
View author publications
You can also search for this author in PubMed Google Scholar
Yan Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Gerbrand Ceder
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

N.J.S. developed the algorithms for data analysis and decision-making. N.J.S. and B.R. developed the methods used to correct DFT-calculated lattice parameters. B.R. and Y.F. built the lab’s hardware and developed the refinement algorithm. R.E.K. and Y.F. designed the control software and its integration with the hardware. T.H. created the algorithms for literature-inspired synthesis recipe recommendation. D.M. assisted in hardware development. M.J.D. and M.G. built the filtering pipeline for novel-materials identification. E.D.C. and A.M. applied the filtering from Google DeepMind. C.J.B. assisted in planning the A-Lab’s setup, developing the algorithms for analysis and decision-making, and modelling the lab’s throughput. H.K. and A.J. supervised hardware and software development, respectively. K.P. supervised the contributions from the Materials Project. Y.Z. and G.C. conceived and supervised all of the main aspects of the project.

Corresponding authors

Correspondence to Yan Zeng or Gerbrand Ceder.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data figures and tables

Extended Data Fig. 1 A-Lab hardware setup.

Detailed overview of all physical components in the A-Lab, including the stations for precursor preparation, heating and product handling for XRD characterization.

Extended Data Fig. 2 Robotic installations for sample transfer in the A-Lab.

Grippers on the UR5e robotic arms that are used for sample preparation (a), loading/unloading of crucible racks to/from the box furnaces (b) and sample retrieval and characterization (c). d, Linear rail used to increase the working envelope of the robotic arm that loads/unloads crucible racks to/from the furnaces. e, Carousel used to organize and move samples in the sample preparation station.

Extended Data Fig. 3 Communication protocols connecting each module in the A-Lab.

A local area network (LAN) is built to connect all the pieces of the A-Lab with a control computer using an RS-485 interface (or DB25 for the box furnaces). Each module on the RS-485 interface has an Internet Protocol (IP) assigned to enable communication with the computer. For enhanced cybersecurity, only the control computer has access to the internet, whereas the LAN is isolated from it.

Supplementary information

Supplementary Information

This file contains the Supplementary Notes and Figures, which provide a more detailed overview of the hardware specifications for the A-Lab, its decision-making process, the pairwise reactions learned from this process and the synthesis modifications needed to obtain targets that could not be made by the A-Lab. The file also contains a description of the targets evaluated by the A-Lab.

Peer Review File

Supplementary Data

This file contains the refined X-ray diffraction data from the successful syntheses performed by the A-Lab. The corresponding crystal structures used during refinement are also included in CIF format.

Supplementary Video 1

Robot arm R1 (Mitsubishi) handling powders and slurries in the sample preparation station used to dispense and mix precursors before heating. The video is played at 20× speed.

Supplementary Video 2

Robot arm R2 (UR5e) moving crucibles from the sample preparation station to the box furnaces. The video is played at 20× speed.

Supplementary Video 3

Robot arm R3 (UR5e) retrieving powder samples (post-annealing) and cooperating with an Aeris X-ray diffractometer for their characterization. The video is played at 12× speed.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Szymanski, N.J., Rendy, B., Fei, Y. et al. An autonomous laboratory for the accelerated synthesis of novel materials. Nature 624, 86–91 (2023). https://doi.org/10.1038/s41586-023-06734-w

Download citation

Received: 16 May 2023
Accepted: 10 October 2023
Published: 29 November 2023
Issue Date: 07 December 2023
DOI: https://doi.org/10.1038/s41586-023-06734-w

This article is cited by

Robotic synthesis decoded through phase diagram mastery
- Jeffrey A. Bennett
- Milad Abolhasani
Nature Synthesis (2024)
Navigating phase diagram complexity to guide robotic inorganic materials synthesis
- Jiadong Chen
- Samuel R. Cross
- Wenhao Sun
Nature Synthesis (2024)
Freedom of chemical space
- Dylan A. Edelman
- Donggun Eum
- William C. Chueh
Nature Sustainability (2024)
The rise of high-entropy battery materials
- Bin Ouyang
- Yan Zeng
Nature Communications (2024)
Robot chemist sparks row with claim it created new materials
- Mark Peplow
Nature (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.