Machine learning-assisted crystal engineering of a zeolite

Li, Xinyu; Han, He; Evangelou, Nikolaos; Wichrowski, Noah J.; Lu, Peng; Xu, Wenqian; Hwang, Son-Jong; Zhao, Wenyang; Song, Chunshan; Guo, Xinwen; Bhan, Aditya; Kevrekidis, Ioannis G.; Tsapatsis, Michael

doi:10.1038/s41467-023-38738-5

Download PDF

Article
Open access
Published: 31 May 2023

Machine learning-assisted crystal engineering of a zeolite

Nature Communications volume 14, Article number: 3152 (2023) Cite this article

5941 Accesses
7 Citations
2 Altmetric
Metrics details

Subjects

Abstract

It is shown that Machine Learning (ML) algorithms can usefully capture the effect of crystallization composition and conditions (inputs) on key microstructural characteristics (outputs) of faujasite type zeolites (structure types FAU, EMT, and their intergrowths), which are widely used zeolite catalysts and adsorbents. The utility of ML (in particular, Geometric Harmonics) toward learning input-output relationships of interest is demonstrated, and a comparison with Neural Networks and Gaussian Process Regression, as alternative approaches, is provided. Through ML, synthesis conditions were identified to enhance the Si/Al ratio of high purity FAU zeolite to the hitherto highest level (i.e., Si/Al = 3.5) achieved via direct (not seeded), and organic structure-directing-agent-free synthesis from sodium aluminosilicate sols. The analysis of the ML algorithms’ results offers the insight that reduced Na₂O content is key to formulating FAU materials with high Si/Al ratio. An acid catalyst prepared by partial ion exchange of the high-Si/Al-ratio FAU (Si/Al = 3.5) exhibits improved proton reactivity (as well as specific activity, per unit mass of catalyst) in propane cracking and dehydrogenation compared to the catalyst prepared from the previously reported highest Si/Al ratio (Si/Al = 2.8).

Synthesis strategies and design principles for nanosized and hierarchical zeolites

Article 27 June 2022

Machine-learning informed prediction of high-entropy solid solution formation: Beyond the Hume-Rothery rules

Article Open access 07 May 2020

Rapidly predicting Kohn–Sham total energy using data-centric AI

Article Open access 24 August 2022

Introduction

Zeolites are crystalline, porous aluminosilicate molecular sieves with uniform pores of molecular dimensions that are widely used in industrial applications such as catalysis, adsorption, membrane separation and ion exchange^1,2,3,4,5,6. Their performance (sorption capacity, catalytic activity, selectivity, stability) depends on a hierarchy of microstructural characteristics. In addition to the framework topology (represented by a three-letter code)⁷, framework composition (i.e., the atomic Si/Al ratio of the tetrahedra in the framework) and extra-framework cation content, numerous other characteristics can be tuned to optimize the performance of a zeolite including crystallographic positions of Si and Al atoms⁸, crystallographic location of extra-framework cations⁹, crystal size and shape^10,11,12, extent of crystallite aggregation comprising a zeolite particle^13,14, the presence of meso-porosity^15,16, the occurrence and frequency of intergrowths with related framework types^17,18, and other types of defects like Si or Al framework vacancies and associated silanol nests¹⁹, and pore blockages by extra-framework matter^20,21.

These microstructural characteristics are the output of a batch crystallization process, whose inputs include the chemical composition of the mixture, the chemicals and sequence of steps used to prepare this mixture, the temperature and time of crystallization, and the extent of mixing during crystallization (e.g., static or rotating autoclaves)^21,22. Additional variations further expand the range of synthesis inputs that can affect the crystallization output. For example, mid-synthesis changes in composition and temperature during crystallization can have a significant effect on crystal size and framework type^11,13. Crystallization mixtures used for zeolite synthesis contain species varying from small ions to colloidal particles and gels, the interconversions and interactions of which cannot be predicted quantitatively²³. Therefore, the ability to determine the effect of crystallization inputs on the microstructural outcome (output) is very limited, and microstructural optimization requires a large number of experiments exploring all possible input combinations^24,25. Here, it is demonstrated that Machine Learning algorithms can be used to quantitatively capture the effect of crystallization inputs on key microstructural characteristics (outputs) of faujasite, which is widely used as a catalyst in fluid catalytic cracking and as an adsorbent for oxygen/nitrogen separation^21,26,27. Comprehensive combinations of crystal morphologies, composition and phase purity are reported, and improved catalytic properties are demonstrated.

Results

The focus is on the synthesis of the zeolite faujasite, and we aim to prepare faujasite crystals with a combination of characteristics (outputs): Si/Al ratio, crystal size, particle size, FAU/EMT ratio, microporosity. Figure 1 summarizes experiments performed initially to outline the region in composition space (details are provided in Supplementary Fig. 1), which results in pure faujasite (i.e., FAU, EMT or FAU/EMT intergrowths). The initial selection of the synthesis region in Fig. 1 is based on our prior works^13,28 and prior work by Rimer et al. ²⁹, which empirically explored and broadened the boundaries of faujasite synthesis conditions. Within this region, we performed 174 synthesis experiments. From these, 86 experiments (indicated by A1-A86 in Fig. 1, Supplementary Fig. 1, and Supplementary Table 2) did not produce pure FAU or FAU/EMT, and these entries are excluded from further analysis. The remaining 88 experiments (indicated by 1-88 in Fig. 1, Supplementary Fig. 1, and Supplementary Table 1) were used for training (81 entries) and testing (7 entries) of the ML algorithm (the latter suggests 4 more entries as prediction points), except when analyzing crystal size, where we have excluded crystal sizes larger than 60 nm and used 46 experiments (42 entries for training, and 4 entries for testing).

**Fig. 1: Explored region in composition space aiming at pure FAU and FAU/EMT zeolite synthesis.**

Our synthesis involves 5 parameters representing the crystallization mixture composition (x, y, z, m, n) x SiO₂: y Al₂O₃: z Na₂O: m H₂O_initial (n H₂O_final). The initial and final water contents indicate the water present during the aging and crystallization steps, respectively. In some synthesis experiments they are the same, i.e., there is no adjustment of the water content, while in others the water content is reduced by freeze drying to set the ratio of H₂O_final/ H₂O_initial equal to ca. 0.47. In all experiments, we use Al₂O₃ content as a basis by setting it equal to 1. We have then four independent parameters to describe the composition of our mixtures, i.e., the relative ratios of Na₂O/Al₂O₃, SiO₂/Al₂O₃, H₂O_final/Na₂O, and H₂O_final/H₂O_initial. Figure 1 shows all experiments in plots of H₂O_final/Na₂O versus Na₂O/Al₂O₃ for different SiO₂/Al₂O₃ ratios (SiO₂/Al₂O₃ = 12, <12, 14, and >12 (other than 14) in Fig. 1(a–d), respectively, and details are provided in Supplementary Fig. 1), and reflects the two H₂O_final/H₂O_initial levels of 1 and 0.47. In addition to the four independent parameters needed to describe the composition of the mixture, we have five other synthesis parameters: the source of silica, the source of alumina, the type of oven used (rotation vs. static autoclave, or oil bath), and the crystallization time and temperature (for all experiments aging was performed at 25 °C for 24 h with stirring).

In total, we have 9 independent parameters (inputs) that describe the synthesis (processing) conditions. For the 88 experiments that gave FAU or FAU/EMT with no other phases, we determined 5 microstructural characteristics (structure): the Si/Al ratio by ICP, the particle size by TEM and/or SEM images, the crystal size from XRD peak broadening (in our analysis, we only considered crystal sizes smaller than 60 nm), the degree of intergrowth represented as FAU/(FAU + EMT) (determined by analysis of XRD data), and the Ar adsorption at p/p₀ = 0.01 as an indication of the microporosity. These five quantities/microstructural characteristics represent the outputs of the crystallization process; we have also considered as a separate, sixth output, the ratio of particle over crystal size (for crystal sizes smaller than 60 nm), as a measure of the level of aggregation. The characterization results for the 88 experiments are presented in Section S2 (Tables S3–S25 and Supplementary Figs. S2–S24).

In many branches of materials science, both experimental and computational (including metal additive manufacturing³⁰, polymer science³¹, and drug design³² and delivery³³), there have been extensive studies of structure-property relations with the help of ML. An important ingredient of these is the knowledge—whether from first principles, experience, or intuition—of the appropriate structural features that correlate the properties of interest. ML holds the promise of turning such correlations from an informed art to a reliable, data-driven, computer assisted process; learning such correlations can then lead to the educated design and optimization of developed materials^33,34,35. The processing/fabrication of materials with desired structure is an equally (if not more challenging) problem; data science and ML have the potential to be transformative in deriving processing-structure relations, leading to breakthroughs in ultimately establishing the ideal “processing-structure-property” pathway to materials design^{30,31,32,33,34,36,37,38,39}.

ML algorithms have proven useful for predicting both quantitative and discrete characteristics of various zeolitic materials. Carr et al.⁴⁰ constructed a classifier based on the topology of zeolites into different mineral types and framework types^41,42. Coudert et al.^43,44 used ML algorithms on data from DFT computations to construct a link between structural properties and mechanical properties. Moliner et al.⁴⁵ discussed the potential of ML in zeolites synthesis (a) in the construction of high throughput platforms, (b) in the prediction of stable structures for zeolites and guidance of the zeolite synthesis involved with different structures, and (c) in automated data extraction. Gurney et al.⁴⁶ presented different ML tools that can be key elements for an ML-based design and discovery of zeolites and other crystalline materials. Ducamp et al.³⁸ used DFT data to construct a structure-property relation between features of the geometry, topology, and porosity of the zeolitic materials, and their thermal properties. Jensen et al.⁴⁷ built a text mining pipeline for extracting zeolite synthesis data from a database of ~70,000 relevant journal articles. They further constructed, through ML, an input-output relationship between synthesis conditions and framework density of zeolitic materials.

Here, we study the synthesis (ingredients, composition, processing conditions, operating protocols) leading to the fabrication of faujasite zeolite; and explore the capabilities and avenues that ML opens toward the optimization of desired microstructural characteristics like the framework Si/Al ratio. Selecting appropriate synthesis conditions leading to a particular set of microstructural characteristics is challenging, since the known crystallization mechanism is not adequate to derive predictive models⁴⁵. ML algorithms can be used to construct experimentally-informed candidate input-output (processing/structure) relationships from data in the absence of closed-form (physics-driven) expressions. In our case, given processing/structure information for zeolite fabrication, we aim to construct a function that maps synthesis conditions to final structure. Positing such a model allows us to estimate, predict, and even optimize structure of a zeolite material given unexplored synthesis conditions, thus guiding further experimentation.

Learning a candidate mapping between inputs and outputs can be attempted through several, in principle comparable, ML approaches, including Neural Networks (NN), Gaussian Process Regression (GPR), and Geometric Harmonics (GH). This paper focuses primarily on predicting via GH, but we also provide comparisons with the other two methods in order to illustrate the qualitative similarity of corresponding results. All methods use input and corresponding output data from a (posited) function of interest to construct a surrogate model (an approximation) of the true function. To the best of our knowledge, Diffusion Maps/Geometric Harmonics have not been previously used in this context. We provide a brief description of each method below and additional details in the SI (Section S2, Supplementary Figs. S25–S27).

Geometric Harmonics (GH) uses the input-output data to numerically construct a hierarchical set of data-driven basis functions (to be exact, basis vectors, that constitute discretized versions of basis functions) in the space of inputs. Any function of the inputs (e.g., a structural characteristic of the resulting material) can be approximated as a linear combination of the leading (data-driven) basis functions, in the same spirit as a function of space can be approximated by a truncated sum of its Fourier components^48,49.

Similarly, a Neural Network (NN) can construct a surrogate function $f$ between inputs (synthesis conditions), ${{{{{\boldsymbol{x}}}}}}$, and outputs (structural characteristics), $y=f\left({{{{{\boldsymbol{x}}}}}};{{{{{\boldsymbol{\theta }}}}}}\right)$, by adapting the values of the parameters ${{{{{\boldsymbol{\theta }}}}}}$ to achieve the best function approximation⁵⁰. The selection of the parameters is achieved via an optimization stage that is called training. During training, the network computes derivatives with respect to its parameters ${{{{{\boldsymbol{\theta }}}}}}$ in order to minimize a loss function across the training data^46,50,51.

Gaussian Process Regression (GPR) models an output function $f$ of the inputs as a collection of jointly normal random variables that describe one’s knowledge about $f({{{{{\boldsymbol{x}}}}}})$ at each point ${{{{{\boldsymbol{x}}}}}}$ in the function’s domain⁵². After the user specifies (via a kernel function) how these random variables are correlated with each other, conditional probability allows one to predict the function value ${y}^{{\prime} }=f\left({{{{{{\boldsymbol{x}}}}}}}^{{\prime} }\right)$ at another input point ${{{{{{\boldsymbol{x}}}}}}}^{{\prime} }$. Such a result is expressed as a Gaussian distribution, from which the mean may serve as the prediction and the variance as a measure of uncertainty in the estimate⁵³.

We characterize the five outputs of each experiment as functions of nine input quantities. Because six of the inputs are numerical and three of the inputs (oven type, silica source, and alumina source) are categorical, the input for each experiment is represented as a vector in 12-dimensional space (see Section S4 for explanation of the number of dimensions). We developed forty-four GH (nine for Si/Al ratio, etc., see Section S4.2), two NN, and five GPR models, which are described in detail in Section S4 of the SI. Comparisons of experimental and predicted microstructural characteristics (outputs) from one of these models (i.e., GH with rescaled inputs and 10-fold cross validation: “rescaled 10-CV”) are shown in Fig. 2 (Details of entry points are provided in Supplementary Fig. 28). For crystal size, we only include experiments and predictions for sizes smaller than 60 nm to ensure that instrumental broadening does not affect the experimental measurements. In addition to the five outputs discussed earlier, we also consider the particle to crystal size ratio as a measure for differentiating single from aggregated/intergrown crystals.

**Fig. 2: Comparison between experimental values and predicted values via the Machine Learning algorithm we label “rescaled 10-CV” (a Geometric Harmonics algorithm).**

Figure 2 shows that ML can approximate output functions of interest (entry numbers for points in Fig. 2(a–f) are labelled in Supplementary Fig. 28). We use a set of 81 experiments as training points (represented as blue points in Fig. 2(a–f)) and 7 experiments as testing points (represented as red points in Fig. 2(a–f)). In addition, the green points are for unseen experiments, which will be discussed later. The blue, red, green color scheme, indicating training, testing, and prediction, respectively, is used in Figs. 1 and 2, Supplementary Table 1, Supplementary Tables 3–25, Supplementary Figs. 28–37.

All our GH, GPR, and NN models were able to learn a surrogate model from the training data, choosing hyperparameter values based on cross-validation, log-likelihood maximization, and ADAM minimization of training mean squared error, respectively. These models also performed well on the test set, which consisted of experiments that had already been performed but were set aside for this purpose. Predictions from the different algorithms are provided in the Supporting Information (Supplementary Figs. 28–37). In order to compare the reliability of these models, we applied error analysis and calculated the R² and MSE values for each model and each output from the training (blue entries) and test (red entries) sets as listed in Supplementary Table 26. Error analysis results (Supplementary Table 26) show that if we select MSE training as a performance metric “rescaled 10-CV” is best for Si/Al (second smallest value for MSE training and smallest value for MSE testing) followed by “rescaled LOOCV (w/o categorical)” (ranking third for MSE training and second for MSE test). By comparison, “rescaled 10-CV (w/o categorical)” has the smallest value for MSE training but ranks fourth for MSE test. Therefore, the algorithm “rescaled 10-CV” is the most reliable one among different CV schemes, and correlations developed by this algorithm are provided in Fig. 2. Correlations for the algorithm “rescaled LOOCV (w/o categorical)” are provided in Supplementary Fig. 36.

An example of training synthesis is Entry 3 with a molar composition of 10 SiO₂: 1 Al₂O₃: 8 Na₂O: 400 H₂O, which is heated at 100 °C for 18 h in a static autoclave. It leads to the synthesis of large pure FAU crystals with Si/Al ratio of 1.6. In our prior work, we used this FAU material to investigate the accessibility and reactivity of protons located within sodalite cages of the FAU framework that become accessible during ion exchange⁵⁴.

An example of a test synthesis is Entry 81, with an initial molar composition of 12 SiO₂: 1 Al₂O₃: 10 Na₂O: 180 H₂O, which after aging at 25 °C for 24 h was subjected to water reduction by freeze drying to 80 H₂O and was then heated at 50 °C for 4 days in a rotating autoclave (6 rpm). The trained model predicts the output of this test synthesis that leads to non-aggregated high FAU content nanocrystals with Si/Al ratio of 1.3. Such low Si/Al ratio nanocrystals could be useful for the fabrication of FAU membranes.

ML was then used to suggest inputs aiming at achieving the desirable output of FAU with Si/Al ratio higher than 3. Exceeding a Si/Al of 3, by direct synthesis (i.e., without dealumination treatments)²¹ in the absence of organic-structure-directing agents (OSDA) in sodium aluminosilicate sols/gels remains elusive despite many decades of effort in developing FAU synthesis methods, e.g., OSDA structure design (bottom-up)^22,26, post-synthesis dealumination (top-down)^21,26, etc. It is a highly desirable outcome, as it may lead to robust catalytic properties²⁹, improved stability²⁶, and lower manufacturing cost¹¹. We test the ability of the best algorithm to predict properties for unseen synthesis conditions by using it as a surrogate model for optimization. Since our input space is 12-dimensional, we use plots that vary two quantities at a time while keeping others fixed, which produces hyperplane “slices” of the complete model in the synthesis conditions space as seen in Fig. 3, which provides projective views of the GH predictions for Si/Al ratio, as a function of different pairs of process parameters (input variables). Entry 20 was reported from the prior work by Rimer et al.²⁹, and this entry holds the reported hitherto highest Si/Al ratio to prepare high-silica faujasite zeolites via template-free routes. The gradients were computed at entry 20 in our training set (Supplementary Table 1) as the base point. Supplementary Fig. 39 includes two similar plots of predictions for a model (Supplementary Fig. 39(a)(b)) that instead uses a Matérn(0.5) kernel (see method description in Section S4) and counterparts developed by the algorithm LOOCV (Supplementary Fig. 39(c)(d)). Therefore, the predicted contours and gradients developed from multiple ML models suggest decreasing Na₂O (or reducing pH) and increasing the crystallization time to achieve Si/Al > 3.

**Fig. 3: Predictions for unseen experiments to exceed Si/Al = 3.**

An inspection of input/output correlations from just plotting the raw data (Supplementary Figs. 40–43), which demonstrate the dependences among these variables and the complexity of the zeolite synthesis itself, also indicates that low Na₂O increases the Si/Al ratio (as shown in entries of Supplementary Table 1, Fig. 4(a), and Supplementary Fig. 41). Indeed entries 4-8 of Supplementary Table 1 show the progressive increase of Si/Al from 2.7 to 2.8 as the Na₂O/Al₂O₃ ratio decreases from 4 to 3.6. Lower Na₂O/Al₂O₃ entries, e.g., A16 to A22 of Supplementary Table 2 with Na₂O/Al₂O₃ ratio of 3.5, have been, however, excluded from the training set because they yield amorphous or impure (e.g., mixtures of FAU + LTA) products. In particular, entries A22 and A20, which were performed at 100 °C (i.e., same temperature as entries 4–8 discussed above) for 3 and 7 days yield products that are either amorphous or amorphous mixed with some FAU, respectively. A potential explanation for this observation is that as Na₂O is being reduced, the associated pH reduction slows down the crystallization kinetics. Therefore, a possible path to FAU with Si/Al > 3 would be to increase the time and/or temperature of crystallization. Since increasing the temperature to 120 °C (entries A16-A19 and A21) also yields FAU with amorphous or LTA impurities, we decided to explore longer crystallization times at 100 °C. Entries 89–92, with crystallization times 9–13 days, yield pure FAU with Si/Al larger than 3.

**Fig. 4: Input/output correlations from raw data and SHAP (Shapley value based) analyses.**

The best performing algorithm (based on MSE training in Supplementary Table 26) “rescaled 10-CV” predicts this outcome (see Fig. 2(a) and Supplementary Fig. 28(a)). From the remaining models, the second best performer “rescaled LOOCV (w/o categorical)” also makes good predictions. The rest, except for two models (normalized 10-CV and rescaled 5-CV), fail to predict Si/Al > 3. The “rescaled 10-CV” model successfully predicts additional outcomes of the synthesis for entries 89–92. Particle Size, FAU/(FAU + EMT) ratio, and Uptake Values are shown in Fig. 2 and Supplementary Fig. 28. We note that since the crystal sizes for entries 89–92, as determined from XRD peak broadening, are larger than 60 nm, they are not included in the plots of Supplementary Fig. 28(c) and 28(f). Once Si/Al ratio exceeds 2, FAU fraction increases to near unity, and the high Si/Al materials are pure FAU products (Fig. 4(b)). Similarly, particle size consistently increases with increasing Si/Al (Fig. 4(c)).

Despite the small number of training and testing data, it can be concluded that the selected best performing Geometric Harmonics algorithm (“rescaled 10-CV”) is successful in predicting the outcome of unseen experiments with different combinations of properties (outputs). These predictions can also steer experimental conditions (inputs) to achieve desirable outcomes. On the contrary, Neural Networks and Gaussian Process Regression were not successful in providing good predictions.

The dominant role of Na₂O is evident by the input/output correlation of Fig. 4(a). It becomes also evident in SHAP (Shapley value based) analyses^55,56,57 (Fig. 4(d) and (e), Supplementary Fig. 44). The Shapely values measure the average contribution of each feature’s (variable’s) value to the prediction and thus provide a sense of how the change of a variable might affect the output^56,57. We applied the model-agnostic exact explainer algorithm⁵⁷ on the model Si/Al trained with LOOCV, rescaling as preprocessing and without categorical inputs (namely the “rescaled LOOCV (w/o) categorical” model). The selection of this model was made based on its performance (second best based on its MSE metrics) and on the fact that, by excluding the categorical inputs, all of its inputs are continuous. The SHAP analysis on the Si/Al model trained with “rescaled 10-CV” is provided in SI (Supplementary Fig. 45). We generate the summary plot (Fig. 4(d)) for all the training points to get a sense of the importance of contribution of each variable (synthesis condition). In the Fig. 4(d), the x-axis is the Shapely value that indicates the contribution of a particular feature to the output. The y-axis reports the variables (synthesis conditions), and the color corresponds to the magnitude of the value for each variable if it is large or small. The variables are sorted in descending order based on their contribution. SHAP suggests that Na₂O contributes the most to the output (Si/Al) and that deceasing relative Na₂O amount (Na₂O/Al₂O₃) will contribute positively to the output (increase the Si/Al). The SHAP analysis for the model “rescaled 10-CV” provides a similar conclusion regarding the role of Na₂O/Al₂O₃, being most prominent and contributing positively to the output when decreasing (Supplementary Fig. 45).

For the four prediction points after optimization (89, 90, 91, and 92) the Shapely values predicted separately. The waterfall plot for the entry 91 is shown in Fig. 4(e) and for the remaining prediction points we included their waterfall plots in Supplementary Fig. 44. In the waterfall plot, the x-axis corresponds to the normalized values of the output variable (Si/Al ratio) $f(x)$. To obtain the true ${f}_{{True}}\left(x\right)$ values for the output, denormalization is needed: ${f}_{{True}}\left(x\right)=f\left(x\right){\sigma }_{{Si}/{Al}}+{\mu }_{{Si}/{Al}},$ where ${\sigma }_{{Si}/{Al}}$ corresponds to the standard deviation of the training set, ${\sigma }_{{Si}/{Al}}=0.67$, and ${\mu }_{{Si}/{Al}}$ corresponds to the mean of the training set, ${\mu }_{{Si}/{Al}}=1.91$. The Shapely value of each feature is given by the length of the bar. If the contribution is positive is colored red and if is negative is colored blue. The (absolute) Shapely values shows how much a single variable affects the prediction. In Fig. 4(e) it appears that the change in Na₂O and Crystallization Temperature positively affects (increases) the output prediction of the model.

Next, we compared this as-synthesized faujasite (Na-FAU3.5) with the previously reported highest Si/Al-ratio faujasite made by direct synthesis (Na-FAU2.8 with a Si/Al ratio of 2.8 prepared from the composition of 12 SiO₂: 1 Al₂O₃: 4 Na₂O: 160 H₂O)²⁹. XRD patterns (Fig. 5(a)) show that these two faujasite materials are pure FAU without EMT intergrowths, or other impurities. According to Ar-adsorption isotherms (Fig. 5(b)), although Na-FAU3.5 has a lower uptake value at P/P₀ = 0.01 than Na-FAU2.8, the corresponding ion exchanged forms, H-FAU3.5 and H-FAU2.8 exhibit similar isotherms (ion exchange for both was performed using 1 M of NH₄NO₃ solution for 1 h, and 0.25 g zeolite powder per 40 cm³ of ammonium solution). As shown in SEM images (Fig. 5(c)(d)), Na-FAU3.5 exhibited a larger particle size than Na-FAU2.8. Solid-state ²⁷Al-NMR (Fig. 5(e)) proved that both Na-FAU materials did not contain octahedral Al species (typically observed at a chemical shift δ of 0 ppm)²⁹ prior to ion exchange, reflecting integrity of the FAU framework and the absence of extra-framework Al. Extra-framework Al species were formed only after ion exchange^16,54. The framework Si/Al ratios (Table 1, columns 4&5, Supplementary Table 27) could be estimated from the ²⁹Si-NMR spectra (Fig. 5(f)) based on “Loewenstein’s rule” (Eq. 1)⁵⁸, which stipulates the absence of Al-O-Al linkages in the zeolite framework.

$$\frac{{{{{{\rm{Si}}}}}}}{{{{{{\rm{Al}}}}}}}=\mathop{\sum }\limits_{x=0}^{4}{I}_{{{{{{\rm{Si}}}}}}{({{{{{\rm{OAl}}}}}})}_{x}}/0.25\mathop{\sum }\limits_{x=0}^{4}x{I}_{{{{{{\rm{Si}}}}}}{({{{{{\rm{OAl}}}}}})}_{x}}$$

(1)

**Fig. 5: Characterization results of FAU3.5 and FAU2.8 materials.**

Table 1 Physical properties of H-FAU3.5 and H-FAU2.8 zeolites

Full size table

These framework Si/Al ratios (Table 1, columns 4&5) over H-FAU materials can be combined with Na/Al ratios via ICP analysis (Table 1, column 3) to calculate chemical formulae (Table 1, column 6), and determine H⁺ site densities over these two H-FAU materials (Table 1, column 7). Infrared spectra recorded upon adsorption of pyridine (Supplementary Fig. 46) show that pyridine molecules only titrate protons (at ~3640 cm⁻¹)^54,59 located within supercages over these two H-FAU materials. Protons with sodalite cages are able to be fully titrated only when the framework collapses partially (e.g., steam treatment of FAU materials to prepare ultra-stable Y)⁶⁰. Both H-FAU3.5 and H-FAU2.8 zeolites still sustain bulk framework stability as evidenced by Infrared spectra recorded upon adsorption of pyridine (Supplementary Fig. 46) and Ar-adsorption isotherms (Fig. 5(b)) after ion exchange with 1 M of NH₄NO₃ solution. Our prior work compared the reactivities and selectivities for protolytic reactions of propane between protons within opened sodalite cages and protons within supercages over high-silica faujasite zeolites⁵⁴. We reported in this prior work that sodalite cages could be fully opened when 0.6 M of NH₄NO₃ solution was used to perform ion exchange by virtue of infrared spectra of H-D exchange with deuterated propane⁵⁴. Thus, upon ion exchange using 1 M of NH₄NO₃ solution, protons within both sodalite cages and supercages could be titrated by propane. Infrared spectra after dehydration at 603 K (Fig. 5(g)) reflect that H-FAU3.5 and H-FAU2.8 zeolites exhibit a similar proton density ratio of H_SOD/H_SUP (Table 1, column 8). We also observed in our prior work that unlike low-silica FAU zeolites, which contain protons on site II and site III within supercages, high-silica FAU zeolites only contain protons on site II within supercages⁶¹. Thus, H-FAU3.5 and H-FAU2.8 zeolites possess similar H_SOD/H_SUP ratios and the same atomic configurations (i.e., protons on site II within supercages, and protons on site I′ within sodalite cages).

Having now established the similarities of the two materials in terms of phase purity, porosity, particle size, extra-framework Al, acid site density and location, we proceed to compare their catalytic performance. We compared reactivities of protons on H-FAU3.5 and H-FAU2.8 zeolites by using molecular dehydrogenation and cracking of propane (Eqs. 2 and 3) as probe reactions.

$${{{{{{\rm{C}}}}}}}_{3}{{{{{{\rm{H}}}}}}}_{8}\mathop{\longrightarrow }\limits^{{k}_{{{{{{\rm{D}}}}}}}}{{{{{{\rm{C}}}}}}}_{3}{{{{{{\rm{H}}}}}}}_{6}+{{{{{{\rm{H}}}}}}}_{2}$$

(2)

$${{{{{{\rm{C}}}}}}}_{3}{{{{{{\rm{H}}}}}}}_{8}\mathop{\longrightarrow }\limits^{{k}_{{{{{{\rm{C}}}}}}}}{{{{{{\rm{C}}}}}}}_{2}{{{{{{\rm{H}}}}}}}_{4}+{{{{{{\rm{CH}}}}}}}_{4}$$

(3)

Gounder et al.⁶² reported that alkane dehydrogenation can be promoted over extrinsic active sites of carbonaceous deposits formed during reaction, and the removal of remnant reactive carbon species should be taken into consideration to precisely assess intrinsic H⁺- catalyzed propylene formation rates. Sample pretreatment in H₂, and H₂ co-feed in the inlet stream were thus incorporated in the experimental protocol to mitigate on-stream deposition of reactive carbon species. Two H-FAU samples were pretreated using H₂/He mixtures (p_H2 = 35 kPa, and H₂/He = 1:2) and co-fed H₂ (H₂/C₃H₈/Ar/He = 3/3/1.5/60, p_H2 = 5.3 kPa,) in the inlet stream at different temperatures (818, 833, 848, 863, 878, and 893 K). Once protons within sodalite cages are rendered accessible by partial framework collapse upon ion exchange at NH₄NO₃ concentrations exceeding 0.6 M, then propane dehydrogenation and cracking occurs both over H⁺ sites in the sodalite cage and in the supercage as we reported previously⁵⁴. Since these two H-FAU samples share similar proton density ratios (H_SOD/H_SUP in Table 1, column 8), we directly normalized rates per overall H⁺ site. By comparison, H-FAU3.5 exhibits higher propane dehydrogenation and cracking rate constants per overall H⁺ site than H-FAU2.8 (Fig. 6). Despite the lower proton density (Table 1, column 7), H-FAU3.5 also exhibits higher rate constants per gram (Supplementary Fig. 47). We surmise that these two samples were partially dealuminated at the harsh ion exchange conditions (1 M NH₄NO₃) used, which could be proved by ²⁷Al-NMR spectra (Fig. 5(e)). Lercher et al.⁶³ have examined solid state reactions that occur in partially-dealuminated zeolites to note IR, NMR, and EXAFS spectroscopic signatures of extra-framework Al in close proximity to Brønsted acid sites in H-ZSM-5 materials. Active centers with adjacent Brønsted acid sites and partially dislodged framework Al species showed higher rates for H₂/D₂ exchange and protolytic butane cracking and it was noted that accessible pore space in the zeolite was adjusted to accommodate alkane cracking transition states better leading to higher entropies of activation. The higher reactivity of H-FAU3.5 for protolytic alkane activation relative to H-FAU2.8 likely arises from similar tunability of pore size and space upon ion exchange.

**Fig. 6: Analysis of rate constants for molecular cracking and dehydrogenation of propane over H-FAU3.5 and H-FAU2.8 zeolites.**

Discussion

Herein, we reported that a ML-based model, created using an in-house set of synthesis data, directed us to explore synthesis routes to enhance the Si/Al ratio of FAU zeolites via OSDA-free direct synthesis. Based on 81 training synthesis inputs and outcomes, the ML algorithm was validated with 7 testing points, and suggested synthesis conditions that elevated Si/Al to hitherto highest level (i.e., Si/Al = 3.5). Compared to a previously reported high-silica FAU zeolite (H-FAU2.8, entry 24) made by direct synthesis, H-FAU3.5 zeolite exhibits 2.5- and 2-fold increments in propane dehydrogenation and cracking rate constants per H⁺ site, respectively, demonstrating the potential of ML-directed synthesis to improve catalytic performance.

Methods

Na-FAU zeolites synthesis

The synthesis mixtures employed in this work were sodium aluminosilicates via template-free routes, and the molar compositions are listed in Supplementary Tables 1 and 2, in which the entries have different Si and Al sources. Specifically, Al sources include aluminum powder (99.9%, MilliporeSigma, abbreviated as Al powder), aluminum foil (99.999%, MilliporeSigma, abbreviated as Al foil), aluminium isopropoxide (98%, MilliporeSigma, abbreviated as Al(O-iPr)₃), and sodium aluminate (Sigma-Aldrich, abbreviated as NaAl). Si sources include LUDOX HS-30 colloidal silica (abbreviated as HS30), LUDOX AS-40 colloidal silica (abbreviated as AS40), and sodium silicate (Sigma-Aldrich, abbreviated as NaSi). Sodium hydroxide solution (50% w/w, Neta Scientific) and deionized water are also used here for synthesis.

Two solutions were prepared during synthesis. Solution A (Si precursor solution) was prepared by adding a given amount of sodium hydroxide solution to a given amount of deionized water, followed by addition of a given amount of LUDOX HS-30 colloidal silica (or other Si precursors) into the prepared solution. Solution A formed a gel initially, and then it was heated in an oven at 343 K for 15–30 min until reaching a clear sol. Solution B (Al precursor solution) was prepared by adding a given amount of sodium hydroxide solution to a given amount of deionized water, followed by dissolving a given amount of aluminum powder (or other Al precursors) into the prepared solution (Note: the reaction is exothermic and produces hydrogen, hence the addition of aluminum powder should be performed with caution and appropriate safety protocols in place). Solutions A and B were cooled to ambient temperature and then solution B was added dropwise into solution A in a Teflon bottle while stirring. A freeze drying step is applied to remove water to a desired level (e.g., H₂O_final/H₂O_initial = 0.47) within a lyophilizer at ambient temperature with a pressure of 20 mTorr. The synthesis mixture was aged with stirring at ambient temperature for 24 h, and then the vessel was heated in a static oven at a given temperature for a given duration (see details in Supplementary Tables 1 and 2). The products were then washed by repetitive centrifugation and redispersion by deionized water until the pH dropped to 9–10, and then they were dried at 343 K overnight.

H-FAU zeolites preparation

Na-FAU zeolites were ion-exchanged with aqueous ammonium nitrate solutions for 1 h at ambient temperature. Each time 0.25 g of Na-FAU material was added to 40 cm³ of ammonium nitrate solution with stirring, and the ammonium concentrations were selected as 1 M. The solid products were thoroughly washed with deionized water at ambient temperature and then dried at 343 K for 6 h. Finally, the solid products were heated under inert helium flowing gas (0.167 cm³ s⁻¹, Matheson) from ambient temperature to 673 K with a ramping rate of 0.033 K s⁻¹ and maintained at 673 K for 6 h. The resulting samples were denoted as H-FAU zeolites.

Synchrotron X-ray diffraction

XRD patterns were collected at Beamline 17-BM at Advanced Photon Source, Argonne National Laboratory. Powder samples were crushed finely with a pestle and mortar and loaded into 0.8–1 mm diameter Kapton capillaries. The X-ray wavelength used was 0.45192–0.45228 Å. 2-D diffraction data were collected in transmission geometry by a PerkinElmer amorphous silicon flat panel detector, and then 2-D diffraction data were processed with software GSAS II⁶⁴ to obtain conventional XRD plots of intensity vs. 2θ. All XRD patterns presented in Supplementary Figs. 1–23 are converted to a wavelength of 1.54059 Å (Cu Kα).

Argon physisorption

Measurements were performed at 87.3 K using an automatic manometric sorption Analyzer (Quantachrome Instruments Autosorb iQ MP). Prior to adsorption measurements, the samples were outgassed at 573 K for 10 h under turbomolecular pump vacuum ( < 0.003 Torr). Cumulative pore volume curves were calculated from the isotherms by applying an advanced NLDFT method, which assumes that argon adsorption at 87 K occurs in spherical siliceous zeolite pores in the micropore range and cylindrical silica pores in the mesopore range⁵⁹.

Scanning electron microscopy (SEM)

SEM images for tested samples were acquired using a JEOL JSM-6500 scanning electron microscope operated at 5 kV. SEM specimens were prepared by suspension of the sample powder in ethanol by ultrasonication for 30 min, and then the solution was dropped onto the surface of a silicon chip and dried at room temperature.

Transmission electron microscopy (TEM)

TEM images were taken using a Tecnai T12 microscope operated at 120 kV with a LaB 6 filament. The specimens were prepared by dispersing the sample powder in ethanol and ultrasonicating for 30 min, and then the solution was dropped onto a Formvar-coated Cu grid and dried at room temperature.

Solid-state magic angle spinning (MAS) nuclear magnetic resonance (NMR) spectroscopy

²⁷Al and ²⁹Si MAS NMR spectra were acquired with a Bruker DSX-500 spectrometer (11.7 T magnet) and a 4 mm Bruker MAS probe. The spectral frequencies were 78.2 MHz and 99.4 MHz for the ²⁷Al and ²⁹Si nucleus, respectively.

Infrared spectroscopy

Infrared (IR) spectra for pyridine adsorption were collected for H-FAU samples on a Nicolet™ iS50 Fourier transform infrared spectrometer with a Hg-Cd-Te (MCT, cooled to 77 K by liquid N₂) detector by averaging 128 scans at 2 cm⁻¹ resolution in the 600–4000 cm⁻¹ range and were taken relative to an empty cell background reference collected under dynamic vacuum (~0.01 Torr) at 498 K. Self-supporting wafers (0.01–0.03 g cm⁻², with a diameter of 13 mm) were sealed within an IR transmission cell with ZnSe windows (High Temperature Transmission Cell, Harrick Scientific Products Inc.). Wafer temperatures were measured by K-type thermocouples (Omega) attached to the sample holder. The IR cell was connected to a glass vacuum manifold, which was used for sample exposure to controlled amounts of gaseous pyridine. The temperature program followed for these measurements is described herein: sample dehydration was performed initially, the temperature of the cell was initially raised from ambient temperature to 673 K at a ramping rate of 0.033 Ks⁻¹ followed by holding temperature at 673 K for 6 h; then the temperature was cooled down to 498 K and pyridine was introduced until saturation of the adsorbate was noted with invariance among successive spectra recorded.

Catalytic tests

Proton-catalyzed monomolecular propane reactions were performed in a tubular glass-lined stainless steel reactor (6.35 mm O.D. and 4 mm I.D., SGE Analytical Science) equipped with a thermocouple to monitor the reaction temperature. The catalyst sample was heated in helium flow (0.083 cm³ s⁻¹, Matheson) from ambient temperature to the reaction temperature at atmospheric pressure. Prior to data acquisition, we pretreated samples using H₂/He mixtures (p_H2 = 35 kPa, H₂/He = 1:2, and the total flow rate = 0.5 cm³ s⁻¹) for 20 min to remove any remnant reactive carbon species. Molar ratios of feed gas mixtures were fixed as H₂/C₃H₈/Ar/He = 3/3/1.5/60 with Ar serving as an internal standard, and space velocity of 3600 cm³_C3H8·g_cat⁻¹ h⁻¹ with a total flow rate of 1.125 cm³ s⁻¹. H₂ is present in the inlet stream to mitigate on-stream deposition of organic species. Reactor effluent was vented to atmospheric pressure, system pressure varied from 101 kPa to 120 kPa as measured by a PX209-300G5V pressure transducer. Reactor temperature varied from 818 to 893 K with an interval of 15 K. Propane conversions were <1% and considered differential for assessment of catalytic rates. The composition of the reactor effluent was analyzed by an online Agilent 7890 A gas chromatograph (GC) using a flame ionization detector (FID) and a thermal conductivity detector (TCD). Eluent separation was achieved in parallel using a dimethylpolysiloxane J&W HP-1 column (50 m long, 320 μm diameter, 0.52 μm film thickness) connected to the FID and a GS-GasPro (60 m long, 320 μm diameter) preceding the TCD. Ar was quantified using the TCD, and all hydrocarbon species were quantified using the FID.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The data that support the findings of this study are provided in Supplementary Information and Source Data file. Source Data are provided as a Source Data file and enclosed with this paper. Details listed in the Supplementary Information consist of the synthesis procedures, characterization results (XRD patterns, Ar-adsorption isotherms, SEM/TEM images, ²⁹Si solid-state NMR), reactivity analysis, and machine learning methods and results. Source data are provided with this paper.

Code availability

The codes used to train the Machine learning models can be accessed in the public Gitlab repository (https://gitlab.com/nicolasevangelou/zeolites_ml.git) and the Figshare Dataset.

References

Breck, D. W. & Breck, D. W. Zeolite molecular sieves: structure, chemistry, and use; (John Wiley & Sons, 1973).
Davis, M. E. Ordered porous materials for emerging applications. Nature 417, 813–821 (2002).
Article ADS CAS PubMed Google Scholar
Cundy, C. S. & Cox, P. A. The hydrothermal synthesis of zeolites: history and development from the earliest days to the present time. Chem. Rev. 103, 663–702 (2003).
Article CAS PubMed Google Scholar
Davis, M. E. & Lobo, R. F. Zeolite and molecular sieve synthesis. Chem. Mater. 4, 756–768 (1992).
Article CAS Google Scholar
Tosheva, L. & Valtchev, V. P. Nanozeolites: synthesis, crystallization mechanism, and applications. Chem. Mater. 17, 2494–2513 (2005).
Article CAS Google Scholar
Corma, A. State of the art and future challenges of zeolites as catalysts. J. Catal. 216, 298–312 (2003).
Article CAS Google Scholar
Baerlocher, C. Database of zeolite structures. http://www.iza-structure.org/databases/ (2008).
Di Iorio, J. R. et al. Cooperative and competitive occlusion of organic and inorganic structure-directing agents within chabazite zeolites influences their aluminum arrangement. J. Am. Chem. Soc. 142, 4807–4819 (2020).
Article PubMed Google Scholar
Lim, K. H. & Grey, C. P. Characterization of Extra-Framework Cation Positions in Zeolites NaX and NaY with Very Fast 23Na MAS and Multiple Quantum MAS NMR Spectroscopy. J. Am. Chem. Soc. 122, 9768–9780 (2000).
Article CAS Google Scholar
Ng, E. P. et al. Capturing Ultrasmall EMT Zeolite from Template-Free Systems. Science 335, 70–73 (2012).
Article ADS CAS PubMed Google Scholar
Awala, H. et al. Template-free nanosized faujasite-type zeolites. Nat. Mater. 14, 447–451 (2015).
Article ADS CAS PubMed Google Scholar
Mintova, S., Jaber, M. & Valtchev, V. Nanosized microporous crystals: emerging applications. Chem. Soc. Rev. 44, 7207–7233 (2015).
Article CAS PubMed Google Scholar
Khaleel, M., Wagner, A. J., Mkhoyan, K. A. & Tsapatsis, M. On the Rotational Intergrowth of Hierarchical FAU/EMT Zeolites. Angew. Chem. Int. Ed. 53, 9456–9461 (2014).
Article CAS Google Scholar
Inayat, A., Knoke, I., Spiecker, E. & Schwieger, W. Assemblies of Mesoporous FAU-Type Zeolite Nanosheets. Angew. Chem. Int. Ed. 51, 1962–1965 (2012).
Article CAS Google Scholar
Verboekend, D., Vilé, G. & Pérez-Ramírez, J. Hierarchical Y and USY Zeolites Designed by Post-Synthetic Strategies. Adv. Funct. Mater. 22, 916–928 (2012).
Article CAS Google Scholar
Verboekend, D., Keller, T. C., Mitchell, S. & Pérez-Ramírez, J. Hierarchical FAU- and LTA-Type Zeolites by Post-Synthetic Design: A New Generation of Highly Efficient Base Catalysts. Adv. Funct. Mater. 23, 1923–1934 (2013).
Article CAS Google Scholar
Zhang, X. et al. Synthesis of Self-Pillared Zeolite Nanosheets by Repetitive Branching. Science 336, 1684–1687 (2012).
Article ADS CAS PubMed Google Scholar
Kumar, P. et al. One-dimensional intergrowths in two-dimensional zeolite nanosheets and their effect on ultra-selective transport. Nat. Mater. 19, 443–449 (2020).
Article ADS CAS PubMed Google Scholar
Qi, L. et al. Ethanol Conversion to Butadiene over Isolated Zinc and Yttrium Sites Grafted onto Dealuminated Beta Zeolite. J. Am. Chem. Soc. 142, 14674–14687 (2020).
Article CAS PubMed Google Scholar
Pérez-Ramírez, J. et al. Hierarchical zeolites: enhanced utilisation of microporous crystals in catalysis by advances in materials design. Chem. Soc. Rev. 37, 2530–2542 (2008).
Article PubMed Google Scholar
Verboekend, D. et al. Synthesis, characterisation, and catalytic evaluation of hierarchical faujasite zeolites: milestones, challenges, and future directions. Chem. Soc. Rev. 45, 3331–3352 (2016).
Article CAS PubMed Google Scholar
Li, J., Corma, A. & Yu, J. Synthesis of new zeolite structures. Chem. Soc. Rev. 44, 7112–7127 (2015).
Article CAS PubMed Google Scholar
Van Tendeloo, L. et al. Alkaline cations directing the transformation of FAU zeolites into five different framework types. Chem. Commun. 49, 11737–11739 (2013).
Article Google Scholar
Schwalbe-Koda, D. et al. A priori control of zeolite phase competition and intergrowth with high-throughput simulations. Science 374, 308–315 (2021).
Article ADS CAS PubMed Google Scholar
Jensen, Z. et al. Discovering Relationships between OSDAs and Zeolites through Data Mining and Generative Neural Networks. ACS Cent. Sci. 7, 858–867 (2021).
Article CAS PubMed PubMed Central Google Scholar
Vogt, E. T. C. & Weckhuysen, B. M. Fluid catalytic cracking: recent developments on the grand old lady of zeolite catalysis. Chem. Soc. Rev. 44, 7342–7370 (2015).
Article CAS PubMed PubMed Central Google Scholar
Muraoka, K. et al. Linking synthesis and structure descriptors from a large collection of synthetic records of zeolite materials. Nat. Commun. 10, 4459 (2019).
Article ADS PubMed PubMed Central Google Scholar
Khaleel, M., Xu, W., Lesch, D. A. & Tsapatsis, M. Combining Pre- and Post-Nucleation Trajectories for the Synthesis of High FAU-Content Faujasite Nanocrystals from Organic-Free Sols. Chem. Mater. 28, 4204–4213 (2016).
Article CAS Google Scholar
Oleksiak, M. D. et al. Organic‐Free Synthesis of a Highly Siliceous Faujasite Zeolite with Spatially Biased Q4(nAl) Si Speciation. Angew. Chem. Int. Ed. 56, 13366–13371 (2017).
Article CAS Google Scholar
Kouraytem, N. et al. Modeling process–structure–property relationships in metal additive manufacturing: a review on physics-driven versus data-driven approaches. J. Phys. Mater. 4, 032002 (2021).
Article CAS Google Scholar
Pilania, G. et al. Accelerating materials property predictions using machine learning. Sci. Rep. 3, 2810 (2013).
Article PubMed PubMed Central Google Scholar
Wei, J. et al. Machine learning in materials science. InfoMat 1, 338–358 (2019).
Article CAS Google Scholar
Wu, W. et al. Quantitative Structure-Property Relationship (QSPR) Modeling of Drug-Loaded Polymeric Micelles via Genetic Function Approximation. PLOS ONE 10, e0119575 (2015).
Article PubMed PubMed Central Google Scholar
Morgan, D. & Jacobs, R. Opportunities and Challenges for Machine Learning in Materials Science. Annu. Rev. Mater. Res. 50, 71–103 (2020).
Article ADS CAS Google Scholar
Gómez-Bombarelli, R. et al. Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules. ACS Cent. Sci. 4, 268–276 (2018).
Article PubMed PubMed Central Google Scholar
Lansford, J. L. & Vlachos, D. G. Infrared spectroscopy data- and physics-driven machine learning for characterizing surface microstructure of complex materials. Nat. Commun. 11, 1513 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Safder, U. et al. Quantitative structure-property relationship (QSPR) models for predicting the physicochemical properties of polychlorinated biphenyls (PCBs) using deep belief network. Ecotoxicol. Environ. Saf. 162, 17–28 (2018).
Article CAS PubMed Google Scholar
Ducamp, M. & Coudert, F.-X. Prediction of Thermal Properties of Zeolites through Machine Learning. J. Phys. Chem. C. 126, 1651–1660 (2022).
Article CAS Google Scholar
Pattanaik, L. & Coley, C. W. Molecular Representation: Going Long on Fingerprints. Chem 6, 1204–1207 (2020).
Article CAS Google Scholar
Carr, D. A. et al. Machine learning approach for structure-based zeolite classification. Microporous Mesoporous Mater. 117, 339–349 (2009).
Article CAS Google Scholar
Kapko, V., Dawson, C., Treacy, M. M. J. & Thorpe, M. F. Flexibility of ideal zeolite frameworks. Phys. Chem. Chem. Phys. 12, 8531–8541 (2010).
Article CAS PubMed Google Scholar
Soler-Illia, G. Jd. A. A., Sanchez, C., Lebeau, B. & Patarin, J. Chemical Strategies To Design Textured Materials: from Microporous and Mesoporous Oxides to Nanonetworks and Hierarchical Structures. Chem. Rev. 102, 4093–4138 (2002).
Article PubMed Google Scholar
Evans, J. D. & Coudert, F.-X. Predicting the Mechanical Properties of Zeolite Frameworks by Machine Learning. Chem. Mater. 29, 7833–7839 (2017).
Article CAS Google Scholar
Gaillac, R., Chibani, S. & Coudert, F.-X. Speeding Up Discovery of Auxetic Zeolite Frameworks by Machine Learning. Chem. Mater. 32, 2653–2663 (2020).
Article CAS Google Scholar
Moliner, M., Román-Leshkov, Y. & Corma, A. Machine Learning Applied to Zeolite Synthesis: The Missing Link for Realizing High-Throughput Discovery. Acc. Chem. Res. 52, 2971–2980 (2019).
Article CAS PubMed Google Scholar
Gurney, K. An introduction to neural networks; (CRC press, 2018).
Jensen, Z. et al. A Machine Learning Approach to Zeolite Synthesis Enabled by Automatic Literature Data Extraction. ACS Cent. Sci. 5, 892–899 (2019).
Article CAS PubMed PubMed Central Google Scholar
Lafon, S. S. Diffusion maps and geometric harmonics; (Yale University, 2004).
Coifman, R. R. & Lafon, S. Geometric harmonics: a novel tool for multiscale out-of-sample extension of empirical functions. Appl. Comput. Harmon. Anal. 21, 31–52 (2006).
Article MathSciNet MATH Google Scholar
Goodfellow, I., Bengio, Y. & Courville, A. Deep learning; (MIT press, 2016).
Lapedes, A. & Farber, R. How neural nets work. Neural information processing systems. 442–456 (1987).
Rasmussen, C. E. & Williams, C. Gaussian processes for machine learning, vol. 2, (MIT Press, Cambridge, MA, USA, 2006).
Boyle, P. Gaussian processes for regression and optimization. (PhD thesis, Victoria Univ. Wellington, 2007).
Li, X. et al. Enhanced Reactivity of Accessible Protons in Sodalite Cages of Faujasite Zeolite. Angew. Chem. Int. Ed. 61, e20211180 (2022).
Google Scholar
Shapley, L. S. A value for n-person games. Contributions to the Theory of Games II, Annals of Mathematical Studies. 28, 307–317 (1953).
Lundberg, S. M. et al. From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell. 2, 56–67 (2020).
Article PubMed PubMed Central Google Scholar
Lundberg, S. M. & Lee, S.-I. A unified approach to interpreting model predictions. Adv. Neural Inf. Process. Syst. 30, 4768–4777 (2017).
Fyfe, C. A. et al. One- and two-dimensional high-resolution solid-state NMR studies of zeolite lattice structures. Chem. Rev. 91, 1525–1543 (1991).
Article CAS Google Scholar
Qin, Z. et al. Opening the Cages of Faujasite-Type Zeolite. J. Am. Chem. Soc. 139, 17273–17276 (2017).
Article CAS PubMed Google Scholar
Batool, S. R., Sushkevich, V. L. & van Bokhoven, J. A. Correlating Lewis acid activity to extra-framework aluminum species in zeolite Y introduced by Ion-exchange. J. Catal. 408, 24–35 (2022).
Article CAS Google Scholar
Li, X. et al. Acid Catalysis over Low-Silica Faujasite Zeolites. J. Am. Chem. Soc., https://doi.org/10.1021/jacs.1022c01022 (2022).
Kester, P. M., Iglesia, E. & Gounder, R. Parallel Alkane Dehydrogenation Routes on Brønsted Acid and Reaction-Derived Carbonaceous Active Sites in Zeolites. J. Phys. Chem. C. 124, 15839–15855 (2020).
Article CAS Google Scholar
Xue, N. et al. Hydrolysis of zeolite framework aluminum and its impact on acid catalyzed alkane reactions. J. Catal. 365, 359–366 (2018).
Article CAS Google Scholar
Toby, B. H. & Von Dreele, R. B. GSAS-II: the genesis of a modern open-source all purpose crystallography software package. J. Appl. Crystallogr. 46, 544–549 (2013).
Article CAS Google Scholar
Rouquerol, J., Llewellyn, P. & Sing, K. In Adsorption by Powders and Porous Solids (Second Edition); (eds Rouquerol, F., Rouquerol, J., Sing, K. S. W., Llewellyn, P. & Maurin, G.) (Academic Press, Oxford, 2014).
Dalconi, M. C. et al. Ni²⁺ ion sites in hydrated and dehydrated forms of Ni-exchanged zeolite ferrierite. Microporous Mesoporous Mater. 39, 423–430 (2000).
Article CAS Google Scholar
Thibault-Starzyk, F. et al. In situ thermogravimetry in an infrared spectrometer: an answer to quantitative spectroscopy of adsorbed species on heterogeneous catalysts. Microporous Mesoporous Mater. 67, 107–112 (2004).
Article CAS Google Scholar

Download references

Acknowledgements

We are indebted to Prof. Constantine Frangakis of the Biostatistics Department of Johns Hopkins University for several useful conversations about the statistical analysis of our data. We acknowledge partial support from the Catalysis Center for Energy Innovation, an Energy Frontier Research Center funded by the U.S. Department of Energy, Office of Science, and Office of Basic Energy Sciences under Award No. DE-SC0001004 and support from the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences, Division of Chemical Sciences, Geosciences and Biosciences under Award No. DE-SC0023403 (Separation Science Program). Partial support was also provided by the U.S. Department of Energy, Office of Basic Energy Sciences, Division of Chemical Sciences, Geosciences and Biosciences (Award DE-FG02-12ER16362), and by the U.S. Department of Energy, Office of Basic Energy Science, Catalysis Science Program (Award DE-SC00019028). Parts of this work were carried out in the Characterization Facility, University of Minnesota, which receives partial NSF support through the MRSEC and NNIN programs (DMR-1420013). Solid-state MAS NMR measurements were provided by the NMR facility at Caltech. The synchrotron XRD data were collected through the mail-in program at Beamline 17-BM of the Advanced Photon Source, a U.S. Department of Energy (DOE) Office of Science User Facility, operated for the DOE Office of Science by Argonne National Laboratory under Contract No. DE-AC02-06CH11357.

Author information

These authors contributed equally: Xinyu Li, He Han, Nikolaos Evangelou, Noah J. Wichrowski.

Authors and Affiliations

Department of Chemical Engineering and Materials Science, University of Minnesota, 421 Washington Avenue SE, Minneapolis, MN, 55455, USA
Xinyu Li, He Han, Wenyang Zhao, Aditya Bhan & Michael Tsapatsis
State Key Laboratory of Fine Chemicals, PSU-DUT Joint Center for Energy Research, School of Chemical Engineering, Dalian University of Technology, Dalian, 116024, Liaoning Province, China
He Han, Chunshan Song & Xinwen Guo
Department of Chemical and Biomolecular Engineering, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD, 21218, USA
Nikolaos Evangelou, Peng Lu, Ioannis G. Kevrekidis & Michael Tsapatsis
Department of Applied Mathematics and Statistics, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD, 21218, USA
Noah J. Wichrowski & Ioannis G. Kevrekidis
X-ray Science Division, Advanced Photon Source, Argonne National Laboratory, Lemont, IL, 60439, USA
Wenqian Xu
Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA, 91125, USA
Son-Jong Hwang
Applied Physics Laboratory, Johns Hopkins University, 11100 Johns Hopkins Road, Laurel, MD, 20723, USA
Michael Tsapatsis
Institute for NanoBioTechnology, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD, 21218, USA
Michael Tsapatsis

Authors

Xinyu Li
View author publications
You can also search for this author in PubMed Google Scholar
He Han
View author publications
You can also search for this author in PubMed Google Scholar
Nikolaos Evangelou
View author publications
You can also search for this author in PubMed Google Scholar
Noah J. Wichrowski
View author publications
You can also search for this author in PubMed Google Scholar
Peng Lu
View author publications
You can also search for this author in PubMed Google Scholar
Wenqian Xu
View author publications
You can also search for this author in PubMed Google Scholar
Son-Jong Hwang
View author publications
You can also search for this author in PubMed Google Scholar
Wenyang Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Chunshan Song
View author publications
You can also search for this author in PubMed Google Scholar
Xinwen Guo
View author publications
You can also search for this author in PubMed Google Scholar
Aditya Bhan
View author publications
You can also search for this author in PubMed Google Scholar
Ioannis G. Kevrekidis
View author publications
You can also search for this author in PubMed Google Scholar
Michael Tsapatsis
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.T. and I.G.K. conceived the project. M.T., I.G.K., and A.B. co-supervised the work with emphasis on synthesis and characterization, Machine Learning, and catalysis, respectively. X.L. performed most of the synthesis, characterization, and catalytic tests. Early synthesis experiments, mostly described in Supplementary Table 2, were performed by H.H., co-supervised by C.S., X.G., and M.T. W.X. performed synchrotron XRD experiments and contributed to data analysis. P.L. provided some of the synthesis experiments in Supplementary Table 1. S.-J.H. performed NMR and contributed to analysis of data. W.Z. collected TEM/SEM images for part of samples listed in Supplementary Table 1 and contributed to analysis of data. N.E. and N.J.W. performed all ML analysis and predictions, supervised by I.G.K. X.L., N.E., N.J.W., M.T., I.G.K., A.B. wrote the paper with contributions from all co-authors.

Corresponding authors

Correspondence to Aditya Bhan, Ioannis G. Kevrekidis or Michael Tsapatsis.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Source data

Source Data File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, X., Han, H., Evangelou, N. et al. Machine learning-assisted crystal engineering of a zeolite. Nat Commun 14, 3152 (2023). https://doi.org/10.1038/s41467-023-38738-5

Download citation

Received: 11 July 2022
Accepted: 10 May 2023
Published: 31 May 2023
DOI: https://doi.org/10.1038/s41467-023-38738-5

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.