In silico prediction and screening of modular crystal structures via a high-throughput genomic approach

Li, Yi; Li, Xu; Liu, Jiancong; Duan, Fangzheng; Yu, Jihong

doi:10.1038/ncomms9328

Download PDF

Article
Open access
Published: 23 September 2015

In silico prediction and screening of modular crystal structures via a high-throughput genomic approach

Yi Li ORCID: orcid.org/0000-0002-5222-3674¹,
Xu Li¹,
Jiancong Liu¹,
Fangzheng Duan¹ &
…
Jihong Yu¹

Nature Communications volume 6, Article number: 8328 (2015) Cite this article

6715 Accesses
62 Citations
8 Altmetric
Metrics details

Subjects

Abstract

High-throughput computational methods capable of predicting, evaluating and identifying promising synthetic candidates with desired properties are highly appealing to today’s scientists. Despite some successes, in silico design of crystalline materials with complex three-dimensionally extended structures remains challenging. Here we demonstrate the application of a new genomic approach to ABC-6 zeolites, a family of industrially important catalysts whose structures are built from the stacking of modular six-ring layers. The sequences of layer stacking, which we deem the genes of this family, determine the structures and the properties of ABC-6 zeolites. By enumerating these gene-like stacking sequences, we have identified 1,127 most realizable new ABC-6 structures out of 78 groups of 84,292 theoretical ones, and experimentally realized 2 of them. Our genomic approach can extract crucial structural information directly from these gene-like stacking sequences, enabling high-throughput identification of synthetic targets with desired properties among a large number of candidate structures.

Synthesis of goldene comprising single-atom layer gold

Article Open access 16 April 2024

De novo design of protein structure and function with RFdiffusion

Article Open access 11 July 2023

Elucidation of genes enhancing natural product biosynthesis through co-evolution analysis

Article 12 April 2024

Introduction

Discovering new advanced materials, which is one of the most important tasks for materials scientists and chemists, still relies primarily on scientific intuition and trial-and-error experimentation¹. In 2011, the US White House launched the Materials Genome Initiative aiming to develop high-throughput computer methods and data-sharing systems to complement and fully leverage existing experimental research on advanced materials. The incorporation of new computer and informatics tools has the potential to accelerate materials innovation in: (1) predicting a large number of unknown candidate compounds^{2,3,4,5,6,7,8,9,10,11,12,13,14,15}; (2) evaluating the predicted compounds and removing the unrealizable ones^{16,17,18,19,20,21}; and (3) screening the predicted compounds and identifying synthetic candidates with desired properties^{22,23,24,25,26,27,28,29,30,31}. Despite all these successes, in silico materials innovation is still facing many challenges. Unlike the genes of organisms, encoding and decoding the structural information of many important crystalline materials remains very complicated. Meanwhile, the explicit structure-property relationships for many materials are not yet clear, so high-throughput identification of synthetic targets with desired properties among a large number of candidate structures is still challenging.

Fortunately, the structures of many crystalline materials can topologically be decomposed into a set of smaller and simpler building modules. In particular, many materials are built of well-defined parallel-stacked modular layers^14,32,33. If each unique layer is assigned a predefined symbol, then the stacking of these layers can be expressed as a sequence of predefined symbols, just like the genes of organisms. Since each stacking sequence uniquely identifies a specific three-dimensional structure, we deem it the gene of the corresponding structure. Such gene-like one-dimensional stacking sequences can be easily processed by computers, so high-throughput enumeration, evaluation, and identification of theoretical structures with desired properties will be accessible. In this contribution, for the first time, we demonstrate the application of a new genomic approach to ABC-6 zeolites, a family of industrially important catalysts constructed from the stacking of modular 6-ring layers.

To date, over 150 types of ABC-6 zeolites with 28 distinct framework topologies have been discovered, among which cancrinite, sodalite and chabazite are the best-known representatives (Supplementary Table 1). The frameworks of all ABC-6 zeolites can be decomposed into parallel six-ring layers stacked along the c-direction in hexagonal unit cells, and the vertices of each 6-ring are corner-sharing TO₄ tetrahedra (T=Si, Al, or P and so on). An ABC-6 structure may consist of three types of six-ring layers, which are centred at the (0,0,z), (1/3,2/3,z), and (2/3,1/3,z) axes, respectively. If we denote these three types of layers by letters A, B, and C, then the stacking sequences for cancrinite, sodalite and chabazite will be (AB), (ABC), and (AABBCC), respectively (Fig. 1). Meanwhile, the stacking of six-rings gives rise to various types of well-defined polyhedral cages in molecular dimensions, which are the most important structural features for ABC-6 zeolites (to avoid confusion, the stacking sequences for these polyhedral cages are given in lower case throughout this paper). These featured cages may hold various types of extraframework cations, anion groups and/or water molecules, which can be exchanged or removed, providing void space suitable for the adsorption, diffusion and reaction of many types of guest species^{34,35,36,37,38,39,40}. For instance, chabazite and its synthetic counterparts are able to trap CO₂ in their featured cages, showing the highly desired capability for carbon capture from the atmosphere^36,37; meanwhile, these zeolites are currently among the best industrial catalysts for methanol-to-olefin (MTO) reactions because of the confinement effect of their featured cages^41,42,43.

**Figure 1: Enumeration and interpretation of ABC-6 stacking sequences.**

Due to these important applications, speculating how many unknown ABC-6 structures are realizable as new catalysts with desired properties is of great significance for the development of such materials. However, to answer this question is challenging. First, we need a highly efficient computational method to enumerate all possible ABC-6 structures. Second, we need to evaluate all enumerated structures and remove the unrealizable ones. More importantly yet more difficultly, we need a high-throughput structure screening method to identify candidate ABC-6 structures with desired properties according to functional needs. An early attempt was made towards answering this question, but failed in structure evaluation and structure identification⁴⁴.

Here we propose a new genomic approach towards the solution of these problems. In this work, we focus on the one-dimensional digital stacking sequences, that is, the genes of ABC-6 structures. By enumerating all possible stacking sequences, we are able to predict every ABC-6 topology that is chemically feasible. We have developed a ternary numeral coding system, in which each stacking sequence is expressed as a specific ternary numeral. To enumerate all possible stacking sequences, we went through all ternary numerals from the smallest one to the largest allowed and evaluated the chemical feasibility for each one of them. During this enumeration process, equivalent stacking sequences (for instance, (BCA), (CAB), (CBA), (ACB) and (ABCABC), and so on, are all equivalent to (ABC)) and chemically infeasible ones (for instance, (AAA) is chemically infeasible because each stacking layer in it is highly distorted from the ideal tetrahedral coordination) were removed. At the end of the enumeration, every one of our saved stacking sequences corresponded to a topologically unique and chemically feasible ABC-6 topology.

Besides structure enumeration, our genomic approach provides a high-throughput way to extract the most important structural information directly from the enumerated stacking sequences. For instance, our computer program can locate all constituent cages hidden in the stacking sequences, which are the most important structural features for ABC-6 zeolites. To do this, our computer program went through the corresponding stacking sequence back and forth to look for a string of any length that could be interpreted as a valid ABC-6 cage. Such a string should start and end with the same letter, and this letter should not appear in the middle of this string. By finding all such strings in a stacking sequence, we have located all constituent cages in every enumerated ABC-6 topology (Fig. 1). Besides constituent cages, some other structural features, such as the channels and the stacking compactness of six-ring layers are also important to ABC-6 zeolites. Channels link up ABC-6 cages to form a three-dimensional porous system, so their widths and orientations are crucial to the adsorption and diffusion of guest species. Some ABC-6 structures may possess narrow channels only, the openings of which are no wider than a six-ring; other structures may possess interconnecting 8-ring channels perpendicular to the c-axis or/and 12-ring channels running along the c-axis. Besides cages and channels, how compactly the six-ring layers are stacked is another important structural feature influencing the porosity and other related properties of ABC-6 zeolites. Highly compact stacking of six-ring layers leads to dense ABC-6 frameworks, whereas less compact stacking gives rise to frameworks with more accessible void spaces for guest species, which are highly desired for many applications. Because of the intrinsic nature of ABC-6 structures, compact stackings only occur between successive distinct layers, and those between successive identical layers are non-compact stackings. Here we define, for the first time, the stacking compactness of an ABC-6 structure as the difference in the numbers of compact and non-compact stackings divided by the total number of layer stackings. According to this definition, the highest stacking compactness of an ABC-6 structure is 1, corresponding to the densest framework where all layers are compactly stacked. The lowest stacking compactness is 0, corresponding to the most porous framework where only half of the layer stackings are compact. Via high-throughput interpretation of the stacking sequences, the information on channels and stacking compactness can be extracted by our computer program. Details regarding the enumeration and interpretation of ABC-6 stacking sequences can be found in the Methods section.

Results

Enumeration of ABC-6 structures

Considering the computational cost, we have enumerated 84,292 stacking sequences corresponding to all topologically unique and chemically feasible ABC-6 topologies comprised of N stacking layers (N≤16). The results are summarized in Supplementary Table 2. In all, 98.8% of the enumerated ABC-6 topologies possess 8-ring channels, far outnumbering the ones with 12-ring channels (0.2%) and the ones with 6-ring channels only (1.1%). The distribution of ABC-6 topologies among seven possible symmetries is also uneven. 95.7% of the ABC-6 topologies have the symmetry of P3m1, 2.3 and 1.7% belong to P-3m1 and P-6m2, respectively, and those belonging to other symmetries amount only to 0.3%. Most of the enumerated ABC-6 topologies consist of 5∼9 types of constituent cages.

From the stacking sequences we enumerated, we built the corresponding 84,292 atomic models. All of these models were fully optimized as silica polymorphs through a classic molecular mechanics method (see the Methods section and our online database⁴⁵ for more details). The framework energies relative to quartz for all of these models vary between 12.5 and 20.6 kJ (mol Si)⁻¹, and the framework densities vary between 15.6 and 18.7 Si nm⁻³ (Fig. 2a), agreeing well with those of existing ABC-6 zeolites. Moreover, statistics on Si–O, O–Si–O, and Si–O–Si distances in these models well obey the local interatomic distances (LIDs) criteria¹⁹ recently discovered among all existing zeolites, indicating that all of our enumerated topologies are chemically feasible as tectosilicates (see the Methods section for more details). Figure 2b plots the framework density versus stacking compactness for 84,292 optimized ABC-6 models. The stacking compactness is proportional to framework density, just as it is defined. According to this plot, we are able to estimate the framework density of an unknown ABC-6 structure directly from its corresponding stacking sequence.

**Figure 2: Structural attributes of 84,292 optimized ABC-6 models.**

Grouping of ABC-6 structures

Figure 3a demonstrates the plot of lattice dimensions c versus a for all of the optimized atomic models. The a dimensions of the optimized models vary between 1.230 and 1.356 nm, and the c dimensions vary between 0.241 × N and 0.259 × N nm, where N is the number of stacking layers. Surprisingly, all of these models seem to cluster into several groups even for those with identical N. We believe that the grouping of ABC-6 models should arise from the discreteness of their stacking compactness values. For N-layered ABC-6 topologies, the stacking compactness may have N/2+1 or (N-1)/2+1 possible values, depending on whether N is an even or odd number. Figure 3b is the plot of c/a versus stacking compactness, showing the perfect grouping of 84,292 optimized ABC-6 models according to N and the stacking compactness. Thus, all ABC-6 topologies comprised of ≤16 stacking layers can be divided into 78 groups, 20 of which have at least one end member realized already (underlined with short bars in Fig. 3b).We can name each individual group in the form of N–M, where M (written as a Roman numeral) is the rank of its corresponding stacking compactness among all possible values for N-layered structures. For instance, six-layered ABC-6 structures may have four possible stacking compactness values, that is, 6/6, 4/6, 2/6, and 0/6. Thus, among all 6-layered structures, liottite ((ABABAC)) with the highest stacking compactness of 6/6 belongs to Group 6-I, erionite ((AABAAC)) and bellbergite ((AABCCB)) with a stacking compactness of 2/6 belong to Group 6-III, and chabazite ((AABBCC)) with the lowest stacking compactness of 0/6 belongs to Group 6-IV. Figure 3 can be used as a reference to determine the framework structures of new ABC-6 zeolites. When the lattice dimensions of a new ABC-6 zeolite are known, we may refer to these plots to determine which groups the new structure may belong to. Then, the most probable atomic models will be determined from these groups by examining whether their simulated X-ray diffraction patterns match the observed one.

**Figure 3: Grouping of 84,292 optimized ABC-6 models.**

Identification of the most realizable ABC-6 topologies

Although all of our enumerated ABC-6 topologies are chemically feasible as tectosilicates, only 23 of them have been realized as natural minerals or synthetic materials. Among these realized ABC-6 topologies, half possess six-ring channels only, contradicting the enumeration result that only 1.1% of the enumerated topologies do (Supplementary Table 2). We believe these contradictions arise from the fact that many of our enumerated topologies are not practically realizable. Thus far, we have only considered the chemical feasibility of the host frameworks, yet neglecting the contribution of extra-framework cations, anion groups, or water molecules inside the ABC-6 cages. As a matter of fact, all of the realized ABC-6 frameworks can only form when extra-framework species are present, implying that they are highly important to the formation of ABC-6 structures. Considering the strong host-guest interactions between ABC-6 cages and extra-framework species, we believe that these featured constituent cages may hold the key to improve our prediction. After careful examination of the structural information we have extracted from the stacking sequences, we determine, for the first time, that all realized ABC-6 topologies are comprised of no more than four types of constituent cages, as is the case even for 36-layered kircherite, the most complex ABC-6 zeolite ever (Supplementary Table 1). This phenomenon is reasonable because every type of ABC-6 cage holds a specific collection of extra-framework species, which can form only under specific reaction conditions. Structures comprised of many types of cages can form only when the reaction conditions for all constituent cages are simultaneously fulfilled, which will be too difficult to occur in reality. Among the 23 already-realized ABC-6 topologies with ≤16 stacking layers, 2 are comprised of 1 type of cages, 6 comprised of 2, 14 comprised of 3, and the remaining 1 comprised of 4. In contrast, nearly 99% of the enumerated ABC-6 topologies are comprised of 5∼9 types of constituent cages (Supplementary Table 2). After removing all enumerated structures that are comprised of more than four types of cages, only 1,150 remained in the end (Table 1 and Supplementary Fig. 1; see our online database⁴⁵ for more details). Half of these 1,150 topologies possess six-ring channels only, which is consistent with the situation of realized ABC-6 zeolites. The cell dimensions, space groups, largest channel openings, framework energies, framework densities, stacking compactness and extracted constituent cages for these 1,150 ABC-6 structures are provided in Supplementary Data 1. In addition, we have calculated the theoretical solvent-accessible pore volumes and surface areas with respective to H₂O, H₂, CO₂, N₂, and CH₄ for these ABC-6 structures (Supplementary Data 1; see the Methods section for more details). The fractional pore volumes for these five important probe molecules are in the ranges of 5.24–11.93%, 4.05–9.91%, 2.48–7.40%, 1.55–5.75% and 1.21–5.09%, respectively, and the surface areas are in the ranges of 5.55–11.30 Å² Si⁻¹, 4.66–10.07 Å² Si⁻¹, 3.41–7.62 Å² Si⁻¹, 2.54–6.03 Å² Si⁻¹ and 2.17–5.39 Å² Si⁻¹, respectively. These data can be used to prescreen candidate structures for specific gas adsorption or separation applications. Among the 1,150 ABC-6 topologies constructed by no more than four types of constituent cages, 23 have already been realized. We deem the remaining 1,127 ABC-6 structures the most realizable synthetic candidates, because they are both chemically feasible and practically easy to form together with extraframework species. Recently, we have successfully realized two of these candidates, that is, magnesium aluminophosphate JU-60 and zinc aluminophosphate JU-61. These two new ABC-6 zeolites were both synthesized using 1,2-diaminocyclohexane as the structure-directing agent under hydrothermal conditions, and both of their structures were determined through single-crystal X-ray diffraction (see the Methods section for more details about their synthesis and structure determination). JU-60 belongs to Group 10-V, and its corresponding stacking sequence is (AABAACCBCC). JU-60 is comprised of four types of cages, including hexagonal prisms ((aa)), cancrinite cages ((aba)), chabazite cages ((abbcca)) and erionite cages ((abbcbba)), respectively (Fig. 4a). JU-61 belongs to Group 15-VII, and it is the first ABC-6 zeolite comprised of 15 stacking layers ((AABAABBCBBCCACC)). JU-61 consists of four types of cages, including the hexagonal prisms, cancrinite cages, gmelinite cages ((abba)), and a new type of ABC-6 cage ((abbcbbcca)), respectively (Fig. 4b). The synthesis of these new ABC-6 zeolites once again validates our prediction of the most realizable ABC-6 topologies.

Table 1 Numbers of topologically unique and practically realizable ABC-6 topologies*.

Full size table

**Figure 4: Two new ABC-6 topologies recently realized by the authors.**

Discussion

Focusing on the stacking sequences of ABC-6 zeolites, our genomic approach has provided a straightforward and reliable way to predict the most realizable synthetic candidates. More importantly, the key structural information, especially regarding the constituent cages, can be directly extracted from these stacking sequences, the genes of ABC-6 zeolites. Through a computer procedure similar to the enumeration of ABC-6 structures, we have enumerated 57 types of ABC-6 cages comprised of no more than 10 six-ring layers (Fig. 5 and Supplementary Fig. 3; see the Methods section for more details). As the physical and chemical properties of ABC-6 zeolites are mainly determined by their constituent cages, examining these ABC-6 cages enables the high-throughput screening of ABC-6 zeolites for specific applications. For instance, methanol-to-olefin (MTO) conversion over acidic zeolite catalysts has been an important non-petrochemical industrial process to produce highly demanded light olefins via natural gas, coal, or even biomass^46,47. Chabazite and its synthetic counterparts are currently among the best catalysts for MTO reactions, and the shape and size of their featured cage ((abbcca)) are believed to play the key role in this type of reactions by providing suitable confined void space⁴³. To find new ABC-6 catalysts with better MTO performance than chabazite, we have performed density functional theory (DFT) calculations on the methylation of hexamethylbenzene within different ABC-6 cages assuming the same ‘hydrocarbon pool’ mechanism^48,49. This reaction occurs at the beginning of an MTO process and is believed to be the key step to initiate the MTO process⁵⁰. We have calculated seven ABC-6 cages that are in similar size to the chabazite cage and possess many eight-ring windows in favour of olefin diffusion. The reaction barriers and reaction energies of these ABC-6 cages, as well as those of the chabazite cage, are listed in Supplementary Table 9. Two of these ABC-6 cages ((abbccbba) and (abbccbca)) exhibit significantly lower reaction barriers and reaction energies than the chabazite cage, indicating that they may provide more suitable confinement effect on the hydrocarbon species than the chabazite cage (Supplementary Fig. 4). By checking the stacking sequences of the 1,127 most realizable synthetic candidates, we have found that only seven of them possess these ‘superior’ cages (Supplementary Data 1). In particular, two of these seven ABC-6 structures ((AABBAACCAABBCC) and (AABBAACCBBAABBCC)) possess large accessible pore volumes comparable to chabazite, making them the most promising synthetic candidates as new MTO catalysts.

**Figure 5: Enumerated ABC-6 cages constructed by ≤8 six-ring layers.**

Notably, we assume all enumerated ABC-6 topologies are silicate zeolites in this work. In fact, these topologies may also be realizable as other tetrahedrally coordinated materials, such as silicon sulfides, alkali halides, sp³ carbon or silicon allotropes, and Zintl phases, which may have interesting mechanical, electronic, optical and chemical properties^51,52. In particular, a series of zeolitic imidazolate frameworks with ABC-6 topologies have been reported recently, which exhibit the highly desired capability for the capture of fission product⁵³ and CO₂ (ref. 54). Moreover, our genomic approach is valid not only for ABC-6 structures but also for other crystalline materials that are constructed from the stacking of well-defined modular layers.

Methods

Enumeration of ABC-6 stacking sequences

Our computer programs for the enumeration and interpretation of ABC-6 stacking sequences were written in FORTRAN. To enumerate all possible stacking sequences of length N, our computer program went through every N-digit ternary numeral from the smallest one to the largest allowed. Because only unique stacking sequences were needed, we fixed the first digit of every ternary numeral to be ‘0’ and enumerated the remaining (N−1) digits. To guarantee that only the chemically feasible and topologically unique stacking sequences were retained, our computer program performed a two-step examination procedure for each numeral visited. First, our program checked if the current numeral consisted of three or more successive identical digits. If not, this numeral should represent a chemically feasible stacking sequence. Then, our program generated all equivalent numerals for the current one and examined whether any of these equivalent numerals was smaller than the current one. If not, then the current numeral should represent a new stacking sequence. Only the ternary numerals passing both examinations were saved by our computer program. This examination procedure was repeated until the largest allowed N-digit numeral was achieved. The enumeration of ABC-6 cages followed a similar procedure. The only difference was that the ternary numeral for a valid ABC-6 cage should start and end with the same digit, and this digit must be absent in the middle of this ternary numeral. To ensure that all our enumerated cages are topologically unique, we fixed the starting and ending digits to be ‘0’ and enumerated the middle part with digits ‘1’ and ‘2’ only.

Structural information extraction

To extract the channel information from the stacking sequences, our computer program checked the following situations: (1) if a stacking sequence was comprised of only two types of letters, it represented an ABC-6 topology with 12-ring channels running along the c-direction; (2) if a stacking sequence consisted of successive identical letters, it indicated the existence of interconnecting 8-ring channels perpendicular to the c-direction; (3) other stacking sequences corresponded to ABC-6 topologies with six-ring channels only. To calculate the stacking compactness of an ABC-6 topology from its stacking sequence, our computer program counted the number of letters that were distinct from both of their neighbours and divided that value by N, the number of stacking layers.

Geometry optimization of ABC-6 models

The atomic models for 84,292 enumerated stacking sequences were built as silica polymorphs using Materials Studio (Accelrys Software Inc., 2005). The highest symmetries of these models were identified by the ‘Find Symmetry’ tool implemented in Materials Studio. These models were fully optimized without symmetry constraints by GULP⁵⁵ with the Sanders-Leslie-Catlow potentials⁵⁶. All structural models were confirmed to have no imaginary phonon mode.

Evaluation of the optimized ABC-6 models

To evaluate the chemical feasibility of the optimized ABC-6 structures, we have also optimized 208 existing zeolites known to date⁵⁷ as silica polymorphs using the same empirical potentials. The framework densities and framework energies of our enumerated ABC-6 structures agreed well with those of 208 existing zeolite structures. Recently, we proposed a set of LIDs criteria¹⁹, which have proved to be more effective and reliable for structure evaluation than other methods. According to these criteria, the means, standard deviations, and ranges of LIDs in a chemically feasible zeolite structure, including T–O, O–T–O, and T–O–T distances, should obey a set of relationships. In this work, the LIDs in 84,292 optimized ABC-6 structures were calculated using the program FraGen⁵⁸, and the results showed that all of these enumerated ABC-6 structures were chemically feasible. The LIDs for 1,150 most realizable ABC-6 structures are provided as Supplementary Data 2. The solvent-accessible pore volumes and surface areas for 1,150 most realizable ABC-6 structures were calculated using the ‘Volume’ tool implemented in Materials Studio. Rigid spheres with diameters of 2.65, 2.89, 3.30, 3.64 and 3.80 Å were used as the probes, corresponding to the kinetic diameters of H₂O, H₂, CO₂, N₂ and CH₄, respectively.

Synthesis of JU-60 and JU-61

Magnesium aluminophosphate JU-60 and zinc aluminophosphate JU-61 were both synthesized using 1,2-diaminocyclohexane (DACH) as the structure-directing agent under hydrothermal conditions. To synthesize JU-60, 0.1 g of pseudoboehmite (Al₂O₃, 74.3%) and 0.3 g of magnesium acetate were dispersed in 10 ml of H₂O with stirring for 2 h. A volume of 0.5 ml of DACH (99 wt %) was then added into the mixture with stirring, followed by the addition of 0.2 ml of H₃PO₄ (85 wt %). A homogeneous gel was formed with an overall molar composition of 1.0 MgO: 0.5 Al₂O₃: 2.1 H₃PO₄: 2.9 DACH: 404 H₂O. The gel was transferred into a 15-ml Teflon-lined stainless steel autoclave and heated at 180 °C for 3 days. The obtained crystals of JU-60 were separated by filtration, washed with distilled water and dried in air at room temperature. To synthesize JU-61, 0.25 g of pseudoboehmite (Al₂O₃, 62.5%) was dispersed in a mixture of 8 ml of H₂O and 0.4 ml of H₃PO₄ (85 wt%), followed by the addition of 0.41 g of ZnCl₂. After stirring for 2 h, 1.5 ml of DACH (99 wt%) was added. A homogeneous gel was formed after stirring for another 2 h, with an overall molar composition of 1.0 ZnO: 0.5 Al₂O₃: 2.0 H₃PO₄: 4.0 DACH: 152 H₂O. The gel was transferred into a 15-ml Teflon-lined stainless steel autoclave and heated at 180 °C for 5 days. The obtained crystals of JU-61 were separated by filtration, washed with distilled water and dried in air at room temperature.

X-ray structure determination for JU-60 and JU-61

Powder X-ray diffraction data were collected on a Rigaku D/max-2550 diffractometer with Cu Kα radiation (λ=1.5418 Å). Single-crystal X-ray diffraction data were collected on a Bruker AXS SMART APEX II diffractometer using graphite-monochromated Mo Kα radiation (λ=0.71073 Å) at the temperature of 23±2 °C. Data processing was accomplished with the SAINT processing program. The framework structures of JU-60 and JU-61 were solved by direct methods and refined on F² by full matrix least-squares techniques with SHELXTL. Parts of the extra-framework species, such as DACH and water molecules, were located during least-squares refinement. JU-60 consists of three crystallographically distinct Al and three P sites. One of the Al site exhibited an average Al-O bond distance of 1.85 Å, indicating that it was half occupied by Mg. JU-61 consisted of five crystallographically distinct tetrahedrally coordinated sites. Considering the restrictions of the odd number of layers and the Loewenstein’s rule⁵⁹, we had to refine the structure of JU-61 assuming that all of the five tetrahedrally coordinated sites were co-occupied by disordered Al, P, and Zn. The occupancy ratio of Zn to Al was fixed to 2:3 according to the average bond distance in JU-61 (1.66 Å). To remove these disorders, we have also made several attempts to index JU-61 in a doubled unit cell, but the data collected in this way were not good enough for a feasible structure solution. The crystallographic tables, atomic coordinates, selected bond distances and angles, and powder X-ray diffraction patterns for JU-60 and JU-61 are provided in Supplementary Tables 3–8 and Supplementary Fig. 2.

Density functional theory calculations

All of the cage models were cut from the optimized ABC-6 structures. For each ABC-6 cage, one of the Si atoms in the eight-ring window was replaced by Al to produce the Brönsted acid site. The dangling bonds in all cages were saturated by H atoms. All atoms in ABC-6 cages and extra-framework species were fully optimized without any constraint at ONIOM(B3LYP/6–31G(d,p):AM1) level^60,61,62, where the acid site (SiO₃–O–AlO₂–OH–SiO₃ cluster) and extraframework species were in the high level (Supplementary Fig. 4). The achievement of energy minima or saddle points was checked by frequency calculations at the same level. The reaction barrier was calculated as the energy difference between the transition state and the reactant (hexamethylbenzene, methanol and the protonated ABC-6 cage). The reaction energies were calculated as the energy differences between the product (heptamethylbenzenium cation, water, and the deprotonated ABC-6 cage) and the reactant. To improve the precision of weak interaction energy calculations, we have performed single-point energy calculations at ωB97XD/6–31+G(d,p) level⁶³ for all optimized models. All density functional theory calculations were carried out using the Gaussian 09 package⁶⁴.

Additional information

How to cite this article: Li, Y. et al. In silico prediction and screening of modular crystal structures via a high-throughput genomic approach. Nat. Commun. 6:8328 doi: 10.1038/ncomms9328 (2015).

References

Jain, A. et al. Commentary: The Materials Project: A materials genome approach to accelerating materials innovation. APL Mater. 1, 011002 (2013) .
Article ADS Google Scholar
Deem, M. W. & Newsam, J. M. Determination of 4-connected framework crystal structures by simulated annealing. Nature 342, 260–262 (1989) .
Article ADS CAS Google Scholar
Delgado Friedrichs, O., Dress, A. W. M., Huson, D. H., Klinowski, J. & Mackayk, A. L. Systematic enumeration of crystalline networks. Nature 400, 644–647 (1999) .
Article ADS Google Scholar
Mellot Draznieks, C., Newsam, J. M., Gorman, A. M., Freeman, C. M. & Férey, G. De novo prediction of inorganic structures developed through automated assembly of secondary building units (AASBU method). Angew Chem. Int. Ed. 39, 2270–2275 (2000) .
Article CAS Google Scholar
Mellot-Draznieks, C., Dutour, J. & Férey, G. Hybrid organic-inorganic frameworks: Routes for computational design and structure prediction. Angew Chem. Int. Ed. 43, 6290–6296 (2004) .
Article CAS Google Scholar
Treacy, M. M. J., Rivin, I., Balkovsky, E., Randall, K. H. & Foster, M. D. Enumeration of periodic tetrahedral frameworks. II. Polynodal graphs. Microporous Mesoporous Mater. 74, 121–132 (2004) .
Article CAS Google Scholar
Férey, G., Mellot-Draznieks, C., Serre, C. & Millange, F. Crystallized frameworks with giant pores: are there limits to the possible? Acc. Chem. Res. 38, 217–225 (2005) .
Article PubMed Google Scholar
Fischer, C. C., Tibbetts, K. J., Morgan, D. & Ceder, G. Predicting crystal structure by merging data mining with quantum mechanics. Nat. Mater. 5, 641–646 (2006) .
Article ADS CAS PubMed Google Scholar
Woodley, S. M. & Catlow, R. Crystal structure prediction from first principles. Nat. Mater. 7, 937–946 (2008) .
Article ADS CAS PubMed Google Scholar
O’Keeffe, M., Peskov, M. A., Ramsden, S. J. & Yaghi, O. M. The reticular chemistry structure resource (RCSR) database of, and symbols for, crystal nets. Acc. Chem. Res. 41, 1782–1789 (2008) .
Article PubMed Google Scholar
Nørskov, J. K., Bligaard, T., Rossmeisl, J. & Christensen, C. H. Towards the computational design of solid catalysts. Nat. Chem. 1, 37–46 (2009) .
Article PubMed Google Scholar
Oganov, A. R., Lyakhov, A. O. & Valle, M. How evolutionary crystal structure prediction works-and Why. Acc. Chem. Res. 44, 227–237 (2011) .
Article CAS PubMed Google Scholar
Pophale, R., Cheeseman, P. A. & Deem, M. W. A database of new zeolite-like materials. Phys. Chem. Chem. Phys. 13, 12407–12412 (2011) .
Article CAS PubMed Google Scholar
Dyer, M. S. et al. Computationally assisted identification of functional inorganic materials. Science 340, 847–852 (2013) .
Article ADS CAS PubMed Google Scholar
Curtarolo, S. et al. The high-throughput highway to computational materials design. Nat. Mater. 12, 191–201 (2013) .
Article ADS CAS PubMed Google Scholar
Foster, M. D. et al. Chemically feasible hypothetical crystalline networks. Nat. Mater. 3, 234–238 (2004) .
Article ADS CAS PubMed Google Scholar
Walker, A. M., Slater, B., Gale, J. D. & Wright, K. Predicting the structure of screw dislocations in nanoporous materials. Nat. Mater. 3, 715–720 (2004) .
Article ADS CAS PubMed Google Scholar
Sartbaeva, A., Wells, S. A., Treacy, M. M. J. & Thorpe, M. F. The flexibility window in zeolites. Nat. Mater. 5, 962–965 (2006) .
Article ADS CAS PubMed Google Scholar
Li, Y., Yu, J. & Xu, R. Criteria for zeolite frameworks realizable for target synthesis. Angew Chem. Int. Ed. 52, 1673–1677 (2013) .
Article CAS Google Scholar
Combariza, A. F., Gomez, D. A. & Sastre, G. Simulating the properties of small pore silica zeolites using interatomic potentials. Chem. Soc. Rev. 42, 114–127 (2013) .
Article CAS PubMed Google Scholar
Li, Y. & Yu, J. New stories of zeolite structures: their descriptions, determinations, predictions, and evaluations. Chem. Rev. 114, 7268–7316 (2014) .
Article CAS PubMed Google Scholar
Greeley, J., Jaramillo, T. F., Bonde, J., Chorkendorff, I. & Nørskov, J. K. Computational high-throughput screening of electrocatalytic materials for hydrogen evolution. Nat. Mater. 5, 909–913 (2006) .
Article ADS CAS PubMed Google Scholar
Yang, K., Setyawan, W., Wang, S., Buongiorno Nardelli, M. & Curtarolo, S. A search model for topological insulators with high-throughput robustness descriptors. Nat. Mater. 11, 614–619 (2012) .
Article ADS CAS PubMed Google Scholar
Dubbeldam, D., Krishna, R., Calero, S. & Yazaydın, A. Ö. Computer-assisted screening of ordered crystalline nanoporous adsorbents for separation of alkane isomers. Angew Chem. Int. Ed. 51, 11867–11871 (2012) .
Article CAS Google Scholar
Lin, L.-C. et al. In silico screening of carbon-capture materials. Nat. Mater. 11, 633–641 (2012) .
Article ADS CAS PubMed Google Scholar
Wilmer, C. E. et al. Large-scale screening of hypothetical metal-organic frameworks. Nat. Chem. 4, 83–89 (2012) .
Article CAS Google Scholar
Kim, J., Abouelnasr, M., Lin, L.-C. & Smit, B. Large-scale screening of zeolite structures for CO2 membrane separations. J. Am. Chem. Soc. 135, 7545–7552 (2013) .
Article CAS PubMed Google Scholar
Kim, J. et al. New materials for methane capture from dilute and medium-concentration sources. Nat. Commun. 4, 1694 (2013) .
Article PubMed Google Scholar
Colón, Y. J. & Snurr, R. Q. High-throughput computational screening of metal-organic frameworks. Chem. Soc. Rev. 43, 5735–5749 (2014) .
Article PubMed Google Scholar
Bai, P. et al. Discovery of optimal zeolites for challenging separations and chemical transformations using predictive materials modeling. Nat. Commun. 6, 5912 (2015) .
Article CAS PubMed Google Scholar
Simon, C. M. et al. The materials genome in action: identifying the performance limits for methane storage. Energy Environ. Sci. 8, 1190–1199 (2015) .
Article CAS Google Scholar
Willhammar, T. et al. Structure and catalytic properties of the most complex intergrown zeolite ITQ-39 determined by electron crystallography. Nat. Chem. 4, 188–194 (2012) .
Article CAS PubMed Google Scholar
Esters, M. et al. Synthesis of inorganic structural isomers by diffusion-constrained self-assembly of designed precursors: a novel type of isomerism. Angew Chem. Int. Ed. 54, 1130–1134 (2015) .
Article CAS Google Scholar
Reinen, D. & Lindner, G.-G. The nature of the chalcogen colour centres in ultramarine-type solids. Chem. Soc. Rev. 28, 75–84 (1999) .
Article CAS Google Scholar
Lezhnina, M., Laeri, F., Benmouhadi, L. & Kynast, U. Efficient near-infrared emission from sodalite derivatives. Adv. Mater. 18, 280–283 (2006) .
Article CAS Google Scholar
Shang, J. et al. Discriminative separation of gases by a ‘molecular trapdoor’ mechanism in chabazite zeolites. J. Am. Chem. Soc. 134, 19246–19253 (2012) .
Article CAS PubMed Google Scholar
Hudson, M. R. et al. Unconventional, highly selective CO2 adsorption in zeolite SSZ-13. J. Am. Chem. Soc. 134, 1970–1973 (2012) .
Article CAS PubMed Google Scholar
Xu, S. et al. Direct observation of cyclic carbenium ions and their role in the catalytic cycle of the methanol-to-olefin reaction over chabazite zeolites. Angew. Chem. Int. Ed. 52, 11564–11568 (2013) .
Article CAS Google Scholar
Xie, D. et al. SSZ-52, a zeolite with an 18-layer aluminosilicate framework structure related to that of the DeNOx catalyst Cu-SSZ-13. J. Am. Chem. Soc. 135, 10519–10524 (2013) .
Article CAS PubMed Google Scholar
Moliner, M., Martínez, C. & Corma, A. Synthesis strategies for preparing useful small pore zeolites and zeotypes for gas separations and catalysis. Chem. Mater. 26, 246–258 (2014) .
Article CAS Google Scholar
Olsbye, U. et al. Conversion of methanol to hydrocarbons: how zeolite cavity and pore size controls product selectivity. Angew. Chem. Int. Ed. 51, 5810–5831 (2012) .
Article CAS Google Scholar
Van Speybroeck, V. et al. Mechanistic studies on chabazite-type methanol-to-olefin catalysts: insights from time-resolved UV/Vis microspectroscopy combined with theoretical simulations. Chem. Cat. Chem. 5, 173–184 (2013) .
CAS Google Scholar
Li, X. et al. Confinement effect of zeolite cavities on methanol-to-olefin conversion: a density functional theory study. J. Phys. Chem. C 118, 24935–24940 (2014) .
Article CAS Google Scholar
Smith, J. V. & Bennett, J. M. Enumeration of 4-connected 3-dimensional nets and classification of framework silicates: the infinite set of ABC-6 nets: the Archimedean and σ-related nets. Am. Mineral. 66, 777–788 (1981) .
CAS Google Scholar
Li, Y. & Yu, J. Hypothetical Zeolite Frameworks. Available at < http://mezeopor.jlu.edu.cn/hypo/ > (2015) .
Haw, J. F., Song, W., Marcus, D. M. & Nicholas, J. B. The mechanism of methanol to hydrocarbon catalysis. Acc. Chem. Res. 36, 317–326 (2003) .
Article CAS PubMed Google Scholar
Van Speybroeck, V. et al. First principle chemical kinetics in zeolites: the methanol-to-olefin process as a case study. Chem. Soc. Rev. 43, 7326–7357 (2014) .
Article CAS PubMed Google Scholar
Dahl, I. M. & Solboe, S. On the reaction mechanism for hydrocarbon formation from methanol over SAPO-34: 1. Isotopic labeling studies of the co-reaction of ethene and methanol. J. Catal. 149, 458–464 (1994) .
Article CAS Google Scholar
Dahl, I. M. & Kolboe, S. On the reaction mechanism for hydrocarbon formation from methanol over SAPO-34: 2. Isotopic labeling studies of the co-reaction of propene and methanol. J. Catal. 161, 304–309 (1996) .
Article CAS Google Scholar
Lesthaeghe, D., De Sterck, B., Van Speybroeck, V., Marin, G. B. & Waroquier, M. Zeolite shape-selectivity in the gem-methylation of aromatic hydrocarbons. Angew Chem. Int. Ed. 46, 1311–1314 (2007) .
Article CAS Google Scholar
Wang, H., Tse, J. S., Tanaka, K., Iitaka, T. & Ma, Y. Superconductive sodalite-like clathrate calcium hydride at high pressures. Proc. Natl Acad. Sci. USA 109, 6463–6466 (2012) .
Article ADS CAS PubMed PubMed Central Google Scholar
Kim, D. Y., Stefanoski, S., Kurakevych, O. O. & Strobel, T. A. Synthesis of an open-framework allotrope of silicon. Nat. Mater. 14, 169–173 (2015) .
Article ADS CAS PubMed Google Scholar
Sava, D. F. et al. Capture of volatile iodine, a gaseous fission product, by zeolitic imidazolate framework-8. J. Am. Chem. Soc. 133, 12398–12401 (2011) .
Article CAS PubMed Google Scholar
Nguyen, N. T. T. et al. Selective capture of carbon dioxide under humid conditions by hydrophobic chabazite-type zeolitic imidazolate frameworks. Angew Chem. Int. Ed. 53, 10645–10648 (2014) .
Article CAS Google Scholar
Gale, J. D. GULP: Capabilities and prospects. Z. Kristallogr. 220, 552–554 (2005) .
CAS Google Scholar
Schröder, K.-P., Sauer, J., Leslie, M., Catlow, C. R. A. & Thomas, J. M. Bridging hydroxyl groups in zeolitic catalysts: a computer simulation of their structure, vibrational properties and acidity in protonated faujasites (H-Y zeolites). Chem. Phys. Lett. 188, 320–325 (1992) .
Article ADS Google Scholar
Baerlocher, C. & McCusker, L. B. Database of Zeolite Structures. Available at < http://www.iza-structure.org/databases/ > (2015) .
Li, Y., Yu, J. & Xu, R. FraGen: a computer program for real-space structure solution of extended inorganic frameworks. J. Appl. Cryst. 45, 855–861 (2012) .
Article CAS Google Scholar
Loewenstein, W. The distribution of aluminum in the tetrahedra of silicates and aluminates. Am. Mineral. 39, 92–96 (1954) .
CAS Google Scholar
Chung, L. W. et al. The ONIOM method and its applications. Chem. Rev. 115, 5678–5796 (2015) .
Article CAS PubMed Google Scholar
Tirado-Rives, J. & Jorgensen, W. L. Performance of B3LYP density functional methods for a large set of organic molecules. J. Chem. Theory Comput. 4, 297–306 (2008) .
Article CAS PubMed Google Scholar
Dewar, M. J. S., Zoebisch, E. G., Healy, E. F. & Stewart, J. J. P. AM1: a new general purpose quantum mechanical molecular model. J. Am. Chem. Soc. 107, 3902–3909 (1985) .
Article CAS Google Scholar
Chai, J.-D. & Head-Gordon, M. Long-range corrected hybrid density functionals with damped atom-atom dispersion corrections. Phys. Chem. Chem. Phys. 10, 6615–6620 (2008) .
Article CAS PubMed Google Scholar
Frisch, M. J. et al. Gaussian 09 Gaussian, Inc. (2013) .

Download references

Acknowledgements

This work was supported by the State Basic Research Project of China (Grant No. 2011CB808703) and the National Natural Science Foundation of China (Grant Nos. 91122029; 21273098; 21320102001). Y.L. acknowledges the support by Program for New Century Excellent Talents in University (NCET-13-0246).

Author information

Authors and Affiliations

State Key Laboratory of Inorganic Synthesis and Preparative Chemistry, Jilin University, Qianjin Street 2699, Changchun, 130012, China
Yi Li, Xu Li, Jiancong Liu, Fangzheng Duan & Jihong Yu

Authors

Yi Li
View author publications
You can also search for this author in PubMed Google Scholar
Xu Li
View author publications
You can also search for this author in PubMed Google Scholar
Jiancong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Fangzheng Duan
View author publications
You can also search for this author in PubMed Google Scholar
Jihong Yu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.Y. supervised and coordinated all aspects of the project. Y.L. wrote the computer programs and performed the enumeration, geometry optimization, evaluation, and high-throughput screening of all ABC-6 structures. X.L. performed density functional theory calculations for the selected ABC-6 cages. J.L. and F.D. synthesized JU-60 and JU-61.

Corresponding author

Correspondence to Jihong Yu.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary Figures 1-4, Supplementary Tables 1-9 (PDF 7717 kb)

Supplementary Data 1

1,150 enumerated ABC-6 structural models constructed from no more than four types of cages (including known ABC-6 types). (XLSX 275 kb)

Supplementary Data 2

The means (<D>), standard deviations (σ), and ranges (R) of the local Interatomic Distances (T-O, O-O, and T-T) in 1,150 enumerated ABC-6 structures with no more than four types of constituent cages. (XLSX 182 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Li, Y., Li, X., Liu, J. et al. In silico prediction and screening of modular crystal structures via a high-throughput genomic approach. Nat Commun 6, 8328 (2015). https://doi.org/10.1038/ncomms9328

Download citation

Received: 01 May 2015
Accepted: 11 August 2015
Published: 23 September 2015
DOI: https://doi.org/10.1038/ncomms9328

This article is cited by

High-throughput Screening of Aluminophosphate Zeolites for Adsorption Heat Pump Applications
- Chao Shi
- Jiaze Wang
- Yi Li
Chemical Research in Chinese Universities (2022)
A Cage-based Porous Metal-organic Framework for Efficient C2H2 Storage and Separation
- Hengbo Li
- Kuikui Wang
- Maochun Hong
Chemical Research in Chinese Universities (2022)
Tradeoffs and Compatibilities of Chemical Properties in CpHqFrOs System
- Yasuharu Okamoto
Scientific Reports (2019)
Accelerating the discovery of insensitive high-energy-density materials by a materials genome approach
- Yi Wang
- Yuji Liu
- Yong Tian
Nature Communications (2018)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.