Abstract
The chemical and structural properties of atomically precise nanoclusters are of great interest in numerous applications, but the structures of the clusters can be computationally expensive to predict. In this work, we present the largest database of cluster structures and properties determined using ab-initio methods to date. We report the methodologies used to discover low-energy clusters as well as the energies, relaxed structures, and physical properties (such as relative stability, HOMO-LUMO gap among others) for 63,015 clusters across 55 elements. We have identified clusters for 593 out of 1595 cluster systems (element-size pairs) explored by literature that have energies lower than those reported in literature by at least 1 meV/atom. We have also identified clusters for 1320 systems for which we were unable to find previous low-energy structures in the literature. Patterns in the data reveal insights into the chemical and structural relationships among the elements at the nanoscale. We describe how the database can be accessed for future studies and the development of nanocluster-based technologies.
Similar content being viewed by others
Background & Summary
Small nanoclusters possess novel physical properties which differ from those of their bulk counterparts, including discrete energy levels1,2, nonlinear optical properties3, magnetism4, high catalytic activity5,6,7, multiple absorption bands7,8, and enhanced photoluminescence9,10,11,12. These properties emerge as a consequence of their small sizes and relatively high number of uncoordinated atoms on the surface, and they can be tuned by altering the size and shape of the cluster13,14. The past few decades have shown significant progress in computational methods to predict these properties, but before a property can be calculated it is necessary to first determine the atomic structure of the cluster. A number of algorithms have been developed to predict the ground state atomic structure by comprehensively sampling the potential energy surface (PES) of the cluster. These methods include genetic algorithms15,16,17, simulated annealing18, particle swarm optimization19, Bayesian optimization20 and basin-hopping21 methods. In each of these methods, identification of the ground state structure is accomplished by first calculating the energies of a large number of candidate structures and then selecting the structures with the lowest energies.
Due in part to strong quantum finite-size effects, accurate determination of the relative cluster energies is best accomplished using ab-initio calculations, and the number of low-energy configurational isomers is estimated to grow exponentially with the number of atoms in the cluster22,23. For these reasons searching for the ground state atomic structure can be computationally demanding. An alternative approach to identifying low-energy structures is to search through a reference database of known structures. In recent years, materials databases for crystalline materials have transformed materials research24,25,26,27,28. However, current nanocluster structure datasets are either unavailable to the public, limited in scope, or primarily utilize lower levels of theory like interatomic potential models29,30,31,32 and tight binding models33.
Here we present the Quantum Cluster Database (QCD) of low-energy cluster structures for 55 elements, for clusters of 3–55 atoms, calculated using density functional theory (DFT)34. The database contains structures obtained through an ab-initio genetic algorithm, an accelerated genetic algorithm using machine-learned interatomic potentials, regression analysis of chemically similar elements, and a survey of the scientific literature. The structures from the scientific literature include 3682 nanoclusters collected or derived from the Cambridge Cluster Database (CCD)35, which were identified mostly using interatomic potentials. The 55 elements encompass different regions of the periodic table, including alkali and alkaline earth metals, transition metals, post transition metals, metalloids, and non-metals. Although we are continuing to add to this data set, to the best of our knowledge, this dataset already constitutes the most extensive collection of computed cluster structures at the DFT level of theory. The data set can be used to guide experimental synthesis of predicted nanoclusters, to guide searches for low-energy clusters in different chemical environments, to computationally screen for clusters suitable for a variety of applications, or to train machine learning models. Since the structural energies were obtained using a consistent computational method, the data also serves as a direct source for comparative benchmark studies of different DFT or other electronic structure techniques within the context of atomic cluster modelling. All atomic structures and their calculated properties are openly accessible, enabling researchers across the world to access it for free and use it for further analysis.
Methods
We have used the following methods to populate the Quantum Cluster Database with atomically precise nanoclusters:
-
1.
We have searched the literature for coordinates of previously discovered candidate low-energy clusters. The atomic structures are available on our QCD website (http://muellergroup.jhu.edu/qcd) and their literature sources are summarized in Table 1 and Fig. 5.
-
2.
We have used a genetic algorithm with ab-initio calculations. This method36 was primarily used to identify structures of sizes and elements that are computationally cheap as determined by the number of valence electrons in the projector augmented wave potentials used (Supplementary Table 1), such as Mg, Li, Sb, Na, Ga, Si, Al, B, C, P and S.
-
3.
We have used a genetic algorithm accelerated by actively learned moment tensor potentials (MTP)37,38,39,40 trained on-the-fly. This method has been used to search clusters for a few sizes of Al36, B, C, P and S.
-
4.
We have used correlations among energies of same structures but different elements to generate low-energy clusters of elements from known low-energy structures of chemically similar elements.
A brief description of each of these methods is provided below, with additional details in the Supplementary Note 1.
Low-energy structures mined from existing literature
Many of the clusters in the QCD have been studied before, including systematic DFT studies of small and large clusters across different elements. We collected atomic structures of clusters from publications that provide atomic coordinates of reported low-energy structures, as calculated using DFT, and from the Cambridge Cluster Database, which consists primarily of structures discovered using empirical potentials36,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94. All structures from the literature were relaxed using our DFT settings as described in the section on DFT calculations. In Table 1, we provide the literature references for these cluster structures grouped by element.
Low-energy structures from ab initio genetic algorithm searches
Low-energy cluster structures were also identified by means of a genetic algorithm (GA)95,96, an optimization algorithm based on natural evolution in which beneficial characteristics prevail over successive generations. In our implementation36, a GA run begins by populating a pool of clusters with random structures and/or seed structures assembled from previous GA runs. All cluster energies are evaluated by relaxing the atomic positions using DFT34. Child clusters are generated from pool clusters using one of two types of operations: crossover, in which parts of each parent cluster are combined to form a child cluster, and mutation, in which a subset of atoms of a cluster structure are randomly relocated. If a child cluster has lower energy than the highest-energy pool cluster and is not structurally equivalent to other pool clusters, it replaces the highest-energy cluster of the pool. The cycle continues until the total number of clusters in the GA run is at least 1000. After the search, the 10 lowest-energy clusters (where available) for each element-size pair were collected and added to QCD. Additional details of the genetic algorithm method can be found in SI section 1 as well as the work by Wang et al.36.
Learning on the Fly (LOTF)-GA
We have recently developed a way to accelerate the genetic algorithm using machine-learned interatomic potentials trained on-the-fly using active learning36,37,38. The machine-learned interatomic potentials are used to quickly identify candidate low-energy clusters, which are further relaxed locally by DFT to refine the energies. This method has been used to identify low-energy structures for some sizes of Al, B, C, P and S. Additional details about this method can be found in reference36.
Clusters built from low-energy structures of chemically similar elements
We have constructed additional low-energy cluster structures by taking advantage of the fact that for some elements there are strong correlations between the total energies of chemically similar cluster structures. Low-energy clusters of one element can be used as template to quickly generate low-energy clusters of the positively correlated elements by rescaling the template in proportion to the ratio of nearest neighbor distances. To identify these relationships, we created 55 representative cluster structure prototypes in a two-step process.
In the first step, we used the genetic algorithm to identify low energy structures for clusters of 5, 10, 15 and 20 atoms for Al, Be, Li, Mg, Na, Si, Ta, and Ti. These elements were chosen because they cover different parts of the periodic table and are computationally inexpensive relative to others because of the small numbers of valence electrons. The low energy configurations are provided in Supplementary Note 5.
In the second step, we used these clusters as templates to create clusters of all the other elements. For each target element, the interatomic distances in the cluster were scaled by the ratio of the nearest neighbor distances of the target element and the template element. The nearest neighbor distances (Supplementary Table 2) are the bond lengths in their most stable bulk form retrieved from the Materials Project26. To identify a chemically diverse set of elements, we used least-squares regression to express the DFT-calculated energies of unrelaxed clusters for each element as a linear combination of the energies of the remaining 54 elements. The residual errors for these fits provide a measure of the extent by which each element is different from the other 54 elements. We selected 13 elements with the highest errors: B, Ba, Be, Ca, Cr, Cs, K, Li, Mg, Na, Rb, Sr, and Zn, as these are likely to have distinct ground state structures. We then used the genetic algorithm to search for low-energy structures for clusters of 10, 15, 20, 25, and 30 atoms for these 13 elements. The low-energy structures discovered by the genetic algorithm are shown in Fig. 1, and their coordinates are provided in the Supplementary Note 6. These 65 structures were used as structural templates to determine again correlations among energies of different elements, following the same procedure described above. The correlation values are plotted in the heat map of Fig. 2. A positive correlation between a pair of elements means that a cluster structure having high energy for one element also tends to have high energy for the other element, while the negative correlation indicates the reverse relationship.
To evaluate the diversity of the 65 template structures, we compared the clusters using a structural similarity score as described in reference97, where perfectly similar structures have a score of 0.0, and we consider structures with a score above 0.3 to be dissimilar. Across the 5 different sizes and 13 different elements, only five pairs have a similarity score less than 0.3, indicating that the remaining pairs of structures are structurally distinct.
After discovering low-energy clusters with the genetic algorithm, we filled gaps on the database (i.e., elements and sizes where no clusters were available) using correlations among the energies of elements (as shown in Fig. 2). For a gap of a given system (an element-size pair), we identified the most correlated element and used its 5 lowest-energy clusters of the same size as templates to generate new clusters that were likely to have low energy. We followed the process of re-scaling the interatomic distances using the bulk nearest-neighbor bond lengths.
DFT calculations
All DFT local energy minimizations were carried out using the Vienna ab initio Simulation Package98 (VASP) with the Perdew-Burke-Ernzerhof99 (PBE) generalized gradient approximation exchange-correlation functional. We found that VASP was particularly efficient for clusters with a large number of atoms, which consumed the greatest amount of computational resources. We used the projector-augmented wave100 method, with the pseudopotentials and the corresponding default cutoff energies listed in Supplementary Table 1. The convergence criterion for electronic self-consistency was set to 10−5 eV per cluster. Structures were optimized using the conjugate gradient algorithm101,102 or the RMM-DIIS algorithm103 as implemented in VASP until all the atomic forces were less than 0.1 eV/Å. Low-energy clusters within 1 eV from the lowest-energy cluster of each system and all clusters collected from the literature (in total 31,911 clusters) were re-optimized with a tighter force-convergence criterion of 0.025 eV/Å, to increase the accuracy of the low-energy isomers. All calculations were run at the gamma point with spin polarization. The magnetic moments were initialized as 1 µB/atom for non-magnetic elements. For magnetic elements, local magnetic moments were written out for most calculations and the detailed initialization scheme can be found in a separate section below. Gaussian smearing104 with σ = 0.0001 eV was used to achieve high accuracy when calculating final energies (2283 clusters out of the total 63,015 clusters used σ = 0.001). To accelerate convergence for some clusters, a two-step minimizations scheme was adopted with smearing of 0.1 eV in the initial step for faster convergence and a smaller value of 0.0001 eV for the final step. Symmetry was turned off for all DFT calculations to increase the chance of completing the calculations successfully. We found the inclusion of spin-orbit coupling (SOC) had little effect on the ranking of low-energy structures. To maintain the consistency of settings of DFT calculations, we did not include SOC-predicted total energies and atomic structures in the QCD database.
All DFT calculations were performed using VASP which can only perform periodic calculations, so each cluster is in effect surrounded by translationally equivalent clusters. Hence it is essential to use a simulation cell that is sufficiently large to avoid interactions among periodic images. For all elements, we enforced that the minimum distance between atoms in periodic images must be greater than 10 angstroms. Additionally, for elements in the Groups 1 A and 2 A of the periodic table, the minimum distance between neighboring images must also be greater than 3.5 times the nearest-neighbor distances listed in Supplementary Table 2. If, after relaxation, the minimum distance between neighboring images shrank below the aforementioned values, we increased the supercell size and ran the DFT calculation again. We found that these “box size” criteria are sufficient to reach energy convergence within 2 meV/atom in all 1135 tested cases (see Supplementary Fig. 7 and Supplementary Note 4 for more details) with a root mean squared error of 0.118 meV/atom.
Workflow
We identified candidate low-energy cluster structures using one of the four methods listed above. DFT calculations were performed on these clusters structures before adding them to the database. An outline of the high-throughput workflow used in these DFT calculations is provided in Fig. 3. We first initialized calculations of magnetic elements with proper magnetic values (details in next section). Then we ran ionic relaxations and checked convergence. For calculations that did not converge, we adjusted input parameters such as the step size of optimization algorithms and charge mixing parameters and reran them until convergence was achieved.
The atoms sometimes form periodic configurations that correspond to nanowires or slabs. We filtered out these types of structures by discarding clusters that had a minimum distance between periodic images smaller than 1.5 times the atomic nearest neighbor distance. We also screened for discontiguous clusters using this same criterion and discarded any discontiguous clusters that were identified.
To ensure the Quantum Cluster Database contains only unique clusters for a given element and size, when two clusters had a structural similarity score97 less than 0.3, the cluster with higher energy was discarded. If the higher-energy cluster was from the literature, the appropriate literature references would be assigned to the structurally similar low-energy cluster. All filters that ensure quality of the clusters in the database are summarized in the Fig. 3.
The properties and metadata described in the Data Records section were calculated for each cluster and stored in a PostgreSQL database. Finally, the data are displayed in the Quantum Cluster Database website (http://muellergroup.jhu.edu/qcd) and output as a JSON file and a CSV file.
Treatment of magnetic clusters
The final magnetic state of a cluster may depend on the atomic magnetic moments used to initialize the calculation. The final magnetic state is particularly likely to be non-zero for elements with non-zero magnetic moments in their elemental bulk phase, specifically Fe, Mn, Co, Ni, Ru, Rh, V, Cu, and Cr. For elements other than these, we initialized spin-polarized calculations with the default magnetic moment (1 μB/atom). For the magnetic elements, we performed a benchmark on 2228 clusters with 3 to 55 atoms selected from an early version of QCD and initialized spin-polarized calculations with 4 different magnetic moments, namely 1 μB/atom, 2 μB/atom, 3 μB/atom, and 5 μB/atom, to evaluate the effect of the initial magnetic moments on the final magnetic states and the total energies. We found that the final magnetic states for Fe, Mn, Ru, Rh, V, and Cr clusters are particularly likely to depend on the initial magnetic moments, whereas for Cu and Co the initialization with 3 μB/atom relaxed into the lowest energy configurations in almost all of the benchmarked clusters. For Ni clusters, the final states were independent of initialization, so we used the default 1 μB/atom in the QCD calculations. Supplementary Table 4 showed the effects of different initial magnetic moments on the final magnetic states. To mitigate the chance of missing the correct final magnetic states, multiple initial magnetic moments were used for Fe, Mn, Ru, Rh, V, and Cr clusters, and the calculations yielding the lowest total energies were included in QCD. The set of initial magnetic moments of each element were chosen such that they led to the lowest energy states for more than 97% of all benchmarked clusters of the corresponding element. Table 2 lists the selected set of magnetic moments for the six elements, together with the single initialization value for Co, Cu, and Ni.
Data Records
We have created a website at http://muellergroup.jhu.edu/qcd to host the database. It provides downloadable links to the correlation table (Fig. 2) and an archive of all relaxed cluster structures, and individual webpages for each cluster with interactive visualization and tabulated cluster properties (discussed in the next section). The input and output files of the DFT calculations of all 63,015 clusters are publicly available (licensed under CC-BY-4.0) in the NOMAD database105 at https://doi.org/10.17172/NOMAD/2023.02.01-1106. A link is created at the individual cluster webpage on the QCD website connecting to the corresponding data entry in the NOMAD database, where the DFT files can be easily downloaded.
File format
Properties of all clusters are available for download as a JSON file and as a.csv file on the Quantum Cluster Database website. In the JSON file, each cluster is stored as a key/value pair with “cluster_id” as the key and an object composed of all quantities listed in Table 3 as the value. Within the object, properties of the corresponding cluster are also stored as key/value pairs with keys being those listed in Table 3. The columns of the.csv file correspond to the keys described in Table 3. The input and output VASP files for DFT calculation of each cluster are available in the NOMAD repository at https://doi.org/10.17172/NOMAD/2023.02.01-1.
Properties
For each cluster of a given number of atoms N and element type k, the database contains the energy relative to the lowest energy structure of size N and species k, the formation energy with respect to the lowest-energy cluster of size N-1 of species k (Eq. (1)), the formation energy with respect to the lowest-energy cluster with N + 1 atoms of the same species (Eq. (2)), the HOMO-LUMO gap, the number of valence electrons considered by DFT, the magnetic moment, a list of similar structures within the Quantum Cluster Database, a list of literature references for the cluster (“http://muellergroup.jhu.edu/qcd” if it was generated by GA or low-energy clusters of chemically similar elements), the coordinates (downloadable in XYZ format and the VASP POSCAR format), and an interactive visualization of the cluster. The formation energies are calculated using the following equations:
where EN is the energy of this cluster of size N, EN-1 is the energy of the lowest-energy cluster of size N-1, EN+1 is the energy of the lowest-energy cluster of size N + 1, and Eatom is the energy of an isolated atom. The energies for isolated atoms used in these calculations are provided in Supplementary Table 3.
The sizes of simulation cells determine the distances between periodic images and can be important for reproducing our results. Therefore, at the structure-view page of each cluster on the QCD website, we provide a link to the relaxed structure in the VASP POSCAR format, which contains the cell lattice vectors and from which the lengths of the simulation cell can be readily calculated.
As a summary, we listed below in Table 4 the links through which readers can access the information discussed in this work.
Technical Validation
Comparison of literature clusters and newly reported QCD clusters
For a given cluster size and element, we compared the lowest-energy cluster from the literature against the lowest-energy cluster newly reported in the database to assess which had lower energy. There are 1595 systems for which there is at least one literature structure in the database. Out of those, the database has discovered new lowest-energy clusters for 593 systems that are lower in energy by at least 1 meV/atom (Fig. 4).
The Quantum Cluster Database contains 1379 structure types or templates (i.e., relative arrangements of atoms) that were not previously reported in the literature (Fig. 5). The 1379 templates were identified from the set of all clusters with calculated energies within 1 meV/atom of the lowest energy cluster with the same element and size. In comparison, there are 582 templates of low-energy clusters from the literature.
Before our work, there were 1595 cluster systems, or approximately 55% of the total 2915 systems, that had at least one structure whose atomic coordinates are available in literature (including the CCD). With the Quantum Cluster Database, the percentage increases to 100%. Table 5 provides a summary of the statistics of cluster systems and total number of clusters from different approaches. We note the sum of the numbers of clusters from different sources does not equal the total number of clusters in the QCD because some clusters are found in multiple sources, as shown in Table 5.
Magnetization of magnetic elements
We performed a more in-depth analysis on the final magnetic moments of the nine magnetic elements listed above (Fe, Mn, Co, Ni, Ru, Rh, V, Cu, and Cr). We performed our analysis on 12,171 DFT calculations which listed local atomic magnetic moments. We calculated for each cluster the ratio of opposite local magnetic moments:
where μup is the sum of local magnetic moments with positive values, μdown represents the sum of local magnetic moments with negative values, and the function min() takes the minimum of the two values. Figure 6 shows the distribution of the ratio of opposite local magnetic moments across the investigated cluster sizes. Although Cr is the only antiferromagnetic element in bulk phase at room temperature, we found antiferromagnetic nanoclusters with significant local magnetic moments (≥0.5 μB in spin-up and down direction) in five elements, Cr, Mn, Ru, Rh, V. As the cluster size increases, the lower bound of the opposite moment ratios in Cr gradually increases, indicating more and more spins of Cr atoms tend to order antiparallelly. This is likely because there are more high-coordinated atoms as cluster size rises, whose local atomic environments mimic that of the bulk phase. Aside from the antiferromagnetic isomers with the ratio of opposite moments close to 1, there are many clusters with ratios between 0 and 1 for the five elements (Cr, Mn, Ru, Rh and V), exhibiting states similar to ferrimagnetic configurations, which might result from geometric frustration. Because of the complex geometry of nanoclusters and the lack of a periodic lattice, it is hard for the spins of neighboring atoms to order perfectly in an antiparallel pattern. The ferromagnetic elements Fe, Co, and Ni remain ferromagnetic as atomic clusters. Cu mostly alternates between non-magnetic and ferromagnetic with a total magnetic moment of 0 and 1 μB for even and odd-sized clusters because of the odd number of valance electrons. For the few cases of Cu where the ratio of opposite moments is large, the magnitudes of local moments are very small, suggesting a non-magnetic nature.
Effect of spin-orbit coupling
We investigated the effect of spin-orbit coupling (SOC) on heavy-metal elements by performing additional PBE + SOC calculations on the lowest energy clusters of 11 heavy-metal elements, namely Au, Bi, Hf, Hg, Ir, Os, Pb, Pt, Re, Ta, and Tl, selected based on the work by Piotrowski et al.42, with sizes ranging from 3 to 55 atoms. Additionally, we choose the six (where available) lowest energy isomers for small (size 10), medium (size 30), and large (size 55) clusters to study the effect of SOC on relative ordering. The energies calculated by PBE + SOC linearly correlated with energies computed by PBE, as shown in Fig. 7. We also developed a cheap proxy to approximate PBE + SOC computed energies from PBE computed energies for these 11 different elements using the least-squares regression. The conversion factors (slope and intercept) for converting to PBE + SOC energies from PBE energies for these elements are provided in Supplementary Fig. 3. We also found that the use of SOC has little effect on the energy rankings of isomers (Supplementary Fig. 4–6). For three small clusters of systems Pb10, Tl10 and Hf10, the PBE + SOC relaxed structures are geometrically dissimilar to the relaxed PBE structures with similarity scores larger than 0.3, and therefore are excluded from the comparisons in Supplementary Fig. 4–6.
Usage Notes
The homepage of the QCD website (http://muellergroup.jhu.edu/qcd) provides a view of the periodic table and downloadable links to an archive of all relaxed structures and the files summarizing properties of all clusters (in JSON and CSV format). When clicking on one of the 55 elements reported in this work, the other elements will be colored according to the energy correlations (Fig. 8a) and a list of clusters with 3 to 55 atoms of this element will be displayed below the periodic table (Fig. 8b). By default, clusters within 200 meV from the lowest-energy cluster of each size will be displayed. This range can be changed in the input box right beneath the periodic table. For each cluster in the list, a snapshot of the relaxed structure is provided, along with the energy relative to the lowest energy cluster of the corresponding system.
Clicking on a particular cluster from the list brings users to the structure-view page of this specific structure (Fig. 8c). This webpage tabulates the properties of the cluster listed in Table 3 (except for the DFT total energy and the atomic coordinates in POSCAR format). Beside the table, the webpage provides an interactive visualization of the relaxed structure and downloadable links to the structure in XYZ format and POSCAR format of VASP, the VASP input file (INCAR) and a file containing references of this cluster in the BibTex format. The entire set of input and output files of this cluster can also be downloaded from the link to the NOMAD repository at the header of the individual entry page.
Code availability
The implementations of the DFT and MTP genetic algorithms used to search for low-energy structures are available via GitLab: https://gitlab.com/muellergroup/cluster-ga. The scripts and code for managing QCD, for example merging new clusters into QCD, updating existing clusters with DFT calculations using updated parameters, generating metadata of clusters listed in Table 3, are also open-sourced at https://gitlab.com/muellergroup/qcd_mgmt.
References
Jena, P. & Sun, Q. Super Atomic Clusters: Design Rules and Potential for Building Blocks of Materials. Chem. Rev. 118, 5755–5870 (2018).
Wilcoxon, J. P. & Abrams, B. L. Synthesis, structure and properties of metal nanoclusters. Chemical Society Reviews 35, 1162–1194 (2006).
Jin, R. Atomically precise metal nanoclusters: stable sizes and optical properties. Nanoscale 7, 1549–1565 (2015).
Cramer, C. J. & Truhlar, D. G. Density functional theory for transition metals and transition metal chemistry. Physical Chemistry Chemical Physics 11, 10757–10816 (2009).
Li, G. & Jin, R. Atomically Precise Gold Nanoclusters as New Model Catalysts. Acc. Chem. Res. 46, 1749–1758 (2013).
Batista, K. E. et al. Ab Initio Investigation of CO2 Adsorption on 13-Atom 4d Clusters. Journal of chemical information and modeling 60, 537–545 (2020).
Felício-Sousa, P. & Andriani, K. F. & Da Silva, J. L. Ab initio investigation of the role of the d-states occupation on the adsorption properties of H 2, CO, CH 4 and CH 3 OH on the Fe 13, Co 13, Ni 13 and Cu 13 clusters. Physical Chemistry Chemical Physics 23, 8739–8751 (2021).
Jia, X., Li, J. & Wang, E. Cu Nanoclusters with Aggregation Induced Emission Enhancement. Small 9, 3873–3879 (2013).
Zhang, Y. et al. Unique size-dependent nanocatalysis revealed at the single atomically precise gold cluster level. Proceedings of the National Academy of Sciences 115, 10588 (2018).
Chakraborty, I. & Pradeep, T. Atomically Precise Clusters of Noble Metals: Emerging Link between Atoms and Nanoparticles. Chemical Reviews 117, 8208–8271 (2017).
Watanabe, Y. Atomically precise cluster catalysis towards quantum controlled catalysts. Science and Technology of Advanced Materials 15, 063501 (2014).
Zhu, Y., Qian, H. & Jin, R. Catalysis opportunities of atomically precise gold nanoclusters. Journal of Materials Chemistry 21, 6793–6799 (2011).
Li, Z. Y. et al. Three-dimensional atomic-scale structure of size-selected gold nanoclusters. Nature 451, 46–48 (2008).
Castleman, A. W. & Khanna, S. N. Clusters, Superatoms, and Building Blocks of New Materials. The Journal of Physical Chemistry C 113, 2664–2675 (2009).
Wu, S. Q. et al. An adaptive genetic algorithm for crystal structure prediction. J. Phys.: Condens. Matter 26, 035402 (2013).
Curtis, F. et al. GAtor: A First-Principles Genetic Algorithm for Molecular Crystal Structure Prediction. Journal of Chemical Theory and Computation 14, 2246–2264 (2018).
Jennings, P. C., Lysgaard, S., Hummelshøj, J. S., Vegge, T. & Bligaard, T. Genetic algorithms for computational materials discovery accelerated by machine learning. npj Computational Materials 5, 46 (2019).
Kirkpatrick, S., Gelatt, C. D. & Vecchi, M. P. Optimization by simulated annealing. science 220, 671–680 (1983).
Lv, J., Wang, Y., Zhu, L. & Ma, Y. Particle-swarm structure prediction on clusters. The Journal of Chemical Physics 137, 084104 (2012).
Yamashita, T. et al. Crystal structure prediction accelerated by Bayesian optimization. Physical Review Materials 2, 013803 (2018).
Yang, S. & Day, G. M. Exploration and Optimization in Crystal Structure Prediction: Combining Basin Hopping with Quasi-Random Sampling. Journal of Chemical Theory and Computation 17, 1988–1999 (2021).
Stillinger, F. H. Exponential multiplicity of inherent structures. Phys. Rev. E 59, 48–51 (1999).
Heard, C. J. & Johnston, R. L. in Challenges and Advances in Computational Chemistry and Physics Vol. 23 (eds M. Nguyen & B. Kiran) (Springer, Cham, 2017).
Curtarolo, S. et al. AFLOW: An automatic framework for high-throughput materials discovery. Computational Materials Science 58, 218–226 (2012).
Saal, J. E., Kirklin, S., Aykol, M., Meredig, B. & Wolverton, C. Materials design and discovery with high-throughput density functional theory: the open quantum materials database (OQMD). Jom 65, 1501–1509 (2013).
Jain, A. et al. Commentary: The Materials Project: A materials genome approach to accelerating materials innovation. APL Materials 1, 011002 (2013).
Choudhary, K. et al. The joint automated repository for various integrated simulations (JARVIS) for data-driven materials design. npj Computational Materials 6, 1–13 (2020).
Zhou, J. et al. 2DMatPedia, an open computational database of two-dimensional materials from top-down and bottom-up approaches. Scientific data 6, 1–10 (2019).
Joswig, J.-O. & Springborg, M. Genetic-algorithms search for global minima of aluminum clusters using a Sutton-Chen potential. Physical Review B 68, 085408 (2003).
Shao, X., Liu, X. & Cai, W. Structural optimization of silver clusters up to 80 atoms with Gupta and Sutton-Chen potentials. Journal of chemical theory and computation 1, 762–768 (2005).
Grigoryan, V. G., Alamanova, D. & Springborg, M. Structure and energetics of CuN clusters with (2 ≤ N ≤ 150): An embedded-atom-method study. Phys. Rev. B 73, 115415 (2006).
Loeffler, T. D. et al. Active Learning A Neural Network Model For Gold Clusters & Bulk From Sparse First Principles Training Data. ChemCatChem 12, 4796–4806 (2020).
Dong, X., Wang, G. M. & Blaisten-Barojas, E. Tight-binding model for calcium nanoclusters: Structural, electronic, and dynamical properties. Physical Review B 70, 205409 (2004).
Kohn, W. & Sham, L. Phys. Rev. A. Self–Consistent Equations Including Exchange and Correlation Effects 140, A1133–A1138 (1965).
Wales, D. J. et al. The Cambridge Cluster Database https://www-wales.ch.cam.ac.uk/CCD.html (2022).
Wang, Y. et al. Accelerated prediction of atomically precise cluster structures using on-the-fly machine learning. npj Comput. Mater. 8, 173 (2022).
Shapeev, A. V. Moment Tensor Potentials: A Class of Systematically Improvable Interatomic Potentials. Multiscale Model. Simul. 14, 1153–1173 (2016).
Podryabinkin, E. V. & Shapeev, A. V. Active learning of linearly parametrized interatomic potentials. Computational Materials Science 140, 171–180 (2017).
Zuo, Y. et al. Performance and cost assessment of machine learning interatomic potentials. The Journal of Physical Chemistry A 124, 731–745 (2020).
Novikov, I. S., Gubaev, K., Podryabinkin, E. V. & Shapeev, A. V. The MLIP package: moment tensor potentials with MPI and active learning. Machine Learning: Science and Technology 2, 025002 (2020).
Chaves, A. S., Piotrowski, M. J. & Da Silva, J. L. F. Evolution of the structural, energetic, and electronic properties of the 3d, 4d, and 5d transition-metal clusters (30 TMn systems for n = 2–15): a density functional theory investigation. Physical Chemistry Chemical Physics 19, 15484–15502 (2017).
Piotrowski, M. J. et al. Theoretical Study of the Structural, Energetic, and Electronic Properties of 55-Atom Metal Nanoclusters: A DFT Investigation within van der Waals Corrections, Spin–Orbit Coupling, and PBE+U of 42 Metal Systems. J. Phys. Chem. C 120, 28844–28856 (2016).
Doye, J. P. K. & Wales, D. J. Structural consequences of the range of the interatomic potential A menagerie of clusters. J. Chem. Soc., Faraday Trans. 93, 4233–4243 (1997).
Doye, J. P. K., Wales, D. J. & Berry, R. S. The effect of the range of the potential on the structures of clusters. J. Chem. Phys. 103, 4234–4249 (1995).
Wales, D. J. & Doye, J. P. K. in Large Clusters of Atoms and Molecules (ed Martin, T. P.) 241–279 (Springer Netherlands, 1996).
Chen, M., Dyer, J. E., Li, K. & Dixon, D. A. Prediction of Structures and Atomization Energies of Small Silver Clusters, (Ag)n, n <100. J. Phys. Chem. A 117, 8298–8313 (2013).
Doye, J. P. K. & Wales, D. J. Global minima for transition metal clusters described by Sutton–Chen potentials. New Journal of Chemistry 22, 733–744 (1998).
Galvão, B. R. L. & Viegas, L. P. What Electronic Structure Method Can Be Used in the Global Optimization of Nanoclusters? J. Phys. Chem. A 123, 10454–10462 (2019).
Doye, J. P. K. & Wales, D. J. The effect of the range of the potential on the structure and stability of simple liquids: from clusters to bulk, from sodium to C60. J. Phys. B: At. Mol. Opt. Phys. 29, 4859–4894 (1996).
Aguado, A. & López, J. M. Structures and stabilities of Aln+, Aln, and Aln− (n = 13–34) clusters. J. Chem. Phys. 130, 064704 (2009).
Song, W., Lu, W.-C., Wang, C. Z. & Ho, K. M. Magnetic and electronic properties of the nickel clusters Nin (n ≤ 30). Comput. Theor. Chem. 978, 41–46 (2011).
Liu, X. J., Hamilton, I., Krawczyk, R. P. & Schwerdtfeger, P. The stability of small helical gold nanorods: a relativistic density functional study. J Comput Chem 33, 311–318 (2012).
Fa, W., Luo, C. & Dong, J. Bulk fragment and tubelike structures of AuN (N = 2-26). Phys. Rev. B 72, 205428 (2005).
Fernández, E. M., Soler, J. M., Garzón, I. L. & Balbás, L. C. Trends in the structure and bonding of noble metal clusters. Phys. Rev. B 70, 165403 (2004).
Zhao, H.-Y. et al. Structural evolution of Aun (n = 20–32) clusters: Lowest-lying structures and relativistic effects. Phys. Lett. A 374, 1033–1038 (2010).
Sai, L. et al. Structural Evolution of Medium-Sized Phosphorus Clusters (P20–P36) from Ab Initio Global Search. J. Cluster Sci. 31, 567–574 (2020).
Tai, T. B. & Nguyen, M. T. Electronic structure and photoelectron spectra of Bn with n = 26–29: an overview of structural characteristics and growth mechanism of boron clusters. Physical Chemistry Chemical Physics 17, 13672–13679 (2015).
Tai, T. B., Duong, L. V., Pham, H. T., Mai, D. T. T. & Nguyen, M. T. A disk-aromatic bowl cluster B30: toward formation of boron buckyballs. Chem. Commun. 50, 1558–1560 (2014).
Pham, H. T., Duong, L. V., Tam, N. M., Pham-Ho, M. P. & Nguyen, M. T. The boron conundrum: Bonding in the bowl B30 and B36, fullerene B40 and triple ring B42 clusters. Chem. Phys. Lett. 608, 295–302 (2014).
Tai, T. B. & Nguyen, M. T. A new chiral boron cluster B44 containing nonagonal holes. Chem. Commun. 52, 1653–1656 (2016).
Pham, H. T., Duong, L. V., Pham, B. Q. & Nguyen, M. T. The 2D-to-3D geometry hopping in small boron clusters: The charge effect. Chem. Phys. Lett. 577, 32–37 (2013).
Tai, T. B., Grant, D. J., Nguyen, M. T. & Dixon, D. A. Thermochemistry and Electronic Structure of Small Boron Clusters (Bn, n = 5−13) and Their Anions. J. Phys. Chem. A 114, 994–1007 (2010).
Arvanitidis, A. G., Tai, T. B., Nguyen, M. T. & Ceulemans, A. Quantum rules for planar boron nanoclusters. Physical Chemistry Chemical Physics 16, 18311–18318 (2014).
Doye, J. P. K. & Hendy, S. C. On the structure of small lead clusters. The European Physical Journal D - Atomic, Molecular, Optical and Plasma Physics 22, 99–107 (2003).
Doye, J. P. K. Lead clusters: Different potentials, different structures. Computational Materials Science 35, 227–231 (2006).
Götz, D. A., Shayeghi, A., Johnston, R. L., Schwerdtfeger, P. & Schäfer, R. Structural evolution and metallicity of lead clusters. Nanoscale 8, 11153–11160 (2016).
Nava, P., Sierka, M. & Ahlrichs, R. Density functional study of palladium clusters. Physical Chemistry Chemical Physics 5, 3372–3381 (2003).
Dieterich, J. M., Gerke, S. & Mata, R. A. A First-Principles-Based Potential for the Description of Alkaline Earth Metals. Journal of Atomic, Molecular, and Optical Physics 2012, 648386 (2012).
Doye, J. P. K. Identifying structural patterns in disordered metal clusters. Phys. Rev. B 68, 195418 (2003).
Kohaut, S. & Springborg, M. Growth patterns and structural motifs of cadmium clusters with up to 60 atoms: disordered or not? Physical Chemistry Chemical Physics 18, 28524–28537 (2016).
Johansson, M. P. & Pyykkö, P. The importance of being tetrahedral: the cadmium pyramids CdN; N = 4, 10, 20, 35 and 56. Physical Chemistry Chemical Physics 6, 2907–2909 (2004).
Zhan, L., Chen, J. Z. Y., Liu, W.-K. & Lai, S. K. Asynchronous multicanonical basin hopping method and its application to cobalt nanoclusters. The Journal of Chemical Physics 122, 244707 (2005).
Jin, Y. et al. Geometries, stabilities and fragmental channels of neutral and charged sulfur clusters: SnQ (n = 3–20, Q = 0, ±1). Physical Chemistry Chemical Physics 17, 13590–13597 (2015).
Aguado, A. Discovery of Magnetic Superatoms and Assessment of van der Waals Dispersion Effects in Csn Clusters. J. Phys. Chem. C 116, 6841–6851 (2012).
Calaminici, P., Pérez-Romero, M., Vásquez-Pérez, J. M. & Köster, A. M. On the ground state structure of neutral Cun (n = 12,14,16,18,20) clusters. Comput. Theor. Chem. 1021, 41–48 (2013).
Alparone, A. Density functional theory Raman spectra of cyclic selenium clusters Sen (n = 5–12). Comput. Theor. Chem. 988, 81–85 (2012).
Elliott, J. A., Shibuta, Y. & Wales, D. J. Global minima of transition metal clusters described by Finnis–Sinclair potentials: A comparison with semi-empirical molecular orbital theory. Philosophical Magazine 89, 3311–3332 (2009).
Zhou, R. L. & Pan, B. C. Structural features of silicon clusters Sin (n = 40–57,60). Phys. Lett. A 368, 396–401 (2007).
Yoo, S. & Zeng, X. C. Structures and relative stability of medium-sized silicon clusters. IV. Motif-based low-lying clusters Si21–Si30. J. Chem. Phys. 124, 054304 (2006).
Goedecker, S., Hellmann, W. & Lenosky, T. Global Minimum Determination of the Born-Oppenheimer Surface within Density Functional Theory. Phys. Rev. Lett. 95, 055501 (2005).
Yoo, S., Shao, N., Koehler, C., Fraunhaum, T. & Zeng, X. C. Structures and relative stability of medium-sized silicon clusters. V. Low-lying endohedral fullerenelike clusters Si31–Si40 and Si45. J. Chem. Phys. 124, 164311 (2006).
Yoo, S. & Zeng, X. C. Motif Transition in Growth Patterns of Small to Medium-Sized Silicon Clusters. Angew. Chem. Int. Ed. 44, 1491–1494 (2005).
Wang, J., Zhou, X., Wang, G. & Zhao, J. Optimally stuffed fullerene structures of silicon nanoclusters. Phys. Rev. B 71, 113412 (2005).
Yoo, S., Zhao, J., Wang, J. & Zeng, X. C. Endohedral Silicon Fullerenes SiN (27 ≤ N ≤ 39). Journal of the American Chemical Society 126, 13845–13849 (2004).
Bazterra, V. E. et al. Modified genetic algorithms to model cluster structures in medium-size silicon clusters. Phys. Rev. A 69, 053202 (2004).
Yoo, S., Zeng, X. C., Zhu, X. & Bai, J. Possible Lowest-Energy Geometry of Silicon Clusters Si21 and Si25. Journal of the American Chemical Society 125, 13318–13319 (2003).
Núñez, S., López, J. M. & Aguado, A. Neutral and charged gallium clusters: structures, physical properties and implications for the melting features. Nanoscale 4, 6481–6492 (2012).
Drebov, N., Weigend, F. & Ahlrichs, R. Structures and properties of neutral gallium clusters: A theoretical investigation. J. Chem. Phys. 135, 044314 (2011).
Yoo, S. & Zeng, X. C. Search for global-minimum geometries of medium-sized germanium clusters. II. Motif-based low-lying clusters Ge21–Ge29. The Journal of Chemical Physics 124, 184309 (2006).
Aguado, A. Structures, relative stabilities, and electronic properties of potassium clusters Kn (13 ≤ n ≤ 80). Comput. Theor. Chem. 1021, 135–143 (2013).
Hu, H.-S. et al. Theoretical studies of the global minima and polarizabilities of small lithium clusters. Chem. Phys. Lett. 644, 235–242 (2016).
Belyaev, S. N., Panteleev, S. V., Ignatov, S. K. & Razuvaev, A. G. Structural, electronic, thermodynamic and spectral properties of Mgn (n = 2–31) clusters. A DFT study. Comput. Theor. Chem. 1079, 34–46 (2016).
Aguado, A., Vega, A., Lebon, A. & von Issendorff, B. Are zinc clusters really amorphous? A detailed protocol for locating global minimum structures of clusters. Nanoscale 10, 19162–19181 (2018).
Noya, E. G., Doye, J. P. K., Wales, D. J. & Aguado, A. Geometric magic numbers of sodium clusters: Interpretation of the melting behaviour. The European Physical Journal D 43, 57–60 (2007).
Oganov, A. R., Lyakhov, A. O. & Valle, M. How Evolutionary Crystal Structure Prediction Works—and Why. Acc. Chem. Res. 44, 227–237 (2011).
Trimarchi, G., Freeman, A. J. & Zunger, A. Predicting stable stoichiometries of compounds via evolutionary global space-group optimization. Physical Review B 80, 092101 (2009).
Li, X.-T., Yang, X.-B. & Zhao, Y.-J. Geometrical eigen-subspace framework based molecular conformation representation for efficient structure recognition and comparison. J. Chem. Phys. 146, 154108 (2017).
Kresse, G. & Furthmuller, J. Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. Phys. Rev. B 54, 11169 (1996).
Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized Gradient Approximation Made Simple. Phys. Rev. Lett. 77, 3865–3868 (1996).
Blöchl, P. E. Projector augmented-wave method. Phys. Rev. B 50, 17953–17979 (1994).
Gillan, M. Calculation of the vacancy formation energy in aluminium. Journal of Physics: Condensed Matter 1, 689 (1989).
Štich, I., Car, R., Parrinello, M. & Baroni, S. Conjugate gradient minimization of the energy functional: A new method for electronic structure calculation. Physical Review B 39, 4997 (1989).
Pulay, P. Convergence acceleration of iterative sequences. The case of scf iteration. Chem. Phys. Lett. 73, 393–398 (1980).
Aarons, J., Sarwar, M., Thompsett, D. & Skylaris, C.-K. Perspective: Methods for large-scale density functional calculations on metallic systems. The Journal of chemical physics 145, 220901 (2016).
Draxl, C. & Scheffler, M. NOMAD: The FAIR concept for big data-driven materials science. MRS Bull. 43, 676–682 (2018).
Manna, S. et al. Quantum Cluster Database Version 2022-11-20. NOMAD Repository https://doi.org/10.17172/NOMAD/2023.02.01-1 (2022).
Acknowledgements
This work is supported by a Multidisciplinary University Research Initiatives (MURI) grant from the Office of Naval Research, N00014-15-1-2681. Calculations were performed using resources provided by the Department of Defense HPC Modernization Program through grant STVONRDC40463482, the Maryland Advanced Research Computing Cluster (MARCC), and the Stampede2 supercomputer at the Texas Advanced Computer Center (TACC) through NSF award DMR-140068. We would also like to acknowledge Caitlin McCandler (Lawrence Berkeley National Laboratory), Chuhong Wang, Chenyang Li, Sam Norwood, Kit Bowen (Johns Hopkins University), and Joseph Hooper (Naval Postgraduate School) for helpful discussions.
Author information
Authors and Affiliations
Contributions
Y.W., P.L. and S.M. developed the genetic algorithm code with input from T.M. S.M., Y.W., A.H. and P.L. extracted clusters from the literature. S.M., P.L. and T.M. developed the data driven method using the structural correlation to accelerate the search. Y.W., S.M. and S.L. performed structure searches using genetic algorithms. S.M., Y.W. and A.H. performed D.F.T. calculations using consistent settings. S.M., Y.W. and A.H. collected clusters from different sources and organized them in the database. A.H. and Y.W. wrote the Q.C.D. management code for organizing clusters, updating database, and computing cluster properties. Y.W. and A.H. improved and installed the website interface in collaboration with an outside team. S.M., Y.W., A.H. and T.M. processed data, generated graphs, and analyzed the graphs. S.M., Y.W., A.H. and T.M. wrote the manuscript. S.M. and Y.W. contributed equally. T.M. conceived of and managed the project. All authors contributed to the discussion.
Corresponding author
Ethics declarations
Compteting interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Manna, S., Wang, Y., Hernandez, A. et al. A database of low-energy atomically precise nanoclusters. Sci Data 10, 308 (2023). https://doi.org/10.1038/s41597-023-02200-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-023-02200-4
This article is cited by
-
A Continuous Action Space Tree search for INverse desiGn (CASTING) framework for materials discovery
npj Computational Materials (2023)
-
Multi-reward reinforcement learning based development of inter-atomic potential models for silica
npj Computational Materials (2023)