Computational scanning tunneling microscope image database

Choudhary, Kamal; Garrity, Kevin F.; Camp, Charles; Kalinin, Sergei V.; Vasudevan, Rama; Ziatdinov, Maxim; Tavazza, Francesca

doi:10.1038/s41597-021-00824-y

Download PDF

Data Descriptor
Open access
Published: 11 February 2021

Computational scanning tunneling microscope image database

Kamal Choudhary ORCID: orcid.org/0000-0001-9737-8074¹,
Kevin F. Garrity¹,
Charles Camp¹,
Sergei V. Kalinin²,
Rama Vasudevan²,
Maxim Ziatdinov ORCID: orcid.org/0000-0003-2570-4592² &
…
Francesca Tavazza¹

Scientific Data volume 8, Article number: 57 (2021) Cite this article

10k Accesses
18 Citations
3 Altmetric
Metrics details

Subjects

Abstract

We introduce the systematic database of scanning tunneling microscope (STM) images obtained using density functional theory (DFT) for two-dimensional (2D) materials, calculated using the Tersoff-Hamann method. It currently contains data for 716 exfoliable 2D materials. Examples of the five possible Bravais lattice types for 2D materials and their Fourier-transforms are discussed. All the computational STM images generated in this work are made available on the JARVIS-STM website (https://jarvis.nist.gov/jarvisstm). We find excellent qualitative agreement between the computational and experimental STM images for selected materials. As a first example application of this database, we train a convolution neural network model to identify the Bravais lattice from the STM images. We believe the model can aid high-throughput experimental data analysis. These computational STM images can directly aid the identification of phases, analyzing defects and lattice-distortions in experimental STM images, as well as be incorporated in the autonomous experiment workflows.

Measurement(s)	material entity • 2-dimensional material
Technology Type(s)	Scanning Tunneling Microscopy • density functional theory
Factor Type(s)	exfoliable 2D material

Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.13573820

Synthesis of goldene comprising single-atom layer gold

Article Open access 16 April 2024

Scaling deep learning for materials discovery

Article Open access 29 November 2023

Visualizing the structural evolution of individual active sites in MoS2 during electrocatalytic hydrogen evolution reaction

Article 15 April 2024

Background & Summary

Since the invention of the scanning tunneling microscope (STM)¹, this technique has become an essential tool for characterizing material surfaces and adsorbates. In addition to providing atomic insights, STM has been proven useful for characterizing the electronic structure, shapes of molecular orbitals, and vibrational and magnetic excitations^2,3. It can also be used for manipulating adsorbates and adatoms, and for catalysis and quantum information processing applications^3,4,5,6,7,8. Quantum mechanics-based density functional theory (DFT) has often been used to produce virtual STM images for these applications^9,10. However, a systematic database of such computational STM data is still lacking. As DFT-STM images are constructed using defect-free materials, they provide standard reference images (SRI) that are useful to aid in identifying phases^11,12, analyzing defects^13,14 and quantifying lattice-distortions¹⁵ in experimental STM images. A DFT-STM database is therefore essential to provide a direct link between atomic positions and images, which can aid experimental analysis. Moreover, the orbital-projected electronic density of states available in our database can help explain which atoms and orbitals contribute to an experimental STM image. Finally, a computational database can provide an accurate training set for developing machine learning (ML) models to rapidly analyze experimental STM images.

STM imaging is particularly well-suited to studying two-dimensional (2D) materials, such as graphene¹⁶, MoS₂¹⁷, NbSe₂¹⁸, WSe₂¹⁹, WTe₂²⁰, FeSe²¹, black-phosphrous^22,23 and SnSe²⁴. 2D materials^25,26 has opened diverse areas of application, such as sub-micron level electronics²⁷, flexible and tunable electronics²⁸, superconductivity²⁹, photo-voltaics³⁰, water-purification³¹, sensors³², thermal-management³³, energy-storage³⁴, medicine³⁵, quantum dots^36,37 and composites^38,39,40. The surfaces of 2D materials are unique because they lack dangling bonds, allowing them to be exfoliated. This property makes them ideal candidates for building a database of computational STMs images because they don’t require thick slabs perpendicular to the surface, which are computationally expensive to simulate accurately, and they do not have surface reconstructions. The generation of STM images for perfect systems is an initial step, and we will extend this project to include defective systems and the effect of thermal noise in the future.

In this work, we use DFT to generate STM images of exfoliable 2D materials. We use the recently developed JARVIS-DFT database (https://jarvis.nist.gov/jarvisdft) and select 2D materials with exfoliation energy less than 200 meV/atom. The JARVIS-DFT database contains about 40000 bulk and 1000 two-dimensional materials with their DFT-computed structural, energetic²⁶, elastic⁴¹, optoelectronic⁴², thermoelectric⁴³, piezoelectric, dielectric, infrared⁴⁴, solar-efficiency⁴⁵, and topological^46,47 properties. We note that there are several factors that can influence the appearance of experimental or DFT-based STM image predictions, such as the STM-tip material, bias voltage, and the scanning mode, i.e. constant-height mode (CHM) vs. constant current mode (CCM). Similarly, there are several methods for simulating STM images using DFT, including Bardeen⁴⁸, Tersoff-Hamman⁴⁹ and Chen³ methods. Here, we present results for constant height and constant current DFT-STM images computed using the Tersoff-Hamann approach⁴⁹, which assumes a non-functionalized (s-wave) STM tip. Hence, in the simulation we don’t explicitly model the tip and its interactions. The ML model training is based on CHM images. The DFT-STM database currently contains images for 716 exfoliable 2D materials, with additional computations ongoing. All the DFT-STM data will be uploaded into the JARVIS-DFT database.

As a first example application of this database and use artificial intelligence methods^50,51,52,53, we use the computational STM images to train a convolution neural network ML classification model for Bravais-lattices. This model is able to quickly classify STM images into the five lattice classes (square, hexagon, rhombus/centered-rectangle, rectangle and parallelogram/oblique) that are possible for 2D systems. Such classifications are of importance, for example when dealing with phase transitions⁵⁴. They can also be used as an aid to automatic conventional crystallographic image processing of big datasets and to obtain information from noisy images. This work acts as a starting point for identifying the defects in experimental images by providing a collection of ideal STM images for comparison purposes⁵⁵. Ideally one would use an information-theoretic approach, as opposed to deep learning, to enable space group determination with uncertainty quantification, as demonstrated by Moeck, to distinguish the specific subgroups of selected group⁵⁶. However, a pre-screening step can be rapidly accomplished with a suitably trained neural network as shown here, which should then be verified using the approach outlined in ref. ⁵⁴. Later, these computational STM trained models can be integrated with experiments for active learning processes^50,57.

Methods

All DFT calculations are carried out with Vienna ab initio simulation package (VASP)^58,59 using projected augmented wave (PAW) formalism and using vdW-DF-OptB88 functional⁶⁰. Note that for monolayers, vdW functionals are not strictly necessary. But we include vdW interactions to be consistent with our JARVIS-DFT 3D dataset. Also, we plan to develop multi-layer materials databases, which do require vdW interactions. The vdW functional works for both strongly and weakly bonded systems⁴¹. All the machine learning trainings are carried using Keras with TensorFlow backend⁶¹. Note that commercial software is identified to specify procedures, and such identification does not imply recommendation by the National Institute of Standards and Technology. The k-point and plane-wave cut-off convergence for each material are obtained using the workflow detailed in ref. ⁴⁵. The high-throughput computation and analysis tools will be made available at JARVIS-Tools github page: https://github.com/usnistgov/jarvis. The 2D materials are provided with at least 20 Å vacuum in the z-direction to avoid self-interactions. The force and energy convergence for DFT self-consistent calculations are 10⁻⁶ eV and 0.001 eV/Å respectively.

The STM images are calculated using the Tersoff-Hamann approach, which is a simple model of an s-wave STM tip⁴⁹:

$$n\left(r,\,E\right)=\sum _{\mu }{\left|{\psi }_{\mu }\left(r\right)\right|}^{2}\delta \left({\varepsilon }_{\mu }-E\right)$$

(1)

$$I\left(r,\,V\right)\,\propto \,\underset{{E}_{F}}{\overset{{E}_{F}+eV}{\int }}dE\,n(r,\,E)$$

(2)

In this approach, the tunneling current I, which depends on the tip position r and the applied voltage V, is proportional to the integrated local density of states (ILDOS). The ILDOS is calculated from the Kohn-sham eigenvectors, ${\psi }_{\mu }$, and eigenvalues, ${\varepsilon }_{\mu }$, where μ labels different states. E_F is the Fermi-energy. Different experiments will choose different applied voltages, but we concentrate on two values, + 0.5 eV for positive bias and −0.5 eV for negative bias, which require integrating from E_F to ${E}_{F}\pm 0.5\,eV$. We choose 0.5 eV range for simplicity sake, and other values usually produce qualitatively similar images for metals or small gap semiconductors. However, simulations for other voltages should also be possible with the method and tools discussed in this work.

This method is readily available in DFT software such as VASP⁶². Please note that plane-wave codes like VASP will not accurately describe the exponential decay of the wave functions far away from the atoms, and wave functions may need to be extrapolated in order for STM simulations at large heights such as 7 Å else it can show unphysical effects⁶³. Hence, we choose image height relatively close to surface. All the STM images are made at least 20 Å long in the xy plane by repeating the primitive unit cell. We choose height 2 Å above the surface (maximum of z-coordinate) during the simulations. For constant-current images, we identify iso-surfaces that have a constant ILDOS. The height of these iso-surfaces at each xy-coordinates produces the images.

For the machine learning model, we simplify the constant-height STM images using a black/white color-scheme and choose a pixel value of 170 (out of maximum 255) for finding atomic features. We simplify the images because the image produced from the wavefunction is still on a continuous scale (i.e. grey image), while for the Bravais lattice classification only requires information on whether an atom is there or not. Based on the lattice-parameters and angles the 2D materials can be classified in five classes: 1) hexagonal, 2) square, 3) rhombus/centered-rectangle, 4) rectangle, 5) parallelograms/oblique. Deep-learning image recognition tasks typically require thousands of training images. To increase the size of our training set, we use several commonly applied image augmentations: random rotations, flipping, zooming in and zooming out. We apply augmentations until all the five classes have at least 10000 images leading 53508 images. Image processing ML models are usually non invariant to the operations mentioned above, which is why the initial dataset is augmented with such operations. We use a multi-layer network with four convolution layers (with 16, 32, 48 and 64 feature-maps and with kernel-size of 3), four max-pooling layers (with pool-size of (2, 2)) activated by a rectified linear unit (ReLU), one fully-connected 600-nodes layer with ReLU activations, and a fully-connected softmax layer with five outputs. Since the entire dataset is too big to feed to the GPU memory at once, we divide it into multiple smaller batches. The total number of training examples present in a single batch (batch size) is 32 for our NN model. We have 20% dropout before the softmax layer to avoid overfitting. We use ADAM stochastic optimization method for gradient descent with ‘sparse categorical crossentropy’ as loss function. We split the dataset into training, validation, and test sets. We use a 90%-10% train-test split for the entire dataset in such a way that both training and testing data have a proportionate amount of all the five classes. Furthermore, we apply a 90%-10% split on the training data for model-training, validation and generating the learning curve. We apply ‘Early-stopping’ to avoid over-fitting of the model. After the model development, we apply this model on the 10% test-data to evaluate the accuracy of the model. Note that the 10% test-dataset was never used during model development.

During the training, we monitor the train-validation curve (discussed later) to avoid overfitting. We use accuracy, precision, recall, and F1-score to measure the overall and individual class performances. The precision is the ratio $\frac{{\rm{TP}}}{{\rm{TP}}+{\rm{FP}}}$ where TP is the number of true positives and FP the number of false positives. The recall is the ratio $\frac{{\rm{TP}}}{{\rm{TP}}+{\rm{FN}}}$ where TP is the number of true positives and FN the number of false negatives. The recall is intuitively the ability of the classifier to find all the positive samples. The F1-score can be interpreted as a weighted harmonic mean of the precision and recall, where an F1-score reaches its best value at 1 and worst score at 0. The overall classification accuracy of the model is given as $\frac{{\rm{TP}}+{\rm{TN}}}{{\rm{TP}}+{\rm{TN}}+{\rm{FP}}+{\rm{FN}}}$, where TN represents the number of true-negatives. We also use the confusion matrix to show the percentage correct and incorrect predictions of each class. Both the model and the associated dataset will be made publicly available soon at the JARVIS-DFT website.

Data Records

After the calculations, the metadata is stored in the Javascript Object Notation Files (JSON) format which can be easily integrated with databases such as MongoDB. The dataset is made publicly available through the JARVIS-STM (https://jarvis.nist.gov/jarvisstm) web-app. The web-app provides both constant-height and constant-current simulation features and allows the user to change the chosen height or current value. We have made the dataset publicly available through Figshare repository⁶⁴ as well. The dataset consists of positive and negative bias constant-height images in Joint Photographic Experts Group (JPEG) format for the 2D materials under investigation. In addition to the images, we provide the raw input/output files for the calculations (including PARCHG files) at to enhance reproducibility of the work that could be used for generating both constant height and constant current images and for a given size of the xy-dimension.

Technical Validation

Validation of DFT simulated images

We simulate computational STM images of 716 exfoliable materials (E_f <200 meV/atom) using the Tersoff-Hamann approach. We compare computational STM images with those from experiments for graphene¹⁶, 2H-MoS₂¹⁷, 2H-NbSe₂¹⁸, 2H-WSe₂¹⁹, 1T’-WTe₂²⁰, FeSe²¹, black-P^22,23, SnSe²⁴, Bismuth^64,65. We chose these systems because we could find well-characterized experimental images in the literature. Qualitatively, we observe that the patterns in the computational and experimental STMs are very similar (see Fig. 1).

Experimental STM images for each system can be found in appropriate reference. Note that we are able to predict the STM for 2D vdW materials very well because they lack dangling bonds. Such images with non-vdW systems such as Si (111)⁶⁶ would require bigger simulation cells in the xy direction to accommodate reconstructions, as well as many additional layers to converge the calculations.

The DFT-STM can be used for distinguishing phases such as the 2D-monolayer 2H-MoTe₂ (JVASP-670) and 1T’-MoTe₂ (JVASP-673) phases, as shown in constant height positive bias conditions in Fig. 2. Such phase-identifications can be helpful in providing insight into phase-transformation mechanisms during experiments.

The 2H-phase is semiconducting material with hexagonal symmetry, as is evident from the crystal structure in Fig. 2a. The positive + 0.5 eV bias constant height image of this structure is shown in Fig. 2b. The electronic states in this range are dominated by Mo (d-orbital) states, hence the brighter spots in the STM are dominated by Mo d-orbitals, which can be understood by analyzing the projected density of states (Fig. 3). As shown in Fig. 2c, the fast Fourier transform (FT) of the simulated STM image in Fig. 2b shows hexagonal symmetry. Similarly, the crystal structure, STM image and FT of rectangular 1T’-MoTe₂ is shown in Fig. 2d–f respectively. We note that the FT of the STM image of a rectangular system with a multi-atom cell is not a simple rectangle. We show examples of variation of height in Å and current in arbitrary units in Fig. 2g,h and i for 2H-MoTe₂. The constant height for 2H-MoTe₂ in Fig. 2b is for 3 Å while that in Fig. 2g is for 5 Å with respect to the highest atom in the cell. Clearly, the hexagonal patterns remain the same, but the structure around the atoms slightly changes due to the change in height. This is because as we move the hypothetical STM tip, we probe different layers of charge density. Similarly, we show the current variation based STM images for 0.01 and 0.05 a.u.⁻³ eV⁻¹ in Fig. 2h,i. Note that it is difficult to quantitatively compare the computational and experimental STM images because the tunneling-current is critically dependent on the specific experimental setup.

Based on lattice parameter information in 2D plane, the 2D materials lattices can be classified in 5 types: hexagon, square, rectangle, rhombus/centered-rectangle, and parallelograms/oblique. We classify all the 2D materials in our database, with the distribution shown in Fig. 4a. Most of the 2D materials in our database are hexagonal, followed by rectangular and square lattices. In Fig. 4, we give examples of materials in each lattice type, in each case showing the atomic positions, a constant height STM image, and the fast Fourier transform (FT) of the STM image. An example of hexagonal lattice is shown in Fig. 4b graphene (JVASP-667). It is one of the most widely investigated 2D materials. The STM positive bias image for graphene is shown in Fig. 4c. An FT of the image Fig. 4c is shown in Fig. 4d. It is clear from Fig. 4d that there is a hexagonal pattern due to hexagonal symmetry in graphene. Similarly, for the square lattice example, FeTe (JVASP-6667), the crystal structure, STM, and FT are shown in Fig. 4e–g. Fe d-states mainly contributes to the STM image in Fig. 4f. The FT of this image shows square-like patterns in Fig. 4g. Similarly, Fig. 4h gives the crystal structure of VClO (JVASP-8933), and its STM and FT show a rectangular pattern (Fig. 4i,j). AuI (JVASP-6187) has a centered-rectangle structure, as shown in Fig. 4k. The lattice constants are 4.274 Å and the angle between them is 93.2 degrees. The Au d-orbitals contribute most to the STM image. The atomic and orbital projected density of systems for all the systems here is given in the supplementary information (Supplementary Fig. S1) and the respective webpages for each material. The FT in the Fig. 4m,p shows a noticeable blur, which can be caused by the truncation of the infinite slab to a finite image. Note that the mathematical FT of a perfectly periodic system would have ideal/sharp peaks. However, we purposefully truncate the images and include white spaces to mimic experimental images. Hence, they won’t be perfectly sharp. Figure 4n shows As₂Se₃ (JVASP-13544), an example material with an oblique unit cell with lattice constants of 4.4 and 12.9 Å and an angle of 109.9 degrees. The FT of the STM in Fig. 4p is difficult to interpret.

Machine learning model development

Having prepared our database, we now train a ML model (JARVIS-STMnet) following the flow-chart in Fig. 5.

In Fig. 6 we show the convolution neural network training and the learning curves for the deep learning model. We monitor the learning curve as in Fig. 6a. We see that after the 5^th epoch the training and validation accuracy curves begin to diverge, so we stop further training. We obtain 90.1% accuracy on the validation set and 90.0% accuracy on the 10% test-set, which was never used during the training process. The difference between the training and the validation curve is small, implying low overfitting. We apply the trained model on the 10% test-set data and the confusion matrix is shown in Fig. 6b. We also provide precision, recall and F1 scores in Table 1. The baseline accuracy of the model is 1/5 = 20%. Clearly, the overall accuracy is more than 4 times higher than the random-guessing baseline model. Also, all the scores in Table 1 are more than 0.85, indicating that the model performs much better than a random guessing model. Note that although the accuracy is a measure of the overall model, it is important to investigate the prediction accuracy for each class of the model. A confusion matrix with high diagonal element values signifies high accuracy. It is clear from the Fig. 6b that the model performs excellently for hexagonal, centered rectangle and square lattices, and less well for the rectangle and oblique lattice types. Moving beyond simulated STM images, as an initial validation, we apply the model to nine experimental images discussed above for an initial more realistic test step for graphene¹⁶, 2H-MoS₂¹⁷, 2H-NbSe₂¹⁸, 2H-WSe₂¹⁹, 1T’-WTe₂²⁰, FeSe²¹, black-P^22,23, SnSe²⁴, Bismuth^64,65. We find that the model predicts the correct class for seven of them. Performing a more systemic analysis of our model’s accuracy on experimental images would require a database of hundreds of experimental images, and such a database is currently not available. We hope this work will spur the development of such a database. Also, as we make the entire dataset publicly available, and we hope that other researchers could apply their machine-learning models on this dataset.

Table 1 Classification report of classifying 2D constant-height STM images into lattice-types.

Full size table

Usage Notes

We introduce the first systematic database of scanning tunneling microscope (STM) images obtained using density functional theory (DFT) for two-dimensional (2D) materials. Specifically, the database is constructed using the Tersoff-Hamann method for constant-height images. Although only defect free materials are considered in this work, STM image dataset with defects will be developed soon. We anticipate that this dataset and methods used will provide a useful tool in fundamental and application-related studies of materials. Experimental verification provides insight into understanding the applicability and limitation of our DFT data. Based on the list of data, the user will be able to choose particular materials for specific applications. Data mining, data analytics, and artificial-intelligence tools then can be added to guide screening of materials.

Code availability

Python-language-based codes with examples are given at the JARVIS-Tools page https://github.com/usnistgov/jarvis.

References

Binnig, G., Rohrer, H., Gerber, C. & Weibel, E. Surface studies by scanning tunneling microscopy. Phys. Rev. Lett. 49, 57 (1982).
Article ADS Google Scholar
Mugarza, A. et al. Spin coupling and relaxation inside molecule–metal contacts. Nat. Commun. 2, 490 (2011).
Article ADS PubMed CAS Google Scholar
Chen, C. J. Introduction to scanning tunneling microscopy. Vol. 4 (Oxford University Press on Demand, 1993).
Gross, L. et al. High-resolution molecular orbital imaging using a p-wave STM tip. Phys. Rev. Lett. 107, 086101 (2011).
Article ADS PubMed CAS Google Scholar
Eigler, D. M. & Schweizer, E. K. Positioning single atoms with a scanning tunnelling microscope. Nature 344, 524 (1990).
Article ADS CAS Google Scholar
Stipe, B., Rezaei, M. & Ho, W. Single-molecule vibrational spectroscopy and microscopy. Science 280, 1732–1735 (1998).
Article ADS CAS PubMed Google Scholar
Hirjibehedin, C. F. et al. Large magnetic anisotropy of a single atomic spin embedded in a surface molecular network. Science 317, 1199–1203 (2007).
Article ADS CAS PubMed Google Scholar
Yang, K. et al. Coherent spin manipulation of individual atoms on a surface. Science 366, 509–512 (2019).
Article ADS CAS PubMed Google Scholar
Barth, J., Brune, H., Ertl, G. & Behm, R. Scanning tunneling microscopy observations on the reconstructed Au (111) surface: Atomic structure, long-range superstructure, rotational domains, and surface defects. Phys. Rev. B 42, 9307 (1990).
Article ADS CAS Google Scholar
Magonov, S. N. & Whangbo, M.-H. Surface analysis with STM and AFM: experimental and theoretical aspects of image analysis. (John Wiley & Sons, 2008).
Poirier, G. et al. Identification of the facet planes of phase I TiO2 (001) rutile by scanning tunneling microscopy and low energy electron diffraction. J. Vac. Sci. Tech. B 10, 6–15 (1992).
Article CAS Google Scholar
Burk, B., Thomson, R., Zettl, A. & Clarke, J. Charge-density-wave domains in 1T-TaS 2 observed by satellite structure in scanning-tunneling-microscopy images. Phys. Rev. Lett. 66, 3040 (1991).
Article ADS CAS PubMed Google Scholar
Vancsó, P. et al. The intrinsic defect structure of exfoliated MoS2 single layers revealed by Scanning Tunneling Microscopy. Sci. Rep. 6, 29726 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Liu, H. et al. Line and point defects in MoSe2 bilayer studied by scanning tunneling microscopy and spectroscopy. ACS Nano 9, 6619–6625 (2015).
Article CAS PubMed Google Scholar
Dubout, Q. et al. Giant apparent lattice distortions in STM images of corrugated sp2-hybridised monolayers. New J. Phys. 18, 103027 (2016).
Article ADS CAS Google Scholar
Li, G., Luican, A. & Andrei, E. Y. Scanning tunneling spectroscopy of graphene on graphite. Phys. Rev. Lett. 102, 176804 (2009).
Article ADS PubMed CAS Google Scholar
Mills, A. et al. Ripples near edge terminals in MoS2 few layers and pyramid nanostructures. App. Phys. Lett. 108, 081601 (2016).
Article ADS CAS Google Scholar
Wang, J. et al. A variable-temperature scanning tunneling microscope operated in a continuous flow cryostat. Rev. Sci. Instr. 90, 093702 (2019).
Article ADS CAS Google Scholar
Liu, H. et al. Molecular-beam epitaxy of monolayer and bilayer WSe2: a scanning tunneling microscopy/spectroscopy study and deduction of exciton binding energy. 2D Maters. 2, 034004 (2015).
Article CAS Google Scholar
Jia, Z.-Y. et al. Direct visualization of a two-dimensional topological insulator in the single-layer 1 T′− WT e 2. Phys. Rev. B 96, 041108 (2017).
Article ADS Google Scholar
Song, C.-L. et al. Direct observation of nodes and twofold symmetry in FeSe superconductor. Science 332, 1410–1413 (2011).
Article ADS CAS PubMed Google Scholar
Kumar, A. et al. STM study of exfoliated few layer black phosphorus annealed in ultrahigh vacuum. 2D Maters. 6, 015005 (2018).
Article CAS Google Scholar
Kiraly, B., Hauptmann, N., Rudenko, A. N., Katsnelson, M. I. & Khajetoorians, A. A. Probing single vacancies in black phosphorus at the atomic level. Nano Lett. 17, 3607–3612 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Duvjir, G. et al. Origin of p-type characteristics in a SnSe single crystal. App. Phys. Lett. 110, 262106 (2017).
Article ADS CAS Google Scholar
Choudhary, K., Kalish, I., Beams, R. & Tavazza, F. High-throughput Identification and Characterization of Two-dimensional Materials using Density functional theory. Sci. Rep. 7, 5179 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Choudhary, K. et al. The joint automated repository for various integrated simulations (JARVIS) for data-driven materials design. npj Comput Mater 6, 173 (2020).
Article ADS Google Scholar
Fiori, G. et al. Electronics based on two-dimensional materials. Nat. Nanotechnol. 9, 768–779 (2014).
Article ADS CAS PubMed Google Scholar
Akinwande, D., Petrone, N. & Hone, J. Two-dimensional flexible nanoelectronics. Nat. Comm. 5 (2014).
Navarro-Moratalla, E. & Jarillo-Herrero, P. Two-dimensional superconductivity: The Ising on the monolayer. Nat. Phys. 12, 112–113 (2016).
Article CAS Google Scholar
Bubnova, O. 2D materials: Hybrid interfaces. Nat. Nanotechnol. 16, 497 (2016).
Google Scholar
Dervin, S., Dionysiou, D. D. & Pillai, S. C. 2D nanostructures for water purification: graphene and beyond. Nanoscale (2016).
Cui, S. et al. Ultrahigh sensitivity and layer-dependent sensing performance of phosphorene-based gas sensors. Nat. Commun. 6, 8632 (2015).
Article ADS CAS PubMed Google Scholar
Lee, M.-J. et al. Thermoelectric materials by using two-dimensional materials with negative correlation between electrical and thermal conductivity. Nat. Commun. 7, 12011 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhang, X., Hou, L., Ciesielski, A. & Samorì, P. 2D Materials Beyond Graphene for High‐Performance Energy Storage Applications. Adv. Energy Mater. 6, 1600671 (2016).
Article CAS Google Scholar
Boland, C. S. et al. Sensitive, high-strain, high-rate bodily motion sensors based on graphene–rubber composites. ACS nano 8, 8819–8830 (2014).
Article CAS PubMed Google Scholar
Wang, X., Sun, G., Li, N. & Chen, P. Quantum dots derived from two-dimensional materials and their applications for catalysis and energy. Chem. Soc. Rev. 45, 2239–2262 (2016).
Article CAS PubMed Google Scholar
Chakraborty, C., Kinnischtzke, L., Goodfellow, K. M., Beams, R. & Vamivakas, A. N. Voltage-controlled quantum light from an atomically thin semiconductor. Nat. nanotechnol. 10, 507–511 (2015).
Article ADS CAS PubMed Google Scholar
Castellanos-Gomez, A. Why all the fuss about 2D semiconductors? Nat. Photon. 10, 202 (2016).
Article CAS Google Scholar
Flat talk. Nat. Photon. 10, 205 (2016).
Article CAS Google Scholar
Rodenas, T. et al. Metal–organic framework nanosheets in polymer composite materials for gas separation. Nat. Mater. 14, 48–55 (2015).
Article ADS CAS PubMed Google Scholar
Choudhary, K., Cheon, G., Reed, E. & Tavazza, F. Elastic properties of bulk and low-dimensional materials using van der Waals density functional. Phys. Rev. B 98, 014107 (2018).
Article ADS CAS Google Scholar
Choudhary, K. et al. Computational screening of high-performance optoelectronic materials using OptB88vdW and TB-mBJ formalisms. Sci. Data 5, 180082 (2018).
Article CAS PubMed PubMed Central Google Scholar
Choudhary, K., Garrity, K. & Tavazza, Data-driven Discovery of 3D and 2D Thermoelectric Materials. J. Phys.: Condens. Matter 32, 475501 (2020).
ADS CAS Google Scholar
Choudhary, K. et al. High-throughput Density Functional Perturbation Theory and Machine Learning Predictions of Infrared, Piezoelectric and Dielectric Responses. npj Comput. Mat. 6, 64 (2020).
Article Google Scholar
Choudhary, K. et al. Accelerated Discovery of Efficient Solar Cell Materials Using Quantum and Machine-Learning Methods. Chem. Mater. 31(15), 5900 (2019).
Article CAS Google Scholar
Choudhary, K., Garrity, K. F., Jiang, J., Pachter, R. & Tavazza, Computational Search for Magnetic and Non-magnetic 2D Topological Materials using Unified Spin-orbit Spillage, Screening. npj Comput. Mat. 6, 49 (2020).
Article Google Scholar
Choudhary, K., Garrity, K. F. & Tavazza, F. High-throughput Discovery of topologically Non-trivial Materials using spin-orbit spillage. Sci. Rep. 9, 1–8 (2019).
Article CAS Google Scholar
Bardeen, Tunnelling from a many-particle point of view. Phys. Rev. Lett. 6, 57 (1961).
Article ADS CAS Google Scholar
Tersoff, J. & Hamann, D. R. Theory of the scanning tunneling microscope. Phys. Rev. B 31, 805 (1985).
Article ADS CAS Google Scholar
Vasudevan, R. K. et al. Materials science in the artificial intelligence age: high-throughput library generation, machine learning, and a pathway from correlations to the underpinning physics. MRS Comm. 9(3), 821 (2019).
Article CAS Google Scholar
Rickman, J. M., Lookman, T. & Kalinin, S. V. Materials informatics: From the atomic-level to the continuum. Acta Mater. 168, 473 (2019).
Article ADS CAS Google Scholar
Hill, J., Mannodi-Kanakkithodi, A., Ramprasad, R. & Meredig, B. Computational Materials System Design 193–225 (Springer, 2018).
Choudhary, K., DeCost, B. & Tavazza, F. Machine learning with force-field-inspired descriptors for materials: Fast screening and mapping energy landscape. Phy. Rev. Mat. 2, 083801 (2018).
CAS Google Scholar
Vasudevan, R. K. et al. Mapping mesoscopic phase evolution during E-beam induced transformations via deep learning of atomically resolved images. npj Comput. Mat. 4, 30 (2018).
Article CAS Google Scholar
Moeck, Peter. On classification approaches for crystallographic symmetries of noisy 2D periodic patterns. IEEE Transactions on Nanotechnology 18, 1166–1173 (2019).
Article ADS CAS Google Scholar
Ziatdinov, M. et al. Deep learning of atomically resolved scanning transmission electron microscopy images: chemical identification and tracking local transformations. ACS Nano 11, 12742 (2017).
Article CAS PubMed Google Scholar
Sk, R., Deshpande, A. & Engineering. Unveiling the emergence of functional materials with STM: metal phthalocyanine on surface architectures. Mol. Syst. Design & Engineerin 4, 471 (2019).
Article CAS Google Scholar
Kresse, G. & Furthmüller, Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. Phys. Rev. B 54, 11169 (1996).
Article ADS CAS Google Scholar
Kresse, G. & Furthmüller, Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set. Comput. Mat. Sci. 6, 15–50 (1996).
Article CAS Google Scholar
Klimeš, J., Bowler, D. R. & Michaelides, A. Chemical accuracy for the van der Waals density functional. J. Phys. Cond. Mat. 22, 022201 (2009).
Article ADS CAS Google Scholar
Abadi, M. et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems. arXiv: 1603, 04467 (2016).
Google Scholar
Lounis, S. Theory of scanning tunneling microscopy. arXiv 1404, 0961 (2014).
Google Scholar
Tersoff, J. Method for the calculation of scanning tunneling microscope images and spectra. Phys. Rev. B 40, 11990 (1989).
Article ADS CAS Google Scholar
Choudhary, K. et al. Computational Scanning Tunneling Microscope Image Database. figshare https://doi.org/10.6084/m9.figshare.c.3883270 (2020).
Song, F. et al. Low-temperature growth of bismuth thin films with (111) facet on highly oriented pyrolytic graphite. ACS Appl. Mater. Interfaces 7, 8525 (2015).
Article CAS PubMed Google Scholar
Smeu, M., Guo, H., Ji, W. & Wolkow, R. A. Electronic properties of Si (111)-7×7 and related reconstructions: Density functional theory calculations. Phys. Rev. B 85, 195315 (2012).
Article ADS CAS Google Scholar

Download references

Acknowledgements

K.C., K.F.G., C.C. and F.T. thank National Institute of Standards and Technology for funding, computational and data-management resources. S.V.K acknowledges support by the U.S. Department of Energy, Office of Science, Basic Energy Sciences, Materials Sciences and Engineering Division. This research was in part supported by and conducted at the Center for Nanophase Materials Sciences (R.V.K., M.Z.), which is a DOE Office of Science User Facility. We also thank the computational support from XSEDE computational resources.

Author information

Authors and Affiliations

Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, 20899, USA
Kamal Choudhary, Kevin F. Garrity, Charles Camp & Francesca Tavazza
Center for Nanophase Materials Sciences, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA
Sergei V. Kalinin, Rama Vasudevan & Maxim Ziatdinov

Authors

Kamal Choudhary
View author publications
You can also search for this author in PubMed Google Scholar
Kevin F. Garrity
View author publications
You can also search for this author in PubMed Google Scholar
Charles Camp
View author publications
You can also search for this author in PubMed Google Scholar
Sergei V. Kalinin
View author publications
You can also search for this author in PubMed Google Scholar
Rama Vasudevan
View author publications
You can also search for this author in PubMed Google Scholar
Maxim Ziatdinov
View author publications
You can also search for this author in PubMed Google Scholar
Francesca Tavazza
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.C. developed the workflow, carried out the DFT calculations and trained the machine learning model. K.C. and K.G. analyzed the DFT data. C.C., S.K., R.V., M.Z. helped in the machine learning training. R.V., S.K. and M.Z. helped in the experimental validation. All contributed in writing the manuscript.

Corresponding author

Correspondence to Kamal Choudhary.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.

Reprints and permissions

About this article

Cite this article

Choudhary, K., Garrity, K.F., Camp, C. et al. Computational scanning tunneling microscope image database. Sci Data 8, 57 (2021). https://doi.org/10.1038/s41597-021-00824-y

Download citation

Received: 23 June 2020
Accepted: 06 January 2021
Published: 11 February 2021
DOI: https://doi.org/10.1038/s41597-021-00824-y

This article is cited by

Deep learning based atomic defect detection framework for two-dimensional materials
- Fu-Xiang Rikudo Chen
- Chia-Yu Lin
- Chun-Liang Lin
Scientific Data (2023)
Machine learning the microscopic form of nematic order in twisted double-bilayer graphene
- João Augusto Sobral
- Stefan Obernauer
- Mathias S. Scheurer
Nature Communications (2023)
Recent advances and applications of deep learning methods in materials science
- Kamal Choudhary
- Brian DeCost
- Chris Wolverton
npj Computational Materials (2022)
Large scale dataset of real space electronic charge density of cubic inorganic materials from density functional theory (DFT) calculations
- Fancy Qian Wang
- Kamal Choudhary
- Ming Hu
Scientific Data (2022)
Ordering a rhenium catalyst on Ag(001) through molecule-surface step interaction
- Ole Bunjes
- Lucas A. Paul
- Martin Wenderoth
Communications Chemistry (2022)