Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

# A library of ab initio Raman spectra for automated identification of 2D materials

## Abstract

Raman spectroscopy is frequently used to identify composition, structure and layer thickness of 2D materials. Here, we describe an efficient first-principles workflow for calculating resonant first-order Raman spectra of solids within third-order perturbation theory employing a localized atomic orbital basis set. The method is used to obtain the Raman spectra of 733 different monolayers selected from the Computational 2D Materials Database (C2DB). We benchmark the computational scheme against available experimental data for 15 known monolayers. Furthermore, we propose an automatic procedure for identifying a material based on an input experimental Raman spectrum and apply it to the cases of MoS2 (H-phase) and WTe2 (T$${}^{\prime}$$-phase). The Raman spectra of all materials at different excitation frequencies and polarization configurations are freely available from the C2DB. Our comprehensive and easily accessible library of ab initio Raman spectra should be valuable for both theoreticians and experimentalists in the field of 2D materials.

## Introduction

Following the discovery of graphene in 20041, the field of two-dimensional (2D) materials has grown tremendously during the last decade. Today, more than 50 different monolayer compounds including metals2,3, semiconductors4,5,6, insulators7, ferromagnets8, and superconductors9,10, have been chemically grown or mechanically exfoliated from layered bulk crystals11. The enormous interest in 2D materials has mainly been driven by their unique and easily tunable properties (as compared to 3D bulk crystals), which make them attractive for both fundamental research and technological applications in areas such as energy conversion/storage, (opto)-electronics, and photonics6,12,13. Among the various experimental techniques used for characterizing 2D materials, Raman spectroscopy plays a pivotal role14 thanks to its simplicity, non-destructive nature, and high sensitivity towards key materials properties such as chemical composition, layer thickness (number of layers), inter-layer coupling, strain, crystal symmetries and sample quality15,16,17.

Raman spectroscopy is a versatile technique for probing the vibrational modes of molecules and crystals from inelastically scattered light, and is widely used for identifying materials through their unique vibrational fingerprints18. There are various types of Raman spectroscopies that differ in the number of photons or phonons involved in the scattering process18. Here we focus on the first-order Raman processes in which only a single phonon is involved. Typically, this is the dominant scattering process in defect-free samples (which are considered here). Note that Raman processes involving defect states or several phonons may also play important roles in some 2D crystals such as graphene19. As shown schematically in Fig. 1(a), the light scattered from a crystal appears in three distinct frequency bands: A strong resonance at the incident frequency ωin due to Rayleigh (elastic) scattering, and weaker resonances due to Raman (inelastic) scattering at ωin − ων and ωin + ων forming Stokes and anti-Stokes bands, respectively. Here, ων is the frequency of a (Raman active) vibrational mode of the crystal, i.e. a phonon. Depending on the symmetry of the phonon modes and polarization of the electromagnetic fields, a phonon mode may be active or inactive in the Raman spectrum.

While semi-classical theories of Raman spectroscopy can provide some qualitative insight18, a full quantum mechanical treatment is necessary for a quantitatively accurate description. In particular, ab initio techniques have been employed successfully to calculate Raman spectra of both molecules18,20 and solids21,22 typically showing good agreement with experimental spectra. The parameter-free nature of such computational schemes endow them with a high degree of predictive power, although their computational cost can be significant, thus, in practice limiting them to relatively simple, i.e. crystalline, materials. In the realm of 2D materials, ab initio Raman studies have been limited to a handful of the most popular 2D crystals including graphene19, hBN23, WTe224, SnS, and SnSe25, as well as MoS2 and WS226. In view of the significant experimental efforts currently being devoted to the synthesis and application of future 2D materials and the important role of Raman spectroscopy as a main characterization tool, it is clear that the compilation of a comprehensive library of Raman spectra of 2D materials across different crystal structures and chemical compositions is a critical and timely endeavor.

Recently, we have introduced the open Computational 2D Materials Database (C2DB)11, which contains various calculated properties for several thousands 2D crystals using state of the art ab initio methods. The properties currently provided in the C2DB include the relaxed crystal structures, thermodynamic phase diagrams (convex hull), electronic band structures and related quantities (effective masses, deformation potentials, etc.), elastic properties (stiffness tensors, phonon frequencies), and optical conductivity/absorbance spectra. We stress that the materials in the C2DB comprise both experimentally known as well as hypothetical materials, i.e. materials that may or may not be possible to synthesize in reality.

In this paper, we present an ab initio high-throughput computation of the resonant first-order Raman spectra of more than 700 monolayers selected as the most stable 2D crystals from the C2DB. The calculations are based on an efficient density functional theory (DFT) implementation of the first-order Raman process employing a localized atomic orbital (LCAO) basis set27. We describe the implementation and the automated workflow for computing the Raman spectra at three different excitation frequencies and nine polarization setups. All calculated Raman spectra are provided in Supplementary Figs. 2–734, and can be found at the C2DB website (http://c2db.fysik.dtu.dk). In addition, the applied computational routines are freely available online through the website. Our numerical results are benchmarked against available experimental data for selected 2D crystals (15 different monolayers) such as MoS2, MoSSe, and MoSe2. The calculated spectra show excellent agreement with experiments for the Raman peak positions and acceptable agreement for the relative peak intensities. Finally, we analyze the inverse problem of identifying a material based on an input (experimental) Raman spectrum as shown schematically in Fig. 1(b). Using MoS2 (H-phase) and WTe2 (T$${}^{\prime}$$-phase) as two examples, we find that a simple descriptor consisting of the first and second moments of the Raman spectrum combined with the Euclidean distance measure suffices to identify the correct material among the 700+ candidate materials in the database. In particular, this procedure can be used to differentiate clearly the distinct structural phases of MoS2 and WTe2. Incidentally, the library of calculated Raman spectra provides a useful dataset for training machine learning algorithms28,29. As such, our work is not only a valuable reference for experimentalists and theoreticians working in the field of 2D materials, but also represents a step in the direction of autonomous (in situ) characterization of materials.

## Results

### Theory of Raman Scattering

We first briefly review the theory of Raman scattering in the context of third-order perturbation theory. As discussed above, accurate modeling of Raman processes requires a quantum mechanical treatment to obtain the electronic properties. Regarding the electromagnetic field, it can be shown that a classical description of the field30,31 yields the same results as the full quantum mechanical theory that quantizes the photon field18,32. The most common approach to Raman calculations is the Kramers–Heisenberg–Dirac approach31, in which the Raman tensor is obtained as a derivative of the electric polarizability with respect to the vibrational normal modes21,26,30,31. Nonetheless, here we employ a more direct and much less-explored approach based on time-dependent third-order perturbation theory to obtain the rate for coherent electronic processes involving creation/annihilation of two photons and one phonon. While the two approaches can be shown to be equivalent18, at least when local field effects can be ignored as is the case for 2D materials, the third-order perturbative approach can be readily extended to higher order Raman processes (e.g. scattering on multiple phonons), and provides a more transparent physical picture of the Raman processes in terms of individual scattering events33. Hence, our computational framework is prepared for future extensions to multi-phonon processes. Note that in terms of computational effort, the perturbative approach is comparable to the polarizability derivative method for typical crystals, for which the matrix element calculation dominates the computation time. In this case, both approaches scale as $${N}_{\nu }{N}_{b}^{2}$$, where Nν and Nb denote the number of phonon modes and electronic bands, respectively.

To derive an expression for the Raman intensity, both electron–light and electron–phonon Hamiltonians are treated as perturbations (the exact forms of these Hamiltonians are given in the method section). A general time-dependent perturbation can be written as $$\hat{H^{^\prime}} (t)\equiv {\sum }_{{\omega }_{1}}{\hat{H^{^\prime}}}({\omega }_{1})\exp (-{\mathrm{i}}{\omega }_{1}t)$$ (ω1 runs over positive and negative frequencies and can also be zero). Note that, in our study, there are three distinct frequency components in $$\hat{H^{^\prime}} (t)$$: input and output frequencies (ωin and ωout) due to the electron–light interaction and zero frequency (i.e. time-independent) for electron–phonon coupling. Within third-order perturbation theory, the transition rate $${P}_{i\to f}^{(3)}$$ from an initial state $$\left|{\Psi }_{i}\right\rangle$$ to a final state $$|{\Psi }_{f}\rangle$$ due to the perturbative Hamiltonian $$\hat{H^{^\prime}} (t)$$, is given by34

$${P}_{i\to f}^{(3)}=\frac{2\pi }{\hslash }{\left|\sum _{ab}\sum_{({\omega }_{1}{\omega }_{2}{\omega }_{3})}\frac{\langle {\Psi }_{f}| {\hat{H^{^\prime}}}({\omega }_{1})| {\Psi }_{a}\rangle \langle {\Psi }_{a}| {\hat{H^{^\prime}}}({\omega }_{2})| {\Psi }_{b}\rangle \langle {\Psi }_{b}| {\hat{H^{^\prime}}}({\omega }_{3})| {\Psi }_{i}\rangle }{({E}_{i}-{E}_{a}+\hslash {\omega }_{2}+\hslash {\omega }_{3})({E}_{i}-{E}_{b}+\hslash {\omega }_{3})}\right|}^{2}\delta ({E}_{f}-{E}_{i}-\hslash \omega)\ .$$
(1)

Here, ab summations are performed over all eigenstates of the unperturbed system (here a set of electrons and phonons) and the sums over ωn with n = 1, 2, 3 are over all three involved frequencies in the perturbative Hamiltonian $$\hat{H}^{\prime} (t)$$. The notation (ω1ω2ω3) indicates that, in performing the summation over ωn, the sum ω1 + ω2 + ω3 = ω is to be held fixed. In addition, Eα with $$\alpha \in \left\{i,f,a,b\right\}$$ denote the energies associated with $$\left|{\Psi }_{\alpha }\right\rangle$$ and the Dirac delta ensures energy conservation. The light field is written as $${\boldsymbol{\mathcal{F}}}(t)={\boldsymbol{\mathcal{F}}}_{\mathrm{in}}{\bf{u}}_{{\mathrm{in}}}\,{\mathrm{exp}}\, (-{\mathrm{i}}{\omega }_{\mathrm{in}}t)+{\boldsymbol{\mathcal{F}}}_{{\mathrm{out}}}{{\mathbf{u}}}_{{\mathrm{out}}}\,{\mathrm{exp}}\,(-{\mathrm{i}}{\omega }_{{\mathrm{out}}}t)+{\mathrm{complex}}$$ conjugate, where $${\boldsymbol{\mathcal{F}}}_{{\mathrm{in}}/{\mathrm{out}}}$$ and ωin/out are the amplitudes and frequencies of the input/output electromagnetic fields, respectively, see Fig. 1(a). In addition, $${{\mathbf{u}}}_{{\rm{in}}/{\rm{out}}}={\sum }_{\alpha }{u}_{{\rm{in}}/{\rm{out}}}^{\alpha }{{\bf{e}}}_{\alpha }$$ denote the corresponding polarization vectors, where eα is the unit vector along the α-direction with α {xyz}.

We now specialize to the case where the initial and final states of the system are given by $$\left|{\Psi }_{i}\right\rangle =\left|0\right\rangle \otimes \left|{n}_{\nu }\right\rangle$$ and $$|{\Psi }_{f}\rangle =|0\rangle \otimes |{n}_{\nu }+1\rangle$$, respectively32 so that Ef − Ei = ων. Here, $$\left|0\right\rangle$$ denotes the ground state of the electronic system and $$\left|{n}_{\nu }\right\rangle$$ is a state with nν phonons at frequency of ων. In this case, the intensity of the Stokes Raman process for a phonon mode is proportional to $${P}_{i\to f}^{(3)}$$, in which the transition rate involves a photon absorption, followed by an emission of a single phonon and photon. For this type of processes, (ω1ω2ω3) are any permutation of (ωin, −ωout, 0), e.g. ω1 = ωin, ω2 = −ωout, ω3 = 0 and five similar terms (all six terms contribute to the response at frequency of ω = ωin − ωout). The total Raman intensity I(ω) is then obtained by summing over all possible final states, i.e. phonon modes ν. Inserting the perturbative Hamiltonians [c.f. Eqs. (6)–(8) in method section] in Eq. (1), the expression for the Stokes Raman intensity involving scattering events by only one phonon can be written

$$I(\omega )={I}_{0}\sum_{\nu }\frac{{n}_{\nu }+1}{{\omega }_{\nu }}{\left|\sum_{\alpha \beta }{u}_{{\rm{in}}}^{\alpha }{R}_{\alpha \beta }^{\nu }{u}_{{\rm{out}}}^{\beta }\right|}^{2}\delta (\omega -{\omega }_{\nu })\ .$$
(2)

Here, I0 is an unimportant constant (since Raman spectra are always reported normalized) that is proportional to the input intensity and depends on the input frequency, and nν is given by the Bose–Einstein distribution, i.e. $${n}_{\nu }\equiv {(\exp [\hslash {\omega }_{\nu }/{k}_{B}T]-1)}^{-1}$$ at temperature T. Due to momentum conservation, only phonons at the center of the Brillouin zone contribute to the one-phonon Raman processes19. Furthermore, $${R}_{\alpha \beta }^{\nu }$$ denotes the Raman tensor for phonon mode ν, see method section. Eq. (2) is used for computing the Raman spectra in this work for a given excitation frequency and polarization setup. It may be noted that one can derive a similar expression for the anti-Stokes Raman intensity by replacing nν + 1 by nν in Eq. (2) and ων by −ων in Eq. (10) in method section. Note, also, that the Raman shift ω is expressed in cm−1 with 1 meV equivalent to 8.0655 cm−1.

### Computational workflow

An overview of the automated workflow for computing the Raman tensor of the materials in the C2DB is shown in Fig. 2. First, the relaxed structures are extracted from the database. In this work, we consider only compounds that are dynamically stable. Next, the electronic band energies and wavefunctions are obtained from a DFT calculation. In parallel, a zone-center phonon calculation is performed to obtain the optical vibrational modes. From the obtained electronic states and phonon modes, the momentum and electron–phonon matrix elements are evaluated and stored. In the final step, for a given excitation frequency and input/output polarization vectors, the Raman spectrum is calculated using Eq. (2). The key feature of the approach outlined here is that the calculation process can be automatized, allowing one to perform thousands of calculations in parallel without human intervention.

For simplicity, we have restricted the study to non-magnetic materials, but our routines can be readily extended to include magnetic materials. The Raman spectra presented in this paper are computed for in-plane polarization, where the incoming and outgoing photons are polarized along the x- or y-directions, i.e. uin/out are either [1, 0, 0] or [0, 1, 0]. The four possible combinations are referred to as xx, xy, yx, and yy polarization setups.

### Raman spectra and comparison with experiments

Figure 3 compares the calculated Raman spectrum of three different monolayer transition metal dichalcogenides (TMDs), namely MoSe2, MoSSe and MoS2, with the experimental data extracted from ref. 35. For all three monolayers, a good agreement is observed both for the peak positions and relative amplitudes of the main peaks. Additional peaks in the experimental spectra presumably originate from the substrate or defects in the samples. The differences between the Raman spectra of the three materials provide valuable information about the crystal structure. Symmetry and Raman activity of phonon modes are determined by the irreducible point group representations. MoS2 and MoSe2 are members of point group D3h, whereas MoSSe lacking a horizontal mirror plane σh belongs to the point group C3v. In Mulliken notation, the irreducible representation of MoS2 and MoSe2 is $$2{{\rm{A}}}_{2}^{{\prime\prime} }+{{\rm{A}}}_{1}^{\prime}+2{{\rm{E}}}^{\prime}+{{\rm{E}}}^{^{\prime\prime} }$$, whereas for MoSSe the lowered symmetry leads to 3A1 + 3E. For MoS2 and MoSe2, one member of both $${{\rm{A}}}_{2}^{{\prime\prime} }$$ and $${\rm{E}}^{\prime}$$ is an acoustic mode, and the other $${{\rm{A}}}_{2}^{{\prime\prime} }$$ mode is Raman inactive. For MoSSe, A1 and E each contain an acoustic mode, and all other modes are Raman active. The relevant modes are shown schematically in Fig. 3(b). In general, a Raman active mode will only appear in certain polarization configurations. The tensorial Raman selection rules follow from the irreducible point group representations36,37 as shown for point groups D3h and C3v in Supplementary Note 1.

Next, we focus on the case of MoS2, and investigate the dependency of the Raman spectrum on the excitation frequency and polarization, see Figs. 4(a) and 4(b), respectively. In Fig. 4(a), the Raman spectra are computed for three commonly used wavelengths of blue, green and red laser sources. In this case both in- and outgoing polarization vectors are along the y-direction (or x-direction). While the relative strength of the first Raman active peak in the spectrum is enhanced slightly for shorter wavelengths, the shape of the spectrum does not change significantly. Note that, in reality, the relative amplitudes of the $${\rm{E}}^{\prime}$$ and $${{\rm{A}}}_{1}^{\prime}$$ modes may change considerably if the excitation frequency coincides with an exciton resonance38,39. This is because excitons can selectively enhance specific Raman modes due to their symmetry40,41. Although this effect is not captured properly in our independent-electron model, it is in principle straightforward to include by using the many-body eigenstates obtained by diagonalizing the Bethe–Salpeter equation (BSE) when evaluating the matrix elements in Eq. (1)41,43,44,45,45. Moreover, the absolute magnitudes of the Raman peaks can vary substantially by changing the excitation wavelength due to the possible resonance with electronic states (resonance Raman spectroscopy)40. Nonetheless, the overall magnitude of the Raman spectra is usually of little practical importance compared to the spectral positions and spectra are typically normalized as done here. Changing the polarization of electromagnetic fields not only influences the relative amplitudes of Raman peaks, but may switch certain modes on and off as shown in Fig. 4(b). For instance, the MoS2$${\rm{E}}^{\prime}$$ mode becomes completely inactive for the perpendicular polarization setup (zz) due to symmetry26. This is easily confirmed using Supplementary Eq. (1) of Supplementary Note 1 predicting an inactive $${\rm{E}}^{\prime}$$ mode for zz-polarization. Note that, although the E mode is Raman active for xz-, yz-, zx-, and zy-polarizations, the intensity is too small to be observed in Fig. 4(b).

We have assessed the quality of the Raman library for a wide range of material compositions and crystal structures. Fig. 5 compares experimental and calculated Raman spectra for 12 monolayers including graphene, hBN, several conventional TMDs in the H- or T$${}^{\prime}$$-phase as well as anisotropic crystals such as phosphorene and Pd2Se4. In general, the number of Raman active modes increases with the number of atoms in the unit cell, as expected. For instance, there are more than eight peaks in the Raman spectrum of Pd2Se4. Furthermore, as a rule of thumb, Raman modes of materials containing heavier atoms are at lower frequencies and vice versa, e.g. the Raman peaks for graphene and hBN appear at frequencies above 1000 cm−1. The experimental data are obtained under various experimental conditions such as different excitation wavelengths and polarizations or diverse sample substrates. Note that if polarized Raman spectra were not available (or in the case of unspecified polarization), an average of all four in-plane polarization settings, i.e. xx, xy, yx, and yy, has been used for generating the theoretical spectra. In general, there is quite good agreement between our calculations and experimental results in all cases, particularly, for the peak positions. The deviations can be attributed to various factors such as substrate and excitonic effects, which are not captured in our calculations, as well as the quality of the experimental samples and other experimental uncertainties, all of which can influence the spectra considerably.

### Identifying materials from their Raman spectra

At this point, we turn to a critical test of the ab initio Raman library: given an experimental Raman spectrum, is it possible to identify the underlying material by comparing the experimental spectrum to a library of calculated spectra? The answer to this question will depend on several factors including: (1) the quality of the experimental spectrum. (2) The quality of the calculated spectra, i.e. the ability of theory to reproduce a (high quality) experimental spectrum for a given material. (3) The size/density of the calculated Raman spectrum database. Obviously, a more densely populated database increases the chances that the experimental sample is, in fact, contained in the database. But, at the same time, this increases the risk of obtaining a false positive, i.e. matching the experimental spectrum by a calculated spectrum of a different material.

Putting the above idea into practice requires a quantitative measure for comparing Raman spectra. In the present work, we use the two lowest moments to fingerprint the Raman spectrum. In general, the Nth Raman moment of the spectrum is given by

$$\langle {\omega }^{N}\rangle \equiv \int_{0}^{\infty }I(\omega ){\omega }^{N}{\rm{d}}\omega =\sum_{\nu }{I}_{\nu }{\omega }_{\nu }^{N}\ ,$$
(3)

where Iν denotes the amplitude of mode ν, i.e. $${I}_{\nu }={I}_{0}({n}_{\nu }+1)| {\sum }_{\alpha \beta }{u}_{{\rm{in}}}^{\alpha }{R}_{\alpha \beta }^{\nu }{u}_{{\rm{out}}}^{\beta }{| }^{2}/{\omega }_{\nu }$$. Note that, for these calculations, we normalize the Raman spectrum such that its zeroth moment becomes one, i.e. $$\mathop{\int}\nolimits_{0}^{\infty }I(\omega ){\rm{d}}\omega ={\sum }_{\nu }{I}_{\nu }=1$$. Therefore, the first Raman moment corresponds to the mean value of the spectrum. Rather than using the second moment, we use the standard deviation of the spectrum as the selected measure, given by

$$\delta \omega =\sqrt{\langle {\omega }^{2}\rangle -{\langle \omega \rangle }^{2}}\ .$$
(4)

Figure 6 shows a scatter plot of 〈ω〉 and δω/〈ω〉 for the 733 monolayers at an excitation wavelength of 532 nm and xx-polarization setup obtained at the room temperature. In this plot, crystals composed of lighter elements appear further to the right because their optical phonons generally have higher energies. Furthermore, crystals with fewer atoms in the unit cell and/or higher degree of symmetry, appear in the bottom of the plot because they have fewer (non-degenerate) phonons and thus fewer peaks in their Raman spectrum resulting in a reduced frequency spread. In particular, δω vanishes for materials with only a single Raman peak such as graphene and hBN.

To test the feasibility of inverse Raman mapping, we evaluate the lowest Raman moment fingerprint for five experimental Raman spectra of MoS2 (H-phase) and three spectra of WTe2 (T$${}^{\prime}$$-phase) obtained from independent studies, see stars in Fig. 6. Similar analyses have been performed for the eleven additional crystals found in Fig. 5, and is provided in Supplementary Note 2. Firstly, note that the fingerprint of MoS2 in the T$${}^{\prime}$$-phase (WTe2 in H-phase) is located relatively far from the H-phase (T$${}^{\prime}$$-phase) fingerprint in the plot, which suggests that the lowest Raman moments are indeed able to distinguish different structural phases of the same material. The insets highlight the regions surrounding the experimental data. The variation in the experimental fingerprints is due to small differences in the Raman spectra, originating from the variations in sample quality, substrate effects, measurement techniques/conditions, etc. Consequently, the precise peak positions and, in particular, their amplitudes can vary from one experiment to another. Clearly, the fingerprints of the calculated spectra for both MoS2 and WTe2 lie close to the experimental data. In a few cases, such as Pd2Se4, the experimental fingerprints lie further from the theoretical predictions, as illustrated in Supplementary Fig. 1. This may partly be due to insufficient sample quality for these less-explored 2D crystals. In fact, the deviation between theory and experiments is comparable to the variation between the different experiments. Importantly, only a few other materials show a similar agreement with the experimental data. This suggests that fingerprints including higher order moments could single out the correct material with even higher precision. For instance, the skewness (based on the third Raman moment) can be used to distinguish MoS2 from CrS2. By manual inspection of the Raman spectra, one readily confirms that the calculated spectra of MoS2 and WTe2 are in fact the best match to the experimental spectra, e.g. other candidates have Raman peaks that are not observed in the experimental spectra or the relative amplitudes of the peaks are completely different from the experimental data. Nonetheless, the procedure of manual inspection can be replaced by a more rigorous and unbiased approach as discussed below.

To compare the experimental and calculated Raman spectra quantitatively, we focus on the experimental data of Tongay et al.46 and Cao et al.47 for MoS2 and WTe2, respectively. The experimental spectra for MoS2 are obtained without any polarizer at 77 K at an excitation wavelength of 488 nm. For WTe2 in Cao et al.47, the experiment is performed at room temperature using a 532 nm laser linearly polarized in-plane. To account for the unspecified polarization, we take the average of Raman spectra for the xx and xy polarization setups in the case of WTe2, while for MoS2 the average of all Raman spectra for transverse components (xx, xy, yx, and yy) is used as the theoretical spectrum. For quantitative comparison with the experimental data, one can use Euclidean distances between the experimental and theoretical spectra as a measure. For two Raman spectra I1(ω) and I2(ω), the Euclidean distance (or L2-norm) I1 − I2 is defined as

$$| | {I}_{1}-{I}_{2}| | \equiv {\left(\int_{0}^{\infty }{\left|{I}_{1}(\omega )-{I}_{2}(\omega )\right|}^{2}{\rm{d}}\omega \right)}^{1/2}\ .$$
(5)

Note that the spectra are normalized such that the total area is unity. Figure 7 shows the computed Euclidean distances from the calculated Raman spectra to the experimental data for both MoS2 and WTe2. We highlight the points corresponding to the materials in the insets of Fig. 6. In both cases, identifying the smallest Euclidean distance confirms that the Raman spectra closest to the experimental data are indeed the calculated spectra of MoS2 and WTe2. This shows that the quality and accuracy of, respectively, the experimental and computed 2D materials Raman spectra, is sufficient for automatic structure identification.

## Discussion

We have introduced a comprehensive library of ab initio computed Raman spectra for more than 700 2D materials spanning a variety of chemical compositions and crystal structures. The 2D materials comprise both experimentally known and hypothetical compounds, all dynamically stable and with low formation energies. Using an efficient first-principles implementation of third-order perturbation theory, the full resonant first-order Raman tensor was calculated including all nine possible combinations for polarization vectors of the input/output photons and three commonly used excitation wavelengths. All spectra are freely available as part of the C2DB and should comprise a valuable reference for both theoreticians and experimentalists in the field. The reliability of the computational approach was demonstrated by comparison with experimental spectra for 15 monolayers such as graphene, hBN, phosphorene and several TMDs in the H-, T-, and T$${}^{\prime}$$-phases.

We carefully tested the feasibility of inverse Raman mapping, i.e. to what extent the library of computed Raman spectra can be used to identify the composition and crystal structure of an unknown material from its Raman spectrum. For the specific cases of MoS2 in H-phase and WTe2 in T$${}^{\prime}$$-phase, we showed that a simple fingerprint based on the lowest moments of the Raman spectrum is sufficient to identify the materials from their experimental Raman spectrum. This represents a significant step in the direction of autonomous identification/characterization of materials. In addition, apart from being a useful reference for 2D materials research, the Raman library can be used to train machine learning algorithms to predict Raman spectra directly from the atomic structure similarly to recent work on prediction of linear optical spectra for molecules48. This is of particular importance in the currently attractive trend of employing machine learning algorithms in materials science28,29.

In the present work, we have focused on Raman processes involving only a single phonon, i.e. first-order Raman processes, since these are typically the dominant contributions to the Raman spectrum. Nonetheless, the presented methodology can be readily extended to include two-phonon scattering processes, although the computational cost will be significantly increased. Excitonic effects in the Raman spectrum have been neglected since most experimental Raman spectra are recorded off-resonance where excitons play a minor role. The inclusion of excitonic effects can be achieved within the presented methodology by employing the many-body eigenstates obtained from the BSE42,49,50 instead of Slater determinantal electron–hole excitations. However, this will mainly affect the amplitude of the Raman peaks which is of secondary importance in practice. We only compute the Raman spectra of monolayers in the present work, but the library can be extended to multi-layer structures. For some 2D materials such as graphene or MoS2 this can be done by employing existing exchange-correlation functionals capable of accurate modeling of van der Waals forces. But for other 2D systems such as phosphorene, further development of exchange-correlation functionals is required to describe the complex inter-layer couplings, particularly for low-frequency Raman modes51. The symmetry of phonons modes have previously been investigated for graphene52, the TMD family37 and phosphorene53 using group theory analysis. Based on the Raman library, such analysis could be performed for a much wider range of materials in future work. Finally, the current work has been restricted to non-magnetic materials, and the ab initio Raman response of magnetic materials is an interesting future research field.

## Methods

### Theory

In the independent-particle approximation, the Hamiltonian of a system of electrons interacting with phonons and electromagnetic fields takes the form $$\hat{H}={\hat{H}}_{0}+{\hat{H}}_{{\rm{e}}\gamma }+{\hat{H}}_{{\rm{e}}\nu }$$, where $${\hat{H}}_{0}$$ is the unperturbed Hamiltonian of the electrons (e) and phonons (ν), $${\hat{H}}_{{\rm{e}}\gamma }$$ describes the electron–light interaction (here written in the velocity or minimal coupling gauge54,55), and $${\hat{H}}_{{\rm{e}}\nu }$$ describes the electron–phonon coupling. In second quantization, they are given by56

$${\hat{H}}_{0}\equiv \sum_{n{\bf{k}}}{\varepsilon }_{n{\bf{k}}}{\hat{c}}_{n{\bf{k}}}^{\dagger }{\hat{c}}_{n{\bf{k}}}+\sum_{\nu {\bf{q}}}\hslash {\omega }_{\nu {\bf{q}}}\left({\hat{a}}_{\nu {\bf{q}}}^{\dagger }{\hat{a}}_{\nu {\bf{q}}}+\frac{1}{2}\right)\ ,$$
(6)
$${\hat{H}}_{{\rm{e}}\gamma }(t)=\frac{e}{m}{\boldsymbol{\mathcal{A}}}(t)\cdot \sum _{nm{\bf{k}}}{{\bf{p}}}_{nm{\bf{k}}}{\hat{c}}_{n{\bf{k}}}^{\dagger }{\hat{c}}_{m{\bf{k}}}\ ,$$
(7)
$${\hat{H}}_{{\rm{e}}\nu }=\sum _{ ^{nm\nu} _{{\bf{k}}{\bf{q}}}}\sqrt{\frac{\hslash }{{\omega }_{\nu {\bf{q}}}}}{g}_{nm{\bf{k}}}^{\nu {\bf{q}}}{\hat{c}}_{n{\bf{k}}}^{\dagger }{\hat{c}}_{m{\bf{k}}}\left({\hat{a}}_{\nu {\bf{q}}}+{\hat{a}}_{\nu (-{\bf{q}})}^{\dagger }\right)\ .$$
(8)

Here, $${\hat{c}}^{\dagger }/\hat{c}$$ and $${\hat{a}}^{\dagger }/\hat{a}$$ are the creation/annihilation operators of electrons and phonons, respectively, $${\boldsymbol{\mathcal{A}}}$$ denotes the vector potential ($${\boldsymbol{\mathcal{F}}}=-\partial {\boldsymbol{\mathcal{A}}}/\partial t$$), εnk is the energy of the single-particle electronic state $$\left|n{\bf{k}}\right\rangle$$, and ωνq denotes the phonon energy of normal mode ν and wavevector q. Furthermore, $${{\bf{p}}}_{nm{\bf{k}}}=\langle n{\bf{k}}| \hat{{\bf{p}}}| m{\bf{k}}\rangle$$ and $${g}_{nm{\bf{k}}}^{\nu {\bf{q}}}=\langle n{\bf{k}}+{\bf{q}}| {\partial }_{\nu {\bf{q}}}{V}^{{\rm{KS}}}| m{\bf{k}}\rangle$$ are the momentum and electron–phonon matrix elements (to the first order in the atomic displacements56), respectively, with the Kohn–Sham potential VKS. The summation over k implies an integral over the first Brillouin zone, i.e. (2π)Dk → VDBZdDk where V is the D-dimensional volume (D = 2 for 2D systems). Note that the $${\boldsymbol{\mathcal{A}}}^{2}$$ term does not contribute to the linear Raman response and, hence, is absent here. Moreover, we neglect the Coulomb interaction between electrons and holes, i.e. excitonic effects. If the Raman spectroscopy is performed with an excitation frequency that matches the exciton energy38, the electron–hole interactions should be included, ideally within the GW and BSE framework41,43,43.

We now insert the Hamiltonians given in Eqs. (6)–(8) in the third-order perturbation rate, Eq. (1). As mentioned in the main text, for the Stokes processes involving one phonon, six permutations of (ωin, − ωout, 0) are used for (ω1ω2ω3). Furthermore, the eigenstates of the unperturbed Hamiltonian $$\left|{\Psi }_{a}\right\rangle$$ (or $$\left|{\Psi }_{b}\right\rangle$$) can be written as $$\left|{\psi }_{e}\right\rangle \otimes \left|{n}_{\nu ^{\prime} }\right\rangle$$, where $$\left|{\psi }_{e}\right\rangle$$ and $$\left|{n}_{\nu ^{\prime} }\right\rangle$$ are the many-body electronic and phononic states, respectively (the index e runs only over many-body electronic states). Since $${\hat{H}}_{{\rm{e}}\nu }$$ and $${\hat{H}}_{{\rm{e}}\gamma }$$ are, respectively, linear in and independent of the phononic operator, the phonon state $$\left|{n}_{\nu ^{\prime} }\right\rangle$$ contributes to the Stokes response only if $$\left|{n}_{\nu ^{\prime} }\right\rangle$$ is either $$\left|{n}_{\nu }\right\rangle$$ or $$\left|{n}_{\nu }\pm 1\right\rangle$$. Consequently, $$\langle {\Psi }_{a}| {\hat{H}}_{{\rm{e}}\gamma }| {\Psi }_{i}\rangle =\langle {\psi }_{e}| {\hat{H}}_{{\rm{e}}\gamma }| 0\rangle {\delta }_{\nu ^{\prime} \nu }{\delta }_{{n}_{\nu ^{\prime} }{n}_{\nu }}$$ and $$\langle {\Psi }_{a}| {\hat{H}}_{{\rm{e}}\nu }| {\Psi }_{i}\rangle =\langle {\psi }_{e}| {\hat{H}}_{{\rm{e}}\nu }| 0\rangle {\delta }_{\nu ^{\prime} \nu }{\delta }_{{n}_{\nu ^{\prime} }({n}_{\nu }\pm 1)}$$ (δij denotes the Kronecker delta). The total Raman intensity I is obtained by summing over all final states, i.e. phonon modes, and given by I(ω) = I0ν(nν + 1)Pν2δ(ω − ων)/ων, where Pν is defined as

$${P}_{\nu }\equiv \,\sum _{ed}\left[\frac{\langle 0| {\bf{u}}_{\rm{in}}\cdot \hat{{\bf{P}}}| {\psi }_{e}\rangle \langle {\psi }_{e}| {\hat{G}}_{\nu} | {\psi }_{d}\rangle \langle {\psi }_{d}| {\bf{u}}_{\rm{out}}\cdot {\hat{{\bf{P}}}}| 0\rangle }{(\hslash {\omega }_{{\rm{in}}}-{{\mathcal{E}}}_{e})(\hslash {\omega }_{{\rm{out}}}-{{\mathcal{E}}}_{d})}\, +\, \frac{\langle 0| {\bf{u}}_{\rm{in}}\cdot {\hat{{\bf{P}}}}| {\psi }_{e}\rangle \langle {\psi }_{e}| {\bf{u}}_{\rm{out}}\cdot {\hat{{\bf{P}}}}| {\psi }_{d}\rangle \langle {\psi }_{b}| {\hat{G}}_{\nu} | 0\rangle }{(\hslash {\omega }_{{\rm{in}}}-{{\mathcal{E}}}_{e})(\hslash {\omega }_{\nu }-{{\mathcal{E}}}_{d})}\right.\\ \, + \, \frac{\langle 0| {\bf{u}}_{\rm{out}}\cdot {\hat{{\bf{P}}}}| {\psi }_{e}\rangle \langle {\psi }_{e}| {\hat{G}}_{\nu} | {\psi }_{d}\rangle \langle {\psi }_{d}| {\bf{u}}_{\rm{in}}\cdot {\hat{{\bf{P}}}}| 0\rangle }{(-\hslash {\omega }_{{\rm{out}}}-{{\mathcal{E}}}_{e})(-\hslash {\omega }_{{\rm{in}}}-{{\mathcal{E}}}_{d})}\, +\, \frac{\langle 0| {\bf{u}}_{\rm{out}}\cdot {\hat{{\bf{P}}}}| {\psi }_{e}\rangle \langle {\psi }_{e}| {\bf{u}}_{\rm{in}}\cdot {\hat{{\bf{P}}}}| {\psi }_{d}\rangle \langle {\psi }_{d}| {\hat{G}}_{\nu} | 0\rangle }{(-\hslash {\omega }_{{\rm{out}}}-{{\mathcal{E}}}_{e})(\hslash {\omega }_{\nu }-{{\mathcal{E}}}_{d})}\\ \, + \, \left.\frac{\langle 0| {\hat{G}}_{\nu} | {\psi }_{e}\rangle \langle {\psi }_{e}| {\bf{u}}_{\rm{in}}\cdot {\hat{{\bf{P}}}}| {\psi }_{d}\rangle \langle {\psi }_{d}| {\bf{u}}_{\rm{out}}\cdot {\hat{{\bf{P}}}}| 0\rangle }{(-\hslash {\omega }_{\nu }-{{\mathcal{E}}}_{e})(\hslash {\omega }_{{\rm{out}}}-{{\mathcal{E}}}_{d})}\, +\, \frac{\langle 0| {\hat{G}}_{\nu} | {\psi }_{e}\rangle \langle {\psi }_{e}| {\bf{u}}_{\rm{out}}\cdot {\hat{{\bf{P}}}}| {\psi }_{d}\rangle \langle {\psi }_{d}| {\bf{u}}_{\rm{in}}\cdot {\hat{{\bf{P}}}}| 0\rangle }{(-\hslash {\omega }_{\nu }-{{\mathcal{E}}}_{e})(-\hslash {\omega }_{{\rm{in}}}-{{\mathcal{E}}}_{d})}\right].$$
(9)

Here, $${{\mathcal{E}}}_{e/d}$$ denote the electronic energies (with respect to the electronic ground state), the summations over e and d include only electronic states, and $$\hat{{\bf{P}}}$$ and $$\hat{G}_\nu$$ are many-body electronic operators given by $$\hat{{\bf{P}}}\equiv {\sum }_{nm{\bf{k}}}{{\bf{p}}}_{nm{\bf{k}}}{\hat{c}}_{n{\bf{k}}}^{\dagger }{\hat{c}}_{m{\bf{k}}}$$ and $$\hat{G}_\nu \equiv {\sum }_{nm{\bf{k}}}{g}_{nm{\bf{k}}}^{\nu {\bf{0}}}{\hat{c}}_{n{\bf{k}}}^{\dagger }{\hat{c}}_{m{\bf{k}}}$$. Note that the momentum conservation implies that only phonons at q = 0 contribute to the response here19, i.e. ων ≡ ων0. Since both $$\hat{{\bf{P}}}$$ and $$\hat{G}_\nu$$ are bi-linear in the electronic operator, for a non-vanishing matrix elements, $$|{\psi }_{e/d}\rangle$$ must include singly-excited states, i.e. terms in the form $${\hat{c}}_{c{\bf{k}}}^{\dagger }{\hat{c}}_{v{\bf{k}}}\left|0\right\rangle$$ (indices c and v imply conduction and valence bands, respectively)50. Excitonic effects can readily be introduced at this stage by incorporating the BSE solution45,50. However, we neglect the excitonic effects in the present work, and hence, each singly-excited state contributes individually to the response, i.e. $$|{\psi }_{e/d}\rangle ={\hat{c}}_{c{\bf{k}}}^{\dagger }{\hat{c}}_{v{\bf{k}}}|0\rangle$$ with an energy of $${{\mathcal{E}}}_{e/d}={\varepsilon }_{c{\bf{k}}}-{\varepsilon }_{v{\bf{k}}}$$. At finite temperature, the expression for $$|{\psi }_{e/d}\rangle$$ should be taken as $$|{\psi }_{e/d}\rangle ={f}_{i}(1-{f}_{j}){\hat{c}}_{j{\bf{k}}}^{\dagger }{\hat{c}}_{i{\bf{k}}}|0\rangle$$, where $${f}_{i}\equiv {(1+\exp [({\varepsilon }_{i{\bf{k}}}-\mu )/{k}_{B}T])}^{-1}$$ is the Fermi–Dirac distribution with chemical potential μ.

Rewriting Pν in terms of the single-particle variables and polarization vectors leads to Eq. (2) for the Raman intensity, where the Raman tensor component, $${R}_{\alpha \beta }^{\nu }$$, reads

$${R}_{\alpha \beta }^{\nu }\equiv \sum _{ijmn{\bf{k}}}\left[\frac{{p}_{ij}^{\alpha }({g}_{jm}^{\nu }{\delta }_{in}-{g}_{ni}^{\nu }{\delta }_{jm}){p}_{mn}^{\beta }}{(\hslash {\omega }_{{\rm{in}}}-{\varepsilon }_{ji})(\hslash {\omega }_{{\rm{out}}}-{\varepsilon }_{mn})}\, +\, \frac{{p}_{ij}^{\alpha }({p}_{jm}^{\beta }{\delta }_{in}-{p}_{ni}^{\beta }{\delta }_{jm}){g}_{mn}^{\nu }}{(\hslash {\omega }_{{\rm{in}}}-{\varepsilon }_{ji})(\hslash {\omega }_{\nu }-{\varepsilon }_{mn})}\right.\\ \quad \quad +\, \frac{{p}_{ij}^{\beta }({g}_{jm}^{\nu }{\delta }_{in}-{g}_{ni}^{\nu }{\delta }_{jm}){p}_{mn}^{\alpha }}{(-\hslash {\omega }_{{\rm{out}}}-{\varepsilon }_{ji})(-\hslash {\omega }_{{\rm{in}}}-{\varepsilon }_{mn})}\, +\, \frac{{p}_{ij}^{\beta }({p}_{jm}^{\alpha }{\delta }_{in}-{p}_{ni}^{\alpha }{\delta }_{jm}){g}_{mn}^{\nu }}{(-\hslash {\omega }_{{\rm{out}}}-{\varepsilon }_{ji})(\hslash {\omega }_{\nu }-{\varepsilon }_{mn})}\\ \quad \quad +\,\left.\frac{{g}_{ij}^{\nu }({p}_{jm}^{\alpha }{\delta }_{in}-{p}_{ni}^{\alpha }{\delta }_{jm}){p}_{mn}^{\beta }}{(-\hslash {\omega }_{\nu }-{\varepsilon }_{ji})(\hslash {\omega }_{{\rm{out}}}-{\varepsilon }_{mn})}\, +\, \frac{{g}_{ij}^{\nu }({p}_{jm}^{\beta }{\delta }_{in}-{p}_{ni}^{\beta }{\delta }_{jm}){p}_{mn}^{\alpha }}{(-\hslash {\omega }_{\nu }-{\varepsilon }_{ji})(-\hslash {\omega }_{{\rm{in}}}-{\varepsilon }_{mn})}\right]{f}_{i}(1-{f}_{j}){f}_{n}(1-{f}_{m})\ .$$
(10)

Here, εij ≡ εik − εjk, $${p}_{ij}^{\alpha }\equiv \langle i{\bf{k}}| {\hat{p}}^{\alpha }| j{\bf{k}}\rangle$$, $${g}_{ij}^{\nu }\equiv \langle i{\bf{k}}| {\partial }_{\nu {\bf{0}}}{V}^{{\rm{KS}}}| j{\bf{k}}\rangle$$, and (ijmn)/ν are the electron/phonon band index. The line-shape broadening is accounted for by adding a small phenomenological imaginary part, iη, to the photon frequencies ωin/out → ωin/out + iη. We set the frequency broadening to η = 200 meV in our calculations.

### First-principles calculations

All DFT calculations are performed with the projector-augmented wave code, GPAW57,58, in combination with the atomic simulation environment (ASE)59. The Perdew–Burke–Ernzerhof (PBE) exchange-correlation functional is used60 and the Kohn–Sham orbitals are expanded using the double zeta polarized (dzp) basis set27. Despite its fairly small size, the dzp basis set provides sufficiently accurate phonon modes. This has been tested by benchmarking the phonon frequencies obtained from this basis set against the results using the commonly-employed plane waves for 700+ monolayers (more than 7000 phonon modes). We confirm that for approximately 80% of all phonons, the discrepancy between the two approaches is less than 5%. Also, the choice of exchange-correlation functional may slightly influence the Raman spectra. For instance, it is known that the PBE functional tends to overestimate the lattice parameters and underestimate the phonon frequencies in crystals61, whereas the opposite occurs for the local-density approximation (LDA) functionals. Nonetheless, this choice only slightly influences our calculated Raman spectra, and PBE usually provides sufficiently accurate phonon frequencies in the range of theoretical and experimental uncertainties62. The monolayers are placed between two vacuum regions with thicknesses of 15 Å. A convergence test of Raman spectra with respect to the wavevector density is performed for several materials, and a mesh with the density of 25 Å−1 for ground state calculations was chosen. The phonon modes are obtained using the standard approach based on calculating the dynamical matrices in the harmonic approximation63. The dynamical matrix is evaluated using the small-displacement method64, where the change of forces on a specific atom caused by varying the position of neighboring atoms is computed. Since only the zone-centered (Γ-point) phonons are required, the phonon modes can be computed based on the crystal unit cell. A k-mesh with a density of 12 Å−1 is used for phonon calculations, and the forces are converged within 10−6 eVÅ−1. Since the wavefunctions and Kohn–Sham potentials in GPAW are evaluated on a real-space grid57, a convergence test with respect to this grid spacing is performed and a real-space grid of 0.2 Å is chosen for calculations. The electron–phonon matrix elements are then obtained within the adiabatic approximation using a finite difference technique for evaluating the derivative of the Kohn-Sham potential65. Similarly, the momentum matrix elements are calculated using the finite difference technique and the correction terms due to projector-augmented waves66 are added. The width of the Fermi–Dirac occupations is set to kBT = 50 meV for faster convergence of the DFT results. For generating the Raman spectra, a Gaussian [$$G(\omega )={(\sigma \sqrt{2\pi })}^{-1}\exp (-{\omega }^{2}/2{\sigma }^{2})$$] with a variance σ = 3 cm−1 is used to replace the Dirac delta function, which accounts for the inhomogeneous broadening of phonon modes. The temperature of the Bose–Einstein distributions is set to 300 K for all calculations except for the results in top panel of Fig. 7, where a temperature of 77 K is used. The calculations are submitted, managed, and received using the simple MyQueue workflow tool67, which is a Python front-end to job scheduler.

### Experimental Raman spectra

The experimental Raman spectra are extracted from the figures in the corresponding references using a common plot digitizer. To remove the noise in the experimental data, they are filtered using a Savitzky–Golay filter68 of order three with a filter window length of eleven. For a fair comparison with our theoretical spectra in Fig. 5, we have convolved the experimental spectra with a Gaussian function with variance of 10 cm−1 to reduce the effect of possible but unimportant small frequency shifts between the experimental and theoretical spectra. Furthermore, the Raman moments have been calculated over a frequency range where the main Raman peaks appear, from 350 to 450 cm−1 for MoS2 and from 75 to 260 cm−1 for WTe2. For calculating the Euclidean distance, both the experimental and theoretical spectra are convolved with a Gaussian function with variance of 6 cm−1.

## Data availability

All calculated Raman spectra are freely available online through https://cmrdb.fysik.dtu.dk/c2db/. Other data is available from the corresponding author upon reasonable request.

## Code availability

GPAW is an open-source DFT Python code based on the projector-augmented wave method and the ASE, which is available at https://wiki.fysik.dtu.dk/gpaw/. The Raman code used for generating Raman spectra in this work will be available in future releases of the code.

## References

1. 1.

Novoselov, K. S. et al. Electric field effect in atomically thin carbon films. Science 306, 666–669 (2004).

2. 2.

Anasori, B., Lukatskaya, M. R. & Gogotsi, Y. 2D metal carbides and nitrides (MXenes) for energy storage. Nat. Rev. Mater. 2, 16098 (2017).

3. 3.

Xi, X. et al. Strongly enhanced charge-density-wave order in monolayer NbSe2. Nat. Nanotechnol. 10, 765–769 (2015).

4. 4.

Liu, H. et al. Phosphorene: An unexplored 2D semiconductor with a high hole mobility. ACS Nano 8, 4033–4041 (2014).

5. 5.

Mak, K. F., Lee, C., Hone, J., Shan, J. & Heinz, T. F. Atomically thin MoS2: a new direct-gap semiconductor. Phys. Rev. Lett. 105, 136805 (2010).

6. 6.

Wang, Q. H., Kalantar-Zadeh, K., Kis, A., Coleman, J. N. & Strano, M. S. Electronics and optoelectronics of two-dimensional transition metal dichalcogenides. Nat. Nanotechnol. 7, 699–712 (2012).

7. 7.

Ci, L. et al. Atomic layers of hybridized boron nitride and graphene domains. Nat. Mater. 9, 430–435 (2010).

8. 8.

Huang, B. et al. Layer-dependent ferromagnetism in a van der Waals crystal down to the monolayer limit. Nature 546, 270–273 (2017).

9. 9.

Wang, H. et al. High-quality monolayer superconductor NbSe2 grown by chemical vapour deposition. Nat. Commun. 8, 394 (2017).

10. 10.

Cao, Y. et al. Unconventional superconductivity in magic-angle graphene superlattices. Nature 556, 43–50 (2018).

11. 11.

Haastrup, S. et al. The Computational 2D Materials Database: high-throughput modeling and discovery of atomically thin crystals. 2D Mater. 5, 042002 (2018).

12. 12.

Bonaccorso, F., Sun, Z., Hasan, T. & Ferrari, A. C. Graphene photonics and optoelectronics. Nat. Photon. 4, 611–622 (2010).

13. 13.

Gjerding, M. N., Petersen, R., Pedersen, T. G., Mortensen, N. A. & Thygesen, K. S. Layered van der Waals crystals with hyperbolic light dispersion. Nat. Commun. 8, 320 (2017).

14. 14.

Ferrari, A. C. Raman spectroscopy of graphene and graphite: Disorder, electron–phonon coupling, doping and nonadiabatic effects. Solid State Commun. 143, 47–57 (2007).

15. 15.

Ferrari, A. C. & Basko, D. M. Raman spectroscopy as a versatile tool for studying the properties of graphene. Nat. Nanotechnol. 8, 235–246 (2013).

16. 16.

Neumann, C. et al. Raman spectroscopy as probe of nanometre-scale strain variations in graphene. Nat. Commun. 6, 8429 (2015).

17. 17.

Li, X.-L. et al. Layer-number dependent optical properties of 2D materials and their application for thickness determination. Adv. Funct. Mater. 27, 1604468 (2017).

18. 18.

Long, D. A. The Raman Effect: A Unified Treatment of the Theory of Raman Scattering by Molecules. (John Wiley & Sons Ltd, Chichester, 2002).

19. 19.

Saito, R., Hofmann, M., Dresselhaus, G., Jorio, A. & Dresselhaus, M. S. Raman spectroscopy of graphene and carbon nanotubes. Adv. Phys. 60, 413–550 (2011).

20. 20.

Wilson E. B., C. P., Decius J. C. (ed.) Molecular Vibrations. The Theory of Infrared and Raman vibrational Spectra (Dover Publications, New York, 1980).

21. 21.

Umari, P. & Pasquarello, A. First-principles analysis of the Raman spectrum of vitreous silica: comparison with the vibrational density of states. J. Phys. Condens. Matter 15, S1547–S1552 (2003).

22. 22.

Liang, Q., Dwaraknath, S. & Persson, K. A. High-throughput computation and evaluation of Raman spectra. Sci. Data 6, 135 (2019).

23. 23.

Cai, Q. et al. Raman signature and phonon dispersion of atomically thin boron nitride. Nanoscale 9, 3059–3067 (2017).

24. 24.

Jiang, Y. C., Gao, J. & Wang, L. Raman fingerprint for semi-metal WTe2 evolving from bulk to monolayer. Sci. Rep. 6, 19624 (2016).

25. 25.

Dewandre, A., Verstraete, M. J., Grobert, N. & Zanolli, Z. Spectroscopic properties of few-layer tin chalcogenides. J. Phys.: Mater. 2, 044005 (2019).

26. 26.

Liang, L. & Meunier, V. First-principles Raman spectra of MoS2, WS2 and their heterostructures. Nanoscale 6, 5394 (2014).

27. 27.

Larsen, A. H., Vanin, M., Mortensen, J. J., Thygesen, K. S. & Jacobsen, K. W. Localized atomic basis set in the projector augmented wave method. Phys. Rev. B 80, 195112 (2009).

28. 28.

Butler, K. T., Davies, D. W., Cartwright, H., Isayev, O. & Walsh, A. Machine learning for molecular and materials science. Nature 559, 547–555 (2018).

29. 29.

Schmidt, J., Marques, M. R. G., Botti, S. & Marques, M. A. L. Recent advances and applications of machine learning in solid-state materials science. Npj Comput. Mater. 5, 83 (2019).

30. 30.

Albrecht, A. C. On the theory of Raman intensities. J. Chem. Phys. 34, 1476–1484 (1961).

31. 31.

Lee, S.-Y. & Heller, E. J. Time-dependent theory of Raman scattering. J. Chem. Phys. 71, 4777 (1979).

32. 32.

Ganguly, A. K. & Birman, J. L. Theory of lattice Raman scattering in insulators. Phys. Rev. 162, 806–816 (1967).

33. 33.

Yu, P. Y. & Cardona, M. Fundamentals of Semiconductors: Physics and Materials Properties (Springer, Berlin, 2010).

34. 34.

Grosso, G. & Parravicini, G. P. Solid State Physics. (Academic Press, San Diego, CA, 2000).

35. 35.

Zhang, J. et al. Janus monolayer transition-metal dichalcogenides. ACS Nano 11, 8192–8198 (2017).

36. 36.

Loudon, R. The Raman effect in crystals. Adv. Phys. 13, 423–482 (1964).

37. 37.

Ribeiro-Soares, J. et al. Group theory analysis of phonons in two-dimensional transition metal dichalcogenides. Phys. Rev. B 90, 115438 (2014).

38. 38.

Carvalho, B. R., Malard, L. M., Alves, J. M., Fantini, C. & Pimenta, M. A. Symmetry-dependent exciton-phonon coupling in 2D and bulk MoS2 observed by resonance Raman scattering. Phys. Rev. Lett. 114, 136403 (2015).

39. 39.

Scheuschner, N., Gillen, R., Staiger, M. & Maultzsch, J. Interlayer resonant Raman modes in few-layer MoS2. Phys. Rev. B 91, 235409 (2015).

40. 40.

Lee, J.-U., Kim, K. & Cheong, H. Resonant Raman and photoluminescence spectra of suspended molybdenum disulfide. 2D Mater. 2, 044003 (2015).

41. 41.

del Corro, E. et al. Atypical exciton–phonon interactions in WS2 and WSe2 monolayers revealed by resonance Raman spectroscopy. Nano Lett. 16, 2363–2368 (2016).

42. 42.

Gillet, Y., Giantomassi, M. & Gonze, X. First-principles study of excitonic effects in Raman intensities. Phys. Rev. B 88, 094305 (2013).

43. 43.

Wang, Y., Carvalho, B. R. & Crespi, V. H. Strong exciton regulation of Raman scattering in monolayer MoS2. Phys. Rev. B 98, 161405 (2018).

44. 44.

Taghizadeh, A. & Pedersen, T. G. Nonlinear optical selection rules of excitons in monolayer transition metal dichalcogenides. Phys. Rev. B 99, 235433 (2019).

45. 45.

Taghizadeh, A. & Pedersen, T. G. Nonlinear excitonic spin Hall effect in monolayer transition metal dichalcogenides. 2D Mater. 7, 015003 (2019).

46. 46.

Tongay, S. et al. Two-dimensional semiconductor alloys: Monolayer Mo1−xWxSe2. Applied Physics Letters, 104, 012101 (2014).

47. 47.

Cao, Y. et al. Anomalous vibrational modes in few layer WTe2 revealed by polarized Raman scattering and first-principles calculations. 2D Materials, 4, 035024 (2017).

48. 48.

Ghosh, K. et al. Deep learning spectroscopy: Neural networks for molecular excitation spectra. Adv. Sci. 6, 1801367 (2019).

49. 49.

Cudazzo, P. et al. Exciton band structure in two-dimensional materials. Phys. Rev. Lett. 116, 066803 (2016).

50. 50.

Taghizadeh, A. & Pedersen, T. G. Gauge invariance of excitonic linear and nonlinear optical response. Phys. Rev. B 97, 205432 (2018).

51. 51.

Liang, L. et al. Low-frequency shear and layer-breathing modes in Raman scattering of two-dimensional materials. ACS Nano 11, 11777–11802 (2017).

52. 52.

Malard, L. M., Guimarães, M. H. D., Mafra, D. L., Mazzoni, M. S. C. & Jorio, A. Group-theory analysis of electrons and phonons in n-layer graphene systems. Phys. Rev. B 79, 125426 (2009).

53. 53.

Ribeiro-Soares, J., Almeida, R. M., Cançado, L. G., Dresselhaus, M. S. & Jorio, A. Group theory for structural analysis and lattice vibrations in phosphorene systems. Phys. Rev. B 91, 205421 (2015).

54. 54.

Taghizadeh, A., Hipolito, F. & Pedersen, T. G. Linear and nonlinear optical response of crystals using length and velocity gauges: Effect of basis truncation. Phys. Rev. B 96, 195413 (2017).

55. 55.

Reichardt, S. & Wirtz, L. Theory of resonant Raman scattering: Towards a comprehensive ab initio description. Phys. Rev. B 99, 174312 (2019).

56. 56.

Giustino, F. Electron–phonon interactions from first principles. Rev. Mod. Phys. 89, 015003 (2017).

57. 57.

Mortensen, J. J., Hansen, L. B. & Jacobsen, K. W. Real-space grid implementation of the projector augmented wave method. Phys. Rev. B 71, 035109 (2005).

58. 58.

Enkovaara, J. et al. Electronic structure calculations with GPAW: a real-space implementation of the projector augmented-wave method. J. Condens. Matter Phys. 22, 253202 (2010).

59. 59.

Larsen, A. H. et al. The atomic simulation environment: a Python library for working with atoms. J. Phys. Condens. Matter 29, 273002 (2017).

60. 60.

Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 77, 3865–3868 (1996).

61. 61.

Zhang, G.-X., Reilly, A. M., Tkatchenko, A. & Scheffler, M. Performance of various density-functional approximations for cohesive properties of 64 bulk solids. N. J. Phys. 20, 063020 (2018).

62. 62.

Skelton, J. M. et al. Influence of the exchange-correlation functional on the quasi-harmonic lattice dynamics of II–VI semiconductors. J. Chem. Phys. 143, 064710 (2015).

63. 63.

Fritsch, J. Density functional calculation of semiconductor surface phonons. Phys. Rep. 309, 209–331 (1999).

64. 64.

Alfè, D. PHON: A program to calculate phonons using the small displacement method. Comput. Phys. Commun. 180, 2622–2633 (2009).

65. 65.

Kaasbjerg, K., Thygesen, K. S. & Jacobsen, K. W. Phonon-limited mobility in n-type single-layer MoS2 from first principles. Phys. Rev. B 85, 115317 (2012).

66. 66.

Blöchl, P. E. Projector augmented-wave method. Phys. Rev. B 50, 17953–17979 (1994).

67. 67.

Mortensen, J., Gjerding, M. & Thygesen, K. MyQueue: task and workflow scheduling system. J. Open Source Softw. 5, 1844 (2020).

68. 68.

Savitzky, A. & Golay, M. J. E. Smoothing and differentiation of data by simplified least squares procedures. Anal. Chem. 36, 1627–1639 (1964).

69. 69.

Wu, J. B., Lin, M. L., Cong, X., Liu, H. N., & Tan, P. H. Raman spectroscopy of graphene-based materials and its applications in related devices. Chemical Society Reviews, 47, 1822–1873 (2018).

70. 70.

Xia, F., Wang, H., & Jia, Y. Rediscovering black phosphorus as an anisotropic layered material for optoelectronics and electronics. Nature communications, 5, 1–6 (2014).

71. 71.

Ruppert, C., Aslan, O. B., & Heinz, T. F. Optical properties and band gap of single-and few-layer MoTe2 crystals. Nano letters, 14, 6231–6236 (2014).

72. 72.

Liu, H. L. et al. Deep-ultraviolet Raman scattering spectroscopy of monolayer WS 2. Scientific reports, 8, 1–10 (2018).

73. 73.

Yang, M. et al. Anharmonicity of monolayer MoS2, MoSe2, and WSe2: A Raman study under high pressure and elevated temperature. Applied Physics Letters, 110, 093108 (2017).

74. 74.

Zhao, Y. et al. Extraordinarily strong interlayer interaction in 2D layered PtS2. Advanced materials, 28, 2399–2407 (2016).

75. 75.

Yan, M. et al. High quality atomically thin PtSe2 films grown by molecular beam epitaxy. 2D Materials, 4, 045015 (2017).

76. 76.

Gupta, U. et al. Characterization of few-layer 1T-MoSe2 and its superior performance in the visible-light induced hydrogen evolution reaction. APL Materials, 2, 092802 (2014).

77. 77.

Chen, K. et al. A Simple Method for Synthesis of High‐Quality Millimeter‐Scale 1T′ Transition‐Metal Telluride and Near‐Field Nanooptical Properties. Advanced Materials, 29, 1700704 (2017).

78. 78.

Yu, J. et al. Direct observation of the linear dichroism transition in two-dimensional palladium diselenide. Nano Letters, 20, 1172–1182 (2020).

79. 79.

Li, H. et al. From bulk to monolayer MoS2: evolution of Raman scattering. Advanced Functional Materials, 22, 1385–1390 (2012).

80. 80.

Lanzillo, N. A. et al. Temperature-dependent phonon shifts in monolayer MoS2. Applied Physics Letters, 103, 093102 (2013).

81. 81.

Zhou, J. et al. Large‐area and high‐quality 2D transition metal telluride. Advanced Materials, 29, 1603471 (2017).

## Acknowledgements

The authors thank M. N. Gjerding for helpful discussions throughout the project. This work was supported by the Center for Nanostructured Graphene (CNG) under the Danish National Research Foundation (project DNRF103). K.S.T. acknowledges support from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (Grant No. 773122, LIMA).

## Author information

Authors

### Contributions

U.L. wrote the initial computer code. A.T. developed the formalism, modified the code and carried out the calculations. All authors discussed the results. A.T., T.G.P., and K.S.T. wrote the manuscript. T.G.P. and K.S.T. supervised the project.

## Ethics declarations

### Competing Interests

The authors declare no competing interests.

Peer review informationNature Communications thanks Ado Jorio, Vincent Meunier, and the other, anonymous reviewer(s) for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Rights and permissions

Reprints and Permissions

Taghizadeh, A., Leffers, U., Pedersen, T.G. et al. A library of ab initio Raman spectra for automated identification of 2D materials. Nat Commun 11, 3011 (2020). https://doi.org/10.1038/s41467-020-16529-6

• Accepted:

• Published:

• ### Superconductivity in two-dimensional η-Mo3C2 films

• Wei Shi
• Junyao Yu
• Libo Gao

Science China Materials (2021)