Introduction

The diversity among known terpenes and terpene derivatives (terpenoids) is a testament to the power of terpene synthases in transforming simple biosynthetic building blocks into complex, stereodense natural products.1, 2, 3, 4, 5, 6, 7 Despite decades of work, scientists continue to isolate and identify terpenes and terpenoids with new carbon frameworks from plants, fungi and bacteria.2, 3, 4, 5, 6, 7, 8, 9, 10 Moreover, the capacity of wild-type terpene synthases can be further magnified through genetic engineering. For instance, Cane, Ikeda and co-workers recently described the frameworks of a variety of new terpenes isolated from an engineered Streptomyces host, although relative configurations for many were not fully assigned.11 Here we show how NMR chemical shifts computed with density functional theory (DFT) methods can be used to assign the relative configurations of such structures, using the three structures in Scheme 1 as representative examples.12, 13, 14 This work builds on our past work on confirming, assigning and reassigning the structures of terpenes15, 16, 17 and terpenoids,18, 19 and provides a missing piece of the structural puzzle defined by Cane and Ikeda. This work represents only the second case of assigning the relative stereochemical configuration of complex hydrocarbon natural products, that is, those lacking polar functional groups, using DFT.15

Results and Discussion

Sesquiterpene 8 (isohirsut-1-ene)

As structure 8 has four stereogenic centers, there are eight possible diastereomers (Scheme 2; each has a corresponding enantiomer, but absolute configuration is not addressed here). First, we constructed and optimized the structures of the eight diastereomers with B3LYP/6-31+G(d,p) in the gas phase (see Methods section). The resulting geometries were then subjected to NMR calculations using SMD(CHCl3)–mPW1PW91/6-311+G(2d,p). Only structure 8F was found to have more than one conformer close enough in energy to contribute to the overall chemical shifts (here there were three); the shifts reported in Table 1 for 8F therefore represent a Boltzmann-weighted average. The mean absolute deviations (MADs) between the experimental and calculated values were determined for each set of carbon and proton chemical shifts for each diastereomer versus the reported experimental values and are shown in Table 1.11 Structure 8D is the best match to the experimental data, based on MAD values for both 1H (0.10 p.p.m.) and 13C (1.60 p.p.m.). Structure 8D also does not contain any large outliers when comparing to the experimental values. To confirm our assignment, DP4 analysis was performed. This statistical analysis, which makes comparisons on an atom-by-atom basis, indicated that 8F is actually the best match (87.4% probability) in terms of 1H shifts (note that 8F also has a small 1H MAD: 0.12 p.p.m.), but 8D is the best match in terms of 13C shifts (99.6%) and when both 1H and 13C shifts are combined in the same analysis (98.8%). Although there is no consensus as to which shifts are generally more diagnostic—1H or 13C—it is clear that 13C shifts are for compound 8 (based on both MAD and DP4 analysis).

Table 1 Differences between computed and experimental (center column) chemical shifts for each atom in each of the diastereomers of compound 8 (computed minus experimental)

Sesquiterpene 9 (isohirsut-4-ene)

Compound 9 is a constitutional isomer of compound 8. As a result, in addition to assigning the relative configuration as we did for 8, we can also confirm that our analysis, simply based on chemical shifts (even in the absence of coupling constants, DEPT data and so on), would differentiate between the hydrocarbon frameworks of 8 and 9. Compound 9 also has four stereogenic centers, again giving rise to eight diastereomers (Scheme 3). Conformational searching yielded three distinct conformers for 9A and 9D after optimization, but for each case, two of these were >3 kcal mol−1 higher in free energy than the lowest-energy conformer, meaning that they would not contribute significantly to the conformational population. Diastereomer 9D is clearly the best match, in terms of 1H or 13C MADs and DP4 probabilities, for the experimental data (Table 2).

Table 2 Differences between computed and experimental (center column) chemical shifts for each atom in each of the diastereomers of compound 9 (computed minus experimental)

Also, as reflected in the data in Table 3, our computed chemical shifts clearly distinguish between the two constitutional isomers 8 and 9. We compared the conformers that were predicted to be the best match for each isomer with the experimental data for the other, that is, computed shifts for 8D were compared with the experimental shifts for 9 and computed shifts for 9D were compared with the experimental shifts for 8. The MADs for both 1H and 13C were much larger for the incorrect isomers (compare values in Table 3 with those in Tables 1 and 2).

Table 3 Differences between computed chemical shifts for 8D and experimental shifts for 9 (left) and between computed chemical shifts for 9D and experimental shifts for 8 (right)

Diterpene 11 (tsukubadiene)

We examined compound 11 because it is larger and more flexible than compounds 8 and 9, and it contains an uncommon nine-membered ring, thereby providing a more challenging test of our methods. Compound 11 also contains four stereogenic centers, leading again to eight diastereomers (Scheme 4). Due to its somewhat flexible nine-membered ring, conformational searching gave many possible conformations for each diastereomer (see Supplementary Information for energies of conformers). After optimization with B3LYP/6-31+G(d,p), five conformers were found for 11A that are within 2.1 kcal mol−1 (free energy) of the lowest-energy conformer found (a conformer that is 2 kcal mol−1 higher in energy accounts for ~2% of the population). For 11B, two conformers were found to be within 0.015 kcal mol−1 of the lowest, whereas the third conformer was 6.5 kcal mol−1 higher in energy. Diastereomer 11C had two relevant conformers (spanning, in this case, a 1.4 kcal mol−1 range), 11D had five relevant conformers (spanning, in this case, a 2.3 kcal mol−1 range), 11E had five relevant conformers (spanning, in this case, a 2.4 kcal mol−1 range), 11F had 10 relevant conformers (spanning, in this case, a 1.9 kcal mol−1 range), 11G had four relevant conformers (spanning, in this case, a 2.0 kcal mol−1 range) and 11H had two relevant conformers (spanning, in this case, a 0.5 kcal mol−1 range). Due to the presence of more than one contributing conformer, a Boltzmann-weighted average was used to represent the computed chemical shifts for each diastereomer of compound 11. Listed in Table 4 are the experimental chemical shifts for compound 11 and the differences between computed shifts for each diastereomer and the experimental values. These results indicate that 11A is the best match to the experimental data, with MADs of 2.12 p.p.m. for 13C and 0.09 p.p.m. for 1H and very high DP4 probabilities. Note that diastereomer 11E actually has a lower 13C MAD, but a higher 1H MAD than 11A, but when comparisons are made on a shift-by-shift basis (DP4 analysis), 13C shifts for 11E are a much poorer match for the experimental data than are those for 11A.

Table 4 Differences between computed and experimental (center column) chemical shifts for each atom in each of the diastereomers of compound 11 (computed minus experimental)

Conclusion

We have shown in this report that 1H and 13C chemical shifts computed using quantum chemistry can be used to assign relative configurations of hydrocarbon natural products with high confidence. Specifically, we disclose here the relative configurations of isohirsut-1-ene, isohirsut-4-ene and tsukubadiene, three terpenes previously isolated by Cane, Ikeda and co-workers from an engineered Streptomyces (three-dimensional structures shown in Figure 1).11 Note that if the experimental chiroptical data were available, quantum chemistry could also be used to assign absolute configurations.20 This work builds on our previous assignment of the structure of rhizathalene,15 providing further evidence that this approach is generally useful for terpenes. It is remarkable that such simple methods can be used to distinguish one exceedingly nonpolar molecule from many slightly different nonpolar molecules with high confidence.

Figure 1
figure 1

Computed three-dimensional structures of the lowest-energy conformers of (top to bottom) isohirsut-1-ene, isohirsut-4-ene and tsukubadiene.

Methods

Quantum chemical calculations were performed with Gaussian09.21 All structural optimizations and frequency calculations (to verify that structures were minima) were performed with B3LYP/6-31+G(d,p) (gas phase).22, 23 1H and 13C chemical shift calculations were performed using mPW1PW91/6-311+G(2d,p)24 and the SMD implicit continuum solvation model25 for chloroform (the solvent used experimentally). Chemical shifts were scaled as described previously13, 26 using scaling factors (slope=−1.0936 for 1H and −1.0533 for 13C; intercept= 31.8018 for 1H and 186.5242 for 13C) available at the cheshirenmr.info website.27 For methyl groups, averages of the computed values of the three hydrogens were used to compare with the experimental data. DP4 statistical analysis28 was performed to assess the probability that our assignments are correct (given the possible alternatives examined, vide infra). For each diastereomer of each compound, a systematic conformational search was first performed using Spartan10.29 For those searches that resulted in only a few conformers, optimizations and NMR calculations were carried out for all of them. For those searches that resulted in many structures, we first calculated the electronic energy of each using B3LYP/6-31G(d) and then, only for the conformers that were within 15 kcal mol−1 of the lowest-energy conformer, we performed optimization with B3LYP/6-31+G(d,p) and NMR calculations (see Supplementary Information for details). To facilitate comparisons with the work of Cane, Ikeda and co-workers, structure numbers correspond to the numbers used in reference 11. Also, note that the chemical shifts for geminal dimethyl groups and diastereotopic CH2 protons are computed separately, but these were averaged to compare with reported values (see Supplementary Information for additional details).

scheme 1

Two sesquiterpenes and a diterpene from reference 11; structure numbers correspond to the numbers used in this reference.

scheme 2

Eight diastereomers of compound 8. Note that, although we do not address herein absolute configuration, the configuration of C9 is shown as identical for all diastereomers to facilitate comparisons.

scheme 3

Eight diastereomers of compound 9. Note that, although we do not address herein absolute configuration, the configuration of C9 is shown as identical for all diastereomers to facilitate comparisons.

scheme 4

Eight diastereomers of compound 11. Note that, although we do not address herein absolute configuration, the configuration of C9 is shown as identical for all diastereomers to facilitate comparisons.