The ability to store information is believed to have been crucial for the origin and evolution of life; however, little is known about the genetic polymers relevant to abiogenesis. Nitrogen heterocycles (N-heterocycles) are plausible components of such polymers as they may have been readily available on early Earth and are the means by which the extant genetic macromolecules RNA and DNA store information. Here, we report the reactivity of numerous N-heterocycles in highly complex mixtures, which were generated using a Miller-Urey spark discharge apparatus with either a reducing or neutral atmosphere, to investigate how N-heterocycles are modified under plausible prebiotic conditions. High throughput mass spectrometry was used to identify N-heterocycle adducts. Additionally, tandem mass spectrometry and nuclear magnetic resonance spectroscopy were used to elucidate reaction pathways for select reactions. Remarkably, we found that the majority of N-heterocycles, including the canonical nucleobases, gain short carbonyl side chains in our complex mixtures via a Strecker-like synthesis or Michael addition. These types of N-heterocycle adducts are subunits of the proposed RNA precursor, peptide nucleic acids (PNAs). The ease with which these carbonylated heterocycles form under both reducing and neutral atmospheres is suggestive that PNAs could be prebiotically feasible on early Earth.
The ability to store information was a critical step for the origin and evolution of life as we know it. Today life on Earth relies on DNA to store genetic information; however, the catalytic properties of RNA (among other evidence) has led to the widely-accepted hypothesis that early life relied solely on RNA as its primary genetic molecule1. Numerous studies have shown how the constituents of RNA (i.e. nucleobases, sugars, phosphate) may have formed in situ or were delivered to the early Earth2,3,4,5,6, and several studies have shown that ribonucleotides, the subunits of RNA, can be generated under prebiotic conditions7,8,9,10.
Alternatively, RNA was possibly preceded by a polymer that could spontaneously form from simpler precursors, aptly named a pre-RNA11,12. Proposed pre-RNAs have backbones different from that of RNA or utilize nitrogen heterocycles (N-heterocycles) other than the canonical nucleobases to store information. Of particular interest are those that are capable of spontaneously forming double helices with RNA via Watson-Crick base pairing as such supramolecular ordering provides a facile mechanism for the chemical evolution of pre-RNAs to RNA. Several such pre-RNAs have been studied including threose nucleic acid (TNA)13, glyoxylate-ribonucleic acid (gaRNA)14, glycol nucleic acid (GNA)15, and peptide nucleic acids (PNAs)16.
A further complication to elucidating the structures of plausible pre-RNA molecules stems from the diverse inventory of N-heterocycles from which they may have pooled (as abiotic reactions that generate the nucleobases also yield a variety of other N-heterocycles)2,17,18. In addition, a wide-range of N-heterocycles has been identified in carbonaceous meteorites, suggesting that delivery of extraterrestrial materials may have also served as a source for these molecules4,19. Unlike the canonical nucleobases, some of these alternative heterocycles (e.g. 2,4,6-triaminopyrimidine (TAP), melamine, and barbituric acid) can spontaneously form glycosides with ribose or other sugars and adopt Watson-Crick-like base pairs in aqueous solutions20,21. Given this, it has been proposed that the biological nucleobases are the result of evolutionary pressures and that the earliest genetic polymers incorporated alternative N-heterocycles22.
Considering the vast number of N-heterocycles and molecular backbones that could form a pre-RNA molecule, the number of plausible structures for genetic polymers relevant to the origin of life seem endless. Here, we investigated the chemistry of various N-heterocycles in complex mixtures (produced by spark discharge experiments) in order to simulate conditions approaching the chemical complexity expected for the prebiotic environment. We chose to focus on Miller-Urey mixtures as they generate a plethora of diverse organic compounds, including N-heterocycles, under conditions that may have been widespread on early Earth3,17,23. Moreover, many organics produced by spark discharges have been found in meteorites23,24 and are formed by various prebiotic reactions including HCN polymerization2, Fischer-Tropsch type synthesis18, and laboratory simulations of hydrothermal vent chemistry25. These observations imply that the canonical nucleobases and other N-heterocycles—whether they formed in situ or were delivered to primordial Earth—probably existed in complex mixtures like those generated by spark discharge experiments.
The reactivity of N-heterocycles in their chemical environment may not only have a profound effect on the heterocycle’s chemical properties (e.g. increasing solubility or facilitating the formation of more complex molecules), but also on whether they maintain the ability to base pair and store information. We show here that multiple prebiotically plausible pathways exist for the robust formation of carbonylated N-heterocycles in Miller-Urey mixtures and discuss how these structures could serve as precursors for the formation of PNAs on the early Earth.
Method Rationale and Limitations
The primary goals of this study were to (1) elucidate the adducts that N-heterocycles commonly form in mixtures simulating the chemical complexity expected on early Earth and (2) evaluate plausible pre-RNA molecules compatible with these more complex structures. To this end, we studied the reactivity of 53 N-heterocycles representing five classes (pyridines, pyrimidines, triazines, purines, and pteridines) with complex mixtures produced under both reducing and neutral atmospheres to elucidate chemical trends, which resulted in hundreds of samples. Given the high number of samples and the complexity of each reaction mixture, it was not feasible to pursue product purification and structure elucidation via liquid chromatography-mass spectrometry, which precluded us from estimating product yields and determining which conformational isomer formed in any particular case. Instead we chose to analyze samples using high-resolution mass spectrometry (HRMS) with a direct analysis in real-time (DART) ion source to enable high sample throughput. The high mass resolution (>30,000 at m/z 400) and mass accuracy (typically <5 ppm) of the instrument permitted the assignment of molecular formulas, which enabled the recognition and identification of N-heterocycle adducts. DART was also selected for ionization because it readily protonates nitrogen-containing compounds, is less sensitive to matrix effects, does not usually produce salt adducts, and we found Penning ionization to be minimal, enabling us to rapidly analyze complex mixtures with minimal sample preparation26.
Results and Discussion
Reactivity trends in Miller-Urey mixtures
Miller-Urey spark discharge mixtures generate a plethora of organic compounds including aldehydes, nitriles, hydroxy acids, and amino acids23,27,28,29; these organics hold the potential to generate sugar, alcohol, electrophilic carbonyl, or nucleophilic side chains on N-heterocycles. The resulting side chain would impact the subsequent reactivity of the N-heterocycles and hence influence the types of molecular polymers they may form. For example, sugar and alcohol side chains can be readily phosphorylated10 or glyoxylated14 forming monomers akin to RNA or gaRNA; alternatively, hydroxymethylated N-heterocycles can spontaneously oligomerize30. Meanwhile, the formation of carboxylic acid and aldehyde side chains could serve as the precursor for PNAs should their carbonyls be attacked by amino acids. Lastly, nucleophilic side chains could facilitate the formation of organometallic clusters (e.g. with iron) as good nucleophiles often make decent ligands. To investigate whether N-heterocycles generate adducts that would favor the formation of a specific type of genetic polymer, we characterized the reactivity of N-heterocycles (Supplementary Table S1) in Miller-Urey spark discharge mixtures by searching for adducts with sugar, alcohol, carbonyl (i.e. electrophilic), and nucleophilic side chains (Supplementary Table S2).
Figure 1(a–d) summarizes the strategy used to characterize select products observed in heterocycle-spark reaction mixtures using a uracil adduct (m/z 185.0556 = uracil-C3H5O2, Fig. 1a) that was formed under a reducing atmosphere (N2-CO2-CH4-H2) as an example. Tandem mass spectrometry (MS/MS) of the uracil adduct (Fig. 1b) shows a pattern consistent with a terminal carboxylic acid group (m/z 167.0447 = [M-H2O]+; m/z 139.0497 = [M-CH2O2]+). When uracil is incubated with acrylic acid in a separate reaction, we observed a product with identical mass (m/z = 185.0555) as the uracil-spark adduct. Comparison of the MS/MS spectra in Fig. 1(b,c) reveals that the fragmentation pattern of the uracil-spark and uracil-acrylic acid adducts match. The fragmentation pattern of a commercial reference standard of uracil-N1-propanoic acid (Fig. 1d) and 15N-NMR spectra (Fig. 1g,h) of uracil incubated with acrylic acid indicates that the reaction proceeded at the N1 position in the spark reaction mixture. Thus, the N-heterocycle adduct at m/z 185.0556 in the heterocycle-spark reaction mixture was identified as uracil-N1-propanoic acid.
Using these methods, we confirmed that specific organics in the spark discharge mixture react readily with a range of N-heterocycles (Supplementary Tables S3–S6; Supplementary Figs S3–S25); from this information we inferred that products having side chains with identical chemical compositions and thus the same Δm/z (i.e. the difference in mass between the N-heterocycle adduct and parent N-heterocycle) were generated via the same mechanisms.
N-heterocycles were incubated with spark discharge mixtures generated under a strongly reducing atmosphere as they were originally perceived to be more conducive for organic synthesis than neutral gases31. Although, Earth’s early atmosphere was likely near neutral (N2, CO2) with trace amounts of H2, CO, and H2S32, volcanic outgassing and bolide impacts could have generated large, but transient amounts of reduced gases33.
Figure 2 summarizes the reactivity of 53 N-heterocycles in spark discharge mixtures produced from a 1 bar atmosphere containing 0.4 N2, 0.1 CO2, 0.25 H2, and 0.25 CH4 (i.e. reducing spark mixture) over a 0.2 M phosphate buffer solution (adduct details listed in Supplementary Table S3). Markedly, with the exception of 4-pyridinecarboxylic acid , N-heterocycles with cyano or carboxylic acid groups did not produce detectable adducts. Conversely, the majority of N-heterocycles without these electron withdrawing groups (EWGs), including the canonical nucleobases, formed at least one adduct containing a carbonyl carbon. These carbonylated heterocycles were 1–3 carbons in length, sometimes methylated (at either the α or β-carbon on the side chain), and contained a terminal aldehyde, nitrile, amide, or carboxylic acid functional group.
Given that Earth’s early atmosphere was likely near neutral based on current models32, we wanted to see whether N-heterocycles would carbonylate from a mixture generated under a N2-CO2 atmosphere lacking any reduced gases. Previous work has shown that neutral gas mixtures (1:1 N2:CO2) over an aqueous buffer form amino acids in higher yields34 than previously thought28, albeit at levels lower than that of reducing atmospheres. It is predicted that neutral atmospheres are not as conducive for amino acid synthesis partially because they produce less HCN and NH334, which incidentally are likely precursors of the reactants that carbonylated N-heterocycles in our reducing spark mixture. Moreover, neutral atmospheres generate higher levels of nitrates and nitrites, the latter of which can go on to form nitrosamines from amino acids34 and N-heterocycles35.
Twenty-one N-heterocycles were incubated with a spark discharge mixture formed under a 1 bar atmosphere of 0.5 N2 and 0.5 CO2 gases (i.e. neutral spark mixture). Heterocycles were selected because of either their reactivity in the reducing spark mixture (Fig. 2 Reducing Atmosphere) or their relevance to previous pre-RNA studies20,21,36,37,38 (see Supplementary Table S1). In the majority of cases, the number of adducts formed were the same or higher compared to those using spark reaction mixtures from a reducing atmosphere. In addition, we identified at least one carbonylated adduct for almost every N-heterocycle tested (Fig. 2 Neutral Atmosphere), with many forming the same products as when they were incubated with the reducing spark mixture (see Supplementary Table S4 for a complete list of adducts). It should be noted that in addition to forming carbonylated adducts, several N-heterocycles (including the canonical purines) formed products with masses corresponding to various sugar-like side chains, including 4-, 5-, and 6-carbon sugars or sugar alcohols (see Supplementary Tables S4 and S6). However, these adducts were uncommon (i.e. each only formed by 2–5 N-heterocycles) and additional work is needed to verify the identity of the side chains.
Given that almost all the N-heterocycles with carboxylic acid groups were unreactive under a reducing atmosphere, we were intrigued to find that all three pyridine carboxylic acids tested were reactive [7, 8, 9], with [8 and 9] forming carbonylated heterocycles; 2-pyridinecarboxylic acid  formed a single adduct containing an alcohol side chain. One possible explanation for the reactivity of these compounds in the neutral but not reducing spark mixture may be that the former produces considerably fewer nucleophiles like HCN and ammonia39 that would otherwise sequester the carbonylating reactants23,40. This suggests that heterocycles with carboxylic acids can act as a nucleophile to form carbonylated heterocycles under certain favorable reaction conditions; indeed, previous studies have shown that such heterocycles undergo carbonylation in isolated reactions41,42.
Reactivity trends of N-heterocycles
We surmised that most of the N-heterocycle adducts were formed by the N-heterocycles attacking electrophiles; thus, those that were unreactive (and ionized by DART) can be considered poor nucleophiles. To some extent pKa can be used to predict whether heterocycles will be unreactive as those having an exceptionally low pKa are strong acids, but very poor nucleophiles. Being strong acids, these heterocycles hold on to their electrons tightly, hindering their interaction with electrophiles. This value is especially important for heterocycles whose conjugate base is neutral (e.g. pyridine as the conjugate base for pyridinium cations) and thus significantly less nucleophilic than conjugate bases that are negatively charged (e.g. deprotonated uracil). In accordance with this pattern the reported pKa values of the conjugate acids (i.e. pKaH) for 2-, 3-, and 4-cyanopyridines (all of which were unreactive in the reducing spark mixture) are very low (−0.26, 1.36, and 1.90, respectively). The pKaH of the ring nitrogen in the cyanopyridines reflects how the cyano group pulls electrons from the ring nitrogen via resonance effects, decreasing its nucleophilicity and deactivating the molecule towards electrophilic reactions. In addition to decreasing the electron density surrounding the ring nitrogen of pyridine, cyano groups also decrease the nucleophilicity of the ring N via steric hindrance as the bulky cyano group partially blocks the ring N from electrophilic attack.
Although carboxylic acids (R-COOH) are EWGs, once they deprotonate and form carboxylates (R-COO−) they are only slightly deactivating. In our reaction mixtures (pH 8), all of our carboxylic acid heterocycles exist as carboxylates. In consequence, the pKaH of the ring nitrogen is only slightly less than pyridine (~5 for pyridine carboxylic acids). Despite what their pKaH indicates, the pyridine carboxylic acids remain relatively weak nucleophiles; therefore, their low reactivity is probably due to steric hindrance from the bulky side chain near the ring nitrogen. This would also explain why 4-pyridinecarboxylic acid  is the only reactive pyridine carboxylic acid in the reducing spark mixture and why 2-pyridinecarboxylic acid  is the least reactive of the three in the neutral spark mixture (Supplementary Table S4). This trend is also consistent with 4- and 2-hydroxymethylpyridine [5, 6] being unreactive in the reducing spark mixture despite the ring nitrogen being a decent base (pKaH ~9); 3-hydroxymethylpyridine  is slightly reactive, forming two detectable adducts, only one of which is from electrophilic attack. In comparison, 4-aminopyridine , which has a similar pKaH (9.2) to 4-hydroxymethylpyridine  (pKaH 8.9) but is not sterically hindered, generated 6 detectable adducts while  was unreactive.
Robust Reactions of N-Heterocycles in Miller-Urey Mixtures
We investigated whether N-heterocycles gain sugar, alcohol, carbonyl, or nucleophilic side chains in prebiotically plausible complex mixtures. Our results show that in both reducing and neutral spark mixtures the majority of adducts contained carbonyl side chains (see Supplementary Table S6); in comparison, the formation of nucleophilic, sugar, and alcohol side chains (with the exception of acetaldehyde and cyanamide adducts in the neutral spark mixture) were exceptionally low (see Supplementary Table S6). Figure 2 summarizes the most common reactions of N-heterocycles observed in spark discharge mixtures (only the hydrolysis product of each adduct (i.e. acids) will be referred to hereafter). We assigned reactants based on their availability in spark mixtures (i.e. compounds that have been reported in the literature), demonstrated reactivity with N-heterocycles based on isolated reactions resulting in the same N-heterocycle adduct m/z (Table 1, Supplementary Table S5), and (for adducts with sufficient abundance) MS/MS spectra (Supplementary Figs S7–S25); in select cases we also compared the MS/MS of the spark adduct with that from an isolated reaction—in every case the fragmentation pattern matched. Notably, the only reactions that were major in both reducing and neutral spark mixtures were those that generated carboxylic acid side chains (Fig. 3).
In the aqueous buffer (0.2 M phosphate buffer, pH 8), about 10% of HCN is deprotonated (pKa 9.2); the resulting anion is an excellent nucleophile in reactions with N-heterocycles whose rings are sufficiently electron deficient. Accordingly, we found that the majority of cyanide adducts in reducing atmospheres were formed by pteridines and triazines that possessed a strongly electrophilic carbon susceptible to nucleophilic attack (e.g. , , and ). Thus, it was surprising to find that in the neutral spark mixture, pyrimidines and triazines containing two or more electron donating amine groups (e.g. melamine  and TAP ) formed cyano adducts, especially considering that Cleaves and coworkers34 measured significantly less HCN in spark discharge mixtures generated under an identical atmospheric ratio (1:1 N2:CO2). Given this, we propose that these cyano-adducts are formed via an intermediate that was present in the neutral but not reducing spark mixture. One such possibility is nitrosamines via nitrous acid generated from the spark discharge through a N2-CO2 atmosphere34. In water nitrous acid readily forms nitrosonium ions (Fig. 4a) that then react with nucleophilic amine groups forming nitrosamines (Fig. 4b). For aromatic compounds like the N-heterocycles, loss of water from the nitrosamine gives aryl diazonium cations (Fig. 4c)—excellent leaving groups that lower the activation energy for nucleophilic attack. Subsequent attack of the aryl diazonium cations by nucleophilic cyanamide would generate an adduct with identical mass to that expected from the parent heterocycle undergoing HCN nucleophilic substitution (Fig. 4d).
Intriguingly, N-heterocycles with an acetic acid group, our most consistently observed adduct, are the “nucleoside” subunit of the PNA shown to readily base pair with RNA16 (i.e. PNAs whose heterocycles are linked to a N-(2-aminoethyl)glycine (AEG) backbone by an acetamide bridge: Het-CH2-CO-AEG), hereafter referred to as aegPNA. These carbonylated heterocycles formed via a Strecker-like synthesis on the N-heterocycle with glycolonitrile (or alternatively, formaldehyde, followed by HCN). Notably, when we tested the reactivity of glycolamide with guanazole we did not observe the corresponding acetic amide nor acetic acid adduct (a detailed discussion of the reactivity of glycolonitrile with guanazole can be found in the Supplementary text, section 2.2 and Supplementary Fig. S5). These results coincide with previous observations that glycolonitrile—and by extension, its precursors, HCN and formaldehyde—are abundant products of spark discharges under reducing atmospheres39. Our results also show that N-heterocycles form the heterocycle-acetic acid molecules even in mixtures produced under neutral atmospheres where the concentrations of HCN are suspected to be relatively low31,34. While concentrations of HCN decrease under increasingly neutral atmospheres, the proportional amounts of formaldehyde and glycolonitrile may increase. Accordingly, it was found that glycolonitrile yields were highest under a N2-H2-CO2 atmosphere (compared to the same atmosphere but with CO or CH4 instead of CO2); notably the highest yields were found when the H2:CO2 ratio was 3 (compared to 0.5)39.
The majority of the reactions depicted in Fig. 3 are Michael additions of N-heterocycles with 3-carbon-long α,β-conjugated compounds with a terminal carbonyl group (derived from methylcyanoacetylene, cyanoacetylene, crotonitrile, methylpropiolaldehyde, propiolaldehyde, acrolein, and acrylonitrile). Previous work has repeatedly shown that these types of reactions are regioselective with carbonylation being thermodynamically favored at the N1 and N9 positions of pyrimidines and purines, respectively42,43,44,45—the sites where ribose attaches to the canonical nucleobases in RNA. In accordance with this, we found that the Michael acceptors propiolic acid and acrylic acid preferentially add to the N1 position of uracil (Fig. 1; Supplementary Fig. S6).
Reactions unique to either reducing or neutral atmospheres
We identified several reactions that were common in either the reducing or the neutral spark mixtures, but not in both. The addition of the Michael acceptors acrolein, propiolaldehyde, and methylpropiolaldehyde were frequently observed only in reactions with a reducing atmosphere, which concurs with the fact that all three are easily generated via methane46,47 (Table 1). However, it should be noted that regarding the methylpropiolaldehyde and acrolein adducts, this observation may be misleading as the majority of the N-heterocycles that formed these adducts in the reducing spark mixture were not studied in the neutral spark mixture.
On the other hand, a greater presence of acetaldehyde, acetic acid, and formic acid adducts under a N2-CO2 atmosphere are consistent with measurements of excess formaldehyde and proportionally less HCN in neutral spark mixtures39. Since HCN sequesters aldehydes, low HCN yields correspond to higher acetaldehyde concentrations48 (for a discussion of the lack of observed formaldehyde adducts see Supplementary text, section 2.1). The paucity of free HCN may have been due to the efficient formation of HCN products (such as glycolonitrile) by cyanohydrin reactions as deduced by our results here and as seen in prior work34; alternatively, hydrolysis of HCN to formic acid could have been exacerbated by nitric and nitrous acids present in the neutral spark mixture34. Presumably, relatively high formic acid-to-HCN (and in parallel acetic acid-to-acetonitrile) ratios accounts for the robust production of formyl and acetyl adducts in the neutral mixtures.
Discussion of Other Prebiotic Reactions Relevant to Our Results
Remarkably, the majority of the organics that carbonylated N-heterocycles were Michael acceptors that can also act as N-heterocycle precursors (e.g. urea + cyanoacetylene, propiolic acid, or acrylonitrile → cytosine/uracil)46,49,50,51. In addition, it has been shown that carbonylated nucleobases can be generated via purine and pyrimidine precursors that reacted with the carbonylating organics (e.g. hydantoic acid (from urea + glycolonitrile + 2 H2O) + cyanoacetaldehyde → cytosine/uracil-acetic acid)52. Overall, these previous studies, together with our results, suggest that there are multiple chemical pathways by which carbonylated heterocycles may have formed on early Earth. For an in-depth discussion of trends of nucleobase carbonylation and the possibility for a one-pot synthesis of carbonylated nucleobases in Miller-Urey-type reaction mixtures (based on results from previous publications and our observations of the spark mixtures generated for this study) see Supplementary text, sections 2.4 and 2.5; see Table S7 for yields of carbonylated nucleobases in isolated reactions obtained by previous studies.
Under both reducing and neutral atmospheres, the most common adducts were those containing carbonyl side chains 1- to 3-carbons long. The carbonyl carbon of these side chains, being electron deficient, is susceptible to nucleophilic attack by electron-rich molecules (e.g. amino acids)—the exception being the carbonyl resulting from nitrosation and cyanamide (see Supplementary text, section 2.3). Previous work has shown that amino acids readily react with N-heterocycles containing aldehyde side chains in aqueous solution, forming nucleobase-peptide molecules connected via an imine bond41. The formation of amide bonds may be facilitated by subjecting solutions containing N-heterocycle adducts, amino acids, and hydroxy acids to wet-dry cycles, as the latter compound promotes peptide bond formation53. Intriguingly, both amino acids and hydroxy acids are generated in relatively high yields from Miller-Urey reactions23,27. In fact, AEG, the backbone of aegPNA, has been identified in spark discharges of CH4, N2, NH3, and H2O atmospheres52, albeit in low yields and in the presence of numerous other amino acids23. Given this, a diverse set of PNA monomers containing AEG and various other amino acids is likely to be generated from carbonylated heterocycles within complex mixtures.
The resulting PNA monomers with aldehyde, acrylic, and carboxylic acid side chains could subsequently polymerize via Knoevenagel condensations41, free radical-induced polymerization54, or continuous wet-dry cycles (akin to amino acid polymerization53), respectively. As a diverse set of PNA monomers would be available during polymerization, the resulting macromolecule would likely be composed of a hetero-peptide backbone rather than a uniform backbone like aegPNA. Alternatively, carbonylated heterocycles can be attacked by an existing nucleophilic backbone such as poly-AEG55 or HCN polymer. Moreover, the carbonyl side chains, being hydrophilic, help solubilize N-heterocycles which in turn may facilitate the supramolecular assembly of carbonylated heterocycles and peptide-nucleobase monomers that base pair in solution56. The diversity of these polymerization mechanisms suggests that a facile transition of carbonylated heterocycles to complex polymers is prebiotically plausible under a broad range of conditions.
Our results suggest that formylation of N-heterocycles, including the aminopyrimidines studied here, is more favorable under a neutral atmosphere. Intriguingly, formylated 5,6-diaminopyrimidines can become N9 purine nucleosides in formose reaction mixtures subjected to drying7. As neutral atmospheres generate more free aldehydes than reducing atmospheres39, they may also be more conducive than reducing atmospheres for sugar synthesis and hence the conversion of formylated 5,6-diaminopyrmidines to purine glycosides. Therefore, formylation of N-heterocycles under neutral atmospheres could serve as a first-step towards PNA monomers as well as TNA and RNA purine nucleosides.
Here we report plausible prebiotic pathways for the robust formation of carbonylated N-heterocycles in highly complex organic mixtures, which are expected to be present on early Earth (Fig. 5). The majority of these carbonylated heterocycles are formed by a wide range of N-heterocycles via both a Strecker-like synthesis with glycolonitrile and Michael additions with 3-carbon acrylic and propiolic derivatives. Intriguingly, previous work has shown that Michael additions tend to primarily occur at the N1 and N9 position of pyrimidines and purines, respectively42,43,44,45 (Fig. 1, Supplementary Fig. S6), hence generating pyrimidine and purine carbonylated heterocycles capable of forming Watson-Crick base pairs. Notably, strongly acidic N-heterocycles (e.g. cyanopyridines) were unreactive and those with bulky side chains conjugated to the ring (e.g. carboxylates, hydroxymethyl, etc.) were mostly inert in spark discharge mixtures; thus, once formed, these compounds are not as likely as other N-heterocycles to be incorporated into a primitive genetic polymer. Conversely, the majority of N-heterocycles which react readily with sugars were also reactive with a range of electrophiles in both reducing and neutral spark mixtures. Thus, although these heterocycles could have readily formed glycosides via ribose, their versatile reactivity with carbonylating organics may have impeded their incorporation into ribonucleoside polymers.
While it has been argued that a reducing atmosphere would have been more favorable for the origins of life on Earth39, we have demonstrated here that neutral atmospheres are just as conducive at producing carbonylated heterocycles. These results are in accordance with the consensus that the early Earth had a near neutral atmosphere32 and demonstrate how carbonylated heterocycles could have been produced from N-heterocycles on the early Earth under a broad range of atmospheric conditions.
Although we did not determine yields, the reproducibility of products in replicates (see Supplementary text, section 2.6) and with a large number of N-heterocycles in both heated and unheated reactions (see Supplementary Tables S3–S5) confirms that the formation of such adducts is robust. Furthermore, as the very organics that form N-heterocycles may also carbonylate them, the co-formation of N-heterocycles and their corresponding PNA precursors is prebiotically feasible. Given that there are many chemical pathways by which PNA precursors may form, and that these reactions occur under a broad range of conditions, it is possible that the carbonylated heterocycles identified here would have been readily available for the formation of PNAs on early Earth. PNAs are particularly interesting in regards to the origins of life as those with a backbone composed of AEG have been shown to form double helices with themselves and RNA and, importantly, are capable of auto-catalytic and cross-catalytic template-based replication57,58—characteristics that are important for any viable genetic polymer. Additionally, as PNAs are composed of N-heterocycles and amino acids, they provide a plausible avenue for the co-evolution of proteins and nucleic acids.
Intriguingly, the organics generated from spark discharges—including amino and hydroxy acids in addition to N-heterocycles and the derivatives of organics that carbonylate them (e.g. β-amino acids and glycine which are amine derivatives of Michael acceptors and glycolonitrile, respectively)—have been identified in meteorites23,24 and experiments modeling organic synthesis at hydrothermal vents25, Titan’s atmosphere59, as well as the hydrolysis of tholins expected on Titan and Triton60,61. In fact, glycolonitrile, acrylonitrile, cyanoacetylene, acrolein, and propiolaldehyde have all been detected in the interstellar medium62,63,64,65; some of these organics have also been identified in interstellar ice analogs (i.e. acrolein, glycolonitrile)66,67, comets (i.e. cyanoacetylene, propiolaldehyde)68,69, and Titan’s atmosphere (i.e. acrylonitrile and cyanoacetylene)70,71. Furthermore, AEG and diamino acids, both of which could serve as the backbone for PNAs, have been detected in interstellar ice analogs72; diamino acids have also been identified in the Murchison meteorite73. These observations illustrate that the reactants that produce carbonylated heterocycles and potentially PNA monomers form in a wide variety of environments. In consequence, the chemical evolution of N-heterocycles to carbonylated adducts in Miller-Urey mixtures, as described here, may be a common phenomenon in the Solar System, potentially extending the formation of PNA precursors from early Earth environments to meteorites and other planetary bodies.
Materials and Methods
All glassware was thoroughly washed with ultrapure water, wrapped in aluminum foil, and heated at 480 °C for 8 hours to ensure removal of organic compounds. All solutions were stored in anoxic vials stoppered with gas-tight blue rubber butyl stoppers; solutions were prepared with 0.2 M N2-purged potassium phosphate buffer (pH 8.0) with sodium sulfide nonahydrate (0.006 g/L) to act as an oxygen scrub.
Spark discharge experiments
A 1000 mL round bottom flask attached to an Electrotechnics BD50E Tesla coil and two tungsten electrodes (tips 1 cm apart) was used to conduct spark discharge experiments (Supplementary Fig. S1). The flask was filled with 250 mL of a 0.2 M phosphate buffer solution (pH 8.0) and injected (in bars) with either a reducing (0.4 N2, 0.1 CO2, 0.25 H2, 0.25 CH4) or neutral (0.5 N2, 0.5 CO2) gas mixture. The flask was placed inside a water bath (~5 °C) and sparked at 40 kV for 72 hours; the water bath was employed to maintain a constant temperature in the flask. To evaluate the reactivity of N-heterocycles in complex mixtures, 1 mL from a heterocycle stock solution was reacted with 2 mL aliquots of fresh spark product (resulting in an initial heterocycle concentration of 1 mM). The reaction mixture was either incubated at 80 °C for 1–7 days or frozen immediately and kept at −80 °C; the latter was done to determine whether (1) adducts formed in the absence of heating and (2) if heating facilitated their formation. Reactions were stopped by flash freezing with dry ice and stored at −80 °C until their analysis.
Samples were analyzed using a Thermo Scientific LTQ Orbitrap XL hybrid mass spectrometer equipped with an IonSense DART ion source (IonSense, Saugus, MA, USA). The resolving power of the orbitrap was set to 30,000 (at m/z 400) and mass spectra were collected over a range of m/z 50–500. The mass spectrometer was run under positive ion mode and the DART ion source operated at 350 °C with He gas; guanine was analyzed at 450 °C. N-heterocycles that did not ionize were re-analyzed in negative mode using a DART ion source temperature of 450 °C. Due to the complexity of the reaction mixtures, a targeted approach was used to identify adducts (see Supplementary Table S2). Heterocycle adducts were identified using molecular formulae as determined by accurate mass measurements (typically <5 ppm mass error) and comparison to controls that were processed in parallel. Some adducts were confirmed by incubating N-heterocycles (1 mM) with the predicted reactant (1 mM) and matching product ion spectra (MS/MS) along with comparison to reference standards when available. NMR spectra were obtained using a Bruker Avance III HD 500 spectrometer at 50.7 MHz and 298 K. Additional experimental details are described in the SI text (see Supplementary text, section 1.0).
Individual mass spectra will be deposited at the public Penn State data commons (http://www.datacommons.psu.edu) upon publication. All other data generated or analyzed during this study are included in this published article (and its Supplementary Information files).
Cech, T. R. The chemistry of self-splicing RNA and RNA enzymes. Science 236, 1532–1539 (1987).
Ferris, J. P., Joshi, P. C., Edelson, E. H. & Lawless, J. G. HCN: A plausible source of purines, pyrimidines and amino acids on the primitive Earth. J. Mol. Evol. 11, 293–311 (1978).
Ferus, M. et al. Formation of nucleobases in a Miller–Urey reducing atmosphere. Proc. Natl. Acad. Sci. 114, 4306–4311 (2017).
Callahan, M. P. et al. Carbonaceous meteorites contain a wide range of extraterrestrial nucleobases. Proc. Natl. Acad. Sci. 108, 13995–13998 (2011).
Butlerow, A. Formation of a sugar-like substance by synthesis (translated from German). Justus Liebigs Annalen der Chemie. 120, 295–298 (1861).
Pasek, M. A. Schreibersite on the early Earth: scenarios for prebiotic phosphorylation. Geosci. Front. 8, 329–335 (2017).
Becker, S. et al. A high-yielding, strictly regioselective prebiotic purine nucleoside formation pathway. Science 352, 833–836 (2016).
Kim, H. J. & Benner, S. A. Prebiotic stereoselective synthesis of purine and noncanonical pyrimidine nucleotide from nucleobases and phosphorylated carbohydrates. Proc. Natl. Acad. Sci. 14, 11315–11320 (2017).
Powner, M. W., Gerland, B. & Sutherland, J. D. Synthesis of activated pyrimidine ribonucleotides in prebiotically plausible conditions. Nature 459, 239–242 (2009).
Gull, M. et al. Nucleoside phosphorylation by the mineral schreibersite. Sci. Rep. 5, 17198, https://doi.org/10.1038/srep17198 (2015).
Shapiro, R. Prebiotic ribose synthesis: A critical analysis. Orig. Life Evol. Biosph. 18, 71–85 (1988).
Joyce, G. F., Schwartz, A. W., Miller, S. L. & Orgel, L. E. The case for an ancestral genetic system involving simple analogues of the nucleotides. Proc. Natl. Acad. Sci. 84, 4398–4402 (1987).
Schöning, K. U. et al. Chemical etiology of nucleic acid structure: The α-threofuranosyl-(3′→2′) oligonucleotide system. Science 290, 1347–1351 (2000).
Bean, H. D., Anet, F. A. L., Gould, I. R. & Hud, N. V. Glyoxylate as a backbone linkage for a prebiotic ancestor of RNA. Orig. Life Evol. Biosph. 36, 39–63 (2006).
Zhang, L., Peritz, A. & Meggers, E. A simple glycol nucleic acid. J. Am. Chem. Soc. 127, 4174–4175 (2005).
Nielsen, P. E., Egholm, M., Berg, R. H. & Buchardt, O. Sequence-selective recognition of DNA by strand displacement with a thymine-substituted polyamide. Science 254, 1497–1500 (1991).
Menor-Salván, C., Ruiz-Bermejo, D. M., Guzmán, M. I., Osuna-Esteban, S. & Veintemillas-Verdaguer, S. Synthesis of pyrimidines and triazines in ice: implications for the prebiotic chemistry of nucleobases. Chem. Eur. J. 15, 4411–4418 (2009).
Hayatsu, R., Studier, M. H., Oda, A., Fuse, K. & Anders, E. Origin of organic matter in early solar system-II. Nitrogen compounds. Geochim. Cosmochim. Acta 32, 175–190 (1968).
Smith, K. E., Callahan, M. P., Gerakines, P. A., Dworkin, J. P. & House, C. H. Investigation of pyridine carboxylic acids in CM2 carbonaceous chondrites: potential precursor molecules for ancient coenzymes. Geochim. Cosmochim. Acta 136, 1–12 (2014).
Cafferty, B. J., Fialho, D. M., Khanam, J., Krishnamurthy, R. & Hud, N. V. Spontaneous formation and base pairing of plausible prebiotic nucleotides in water. Nat. Commun. 7, 11328, https://doi.org/10.1038/ncomms11328 (2016).
Fialho, D. et al. Glycosylation of a model proto-RNA nucleobase with non-ribose sugars: implications for the prebiotic synthesis of nucleosides. Org. Biomol. Chem. 16, 1263–1271 (2017).
Engelhart, A. E. & Hud, N. V. Primitive genetic polymers. Cold Spring Harb. Perspect. Biol. 2, a002196, https://doi.org/10.1101/cshperspect.a002196 (2010).
Miller, S. L. The atmosphere of the primitive earth and the prebiotic synthesis of amino acids. Orig. Life 5, 139–151 (1974).
Wolman, Y., Haverland, W. J. & Miller, S. L. Nonprotein amino acids from spark discharges and their comparison with the Murchison meteorite amino acids. Proc. Natl. Acad. Sci. 69, 809–811 (1972).
McCollom, T. M. Miller-Urey and beyond: what have we learned about prebiotic organic synthesis reactions in the past 60 years? Annu. Rev. Earth Planet. Sci. 41, 207–229 (2013).
Li, X., Wang, X., Li, L., Bai, Y. & Liu, H. Direct analysis in real time mass spectrometry: A powerful tool for fast analysis. Mass Spectrom. Lett. 6, 1–6 (2015).
Parker, E. T., Cleaves, H. J., Bada, J. L. & Fernández, F. M. Quantitation of α-hydroxy acids in complex prebiotic mixtures via liquid chromatography/tandem mass spectrometry. Rapid Commun. Mass Spectrom. 30, 2043–2051 (2016).
Schlesinger, G. & Miller, S. L. Prebiotic synthesis in atmospheres containing CH4, CO, and CO2. I. Amino Acids. J. Mol. Evol. 19, 376–382 (1983).
Wollrab, E. et al. Chemical analysis of a “Miller-Type” complex prebiotic broth part I: chemical diversity, oxygen and nitrogen based polymers. Orig. Life Evol. Biosph. 46, 149–169 (2016).
Smith, K. E., House, C. H., Dworkin, J. P. & Callahan, M. P. Spontaneous oligomerization of nucleotide alternatives in aqueous solutions. Orig. Life Evol. Biosph. 47, 3–11 (2017).
Miller, S. L. & Schlesinger, G. The atmosphere of the primitive earth and the prebiotic synthesis of organic compounds. Adv. Sp. Res. 3, 47–53 (1983).
Kasting, J. F. Earth’s Early Atmosphere. Science 259, 920–926 (1993).
Kasting, J. F. Bolide impacts and the oxidation state of carbon in the Earth’s early atmosphere. Orig. Life Evol. Biosph. 20, 199–231 (1990).
Cleaves, H. J., Chalmers, J. H., Lazcano, A., Miller, S. L. & Bada, J. L. A reassessment of prebiotic organic synthesis in neutral planetary atmospheres. Orig. Life Evol. Biosph. 38, 105–115 (2008).
Shapiro, R. & Pohl, S. H. The reaction of ribonucleosides with nitrous acid. side products and kinetics. Biochemistry 7, 448–455 (1968).
Chen, M. C. et al. Spontaneous prebiotic formation of a β-ribofuranoside that self-assembles with a complementary heterocycle. J. Am. Chem. Soc. 136, 5640–5646 (2014).
Kolb, V. M., Dworkin, J. P. & Miller, S. L. Alternative bases in the RNA world: The prebiotic synthesis of urazole and its ribosides. J. Mol. Evol. 38, 549–557 (1994).
Bean, H. D. et al. Formation of a β-pyrimidine nucleoside by a free pyrimidine base and ribose in a plausible prebiotic reaction. J. Am. Chem. Soc. 129, 9556–9557 (2007).
Schlesinger, G. & Miller, S. L. Prebiotic synthesis in atmospheres containing CH4, CO, and CO2. II. Hydrogen cyanide, formaldehyde, and ammonia. J. Mol. Evol. 19, 383–390 (1983).
Sanchez, R. A., Ferris, J. P. & Orgel, L. E. Cyanoacetylene in prebiotic synthesis. Science 154, 784–785 (1966).
Nelsestuen, G. L. Origin of life: Consideration of alternatives to proteins and nucleic acids. J. Mol. Evol. 15, 59–72 (1980).
Cleaves, H. J. The reactions of nitrogen heterocycles with acrolein: scope and prebiotic significance. Astrobiology 2, 403–415 (2002).
Crippa, S., Di Gennaro, P., Lucini, R., Orlandi, M. & Rindone, B. Characterization of adducts of nucleic bases and acrylic monomers. Soc. Chim. Ital. 123, 197–203 (1993).
Boncel, S., Gondela, A. & Walczak, K. Michael-type addition as a convenient method for regioselective N-alkylation of ambident uracils. Synthesis 10, 1573–1589 (2010).
Lira, E. P. & Huffman, C. W. Some Michael-type reactions with adenine. J. Org. Chem. 31, 2188–2191 (1966).
Dowler, M. J., Fuller, W. D., Orgel, L. E. & Sanchez, R. A. Prebiotic synthesis of propiolaldehyde and nicotinamide. Science 169, 1320–1321 (1970).
Van Trump, J. E. & Miller, S. L. Prebiotic synthesis of methionine. Science 178, 859–860 (1972).
Schlesinger, G. & Miller, S. L. Equilibrium and kinetics of glyconitrile formation in aqueous solution. J. Am. Chem. Soc. 95, 3729–3735 (1973).
Ferris, J. P., Sanchez, R. A. & Orgel, L. E. Studies in prebiotic synthesis. III. Synthesis of pyrimidines from cyanoacetylene and cyanate. J. Mol. Biol. 33, 693–704 (1968).
Harada, K. & Suzuki, S. The new synthesis of uracil and 1,3-dimethyluracil. Tetrahedron Lett. 27, 2321–2322 (1976).
Oró, J. Non-enzymatic formation of purines and pyrimidines. Fed. Proc. 22, 681 (1963).
Nelson, K. E., Levy, M. & Miller, S. L. Peptide nucleic acids rather than RNA may have been the first genetic molecule. Proc. Natl. Acad. Sci. 97, 3868–3871 (2000).
Forsythe, J. G. et al. Ester-mediated amide bond formation driven by wet-dry cycles: a possible path to polypeptides on the prebiotic earth. Angew. Chem. Int. Ed. Engl. 54, 9871–9875 (2015).
Thomas, W. M. Mechanism of acrylonitrile polymerization in Fortschritte Der Hochpolymeren-Forschung 401–441 (Springer, Berlin, Heidelberg, 1961).
Heemstra, J. M. & Liu, D. R. Templated synthesis of peptide nucleic acids via sequence-selective base-filling reactions. J. Am. Chem. Soc. 131, 11347–11349 (2009).
Cafferty, B. J. et al. Efficient self-assembly in water of long noncovalent polymers by nucleobase analogues. J. Am. Chem. Soc. 135, 2447–2450 (2013).
Plöger, T. A. & von Kiedrowski, G. A self-replicating peptide nucleic acid. Org. Biomol. Chem. 12, 6908–6914 (2014).
Singhal, A. & Nielsen, P. E. Cross-catalytic peptide nucleic acid (PNA) replication based on templated ligation. Org. Biomol. Chem. 12, 6901–6907 (2014).
Hörst, S. M. et al. Formation of amino acids and nucleotide bases in a Titan atmosphere simulation experiment. Astrobiology 12, 809–817 (2012).
Cleaves, H. J. et al. Amino acids generated from hydrated Titan tholins: comparison with Miller-Urey electric discharge products. Icarus 237, 182–189 (2014).
McDonald, G. D., Thompson, W. R., Heinrich, M., Khare, B. N. & Sagan, C. Chemical investigation of Titan and Triton tholins. Icarus 108, 137–145 (1994).
Zeng, S. et al. First detection of the pre-biotic molecule glycolonitrile (HOCH2CN) in the interstellar medium. Mon. Not. R. Astron. Soc. Lett. 484, L43–L48 (2019).
Gardner, F. F. & Winnewisser, G. The detection of interstellar vinyl cyanide /acrylonitrile/. Astrophys. J. 195, L127–L130 (2002).
Alakoz, A. V., Kalenskii, S. V., Voronkov, M. A. & Slysh, V. I. Observations of cyanoacetylene sources. Astron. Reports 47, 75–79 (2003).
Hollis, J. M., Jewell, P. R., Lovas, F. J., Remijan, A. & Møllendal, H. Green Bank Telescope detection of new interstellar aldehydes: propenal and propanal. Astrophys. J. Lett. 610, L21–L24 (2004).
de Marcellus, P. et al. Aldehydes and sugars from evolved precometary ice analogs: importance of ices in astrochemical and prebiotic evolution. Proc Natl Acad Sci 112, 965–970 (2015).
Danger, G., Duvernay, F., Theulé, P., Borget, F. & Chiavassa, T. Hydroxyacetonitrile (HOCH2CN) formation in astrophysical conditions. Competition with the aminomethanol, a glycine precursor. Astrophys. J. 756, 11–19 (2012).
Lis, D. C. et al. New molecular species in comet C/1995 O1 (Hale-Bopp) observed with the Caltech Submillimeter Observatory. Earth Moon Planets 78, 13–20 (1997).
Krasnopolsky, V. A. C3 and CN parents in comet P/Halley. Astron Astrophys 245, 310–315 (1991).
Kunde, V. G. et al. C4H2, HC3N and C2N2 in Titan’s atmosphere. Nature 292, 686–688 (1981).
Palmer, M. Y. et al. ALMA detection and astrobiological potential of vinyl cyanide on Titan. Sci. Adv. 3, e1700022, https://doi.org/10.1126/sciadv.1700022 (2017).
Meinert, C., Filippi, J. J. & de Marcellus, P. Le Sergeant D’Hendecourt, L. & Meierhenrich, U. J. N-(2-aminoethyl)glycine and amino acids from interstellar ice analogues. Chempluschem 77, 186–191 (2012).
Meierhenrich, U. J., Muñoz Caro, G. M., Bredehöft, J. H., Jessberger, E. K. & Thiemann, W. H.-P. Identification of diamino acids in the Murchison meteorite. Proc. Natl. Acad. Sci. 101, 9182–9186 (2004).
Robertson, M. P. & Miller, S. L. Prebiotic synthesis of 5-substituted uracils: a bridge between the RNA world and the DNA- protein world. Science 268, 702–705 (1995).
Smith, K. E. Exogenous and endogenous sources of organic compounds on the early earth: investigating carbonaceous meteorites and plausibly prebiotic complex mixtures by liquid chromatography-mass spectrometry. Ph.D. Dissertation, The Pennsylvania State University, University Park, PA (2014).
Miller, S. L. The mechanism of synthesis of amino acids by electric discharges. Biochim. Biophys. Acta 23, 480–489 (1957).
Johnson, F., Pillai, K. M. R., Grollman, A. P., Tseng, L. & Takeshita, M. Synthesis and biological activity of a new class of cytotoxic agents: N-(3-oxoprop-1-enyl)-substituted pyrimidines and purines. J. Med. Chem. 27, 954–958 (1984).
Furukawa, Y., Miyashita, O. & Honjo, M. A new reaction of nucleotides with cyanoacetylene. Chem. Pharm. Bull. 22, 2552–2556 (1974).
Balucani, N. Elementary reactions and their role in gas-phase prebiotic chemistry. Int. J. Mol. Sci. 10, 2304–2335 (2009).
Petrie, S. Can gas-phase radical/neutral reactions account for observed abundances of propynal within dense interstellar clouds? Astrophys. J. 454, L165–L168 (1995).
Skell, P. S. & Engel, R. R. Reactions of carbon vapor. III. Reactions of metastable atomic carbon (1S) with saturated hydrocarbons. J. Am. Chem. Soc. 88, 4883–4890 (1966).
Hennion, G. F. & Nelson, K. W. The kinetics of the hydrolysis of acetylenic chlorides and their reactions with primary and secondary aliphatic amines. J. Am. Chem. Soc. 79, 2142–2145 (1957).
Mola, L. et al. Nucleophile-catalyzed additions to activated triple bonds. Protection of lactams, imides, and nucleosides with MocVinyl and related groups. J. Org. Chem. 78, 5832–5842 (2013).
Um, I. H., Lee, E. J. & Min, J. S. Remarkable catalytic effect of H+ in Michael-type additions of anilines to 3-butyn-2-one. Tetrahedron 57, 9585–9589 (2001).
Prasad, H. S., Srinivasa, G. R. & Channe Gowda, D. Convenient, cost-effective, and mild method for the N-acetylation of anilines and secondary amines. Synth. Commun. 35, 1189–1195 (2005).
Rahman, M., Kundu, D., Hajra, A. & Majee, A. Formylation without catalyst and solvent at 80 °C. Tetrahedron Lett. 51, 2896–2899 (2010).
We thank Dr. Carlos Pacheco (Penn State) for assistance with NMR, Zhidan Zhang (Penn State) for technical support, as well as Dr. Eric Parker and Dr. Jason Dworkin (NASA Goddard Space Flight Center) for assistance with and providing access to the orbitrap mass spectrometer. This work was funded by the NASA Exobiology program (grant nos 13-EXO13-0063 and NNX16AT71G) and the NASA Astrobiology Institute via the Goddard Center for Astrobiology and the Penn State Astrobiology Research Center (cooperative agreement #NNA09DA76A). L.E.R. acknowledges support from the Alfred P. Sloan Foundation and the Ford Foundation via a Predoctoral Fellowship and. M.R.R. acknowledges support from a Boise State University HERC fellowship.
The authors declare no competing interests.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.