Here we report the first recovery, sequencing, and identification of fossil biomineral proteins from a Pleistocene fossil invertebrate, the stony coral Orbicella annularis. This fossil retains total hydrolysable amino acids of a roughly similar composition to extracts from modern O. annularis skeletons, with the amino acid data rich in Asx (Asp + Asn) and Glx (Glu + Gln) typical of invertebrate skeletal proteins. It also retains several proteins, including a highly acidic protein, also known from modern coral skeletal proteomes that we sequenced by LC–MS/MS over multiple trials in the best-preserved fossil coral specimen. A combination of degradation or amino acid racemization inhibition of trypsin digestion appears to limit greater recovery. Nevertheless, our workflow determines optimal samples for effective sequencing of fossil coral proteins, allowing comparison of modern and fossil invertebrate protein sequences, and will likely lead to further improvements of the methods. Sequencing of endogenous organic molecules in fossil invertebrate biominerals provides an ancient record of composition, potentially clarifying evolutionary changes and biotic responses to paleoenvironments.
Endogenous organic molecules in fossil biominerals have the potential to provide records of past molecular composition with implications for evolutionary and environmental reconstruction. Fossil DNA has frequently been the target biomolecule in fossil samples and is useful for phylogenetic reconstructions of past organisms’ relationships to similar extinct and extant taxa and their responses to environmental change (review by1 e.g.,2,3,4). However, DNA’s preservation for successful sequencing is limited to the past several hundred thousand years for vertebrate taxa5,6 and less than 10,000 years for ancient invertebrate DNA7.
In contrast, proteins can persist in the fossil record for hundreds of thousands to tens of millions of years, and they can inform identification and significant phylogenetic reconstructions of extinct organisms, including for hadrosaurs (e.g.,8,9,10). Preservation of these biomolecules in vertebrate skeletons may in part be due to the embedding of cells in the biomineral. For instance, vertebrate bone, which contains osteoblasts, has a relatively high organic content (~ 20–30% by weight11).
Unfortunately, cells are not generally embedded in most invertebrate biominerals12; therefore, organic matter preservation for these taxa is minimal. However, invertebrate skeletons and shells retain specialized extracellular proteins associated with the biomineralization process12. Thus, these invertebrate biomineralization proteins provide a potential sequencing target. For corals, such proteins become embedded in individual skeleton crystals13, protecting them from degradation for long periods of time, and nitrogen likely derived from biomolecules in invertebrate biominerals has been analyzed for isotopic composition and presence of peptide bonds in invertebrate samples tens to hundreds of millions of years old (e.g.,14,15,16,17). For instance, δ15 N analyses of coral skeletons from the Triassic reveal that amino acids are preserved in fossil corals for hundreds of millions of years15, nearly as long as there have been reef-building scleractinian corals18. Yet, invertebrate biominerals contain much lower starting amounts of organic matter than their vertebrate counterparts, ~ 1% or less19,20,21,22, potentially leaving little sequenceable protein material intact. Until the present work, the oldest sequenced invertebrate “skeletome” data derive from ancient mollusk shells dating to several thousand years23,24, although there have been suggestions that somewhat intact proteins older than the Holocene do indeed remain in fossil invertebrate remains. For instance, glycoproteins extracted from an 80 myo Trigoniida bivalve mollusk shell were characterized as having ~ 5% sequence proportion of DYDY, which is nearly half such content found in a related extant molluscan taxon19, although the identities of the proteins were not determined.
The mechanisms of coral biomineralization have been intensively studied for over 150 years (review by25). Advances in the past decade include sequencing the skeletal proteomes of several modern taxa26,27,28,29, characterizing the carbonate chemistry conditions of the calcifying space that likely impact calcification in response to ocean conditions30,31,32, and developing coral cell cultures that precipitate aragonite at comparable rates to intact corals26,33,34. There has also been a general proliferation of sequenced coral genomes and transcriptomes (e.g.,35,36,37). Roles of several biomineralization proteins have been revealed including the ability of the highly acidic proteins called coral acid rich proteins (CARPs) or skeletal aspartic acid-rich proteins (SAARPs) to precipitate aragonite from seawater38, and the high enzyme activity of the coral skeletal carbonic anhydrase STPCA239,40. However, the function of most of the approximately 100 known coral skeletal proteins remain to be established. Some protein roles may be suggested by their persistent interactions with the mineral once formed. For instance, proteins involved in nucleation of aragonite or in adhering amorphous calcium carbonate nanoparticles together toward their recrystallization to aragonite13,41, may be more tightly bound within the mineral and may resist degradation. In this context, fossil biomolecular data may help clarify roles of proteins currently of unknown function. Thus, important information may be forthcoming in addition to the utility of these protein sequences in organismal phylogenetic reconstruction.
To examine the persistence of coral skeletal proteins older than the Holocene epoch, we analyzed in-depth one modern and several Pleistocene Stage 5E Caribbean corals. As aragonite can recrystallize into secondary aragonite or calcite, with a potential loss or degradation of proteins in the process, we determined the samples’ mineral integrity by x-ray diffraction and inductively coupled plasma mass spectrometry analysis of element/calcium abundance. We then used racemization analysis of free and hydrolyzed amino acids and protein visualization to establish that any sequenced proteins are likely endogenous rather than modern contaminants. Finally, we sequenced extracted proteins using liquid chromatography with tandem mass spectrometry. This work yielded the oldest known invertebrate protein sequences and suggests that highly acidic proteins resist degradation through intimate interactions with the mineral phase and may be useful targets for further analysis within the invertebrate fossil record.
Skeleton integrity and organic matter preservation
Five modern and three fossil coral specimens were interrogated for the quality of their mineral preservation (SI Table 1). Fossil corals that were collected in 1975 from exposed Key Largo Formation deposits, with youngest ages of 125 to 138 kiloanna (ka)42,43, and later donated to the Natural History Museum of Los Angeles County (NHMLA) were loaned to the authors. X-ray diffraction of these fossil specimens showed that Orbicella anularis-2 (hereafter: Mann2) has recrystallized to Mg-calcite and calcite, Montastraea cavernosa-1 (hereafter: Mcav1) is 70–85% aragonite and 15–30% calcite, while O. annularis-4 (hereafter Mann4) is 93–100% aragonite and 0–7% calcite (Fig. 1a–l). A modern O. annularis specimen used for in-family comparison is 100% aragonite (Fig. 1m,n). Of the fossil corals, only Mann4 exhibited Mg/Ca, Sr/Ca, and B/Ca ratios within the range of modern and fossil primary aragonite (Fig. 1o–t). Together, our data suggest that the fossil specimen Mann4 remains mostly primary aragonite.
SDS-PAGE indicated that proteins were among the biomolecules extracted by acid hydrolysis of cleaned fossil skeleton powder for all fossil specimens (Fig. 2). However, these proteins were degraded as evidenced by a bias toward small peptides (i.e., in the low molecular weight area of the gel) in all specimens. Further, fossil corals Mcav1 (70–85% aragonite), Mann2 (fully recrystallized to calcite), and Mann4 (> 90% primary aragonite) exhibited total hydrolysable amino acid (THAA) D/L Asx (aspartic acid plus asparagine) values of 0.634, 0.611, and 0.380, respectively (Table 1). For comparison, THAA D/L Asx of a modern O. annularis skeleton was 0.212.
Fossil and modern coral skeletal protein sequencing
No coral proteins were sequenced from cleaned skeleton powder of Mann2 (fully recrystallized to calcite) or Mcav1 (75–80% calcite). In contrast, Mann4 (> 90% primary aragonite) yielded six coral proteins within our stringent criteria containing peptides that were sequenced by LC–MS/MS after trypsin or trypsin-then-GluC digestion; all proteins had either more than one peptide detected at least once, or one peptide detected multiple times (Table 2 and SI Table 2). Five proteins were detected in the acid-insoluble fraction and one was detected in the soluble fraction, with no overlap between the two fractions. This is contrasted by the 61 proteins detected across all solubility and digestion fractions of proteins extracted from the newest growth of a modern O. annularis (SI Table 3). Proteins sequenced from fossil Mann4 skeleton include one of the highly acidic skeletal proteins known as acidic skeletal organic matrix protein (acidic SOMP)27 or SAARP328 in Acropora spp. and P27 in Stylophora pistillata26, and also detected here in our modern O. annularis (SI Table 3). Although the Blast2GO annotation calls this protein acidic SOMP-like, we use SAARP3 here to be consistent with the most recently published terminology28 and because the gene is clearly a sub-group within the CARPs4/5 or SAARPs1-3 gene family (SI Fig. 2). Other fossil proteins sequenced include a coadhesin which was also detected in our modern O. annularis, a lanC-like protein, a polyamine-modulated factor-binding protein, and two uncharacterized proteins.
Sequencing of six fossil coral proteins (Table 2), three of which are also found in modern coral skeleton (26,27, present study), provides further confirmation that coral skeletal proteins are specific components of the organic matrix embedded within individual aragonite crystals rather than simply cellular contamination13,44, as peripheral proteins in the fossil samples would be accessible to degradation processes over the past ~ 100 ka. In the case of SAARP3, strong interactions likely persist between the acidic domains of the protein and calcium atoms in the biomineral13. This protein is a member of the CARPs4/5 or SAARPs1-3 family (SI Fig. 2), one member of which has been shown to lead to the precipitation of calcium carbonate from unamended seawater38. Further, blasting indicates that the peptide detected in this protein does not have a BLAST hit in Homo sapiens, so it is not a contaminant of the extraction and sequencing. Having shown that the proteins sequenced from the fossil specimen are not likely contaminants (SI Tables 4 and 5), we can focus on the remaining fossil proteins. Like SAARP3, coadhesin has been sequenced previously from modern coral skeleton26,27,28. Coadhesin may persist in a sequenceable form in fossil corals due to its relatively high abundance, as suggested in modern skeleton (SI Table 3). Coadhesin, a transmembrane protein, contains multiple extracellular thrombospondin type-1 repeats, possibly allowing this protein to serve a role in adhesion of calicoblastic cells to organic matrix on and in the skeleton27. Further, thrombospondins may form functional triple helices similar to collagen and peroxidasin45 two proteins known from coral skeleton26,27. They have been implicated in structuring the other framework proteins of the ECM in mammalian osteoblasts46 and inhibition of osteoblast differentiation47. This structural set-up of the coral ECM is engulfed by the growing mineral where it is preserved for, potentially, 100s ka. Two other proteins, a polyamine-modulated factor-binding protein that may play an adhesion and scaffolding role in the biomineralization process, as suggested by the protein’s adhesive properties in sperm48, and a lanC-like protein which may associate with the cell membrane49, were also detected in the fossil O. annularis skeleton but not in the modern one. One uncharacterized protein is similar to a protein previously sequenced from S. pistillata (P1626) while the other is unique to O. annularis skeleton among all coral skeletal proteomes, although orthologs are present in genomes and transcriptomes across many scleractinian taxa (reefgenomics.org, NCBI). In general, annotation of several of the detected fossil proteins suggests that they were likely involved in adhering coral cells to the growing skeleton and in structuring the physical environment of the calcifying space, bringing them into direct contact with the mineral.
While 61 proteins were sequenced from the multiple fractions of modern O. annularis skeleton, only three of these, and six proteins total across all digestion fractions, were detected in fossil skeleton. This is likely due to two issues. As analysis by SDS-PAGE shows, fossil proteins had likely degraded into smaller peptides (Fig. 2). Further, racemization of Arg and Lys would minimize the efficacy of trypsin50,51,52, although the stochastic nature of racemization means that, even after 100s ka, at least half of the enzyme target amino acids should be in a cleavable form. We attempted to overcome diminished effectiveness of tryptic digestion on fossil samples by further digesting the post-trypsin peptides with GluC, which selectively cleaves after Asp or Glu. Increased protein sequencing has been observed when using multiple enzymes on modern specimens53,54, and we detected two additional proteins above our cutoff settings in post-GluC digestions of fossil O. annularis acid-insoluble skeletal protein. Coral skeleton is notoriously low in organic matter abundance with estimates of skeletal organic matter content as low as 0.01%22 and as high as several percent55, so that the combined effects of low starting material, protein degradation, and racemization of enzymatic digestion targets places severe limits on the sequenceability of fossil coral proteins that would be encountered. Skeletons of invertebrates with comparable amounts of starting organic matter content such as mollusk shells19 and echinoderm tests and spines20,21 at 0.3–4% and ~ 0.1% by weight, respectively, may face this issue as well.
In work of this nature, care must be taken to ensure that reported proteins are not modern contaminants56; well-preserved specimens must be chosen and best practices in handling samples must be employed57. Our crystallographic and trace element analyses allowed us to select a fossil coral specimen that retained its primary aragonite mineralogy (Fig. 1). Further, in addition to extensively cleaning all skeleton powders, we handled fossil and modern powders separately in age specific glove bags and months apart, with fossil O. annularis skeletons handled first. We also examined biochemical signatures of age and persistence of intact proteins.
Amino acid racemization has been used for the past 50 years to study fossil samples ranging in age from 500 to 300,000 years old58,59,60. In the present study, Mann4 exhibited THAA D/L Asx lower than similarly-aged Atlantic and Caribbean Pleistocene-aged corals61,62. This may be due to the fact that most of the Mann4 THAA Asx was made up of polymerized amino acids (33.7% FAA) whereas FAA accounted for most of the coral THAA Asx in61 (~ 60% FAA based on their Supplementary Figure EA 1; see our Supplementary document for details on quantitation), with the FAA pool being drawn from hydrolysis of terminal amino acids in degraded proteins for which racemization had already occurred63,64,65. In contrast to Mann4, the higher D/L Asx in Mann2 and Mcav1 may be due to degradation of proteins, potentially linked with recrystallization, as observed in protein extracted from the partially recrystallized Mcav1 and analyzed by SDS-PAGE (Fig. 2). This is similar to the process by which Asx racemization is greater when demineralization of bone results in degradation of the collagen triple helix66,67. Differences in protein content between the fossil coral specimens is likely not due to differences in collection or preservation as documentation for the fossil skeletons indicates that they were collected by the same person at the same time and then stored together in warehouses at the NHMLA after their donation.
All three Pleistocene corals also exhibited relative amino acid molar concentrations similar to archived modern specimens reported here as well as archived material from a modern Porites skeleton and modern and fossil Acropora skeletons for which amino acid relative quantification was performed as part of racemization analysis61,63 (Table 1). Even Mann2, fully recrystallized to calcite, retained a THAA relative composition roughly similar to its modern counterparts with a persistent bias toward the acidic residue-containing amino acid groups, Asx and Glx. However, compared with modern coral skeletons, we observed decreased relative Asx and Ser and increased relative Val in Mann4 and Mcav1, decreased relative Glx in Mann2, and increased relative Ala in all three fossil specimens (Table 1, SI Table 6). The difference in amino acid composition could be due to loss of highly acidic proteins, of which there are multiple types in coral skeleton13,41,68, and which would reduce the remaining Asx pool while increasing the relative abundance of the remaining non-acidic residues. We also observed a bias toward smaller peptides observed by SDS-PAGE (Fig. 2). Together with the non-modern D/L Asx and D/L Glx values, these are indicative of both very old proteins and of protein degradation and loss in the fossil coral skeletons. Further confirmation that we did indeed sequence fossil coral skeletons is provided by degradation signatures in detected peptides. In particular, the detected peptide in SAARP3 in the acid-insoluble SOM fraction is suggested by MS1 data to be deamidated in both asparagines in the peptide. This deamidation is a known degradation feature observed in other paleoproteomic analyses (e.g.69,70). Further effects of proteins being locked in aragonite crystals may include differential production of isoaspartate (or gamma glutamic acid) during asparagine (or glutamine) deamidation71,72; unfortunately, the low peptide yield was insufficient to pursue this analysis in the present study. Finally, our phylogenetic analysis shows that SAARP3 is a coral-specific protein (SI Fig. 2). In sum, we are confident that the six proteins sequenced from the fossil O. annularis, Mann4, are coral proteins of the same age as the skeleton.
It should be noted that extraction of coral proteins for sequencing is destructive. While owners of gifted or loaned coral skeletons were informed of this ahead of time and approved such use of the specimens, we remain cognizant that our use of portions of the samples precludes analysis of these portions in the future. We therefore sought to minimize the amount of material used for sequencing. Some modern coral skeletal proteome sequencing has used 10–30 g of cleaned skeleton powder27,28, but comparable protein detection can be obtained from approximately 1 g of cleaned modern material29. In this study, we found that this smaller amount of skeleton allowed sequencing of some fossil proteins and reinforces that these biomolecules are potentially available for sequencing from invertebrate biominerals aged over 100 ka. More material may yield better detection, and hence more sequenced peptides, in future studies but specimens should be carefully chosen to minimize loss of irreplaceable samples. Further, enzymatic digestion success is likely minimized by amino acid racemization. While we show here that we can successfully sequence several known modern skeletal proteins in fossil coral specimens, future method refinement may consider inclusion of other, non-enzymatic, peptide cleavage methods; this could include the use of cyanogen bromide (e.g.73), although significant accumulation of methionine oxidation in fossil specimens would minimize its effectiveness74.
In summary, we show that fossil coral skeletons that retain their primary aragonite mineralogy preserve endogenous proteins that we extracted and sequenced by standard methods, while proteins in recrystallized fossil coral skeleton are too degraded for sequencing. To be retained in the skeleton for over 100 ka, these proteins must have been intimately associated with the aragonite crystal. Our work supports this and pushes back the age at which these phylogenetically informative skeleton biomolecules can be obtained from the invertebrate fossil record.
Colonies of fossil Orbicella annularis and Montastraea cavernosa skeletons were borrowed from the Natural History Museum of Los Angeles (NHMLA) Invertebrate Paleontology Department (Fig. 1). All specimens were originally collected from Pleistocene deposits in the Key Largo Formation (FL) aged 125 to 138 ka42,75. These and modern corals of several species, also NHMLA collections, and the surface layer of a privately-owned O. annularis were analyzed for skeleton integrity (SI Table 1).
Slabbed coral fragments were soaked in equal parts 30% hydrogen peroxide and 3% sodium hypochlorite after Stoll76 and then ground to 125 µm . Skeleton powder was cleaned three additional times before being dried at 40 °C. Cleaning was sufficient to remove contaminant proteins, as determined from phosphate buffered saline solutions soaked on cleaned skeleton powders and then concentrated on 3 kDa Amicon Ultra centrifugal filter units (Millipore), at the detection level of bichronoic acid assays (SI Table 4), of Stain-Free SDS-PAGE (Bio-Rad) and imaging (SI Fig. 3), and were at concentrations three orders of magnitude lower than that observed in cleaned coral skeleton powder by amino acid analysis61. All clean powder was only handled in age-specific glove bags (i.e. separate bags for fossil and modern), and modern samples were never handled at the same time as fossil samples for any biochemical analysis.
Duplicate milled sub-samples of cleaned skeleton were cleaned after Stoll76 and dried at 60 °C before analysis on a Panalytical X’Pert Pro X-ray Powder Diffractometer (Malvern) on a zero diffraction background plate. Spectra were analyzed in X’Pert Hi-Score software and relative amounts of aragonite and calcite were determined by the Reference Intensity Ratio method77 using 01–072-1652 calcite and 00–041-1475 aragonite references for all samples. After x ray diffraction analysis, elemental composition of powders was measured on an Element XR HR-ICP-MS (Thermo Fisher) using Cibicides wuellerstorfi (CAM-wuell) as a consistency standard78.
Amino acid racemization
Amino acids for racemization analysis, both free and total hydrolysable amino acids, were extracted, hydrolyzed, and evaporated to dryness from cleaned skeleton powders by standard methods79. All samples were prepared in duplicate and analyzed at the Northern Arizona University Amino Acid Geochronology Laboratory using standard methods with modifications for microfossils80,81,82. Rehydrated samples were spiked with L-homo-arginine as an internal standard and then injected into an HPLC fitted with a reverse-phase C18-packed column. ‘Blank’ samples were included.
Approximately 1 g cleaned powder from each coral was decalcified in 0.5 M glacial acetic acid. Acid insoluble matrix (AIM) pellets were rinsed twice in ice-cold 80% acetone whereas acid soluble matrix (ASM) was precipitated in ice-cold 100% acetone and then rinsed twice in ice-cold 80% acetone. Pellets were immediately submitted for protein sequencing. A subset of extracted proteins were separated by SDS-PAGE on 4–20% Mini-PROTEAN TGX Stain-Free™ precast gels (Bio-Rad) which were imaged on a Bio-Rad ChemiDoc XRS + imager following UV light exposure for 5 min.
LC–MS/MS protein sequencing
Skeletal protein AIM and ASM samples were dissolved in 2% SDS buffer and digested using either a filter aided sample preparation (FASP; Mann4)83 or a multi-enzyme digestion filter aided sample preparation protocol (MED-FASP; Mcav1, Mann2, Mann4, and the surface layer of a modern O. annularis) modified from54. Briefly, protein was dissolved in SDS buffer and placed in a 30 kDa Microcon Centrifugal Unit (Sigma Aldrich), SDS was displaced using an 8 M urea solution and then the sample was diluted to 2 M urea and digested with trypsin (Promega). Digested peptides were moved through the filter into a micro-centrifuge tube (low retention; Fisher). Any undigested material that remained on the filter was then digested with Glu-C (Promega) and peptides centrifuged into a second micro-centrifuge tube. Each fraction was analyzed separately on an nano-liquid-chromatography system coupled to a benchtop high-resolution orbitrap mass spectrometer (QE-Plus; Thermo Fisher) and operated in positive ion mode with data-dependent acquisition. MS1 was performed at resolution of 70,000 (at 400 m/z) and MS2 at 17,500. Peak lists were extracted from raw spectra and processed using a Mascot (2.4; Matrix Science) server against Montastraea cavernosa, M. faveolata, and Platygyra carnosus protein databases downloaded from comparative.reefgenomics.org35, an O. faveolata protein database84, and the O. annularis genome predicted protein database85 under NCBI BioProject 550266. A common contaminants database downloaded from the Max Planck Institute of Biochemistry, Martinsried, and a UniProt-Human database were included in the analysis to test for contaminants. We also separately ran the LC–MS/MS data in Mascot against UniProt-bacteria, UniProt-cyanobacteria, and UniProt-fungi databases and confirmed that all detected peptides from those databases did not match peptides assigned to corals (SI Table 5). We also ran LC–MS/MS data for a preparation blank sample against the coral, common contaminants, and UniProt-Human databases to confirm that other contaminant from the preparation process were not assigned to corals (SI Table 5). For all Mascot runs, we applied carbamidomethylation of cysteine as a fixed modification and oxidation of methionine, acetylation, and deamidation of asparagine and glutamine as variable modifications. Enzyme specificity was set to trypsin with one missed cleavage allowed. Mass tolerances were set to 10 ppm and 20 mmu for precursor and product ions, respectively, and precursor charge was set to 2 + , 3 + , or 4 + .
Initial decoy searches were performed in Mascot using a 1% false discovery rate to determine the appropriate significance value setting. Next, we performed Mascot error-tolerant searches with this significance setting. Only protein sequences above the cutoff score with at least two independent significant peptides detected, or one peptide detected significantly multiple times, were retained. We blasted these sequences against the NCBI nr database in Blast2GO. Further, we BLASTed returned proteins against the NCBI Homo sapiens database and manually checked hits with high sequence similarity for identity of LC–MS/MS detected peptides; if ‘coral’ and human peptides were identical, we manually removed the protein sequence from our list of coral skeletal proteins. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE86 partner repository with the dataset identifiers found in SI Table 7.
Phylogenetic analysis of SAARP3
CARP4/SAARP1, CARP5/SAARP2, and P27/acidic SOMP/SAARP3, previously sequenced from coral skeletons26,27,28,29, were blasted against the cnidarian predicted protein databases in comparative.reefgenomics.org35. They were also blasted against NCBI and the top non-cnidarian hits with E-values better than e-20, two Crassostrea gigas sequences, were retained. Multiple sequence alignments were generated in T-Coffee87,88. Aligned protein sequences were trimmed using the TrimAl v1.3 alignment utility in Phylemon2 using the gappyout method89,90. The most appropriate model was chosen in ProtTest 391, and then maximum likelihood trees were constructed in PhyML using the WAG + G + I substitution model with bootstrap set to 1000 and all other pre-set parameters92.
Amino acid composition statistics
Relative content of each amino acid (THAA) was compared between the replicate data for each fossil coral skeleton reported on here versus average values from modern O. annularis, Fungia sp. Pocillopora damicornis, P. acuta, and Porites lobata, plus previously reported values from modern Acropora palmata61 and Porites australiensis63, as reported or reproduced in Table 1, in RStudio93. Shapiro–Wilk normality tests showed that all modern amino acids exhibited normal distribution, so that Student’s t-tests were applied (SI Table 6).
Stoneking, M. & Krause, J. Learning about human population history from ancient and modern genomes. Nat. Rev. Genetics 12, 603 (2011).
Orlando, L. et al. Ancient DNA and the population genetics of cave bears (Ursus spelaeus) through space and time. Mol. Biol. Evol. 19, 1920–1933 (2002).
Green, R. E. et al. Analysis of one million base pairs of Neanderthal DNA. Nature 444, 330 (2006).
Buckner, J. C., Ellingson, R., Gold, D. A., Jones, T. L. & Jacobs, D. K. Mitogenomics supports an unexpected taxonomic relationship for the extinct diving duck Chendytes lawi and definitively places the extinct Labrador Duck. Mol. Phylogen. Evol. 122, 102–109 (2018).
Orlando, L. et al. Recalibrating Equus evolution using the genome sequence of an early Middle Pleistocene horse. Nature 499, 74 (2013).
Bokelmann, L. et al. A genetic analysis of the Gibraltar Neanderthals. Proc. Natl. Acad. Sci. 116, 15610–15615 (2019).
Der Sarkissian, C. et al. Ancient DNA analysis identifies marine mollusc shells as new metagenomic archives of the past. Mol. Ecol. Resour. 17, 835–853 (2017).
Schroeter, E. R. et al. Expansion for the Brachylophosaurus canadensis collagen I sequence and additional evidence of the preservation of Cretaceous protein. J. Proteome Res. 16, 920–932 (2017).
Buckley, M., Larkin, N. & Collins, M. Mammoth and mastodon collagen sequences; survival and utility. Geochim. Cosmochim. Acta 75, 2007–2016. https://doi.org/10.1016/j.gca.2011.01.022 (2011).
Presslee, S. et al. Palaeoproteomics resolves sloth relationships. Nat. Ecol. Evol. 3(7), 1121–1130 (2019).
Eastoe, J. & Eastoe, B. The organic constituents of mammalian compact bone. Biochem. J. 57, 453 (1954).
Lowenstam, H. A. & Weiner, S. On Biomineralization (Oxford University Press, Oxford, 1989).
Mass, T., Drake, J. L., Peters, E. C., Jiang, W. & Falkowski, P. G. Immunolocalization of skeletal matrix proteins in tissue and mineral of the coral Stylophora pistillata. Proc. Natl. Acad. Sci. 111, 12728–12733 (2014).
Muscatine, L. et al. Stable isotopes (δ13C and δ15N) of organic matrix from coral skeleton. Proc. Natl. Acad. Sci. USA 102, 1525–1530. https://doi.org/10.1073/pnas.0408921102 (2005).
Tornabene, C., Martindale, R. C., Wang, X. T. & Schaller, M. F. Detecting photosymbiosis in fossil scleractinian corals. Sci. Rep. 7, 9465 (2017).
Nance, J., Armstrong, J., Cody, G., Fogel, M. & Hazen, R. Preserved macroscopic polymeric sheets of shell-binding protein in the Middle Miocene (8 to 18 Ma) gastropod Ecphora. Geochem. Perspect. Lett. 1, 1–9 (2015).
Myers, C. E. et al. Exceptional preservation of organic matrix and shell microstructure in a Late Cretaceous Pinna fossil revealed by photoemission electron spectromicroscopy. Geology 46, 711–714 (2018).
Stanley, G. D. & Fautin, D. G. The origins of modern corals. Science 291, 1913–1914 (2001).
Weiner, S. & Hood, L. Soluble protein of the organic matrix of mollusk shells: a potential template for shell formation. Science 190, 987–989. https://doi.org/10.1126/science.1188379 (1975).
Mann, K., Poustka, A. & Mann, M. The sea urchin (Strongylocentrotus purpuratus) test and spine proteomes. Proteome Sci. 6, 22 (2008).
Mann, K., Poustka, A. & Mann, M. In-depth, high-accuracy proteomics of sea urchin tooth organic matrix. Proteome Sci. 6, 33 (2008).
Wainwright, S. A. Skeletal organization in the coral, Pocillopora damicornis. Quarterly J. Microsc. Sci. s3–104, 169–183 (1963).
Sakalauskaite, J. et al. 'Palaeoshellomics’ reveals the use of freshwater mother-of-pearl in prehistory. eLife 8, e45644 (2019).
Sakalauskaite, J., Marin, F., Pergolizzi, B. & Demarchi, B. Shell palaeoproteomics: first application of peptide mass fingerprinting for the rapid identification of mollusc shells in archaeology. J. Proteomics 227, 103920 (2020).
Drake, J. et al. How corals made rocks through the ages. Global Change Biol. 26, 31 (2019).
Drake, J. L. et al. Proteomic analysis of skeletal organic matrix from the stony coral Stylophora pistillata. Proc. Natl. Acad. Sci. 110, 3788–3793 (2013).
Ramos-Silva, P. et al. The skeletal proteome of the coral Acropora millepora: the evolution of calcification by cooption and domain shuffling. Mol. Biol. Evol. https://doi.org/10.1093/molbev/mst109 (2013).
Takeuchi, T., Yamada, L., Shinzato, C., Sawada, H. & Satoh, N. Stepwise evolution of coral biomineralization revealed with genome-wide proteomics and transcriptomics. PLoS ONE 11, e0156424. https://doi.org/10.1371/journal.pone.0156424 (2016).
Peled, Y. et al. Optimization of skeletal protein preparation for LC–MS/MS sequencing yields additional coral skeletal proteins in Stylophora pistillata. BMC Mater. 2, 8. https://doi.org/10.1186/s42833-020-00014-x (2020).
Al-Horani, F. A., Al-Moghrabi, S. M. & De Beer, D. Microsensor study of photosynthesis and calcification in the scleractinian coral, Galaxea fascicularis: active internal carbon cycle. J. Exp. Mar. Biol. Ecol. 288, 1–15 (2003).
McCulloch, M. T. et al. In Boron Isotopes. 145–162 (Springer, Berlin, 2018).
Sevilgen, D. S. et al. Full in vivo characterization of carbonate chemistry at the site of calcification in corals. Sci. Adv. 5, eaau7447 (2019).
Mass, T. et al. Aragonite precipitation by “proto-polyps” in coral cell cultures. PLoS ONE 7, e35049. https://doi.org/10.1371/journal.pone.0035049 (2012).
Mass, T., Drake, J. L., Heddleston, J. M. & Falkowski, P. G. Nanoscale visualization of biomineral formation in coral proto-polyps. Curr. Biol. 27, 3191.e3193–3196.e3193 (2017).
Bhattacharya, D. et al. Comparative genomics explains the evolutionary success of reef-forming corals. eLife 5, e13288 (2016).
Voolstra, C. R. et al. The ReFuGe 2020 Consortium—using “omics” approaches to explore the adaptability and resilience of coral holobionts to environmental change. Front. Mar. Sci. 2, 68 (2015).
Shinzato, C. et al. Using the Acropora digitifera genome to understand coral responses to environmental change. Nature 476, 320–323. https://www.nature.com/nature/journal/v476/n7360/abs/nature10249.html#supplementary-information (2011).
Mass, T. et al. Cloning and characterization of four novel coral acid-rich proteins that precipitate carbonates in vitro. Curr. Biol. 23, 1126–1131. https://doi.org/10.1016/j.cub.2013.05.007 (2013).
Bertucci, A., Tambutté, S., Supuran, C., Allemand, D. & Zoccola, D. A new coral carbonic anhydrase in Stylophora pistillata. Mar. Biotechnol. 13, 992–1002. https://doi.org/10.1007/s10126-011-9363-x (2011).
Zoccola, D. et al. Coral carbonic anhydrases: regulation by ocean acidification. Mar. Drugs 14, 109 (2016).
Akiva, A. et al. Minerals in the pre-settled coral Stylophora pistillata crystallize via protein and ion changes. Nat. Commun. 9, 1880 (2018).
Fruijtier, C., Elliott, T. & Schlager, W. Mass-spectrometric 234U–230Th ages from the Key Largo Formation, Florida Keys, United States: constraints on diagenetic age disturbance. Geol. Soc. Am. Bull. 112, 267–277 (2000).
Multer, H. G., Gischler, E., Lundberg, J., Simmons, K. R. & Shinn, E. A. Key Largo limestone revisited: Pleistocene shelf-edge facies, Florida Keys, USA. Facies 46, 229–271 (2002).
Von Euw, S. et al. Biological control of aragonite formation in stony corals. Science 356, 933–938 (2017).
Nelson, R. E. et al. Peroxidasin: a novel enzyme-matrix protein of Drosophila development. EMBO J. 13, 3438–3447 (1994).
Alford, A. I., Terkhorn, S. P., Reddy, A. B. & Hankenson, K. D. Thrombospondin-2 regulates matrix mineralization in MC3T3-E1 pre-osteoblasts. Bone 46, 464–471 (2010).
DuBose, K. B., Zayzafoon, M. & Murphy-Ullrich, J. E. Thrombospondin-1 inhibits osteogenic differentiation of human mesenchymal stem cells through latent TGF-β activation. Biochem. Biophys. Res. Commun. 422, 488–493 (2012).
Zhu, F. et al. Mutations in PMFBP1 cause acephalic spermatozoa syndrome. Am. J. Hum. Genetics 103, 188–199 (2018).
Landlinger, C., Salzer, U. & Prohaska, R. Myristoylation of human LanC-like protein 2 (LANCL2) is essential for the interaction with the plasma membrane and the increase in cellular sensitivity to adriamycin. Biochim. Biophys. Acta (BBA)-Biomembr. 1758, 1759–1767 (2006).
Mares-Guia, M. & Shaw, E. Studies on the Active Center of Trypsin—the binding of amidines and guanidines as models of the substrate side chain. J. Biol. Chem. 240, 1579–1585 (1965).
Hayashi, R. & Kameda, I. Racemization of amino acid residues during alkali-treatment of protein and its adverse effect on pepsin digestibility. Agric. Biol. Chem. 44, 891–895 (1980).
Hayashi, R. & Kameda, I. Decreased proteolysis of alkali-treated protein: consequences of racemization in food processing. J. Food Sci. 45, 1430–1431 (1980).
Meyer, J. G. In silico proteome cleavage reveals iterative digestion strategy for high sequence coverage. ISRN Computational Biology (2014).
Wiśniewski, J. R. & Mann, M. Consecutive proteolytic digestion in an enzyme reactor increases depth of proteomic and phosphoproteomic analysis. Anal. Chem. 84, 2631–2637 (2012).
Cuif, J.-P., Dauphin, Y., Berthet, P. & Jegoudez, J. Associated water and organic compounds in coral skeletons: Quantitative thermogravimetry coupled to infrared absorption spectrometry. Geochem. Geophys. Geosyst. 5, Q11011. https://doi.org/10.1029/2004gc000783 (2004).
Buckley, M., Warwood, S., van Dongen, B., Kitchener, A. C. & Manning, P. L. A fossil protein chimera; difficulties in discriminating dinosaur peptide sequences from modern cross-contamination. Proc. R. Soc. B. 284, 20170544 (2017).
Hendy, J. et al. A guide to ancient protein studies. Nat. Ecol. Evol. 2, 791–799 (2018).
Hare, P. & Mitterer, R. Non-protein amino acids in fossil shells: Carnegie Institution of Washington Year Book, Vol. 65 (1967).
Abelson, P. & Hare, P. Recent origin of amino acids in the Gunflint chert. Spec. Pap. Geol. Soc. Am. 121, 2 (1968).
Bravenec, A. D., Ward, K. D. & Ward, T. J. Amino acid racemization and its relation to geochronology and archaeometry. J. Sep. Sci. 41(6), 1489–1506 (2018).
Tomiak, P. et al. The role of skeletal micro-architecture in diagenesis and dating of Acropora palmata. Geochim. Cosmochim. Acta 183, 153–175 (2016).
Wehmiller, J. F., Hare, P. & Kujala, G. Amino acids in fossil corals: racemization (epimerization) reactions and their implications for diagenetic models and geochronological studies. Geochim. Cosmochim. Acta 40, 763–776 (1976).
Goodfriend, G. A., Hare, P. & Druffel, E. R. Aspartic acid racemization and protein diagenesis in corals over the last 350 years. Geochim. Cosmochim. Acta 56, 3847–3850 (1992).
Demarchi, B. et al. Amino acid racemization dating of marine shells: a mound of possibilities. Quat. Int. 239, 114–124 (2011).
Hendy, E. J. et al. Assessing amino acid racemization variability in coral intra-crystalline protein for geochronological applications. Geochim. Cosmochim. Acta 86, 338–353 (2012).
High, K., Milner, N., Panter, I. & Penkman, K. Apatite for destruction: investigating bone degradation due to high acidity at Star Carr. J. Archaeol. Sci. 59, 159–168 (2015).
Collins, M., Waite, E. & Van Duin, A. Predicting protein decomposition: the case of aspartic–acid racemization kinetics. Philos. Trans. R. Soc. Lond. Ser. B Biol. Sci. 354, 51–64 (1999).
Neder, M. et al. Mineral formation in the primary polyps of pocilloporoid corals. Acta Biomater. https://doi.org/10.1016/j.actbio.2019.07.016 (2019).
Demarchi, B. et al. Protein sequences bound to mineral surfaces persist into deep time. eLife 5, e17092 (2016).
Welker, F. et al. Ancient proteins resolve the evolutionary history of Darwin/’s South American ungulates. Nature 522, 81–84 (2015).
Cournoyer, J. J. et al. Deamidation: differentiation of aspartyl from isoaspartyl products in peptides by electron capture dissociation. Protein Sci. 14, 452–463 (2005).
Li, X., Lin, C. & O’Connor, P. B. Glutamine deamidation: differentiation of glutamic acid and γ-glutamic acid in peptides by electron capture dissociation. Anal. Chem. 82, 3606–3615 (2010).
Thomas, O. R. et al. The inner ear proteome of fish. FEBS J. 286, 66–81 (2019).
Gross, E. in Methods Enzymol. Vol. 11, pp. 238–255 (Elsevier, Amsterdam, 1967).
Osmond, J., Carpenter, J. & Windom, H. Th230/U234 age of the Pleistocene corals and oolites of Florida. J. Geophys. Res. 70, 1843–1847 (1965).
Stoll, H. M. et al. A first look at paleotemperature prospects from Mg in coccolith carbonate: cleaning techniques and culture measurements. Geochem. Geophys. Geosyst. https://doi.org/10.1029/2000GC000144 (2001).
Chung, F. H. Quantitative interpretation of X-ray diffraction patterns of mixtures. I. Matrix-flushing method for quantitative multicomponent analysis. J. Appl. Crystallogr. 7, 519–525 (1974).
Misra, S., Owen, R., Kerr, J., Greaves, M. & Elderfield, H. Determination of δ11B by HR-ICP-MS from mass limited samples: application to natural carbonates and water samples. Geochim. Cosmochim. Acta 140, 531–552 (2014).
Penkman, K., Kaufman, D., Maddy, D. & Collins, M. Closed-system behaviour of the intra-crystalline fraction of amino acids in mollusc shells. Quat. Geochronol. 3, 2–25 (2008).
Kaufman, D. S. & Manley, W. F. A new procedure for determining DL amino acid ratios in fossils using reverse phase liquid chromatography. Quat. Sci. Rev. 17, 987–1000 (1998).
Kaufman, D. S. In Perspectives in Amino Acid and Protein Geochemistry 145 (2000).
Kaufman, D. Dating deep-lake sediments by using amino acid racemization in fossil ostracodes. Geology 31, 1049–1052 (2003).
Wiśniewski, J. R., Zougman, A., Nagaraj, N. & Mann, M. Universal sample preparation method for proteome analysis. Nat. Methods 6, 359 (2009).
Anderson, D. A., Walz, M. E., Weil, E., Tonellato, P. & Smith, M. C. RNA-Seq of the Caribbean reef-building coral Orbicella faveolata (Scleractinia-Merulinidae) under bleaching and disease stress expands models of coral innate immunity. PeerJ 4, e1616 (2016).
Kamel, B., Avila-Magaña, V., Sunagawa, S., Diaz-Almeyda, E., Gonzales, A.M., Ohdera, A.H., Prada, C., Pollock, F.J., Ardell, D., Belmonte, A. Briggs, C., Capo, T., Closek, C., Coykendall, D.K., Galindo-Martínez, C.T., Gillette, P., Green, E., Grimwood, J., Grigoriev, I., Jegla, T., Klueter, A., Lawrence, T.J., Morrison, C.L., Mydlarz, L., Pinzon, J.H., Schmutz, J., Levitan, D.R., Weber, M., Iglesias-Prieto, R., Knowlton, N., Coffroth, M.A., Kitano, H., Woodley, C., Medina, M. Microbes stabilize coral-algal symbiosis (in review).
Perez-Riverol, Y. et al. The PRIDE database and related tools and resources in 2019: Improving support for quantification data. Nucleic Acids Res. 47, D442–D450 (2019).
Di Tommaso, P. et al. T-Coffee: a web server for the multiple sequence alignment of protein and RNA sequences using structural information and homology extension. Nucleic Acids Res., gkr245 (2011).
Notredame, C., Higgins, D. G. & Heringa, J. T-Coffee: A novel method for fast and accurate multiple sequence alignment. J. Mol. Biol. 302, 205–217 (2000).
Capella-Gutiérrez, S., Silla-Martínez, J. M. & Gabaldón, T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–1973 (2009).
Sánchez, R. et al. Phylemon 2.0: a suite of web-tools for molecular evolution, phylogenetics, phylogenomics and hypotheses testing. Nucleic Acids Res. 39, W470-W474 (2011).
Darriba, D., Taboada, G. L., Doallo, R. & Posada, D. ProtTest 3: fast selection of best-fit models of protein evolution. Bioinformatics 27, 1164–1165 (2011).
Guindon, S. et al. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 59, 307–321 (2010).
Team, R. RStudio: Integrated Development for R. https://www.rstudio.com/ (2019).
Hathorne, E. C. et al. Interlaboratory study for coral Sr/Ca and other element/Ca ratio measurements. Geochem. Geophys. Geosyst. 14, 3730–3750 (2013).
Swart, P., Elderfield, H. & Greaves, M. A high-resolution calibration of Sr/Ca thermometry using the Caribbean coral Montastraea annularis. Geochem. Geophys. Geosyst. 3, 1–11 (2002).
de Villiers, S., Shen, G. & Nelson, B. The Sr/Ca-temperature relationship in coralline aragonite: influence of variability in (Sr/Ca) seawater and skeletal growth parameters. Geochim. Cosmochim. Acta 58, 197–208 (1994).
Quinn, T. M. & Sampson, D. E. A multiproxy approach to reconstructing sea surface conditions using coral skeleton geochemistry. Paleoceanography 17 (2002).
Hendy, E., Gagan, M., Lough, J., McCulloch, M. & DeMenocal, P. Impact of skeletal dissolution and secondary aragonite on trace element and isotopic climate proxies in Porites corals. Paleoceanography 22 (2007).
Nothdurft, L. D., Webb, G. E., Bostrom, T. & Rintoul, L. Calcite-filled borings in the most recently deposited skeleton in live-collected Porites (Scleractinia): Implications for trace element archives. Geochim. Cosmochim. Acta 71, 5423–5438 (2007).
Allison, N., Finch, A. A., Webster, J. M. & Clague, D. A. Palaeoenvironmental records from fossil corals: the effects of submarine diagenesis on temperature and climate estimates. Geochim. Cosmochim. Acta 71, 4693–4703 (2007).
Gothmann, A. M. et al. Fossil corals as an archive of secular variations in seawater chemistry since the Mesozoic. Geochim. Cosmochim. Acta 160, 188–208. https://doi.org/10.1016/j.gca.2015.03.018 (2015).
Griffiths, N., Müller, W., Johnson, K. G. & Aguilera, O. A. Evaluation of the effect of diagenetic cements on element/Ca ratios in aragonitic Early Miocene (~ 16Ma) Caribbean corals: Implications for ‘deep-time’ palaeo-environmental reconstructions. Palaeogeogr. Palaeoclimatol. Palaeoecol. 369, 185–200 (2013).
We thank Austin Hendy and Kathy Omura at the Natural History Museum of Los Angeles for access to modern and fossil coral skeleton samples; Lisa Greer for the modern Orbicella annularis specimen; Whitaker Cohn and Lucy Gao at UCLA for mass spectrometry sequencing optimization; Saeed Kahn at UCLA for assistance with XRD; Tina Treude at UCLA for use of instrumentation; Jill Sutton, Maxence Guillermic, Rob Eagle, and Aradhna Tripati at the Institut Universitaire Européen de la Mer, Technopôle Brest Iroise and UCLA for aquiring trace element data, with methods and analyses using the support of an IAGC student grant 2017 and LabexMer grant (no. ANR-10-LABX-19-01); Jordon Bright, Katherine Whitacre, and Darrell Kaufman at the Northern Arizona University Amino Acid Geochronology Laboratory for amino acid racemization analysis; and Dr. Wendy Wooten and students in the after-school Proteomics Training Program at Reseda Charter High School for assistance with data processing. We also thank the reviewers for helping to improve the manuscript. JD was supported by an NSF Postdoctoral Research Fellowship in Biology award #1611943 and the Zuckerman STEM postdoctoral leadership program. JW acknowledges support from the NIDDK P30 DK063491.
The authors declare no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Drake, J.L., Whitelegge, J.P. & Jacobs, D.K. First sequencing of ancient coral skeletal proteins. Sci Rep 10, 19407 (2020). https://doi.org/10.1038/s41598-020-75846-4
Journal of The Royal Society Interface (2021)
Assessing the predictive taxonomic power of the bony labyrinth 3D shape in horses, donkeys and their F1-hybrids
Journal of Archaeological Science (2021)
Fossil Corals With Various Degrees of Preservation Can Retain Information About Biomineralization-Related Organic Material
Frontiers in Earth Science (2021)