Abstract
Collagen has been postulated to be the most abundant protein in our body, making up one-third of the total protein content in mammals. However, a direct assessment of the total collagen levels of an entire mammal to confirm this estimate is missing. Here we measured hydroxyproline levels as a proxy for collagen content together with total protein levels of entire mice or of individual tissues. Collagen content normalized to the total protein is approximately 0.1% in the brain and liver, 1% in the heart and kidney, 4% in the muscle and lung, 6% in the colon, 20–40% in the skin, 25–35% in bones, and 40–50% in tendons of wild-type (CD1 and CB57BL/6) mice, consistent with previous reports. To our surprise, we find that collagen is approximately 12% in females and 17% in males of the total protein content of entire wild-type (CD1 and CB57BL/6) mice. Although collagen type I is the most abundant collagen, the most abundant proteins are albumin, hemoglobulin, histones, actin, serpina, and then collagen type I. Analyzing amino acid compositions of mice revealed glycine as the most abundant amino acid. Thus, we provide reference points for collagen, matrisome, protein, and amino acid composition of healthy wild-type mice.
Similar content being viewed by others
Introduction
Progress in biomedical science depends on building models using existing data to predict the outcome of a future experiment and either confirm or adapt the working model1. These models then should be derived from data and not from assumptions. One such assumption used as a point of reference is the statement: “collagen is the most abundant protein making up one-third or 30% of total protein in mammals or vertebrates.” We found over 50 publications making such a statement without any reference or citing a review paper. It seems as if this statement is a given textbook fact (BNID 109731)2 (https://en.wikipedia.org/wiki/Collagen, accessed 05.10.22); however, the actual quantifications are elusive. The biochemical properties of collagens have been studied since the end of the 18th hundred3. Given the recent increased interest in collagen research of over 8000 papers published per year (Fig. 1a, Supplementary Table 1), it might be essential to have a reference point for collagen levels in tissues and entire organisms. For instance, quantification of collagen levels is important in different disease settings, such as fibrosis (too much collagen deposition4 and 45% of all human deaths5), immune-induced degradation of collagen in arthritis6,7), more collagen and stiffer tumor environment8,9, excessive collagen in skin diseases (scleroderma, keloids)10,11, cancer patient survival correlates with specific collagen types12, fibrotic collagen remodeling in cardiovascular diseases13,14, and aging usually associated with lower collagen levels15.
Collagen is a fascinating biomaterial that is highly biocompatible and used for many biomedical applications in processes, such as wound healing, tissue repair, and dental implants16. Collagen-based materials have been used for medical applications already reported by Galan two millennia ago for reconstructive surgery on tendons and wound closure on gladiators17. Collagen can survive longer than other biomolecules, such that collagen has been extracted from dinosaur bones18, and is used for estimating the age of a specimen19,20. During evolution, the invention of collagen probably enabled the transitions from unicellular to multicellular organisms solving three major problems, cell and tissue adhesion, mechanical support (skeleton), and protection in the form of barrier function and mechanical shock absorption21. Collagen is characterized by its triple helix repeats of (Gly-X-Y)n whereby Gly stands for glycine, X and Y can be any amino acid but most commonly are proline and hydroxyproline (reviewed in22,23). For sterical reasons, the glycine on every third position is essential for adequately forming the triple helix composed of the three collagen chains (reviewed in22,23). Hydroxyproline is relatively unique to collagens (only in a few other proteins found, e.g., elastin, argonaute-2, HIF-1⍺)24 and is essential for stabilizing the collagen triple helix (reviewed in22,23).
There are several methods for the detection of collagen levels: quantifying hydroxyproline levels, collagen stainings, densitometry of collagen bands in sodium dodecyl sulfate–polyacrylamide gel electrophoresis (SDS-PAGE), enzyme-linked immunosorbent assay (ELISA) coupled with collagen antibodies, immunohistochemistry using collagen antibodies, quantitative mass spectrometry enriched for extracellular matrix proteins4,25,26,27,28,29. The gold standard in the field for assessing collagen levels is via hydroxyproline quantification25,29.
Here, we hydrolyzed entire mouse samples to amino acid levels and used hydroxyproline and amino acid levels as a proxy to estimate the total collagen over total protein levels. We find two times lower total collagen levels than previously claimed in the literature. Our proteomics data suggest that collagen might not be the most abundant protein.
Results
Quantifying collagen levels of tissues
The importance of collagens for engineering, biological, and medical research is reflected in the exponential increase in publications about collagens over the last 60 years (Fig. 1a, Supplementary Table 1). In the literature, it has been stated that collagen makes up about 60–90% of the dry weight of bone, skin, sclera, and tendon (Fig. 1b). These percentages seem to be a given in the field since many of these statements have either no reference or cite review publications (Fig. 1b, Supplementary Table 2). Our literature search to identify the source for these percentages led to a review paper by Grant from 197230 that summarized the collagen content of various tissues and animals. For the original work, we found two publications by Lowry and colleagues from 194131 and Neuman and Logan 195032 that measured the collagen content of dried tissue. However, we found no current experimental or quantitative evidence. To fill this gap, we measured total collagen levels in mice. Instead of normalizing to the organs’ dry weight, we decided to normalize to total protein levels. We harvested mouse tissues and hydrolyzed them with hydrogen chloride to the amino acid levels (Fig. 1c, “Materials and methods” for details). We made aliquots of these amino acid homogenates to quantify total collagen and protein levels. A colorimetric read-out was used to determine free hydroxyproline amino acids that were oxidized with chloramine-T to pyrrole and stained with Ehrlich’s reagent33. We normalized these hydroxyproline amino acid levels to a standard curve of rat tail collagen to determine total collagen levels (Fig. 1c). We validated our hydroxyproline and collagen quantification with an alternative but comparable protocol approach (Supplementary Fig. 1 and Supplementary Fig. 2a–e). Since standard protein quantification does not work with free amino acids in an acidic environment, we used the color change of genipin when it reacts with amino acids as a read-out34. We normalized these amino acid levels to a standardized curve of bovine serum albumin protein (BSA) to determine total protein levels (Fig. 1c; “Materials and methods” for details). Thus, with this protocol, we overcame the obstacle of the insolubility of collagens in the extracellular matrix and were able to quantify total collagen over total protein (Supplementary Fig. 2f.).
Collagen levels across mice tissues range from 0.1 to 51% of total protein levels
We chose 4 weeks old outbred wild-type CD1 female mice that started to reach sexual maturity35 and 41 weeks old inbred wild-type C57BL/6 male mice that are considered middle-aged mice used for healthspan assays36, harvested their organs, and quantified collagen over protein levels (Fig. 1d–i). We found that the brain and liver had the least amount of collagen to protein levels (0.1%; Fig. 1d), heart and kidney around 1%, muscle and lung around 3.5%, colon 5.7%, skin 20%, bone 25–35%, and tendon about 40% of total collagen to the protein of wild-type CD1 female mice (Fig. 1d; Supplementary Table 3). To provide a reference point, we quantified collagen or protein weight per weight of wet tissue (Fig. 1e,f; Supplementary Table 3). Next, we quantified collagen over protein levels of 41 weeks old wild-type C57BL/6 male mice and found higher collagen levels for skin (38.5%) and tendon (50.5%) (Fig. 1g; Supplementary Table 4). This difference could be either due to age, gender, or genotype. Furthermore, we noticed a heterogeneity of collagen to protein levels for individual animals’ organs within the same cohort (dots in the bar graph represent individual mice; Fig. 1d,e). For comparison, we searched the literature for collagen measurements normalized to protein levels (BSA, amino acids, nitrogen; Fig. 1j; Supplementary Table 5). We only found a few studies and included rat tissues as additional reference points. Reassuringly, the collagen over protein values were in similar ranges. Thus, our measurements provide reference values for collagen over protein levels for several organs of wild-type mice.
Quantifying total collagen levels normalized to total proteins of entire mice
An accumulating body of literature states the quote: “30% of total protein is collagen in mammals” and the quote: “collagen is the most abundant protein”; however, a reference to the original work that quantified these percentages is missing (Fig. 2a, Supplementary Table 6). None of the over 50 publications we found had an original reference for these statements (Supplementary Table 6). Our literature search to identify the original work for these percentages led to a review paper from 1961 as a probable source37. In this review article, R.D. Harkness states: “A complete analysis of a whole animal has only been performed on mice38. Collagen formed about a fifth of the total body protein.”37. In that referenced study38, Harkness and colleagues dissected nine adult male albino mice, each into five parts (skin, carcass, viscera, femora, quadriceps). They quantified the collagen content of these tissues by multiplying hydroxyproline levels by 7.46 and normalized them to total nitrogen (N) levels as a proxy for protein levels38. Then they added the values of these five parts to estimate the collagen levels of the entire mice. They found that: “Collagen formed 2.6 ± 0.1% of the body weight and 17.7 ± 0.7% of total N in the control animals”38.
There are limitations to this approach of combining and estimating total collagen levels from individual tissue parts. Moreover, quantification of the total collagen of an entire mouse has not been performed before. To address this, we homogenized individual entire mice and hydrolyzed aliquots of these whole mice homogenates with hydrogen chloride to the amino acid levels to determine total collagen over total protein levels of entire mice (Fig. 2b, “Materials and methods” for details). We found that 4-week-old outbred wild-type CD1 female mice contained 14.8% of collagen over total protein levels, inbred wild-type C57BL/6 female mice at the age of 4 weeks contained 12.0%, 7- or 21- weeks old had 12.8% of total collagen over total protein (Fig. 2c–e, Supplementary Table 7). Although in wild-type C57BL/6 female mice from 4 to 21 weeks, there was no significant change in collagen over protein levels, 7-week C57BL/6 male mice showed higher collagen over protein levels compared to corresponding 7-week C57BL/6 female mice, 16.9% versus 12.8%, respectively (Fig. 2c, Supplementary Table 7). This suggests a gender-specific effect on total collagen over protein levels. Furthermore, the 16.9% collagen to the protein content of our 7-week C57BL/6 male mice is comparable to the 17.7% of the male albino mice determined by Harkness et al.38.
To complement our colorimetric measurements of hydroxyproline and total amino acids levels, we analyzed amino acid levels within the samples with high-performance liquid chromatography (HPLC; Fig. 2f). We found that hydroxyproline (4HP) over total amino acids is 1.6% and 1.3% of 4-week old wild-type CD1 and C57BL/6 female mice, respectively (Fig. 2g, Supplementary Table 8). To convert the hydroxyproline levels to collagen levels, we used the conversion factor of 7.46. We found 11.8% and 10.0% of 4-week-old wild-type CD1 and C57BL/6 female mice, respectively (Fig. 2h, Supplementary Table 8). The conversion factor of 7.46 is an estimate based on hydroxyproline levels of collagens from various tissues of cows, pigs, sheep, chickens, kangaroos, and rats resulting in 13.0–14.4% of hydroxyproline in mammalian collagens32,39. Since experimental quantification of this conversion factor for mice is missing, we assessed and compared this conversion factor from our data. We found that a conversion factor of 7.46 reflects well collagen estimation from hydroxyproline in whole mouse samples (Supplementary Fig. 2g–i; Supplementary Table 9). Furthermore, in our HPLC, we also ran a collagen standard to know how much hydroxyproline levels correspond to total collagen levels. When normalizing with this collagen standard, we found 7.75× (Fig. 2i, Supplementary Table 8). Thus, we obtained comparable results either using a colorimetric approach or HPLC. Taken together, our measurements suggest that, on average female wild-type mice have about 12.3 ± 1.5% of collagen over total protein. Consistent with the previous measurements by Harkness et al.38, male wild-type mice showed about 17% of collagen over total protein.
To assess total collagen levels differently than by using hydroxyproline as a read-out for collagen, we digested our samples with pepsin. We used an adapted version of Sircol staining to quantify soluble collagens28. For Sircol staining, the dye Sirius red binds collagens and other non-collagenous proteins. By running the samples over columns that retain larger proteins, such as collagens, the accuracy for Sircol collagen quantification is improved28. We validated this approach using a collagen standard and found similar collagen content with the Sircol method compared to when assessed by hydroxyproline measurements for pure protein samples (Fig. 2j, Supplementary Table 10). From our whole mouse lysates digested with pepsin, we recovered about 5–14 mg collagen per g wet weight, which is about 19–41% of estimated collagen by hydroxyproline measurement (Fig. 2k,l, Supplementary Table 10). Normalizing the Sircol-derived collagen levels (mg collagen/g wet weight) to our total protein estimates (mg protein/g wet weight), resulted in, on average, 5% of collagen over total protein (Supplementary Table 10). This estimation is lower compared to the 12–17% of collagen content by the hydroxyproline measurements. We speculate that the underestimation is in part due to the lack of proper solubilization of collagen using this approach.
Total protein content and amino acid composition of mice
Next, we asked how much total collagen and protein is found in entire mice. In the literature, we found only one publication from 1977 that estimated a total protein of 1.5 g and 1.4 g per 3-week-old male or female CB57BL/6 J ob/ob mice, respectively, based on nitrogen levels40. We quantified higher protein levels of 3.8 g of total protein per 4-week-old BL6 wild-type mice (Fig. 3a). We found wild-type mice contain about 4–8 g of total protein and 0.5–1.2 g of total collagen per mouse (Fig. 3a,b), which corresponds to 265 mg of protein and 37 mg of collagen per g body weight on average (Supplementary Table 7). The total protein and collagen per mouse were poorly correlated with its total body wet weight (Fig. 3c–f). The collagen levels correlated better, which makes sense since collagens are structural and scaffold proteins important for tissue geometry and size (Fig. 3d).
Next, we asked what is the relative amino acid composition of mice. We found that glycine with 15% was the most abundant amino acid in mice (Fig. 3g,h, Supplementary Table 8). Proline was 6.5%, and hydroxyproline (4HP) was about 1.5% relative to all other amino acids (Fig. 3g,h, Supplementary Table 8). Thus, we provide the first relative amino acid composition of entire mice.
Collagen quantification using proteomics
Next, we used a quantitative proteomics approach. To enrich extracellular matrix proteins, we separated whole mouse lysates by SDS-PAGE, gel lanes were cut into five slices per sample, and proteins therein were digested with trypsin overnight. Generated peptides were extracted and analyzed by LC–MS/MS (see Methods for details). Extracted ion currents of peptides were used for label-free quantification employing the iBAQ algorithm41. For each sample, the relative proportion of all quantified collagen proteins was calculated in comparison to the overall intensity of all proteins. We quantified 4.6 ± 0.3% for 4-week-old CD1 mice, 4.6 ± 1.4% for 4-week-old C57BL/6 mice, and 6.4 ± 0.4% for 21-week-old C57BL/6 mice (Supplementary Table 11). Although these percentages are similar to the Sircol collagen quantifications, these percentages were lower compared to the hydroxyproline measurement, probably due to the crosslinked nature of collagens, and we would expect that trypsin is less efficient in generating measurable peptides.
Matreotype of entire mice
To establish matreotypes15, which is the composition of the ECM (i.e., matrisome) associated with entire mice, we analyzed our ECM-enriched proteomics data according to matrisome category42. We detected 167–233 of the 1110 mouse matrisome43 proteins in 4-week-old CD1, 4-week-old C57BL/6, and 21-week-old C57BL/6 wild-type mice (Fig. 4, Supplementary Table 11). To our surprise, serine protease inhibitors (Serpina1,3) were more abundant than collagen type I (Col1a1,2), followed by fibrillin (Fbn1), elastin (Eln), and collagen type 6 (Col6a1,2) (Supplementary Fig. 3, Supplementary Table 11). Furthermore, considering all detected proteins, albumin (Alb), hemoglobin (Hbb), and parvalbumin (Pvalb) were the most abundant protein, followed by histones, actin (Acta1), serine protease inhibitors (Serpina1,3), and then collagen type I (Col1a1,2) (Supplementary Fig. 3, Supplementary Table 11). Thus, we provide the matrisome atlas of entire mice and found that collagens are not the most abundant protein.
Discussion
Biomedical engineering and science rely on previously quantified reference points to build models and compare health versus disease states. Collagen has been assumed to be the most abundant protein of mammals making up two-thirds of total protein. However, over 50 studies ranging from material engineering to cancer research relied on these reference points without an original reference of such quantification.
Although nowhere directly stated, this statement might be based on a review paper from 1961 stating that “collagen formed about a fifth of the total body protein”37. This statement was based on the authors’ previous study from 1958, where nine male mice carcasses were divided into five parts, and hydroxyproline levels over total nitrogen levels were determined, which were added together, resulting in 17.7% of total collagen over protein38. Astonishingly, our quantification revealed a similar percentage of 16.8% of total collagen over protein of entire male mice. This suggests that we had a correct reference point but, for an unknown reason, was inflated to 30%, an interesting hyperbole that might be worth examining by science philosophers.
Besides this, we find sex-specific differences in collagen over protein for females of 12% and males 17%. We provide reference points for collagen over protein levels for ten tissues. Using proteomics, we find that collagen is not the most abundant protein. We provide the matreotype of entire mice as a reference point for an ECM protein atlas. In addition, we provide reference points for milligram protein per gram mice and amino acid compositions, whereby glycine is the most abundant amino acid.
Although not systematically examined, there were few reports of collagen over protein ratios for mice and rat tissues. Our estimates were similar to previous estimates even across species. Also, estimating collagen over protein levels using colorimetric compared to HPLC of hydroxyproline levels was similar. However, using collagen staining and proteomics approaches showed internally consistent collagen percentages of 5–6%, which are lower than when quantified the same animals with hydroxyproline quantifications. For both methods, samples need to be separated via either running on a gel or on a column, probably losing material. Furthermore, a lower yield might also be due to the inherent insolubility of the collagen networks44, and in the case of proteomics, limited protease access to crosslinked peptides and lack of identification of crosslinked peptides, limitations we had not addressed here. A previous study on tendons and ligaments achieved complete solubilization by successive protease and elastase digestion, which allowed the harvesting of more collagen and elastin45. Thus, such an approach might allow quantifying more collagen over protein levels, which might asymptote to our hydroxyproline to collagen estimations.
Our proteomics analysis revealed collagen type I (Col1a1,2), fibrillin (Fbn1), elastin (Eln), and collagen type 6 (Col6a1,2) as the most abundant core matrisome proteins. This is comparable to a recent meta-analysis of different proteomics studies on articular cartilage, bone, ligament, skeletal muscle, and tendon46. It is also similar to tissues with less ECM, such as mammary glands and liver47,48. As in our study of entire mouse proteomics, collagen type I is the most abundant collagen in all tissues but shows a wide range across different tissues. This suggests that collagen type I is the most abundant collagen in mammals across tissues.
In summary, we quantified collagen levels for mice tissues and entire mice. We reconcile assumptions made in textbooks and research papers. Thus, we provide several important reference points to help interpret new findings related to collagen levels.
Materials and methods
Sample preparation
Mouse tissue
CD1 mice were provided by the institute of laboratory animal science (University of Zürich; AniMatch). The ethics committee of the veterinary office of the Canton of Zürich (Licence Nr. ZH092-19) has approved the study of all other animals, and all methods were carried out in accordance with relevant guidelines and regulations. The study was carried out in compliance with the ARRIVE guidelines. All experiments were performed post-mortem.
Total mouse lysate
We used inbred C57BL/6 and outbred CD1 mice for the total lysate experiments (9 to 10 CD1 and 10 C57BL/6 mice, all 4 weeks old female, and 12 C57BL/6 mice (male: 5 × 7 weeks, female: 5 × 7 weeks and 2 × 21 weeks). These mice were euthanized by placing them under CO2 narcosis followed by cervical dislocation, frozen at − 20 °C and then individually mixed with a NutriBullet 600 Series after having added 40 mL of distilled water (Sigma-Aldrich) to the 4 weeks old C57BL/6 mice, 50 mL to the CD1, and 20 mL to the 7–21 weeks old C57BL/6 mice. This mixture was filled into 50 mL tubes and was bead-bounced for 3 × 10 to 6 × 10 min. Then 500 µL was aliquoted into cryotubes and sonicated for 20 s to homogenize it completely. Despite homogenizing, the hair remained in the samples and was not filtered out. The aliquots were snap-frozen and stored at − 80 °C.
Organ samples
Mice were euthanized by placing them under CO2 narcosis followed by cervical dislocation, and skin punches (5 mm, Biopsy punch, Stiefel) were collected from the shaved back. To harvest the tail skin, the tail was cut, and the skin was flayed. A 5 mm piece of the tail skin was cut with a razor blade 3 cm and 3.5 cm away from the tip because, at this distance, the skin looked uniformly in the different age cohorts. Lung, heart, liver, colon, kidney, skull, brain, femur, tail, and hamstring muscle were dissected, snap frozen, and stored at − 80 °C.
Amino acid analysis
Whole mouse lysate aliquots were pestled in liquid nitrogen and sent to Analytical Research Services (Bern, Switzerland) for amino acid analysis. They measured the probes according to Bidlingmeyer et al.49. In brief, 25 µL of the homogenate was pipetted for hydrolysis into a glass tube (6 × 50 mm) and vacuum-speed dried, then 25 µL 6 M HCl with 0.1% phenol was added. For hydrolysis, 200 µL 6 M HCl/0.1% phenol was pipetted into a vessel with frit, and the glass tube was placed inside. The vessel was flooded with nitrogen and sealed under a vacuum (ca. 200 mbar). Samples were hydrolyzed for 22 h at 115 °C. Then they were dried and resuspended in 350 µL 0.1% trifluoroacetic acid (TFA). After a short centrifugation, the supernatant was diluted 1:10 with water. An aliquot of 10 µL was transferred into a tube and dried. After the reaction with phenylisothiocyanate (PITC) the aliquot was dried again and dissolved in 50 µL high-performance liquid chromatography (HPLC) Eluent A. 20 µL was injected into an HPLC (Dionex with P680 HPLC pump, ASI 100 (automated sample injector), thermostatted column compartment TCC-100 and ultimate 3000 RS diode array detector).
Quantification of total collagen and protein
Total mouse lysates
Collagen and protein assay
To quantify the total collagen normalized to the total protein of the total lysate, the QuickZyme total collagen and total protein assays (Biosciences) were performed according to the manufacturer's protocol. The QuickZyme total collagen assay is based on a colorimetric read-out for free hydroxyproline amino acids. Free hydroxyproline is oxidized with Chloramine-T to pyrrole and then stained with Ehrlich’s reagent50. Pure rat tail collagen was hydrolyzed, and different concentrations were loaded according to the protocol on the assay plate to be able to generate a standard collagen curve, which was used to determine the total collagen content in our samples.
The QuickZyme total protein assay is based on a colorimetric read-out. Hydrolyzed free amino acids are assessed by using genipin, and the color change acts as a read-out34. According to the protocol, a standard curve of pre-hydrolyzed bovine serum albumin (BSA) was employed to determine total protein concentration. Before conducting the assays, the samples were vortexed for 10 s. 300 mg of the aliquot was used for the lysis, ensuring that all hair in the sample was weighed. 300 µL 12M HCl (VWR chemicals) and 400 µL 6M HCl were added. This mixture was hydrolyzed at 95 °C for 20 h to the amino acids. The hydrolyzed samples were centrifuged for 10 min at 13,000×g. The amino acid homogenate was split to assess collagen and protein levels.
For the collagen assay, the hydrolysate was diluted (2:3) with water to reach a final HCl concentration of 4M. The same dilution was conducted with the samples for the protein assay but with 6M HCl. Then the samples were further diluted to be in the range of the standard curve (collagen samples dilution was 1:17 with 4M HCl, and protein samples dilution was 1:15 with 6M HCl). These dilutions were loaded on the 96-well plates (one for the collagen assay and one to conduct the protein assay), and the buffer and color reagent required were added. After 1 h incubation time at 60 °C for the collagen plate and at 85 °C for the protein plate, the plates were placed on ice for 5 min or 7 min, respectively, and absorbance was measured at 570 nm (CLARIOstar, BMG Labtech, CLARIOstar-Data Analysis, 3.1).
To calculate the total milligram collagen or protein per gram of an entire mouse, the dilution factor for homogenizing the mouse was considered.
Hydroxyproline assay
Hydroxyproline concentrations were measured according to the Hydroxyproline Assay Kit (Sigma-Aldrich) protocol. This assay also detects free 4HP that is oxidized with Chloramine-T to pyrrole and stained with Ehrlich’s reagent50. Samples were diluted 1:2 with 12M HCl and hydrolyzed at 120 °C for 3 h. Activated charcoal was added, and tubes were centrifuged at 10,000×g for 3 min. 50 µL of the standard collagen samples (see 3.2.3.1.3) were loaded onto the plate, and the other samples were diluted further in two steps (1:10 and 1:9) to be in the range of the hydroxyproline standard curve. Into the plate, 50 µL of the final dilution was loaded and dried at 60 °C. 100 L of Chloramine T/Oxidation Buffer mixture was added and incubated at room temperature for 5 min. 100 µL of the diluted DMAB reagent (DMAB: perchloric acid/Isopropanol = 1:1) was added to each well and incubated for 90 min at 60 °C. Absorbance was measured at 560 nm. The results were multiplied by the hydroxyproline to collagen conversion factor (7.46), as stated in Neuman et al.51.
Collagen and hydroxyproline standard preparation
To verify the collagen content of the samples, standard collagen concentrations, and hydroxyproline standard concentrations were loaded on the assay plates used for the assays comparison and prepared as follows.
For the collagen assay, the collagen assay standard (rat tail collagen, 1200 µg/mL (Bioscience)) was prepared according to protocol. To be able to generate a 4HP standard curve on the collagen assay plate, the 4HP standard (1 mg/mL (Sigma-Aldrich)) was diluted 1:10 with HCl (0.1 mg/mL in 6M HCl). 125 µL of this concentration was mixed with 62.5 µL 4M HCl and 62.5 µL water. 80 µL of this dilution was transferred and mixed with 20 µL 4M HCl (40 µg/mL). The further standard concentrations were reached by consecutive 1:2 dilutions. (20 µg/mL, 10 µg/mL, 5 µg/mL, 2.5 µg/mL). 4M HCl was used as blank.
For the 4HP assay, 250 µL of the collagen standard was diluted with 250 µL water (600 µg/mL). This solution was diluted 1:2 with water (300 µg/mL). 240 µL of the last dilution was mixed with 120 µL of water (200 µg/mL). A 180 µL aliquot was transferred to a new tube and diluted with 180 µL H2O (100 µg/mL). This 1:2 dilution was repeated 4 times (50 µg/mL, 25 µg/mL, 12.5 µg/mL, 6.25 µg/mL), and a water blank was prepared. To be in the range of the 4HP standard curve, 150 µL of each sample and blank were pipetted into a new tube and diluted with 150 µL 12M HCl. After hydrolyzation (3 h at 120 °C), 50 µL of each sample was pipetted on a 96-well plate. The hydroxyproline standard (1 mg/mL) was prepared according to the protocol for the 4HP assay.
Sircol™ Soluble Collagen Assay
Total collagen concentrations were also measured with the Sircol™ Soluble Collagen Assay (biocolor life science assay). 100 mg of the total lysates were mixed with 1 mL 0.1 mg/mL pepsin in 0.5 M acetic acid (Merck) and put at 4 °C overnight. The next day, 500 µL 0.32 mg/mL pepsin in 0.5M acetic acid (Merck) was added and again left overnight at 4 °C. 1 mL of this solution was then transferred into an Amicon Ultra-2 Centrifugal Filter Unit (Millipore). This unit was centrifuged for 60 min at 3220 rcf. The flow-through was discarded, and the collagen remaining in the filter was collected by spinning the inverted tube for 2 min at 1000 rcf. 0.5M acetic acid (Merck) was added to reach 500 µL. 50 µL of this solution was pipetted into a new tube, and the Sircol assay was performed according to protocol. Finally, 1 mL alkali reagent was added. 200 µL was pipetted on a 96-well plate.
The rat tail collagen standard, 1200 µg/mL (Bioscience) used in the collagen assay was also measured with three different pre-treatments. In the first treatment, 100 µL of the standard was digested and filtered like the samples described above. 150 µL 0.5M acetic acid was then added to the collagen collected in the tube. For the second, 120 µL of the standard was diluted with 880 µL 0.5M acetic acid and only filtered like the samples. This time 0.5M acetic acid was added to reach 200 µL. 50 µL of each collagen standard sample was transferred in separate tubes, and 50 µL 0.5M acetic acid was added. Finally, 30 µL of the untreated collagen standard was diluted with 70 µL 0.5M acetic acid. The Sircol assay was performed according to protocol. After having added 1 mL alkali reagent, 200 µL of the differently treated standards were transferred to a 96-well plate.
Bicinchoninic acid (BCA) protein assay
Total protein concentrations were also measured with the Pierce™ BCA Protein Assay Kit (Thermo Scientific). This assay combines the protein-induced biuret reaction with the colorimetric detection of the resulting cuprous cation (Cu1+) by BCA52.
The standard for this assay was prepared according to the protocol. In the first replicate, samples were vortexed, and hairs were included. For the second replicate, the samples were sonicated for 20 s, and hairs were excluded, and for the last replicate, samples were not pretreated. The total lysate was diluted (3:10 and 2:3). 25 µL of the final dilution was pipetted into a 96-well microplate. 200 µL of the BCA working reagent was added before shaking the plate for 30 s on a plate shaker. Afterward, the plate was covered and incubated at 37 °C for 30 min. The plate was cooled to room temperature, and absorbance was measured at 562 nm.
Organs
To quantify collagen normalized to protein levels in organs, the QuickZyme total collagen and total protein assays (Biosciences) were conducted. For most organs, 0.1 g of frozen organ tissue was weighted and used for the assay. For skin tissue, a skin punch (or half of a skin punch) was weighted, and for the skull and femur, 0.01 g was used. Organs were diluted with 6M HCl using the following formula: v = aw × Dorgan − aw (v = µL 6M HCl added to organs, aw = organ assay weight (mg), Dorgan = experimentally determined organ specific dilution factor, Dcolon = 12.5, Dfemur = 74.2, Dlung = 10.9, Dskin or tendon = 73.5, Dmuscle = 11.2, Dbrain = 5.7, Dheart = 11.0, Dliver = 7.3, Dskull = 98.2, Dkidney = 10.8) and hydrolyzed at 95 °C for 20 h. The hydrolyzed samples were processed like the total mouse lysates, meaning diluted to 4M HCl (2:3) with water if collagen was intended to be measured. Protein samples were diluted 2:3 with 6M HCl. To finally be within the range of the standard curve, the samples were diluted further. The liver and brain samples were diluted 1:1 for collagen and 1:10 for protein measurements. Tendon samples were processed like skin samples, but collagen samples were diluted 1:16. All other organs were diluted 1:8 for protein and collagen measurements.
Statistical analysis
Statistical analysis was performed with GraphPad Prism 8.0 using a one-way or two-way analysis of variance (ANOVA) followed by a posthoc Tukey’s, Dunnett, or Sidak multiple comparisons test or a two-tailed unpaired t test with a 95% confidence interval. Normal distribution of the values was assumed. Values represent means ± SD. *P ≤ 0.05, **P ≤ 0.01, ***P ≤ 0.001, and ****P ≤ 0.0001.
Quantitative proteomic analysis of mouse samples
Mouse samples were dissolved in 10 mM Tris–HCl buffer containing 4% SDS, pH 7.5, followed by sonication for 2 min. SDS-PAGE loading buffer was added, and protein samples were heated for 10 min at 95 °C, treated with 1 mM DTT, and alkylated using 5.5 mM iodoacetamide for 10 min at room temperature. Samples were fractionated on 4–12% gradient gels, and proteins were in-gel digested with trypsin (Promega), five fractions per sample. Tryptic peptides were purified by STAGE tips, and LC–MS/MS measurements were performed on a QExactive Plus mass spectrometry coupled to an EasyLC 1200 nanoflow-HPLC (all Thermo Scientific). MaxQuant software (version 1.6.2.10)53 was used for analyzing the MS raw files for peak detection, peptide quantification, and identification using a Uniprot mouse database (version April 2016). Carbamidomethylcysteine was set as fixed modification, and oxidation of methionine was set as variable modification. The MS/MS tolerance was set to 20 ppm, and four missed cleavages were allowed for Trypsin/P as enzyme specificity. Based on a forward-reverse database, protein and peptide FDRs were set to 0.01, the minimum peptide length was set to seven, and at least one unique peptide had to be identified. The match-between run option was set to 0.7 min. MaxQuant results were analyzed using Perseus software (version 1.6.2.3)54.
To calculate the relative abundance of collagen proteins compared to the detected proteome, replicate iBAQ values of respective groups were averaged, and the percentage of collagens was obtained by this formula: sum of collagen iBAQ values*100/sum of iBAQ values of all proteins.
Matrisome annotation of proteomics data
Peptide annotations below a Q-value of 0.05 were augmented with the murine matrisome annotations (http://matrisomedb.pepchem.org)42. Protein abundances for each mouse strain and age group are illustrated by the matrisome category as a tile map. Mean abundances above the overall median protein abundance are displayed in full color, transparent if below, and white if not detected.
Data availability
All data generated or analyzed during this study are included in this published article and its supplementary information files.
References
Glass, D. J. Experimental Design for Biologists 2nd edn. (Cold Spring Harbor Laboratory Press, 2014).
Milo, R., Jorgensen, P., Moran, U., Weber, G. & Springer, M. BioNumbers—The database of key numbers in molecular and cell biology. Nucleic Acids Res. 38, D750–D753 (2010).
Whitslar, W. H. A study of the chemical composition of the dental pulp. Am. J. Dent. Sci. 23, 350–355 (1889).
Teuscher, A. C., Statzer, C., Pantasis, S., Bordoli, M. R. & Ewald, C. Y. Assessing collagen deposition during aging in mammalian tissue and in Caenorhabditis elegans. Methods Mol. Biol. Clifton N. J. 1944, 169–188 (2019).
Wynn, T. A. Fibrotic disease and the TH1/TH2 paradigm. Nat. Rev. Immunol. 4, 583–594 (2004).
Landewé, R. B. M. et al. Arthritis instantaneously causes collagen type I and type II degradation in patients with early rheumatoid arthritis: A longitudinal analysis. Ann. Rheum. Dis. 65, 40 (2006).
Elango, J. et al. Paradoxical duel role of collagen in rheumatoid arthritis: Cause of inflammation and treatment. Bioengineering 9, 321 (2022).
Xu, S. et al. The role of collagen in cancer: From bench to bedside. J. Transl. Med. 17, 309 (2019).
Socovich, A. M. & Naba, A. The cancer matrisome: From comprehensive characterization to biomarker discovery. Semin. Cell Dev. Biol. 89, 157–166 (2019).
Andrews, J. P., Marttala, J., Macarak, E., Rosenbloom, J. & Uitto, J. Keloids: The paradigm of skin fibrosis—Pathomechanisms and treatment. Matrix Biol. 51, 37–46 (2016).
Perlish, J. S., Lemlich, G. & Fleischmajer, R. Identification of collagen fibrils in scleroderma skin. J. Invest. Dermatol. 90, 48–54 (1988).
Izzi, V., Davis, M. N. & Naba, A. Pan-cancer analysis of the genomic alterations and mutations of the matrisome. Cancers 12, 2046 (2020).
Rodriguez-Feo, J., Sluijter, J., Kleijn, D. & Pasterkamp, G. Modulation of collagen turnover in cardiovascular disease. Curr. Pharm. Des. 11, 2501–2514 (2005).
Horn, M. A. & Trafford, A. W. Aging and the cardiac collagen matrix: Novel mediators of fibrotic remodelling. J. Mol. Cell Cardiol. 93, 175–185 (2015).
Ewald, C. Y. The matrisome during aging and longevity: A systems-level approach toward defining matreotypes promoting healthy aging. Gerontology 66, 266–274 (2020).
Meyer, M. Processing of collagen based biomaterials and the resulting materials properties. Biomed. Eng. Online 18, 24 (2019).
Nutton, V. Ancient Medicine (Routledge, 2012). https://doi.org/10.4324/9780203081297.
Schweitzer, M. H. et al. Biomolecular characterization and protein sequences of the Campanian hadrosaur B. canadensis. Science 324, 626–631 (2009).
Harvey, V. L., Egerton, V. M., Chamberlain, A. T., Manning, P. L. & Buckley, M. Collagen fingerprinting: A new screening technique for radiocarbon dating ancient bone. PLoS ONE 11, e0150650 (2016).
Rybczynski, N. et al. Mid-Pliocene warm-period deposits in the High Arctic yield insight into camel evolution. Nat. Commun. 4, 1550 (2013).
Garrone, R. Collagen, a common thread in extracellular matrix evolution. Proc. Indian Acad. Sci. Chem. Sci. 111, 51–56 (1999).
Ricard-Blum, S. The collagen family. Cold Spring Harb. Perspect. Biol. 3, a004978–a004978 (2011).
Shoulders, M. D. & Raines, R. T. Collagen structure and stability. Annu. Rev. Biochem. 78, 929–958 (2009).
Belostotsky, R. & Frishberg, Y. Catabolism of hydroxyproline in vertebrates: Physiology, evolution, genetic diseases and new siRNA approach for treatment. Int. J. Mol. Sci. 23, 1005 (2022).
Bielajew, B. J., Hu, J. C. & Athanasiou, K. A. Collagen: Quantification, biomechanics and role of minor subtypes in cartilage. Nat. Rev. Mater. 5, 730–747 (2020).
Naba, A. et al. The matrisome: In silico definition and in vivo characterization by proteomics of normal and tumor extracellular matrices. Mol. Cell. Proteomics: MCP 11, M111.014647 (2012).
Capella-Monsonís, H., Coentro, J. Q., Graceffa, V., Wu, Z. & Zeugolis, D. I. An experimental toolbox for characterization of mammalian collagen type I in biological specimens. Nat. Protoc. 13, 507–529 (2018).
Lareu, R. R., Zeugolis, D. I., Abu-Rub, M., Pandit, A. & Raghunath, M. Essential modification of the Sircol Collagen Assay for the accurate quantification of collagen content in complex protein solutions. Acta Biomater. 6, 3146–3151 (2010).
Gameil, A. H. M., Yusof, F., Azmi, A. S. & Puad, N. I. M. Progress in the detection and quantification of collagens: A review. IOP Conf. Ser. Mater. Sci. Eng. 1192, 012005 (2021).
Grant, M. E. & Prockop, D. J. The biosynthesis of collagen. 1. N. Engl. J. Med. 286, 194–199 (1972).
Lowry, O. H., Gilligan, D. R. & Katersky, E. M. The determination of collagen and elastin in tissues with results obtained in various normal tissues from different species. J. Biol. Chem. 139, 795–804 (1941).
Neuman, R. E. & Logan, M. A. The determination of collagen and elastin in tissues. J. Biol. Chem. 186, 549–556 (1950).
Prockop, D. J. & Udenfriend, S. A specific method for the analysis of hydroxyproline in tissues and urine. Anal. Biochem. 1, 228–239 (1960).
Lee, S.-W., Lim, J.-M., Bhoo, S.-H., Paik, Y.-S. & Hahn, T.-R. Colorimetric determination of amino acids using genipin from Gardenia jasminoides. Anal. Chim. Acta 480, 267–274 (2003).
Hubrecht, R. & Kirkwood, J. The UFAW Handbook on the Care and Management of Laboratory and Other Research Animals (Wiley, 2022). https://doi.org/10.1002/9781444318777.
Mitchell, S. J. et al. Daily fasting improves health and survival in male mice independent of diet composition and calories. Cell Metab. 29, 221-228.e3 (2019).
Harkness, R. D. Biological functions of collagen. Biol. Rev. Camb. Philos. Soc. 36, 399–463 (1961).
Harkness, M. L., Harkness, R. D. & James, D. W. The effect of a protein-free diet on the collagen content of mice. J. Physiol. 144, 307–313 (1958).
Neuman, R. E. & Logan, M. A. The determination of hydroxyproline. J. Biol. Chem. 184, 299–306 (1950).
Lin, P. Y., Romsos, D. R. & Leveille, G. A. Food intake, body weight gain, and body composition of the young obese (ob/ob) mouse. J. Nutr. 107, 1715–1723 (1977).
Schwanhäusser, B. et al. Global quantification of mammalian gene expression control. Nature 473, 337–342 (2011).
Shao, X., Taha, I. N., Clauser, K. R., Gao, Y. & Naba, A. MatrisomeDB: The ECM-protein knowledge database. Nucleic Acids Res. 48, D1136–D1144 (2019).
Naba, A. et al. The extracellular matrix: Tools and insights for the “omics” era. Matrix Biol. 49, 10–24 (2016).
Last, J. A. & Reiser, K. M. Collagen biosynthesis. Environ. Health Perspect. 55, 169–177 (1984).
Sato, N. et al. Proteomic analysis of human tendon and ligament: Solubilization and analysis of insoluble extracellular matrix in connective tissues. J. Proteome Res. 15, 4709–4721 (2016).
McKee, T. J., Perlman, G., Morris, M. & Komarova, S. V. Extracellular matrix composition of connective tissues: A systematic review and meta-analysis. Sci. Rep. UK 9, 10542 (2019).
Arteel, G. E. & Naba, A. The liver matrisome—Looking beyond collagens. JHEP Rep. 2, 100115 (2020).
Goddard, E. T. et al. Quantitative extracellular matrix proteomics to study mammary and liver tissue microenvironments. Int. J. Biochem. Cell Biol. 81, 223–232 (2016).
Bidlingmeyer, B. A., Cohen, S. A. & Tarvin, T. L. Rapid analysis of amino acids using pre-column derivatization. J. Chromatogr. B Biomed. Sci. Appl. 336, 93–104 (1984).
Prockop, D. J. & Udenfriend, S. A specific method for the analysis of hydroxyproline in tissues and urine. Anal. Biochem. 1, 228–239 (1960).
Neuman, R. E. & Logan, M. A. The determination of collagen and elastin in tissues. J. Biol. Chem. 186, 549–556 (1950).
Smith, P. K. et al. Measurement of protein using bicinchoninic acid. Anal. Biochem. 150, 76–85 (1985).
Cox, J. & Mann, M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat. Biotechnol. 26, 1367–1372 (2008).
Tyanova, S. et al. The Perseus computational platform for comprehensive analysis of (prote)omics data. Nat. Methods 13, 731–740 (2016).
Acknowledgements
We thank Dr. Cyril Statzer for his assistance in the analysis and visualization of the proteomic dataset, Jasmin Mantl for help harvesting samples and initial work establishing protocols, Philippe Bugnon (LTK1 Kurse) and AniMatch for CD1 mice, Frank Zaucke for literature references, John Hourihan, Jan Gebauer, Mats Paulsson, Matthias Chiquet, Peter Bruckner, and Ewald lab members for discussion and comments on the manuscript.
Funding
This work was supported by the Swiss National Science Foundation [163898 and 190072] to CYE.
Author information
Authors and Affiliations
Contributions
All authors participated in designing the research, executing the experiments, and analyzing and interpreting the data. K.T. and C.Y.E. wrote the manuscript in correspondence with the other authors.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Tarnutzer, K., Siva Sankar, D., Dengjel, J. et al. Collagen constitutes about 12% in females and 17% in males of the total protein in mice. Sci Rep 13, 4490 (2023). https://doi.org/10.1038/s41598-023-31566-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-023-31566-z
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.