A large dataset of protein dynamics in the mammalian heart proteome

Lau, Edward; Cao, Quan; Ng, Dominic C.M.; Bleakley, Brian J.; Dincer, T. Umut; Bot, Brian M.; Wang, Ding; Liem, David A.; Lam, Maggie P.Y.; Ge, Junbo; Ping, Peipei

doi:10.1038/sdata.2016.15

Download PDF

Data Descriptor
Open access
Published: 15 March 2016

A large dataset of protein dynamics in the mammalian heart proteome

Edward Lau^1,2,
Quan Cao^1,2,3,
Dominic C.M. Ng^1,2,
Brian J. Bleakley^1,2,
T. Umut Dincer^1,2,4,
Brian M. Bot ORCID: orcid.org/0000-0002-2412-6826^1,5,
Ding Wang^1,2,
David A. Liem^1,2,
Maggie P.Y. Lam^1,2,4,
Junbo Ge³ &
…
Peipei Ping^1,2,4,6

Scientific Data volume 3, Article number: 160015 (2016) Cite this article

10k Accesses
64 Citations
6 Altmetric
Metrics details

Subjects

Abstract

Protein stability is a major regulatory principle of protein function and cellular homeostasis. Despite limited understanding on mechanisms, disruption of protein turnover is widely implicated in diverse pathologies from heart failure to neurodegenerations. Information on global protein dynamics therefore has the potential to expand the depth and scope of disease phenotyping and therapeutic strategies. Using an integrated platform of metabolic labeling, high-resolution mass spectrometry and computational analysis, we report here a comprehensive dataset of the in vivo half-life of 3,228 and the expression of 8,064 cardiac proteins, quantified under healthy and hypertrophic conditions across six mouse genetic strains commonly employed in biomedical research. We anticipate these data will aid in understanding key mitochondrial and metabolic pathways in heart diseases, and further serve as a reference for methodology development in dynamics studies in multiple organ systems.

Design Type(s)	parallel group design • strain comparison design • time series design
Measurement Type(s)	protein-turnover assay
Technology Type(s)	mass spectrometry assay
Factor Type(s)	selectively maintained organism • alternate assay conditions • life cycle stage
Sample Characteristic(s)	Mus musculus • heart • blood plasma

Machine-accessible metadata file describing the reported data (ISA-Tab format)

Proteome-wide profiling and mapping of post translational modifications in human hearts

Article Open access 26 January 2021

A mass spectrometry workflow for measuring protein turnover rates in vivo

Article 04 November 2019

Quantitative analysis of global protein stability rates in tissues

Article Open access 29 September 2020

Background & Summary

Cellular proteomes are under constant insults. Regulation of proteome integrity requires chaperone-assisted folding of unfolded proteins, dissolution of misfolded aggregates, proteolytic removal of proteins, and other concerted proteostatic processes^1,2. Recent studies have associated proteostatic disruptions causatively to an expanding list of disorders, including cystic fibrosis, neurodegenerations, and cardiovascular diseases^3–8. In the heart, decreased proteolytic capacity and accumulating proteotoxcity have been shown to directly exacerbate outcomes in cardiac infarcts, hypertrophy, and failure^9,10. Because proteostatic events often trigger zero net change in protein abundance but instead alter protein temporal dynamics^11–13, they typically elude conventional experiments that measure only the steady-state abundance of proteins. Protein dynamics data are therefore sought to better describe homeostatic processes and enhance the utility of phenotyping-by-omics approaches. However, large-scale protein dynamics datasets have remained scarce, due to the specialized technologies necessary to measure turnover of individual proteins on a global scale.

We report here a large dataset of protein turnover dynamics in the heart of six common genetic strains of mice, acquired under both normal and hypertrophic conditions. The dataset contains over 1.92 million data points in protein isotope labeling kinetics, culminating as the in vivo turnover rates of 3,228 cardiac proteins and the expression levels of 8,064 proteins. Proteins with quantified dynamics belong to over 10 major cellular compartments and over 200 known pathways. Key proteins in mitochondria and metabolic pathways are encompassed, in addition to contractile machineries and sarcolemmal signaling proteins. To promote data reusability, we describe four example use cases where this dataset may be re-analyzed to support basic research, translational investigation, omics data integration, and kinetic modeling.

The present dataset was collected using a technology platform we recently developed, which overcame several technical challenges in quantifying individual protein turnover rates on a proteome scale. The history of protein dynamics traces back to 1935, when Schoenheimer and Rittenberg synthesized the first isotopologs of biological molecules to demonstrate the continuous renewal of proteins throughout life^14,15. With recent progresses in shotgun proteomics, methodologies began to reach the sophistication and throughput required to understand turnover dynamics on a proteome scale. In contrast to steady-state protein abundance, which may be quantified directly in mass spectrometry (MS) by spectral intensity^16,17 or sampling frequency^18,19, protein turnover rates cannot be predicted from steady-state data²⁰, requiring instead methods that can distinguish old and new protein molecules in mass spectra^11,21. A common strategy is to introduce synthesized, isotope-tagged amino acids into cultured cells, such as in dynamic stable isotope labeling by amino acids in cell culture (dynamic SILAC) experiments that measure the time lapse required to fully label cellular proteins in culture^22,23. Despite successful applications of dynamic SILAC in vitro in bacterial^24,25, yeast²⁶, and cultured mammalian cells^20,27–29, protein turnover in freely-growing cultured cells does not recapitulate protein turnover in animals in vivo^8,30. Intact animal studies with synthetic essential amino acids such as [²H₈]-valine or [¹³C₆]-lysine have been demonstrated to allow in vivo dynamics measurements^11,28. These approaches have the advantage of labeling relatively immediate protein precursors, but require dietary modifications and may be financially costly in large studies. As an alternative strategy, we and others have proposed the use of minimal levels of deuterium oxide (²H₂O) introduced in the drinking water of living animals to label their entire proteomes^12,13,31.

In ²H₂O labeling, protein turnover is measured as the rate of deuterium atom incorporation into the tissue protein pool following protein synthesis and degradation. The incorporation of deuterium is reflected as a gradual shift in peptide isotope clusters towards higher masses in mass spectra during the course of labeling^32–34. We previously demonstrated several operational desiderata of ²H₂O labeling, which include its safety, bio-orthogonality, ease of monitoring label enrichment, rapid clearance following label withdrawal^35,36, and low cost in prolonged longitudinal studies^12,36,37. We developed a data science software application, ProTurn, to solve the precursor-product relationship of peptide ions in mass spectra³⁸ and automate large-scale ²H₂O-labeled data analysis^12,37. ProTurn reads in mass spectra and protein identification results to integrate the areas of all peptide isotope signals. Isotope patterns from multiple experimental time points are then tabulated for curve-fitting using a unified kinetic model, which corrects for potential labeling delays and computes turnover rates (k). The platform has proven applicable to recent mouse^12,37 and human³⁶ studies. The present study greatly expands on these previous efforts, encompassing the proteome-wide expression and dynamics of cardiac proteins across multiple sub-cellular compartments and six genetic backgrounds. All raw MS data have been deposited onto ProteomeXchange/PRIDE (PXD002870). Processed data tables and codes are available on Synapse (syn2289125), an open source platform for collaborative analyses provided by Sage Bionetworks.

Methods

Summary

The overall strategy for data acquisition, analysis, and dissemination is summarized in Fig. 1.

**Figure 1: Workflows for data acquisition, analysis, and dissemination.**

Animal models and in vivo isotope labeling

We performed ²H₂O labeling and the cardiac hypertrophy model on A/J, BALB/cJ, C57BL/6J, CE/J, DBA/2J, and FVB/NJ mice. The animals (male, 9–12 weeks of age) were purchased from The Jackson Laboratory, and upon arrival were acclimatized at the UCLA housing facilities for 48 h. To initiate deuterium labeling, each animal received two intraperitoneal injections of 500-μl 99.9% (molar ratio) ²H₂O-saline spaced 4 h apart, at 12 noon and at 4 pm on the starting day of labeling, respectively (Fig. 1a). Following the injections, the animals had access ad libitum to 8% (v/v; 7.25% molar ratio) ²H₂O in the drinking water supply for up to 14 days, along with standard lab chow (Harlan Teklad 7013). From each experimental group we euthanized two mice at each of day 0, 1, 3, 5, 7, 10, and 14 following the first ²H₂O injection at 12:00 noon to collect heart and plasma samples. In the cardiac hypertrophy groups, we surgically implanted subcutaneous micro-osmotic pumps (Alzet) at the initiation of the first priming dose of ²H₂O labeling¹². The micro-osmotic pumps were calibrated to deliver 15 mg⋅kg⁻¹⋅d⁻¹ of isoproterenol over 14 days. All animal procedures were performed in accordance with the Guide for the Care and Use of Laboratory Animals by the National Research Council and approved by the Animal Research Committee at UCLA.

Gas chromatography-mass spectrometry

To measure the rate and level of label enrichment in the animals, we performed gas chromatography-mass spectrometry (GC-MS) on the body water samples of the labeled animals. Mouse plasma (20 μl) was mixed with 2 μl of 10 N NaOH and 4 μl of 5% (v/v) acetone in acetonitrile. In parallel, standard curves were created by adding, to the 5% acetone in acetonitrile, 0 to 20% (molar ratio) of ²H₂O at 11 regular intervals in 1×phosphate-buffered saline in lieu of the mouse plasma. The sample and standard curve mixtures were incubated at room temperature for 12 h, after which the acetone portion was extracted by the introduction of 500 μl of chloroform and 0.5 g of anhydrous sodium sulfate. The extracted solution (1 μl) was analyzed directly by GC-MS (Agilent 6890/5975) using a J&W DB17-MS capillary column (Agilent, 30 m×0.25 mm×0.25 μm). The column temperature gradient was as follows: 60 °C initial; then 20 °C⋅min⁻¹ ramp to 100 °C; then 50 °C⋅min⁻¹ ramp to 220 °C; then 1 min hold. The MS operated in the electron impact mode (70 eV) and selective ion monitoring at m/z 58 and 59 with 10 ms dwell time.

Protein extraction and processing

Cardiac proteins were extracted following subcellular fractionation by differential centrifugation over density gradients³⁹. Excised cardiac tissues were homogenized in an extraction buffer composed of 250 mM sucrose, 10 mM HEPES, 10 mM Tris, 1 mM EGTA, 10 mM dithiothreitol, and a protease/phosphatase inhibitor cocktail (Pierce Halt), at pH 7.4. The homogenate was centrifuged (800×g, 4 °C, 7 min). The pellet was collected and resuspended on 880 mM sucrose, 500 μM MgCl₂ at 3,000×g at 4 °C for 15 min and collected as the nuclear and extracellular fraction. The supernatant from the first (800×g) centrifugation step was again centrifuged (4,000×g, 4 °C, 30 min) to collect the supernatant as the organelle-depleted intracellular fraction. The pellet from this step was washed and centrifuged again (4,000×g, 4 °C, 30 min) to collect the pellets as the mitochondrial and microsomal fraction. We estimated protein yields using bicinchoninic acid assays (Thermo Pierce), then solubilized the protein fractions with RIPA buffer and performed proteolysis on 100 μg protein from each fraction on 10,000-Da polyethersulfone filters (Nanosep; Pall Life Sciences). The RIPA buffer was exchanged on-filter with ammonium bicarbonate (100 mM, 100 μl). Specifically, the samples were reduced (70 °C, 5 min) with dithiothreitol (3 mM) and alkylated in the dark (ambient temperature, 30 min) with iodoacetamide (9 mM). Proteins were digested on-filter (16 h, 37 °C) with sequencing-grade modified trypsin (50:1, Promega). Proteolysis was terminated and peptides were eluted by incubation with 20 μl of 10% trifluoroacetic acid (Pierce) (30 min, 37 °C) followed by centrifugation (13,000×g, ambient temperature, 15 min).

Liquid chromatography-tandem mass spectrometry

We analyzed the peptide samples by liquid chromatography-tandem mass spectrometry (LC-MS/MS) to discern peptide abundance, isotope incorporation, and sequences. To reduce sample complexity and increase protein coverage, we performed high-pH/low–pH two-dimensional reversed-phase chromatography to separate peptide samples prior to MS/MS^40,41. The different pH values alter peptide charges to achieve orthogonal separation over hydrophobic stationary phases. First-dimension (high-pH) separation was conducted off-line on a Phenomenex C18 column (Jupiter Proteo C₁₂, 4-μm particle, 90-Å pore, 100 mm×1 mm dimension) at high pH using a Finnigan Surveyor liquid chromatography system. The solvent gradient profile was established by mixing solvent A (20 mM ammonium formate, pH 10) and solvent B (20 mM ammonium formate, 90% acetonitrile, pH 10) as follows: 0–2 min, 0–5% solvent B in solvent A; 3–32 min, 5–35% solvent B in solvent A; 32–37 min, 80% solvent B in solvent A; at 50 μl⋅min⁻¹ flow-rate. Fifty μg of proteolytic peptides were injected with a syringe into a manual 6-port/2-position switch valve. Twelve fractions were collected every 2 min from min 16–40, then desiccated in a vacuum concentrator and re-dissolved in 20 μl 0.5% formic acid with 2% acetonitrile prior to low-pH reversed-phase separation.

We performed on-line second-dimension (low-pH) reversed-phase chromatography on all samples using a single Easy-nLC 1000 nano-UPLC system (Thermo Scientific) on an EasySpray C18 column (PepMap, 3-μm particle, 100-Å pore; 75 μm×150 mm dimension; Thermo Scientific). Throughout the LC-MS/MS experiment, column temperature was held at a constant 50 °C. Each high-pH fraction was injected (10 μl) and analyzed sequentially using the auto-sampler installed on the nano-UPLC system. The solvent gradient profile was established by mixing solvent A (0.1% formic acid, 2% acetonitrile) and solvent B (0.1% formic acid, 80% acetonitrile) as follows: 0–110 min: 0–40% solvent B in solvent A; 110–117 min: 40–80% solvent B in solvent A; 117–120 min: 80% solvent B in solvent A; at 300 nl⋅min⁻¹. Column pressure was monitored to be within approximately 150 bar. High-resolution tandem mass spectrometry (MS/MS) was performed on a single LTQ Orbitrap Elite instrument (Thermo Scientific), coupled on-line to the nano-UPLC system through a Thermo EasySpray interface. MS signals were acquired in Fourier-Transform/Ion-Trap (FT/IT) mode: each FT MS1 survey scan was analyzed at 60,000 resolving power in profile mode, followed by rapid IT MS2 scans on the top 15 ions with monoisotopic peak selection. MS1 and MS2 target ion accumulation targets were 10⁴ and 10⁶, respectively. MS1 lock mass (m/z 425.120025) and dynamic exclusion (90 s) were used.

Peptide identification and protein inference workflow

The acquired mass spectra were analyzed according to Fig. 1b. MS2 spectra were converted to.ms2 format using the MSConvert application from the ProteoWizard software package (v.2.1)⁴². Peptide identification was performed using the database search algorithm ProLuCID⁴³ against a reverse-decoyed protein sequence database (Uniprot Reference Proteome Mus Musculus, reviewed, accessed April-08–2014, 16,672 forward entries and 16,672 decoy entries)⁴⁴. Static cysteine carbamidomethylation (C +57.02146 Da) modification and up to three of the following variable modifications were allowed: methionine oxidation (M +15.9949 Da), lysine acetylation (K +42.0106 Da), serine/threonine/tyrosine phosphorylation (S/T/Y +79.9663 Da), lysine ubiquitylation (K +114.0403 Da), and asparagine deamidation (N +0.9840 Da). Tryptic, semi-tryptic, and non-tryptic peptides within a 20-ppm parent mass window surrounding the candidate precursor mass were searched. Peptide ions from up to 3 isotopic peaks with fragment mass tolerance of 600 ppm were allowed. Protein inference was performed by DTASelect v.2.0 (ref. 45), requiring ≤1% global peptide false discovery rate and 2 unique peptides per protein for the protein to be considered identified. Modified or non-tryptic peptides were subjected to separate statistical filters to limit false discovery using the –modstat and –trypstat options in DTASelect, such that the inclusion of the variable modifications had no negative impact on the total protein identification counts.

Kinetic data processing strategy

We analyzed protein turnover kinetics and estimated fitting errors using a method we previously described^12,37. ProTurn automatically retrieved identified peptides that were uniquely assigned to proteins for area integration. Specifically, the acquired Orbitrap mass spectra in Thermo .raw formats were first converted into. mzML format using ProteoWizard (v.2.1)⁴², then input to ProTurn (v.2.0.5) for analysis. The ProTurn parameters were set as follows: area-under-curve integration width: 60 p.p.m., extracted ion chromatograph smoothing: Savitzky-Golay filter⁴⁶ over 7 data points. To further control against peptide false positive identifications, only peptides that were explicitly identified (1% FDR) and integrated in ≥4 time points were accepted for the calculation of protein abundance and turnover. The ‘Allow Peptide Modification’ option in ProTurn was turned on to include any potentially identified post-translationally modified (PTM) peptides in kinetic curve-fitting.

We fitted the kinetic data using the non-steady-state fitting method in ProTurn, which corrects for any time delay in label enrichment in the experimental model using a first-order kinetic curve to approximate the equilibration of ²H₂O in the total body water. The enrichment curve is described by two parameters: the rate (k_p) and plateau level (p_ss) of deuterium enrichment in body water, both of which were empirically derived from GC-MS measurements at the sampled time points as described above, such that:

\begin{matrix} (1) & p = p_{s s} \cdot (1 - e^{- k_{p} t}) \end{matrix}

The GC-MS data on enrichment kinetics were utilized in the unified kinetic equation to calculate the corrected fraction of newly synthesized peptides from the isotope envelope fractional abundance, which accounts for the fewer labels contributed by a newly synthesized protein early in the labeling procedure. This correction does not shift the kinetic curve rightward but instead retards its initial ascent to the plateau. Briefly, the overall change in the fraction of unlabeled peptide isotopomers in a peptide pool is assumed to follow first-order kinetics:

\begin{matrix} (2) & \frac{d A_{0}}{d t} = k \cdot (A_{0, m a x} - A_{0}) \end{matrix}

where A₀ is the fraction of the 0th isotopomer of a peptide isotope envelope at a given time, and A_0,max is the fraction of the 0th isotopomer of a pool of newly synthesized peptides. The value for A_0,max is given by A_0,max=a⋅(1–p)^N where a is the fraction of the 0th isotopomer in unlabeled samples as calculated by the natural abundance of heavy isotopes⁴⁷ and N is the number of labeling sites in the peptide sequences as estimated according to literature values⁴⁸. Further substituting into equation (1) and equation (2) gives a combined equation with two kinetic rate constants:

\begin{matrix} (3) & \frac{d A_{0}}{d t} = k \cdot (a {(1 - p_{s s} \cdot (1 - e^{- k_{p} t}))}^{N} - A_{0}) \end{matrix}

The integrated rate equation for equation (3) can be derived analytically as described in our previous publication¹², and is used in ProTurn optimizations to derive the turnover rate constants (k) of peptides (see below). The GC-MS measurements of animal labeling kinetics used to calculate p_ss and k_p are deposited on Synapse (ID: syn4847184).

To calculate peptide isotope cluster abundance in ProTurn, the summed area of the peptide isotope envelope was normalized to total spectral intensity, then normalized to the number of possible tryptic peptides 6–30 amino acids in length from in silico digestion of the protein¹⁷, and further normalized to the total relative areas within the sample. To calculate peptide turnover rates, the fractional abundance of the 0th mass isotopomer (m₀) from each integrated time point was modeled with the combined kinetic model above. To estimate the values of k, the model was iteratively fitted to the data points to minimize the squared residual values using the Nelder-Mead simplex method⁴⁹. Gradient optimization using the Broyden-Fletcher-Goldfarb-Shanno method⁵⁰ yielded same optimized values of k in our kinetic model. Peptide isotopomer time-series with R²≥0.8 or standard error of estimate (s.e.) ≤0.05 are considered to pass the stringency filter employed here, but all peptide time-series data are made available regardless of their R² or s.e. values. Protein turnover rates are reported as the median and median absolute deviation of the optimized turnover rate constants of all accepted constituent peptides.

Code availability

All software packages used for processing MS data are publicly available. ProTurn (v.2.0.5) is freely available on our website (http://heartproteome.org/proturn). Additional data analysis codes that perform summary statistics and create the presented figures were written in R (v.3.2.1) and may be freely accessed on Synapse (ID: syn2289125).

Data Records

Quantification of absolute in vivo cardiac protein turnover rates from mass spectrometry data

We acquired the dataset with the goal of examining the temporal dynamics of the mammalian heart proteome. Because protein expression is known to be highly variable among naturally occurring healthy genetic backgrounds of the same species^51–53, the genetic composition of animals on which experimentation is performed can impact the generalizability of discoveries from one experimental model to another. We therefore replicated the proteomics analysis of cardiac tissue samples six times in total (A/J, BALB/cJ, C57BL/6J, CE/J, DBA/2J, FVB/NJ). The six strains were chosen for their common usage in diverse genetic, surgical, and pharmacological models in human disease research, with noted prevalence of A/J in immunological studies, BALB/cJ mice in cancer and cardiovascular research; CE/J mice in cancer, neurobiological, and metabolic research, DBA/2J mice in cardiovascular and developmental studies; and C57BL/6J and FVB/NJ mice in general-purpose and genetics applications⁵⁴.

The entire dataset comprises 1,404 LC-MS/MS experiments performed over eight months of instrument time. All raw MS data files (in Thermo .raw files format) used for protein identification and quantification have been deposited to the PRIDE repository in the ProteomeXchange Consortium⁵⁵ and can be accessed with the dataset identifier PXD002870 (Table 1) (Fig. 1c) (Data Citation 1). A compressed archive is also available on PRIDE that contains all protein identification results from ProLuCID/DTASelect. All protein database search results including peptide and protein identification are recorded on Synapse⁵⁶, with the dataset identifier syn2289125 (Data Citation 2).

Table 1 Samples and Experimental Files in the Dataset.

Full size table

In total, we acquired over 1.92 million data points of quantified peptide isotopomer distributions corresponding to 341,353 peptide time series quantified at 4 or more time points over the course of labeling. We derived confident kinetic rate constants from 120,454 peptide time-series that passed our stringency filter (see ‘Technical Validation’ for discussions on filter selection below) (Data Citation 3). Label enrichment kinetics from GC-MS data, used to calculate peptide turnover rates, are available on Synapse (Data Citation 4). The peptides have median R² of 0.93 (interdecile range: 0.84–0.99) and median standard errors of 7.1% (interdecile range: 4.3–11.9%). Protein turnover rates were calculated as the median of the turnover rates of all member peptides that passed our stringency filter. The turnover kinetics of 3,228 cardiac proteins were quantified at a total of 23,929 protein-sample pairs (Table 2), averaging 7.4 samples in which each protein was quantified, with 2,599 (81%) proteins that were quantified in at least three samples, and 863 (27%) proteins quantified in all 12 samples. The median turnover rate of the quantified peptides is 0.094 d⁻¹ (9.4% replacement, half-life 7.3 d) with interdecile range of 0.037 d⁻¹ to 0.30 d⁻¹ (half-life 2.3 d to 18.9 d) (Fig. 2a).

Table 2 Protein identification and quantification by sample in the dataset.

Full size table

**Figure 2: Distributions of measured protein expression and turnover rates.**

The dataset includes protein dynamics information from diverse cellular compartments and 201 non-overlapping known cellular pathway groups (Tables 3 and 4). We retrieved pathway information from Reactome (release V53; July 2nd, 2015)⁵⁷. Pathways are considered to be covered in the dataset if five or more proteins in the pathway contain quantified half-life information amongst the 3,228 proteins with quantified half-life. Pathways with 50% or more overlap are combined into pathway groups. The dataset captures turnover rates that span >20-fold, and includes long-lived proteins such as histone H4 (Uniprot ID P62806; median half-life 54.6 days) and lamin-B1 (Uniprot ID P14733; median half-life 36.4 days), as well as fast turnover proteins such as apolipoprotein E (Uniprot ID P08226; median half-life 8 h). As previously reported, abundant proteins in the dataset have slower turnover on average (Fig. 2b). Data from each of the six mouse strains under normal and hypertrophy conditions show good correlations and overlaps (Fig. 2c).

Table 3 Selected major organelles and cellular components covered in this dataset.

Full size table

Table 4 Selected biological pathways covered in this dataset.

Full size table

The turnover rate output files from ProTurn have been uploaded to Synapse for public access and collaborative data analysis (syn2289125), including data tables on the outputs of all protein turnover calculations. An index file linking each sample to individual data table can be found at ID: syn4725236. Each ProTurn analysis contains three output files, which contains protein half-life information organized by peptide sequence (hl.out); isotopomer envelope quantities at each time point (hl-data.out); and the total normalized intensity of each protein in each MS experiment for protein quantification purposes (intensity.txt). The hl.out files contain 14 columns; each row is a unique peptide (sequence- and charge- unique) time series that passed the time point filter. The column labels are as follows: ID: internal index for referring to corresponding data in hl.out. UniProt: UniProt ID of the protein the peptide was assigned to in database search. Peptide: sequence (with PTM, if applicable). DP: number of data points. This corresponds to the number of time points in most runs. z: peptide ion charge state. mi: index of the isotopomer whose proportional abundance is quantified (0 in most ProTurn analyses). SS: residual sum of squares of fitting. a: initial isotopomer fractional abundance prior to labeling based on peptide sequence. pss: experimental steady-state relative label enrichment level. kp: experimental rate constant of label enrichment (d⁻¹). N: number of accessible labeling sites based on sequence information, as calculated according to literature values⁴⁸. k: fitted rate constant of peptide turnover (d⁻¹). dk: fitting error. R2: goodness-of-fit. The hl-data.out files contain 3 columns; each row is a unique peptide isotope cluster at a particular time point. The columns are as follow: ID: internal ID for referring to corresponding data in hl.out; t: time point (d); A₀: experimental fractional abundance of the isotopomer. The intensity.txt files contain eight columns, with the first column being the Uniprot ID of the protein, and each successive column thereafter denoting the relative intensity of the protein in each of the examined time point of analysis respectively (day 0, day 1, day 3, day 5, day 7, day 10, day 14).

Quantification of relative cardiac protein expression from mass spectrometry data

The dataset also provides protein expression information from two parallel methods of label-free quantification, based on the extracted ion chromatogram (XIC) intensity-based quantification carried out natively in ProTurn, and the normalized spectral abundance factor (NSAF) from the proteomics database search workflow ProLuCID/DTASelect^43,45. The XIC intensity-based quantification is normalized by the number of tryptic peptides that a protein is estimated to produce¹⁷. From the 8,227 identified proteins, ProTurn quantified the expression levels of 8,064 proteins by XIC areas-under-curves in over 58,748 protein-samples, averaging 7.29 samples in which each protein was quantified. A total of 5,806 (72%) proteins were quantified in at least three samples and 3,205 (40%) were quantified in all 12 samples. In parallel, the spectral count methods quantified the abundance of 3,484 proteins in at least three samples. Results from the two methods are correlated (Spearman’s correlation coefficient ρ: 0.73), but XIC intensity-based quantification achieves a broader dynamic range than spectral counts.

An index file containing all individual protein identification results and expression quantification values is uploaded at Synapse (syn4720849). Each record is in standard text format as output from ProLuCID/DTASelect without modification. The files contain search metadata, and, in each row, a peptide or protein identification record containing scores, spectral counts, mass shift, and other properties as described.

Technical Validation

Strategies for data quality assurance

To assure data quality, we use a multi-step strategy (see Methods), selecting only proteins that are identified at 1% FDR and quantified at 4 or more data points for curve-fitting, and accepting only the turnover rate values that pass a stringency filter (Fig. 3a). Data accuracy may be assessed both with regard to mass spectrometry measurements (i.e., whether the mass spectrometer measures the correct isotopomer fractional abundance for a given peptide) and to biological reality (e.g., whether the measured turnover rate accurately reflects the true turnover rate in vivo). To determine the former, we investigated the accuracy of experimental measurements of peptide isotopomer relative abundance in unlabeled sample against theoretical isotope distributions, which can be calculated from peptide sequences and the natural abundance of heavy isotopes in the biosphere⁴⁷. We find good agreement between theoretical and experimental values, with Spearman’s correlation coefficient ρ of 0.97 and root mean square error between experimental and theoretical isotope abundance of 2.6%, supporting overall excellent accuracy in the measurement of isotope relative abundance on mass spectra.

**Figure 3: Technical Validation of acquired turnover rates.**

Because gold standards of in vivo protein turnover rates are largely unavailable, we assayed the biological validity of the dataset using indirect methods. First, we derived and optimized data filtering criteria to maximize the number of peptides and proteins with quantitative information while simultaneously controlling intra-protein variability. These filter targets are predicated on the assumptions that for most scenarios the entire protein sequence are synthesized and degraded as a unit, and that turnover measurements are self-normalized and insensitive to differential yields across experimental fractions. Hence, multiple quantified peptides originating from the same measured cellular protein pool ought to yield identical turnover rates if data quality is pristine, barring any unresolved isoforms or identification ambiguity. We used goodness-of-fit (R²) as the primary criterion to filter out the peptide time-series in which the experimental A₀ data points deviated significantly from the optimized kinetic curves (Fig. 3b). Relaxing the acceptable R² value to below 0.8 resulted in gradual admittance of peptide time-series with increasing residuals, which in turn led to increases in intra-protein variances. In our experience, we found intra-protein variance of <30% in an experiment to be acceptable, when measured as the median of the median absolute deviations of best-fitted peptide turnover rate constants within each protein in an experiment. Peptide time series that are quantified at more time points show better fitting quality (Fig. 3c). On average, 35% of peptides were fitted with R²≥0.8. In peptides that were quantified in more time points, a higher proportion was closely fitted to the curves (R²≥0.8) (31.2% of peptides quantified in 4 time points; 31.8% of peptides quantified in 5 time points; 35.1% of peptides quantified in 6 time points; 40.5% of peptides quantified in 7 time points). As expected, peptides passing the stringency filters which are quantified in more time points also show lower standard errors of estimate (s.e.) (Fig. 3c). This discrepancy may be due to peptides possessing higher mass spectral intensity leading to improved detection as well as more accurate peak area integration.

Peptides undergoing relatively slow turnover, such as those originating from long-lived nuclear and mitochondrial proteins, lend to flat and horizontal kinetic curves (Fig. 3b, lower right panel). In these peptides, the kinetic curve has limited power to predict the value of A₀ within the studied timeframe, and consequently results in high residual variances. We therefore used a second filter based on total variances of the data points to include peptides with slower turnover and extend the overall dynamic range of the dataset. Stepwise permutation of R² and s.e. values (Fig. 3d) suggest that a small subset of peptides with low R² values are nevertheless well-fitted and contribute to consistent intra-protein turnover rates. These peptides are distinguished by their lower average turnover rates (Fig. 3d). To balance the quality and quantity of protein turnover measurements, we use a stringency filter of R²≥0.8 in the analysis here, and additionally include peptides whose standard errors are ≤0.05. This filter preserves a reasonable fraction of fitted peptides, whereas accepted peptides from each protein demonstrate consistent turnover rates, as can be seen in 14 distinct peptides independently measured from mitochondrial ATP synthase subunit d (ATP5H) (Fig. 3e). Other stringent filters may be easily applied to the raw data by the data consumers, if intra-protein variance is to be further minimized.

Usage Notes

Exploration of turnover kinetics in protein pathways

The mouse strain data documented here may be used to explore natural variations in protein expression and dynamics across healthy genetic backgrounds in a population, whereas the isoproterenol-treated samples may be used to explore variations in pathological responses towards a common stimulus. A number of post-analysis routes are available with the current dataset. For instance, the absolute turnover rate data may be used to interrogate whether particular protein pathways or sub-proteome (such as from an immunoprecipitation study) are co-regulated in their expression and turnover, which may be used to inform on regulatory mechanisms. Subcellular or extracellular localization may influence protein half-life due to the predominant proteolytic mechanisms presented in various compartments. For some proteins, exit from the tissue-of-origin may represent a considerable removal mechanism, which should be taken into account when interpreting data on proteins for which the total equilibrating pool may not reside entirely within the sampled proteome. Exported proteins may appear to have shorter half-life in the heart than when measured in extracellular locations (e.g., in blood), if it may be assumed that a majority of the proteins are quickly lost from the intracellular pool upon their synthesis in the heart. Under rapid export, only recently synthesized proteins may be sampled within the heart, and the sampled proteins resemble the completely turned-over protein pool (i.e., A_0,t resembles A_0,max), a relationship which has been exploited to measure label enrichment of precursor pools from exported liver protein products⁵⁸. The heart is not a major secretory organ, but a small number of proteins are actively secreted from the heart in health or in disease (e.g., natriuretic peptides A and B plus certain cytokines⁵⁹) whereas other proteins may be passively shredded upon cell death or injury (e.g., cardiac troponin I and T⁶⁰). Thus a number of different pathophysiological scenarios including protein differential expression, export, and loss may manifest as alterations in effective turnover rates. Kinetic information may also be used in computational modeling of cellular processes that require kinetic rate constants as input parameters, such as in the simulation of protein damage, protein homeostasis and long-term alterations in expression⁶¹, the stability of disease markers and agonist receptors⁶², or the stochasticity of transcript/protein expression regulation in vivo^25,29.

Alternative routes for data analysis

We present the following four specific examples of use cases in both basic and translational investigations as preliminary guidance on how the data may be analyzed or re-analyzed to gain biological insights. Specific step-by-step instructions of the four use cases below can be found on COPaKB (http://heartproteome.org) and on Synapse (http://doi.org/doi:10.7303/syn2289125).

Use case 1: Turnover Lookup. The half-life and abundance of proteins may be easily and individually retrieved via the data records on the Synapse project (http://doi.org/doi:10.7303/syn2289125) to support targeted queries. An investigator in cardiovascular medicine may be interested in mitochondrial biology in cardiovascular diseases. He or she may study a particular cardiac protein, such as mitofusin 2 (Uniprot ID: Q80U63), and its role in mitochondrial turnover during the development of heart failure. The absolute half-life of the protein in the heart can be looked up by downloading the protein turnover rate tables (e.g., see syn4725236 for an index of result files from all strains) and querying it with the protein’s UniProt ID for various investigation objectives, such as to calibrate the rate of decay of fluorescence timers in other experimental systems with the absolute time of mitochondrial turnover in the intact heart. It may also be possible to study the relationship between the half-life of mitofusin with that of the mitochondrial median and use the information to contextualize whole-mitochondrion dynamics data. Combined with additional data on the absolute quantification of mitofusin, one may calculate the absolute copy numbers of mitofusins that are being synthesized and degraded, and how they relate to mitochondrial turnover rates.

Use case 2: Pathway Analysis. Protein complexes, pathways, and cellular compartments may be analyzed to understand their individual protein half-life and expression. A basic scientist who studies Golgi proteins in the yeast may utilize the data records to ask fundamental questions on the cell biology of the organelle. For example, the median half-life of all Golgi proteins may be assayed and contrasted with other cellular components such as the endoplasmic reticulum (Table 3). Protein half-life and expression in relevant pathways may be retrieved (Table 4). Moreover, because biological fractionation was performed in acquiring this dataset, raw MS data files on PRIDE may be re-analyzed separately to compare the half-life of potential protein subpopulations that may be enriched in either the nuclear/insoluble, cytosol, or mitochondrial/ER isolations. In our previous investigations, we have noticed potential differences in hexokinase isoforms when assayed from cytosol and from the mitochondria³⁴.

Use case 3: MS Re-analysis. Proteomics data form an important component of omics-based disease phenotyping strategies. Nevertheless, it is generally appreciated that substantial amounts of information remain unextracted in shotgun proteomics datasets, in the form of unidentified spectra not matched to any peptides in the utilized protein identification workflow. By our estimation, more than half of the MS2 spectra in the uploaded .raw files may be unidentified under the protein identification filters we used. An omics scientist or informatician interested in proteome dynamics may therefore re-analyze the uploaded raw MS data with more sophisticated protein identification workflows in the future. For instance, one may explore the kinetic regulations of single amino acid variants using proteogenomics databases⁶³, or unknown protein identifications with improved search engine parameters⁶⁴. The re-analyzed data may be further combined with available complementary omics data (e.g., microarray data) from identical models for deeper insights. To support omics data re-analysis, we have built into ProTurn compatibility with database search results from multiple common search engines including Mascot (Matrix Science), SEQUEST/ProLuCID^43,45, MaxQuant/Andromeda⁶⁵, ProteomeDiscoverer (Thermo), and COPaKB⁶⁶.

Use case 4: Method development. This study provides a complete reference dataset from raw data to quantitative results, thus presenting opportunities for new software and kinetic models for protein turnover analysis to be developed and tested. Data analytical methods on the detection of differential protein turnover remain at infancy. Data scientists interested in protein dynamics may reanalyze the quantified peptide time-series for novel insights, for instance, on the behaviors of peptide series that do not conform to the current kinetic model or pass the current stringency filters. It is known that some proteins may have sequential, biphasic responses to stimuli^20,67, which may cause deviation from our kinetic model. On the other hand, some long-lived proteins including histones and nucleopore channel subunits^8,68 may not accumulate sufficient deuterium atoms during the labeling time period, and may be identified by data analysis routes that specifically target the absence of labels. We anticipate that further method developments will lead to a virtuous cycle of reusability of the proteome dynamics data in this dataset.

In summary, we describe here one of the largest experimental datasets on proteome dynamics in an animal model of human diseases, covering over 10 major organelles and 200 distinct cellular pathways. We envision the data will provide new molecular information on disease phenotypes and support further development in dynamics research.

Additional Information

How to cite this article: Lau, E. et al. A large dataset of protein dynamics in the mammalian heart proteome. Sci. Data 3:160015 doi: 10.1038/sdata.2016.15 (2016).

References

Balch, W. E., Morimoto, R. I., Dillin, A. & Kelly, J. W. Adapting proteostasis for disease intervention. Science 319, 916–919 (2008).
Article ADS CAS Google Scholar
Cohen, F. E. & Kelly, J. W. Therapeutic approaches to protein-misfolding diseases. Nature 426, 905–909 (2003).
Article ADS CAS Google Scholar
Day, S. M. The ubiquitin proteasome system in human cardiomyopathies and heart failure. American journal of physiology. Heart and circulatory physiology 304, H1283–H1293 (2013).
Article CAS Google Scholar
Hebert, D. N., Lamriben, L., Powers, E. T. & Kelly, J. W. The intrinsic and extrinsic effects of N-linked glycans on glycoproteostasis. Nat. Chem. Biol. 10, 902–910 (2014).
Article CAS Google Scholar
Hetz, C., Chevet, E. & Harding, H. P. Targeting the unfolded protein response in disease. Nat. Rev. Drug. Discov. 12, 703–719 (2013).
Article CAS Google Scholar
Popovic, D., Vucic, D. & Dikic, I. Ubiquitination in disease pathogenesis and treatment. Nat. Med. 20, 1242–1253 (2014).
Article CAS Google Scholar
Tai, H.-C. & Schuman, E. M. Ubiquitin, the proteasome and protein degradation in neuronal function and dysfunction. Nat. Rev. Neurosci. 9, 826–838 (2008).
Article CAS Google Scholar
Toyama, B. H. & Hetzer, M. W. Protein homeostasis: live long, won't prosper. Nature reviews. Molecular cell biology 14, 55–61 (2013).
Article CAS Google Scholar
Li, J., Powell, S. R. & Wang, X. Enhancement of proteasome function by PA28α overexpression protects against oxidative stress. FASEB J. 25, 883–893 (2011).
Article CAS Google Scholar
Wang, X. & Robbins, J. Heart failure and protein quality control. Circ. Res. 99, 1315–1328 (2006).
Article CAS Google Scholar
Claydon, A. J. & Beynon, R. Proteome dynamics: revisiting turnover with a global perspective. Mol. Cell Proteomics 11, 1551–1565 (2012).
Article Google Scholar
Lam, M. P. et al. Protein kinetic signatures of the remodeling heart following isoproterenol stimulation. J. Clin. Invest. 124, 1734–1744 (2014).
Article CAS Google Scholar
Price, J. C. et al. The effect of long term calorie restriction on in vivo hepatic proteostatis: a novel combination of dynamic and quantitative proteomics. Mol. Cell Proteomics 11, 1801–1814 (2012).
Article Google Scholar
Schoenheimer, R., Ratner, S. & Rittenberg, D. STUDIES IN PROTEIN METABOLISM: X. THE METABOLIC ACTIVITY OF BODY PROTEINS INVESTIGATED WITH l (-)-LEUCINE CONTAINING TWO ISOTOPES. J. Biol. Chem. 130, 703–732 (1939).
CAS Google Scholar
Simoni, R. D., Hill, C. P. & Vaughan., M. The Use of Isotope Tracers to Study Intermediary Metabolism: Rudolf Schoenheimer. J. Biol. Chem. 277, e31 (2002).
CAS Google Scholar
Cox, J. & Mann, M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat. Biotechnol. 26, 1367–1372 (2008).
Article CAS Google Scholar
Nagaraj, N. et al. Deep proteome and transcriptome mapping of a human cancer cell line. Mol. Syst. Biol. 7, 548 (2011).
Article Google Scholar
Carvalho, P. C., Hewel, J., Barbosa, V. C. & Yates, J. R. Identifying differences in protein expression levels by spectral counting and feature selection. Genetics and Molecular Research 7, 342–356 (2008).
Article CAS Google Scholar
Farrah, T. et al. State of the human proteome in 2013 as viewed through peptideatlas: Comparing the kidney, urine, and plasma proteomes for the biology- and disease-driven human proteome project. Journal of Proteome Research 13, 60–75 (2014).
Article CAS Google Scholar
Jovanovic, M. et al. Dynamic profiling of the protein life cycle in response to pathogens. Science 347 (2015).
Papageorgopoulos, C., Caldwell, K., Shackleton, C., Schweingrubber, H. & Hellerstein, M. K. Measuring protein synthesis by mass isotopomer distribution analysis (MIDA). Analytical biochemistry 267, 1–16 (1999).
Article CAS Google Scholar
Andersen, J. S. et al. Nucleolar proteome dynamics. Nature 433, 77–83 (2005).
Article ADS CAS Google Scholar
Schwanhäusser, B., Gossen, M., Dittmar, G. & Selbach, M. Global analysis of cellular protein translation by pulsed SILAC. Proteomics 9, 205–209 (2009).
Article Google Scholar
Jayapal, K. P. et al. Multitagging proteomic strategy to estimate protein turnover rates in dynamic systems. Journal of Proteome Research 9, 2087–2097 (2010).
Article CAS Google Scholar
Maier, T. et al. Quantification of mRNA and protein and integration with protein turnover in a bacterium. Molecular systems biology 7, 511 (2011).
Article Google Scholar
Pratt, J. M. et al. Dynamics of protein turnover, a missing dimension in proteomics. Molecular & cellular proteomics: MCP 1, 579–591 (2002).
Article CAS Google Scholar
Cambridge, S. B. et al. Systems-wide proteomic analysis in mammalian cells reveals conserved, functional protein turnover. Journal of Proteome Research 10, 5275–5284 (2011).
Article CAS Google Scholar
Doherty, M. K., Hammond, D. E., Clague, M. J., Gaskell, S. J. & Beynon, R. J. Turnover of the human proteome: determination of protein intracellular stability by dynamic SILAC. J Proteome Res 8, 104–112 (2009).
Article CAS Google Scholar
Schwanhausser, B. et al. Global quantification of mammalian gene expression control. Nature 473, 337–342 (2011).
Article ADS Google Scholar
Chan, X. C., Black, C. M., Lin, A. J., Ping, P. & Lau, E. Mitochondrial protein turnover: methods to measure turnover rates on a large scale. Journal of molecular and cellular cardiology, In Print 78, 54–61 (2014).
Article Google Scholar
Price, J. C. et al. Measurement of human plasma proteome dynamics with (2)H(2)O and liquid chromatography tandem mass spectrometry. Anal. Biochem. 420, 73–83 (2012).
Article CAS Google Scholar
Busch, R. et al. Measurement of protein turnover rates by heavy water labeling of nonessential amino acids. Biochim Biophys. Acta. 1760, 730–744 (2006).
Article CAS Google Scholar
Kasumov, T. et al. Assessment of Cardiac Proteome Dynamics with Heavy Water: Slower Protein Synthesis Rates in Interfibrillar than Subsarcolemmal Mitochondria. Am. J. Physiol. Heart Circ. Physiol. 304, H1201–H1214 (2013).
Article CAS Google Scholar
Lau, E. et al. Spatial and temporal dynamics of the cardiac mitochondrial proteome. Expert Review of Proteomics 12, 133–146 (2015).
Article CAS Google Scholar
Dufner, D. & Previs, S. F. Measuring in vivo metabolism using heavy water. Current opinion in clinical nutrition and metabolic care 6, 511–517 (2003).
Article CAS Google Scholar
Wang, D. et al. Characterization of human plasma proteome dynamics using deuterium oxide. Proteomics Clin. Appl. 8, 610–619 (2014).
Article Google Scholar
Kim, T. Y. et al. Metabolic labeling reveals proteome dynamics of mouse mitochondria. Mol Cell Proteomics 11, 1586–1594 (2012).
Article Google Scholar
Hellerstein, M. K. & Neese, R. A. Mass isotopomer distribution analysis at eight years: theoretical, analytic, and experimental considerations. The American journal of physiology 276, E1146–E1170 (1999).
CAS PubMed Google Scholar
Lam, M. P. et al. Site-specific quantitative analysis of cardiac mitochondrial protein phosphorylation. J Proteomics 81, 15–23 (2013).
Article CAS Google Scholar
Lam, M. P. et al. Online combination of reversed-phase/reversed-phase and porous graphitic carbon liquid chromatography for multicomponent separation of proteomics and glycoproteomics samples. Electrophoresis 32, 2930–2940 (2011).
Article CAS Google Scholar
Siu, S. O. et al. Fully automatable two-dimensional reversed-phase capillary liquid chromatography with online tandem mass spectrometry for shotgun proteomics. Proteomics 11, 2308–2319 (2011).
Article CAS Google Scholar
Kessner, D., Chambers, M., Burke, R., Agus, D. & Mallick, P. ProteoWizard: Open source software for rapid proteomics tools development. Bioinformatics 24, 2534–2536 (2008).
Article CAS Google Scholar
Xu, T. et al. ProLuCID: An improved SEQUEST-like algorithm with enhanced sensitivity and specificity. Journal of Proteomics 129, 16–24 (2015).
Article ADS CAS Google Scholar
Magrane, M. & Consortium, U. P. UniProt Knowledgebase: A hub of integrated protein data. Database 2011, bar009 (2011).
Article Google Scholar
Tabb, D. L., McDonald, W. H. & Yates, J. R. 3rd DTASelect and Contrast: tools for assembling and comparing protein identifications from shotgun proteomics. J. Proteome Res. 1, 21–26 (2002).
Article CAS Google Scholar
Abraham Savitzky & Golay, M. J. E. Smoothing and differentiation of data by simplified least squares procedures. Anal. Chem. 36, 1627–1639 (1964).
Article ADS Google Scholar
Berglund, M. & Wieser, M. E . Isotopic compositions of the elements 2009 (IUPAC Technical Report, 2011).
Google Scholar
Commerford, S. L., Carsten, A. L. & Cronkite., E. P. The distribution of tritium among the amino acids of proteins obtained from mice exposed to tritiated water. Radiat. Res. 94, 151–155 (1983).
Article ADS CAS Google Scholar
Nelder, J. A. & Mead., R. A simplex-method for function minimization. Computer Journal 7, 308–313 (1965).
Article MathSciNet Google Scholar
Broyden, C. G. The Convergence of a Class of Double-rank Minimization Algorithms 1. General Considerations. IMA Journal of Applied Mathematics 6, 76–90 (1970).
Article Google Scholar
Ghazalpour, A. et al. Comparative analysis of proteome and transcriptome variation in mouse. PLoS Genetics 7, e1001393 (2011).
Article CAS Google Scholar
Melzer, D. et al. A genome-wide association study identifies protein quantitative trait loci (pQTLs). PLoS genetics 4, e1000072 (2008).
Article Google Scholar
Wu, L. et al. Variation and genetic control of protein abundance in humans. Nature 499, 79–82 (2013).
Article ADS CAS Google Scholar
The Jackson Laboratory Handbook on Genetically Standardized Mice, Sixth Edition (eds Currer, J. M., Leiter, E. H. & Witham, B.) 379 (The Jackson Laboratory, Bar Harbor, 2009).
Vizcaíno, J. A., Deutsch, E. W. & Wang, R. ProteomeXchange provides globally coordinated proteomics data submission and dissemination. Nature 32, 223–226 (2014).
Google Scholar
Derry, J. M. J. et al. Developing predictive molecular maps of human disease through community-based modeling. Nat. Genet. 44, 127–130 (2012).
Article CAS Google Scholar
Croft, D. et al. The Reactome pathway knowledgebase. Nucleic. Acids Research 42, D472–D477 (2014).
Article CAS Google Scholar
Claydon, A., Thom, M., Hurst, J. & Beynon, R. J. Protein turnover: measurement of proteome dynamics by whole animal metabolic labelling with stable isotope labelled amino acids. Proteomics 12, 1194–1206 (2012).
Article CAS Google Scholar
Lindsey, M. L., Iyer, R. P., Jung, M., DeLeon-Pennell, K. Y. & Ma, Y. Matrix metalloproteinases as input and output signals for post-myocardial infarction remodeling. Journal of Molecular and Cellular Cardiology 91, 134–140 (2016).
Article CAS Google Scholar
Lin, Y., Fu, Q., Zhu, J., Miller, J. M. & Van Eyk, J. E. Development of a qualitative sequential immunoassay for characterizing the intrinsic properties of circulating cardiac troponin I. Clinical chemistry 56, 1307–1319 (2010).
Article CAS Google Scholar
Proctor, C. J., Tsirigotis, M. & Gray, D. A. An in silico model of the ubiquitin-proteasome system that incorporates normal homeostasis and age-related decline. BMC systems biology 1, 17 (2007).
Article Google Scholar
Kuhar, M. J. On the Use of Protein Turnover and Half-Lives. Neuropsychopharmacology 34, 1172–1173 (2008).
Article Google Scholar
Nesvizhskii, A. I. Proteogenomics: concepts, applications and computational strategies. Nat. Meth. 11, 1114–1125 (2014).
Article CAS Google Scholar
Chick, J. M. et al. A mass-tolerant database search identifies a large proportion of unassigned spectra in shotgun proteomics as modified peptides. Nat Biotech 33, 743–749 (2015).
Article CAS Google Scholar
Cox, J. et al. Andromeda: a peptide search engine integrated into the MaxQuant environment. J. Proteome Res. 10, 1794–1805 (2011).
Article ADS CAS Google Scholar
Zong, N. et al. Integration of Cardiac Proteome Biology and Medicine by a Specialized Knowledgebase. Circ. Res. 113, 1043–1053 (2013).
Article CAS Google Scholar
Rowell, J., Koitabashi, N., Kass, D. A. & Barth, A. S. Dynamic Gene Expression Patterns in Animal Models of Early and Late Heart Failure Reveal Biphasic-Bidirectional Transcriptional Activation of Signaling Pathways. Physiological Genomics 46, 779–787 (2014).
Article CAS Google Scholar
Savas, J. N., Toyama, B. H., Xu, T., Yates, J. R. & Hetzer, M. W. Extremely Long-Lived Nuclear Pore Proteins in the Rat Brain. Science 335, 942 (2012).
Article ADS CAS Google Scholar

Data Citations

Lau, E. ProteomeXchange http://www.ebi.ac.uk/pride/archive/projects/PXD002870 (2015)
Lau, E. Synapse https://doi.org/10.7303/syn2289125 (2015)
Lau, E. Figshare https://doi.org/10.6084/m9.figshare.c.2171334 (2016)
Lau, E. Synapse https://doi.org/10.7303/syn4847184.2 (2016)

Download references

Acknowledgements

This work was supported by NIH awards R01HL129723, R01HL123060, and U54GM114833; and the UCLA Laubisch endowment, to P.P. and American Heart Association fellowship 12PRE11610024 to E.L.

Author information

Authors and Affiliations

The NIH Big Data to Knowledge (BD2K) Center of Excellence in Biomedical Computing at UCLA, Los Angeles, California, 90095, USA
Edward Lau, Quan Cao, Dominic C.M. Ng, Brian J. Bleakley, T. Umut Dincer, Brian M. Bot, Ding Wang, David A. Liem, Maggie P.Y. Lam & Peipei Ping
Department of Physiology, University of California at Los Angeles, Los Angeles, 90095, California, USA
Edward Lau, Quan Cao, Dominic C.M. Ng, Brian J. Bleakley, T. Umut Dincer, Ding Wang, David A. Liem, Maggie P.Y. Lam & Peipei Ping
Department of Shanghai Institute of Cardiovascular Diseases, Zhongshan Hospital, Fudan University, Shanghai, 200032, China
Quan Cao & Junbo Ge
Department of Bioinformatics, University of California at Los Angeles, Los Angeles, California, 90095, USA
T. Umut Dincer, Maggie P.Y. Lam & Peipei Ping
Department of Sage Bionetworks, Seattle, 98109, Washignton, USA
Brian M. Bot
Department of Medicine,University of California at Los Angeles, Los Angeles, 90095, California, USA
Peipei Ping

Authors

Edward Lau
View author publications
You can also search for this author in PubMed Google Scholar
Quan Cao
View author publications
You can also search for this author in PubMed Google Scholar
Dominic C.M. Ng
View author publications
You can also search for this author in PubMed Google Scholar
Brian J. Bleakley
View author publications
You can also search for this author in PubMed Google Scholar
T. Umut Dincer
View author publications
You can also search for this author in PubMed Google Scholar
Brian M. Bot
View author publications
You can also search for this author in PubMed Google Scholar
Ding Wang
View author publications
You can also search for this author in PubMed Google Scholar
David A. Liem
View author publications
You can also search for this author in PubMed Google Scholar
Maggie P.Y. Lam
View author publications
You can also search for this author in PubMed Google Scholar
Junbo Ge
View author publications
You can also search for this author in PubMed Google Scholar
Peipei Ping
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

E.L. designed experiments, performed data analysis, and wrote the manuscript. Q.C. performed animal surgery and sample processing. D.C.M.N performed sample processing, GC-MS, and LC-MS/MS data acquisition. B.J.B. wrote software for data analysis. T.U.D. wrote software for data analysis. BMB designed and executed data dissemination strategy. D.W. collected animal samples. D.A.L collected animal samples. M.P.Y.L designed experiments, performed sample processing, and MS data acquisition. J.G. critiqued the manuscript. P.P. designed experiments, interpreted data, and wrote the manuscript.

Corresponding authors

Correspondence to Edward Lau or Peipei Ping.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

ISA-Tab metadata

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0 Metadata associated with this Data Descriptor is available at http://www.nature.com/sdata/ and is released under the CC0 waiver to maximize reuse.

Reprints and permissions

About this article

Cite this article

Lau, E., Cao, Q., Ng, D. et al. A large dataset of protein dynamics in the mammalian heart proteome. Sci Data 3, 160015 (2016). https://doi.org/10.1038/sdata.2016.15

Download citation

Received: 30 September 2015
Accepted: 12 January 2016
Published: 15 March 2016
DOI: https://doi.org/10.1038/sdata.2016.15

Subjects

Abstract

Similar content being viewed by others

Proteome-wide profiling and mapping of post translational modifications in human hearts

A mass spectrometry workflow for measuring protein turnover rates in vivo

Quantitative analysis of global protein stability rates in tissues

Background & Summary

Methods

Summary

Animal models and in vivo isotope labeling

Gas chromatography-mass spectrometry

Protein extraction and processing

Liquid chromatography-tandem mass spectrometry

Peptide identification and protein inference workflow

Kinetic data processing strategy

Code availability

Data Records

Quantification of absolute in vivo cardiac protein turnover rates from mass spectrometry data

Quantification of relative cardiac protein expression from mass spectrometry data

Technical Validation

Strategies for data quality assurance

Usage Notes

Exploration of turnover kinetics in protein pathways

Alternative routes for data analysis

Additional Information

References

References

Data Citations

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

ISA-Tab metadata

ISA-Tab metadata

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links