Serum Proteomic Profiling to Identify Biomarkers of Premature Carotid Atherosclerosis

Bhosale, Santosh D.; Moulder, Robert; Venäläinen, Mikko S.; Koskinen, Juhani S.; Pitkänen, Niina; Juonala, Markus T.; Kähönen, Mika A. P.; Lehtimäki, Terho J.; Viikari, Jorma S. A.; Elo, Laura L.; Goodlett, David R.; Lahesmaa, Riitta; Raitakari, Olli T.

doi:10.1038/s41598-018-27265-9

Download PDF

Article
Open access
Published: 15 June 2018

Serum Proteomic Profiling to Identify Biomarkers of Premature Carotid Atherosclerosis

Santosh D. Bhosale¹,
Robert Moulder ORCID: orcid.org/0000-0003-4742-0566¹,
Mikko S. Venäläinen ORCID: orcid.org/0000-0003-1777-4259¹,
Juhani S. Koskinen ORCID: orcid.org/0000-0002-5920-9728^2,6,
Niina Pitkänen⁶,
Markus T. Juonala²,
Mika A. P. Kähönen³,
Terho J. Lehtimäki⁴,
Jorma S. A. Viikari²,
Laura L. Elo¹,
David R. Goodlett^1,5,
Riitta Lahesmaa¹ &
…
Olli T. Raitakari^6,7

Scientific Reports volume 8, Article number: 9209 (2018) Cite this article

4441 Accesses
18 Citations
1 Altmetric
Metrics details

Subjects

Abstract

To evaluate the presence of serum protein biomarkers associated with the early phases of formation of carotid atherosclerotic plaques, label-free quantitative proteomics analyses were made for serum samples collected as part of The Cardiovascular Risk in Young Finns Study. Samples from subjects who had an asymptomatic carotid artery plaque detected by ultrasound examination (N = 43, Age = 30–45 years) were compared with plaque free controls (N = 43) (matched for age, sex, body weight and systolic blood pressure). Seven proteins (p < 0.05) that have been previously linked with atherosclerotic phenotypes were differentially abundant. Fibulin 1 proteoform C (FBLN1C), Beta-ala-his-dipeptidase (CNDP1), Cadherin-13 (CDH13), Gelsolin (GSN) and 72 kDa type IV collagenase (MMP2) were less abundant in cases, whereas Apolipoproteins C-III (APOC3) and apolipoprotein E (APOE) were more abundant. Using machine learning analysis, a biomarker panel of FBLN1C, APOE and CDH13 was identified, which classified cases from controls with an area under receiver-operating characteristic curve (AUROC) value of 0.79. Furthermore, using selected reaction monitoring mass spectrometry (SRM-MS) the decreased abundance of FBLN1C was verified. In relation to previous associations of FBLN1C with atherosclerotic lesions, the observation could reflect its involvement in the initiation of the plaque formation, or represent a particular risk phenotype.

Discovery of novel biomarkers for atherosclerotic aortic aneurysm through proteomics-based assessment of disease progression

Article Open access 14 April 2020

Quantitative proteomic analysis of human serum using tandem mass tags to predict cardiovascular risks in patients with psoriasis

Article Open access 17 February 2023

A urinary peptidomics approach for early stages of cardiovascular disease risk: The African-PREDICT study

Article 17 November 2022

Introduction

Atherosclerotic cardiovascular diseases are amongst the leading causes of death globally¹. Atherosclerosis is characterized by the accumulation of pro-atherogenic lipoprotein particles in the sub-endothelial space of large- and medium-sized arteries². The process is initiated by the trapping of apolipoprotein B particles in arterial intima by proteoglycans, followed by subsequent modifications, such as aggregation and oxidation, leading to the development of atherosclerotic plaques³. The rupture of a critically located atherosclerotic plaques can result in myocardial infarction and stroke⁴.

Ultrasound examination can be used to identify thickening of the intima-media in the carotid arteries and is used to detect and monitor both subclinical and clinical atherosclerosis⁵. Although it is unclear how the diffuse thickening of carotid artery wall represents the subclinical phase, there is a consensus that local carotid plaques, defined as distinct protrusions from the carotid vessel wall into the lumen, are indicative of specific phenotypic changes associated with an active atherosclerotic process⁶. As there is compelling evidence that the atherosclerotic process often starts early in life and may remain asymptomatic for several decades⁷, the study of young and middle-aged adults with non-obstructive carotid plaques provides an opportunity to explore biomarkers for early-stage preclinical atherosclerosis. Such markers may have potential in diagnostics and in understanding the disease etiology. Since atherosclerosis is a circulatory manifestation, the extracellular proteome, which includes proteins like collagens, elastin, proteoglycans, lipoproteins and glycoproteins, is an important analytical target^8,9. Accordingly, a number of studies have used serum and plasma to establish or identify protein markers for atherosclerosis^10,11. These have ranged from targeted comparisons, i.e. based on prior hypothesis, through to untargeted profiling discovery measurements^12,13. As an example of the former, Malaud et al. determined proteomic profiles of atherosclerotic lesions, followed by Luminex immunoassays of likely targets in blood from the same subjects¹⁴. Using a discovery approach, DeGraba et al. employed surface-enhanced laser desorption/ionization (SELDI)-time of flight (TOF) mass spectrometry to identify distinguishing proteomics patterns from serum samples of atherosclerotic and non-atherosclerotic groups¹². Employing a multi-faceted discovery strategy, Kristensen et al. compared subjects with different circulatory diseases using a combination of immuno-affinity depletion, isobaric labeling, and an additional consideration of phosphorylated and sialyated peptides. Vinculin was identified as a novel marker of acute coronary syndrome¹³.

In contrast to the published studies, in which the comparisons have mostly addressed advanced atherosclerotic phenotypes, we have analyzed serum samples taken early on from the subjects with a non-obstructive plaque in their carotid artery along with the matched controls. The samples were collected as part of The Cardiovascular Risk in Young Finns Study (YFS), which was established to investigate how childhood lifestyle, biological and psychological measures contribute to cardiovascular risk. Follow-up data on cardiovascular risk factors (e.g. body weight, blood pressure and other biochemical parameters) have been periodically determined at three to six years intervals from over two thousand YFS participants during the past thirty years. Ultrasound assessment of carotid plaque formation has also been performed during the last fifteen years¹⁵. On the basis of these ultrasound measurements, serum samples were selected from subjects (N = 43), in whom early signs of plaque development was discerned, together with the equivalent material from carefully matched controls (N = 43). With the view to identify markers of disease risk and onset, we have applied a label-free quantitative mass spectrometry approach to analyze this unique sample set¹⁶. Selected reaction monitoring mass spectrometry (SRM-MS) was subsequently used for the verification of the observed differences. The label-free strategy employed was advantageous in terms of ease of implementation and scalability. Additionally, targeted mass spectrometry-based validation assays could be quickly developed from the discovery data and applied to validate the results.

Results

Discovery phase of premature carotid atherosclerosis biomarkers

Label-free quantitative proteomics was performed on serum samples obtained from 43 subjects who developed premature carotid artery plaques and 43 matched controls (Figure 1 and Table 1). Overall 296 proteins were detected with more than 1 peptide, according to the defined filtering criteria (see Methods section). For statistical analysis, 249 proteins with valid values in at least 50% of the samples were further considered (Supplementary Table 1). Notably, whilst atherosclerosis is characterized as an inflammatory disease, our clinical data from this study of the early phases of plaque formation revealed that there were no differences in the level of inflammatory C-reactive protein between cases and controls (Table 1).

Table 1 Clinical parameters at the time of plaque detection.

Full size table

The comparison of the samples from the plaque bearing subjects and their controls revealed the differential abundance of seven proteins (p < 0.05) as shown in Table 2 and depicted as a volcano plot in Fig. 2. Fibulin 1 proteoform C (FBLN1C), Beta-ala-his-dipeptidase (CNDP1), Cadherin-13 (CDH13), Gelsolin (GSN) and 72 kDa type IV collagenase (MMP2) were lower in abundance in cases, whilst apolipoprotein C-III (APOC3) and apolipoprotein E (APOE) were more abundant. After correction for multiple hypothesis testing only the difference in the FBLN1C levels was statistically significant (FDR < 0.05). On the basis of the known genetic associations, we evaluated the APOE genotype data but found no difference in the frequency of the carotid atherosclerosis risk related alleles between the case and control groups (Supplementary Fig. S2 and Supplementary Table 2).

Table 2 List of proteins found to be significantly differentially abundant between cases and their matched controls.

Full size table

Machine learning classification

To gain an overview of whether there was a panel of proteins that could be used to distinguish the subjects we applied Lasso penalized logistic regression to the serum proteomics data. On the basis of this, a panel of three proteins, FBLN1C, APOE and CDH13, was observed to provide the best discrimination between the cases and controls. With the inclusion of APOE and CDH13, there was a statistically significant improvement in the AUROC (0.79, 95% CI: 0.69–0.88, p = 0.03) (Fig. 3). FBLN1C alone classified cases from controls with an AUROC value of 0.67 (95% CI: 0.56–0.79).

SRM Verification

SRM measurements were performed for the seven proteins that were indicated to be differentially abundant in the initial profiling data. Additional targeted measurements were made for two housekeeping proteins (selected based on their consistency in our data), APOB (an established CVD risk factor) and standard retention time peptides (iRT)¹⁷ (Table 3). The analysis supported the downregulation of FBLN1C by a ratio of 0.85 (99% CI: 0.73–0.98; FDR < 0.05) in cases compared to their matched controls (Fig. 4). There were, however, no significant differences observed in any of the other targets. The SRM measurements for the panel did not improve the classification of the cases from controls as in the discovery phase (Supplementary Figure S3).

Table 3 List of proteins along with their proteotypic peptides used for the SRM-MS assay.

Full size table

Discussion

In this comparison of serum from matched controls and subjects in whom the early stages in the development of carotid plaques were detected, the proteomic analysis indicated the differential abundance of several proteins. Amongst these were a number of proteins that have been previously linked with atherosclerosis, i.e. APOC3, APOE, CNDP1, CDH13, GSN, MMP2 and fibulin1⁸. Using a machine learning approach the combination of FBLN1C, APOE and CDH13 was found to provide the best classification of the cases from controls. A targeted SRM assay was subsequently developed for the measurement of these differentially abundant proteins. Due to the unavailability of a similar sample set, it was used for verification only in the same samples as analyzed in the discovery phase. The verification measurements were performed for the samples prepared without depletion. The use of undepleted samples removed potential biases created by the depletion step and succeeded against the background of abundant serum proteins due to the intrinsic sensitivity of the targeted method. Based on this verification data, the quantitative difference of FBLN1C remained significant. The failure to confirm the dissimilarities detected for APOE and CDH13 in the discovery data could reflect the small magnitude and variability of these intra-individual differences.

The Fibulins are a family of six moderately abundant serum proteins (FBLN1 – FBLN6) that are linked with the extracellular matrix (ECM) proteins¹⁸. Differences in FBLN1 abundance have previously been observed in several studies, including its relationship to atherosclerosis, cardiovascular risk, arterial stiffness and type 2 diabetes (T2D). For example, Kawata et al. first reported reduced plasma levels of FLBN1 in patients with acute myocardial infarction and stable angina¹⁹. Both lower and higher levels have been reported in the plasma of T2D patients^20,21, but differences in the duration of the disease (recently diagnosed vs. established disease) could reflect upon the latter division. In relationship to cardiovascular disease, FLBN1 was detected as a component of atherosclerotic lesions, and Argraves et al. suggested that decreased plasma FBLN1 could reflect its accumulation in the plaque²². Similarly, the accumulation of FBLN1 in the arterial wall has been detected in patients with T2D²¹, although in a situation in which the FBLN1 plasma levels were higher than in the matched controls. In contrast, in newly diagnosed T2D patients, lower plasma FBLN1 was found and correlated with carotid-femoral arterial stiffness²⁰. On the basis of the latter observation, Paapstel et al. studied the relationship between arterial stiffness and serum FBLN1 levels in patients with atherosclerosis²³. Here they found higher levels of FBLN1 in the patients.

In addition to the complex interplay between arterial stiffness and atherosclerotic risk factors²⁴, the relationship between plasma levels of FBLN1 and the early stages of plaque formation is contradictory and yet to be clearly established.

In the above examples, there are differences in the study sizes, diseases and duration, as well as specificity of the controls. Further, whilst alternative splicing produces four FBLN1 proteoforms (A, B, C and D)²⁵, the former the studies have not made any distinction between these. These variants may differ from both a structural and functional perspective. In this respect their distinction as proteoforms and this terminology implicates protein variants that are not coded explicitly in the genome, i.e. including alternative RNA splicing and post-translational modifications²⁶. Our proteomic data has specifically highlighted significant differences for the C proteoform. Fibulin1C has been reported to be the predominant form in plasma²⁵ and has been identified in the tissue secretome analysis of coronary arteries²⁷. Within the limitations of this knowledge, we can only speculate whether this difference in abundance represents a phenotype that is more susceptible to plaque formation or an early indication of onset. Potentially, it may be that the structural differences for this specific proteoform of FBLN1 could, following some trigger, contribute to its interaction with extracellular matrix protein (ECM) molecules and accumulation on the arterial intimal walls, and thus be reflected by its lower serum abundance.

The progression of the atherosclerotic process is influenced by a range of factors including age, diet, stress and other risk factors such as smoking^28,29. In the present study the subjects were carefully matched by age, gender, BMI and blood pressure. However, seven cases and five controls were smokers, and the influence of smoking was not separately evaluated. Additional limitations of the current study is the sample size and the need for further validation studies in a larger independent cohort. Furthermore, structural studies explaining the interaction of the FBLN1C proteoform with ECM proteins could provide insights into its potential role in plaque formation.

In summary, from these measurements from a cohort selected to study the risk and development of atherosclerotic lesions, distinguishing proteomics profiles were identified in subjects showing early signs of plaque development. In particular, FBLN1C is implicated as a target for further investigation

Methods

Study Population and design

The samples were selected from participants in the Cardiovascular Risk in Young Finns cohort¹⁵. On the basis of carotid ultrasound measurements, samples from 43 individuals with a distinct carotid artery plaque were selected together with samples from 43 controls. The controls were matched by age, sex, body weight and systolic blood pressure. The age range of the subjects was 30–45 years. The study design is depicted in Fig. 1, and the clinical characteristics of cases and controls are shown in Table 1. The measurements and data included in this manuscript have been acquired following the guidelines of the Declaration of Helsinki for research on human participants and were conducted with the permission of the Ethical Committees of the University Hospitals of Turku with written informed consent.

Carotid intima-media thickness measurement

Ultrasound examination of the left carotid artery, including common carotid artery and carotid bifurcation, were performed using B-mode ultrasound (Acuson Sequoia 512, Siemens) with the 13.0-MHz linear-array transducer, according to a standardized protocol³⁰. Intima-media thickness was measured from digitally stored scans by one reader blinded to participant details. The best-quality end-diastolic frame was selected, and ultrasonic calipers were used to measure carotid intima-media thickness from the far wall of the common carotid artery 10 mm proximal to the bifurcation. To detect the carotid plaques, the images were scanned and the presence of atherosclerotic plaque defined as a distinct area of the carotid vessel wall protruding into the lumen >50% of the adjacent intima-media layer³¹. All the observed plaques were detected in the carotid bifurcation.

ApoE genotype determination

APOE genotyping was performed by using TaqMan SNP Genotyping Assays (rs429358 assay C 3084793_20; rs7412 assay C_904973_10) and the ABI Prism 7900HT Sequence Detection System (Applied Biosystems, Foster City, CA, USA).

Sample preparation

Immunodepletion of high abundant proteins

An Agilent MARS-14 immunoaffinity column was used for the targeted removal of the most abundant serum proteins. The isolated, lower abundance proteins were reduced, alkylated, digested and desalted prior to mass spectrometry (MS) analysis as described previously³².

Preparation of undepleted serum

For the verification measurements, the serum samples were diluted in the denaturant, reduced, alkylated and digested using sequencing grade modified trypsin (Promega)³².

Heavy labeled synthetic analogs of proteotypic peptides³³ of the differentially abundant proteins, housekeeping proteins (Alpha-1B-glycoprotein and complement C1s subcomponent) and the known CVD risk factor, Apolipoprotein B-100, were spiked into the digests together with indexed retention time standard (iRT) peptides (Biognosys). These were selected from discovery phase peptide data with the consideration of the consistent detection, absence of potentially modified residues and missed cleavages (Table 3). The heavy-labeled synthetic analogs (lysine ¹³C₆ ¹⁵N₂ and arginine ¹³C₆ ¹⁵N₄) equivalent of proteotypic peptides were obtained (Thermo Fischer Scientific).

Mass spectrometry analysis

Discovery phase

Aliquots of the depleted serum digests (500 ng) were analyzed with an Easy-nLC-II coupled to a LTQ Orbitrap Velos Pro mass spectrometer (Thermo Fisher Scientific). The peptides were separated on 150 mm × 75 µm ID column packed with 5 µm magic C18-bonded silica (200 Å). The peptides were eluted with an increasing gradient of 5–35% acetonitrile at a flow rate of 300 nl/min using a binary mixture of water and acetonitrile with 0.2% formic acid. The mass spectrometer was operated in data-dependent acquisition mode with a selection of top 15 precursors followed by fragmentation using collisional induced dissociation (CID) method. All the samples were analyzed in quadruplicate as randomized batches³². To ensure comparable instrument performance throughout the time span of the discovery phase study, an in-house standard was periodically analyzed to establish the consistency of the signal intensity and chromatographic separation (Supplementary Figure S1).

Verification

Aliquots of the digested peptides (250 ng), spiked with heavy labeled peptides and index retention time (iRT) peptides, were analyzed with Easy-nLC-II coupled to a TSQ Vantage mass spectrometer (Thermo Fisher Scientific)³². The peptide mixture was separated with a 150 mm × 75 µm ID column packed with ReproSil-Pur C18-AQ 5 µm resin (Dr. Maisch GmbH). An unscheduled analysis of the sample was carried out to generate iRT values of target peptides and their heavy counterparts. Skyline software was used to build up the scheduled method for the selected targets using the unscheduled run. The scheduled method was then edited by removing interfering signals³² to monitor 99 transitions from 33 peptides, representing 10 proteins and the iRT peptides.

The 86 serum samples (N = 43 vs. 43) were prepared without depletion and analyzed as randomized batches. To monitor the variation of peak areas and retention time across and within the batches, a pooled digest of the undepleted serum was included in each batch.

Data processing

Protein informatics analysis (Discovery phase)

The tandem mass spectra data were searched against a UniProt human isoform protein sequence database (UniProt release, August 2017, entries = 42,210) using the Andromeda³⁴ search algorithm and MaxQuant 1.5.5.1³⁵. The search parameters were set to allow two missed tryptic cleavages, methyl methanethiosulfonate (MMTS) modification of cysteine and variable modification of methionine and acetylation of the protein N terminus. A false discovery rate (FDR) of 1% was applied at peptide and protein level. The “match between run” option (matching and alignment time window = 0.7 and 20 minutes respectively) was selected in order to enable the transfer of identifications across the mass spectrometric measurements³⁶. The label-free normalized intensity values (MaxQuant output) were further analyzed using Perseus software^36,37. Briefly, the output was filtered to exclude reverse hits and proteins only inferred by the detection of single variable modifications. Furthermore, only proteins identified with >1 unique plus razor peptides were considered. The razor peptides are defined as those that are shared between different protein groups and are assigned to the protein that has the most peptides^35,38. The data was then log₂ transformed followed by filtering to at least 50% valid values. Missing values were imputed by “imputation from normal distribution” (width = 0.3, downshift = 1.8)³⁹ followed by taking the average of quadruplicate analyses. The subsequent statistical analysis of data was then performed using R⁴⁰.

SRM data analysis and transition selection

Skyline version 4.1 was used to develop and analyze SRM assay transitions⁴¹. The quality of the transitions and confirmation of light/heavy pairs were manually inspected. On account of interferences in the transitions for one of the FBLN1C peptides, DLLLTVK alone was considered for its statistical analysis in the SRM data. Out of the two housekeeping proteins included for normalization of the data, the TNFDNDIALVR peptide from complement C1s subcomponent was used.

Statistical analysis

Reproducibility-optimized test statistic (ROTS)

The label-free normalized protein intensity abundance values obtained from MaxQuant analysis were used as input for ROTS analysis^36,42,43. Briefly, the log2 transformed data were analyzed using non-parametric method relying on the family of t-type statistics which ranks the proteins based on their differential expression in two group conditions and the calculation was made with 1000 permutations (FDR < 0.05).

Machine learning classification

To identify the protein panel with the highest discriminative performance, Lasso penalized logistic regression⁴⁴, implemented in the R package glmnet⁴⁵, was applied to the serum proteomics data. First, all candidate predictors were identified by shrinking the coefficients of non-informative predictors to zero using Lasso with 3-fold cross-validation, repeating the randomization procedure 200 times. In each fold, only significantly differentially abundant proteins (ROTS; P < 0.05) were considered. Finally, among the top 20 most frequent candidate proteins, the Lasso model with the protein panel having the largest improvement in discriminative performance in terms of area under the receiver-operating characteristic curve (AUROC) and with the least number of predictors was identified. Statistical significance of the differences in the AUROC values between the models was determined using the DeLong method⁴⁶ implemented in the R package pROC⁴⁷.

MSStats

The MSStats (3.8.4) plugin included in the Skyline software was used for the group comparison between cases and controls⁴⁸. Briefly, after normalizing the data to the housekeeping protein the statistics were calculated on the basis of the Turkey’s median polish method. The latter uses a linear mixed model to give a robust estimation of differentially abundant proteins between conditions.

Data availability

The LC-MS/MS proteomics discovery data are available from the ProteomeXchange Consortium via the PRIDE⁴⁹ partner repository with the dataset identifier PXD008278. The SRM verification data are available from the ProteomeXchange Consortium via the PASSEL⁵⁰ partner repository with dataset identifier PASS01146.

References

Kim, A. S. & Johnston, S. C. Global variation in the relative burden of stroke and ischemic heart disease. Circulation 124, 314–323 (2011).
Article PubMed Google Scholar
Williams, K. J. & Tabas, I. The response-to-retention hypothesis of early atherogenesis. Arterioscler. Thromb. Vasc. Biol. 15, 551–61 (1995).
Article PubMed PubMed Central CAS Google Scholar
Williams, K. J. & Tabas, I. The response-to-retention hypothesis of atherogenesis reinforced. Curr. Opin. Lipidol. 9, 471–474 (1998).
Article PubMed CAS Google Scholar
Libby, P. Inflammation in atherosclerosis. Nature 420, 868–74 (2002).
Article ADS PubMed CAS Google Scholar
O’Leary, D. H. & Bots, M. L. Imaging of atherosclerosis: carotid intima-media thickness. Eur. Heart J. 31, 1682–1689 (2010).
Article PubMed Google Scholar
Rundek, T. et al. The relationship between carotid intima-media thickness and carotid plaque in the Northern Manhattan Study. Atherosclerosis 241, 364–370 (2015).
Article PubMed PubMed Central CAS Google Scholar
McGill, H. C. J. et al. Origin of atherosclerosis in childhood and adolescence. in American Journal of Clinical Nutrition 72 (2000).
Eberini, I. et al. A proteomic portrait of atherosclerosis. J. Proteomics 82, 92–112 (2013).
Article PubMed CAS Google Scholar
Didangelos, A., Stegemann, C. & Mayr, M. The -omics era: Proteomics and lipidomics in vascular research. Atherosclerosis 221, 12–17 (2012).
Article PubMed CAS Google Scholar
Saarikoski, L. A. et al. Adiponectin is related with carotid artery intima-media thickness and brachial flow-mediated dilatation in young adults–the Cardiovascular Risk in Young Finns Study. Ann. Med. 42, 603–611 (2010).
Article PubMed CAS Google Scholar
Oikonen, M. et al. Tissue inhibitor of matrix metalloproteinases 4 (TIMP4) in a population of young adults: relations to cardiovascular risk markers and carotid artery intima-media thickness. The Cardiovascular Risk in Young Finns Study. Scand. J. Clin. Lab. Invest. 72, 540–546 (2012).
Article PubMed CAS Google Scholar
DeGraba, T. J. et al. Biomarker Discovery in Serum from Patients with Carotid Atherosclerosis. Cerebrovasc. Dis. Extra 1, 115–129 (2011).
Article PubMed PubMed Central Google Scholar
Kristensen, L. P. et al. Plasma proteome profiling of atherosclerotic disease manifestations reveals elevated levels of the cytoskeletal protein vinculin. J. Proteomics 101, 141–153 (2014).
Article PubMed CAS Google Scholar
Malaud, E. et al. Local carotid atherosclerotic plaque proteins for the identification of circulating biomarkers in coronary patients. Atherosclerosis 233, 551–558 (2014).
Article PubMed CAS Google Scholar
Raiko, J. R. et al. Follow-ups of the Cardiovascular Risk in Young Finns Study in 2001 and 2007: levels and 6-year changes in risk factors. J. Intern. Med. 267, 370–384 (2010).
Article PubMed CAS Google Scholar
Bantscheff, M., Lemeer, S., Savitski, M. M. & Kuster, B. Quantitative mass spectrometry in proteomics: Critical review update from 2007 to the present. Analytical and Bioanalytical Chemistry 404, 939–965 (2012).
Article PubMed CAS Google Scholar
Escher, C. et al. Using iRT, a normalized retention time for more targeted measurement of peptides. Proteomics 12, 1111–1121 (2012).
Article PubMed PubMed Central CAS Google Scholar
Argraves, W. S., Greene, L. M., Cooley, M. A. & Gallagher, W. M. Fibulins: physiological and disease perspectives. EMBO Rep. 4, 1127–1131 (2003).
Article PubMed PubMed Central CAS Google Scholar
Kawata, K., Tanaka, A., Arai, M., Argraves, W. S. & Fukutake, K. Alteration of plasma fibulin-1 concentrations in ischemic heart diseases. Jpn. J. Thromb. Hemost. 12, 126–132 (2001).
Article CAS Google Scholar
Laugesen, E. et al. Plasma levels of the arterial wall protein fibulin-1 are associated with carotid-femoral pulse wave velocity: A cross-sectional study. Cardiovasc. Diabetol. 12 (2013).
Cangemi, C. et al. Fibulin-1 is a marker for arterial extracellular matrix alterations in type 2 diabetes. Clin. Chem. 57, 1556–1565 (2011).
Article PubMed CAS Google Scholar
Argraves, W. S. et al. Fibulin-1 and fibrinogen in human atherosclerotic lesions. Histochem. Cell Biol. 132, 559–565 (2009).
Article PubMed CAS Google Scholar
Paapstel, K. et al. Association Between Fibulin-1 and Aortic Augmentation Index in Male Patients with Peripheral Arterial Disease. Eur. J. Vasc. Endovasc. Surg. 51, 76–82 (2016).
Article PubMed CAS Google Scholar
Van Popele, N. M. et al. Association between arterial stiffness and atherosclerosis: The Rotterdam study. Stroke 32, 454–460 (2001).
Article PubMed Google Scholar
Overgaard, M., Cangemi, C., Jensen, M. L., Argraves, W. S. & Rasmussen, L. M. Total and isoform-specific quantitative assessment of circulating fibulin-1 using selected reaction monitoring MS and time-resolved immunofluorometry. Proteomics - Clin. Appl. 9, 767–775 (2015).
Article PubMed CAS Google Scholar
Smith, L. M. & Kelleher, N. L. Proteoform: A single term describing protein complexity. Nature Methods 10, 186–187 (2013).
Article PubMed PubMed Central CAS Google Scholar
de la Cuesta, F. et al. Secretome analysis of atherosclerotic and non-atherosclerotic arteries reveals dynamic extracellular remodeling during pathogenesis. J. Proteomics 75, 2960–2971 (2012).
Article PubMed CAS Google Scholar
Rafieian-Kopaei, M., Setorki, M., Doudi, M., Baradaran, A. & Nasri, H. Atherosclerosis: Process, indicators, risk factors and new hopes. International Journal of Preventive Medicine 5, 927–946 (2014).
PubMed PubMed Central Google Scholar
Gambardella, J., Sardu, C., Sacra, C., Del Giudice, C. & Santulli, G. Quit smoking to outsmart atherogenesis: Molecular mechanisms underlying clinical evidence. Atherosclerosis 257, 242–245 (2017).
Article PubMed CAS Google Scholar
Raitakari, O. T. et al. Cardiovascular risk factors in childhood and carotid artery intima-media thickness in adulthood: the Cardiovascular Risk in Young Finns Study. JAMA 290, 2277–83 (2003).
Article PubMed CAS Google Scholar
Tonstad, S. et al. Risk factors related to carotid intima-media thickness and plaque in children with familial hypercholesterolemia and control subjects. Arterioscler. Thromb. Vasc. Biol. 16, 984–91 (1996).
Article PubMed CAS Google Scholar
Bhosale, S. D., Moulder, R., Kouvonen, P., Lahesmaa, R. & Goodlett, D. R. Mass Spectrometry-Based Serum Proteomics for Biomarker Discovery and Validation. Methods Mol. Biol. 1619, 451–466 (2017).
Article PubMed CAS Google Scholar
Mallick, P. et al. Computational prediction of proteotypic peptides for quantitative proteomics. Nat. Biotechnol. 25, 125–131 (2007).
Article PubMed CAS Google Scholar
Cox, J. et al. Andromeda: A Peptide Search Engine Integrated into the MaxQuant Environment. J. Proteome Res. 10, 1794–1805 (2011).
Article ADS PubMed CAS Google Scholar
Cox, J. & Mann, M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat Biotech 26, 1367–1372 (2008).
Article CAS Google Scholar
Cox, J., Hein, M. Y., Luber, Ca & Paron, I. Accurate proteome-wide label-free quantification by delayed normalization and maximal peptide ratio extraction, termed MaxLFQ. Mol. Cell. … 13, 2513–2526 (2014).
CAS Google Scholar
Tyanova, S. et al. The Perseus computational platform for comprehensive analysis of (prote)omics data. Nat. Methods, https://doi.org/10.1038/nmeth.3901 (2016).
Nesvizhskii, A. I. & Aebersold, R. Interpretation of Shotgun Proteomic Data: The Protein Inference Problem. Mol Cell Proteomics 4, 1419–1440 (2005).
Article PubMed CAS Google Scholar
Deeb, S. J. et al. Machine Learning Based Classification of Diffuse Large B-cell Lymphoma Patients by their Protein Expression Profiles. Mol. Cell. Proteomics 2947–2960, https://doi.org/10.1074/mcp.M115.050245 (2015).
R Development Core Team, R. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing 1 (2011).
MacLean, B. et al. Skyline: an open source document editor for creating and analyzing targeted proteomics experiments. Bioinformatics 26, 966–968 (2010).
Article PubMed PubMed Central CAS Google Scholar
Elo, L. L., Filén, S., Lahesmaa, R. & Aittokallio, T. Reproducibility-optimized test statistic for ranking genes in microarray studies. In IEEE/ACM Transactions on Computational Biology and Bioinformatics 5, 423–431 (2008).
Suomi, T., Seyednasrollah, F., Jaakkola, M. K., Faux, T. & Elo, L. L. ROTS: An R package for reproducibility-optimized statistical testing. Plos Comput. Biol. 13 (2017).
Tibshirani, R. Regression Selection and Shrinkage via the Lasso. Journal of the Royal Statistical Society B 58, 267–288 (1996).
MATH Google Scholar
Friedman, J., Hastie, T. & Tibshirani, R. Regularization Paths for Generalized Linear Models via Coordinate Descent. J. Stat. Softw. 33 (2010).
DeLong, E. R., DeLong, D. M. & Clarke-Pearson, D. L. Comparing the Areas under Two or More Correlated Receiver Operating Characteristic Curves: A Nonparametric Approach. Biometrics 44, 837 (1988).
Article PubMed MATH CAS Google Scholar
Robin, X. et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics 12, 77 (2011).
Article PubMed PubMed Central Google Scholar
Choi, M. et al. MSstats: an R package for statistical analysis of quantitative mass spectrometry-based proteomic experiments. Bioinformatics, https://doi.org/10.1093/bioinformatics/btu305 (2014).
Vizcaíno, J. A. et al. 2016 update of the PRIDE database and its related tools. Nucleic Acids Res. 44, D447–D456 (2016).
Article PubMed CAS Google Scholar
Farrah, T. et al. PASSEL: The PeptideAtlas SRMexperiment library. Proteomics 12, 1170–1175 (2012).
Article PubMed CAS Google Scholar

Download references

Acknowledgements

The work presented in this paper was completed at the Turku Centre for Biotechnology Proteomics core facility from which Arttu Heinonen and Pekka Haapaniemi are greatly appreciated for their excellent technical support. The facility is supported by Biocenter Finland. We appreciate the help of Irina Lisinen and Johanna Ikonen in data management and statistical consultation. We thank Niina Lietzen for sharing her insights. We would like to thank the participants of the study for their continued commitment. The work was financially supported by the National Technology Agency of Finland (Grant number 40398/11, Finland Distinguish Professor Program to RL and DRG), RL was supported by The Academy of Finland Centre of Excellence in Molecular Systems Immunology and Physiology Research 2012–2017 (AoF grant 250114); Academy of Finland Personalized Medicine Program (AoF grant 292482) and the Academy of Finland grants 294337, 292335. The Young Finns Study has been financially supported by the Academy of Finland: grants 286284 (T.L.), 134309 (Eye), 126925, 121584, 124282, 129378 (Salve), 117787 (Gendi), and 41071 (Skidi); the Social Insurance Institution of Finland; Kuopio, Tampere and Turku University Hospital Medical Funds; Juho Vainio Foundation; Paavo Nurmi Foundation; Finnish Foundation for Cardiovascular Research; Finnish Cultural Foundation; Tampere Tuberculosis Foundation; Emil Aaltonen Foundation; and Yrjö Jahnsson Foundation.

Author information

Authors and Affiliations

Turku Centre for Biotechnology, University of Turku and Åbo Akademi University, Turku, Finland
Santosh D. Bhosale, Robert Moulder, Mikko S. Venäläinen, Laura L. Elo, David R. Goodlett & Riitta Lahesmaa
Department of Medicine, University of Turku and Division of Medicine, Turku University Hospital, Turku, Finland
Juhani S. Koskinen, Markus T. Juonala & Jorma S. A. Viikari
Department of Clinical Physiology, Tampere University Hospital and University of Tampere, Faculty of Medicine, Tampere, Finland
Mika A. P. Kähönen
Department of Clinical Chemistry, Fimlab Laboratories and Finnish Cardiovascular Research Center-Tampere, Faculty of Medicine and Life Sciences, University of Tampere, Tampere, Finland
Terho J. Lehtimäki
Department of Pharmaceutical Science, University of Maryland, Baltimore, Maryland, USA
David R. Goodlett
Research Centre of Applied and Preventive Cardiovascular Medicine, University of Turku, Turku, Finland
Juhani S. Koskinen, Niina Pitkänen & Olli T. Raitakari
Department of Clinical Physiology and Nuclear Medicine, Turku University Hospital, Turku, Finland
Olli T. Raitakari

Authors

Santosh D. Bhosale
View author publications
You can also search for this author in PubMed Google Scholar
Robert Moulder
View author publications
You can also search for this author in PubMed Google Scholar
Mikko S. Venäläinen
View author publications
You can also search for this author in PubMed Google Scholar
Juhani S. Koskinen
View author publications
You can also search for this author in PubMed Google Scholar
Niina Pitkänen
View author publications
You can also search for this author in PubMed Google Scholar
Markus T. Juonala
View author publications
You can also search for this author in PubMed Google Scholar
Mika A. P. Kähönen
View author publications
You can also search for this author in PubMed Google Scholar
Terho J. Lehtimäki
View author publications
You can also search for this author in PubMed Google Scholar
Jorma S. A. Viikari
View author publications
You can also search for this author in PubMed Google Scholar
Laura L. Elo
View author publications
You can also search for this author in PubMed Google Scholar
David R. Goodlett
View author publications
You can also search for this author in PubMed Google Scholar
Riitta Lahesmaa
View author publications
You can also search for this author in PubMed Google Scholar
Olli T. Raitakari
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.D.B. conceived and designed the experiment, prepared and analyzed the samples, analyzed and interpreted the data and wrote the paper. R.M. supervised the study design, critically reviewed, involved in the interpretation of results throughout the study and contributed to the writing of the paper. M.S.V. performed machine learning analysis and helped in fine-tuning the R script. N.P. analyzed the APOE genotype data. L.L.E. supervised M.S.V for implementing machine learning analysis. D.R.G. conceived study design, involved in discussion and data interpretation. J.S.K. and M.T.J. analyzed the ultrasound data. O.T.R., M.K., T.J.L., M.T.J. and J.V. organized data collections, obtained funding, supervised the study. R.L. and O.T.R. initiated the study, designed the study, supervised the study and contributed to the writing of the paper.

Corresponding authors

Correspondence to Riitta Lahesmaa or Olli T. Raitakari.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementry file

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bhosale, S.D., Moulder, R., Venäläinen, M.S. et al. Serum Proteomic Profiling to Identify Biomarkers of Premature Carotid Atherosclerosis. Sci Rep 8, 9209 (2018). https://doi.org/10.1038/s41598-018-27265-9

Download citation

Received: 16 February 2018
Accepted: 31 May 2018
Published: 15 June 2018
DOI: https://doi.org/10.1038/s41598-018-27265-9

This article is cited by

The serum proteome of VA-ECMO patients changes over time and allows differentiation of survivors and non-survivors: an observational study
- Patrick Malcolm Siegel
- Bálint András Barta
- Philipp Diehl
Journal of Translational Medicine (2023)
Benchmarking tools for detecting longitudinal differential expression in proteomics data allows establishing a robust reproducibility optimization regression approach
- Tommi Välikangas
- Tomi Suomi
- Laura L. Elo
Nature Communications (2022)
Trans-ethnic association study of blood pressure determinants in over 750,000 individuals
- Ayush Giri
- Jacklyn N. Hellwege
- Todd L. Edwards
Nature Genetics (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Discovery of novel biomarkers for atherosclerotic aortic aneurysm through proteomics-based assessment of disease progression

Quantitative proteomic analysis of human serum using tandem mass tags to predict cardiovascular risks in patients with psoriasis

A urinary peptidomics approach for early stages of cardiovascular disease risk: The African-PREDICT study

Introduction

Results

Discovery phase of premature carotid atherosclerosis biomarkers

Machine learning classification

SRM Verification

Discussion

Methods

Study Population and design

Carotid intima-media thickness measurement

ApoE genotype determination

Sample preparation

Immunodepletion of high abundant proteins

Preparation of undepleted serum

Mass spectrometry analysis

Discovery phase

Verification

Data processing

Protein informatics analysis (Discovery phase)

SRM data analysis and transition selection

Statistical analysis

Reproducibility-optimized test statistic (ROTS)

Machine learning classification

MSStats

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Supplementry file

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

The serum proteome of VA-ECMO patients changes over time and allows differentiation of survivors and non-survivors: an observational study

Benchmarking tools for detecting longitudinal differential expression in proteomics data allows establishing a robust reproducibility optimization regression approach

Trans-ethnic association study of blood pressure determinants in over 750,000 individuals

Comments

Search

Quick links