Renal cell carcinoma (RCC) is difficult to diagnose at early stages due to a lack of clear clinical symptoms. When symptoms do occur, about 30% of patients already have metastatic disease. In addition, a similar part of patients with resection of localised disease will have a recurrence.1 Therefore, a need remains for reliable markers for diagnosis and follow-up of RCC, preferably in easy-accessible body fluids. Surface Enhanced Laser Desorption Ionisation-Time of Flight Mass Spectrometry (SELDI-TOF MS)2, 3 is being increasingly used to search for new and better tumour markers in (serum) protein profiles, for example, for ovarian,4 breast,5 prostate6 and colorectal cancer.7 Its appeal lies in the ease with which a multitude of samples can be analysed with a minimum of sample preparation in a single SELDI-TOF MS analysis. Indeed, SELDI-TOF MS has also been performed to identify biomarker proteins for early detection of RCC. Two studies describe protein profiling of serum with SELDI-TOF MS.8, 9 Tolson et al8 reported a peak cluster at a mass-to-charge ratio (m/z) of approximately 11 000 from serum amyloid
-1 and variants, which was 100% specific for RCC compared to HC, although not very sensitive (detection in eight of 25 patients). Five m/z values of 3900, 4107, 4153, 5352 and 5987 were reported by Won et al9 in a classification tree separating RCC from healthy and non-RCC urologic disease, with sensitivities and specificities ranging from 80 to 100%.
Although these results are promising, concerns have risen regarding reproducibility of protein profiles and the nature of detected biomarkers.10, 11, 12, 13, 14, 15, 16, 17 For the same cancer types, different biomarkers have been found by different research groups.13, 18 Yet, reproducible protein profiles can hardly be presumed when using different assay procedures or sample handling, or when samples from a patient population with other characteristics are used.19 To ensure that supposed biomarker proteins are not due to chance or bias and are robust enough for detection in different laboratories, validation with an independent sample set originating from a different institute, but handled using standard analytical procedures, is imperative. Yet, only few groups have performed such validation. Attempts to validate biomarker proteins for ovarian and breast cancer with samples from other institutes have been reported, although not all with similar success.4, 20, 21 For prostate cancer, a biomarker validation project is ongoing,22, 23 also standardising analyses among different laboratories. However, the RCC serum protein profiles in the two above-mentioned studies have not been validated in other laboratories or with new populations, leaving the question of their validity and robustness. Thus, instead of starting a new search for RCC biomarkers using SELDI-TOF MS, we attempted to move forward and validate the ones postulated by previous studies. We applied the previously developed analytical protocols8, 9 to two distinct sample sets of patients and controls from two different institutes and assessed robustness and validity of reported RCC serum protein profiles.
Materials and methods
Chemicals
All used chemicals were obtained from Sigma, St Louis, MO, USA, unless stated otherwise.
Patient Samples
Our first set of samples (set 1) consisted of sera from 40 patients with renal cell carcinoma and 32 healthy controls (HC) obtained from the Netherlands Cancer Institute, Amsterdam, The Netherlands. The second set (set 2) consisted of sera from 26 patients with renal cell carcinoma and 27 HC obtained at the University Medical Centre in Utrecht. All serum samples originated from a serum bank, where they had been collected according to institutional protocols. Sample collection was performed with individuals' informed consent after approval by the institutional review boards.
Protein Profiling
The two serum sample sets were analysed at separate time points in our laboratory. Analyses for both sample sets were repeated after storage at -20°C for a year. A scheme of the experimental set-up is shown in Figure 1. For quality control, we analysed two separate pools of HC sera from set 1 and set 2, respectively, on seven chips in each analysis. Assay procedures were identical to those reported,8, 9 using CM10 chips with a weak cation exchange chromatography and as matrix a 50% solution of sinapinic acid (SPA) for Tolson's analysis and a saturated solution of
-cyano-4-hydroxycinnamic acid (CHCA) for Won's analysis. Only the albumin depletion for Tolson's method was performed slightly different, using Cibacron Blue spin columns from a different manufacturer (Ciphergen Biosystems Inc., Freemont CA, USA). Briefly, serum was denatured and depleted from albumin according to manufacturer's instructions. Equivalent quantities of denatured serum to Tolson's were loaded onto these columns, taking into account the different binding capacity for albumin. Protein chips were analysed using the PBS-IIc ProteinChip Reader (Ciphergen Biosystems). Data acquisition parameters in our experiments were optimised for detection of the reported biomarkers. Using Tolson's method, spectra were collected to 80 kDa with focus mass kept at 8000 Da according to reported procedures. M/z values for the detected proteins were calibrated externally with a standard peptide mixture (Ciphergen Biosystems) containing [Arg8] vasopressine (1084.3 Da), somatostatine (1637.9 Da), dynorphine (2147.5 Da), ACTH (2933.5 Da), insulin
-chain (bovine) (3495.9 Da), insulin (human recombinant) (5807.7 Da) and hirudin (7033.6 Da).
Figure 1.
Experimental set-up. Two separate samples sets were analysed according to two previously published methods. In addition, a modified version of one of the methods was used to yield better peaks (see Results section).
Full figure and legend (18K)Statistics and Bioinformatics
Data were analysed with the ProteinChip Software package, version 3.1 (Ciphergen Biosystems). Per sample set, all acquired spectra were compiled and analysed as a whole. Spectra were baseline subtracted and normalised to the total ion current from 1500 Da to the spectrum's end. Spectra with normalisation factors above 2.00 or lower than 0.50 were excluded from clustering. Biomarker Wizard (BMW) software (Ciphergen Biosystems) was used for peak clustering. Peaks were autodetected when occurring in at least 30% of spectra and with first and second pass signal-to-noise (S/N) of 5 and 2, respectively, in a 0.3% cluster mass window. In case of no autodetection of peaks of interest, peak detection was performed manually and the BMW applied including these user-detected peaks. Mean peak intensities between groups were compared by nonparametric statistical tests with P<0.01 defined as statistically significant. The mean coefficient of variation (CV) in each sample set was calculated with replicate peak intensities from the quality-control samples. Biomarker Patters Software (BPS; Ciphergen Biosystems) was used to generate classification trees for patients and controls.
Protein Identification
The identity of proteins of interest was elucidated using immunocapture with appropriate antibodies. For confirmation of the 11-kDa peak identities, the same mouse monoclonal antibody to serum amyloid-
(SAA) was used as reported (Abcam Ltd., Cambridge, UK). In short, antibody was coupled to protein A ceramic HyperD beads (Pall/Biosepra, Saint-Germain-en-Laye, France) and washed thrice with phosphate-buffered saline (PBS). After a 30-min incubation of serum diluted in PBS, the unbound fraction was collected. Beads were washed five times with PBS and once with deionised water. Finally, bound proteins were eluted from the beads with 0.1 M acetic acid. Unbound fractions as well as eluates were profiled both on gold chips or NP20 chips and on the original CM10 chip surface.
Measurement of Serum SAA Levels
Results of protein profiling for SAA were validated by ELISA using a commercially available kit (Tridelta Development Ltd, Maynooth, Ireland). All sera were analysed in duplicate according to manufacturer's instructions.
Results
Patient Samples
Patient and sample characteristics are summarised in Table 1. Two distinct patient populations were gathered: In our first sample set, most samples had been drawn before surgery and patients had mostly early-stage disease, as was the case for the samples in the published studies. In contrast, many samples from set 2 were from patients with metastatic stage-IV disease who had undergone nephrectomy. This enabled us to assess the abundance of the previous markers across several disease stages and the influence of primary tumour resection on their presence. No group of patients with other urological diseases was included in our analysis. Sample collection procedures differed between the institutes regarding clotting time and storage temperature (Table 1).
Table 1 - Patient and sample characteristics of sample sets 1 and 2 compared to those of Won and Tolson.
Protein Profiling: Tolson Method
In the first analysis, we obtained optimal spectra using laser intensity 142 and detector sensitivity 7, collecting an average of 165 laser shots per spot. For the second analysis a year later, acquisition parameters had to be re-optimised to laser intensity 155 and detector sensitivity 8, whereas other parameters were kept constant. On both occasions, protein patterns were highly reproducible within and between the sample sets and the two analyses. Mean peak CV's for the quality-control sera from each sample set were 20 and 30% for set 1 and 2, respectively, based on the 11 most prominent peaks common to both sample sets. Few samples had to be excluded due to aberrant normalisation factors (Table 2). Comparison of spectra from the duplicate analysis with the one from Tolson shows the presence of the same peaks (Figure 2). Results of the duplicate analysis were highly congruent, except for the 9.2-kDa peak, which was seen more frequently in the second analysis. The 11-kDa peaks were present in both our sample sets in a roughly similar patient subset as Tolson's (Table 2). In two healthy controls from set 1 and one from set 2 this peak was also visible, although with S/N<2 (Figure 3). None of the excluded spectra showed the 11-kDa peaks. The peak at 10.85 kDa, also indicated as a potential biomarker by Tolson, was found in very few of our patients (Table 2). The relative abundances of Tolson's biomarkers in our patients in relation to their surgery status and disease stage are summarised in Table 3. Also for some other (non-biomarker) peaks, the frequencies we found were lower than reported, such as for the 9.2-kDa peak from haptoglobin in the first analysis and for the peaks at 15.8 and 15.1 kDa (
- and
-globin).
Figure 2.
Spectra parts generated according to Tolson's procedure. (a) Samples from set 1 and set 2 are shown. The 11-kDa peaks are visible in a subset of the RCC sera (arrow), but not in those from HC. (b) Comparison of one of our spectra with the one reported by Tolson et al. Nearly all peaks present in the Tolson spectrum were visible in our spectrum (arrows), although some with lower peak intensities, which is possibly due to the fact that the published spectrum was acquired with extra high laser intensity for peaks >10 kDa. Tolson spectrum reprinted by permission from Macmillan Publishers Ltd: [Lab Invest],8 copyright 2004.
Full figure and legend (46K)Figure 3.
Detection of the 11-kDa cluster in healthy controls (HC) from sample sets 1 and 2. The straight line represents the noise level.
Full figure and legend (28K)Table 3 - Abundances of Tolson peaks (peak height 'averages' of duplicate analysis and concentrations measured by ELISA) and patient characteristics.
When applying the BMW application to these spectra, the 11-kDa peaks came up as most significantly different in both analyses and both sample sets (P<0.000). However, as these peaks only occur in a subset of RCC patients, their sensitivity for RCC detection is limited. Therefore, we also evaluated other discriminating peaks that were present in all patients and/or controls. Peaks at 8.7, 14.1, 17.3 and 28 kDa were consistently decreased in patients in both analyses and both sample sets (P<0.000, except for 14.1 kDa in set 2: P<0.03). Classification trees were constructed with BPS, but none of the above-mentioned discriminatory peaks, nor any other peaks yielded trees with suitable sensitivity and specificity for both sample sets (data not shown).
Protein Profiling: Won Method
Figure 3 shows spectra from the optimisation of laser intensity and detector sensitivity for the Won peaks at 3.90, 4.10, 4.15, 5.35 and 5.99 kDa. In the first analysis, we obtained optimal spectra using laser intensity 135 and detector sensitivity 7, collecting an average of 165 laser shots per spot. Again, for the second analysis, re-optimisation of acquisition parameters was needed, resulting in laser intensity 128 and detector sensitivity 7 keeping other parameters constant. On both occasions, protein patterns were highly reproducible across the sample sets and the two analyses. However, many spectra were 'empty' and had to be excluded from further data analysis (Table 4). We noted that this was related to the presence of a glassy layer on the chip spot for these samples. This layer is probably due to lack of a washing step with water in the procedure, leaving salt remnants on the chip that cause ion suppression. Owing to this, only two quality-control samples from set 1 and three from set 2 were assessable. Mean peak CVs of the seven most prominent peaks in these spectra common to both sample sets were 60 and 23% for set 1 and 2, respectively. Our peak intensities for the peaks of interest were markedly lower than the intensity values reported by Won, except for the peak at 4.10 kDa (Table 5, original procedure and Figure 4). Varying acquisition parameters did not result in higher intensities for these peaks. As a result, only the 4.10-kDa peak met the criterion of an S/N >5 for clustering and the others had to be manually detected. Although the mean peak intensities from the two analyses were comparable, there were some differences between the first and second analysis regarding the group with the highest mean peak intensity and the observed P-values. None of the Won peaks was significantly different at the 0.01 level in our analyses (Table 5, original procedure). Some of the peaks did show a (nonsignificant) expression difference between patients and controls, but this was only observed in one of the analyses (m/z 4097, 5348), or was discordant with the expression difference reported by Won (m/z 5350, 5991). The lack of an expression difference in patients and controls for m/z 4153 was in agreement with Won's result, however (Figure 5).
Figure 4.
Optimisation of SELDI-TOF MS acquisition parameters for detection of the Won peaks (1–5). Two RCC samples were used for optimisation (A and B). Both laser intensity and detector sensitivity were optimised separately.
Full figure and legend (115K)Figure 5.
Classification tree reported by Won et al. The primary node includes the peak at 4107lDa, which roughly separates between patients (left branch) and controls (right branch). Reprinted with permission from Wiley-VCH verlag GmbH & Co KG.
Full figure and legend (22K)Table 5 - Observed peak expression differences detected in our sample sets with Won's method (original procedure) and a slightly modified procedure.
In an attempt to obtain better intensities for the peaks of interest, we also used a slightly modified assay for these samples (single analysis). Instead of 2% 2-mercaptoethanol in the denaturation solution, 2% dithiotreïtol (DTT), a more effective reductant, was used and the protease inhibitor phenylmethanesulphonyl fluoride (PMSF) was left out. Instead of saturated CHCA, a 50% SPA solution in 50% acetonitrile (ACN)+0.5% sinapinic acid (TFA) was used as energy-absorbing matrix. We have applied this matrix solution successfully for masses below 15 kDa in other experiments, among which the Tolson procedure. SELDI-TOF MS acquisition parameters were optimised for these conditions to laser intensity 142 and detector sensitivity 7. From Figure 6 it can be seen that with the modified method, the same samples yielded higher peak intensities for m/z 3901, 4107, 4153, 5352 and 5987, with reproducible spectra among sample sets 1 and 2. In addition, many more peaks were present in the mass spectrum than with the original procedure, and few spectra did not meet normalisation criteria (Table 4). Two significant differences in agreement with Won's result could now be seen at m/z 5996 and 3894 in set 1. However, the found increase in RCC patients of a peak at 4161 Da in set 2 (P=0.004, Table 5, modified procedure) was not according to Won's results.
Figure 6.
Spectra parts generated with Won's original procedure (a) and with a slightly modified one (b). The same samples in set 1 and set 2 are shown for either assay. Peak intensities for the peaks of interest at 3901, 4107, 4153, 5352 and 5987 Da (peaks 1–5) are higher in the modified procedure.
Full figure and legend (112K)To assess whether the many patients who had undergone nephrectomy in sample set 2 could be the cause of the lack of significant expression differences, only data from the combined presurgery sera in set 1 and 2 were analysed (41 RCC and 59 HC). After normalisation, 49 (22 RCC, 27 HC) and 39 (15 RCC, 24 HC) spectra from the two analyses with the original procedure and 85 (35 RCC, 50 HC) spectra from the modified procedure were assessable. None of the peaks of interest were discriminative in the presurgery samples with the original procedure. Only in the second analysis we saw a slight difference at m/z 4159 (P=0.09, intensity HC 3.89 vs RCC 4.33), which was not in agreement with Won's result, however. In data from the modified assay, the peak at 4097 Da was seen slightly, but significantly decreased in patients compared to controls (intensity 44.8 vs 52.6, P=0.003). Although an expression difference at m/z 5996 and 3894 was still present, significance was lost in this analysis (P=0.11 and P=0.12).
Lastly, we looked for other discriminative peaks than those reported by Won. Few significant differences between the RCC and control groups were found using the original procedure. A significant peak at 8597 Da was observed, which was decreased in RCC patients (P<0.000), both with the original and modified procedure using either the combined samples from set 1 and 2, or only the presurgery ones. Considering the presurgery samples only, in both analyses following the original procedure, consistent expression differences were seen for m/z 2826 (P=0.01 and P=0.09, respectively) and 4291 (P=0.04 and P=0.01). However, with the modified procedure many more peaks differed significantly, notably m/z 4289, 3960, 8151, 6198, 3163, 8707, 4303 and 6456 (P<0.000 in the combined sample sets and in the presurgery samples).
Generating classification trees, we found some with reasonably good classification of RCC and healthy persons. The (nonsignificant) Won peaks at 4097 and 5348 Da in the second original analysis did not come up as tree classifiers, nor did the significant m/z's (5996 and 3894) in the data from the modified method (data not shown). Yet, from these data a tree could be built using m/z 4289 as single classifier with sensitivity 73.3% and specificity 66.0% upon 10-fold cross-validation. Also, m/z 8151 showed promising performance as a single classifier (cross-validation sensitivity 90.6%, specificity 81.5%). However, these peaks were good classifiers only for sample set 1. We also investigated whether the significantly different 4097-Da peak in the presurgery serum samples using the modified procedure was a useful classifier. Ten-fold cross-validation with this m/z as single classifier yielded limited sensitivity and specificity of 62 and 59%. The other significant m/z's we found by clustering were no classifiers or did not yield trees with reasonable sensitivity and specificity (data not shown).
Protein Identification
The identity of the 11-kDa cluster was confirmed by using a serum amyloid-
mouse monoclonal antibody (Abcam Ltd, Cambridge, UK). A similar peak cluster as the one in the profiling experiment was seen in the bound fraction of this antibody, both on NP20 and on CM10 chips (Figure 7). The identities of the discriminating peaks at 14.1 and 17.3 kDa in the Tolson analysis and that of 8597 Da in the Won analysis could be deduced from the protein profiling results and were confirmed by similar immunocapture experiments. Both these discriminating Tolson peaks were lost with the Won analysis. Although highly similar, these methods differ in the addition of a reducing agent to the denaturation buffer in the latter procedure. A peak cluster around 14 kDa has been described, identified as transthyretin and several isoforms, among which a glutathionylated form at 14.1 kDa.24 Similarly, from the Swiss-Prot database (http://www.expasy.org/) it is known that a 17-kDa homodimer of apolipoprotein A-II is formed by disulphide linkage of two monomers. With the Won procedure, reduction would result in loss of the glutathion group for the 14.1-kDa peak to form a mass of 13.7 kDa and the formation of two 8.5-kDa monomers from the 17-kDa dimeric protein, which was indeed the case in our results. Therefore, we performed immunocapture with a transthyretin rabbit polyclonal antibody (Abcam Ltd, Cambridge, UK) and an apolipoprotein A-II goat polyclonal antibody (Chemicon Europe Ltd, Hampshire, UK). Thus, the identity of glutathionylated transthyretin was confirmed, as was that of the apolipoprotein A-II fragment missing a terminal glutamine residue in one chain of the dimer (Tolson method, 17.3 kDa) and in the monomer (Won method, 8597 Da).
Figure 7.
Identification of the serum amyloid-
(SAA) peak cluster by immunocapture. The spectra parts successively show protein profiling on NP20 chip of the serum fraction that has not bound to the SAA-1 antibody, the fraction that has bound to this antibody on NP20 chip and on CM10 chip, the unbound serum fraction on CM10 chip and the original protein profile of whole serum on CM10 chip. Peaks 1–3 represent des-RS SAA-1 (11.4 kDa), des-R SAA-1 (11.5 kDa) and SAA-1 (11.68 kDa), respectively. The peak at 11.9 kDa (4) may be another post-translationally modified form of SAA-1. The fifth peak at 10.8 kDa may be the same as found by Tolson, although it was not reported to be a form of SAA-1.
Measurement of Serum SAA Levels
We observed good agreement between the results of SELDI-TOF MS protein profiling and ELISA regarding the abundance of SAA (Figure 8). It is also clear from this figure that protein profiling is less sensitive than ELISA for the detection of SAA. Similar to Tolson's result, SAA levels below 20
g/ml were generally not detectable with SELDI-TOF MS in our analyses. However, in several RCC samples, the 11-kDa peaks were detectable despite a measured SAA concentration below 10
g/ml (Table 3). Most HC had SAA levels below 10
g/ml, except for 6 samples with levels between 10 and 25
g/ml.
Figure 8.
Correlation of serum levels of SAA as measured by ELISA with peak intensities of the 11.68-kDa peak from SELDI-TOF MS protein profiling.
Full figure and legend (10K)Discussion
Serum protein profiling of RCC was performed to assess the validity and robustness of previously reported biomarker proteins for this type of cancer. We were able to detect the discriminating cluster of serum amyloid-
peaks described by Tolson et al in about the same subset of RCC patients in both sample sets. Furthermore, the protein profiling result was validated with an ELISA for SAA. The observed expression difference between patients and controls was detected despite differences in sample handling and patient characteristics compared to Tolson's samples. Moreover, it was stably present a year after the first analysis. This indicates SAA is a robust biomarker candidate for RCC, which is also supported by the fact that we found it not only with the assay from Tolson, but also with both the original and modified assay from Won, implying that albumin depletion does not influence the presence of this peak cluster. However, for some other peaks, we found markedly lower frequencies in both analyses, such as for the postulated biomarker of 10.85 kDa. As we did detect it in some of the patients, it is unlikely that the lower frequency is due to the analytical procedure, but more probable, these differences reflect real biological variation in patients. The co-presence of this peak with the SAA peaks as reported by Tolson could not be confirmed in our populations. Furthermore, its postulated prognostic value suggested by its presence in four out of six patients with metastatic disease in Tolson's population does not seem to hold, as most of our metastatic patients did not have this peak.
As we found a similar number of patients with the SAA peak cluster as Tolson, that is, SAA levels >
20
g/ml, this possibly represents a specific sub-population of RCC patients. As can be concluded from Table 3, disease stage or surgical status do not account for this distinction, nor seems the abundance of this peak correlated with disease stage. A study by Le et al25 described a high sensitivity and specificity for the presence of SAA and isoforms in prostate cancer patients suffering from bone metastases, but this does not explain the abundance of these peaks in some of our patients with early-stage disease. The cluster of SAA peaks has also been mentioned in several other reports. In a study in ovarian cancer it was found in about half of the patients,26 and a study in nasopharyngeal cancer27 showed an increase of SAA in
70% of these patients and in
40% of lung cancer patients. These peaks are thus not specific for RCC, but might reflect a process that occurs only in a subset of these cancer patients. Despite that we found some other expression differences in both sample sets, these were not large enough for good classification of patients and controls. Furthermore, both a decrease of a 17.3-kDa peak (possibly des-glutamine apolipoprotein A-II) and glutathionylated transthyretin have been described in the sera from patients with other cancers before24, 28, 29, 30 and thus they seem not specific for RCC either. Yet, these proteins might have a role as markers in disease or therapy monitoring.
We experienced difficulties in the analysis with the Won procedure. Many spectra were not assessable, possibly because of ion suppression caused by salt remnants on the chip. Moreover, peak intensities for the peaks of interest were generally much lower than those reported, despite optimisation of MS acquisition parameters, and Won's discriminating peaks were not readily detectable. Thus, this analytical protocol seems less robust than, for example, Tolson's. Furthermore, only nonsignificant expression differences were observed with this procedure, but only in sample set 2, which was the least comparable to Won's. The only similarity with Won's data for this procedure was a lack of an expression difference for m/z 4153. A slightly modified method, including a better reductant and omission of protease inhibitor in the denaturation buffer, as well as a different energy-absorbing matrix generally improved peak detection. Moreover, much fewer spectra suffered from ion suppression. Although also in this modified procedure a final washing step with water was lacking, the use of a different matrix might have precluded ion suppression effects. Using this method, the observed peak differences proved reflective of the patient characteristics: only in sample set 1, most comparable to Won's, we found two of the previously described expression differences. Yet, in the combined presurgery sera from set 1 and set 2, these expression differences became statistically insignificant. Possibly, other patient characteristics than nephrectomy status are responsible for this discrepancy, for example disease stage. Indeed, all presurgery sera in set 2 were from patients with metastatic disease, whereas in set 1 these included 20 patients with RCC stages I–III. In addition, with this modified method, only for the combined presurgery sera we found a peak at 4097 Da higher in controls compared to patients, as reported by Won. However, the difference in mean peak intensities between patients and controls was only slight, a marked difference compared to Won's result. Therefore, despite the observed expression difference, this peak was not identified by the BPS as a main classifier for the classification tree, nor was it able to split patients and controls correctly when manually selected as the main classifier. Thus, we could not reproduce the classification performance of this peak that was suggested by Won's classification tree.
For a peak at 4161 Da, we found an expression difference in sample set 2 with the modified procedure, but not in set 1, nor in the combined presurgery sera. This protein (increased in non-RCC disease vs RCC according to Won's classification tree) might be a specific inflammatory protein that is neither deregulated in RCC nor in health, which is concordant with the similar mean peak intensities for patients and controls mentioned by Won (8.442
4.496 and 9.192
6.746, respectively9). The increase of this peak in RCC that we saw in sample set 2 might be due to differences in patient characteristics or sample handling compared to Won's samples. Interestingly, a more recent report on serum protein profiling for RCC also describes the increase of a peak at m/z 4151 in RCC patients compared to controls.31 This peak was observed under similar assay conditions and thus may well be the same protein as Won's. Its performance to detect RCC in stage I was reported lower than for later stages.31 This could explain why we only saw a significant difference in sample set 2 (mostly stage-IV disease) and not in set 1 or the presurgery sera (mainly early-stage samples), and, if Won and co-workers also evaluated earlier disease stages, why they did not find this difference.
Altogether, some of Won's postulated biomarkers seem prone to lack of reproducibility (at least with the original method) and lack of validity. Although differences with our results might be caused by different sample handling conditions or other disease stages, this at least stresses the necessity of careful control of these parameters if the Won proteins were to be used as biomarkers. Furthermore, the few assessable spectra with the original procedure may have hampered our power to detect the expression differences described previously. However, the 4107-Da peak splitting patients and controls in the first node of Won's classification tree is only based on a total of six controls and 15 patients, which is even less than our assessable individuals. With the modified method, we could evaluate many more spectra, but the expression difference at m/z 4107 that we found for the presurgery samples was not large enough for correct classification of patients and controls. Thus, the classification tree reported by Won might well be the result of overfitting of data. For five peaks to fit a population of 36 samples in three groups is a rather high number, as peaks in the lower branches of a classification tree are discriminative for successively lower numbers of samples, thus more likely influenced by outliers or artefacts in the data. Possibly, the other discriminative m/z's at 4289 and 8151 that we found with this procedure have more potential as biomarkers for RCC. A decrease of a peak at 4289 Da has been mentioned for several other cancers,32, 33 one group attributing this mass to a fragment of inter-
trypsin inhibitor heavy chain 4.33 Also, peaks with an m/z similar to 8151 have been reported as biomarker candidates.6, 34 However, the consistent decrease of these peaks in RCC remains to be established in new sets of presurgery serum samples including patients with early-stage disease. In addition, their identity and their specificity for RCC or cancer in general must be addressed.
Considering our goal to validate previously reported biomarkers for RCC, lack of robustness was the first hurdle to take, especially for the Won analysis. It is unlikely that chip characteristics, (manual) sample processing or instrument integrity were the cause of this, as the spectra that were assessable with both the original and modified analytical protocol from Won, and those from the Tolson protocol did show reproducible profiles across the sample sets and the duplicate analyses. More probably, critical steps in the analytical protocol were the cause, such as the use of a good reductant (DTT) and suitable energy-absorbing matrix (SPA). Furthermore, duration of all steps might prove critical: as DTT reduces full-length haptoglobin, resulting in the release of the 9.2-kDa 1-
chain, the lack of detection of this peak in the first Tolson analysis may have been caused by incomplete reduction of the full-length protein. Because of this potential sensitivity of the technique to such subtle changes, the structural characterisation of any postulated biomarkers and development of more quantitative assays are needed for routine clinical analyses. The second hurdle in validation is related to the law of large numbers. Even when peaks are reproducibly detected, failure to validate expression differences between patients and controls may still occur as a result of limited sample size: too few samples could result in insufficient statistical power to identify a significant difference for the alleged biomarkers. Alternatively, statistically significant differences found previously could rather be the result of chance or bias, in which case former data were overfitted. These problems seem likely to have occurred with Won's result, which was based on very few samples. However, we cannot rule out that this was also the case for our data, as our sample size was still limited.
It might be argued that the only expression difference we could validate was that of the acute phase protein SAA, and that this is due to the extreme increase of this protein during acute phase reactions, which makes detection with SELDI-TOF MS more feasible. However, the change in protein expression detectable with SELDI-TOF MS does not so much depend on the size of this change, as on the concentration range in which it occurs. As for all equilibrium reactions, the binding of proteins such as SAA to the chromatographic chip surface describes an S-curve. If an expression difference occurs in the linear range of this S-curve, a rather small increase can result in binding of much more protein and thus a large change in SELDI-TOF MS peak intensity. Alternatively, a small change in peak intensity might represent a large expression difference, when this happens in the plateau regions of this S-curve. Unfortunately, in complex matrices such as serum, one can never be sure when to be in the linear or plateau range, due to competition for binding sites from other proteins (which causes a shift of the S-curve to the right). Furthermore, the more abundant proteins, often acute phase proteins, compete and bind preferably to the chip, causing limited sensitivity of SELDI-TOF MS for less abundant proteins compared to antibody assays such as ELISA. Thus, the ability to detect expression differences with SELDI-TOF MS depends more on the abundance of the protein under investigation than on the extent of the expression difference.
Concluding, we show that, depending on the analytical protocol, some of the alleged RCC biomarkers were difficult to reproduce. From the detectable candidate biomarkers, only the increased expression in RCC of SAA and variants could be unambiguously validated in our populations, with potential specificity for a certain subset of cancer patients. However, two newly identified biomarker candidates merit further evaluation in new sample sets.
References
- Cohen HT, McGovern FJ. Renal-cell carcinoma. N Engl J Med 2005;353:2477–2490. | Article | PubMed | ChemPort |
- Fung ET, Thulasiraman V, Weinberger SR, et al. Protein biochips for differential profiling. Curr Opin Biotechnol 2001;12:65–69. | Article | PubMed | ISI | ChemPort |
- Issaq HJ, Veenstra TD, Conrads TP, et al. The SELDI-TOF MS approach to proteomics: protein profiling and biomarker identification. Biochem Biophys Res Commun 2002;292:587–592. | Article | PubMed | ISI | ChemPort |
- Zhang Z, Bast Jr RC, Yu Y, et al. Three biomarkers identified from serum proteomic analysis for the detection of early stage ovarian cancer. Cancer Res 2004;64:5882–5890. | Article | PubMed | ISI | ChemPort |
- Li J, Zhang Z, Rosenzweig J, et al. Proteomics and bioinformatics approaches for identification of serum biomarkers to detect breast cancer. Clin Chem 2002;48:1296–1304. | PubMed | ISI | ChemPort |
- Adam BL, Qu Y, Davis JW, et al. Serum protein fingerprinting coupled with a pattern-matching algorithm distinguishes prostate cancer from benign prostate hyperplasia and healthy men. Cancer Res 2002;62:3609–3614. | PubMed | ISI | ChemPort |
- Chen YD, Zheng S, Yu JK, et al. Artificial neural networks analysis of surface-enhanced laser desorption/ionization mass spectra of serum protein pattern distinguishes colorectal cancer from healthy population. Clin Cancer Res 2004;10:8380–8385. | Article | PubMed | ISI | ChemPort |
- Tolson J, Bogumil R, Brunst E, et al. Serum protein profiling by SELDI mass spectrometry: detection of multiple variants of serum amyloid alpha in renal cancer patients. Lab Invest 2004;84:845–856. | Article | PubMed | ISI | ChemPort |
- Won Y, Song HJ, Kang TW, et al. Pattern analysis of serum proteome distinguishes renal cell carcinoma from other urologic diseases and healthy persons. Proteomics 2003;3:2310–2316.
- Baggerly KA, Morris JS, Coombes KR. Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experiments. Bioinformatics 2004;20:777–785. | Article | PubMed | ISI | ChemPort |
- Diamandis EP. Re: diagnostic potential of serum proteomic patterns in prostate cancer. J Urol 2004;171:1244–1245.
- Diamandis EP, van der Merwe D-E. Plasma protein profiling by mass spectrometry for cancer diagnosis: opportunities and limitations. Clin Cancer Res 2005;11:963–965.
- Diamandis EP. Point: proteomic patterns in biological fluids: do they represent the future of cancer diagnostics? Clin Chem 2003;49:1272–1275. | Article | PubMed | ChemPort |
- Petricoin EF, Liotta LA. Counterpoint: the vision for a new diagnostic paradigm. Clin Chem 2003;49:1276–1278.
- Baggerly KA, Morris JS, Edmonson SR, et al. Signal in noise: evaluating reported reproducibility of serum proteomic tests for ovarian cancer. J Natl Cancer Inst 2005;97:307–309.
- Liotta LA, Lowenthal M, Mehta A, et al. Importance of communication between producers and consumers of publicly available experimental data. J Natl Cancer Inst 2005;97:310–314.
- Sorace JM, Zhan M. A data review and re-assessment of ovarian cancer serum proteomic profiling. BMC Bioinformatics 2003;4:24. Available from URL: http://www.biomedcentral.com/1471-2105/4/24. | Article | PubMed |
- Diamandis EP. Mass spectrometry as a diagnostic and a cancer biomarker discovery tool: opportunities and potential limitations. Mol Cell Proteomics 2004;3:367–378. | PubMed | ChemPort |
- Grizzle WE, Meleth S, Petricoin EF, et al. Clarification in the point/counterpoint discussion related to surface-enhanced laser desorption/ionization time-of-flight mass spectrometric identification of patients with adenocarcinomas of the prostate. Clin Chem 2004;50:1475–1477.
- Li J, Orlandi R, White CN, et al. Independent validation of candidate breast cancer serum biomarkers identified by mass spectrometry. Clin Chem 2005;51:2229–2235.
- Mathelin C, Cromer A, Wendling C, et al. Serum biomarkers for detection of breast cancers: a prospective study. Breast Cancer Res Treat 2006;96:83–90.
- Grizzle WE, Adam BL, Bigbee WL, et al. Serum protein expression profiling for cancer detection: validation of a SELDI-based approach for prostate cancer. Dis Markers 2003;19:185–195.
- Grizzle WE, Semmes OJ, Basler J, et al. The early detection research network surface-enhanced laser desorption and ionization prostate cancer detection study: a study in biomarker validation in genitourinary oncology. Urol Oncol 2004;22:337–343.
- Fung ET, Yip TT, Lomas L, et al. Classification of cancer types by measuring variants of host response proteins using SELDI serum assays. Int J Cancer 2005;115:783–789. | Article | PubMed | ISI | ChemPort |
- Le L, Chi K, Tyldesley S, et al. Identification of serum amyloid A as a biomarker to distinguish prostate cancer patients with bone lesions. Clin Chem 2005;51:695–707. | PubMed | ChemPort |
- Moshkovskii SA, Serebryakova MV, Kuteykin-Teplyakov KB, et al. Ovarian cancer marker of 11.7 kDa detected by proteomics is a serum amyloid A1. Proteomics 2005;5:3790–3797.
- Cho WC, Yip TT, Yip C, et al. Identification of serum amyloid a protein as a potentially useful biomarker to monitor relapse of nasopharyngeal cancer by serum proteomic profiling. Clin Cancer Res 2004;10:43–52.
- Langlois RG, Trebes JE, Dalmasso EA, et al. Serum protein profile alterations in hemodialysis patients. Am J Nephrol 2004;24:268–274.
- Honda K, Hayashida Y, Umaki T, et al. Possible detection of pancreatic cancer by plasma protein profiling. Cancer Res 2005;65:10613–10622.
- Yu JK, Chen YD, Zheng S. An integrated approach to the detection of colorectal cancer utilizing proteomics and bioinformatics. World J Gastroenterol 2004;10:3127–3131. | PubMed | ChemPort |
- Hara T, Honda K, Ono M, et al. Identification of 2 serum biomarkers of renal cell carcinoma by surface enhanced laser desorption/ionization mass spectrometry. J Urol 2005;174:1213–1217.
- Banez LL, Prasanna P, Sun L, et al. Diagnostic potential of serum proteomic patterns in prostate cancer. J Urol 2003;170:442–446. | Article | PubMed | ChemPort |
- Koomen JM, Shih LN, Coombes KR, et al. Plasma protein profiling for diagnosis of pancreatic cancer reveals the presence of host response proteins. Clin Cancer Res 2005;11:1110–1118. | PubMed | ChemPort |
- Gourin CG, Xia ZS, Han Y, et al. Serum protein profile analysis in patients with head and neck squamous cell carcinoma. Arch Otolaryngol Head Neck Surg 2006;132:390–397.
MORE ARTICLES LIKE THIS
These links to content published by NPG are automatically generated
RESEARCH
Laboratory Investigation Research Article
Molecular Psychiatry Original Article
Molecular Psychiatry Original Article
Oncogene Original Article

