Main

Head and neck squamous cell carcinoma (HNSCC) is the fifth most common malignancy worldwide, representing a major international health problem.1 The 5-year survival rate for HNSCC (ca.50%) has improved only marginally over the past decade; as a result, it is estimated that 45 641 cases and 11 210 deaths will occur in 2007 in the United States from HNSCC.2 These tumors constitute an anatomically heterogeneous group of neoplasms arising from the oral cavity, oropharynx, hypopharynx, larynx and nasopharynx. The initial anatomic site of HNSCC is highly correlated with early metastasis3, 4, 5 and treatment planning depends largely on anatomical staging of the disease at presentation. Conventional treatment with surgery and/or radiation therapy, and/or chemotherapy, is associated with significant morbidity, affecting speech, swallowing and overall quality of life. Despite these interventions, recurrence of the disease is observed in about 50% of patients, locally, regionally or at a distant site, with high rates of associated mortality.6 Treatment failures for HNSCC can be attributed to multiple factors, but remain difficult to predict.

Although several biomarkers have been associated with HNSCC, none as yet have proven to be useful clinically; that is, no biomarker is used to guide treatment selection at initial diagnosis. For example, studies of individual biomarkers such as p53, EGFR, Bcl-2, MMPs, cyclins and molecular markers have demonstrated inconsistent, and at times contradictory, results.7, 8, 9 Molecular characterization of this genetically complex disease has provided some insight into individual genetic abnormalities that contribute to tumor progression.10 Tumors originating from different locations can exhibit varying behavior that is not predictable by histopathology of the primary tumor but is discernable by gene profiling.11 Our ability to provide effective treatment will depend upon our ability to determine that surgical margins are free of tumor, and on our ability to predict tumor behavior, such as, metastatic potential, potential for local recurrence and response to therapy.12 The goal of this study was to establish a method for detecting biologically significant differences in protein expression in head and neck tumors, using the same samples utilized in analyses of gene expression. This strategy would minimize sampling error and enhance integration of data sets for future analysis of protein, mRNA, DNA methylation and tissue microarray data.

Therefore, we developed a method to take advantage of the total protein mixture remaining after the extraction of total RNA from the TRIzol extraction procedure. The method employs offline strong cation exchange (SCX) and liquid chromatography-mass spectrometry (LC-MS), to find biologically distinct peptide ions, based on charge, hydrophobicity, mass and signal intensity. These peptide ions can be identified subsequently by targeted analysis, using the inclusion list for tandem mass spectrometry (MS/MS). In order to be able to achieve the goal of linking protein profiles to patient outcome or treatment responsiveness in an HNSCC cell carcinoma study, we tested the method for reproducibility. The data presented below demonstrate that the procedure can discriminate biological differences among complex protein mixtures derived from cultured cells, and should be applicable to analysis of other cancers and disease conditions.

Materials and methods

Cell Lines and Tissue

The cell lines obtained from the American Type Culture Collection (Manassas, VA, USA), representing two tumor sites, tongue (SCC-25, CRL-1628),13 and pharynx (FaDu, HTB-43),14 were grown under cell culture conditions as recommended by the vendor. Resection of an oral cavity HNSCC, collected at surgery by an IRB approved protocol to study HNSCC, provided both normal squamous oral mucosa and HNSCC carcinoma.

Gel Electrophoresis and Western Blot Analysis

TRIzol Reagent (Invitrogen, Carlsbad, CA, USA) can be used to extract DNA, RNA and protein from tumor tissue. To determine the quality obtained following TRIzol extraction, we compared 2D gel patterns from normal and HNSCC samples extracted either with a mix of detergents and chaotropic agents15 or with TRIzol (https://www.invitrogen.com/content/sfs/manuals/15596026.pdf). One-dimensional SDS-PAGE and two-dimensional gel electrophoresis (2D gels) were performed by following standard procedures used in other studies.15 Protein concentration in cell lysates was determined using Pierce's BCA Protein Assay reagents (Rockford, IL, USA). Western blotting was carried out as described previously, using mouse monoclonal anti-human annexin II antibody (BD Biosciences, San Jose, CA, USA), mouse monoclonal anti-human vimentin antibody (BD Biosciences, San Jose, CA, USA) and goat anti-mouse Ig secondary antibody conjugated with horseradish peroxidase (Southern Biotech, Birmingham, AL, USA). To summarize: Western blots were blocked in 1 × TBS/T (20 mM Tris, pH 7.6, 137 mM NaCl, 0.1% Tween 20 containing 5% non-fat dry milk), then incubated for 1 h at room temperature in blocking solution containing mouse monoclonal anti-annexin II at a dilution of 1:5000. After washing in 1 × TBS/T, blots were incubated for 1 h at room temperature in blocking solution containing goat anti-mouse horseradish peroxidase-conjugated secondary antibody at 1:5000. Bands were visualized after further washes using Pierce's ECL Western Blotting Substrate (Rockford, IL, USA). Western blotting for vimentin followed the same protocol; mouse monoclonal anti-vimentin antibody (BD Biosciences, San Jose, CA, USA) was used at a dilution of 1:2500.

Preparation of Protein Digests

To optimize the protocol before applying to human tumor tissue, proteins from two HNSCC cell lines, SCC-25 and FaDu,14 were extracted using the TRIzol protocol. The protein pellet derived from the TRIzol procedure was washed three times with 0.3 M guanidine HCl in 95% ethanol, each time for 20 min at room temperature, followed by a final wash with 90% isopropanol. This final wash removed any color remaining from the TRIzol that interfered with the LC-MS measurement, and removed any residual guanidine HCl. The pellet was then dissolved in 70% trifluoroacetic acid (TFA; Pierce, Rockford, IL, USA), Tris(2-carboxyethyl)-phosphine hydrochloride (TCEP; Pierce, Rockford, IL, USA) added to 50 μM to reduce disulfide bonds, and cleaved with 20 mM cyanogen bromide (CNBr; Sigma-Aldrich, St Louis, MO, USA) for 18 h at room temperature under nitrogen in the dark. The reaction mixture was evaporated under nitrogen, and the protein pellet resuspended in 100 mM ammonium bicarbonate, pH 8.5 (Sigma-Aldrich, St Louis, MO, USA), with sonication for 30 min at room temperature. To improve solubility, solid urea (Fluka, Sigma-Aldrich, St Louis, MO, USA) was added to make the final urea concentration 8 M. The mixture was first digested with 300 ng of endoproteinase Lys-C, 1.5 ng/μl (Roche, Indianapolis, IN, USA) at 37°C for 18 h. The solution was then diluted to reduce the urea concentration to 4 M with 100 mM ammonium bicarbonate, pH 8.5, calcium chloride (JT Baker, Phillipsburg, NJ, USA) was added to a concentration of 1 mM, followed by the addition of 15 μl of washed immobilized trypsin bead suspension (Pierce, Rockford, IL, USA), and incubated at 37°C for 18 h. After dilution of the sample to 2 M urea with 100 mM ammonium bicarbonate, pH 8.5, additional trypsin (15 μl) was added and the mixture was incubated for an additional 4–6 h at 37°C. Following digestion, the immobilized trypsin was removed from the digest mixture by centrifugation (16 000 r.p.m., 1 min) and the pH of the sample adjusted to 3 using formic acid (Acros Organics USA, Morris Plains, NJ, USA). The polypeptide content of the sample was measured using a Nanodrop spectrophotometer and a 100 μg aliquot was removed, and acetonitrile (Fisher, Fairlawn, NJ, USA) added to 25% v/v.

Offline SCX LC-MS Method Development

Since reproducibility in sample processing and analysis is key to the reliability of results, internal and external controls were used during method development. Pilot studies were carried out with serine acetyltransferase from Haemophilus influenzae. The initial ion-exchange separation was best performed offline, because of the large volumes. Thus, peptide mixtures obtained from the digestion were separated into 10 fractions using ion-exchange spin columns, as noted below. In addition, a mixture of four standard peptides (with different retention times and masses) was added to each salt cut fraction as an internal control and analyzed using LC-MS on a QqTOF mass spectrometer equipped with a microionspray source. The peptides served as markers for retention time alignment of the LC-MS spectra and used for correcting mass measurement error drift. The mass spectrometer was calibrated every day and instrument performance (retention time and sensitivity) was determined using our quality control 16-peptide mixture before sample analysis.16 Peak picking, LC alignment and intensity normalization procedures were verified manually in comparison to the raw data and internal standards.

Approximately 100 μg of the sample was loaded on 50 mg of polysulfoethyl A resin (12 μm, 300 Å, PolyLC, Columbia, MD, USA) in a spin column and eluted with 200 μl of ammonium acetate pH 3, stepwise, at salt concentrations of 0, 10, 20, 30, 40, 60, 100, 200, 300 and 600 mM, in 0.1% formic acid, 25% acetonitrile. To reduce the acetonitrile concentration before reversed-phase HPLC separation, 40 μl of each fraction was diluted to 100 μl with 0.1% formic acid. A mixture of four synthetic peptides (200 fmol each, used as internal standards) with different masses and retention times was added to every sample before injection. Sequences and monoisotopic masses of the internal standard peptides are as follows: VFLQYLKN (1023.6 Da), VFL*QYL*KN (1025.6 Da), KYIPGTK (805.5 Da) and NFL*QYL*KD (1041.5 Da), where L* is the 15N-labeled amino acid. These standard peptide peaks were used for retention time alignment of the HPLC chromatograms in data analysis and mass measurement error drift correction.

After the addition of internal standard peptides, samples were separated by reversed-phase chromatography, using an LC Packings Ultimate HPLC System (Dionex Corporation, Sunnyvale, CA, USA), and analyzed using a QStar Pulsar i mass spectrometer (AB/Sciex, Foster City, CA, USA). HPLC-grade acetonitrile (CH3CN, Fisher, Fairlawn, NJ, USA), 18 Mohm Milli-Q water (Millipore, Bedford, MA, USA) and high-purity formic acid (Acros Organics USA, Morris Plains, NJ, USA) were used to prepare the mobile phases for LC-MS. A C18 Pepmap100 column (3 μm, 100 Å, 300 μm × 15 cm, Dionex, Sunnyvale, CA, USA) at a flow rate of 4 μl/min, with 2% CH3CN/0.1% formic acid (solvent A) and 80% CH3CN/0.1% formic acid (solvent B), was eluted with the following gradient: 0–30 min, 5% B (desalting); 30–80 min, 5–55% B; 80–90 min, 55% B; 90–100 min, 55–95% B; 100–110 min, 95% B; 110–125 min, 95–5% B; 125–135 min, 5% B. The QStar was equipped with a microionspray source using 20 μm I.D. fused silica capillary. TOF-MS was acquired in the mass range of 300 to 1800 m/z, with a scan time of 1 s. The QStar was calibrated daily using a CsI-peptide standard at 0.1 and 0.15 μM, respectively; consisting of cesium iodide (Sigma-Aldrich, St Louis, MO, USA) and the iPD1 peptide (amino-acid sequence—ALILTLVS; Bachem Bioscience Inc., King of Prussia, PA, USA) dissolved in 50% methanol (Fisher, Fairlawn, NJ, USA)/water containing 0.1% formic acid. A mixture of 16 peptides, each containing 2.5 × 10−13 M, was used as a quality control for monitoring the daily performance of the HPLC and mass spectrometer, before and after each sample set. Peptide ions identified by statistical analysis (see below) to have 90% biological variability were placed in an inclusion list for tandem mass spectrometry fragmentation. An LTQ (Thermo Scientific, Waltham, MA, USA) linear ion trap instrument was used to carry out tandem mass spectrometry measurements to identify a few peptides discriminated by the arrays. DTA files were created from the raw data, merged into one file and searched against the NCBInr database, using the Mascot search engine.

Data Analysis

The aim of our study was to determine whether the biological variability was much larger than the extraction and analytical variability, thus, providing confidence to proceed to the analysis of human tumor samples. In this experiment, three extractions were prepared from each cell line. As described earlier, the protein fraction from each of these extracts was fragmented into peptides and analyzed in triplicate by LC-MS. Use of internal and external protein and peptide standards provided relative quantitation. Each peptide in the final pattern derived from LC-MS had a characteristic mass, elution time from the reversed-phase column and elution characteristic (salt concentration) from a SCX spin column.

Data analysis was performed using an in-house software,16, 17 to build arrays with monoisotopic mass, salt cut, retention time and signal intensity. There are four steps in the process: (1) generation of peptide lists from individual LC-MS runs; (2) retention time alignment to correct any retention time shifts in the LC runs; (3) intensity normalization and (4) construction of arrays for each salt cut. For each salt cut analyzed by LC-MS, a peptide list with monoisotopic mass, retention time and intensity is first produced by performing denoising, deisotoping and charge state assignments. To remove chemical noise due to solvent peaks, all singly charged ions below 600 Da are removed from the list. To be conservative, higher charge-state ions (+4 or +5) that do not have lower charge-state ions of the same mass are also removed. In the second step, the retention time drift is first corrected by using the retention time of the internal standard peptides, followed by a refinement procedure to maximize the number of matches, that is, peptides with similar masses and retention times. The refinement procedure corrects non-linear shift in the chromatograms by using robust spline fitting. Dividing the individual intensities with the average intensity of the LC-MS run normalized the peptide intensities. Peak picking, LC alignment and intensity normalization procedures were verified by manually checking the raw data, including, in several cases, the internal standard peptides. Arrays with normalized mass, retention time and intensities for each peptide in the 18 LC-MS runs (nine repeats for each cell line) were constructed. An array was constructed for each of the 10 salt fractions. Peptides having similar mass (±0.1 Da) and retention time (±40 s) were considered to be the same peptide, and clustered with average linkage clustering. Their normalized intensities form a row in the peptide array, while each column in the array represents an individual run.

In order to quantify the reliability of our approach, we employed a random effects ANOVA model. Based on this model, we computed the ratio of the variation due to differences between cell lines for a given peptide ion to the total variation. We refer to this variation as the percentage biologic variation.18 A peptide ion with ≥90% biologic variation has high reliability in differentiating the two cell lines. These data are presented in a second set of arrays for each salt fraction and combined into a single array. The statistical array was used to identify those peptide ions that should be studied further.

Results

Our long-term goal is to identify clinically useful biomarkers using a systems approach that will predict tumor behavior or response to therapy in patients with HNSCC. The TRIzol extraction protocol permits the isolation of DNA, RNA and protein from the same sample, which facilitates the comparison of proteomics results with DNA and RNA analyses. Proteins from normal human squamous oral mucosa and HNSCC carcinoma were extracted using a standard protocol employing detergent and chaotropic agents, and the TRIzol protocol. Proteins derived from both extraction methods were separated by 2D gel electrophoresis and compared, as shown in Figure 1. The protein patterns for each extraction method are nearly the same for both normal mucosa (Figure 1a and c) and tumor (Figure 1b and d), indicating that TRIzol extraction can be used to generate a global proteomic profile.

Figure 1
figure 1

Two-dimensional gel electrophoresis, comparing proteins extracted by direct solubilization of normal human oral mucosa and HNSCC with proteins extracted during the TRIzol procedure. (a) Normal tissue, standard extraction (detergent/chaotropic salts); (b) HNSCC, standard extraction (detergent/chaotropic salts); (c) normal tissue, TRIzol extraction; (d) HNSCC, TRIzol extraction.

An overview of the analytical method and data analysis developed using TRIzol extracted proteins is given in the flow chart (Figure 2). Due to the insoluble nature of the proteins, samples were dissolved in aqueous TFA and cleaved with CNBr before digestion with multiple enzymes. Data analysis was performed using in-house, publicly available software, to build arrays of each salt fraction with retention time, normalized mass and signal intensity.16, 17

Figure 2
figure 2

Schema showing overview of analytical method and data analysis. (Step 1) FaDu and SCC-25 cells in culture were lysed and their protein extracted using TRIzol, according to the manufacturer's protocol. (Step 2) The resulting protein pellet was dissolved in 70% TFA. (Step 3) Tris(2-carboxyethyl)-phosphine hydrochloride was added to a final concentration of 50 μM, CNBr to a final concentration of 20 mM, and samples were left overnight at room temperature in the dark. After 16 h, CNBr and TFA were evaporated using a gentle stream of nitrogen; proteins were redissolved by adding 0.1 M ammonium bicarbonate, sonication for 30 min and addition of urea to 8 M. pH was checked and adjusted to 8.5, if necessary, using ammonium hydroxide. (Step 4) Endoproteinase Lys-C was added to a concentration of 1.5 ng/μl and sample incubated with shaking, overnight at 37°C. (Step 5) Urea in sample was diluted to 4 M with 0.1 M ammonium bicarbonate, calcium chloride (CaCl2) added to 1 mM and 15 μl washed, immobilized trypsin added. Sample was left overnight at 37°C, with shaking. After 18 h, 0.1 M ammonium bicarbonate was added to further dilute the urea to 2 M, CaCl2 added to 1 mM, an additional 15 μl of trypsin was added, and the sample incubated for a further 4–6 h. (Step 6) Approximately 100 μg peptide digest was loaded onto 50 mg of polysulfoethyl A resin (12 μm, 300 Å, PolyLC) in a spin column and eluted with 200 μl of ammonium acetate, pH 3, stepwise at salt concentrations of 0, 10, 20, 30, 40, 60, 100, 200, 300 and 600 mM, in 0.1% formic acid, 25% acetonitrile. An aliquot of each fraction was diluted, so that the acetonitrile concentration was 10%. A mixture of four synthetic peptides (200 fmol each) with different masses and retention times was added to every fraction before injection. (Step 7) Each fraction was separated by reversed-phase chromatography using an LC Packings HPLC and analyzed using a QStar Pulsar i mass spectrometer (AB/Sciex, Foster City, CA, USA). A C18 Pepmap100 column (3 μm, 100 Å, 300μm × 15 cm, Dionex, Sunnyvale, CA, USA) at a flow rate of 4 μl/min, with 2% acetonitrile/0.1% formic acid (solvent A) and 80% acetonitrile/0.1% formic acid (solvent B). After desalting for 30 min, the peptides were separated using a 5–55% B gradient over 50 min. (Steps 8–9) Arrays for each salt cut containing HPLC retention time, monoisotopic mass and signal intensity were built after noise removal, deisotoping, retention time alignment and intensity normalization of the raw data.

Replicate Analysis of Peptide Mixtures

After optimizing the offline SCX and LC-MS procedure, two cell lines (SCC-25 and FaDu) derived from human HNSCC cell carcinoma were compared. To assess variation in sample processing and instrument variation, as compared to biologic variation, three replicates of peptide mixtures were prepared from each cell line and each replicate was analyzed by LC-MS in triplicate, yielding nine data sets for each cell line.

A range of 4000 to 9000 peptide ions were identified in each LC-MS run as follows: 0 mM salt cut=9574; 10 mM salt cut=7021; 20 mM salt cut=4026; 30 mM salt cut=4639; 40 mM salt cut=5337; 60 mM salt cut=5247; 100 mM salt cut=5264; 200 mM salt cut=6685; 300 mM salt cut=5593; 600 mM salt cut=lack of useful peaks; total=53 386. The base peak chromatograms and mass spectra produced from analyses of two replicates of the 60 mM salt fraction of the SCC-25 cell line, carried out 2 days apart are, shown in Figure 3. The base peak chromatograms of the two runs shown in Figure 3a and c provide a graphic representation of reproducibility. The accumulated mass spectrum obtained between 65.9 and 66.3 min in Figure 3a is shown in Figure 3b. Nearly 20 peaks are observed in the mass spectrum. The same set of peaks was observed between 66.3 and 66.7 min of Figure 3c, in the replicate LC-MS run (Figure 3d). The spectra are reproducible, but with a 24 s shift between these two runs.

Figure 3
figure 3

Reproducibility of LC-MS analysis. (a) Base peak chromatogram (50–80 min) of the 60 mM salt fraction from the second extract of the cell line SCC-25. (b) Mass spectrum between 65.9–66.3 min. Nearly 20 peaks are observed in the mass spectrum. (c) Base peak chromatogram of the same sample 2 days later. (d) Mass spectrum between 66.3–66.7 min. The same set of peaks is observed in the mass spectrum, showing that the LC-MS analysis is reproducible.

The shift in retention time is corrected in the alignment step of the data analysis program, using internal standard peptides. All the LC-MS runs were aligned using the SCC-25 first extraction data as reference. Dividing the peptide intensities with average intensity of the LC-MS run, normalized the intensities. Average intensities in the LC-MS runs were 80–100 counts. Retention times in different runs were first aligned by using internal marker peptides, then by fitting a smooth curve over a large number of potential matches, that is, peptide pairs from different runs. The alignment curve between two of the replicate runs is shown in the scatter plot in Figure 4. Black circles are matched peptide pairs with similar mass and slightly different retention times. The line is the fitted alignment curve, showing excellent run to run correlation (r=0.997).

Figure 4
figure 4

Retention time correlation across different LC-MS runs. The data from SCC-25, extract 1, replicate 1 is compared to the data from SCC-25, extract 3, replicate 2. The line is the fitted RT alignment curve with R statistical package (r=0.997).

Determination of Biological Differences that Exceed Technical Differences

Comprehensive arrays corresponding to each salt cut were constructed including retention time, mass and signal intensity of all peptide ions from 180 LC-MS runs. Each salt fraction was analyzed individually for the purpose of comparing the two cell lines. These arrays were then subjected to statistical analysis, generating a new set of arrays for each salt cut that included only those peptide signals with ≥90% biological variation. The 300 and 600 mM fractions did not yield biologically significant peptide ions. From the remaining eight ion-exchange fractions, 90 peptide ions were identified that differentiated the two cell lines; an array for the peptide ions differentiating the two cell lines is given in Table 1. Figure 5 shows a mass spectrum of the signal for one of the differentiated ions, m/z 714.82, which is set in italic in Table 1 (peptide mass 1427.77 Da). This biologically significant peptide ion is present in SCC-25 cells (Figure 5a, arrow) and absent at the appropriate m/z in the FaDu cell line (Figure 5b).

Table 1 Statistically generated array of normalized intensity values obtained from the salt fractions showing ≥90% biological differences between the cell lines
Figure 5
figure 5

Biological differences between cell lines. Spectrum shown on the left is from one of the nine replicates of cell line SCC-25, and spectrum on the right is from one of the nine replicates of cell line FaDu. Both spectra are from LC-MS runs of the 60 mM salt fraction. The doubly charged ion at m/z 513.2 and the singly charged ion at m/z 831.4 arise from such proteins, which are present in both cell lines. The doubly charged ion at m/z 714.8 is present in all the nine replicates of SCC-25, while it is absent in all nine replicates of FaDu.

Identification of Two Biologically Differentiated Peptide Ions

To confirm that differences revealed in the statistical arrays represent differential protein expression in the two cell lines, preliminary tandem mass spectrometry experiments were performed. The MS/MS spectrum of one differentially expressed peptide from the above array (Figure 5; Table 1) is represented graphically in Figure 6a. This peptide, SLYASSPGGVYATR from the protein vimentin, is expressed in the SCC-25 cell line but undetectable in the FaDu cell line (Table 1). The monoisotopic mass of this peptide is 1427.77 Da, with an intensity of 3.0±0.6 counts (normalized average intensity±s.d., n=9) and a retention time of 59.7 min. Five additional peptides were identified from this protein. Another protein that found to be more highly expressed in the SCC-25 is annexin II, whose MS/MS spectrum is shown in Figure 6b. The monoisotopic mass of this peptide, DALNIETAIK, is 1086.57 Da (Table 1), retention time 62.6 min, with an intensity of 2.3±0.2 counts for SCC-25 and 0.2±0.2 counts for FaDu (normalized average intensity±s.d., n=9). Three additional peptides were identified from this protein. Orthogonal validation of these expression differences was further provided by Western blot analysis with antibodies specific to these proteins (Figure 7). The relative difference in protein expression was found for both peptides, when the results from LC-MS and Western blot analysis were compared. Vimentin was expressed in SCC-25 but undetectable in FaDu, using either method, while annexin II showed a 12-fold difference by LC-MS and a six-fold difference by Western blot analysis.

Figure 6
figure 6

Product ion (MS/MS) spectra of the doubly charged precursor ions m/z 714.85 (a; shown in Figure 4 and Table 1) and m/z 544.30 (b) obtained from the 60 and 0 mM salt fractions of SCC-25, respectively. The MS/MS spectrum of m/z 714.85 (a) matched the peptide SLYASSPGGVYATR of human vimentin (residues 51–6; accession number CAA79613; peptide mass 1427.7 Da). The MS/MS spectrum of m/z 544.3 (b) matched the peptide DALNIETAIK of human annexin A2 (residues 38–47; accession number AAAH09561, one of many isoforms; peptide mass 1086.57 Da). The matched b and y ions are shown in both MS/MS spectra. MS/MS was obtained on a linear ion trap mass spectrometer (LTQ, Thermo Scientific).

Figure 7
figure 7

Western blots of annexin/vimentin. Equal amounts of FaDu and SCC-25 cell lysates were separated on 12% SDS polyacrylamide mini-gels and transferred to PVDF membranes. Western blots were blocked in 1 × TBS/T (20 mM Tris, pH 7.6, 137 mM NaCl, 0.1%Tween 20, containing 5% non-fat dry milk), then incubated for 1 h at room temperature in blocking solution containing mouse monoclonal anti-annexin II at a dilution of 1:5000. After washing in 1 × TBS/T, blots were incubated for 1 h at room temperature in blocking solution containing goat anti-mouse horseradish peroxidase-conjugated secondary antibody at 1:5000. Bands were visualized after further washes using Pierce's ECL Western blotting substrate. Protocol for vimentin followed the same outline; mouse monoclonal anti-vimentin antibody (BD Biosciences) was used at a dilution of 1:2500.

Discordant RNA and Protein Expression

Our long-term goal is to integrate data from multiple data sets (genomic and proteomic), to identify the most reliable prognostic indicators for HNSCC. RNA expression profiling of SCC-25 and FaDu was performed using our custom cDNA arrays carrying 28 000 cDNAs.19, 20 Using high stringency (five-fold differential gene expression) we found 290 SCC-25-specific RNAs and 113 FaDu-specific RNAs. LC-MS/MS analysis identified one of the SCC-25-specific peptides as keratin (KRT6E), which was also identified by RNA expression profiling (data not shown). In contrast, vimentin RNA expression was increased slightly in FaDu cells (1.3 to 1.6-fold), but vimentin protein was expressed only in SCC-25 cells, exemplifying discordant expression of RNA and protein. There are many possible mechanisms for the discordance and some may be responsible for pathology, indicating the advantage of measuring both RNA and protein expression in trying to predict tumor behavior. Thus, the ability to integrate genomic and proteomic data will strengthen our ability to dissect pathologic processes and to identify prognostic biomarkers.

Discussion

Treatment selection for patients with squamous cell carcinoma of the head and neck is guided by clinical examination, imaging studies and histopathology.21 There are no biomarkers in routine use that provide additional information on tumor behavior or predict response to therapy.7, 8 Our long-term goal is to identify biomarkers that can be used to guide initial treatment selection. Our approach is multifaceted, analyzing DNA methylation,22 RNA expression11 and in situ protein expression23 in HNSCC, in relation to clinical outcome. We predict that integration of multiple data sets will be more successful than a single analytic approach. This study was undertaken to develop a method to analyze the proteome from the same tissue sample used for high-throughput DNA methylation and RNA expression, which would permit integration of data sets with clinical outcome.

Many methods are available and have been used to study the proteome of HNSCC.24 Usage of precious human tissue for proteomics demands the usage of reliable methods. There has been significant commentary on biomarker development, emphasizing the need for validated, reproducible protocols.25, 26, 27 Some methods provide a selective analysis of the proteome, while others provide a more global approach. While no single method can assess every protein in a tissue, each experimental approach must be tested for reproducibility, sensitivity, dynamic range of detection and quantitative reliability.

Proteomic Approaches to HNSCC Biomarker Discovery

Surface-enhanced laser desorption ionization-time of flight mass spectrometry (SELDI-TOF-MS) is a profiling method that has been extensively used in cancer biomarker studies.28, 29 It is becoming increasingly recognized that reproducibility and validation of these biomarkers should be addressed carefully.30, 31 In an effort to correlate RNA expression with protein expression, Bosch and co-workers analyzed HNSCC, as compared to normal esophageal mucosa, using cDNA microarrays, quantitative RT-PCR and a variety of MS methods, including, primarily, SELDI-TOF-MS.32 Because of the multiple approaches used, they were able to identify calgranulins A and B and annexin I and II as proteins that were downregulated in HNSCC tumor tissue. It is interesting that the calgranulins are expressed in stratum spinosum of normal epithelium; if HNSCC represents an undifferentiated cell, perhaps the calgranulins are not down-regulated but simply not expressed because the differentiation program has not been initiated. In a similar study comparing normal oral mucosa to HNSCC in five patients, Baker et al,33 using laser-capture microdissection, followed by LC-MS/MS, identified several proteins that were differentially expressed but not correlated with clinical outcome. Roesch-Ely et al,34 using SELDI-TOF-MS, showed that features of the proteomic profile of HNSCC can exist in histologically normal tumor-adjacent and tumor-distant mucosa. Expression of the HNSCC protein profile in tumor-distant mucosa correlated with local regional recurrence, suggesting a prognostic role for this profile in predicting recurrent disease.

LC-MS for Detecting Biologically Significant Peptide Ions

LC-MS is an alternative method used for quantitative profiling.35, 36, 37, 38, 39 In quantitative profiling, signal intensity is compared across many LC-MS spectra. Reproducibility in sample processing and analysis is the key factor in determining the reliability of results in these studies. Few reports have tested the reproducibility in LC-MS-based analysis.40, 41 Sample preparation and separation, the resolution capability of the mass spectrometer and the data analysis highly influence the outcome of studies. Stewart and co-workers have systematically studied the technical variation in LC-MS-based analysis, using two breast cancer cell line samples. They could not identify proteins with significant differential expression in the two cell lines. In their study, the differences between the cell lines were less than the technical variation in the method.41

Gaspari et al40 have studied the reproducibility using secreted proteins from culture media of human U937 macrophages, under three different culture conditions. Macrophages were exposed to lipopolysaccharide, in order to generate an inflammatory reaction; 2D LC-MS was then used to identify differential protein expression in the absence or presence of β2-adrenergic receptor inhibitors. They differentiated the following two sets of samples, namely the inflammation state and inhibition of inflammation, based on principle component analysis. Although they could differentiate the two states, they have reported that the poor resolution of mass spectrometer used in their study and the limitations in their data analysis have increased the technical variation. These studies show that there is still a need for validating label-free LC-MS-based methods and data analysis programs used for biomarker studies.

In the present study, quantitative profiling was used to study the differential expression of proteins in two human head and neck cancer cell line samples. The method employs offline ion-exchange chromatography and LC-MS to identify differential candidates, based on their normalized salt cut fraction, retention time, mass, and signal intensity. Differential candidates can be identified subsequently by targeted analysis, using MS/MS, thus eliminating the variation due to precursor ion selection for MS/MS. Since signal intensities are compared across mass spectra acquired on several days, quality of the data highly influences the outcome of the analysis. We enhanced reproducibility in two ways. A reference standard peptide mixture was analyzed by LC-MS between samples to ensure optimal instrument performance. Internal standards spiked into each ion-exchange fraction provided points of reference for retention time.

Because our purpose was to develop a rugged and reproducible method, identification of peptide ions with a high reliability coefficient for distinguishing biologically different cells or tissue is more important initially than identification of the peptide itself. Once biologically significant peptide ions have been identified, their identity can easily be obtained, as shown for vimentin and annexin II in Figure 6.

The goal of this study was to develop a reliable mass spectrometry method with high resolution and sensitivity, for cancer biomarker discovery in HNSCC. To be able to compare RNA expression, DNA methylation and protein expression in the same tumor sample, an experimental protocol was developed to isolate and analyze the protein fraction after extraction of RNA. Before embarking on analysis of patient tumor samples, testing and validation of the method were carried out with two HNSCC cell lines to provide a statistical basis for the reproducibility of the combined steps of protein preparation and analysis, and to demonstrate that biologic variation can be discriminated from technical variation. Our results show that the method presented is highly reproducible (ie, with a reliability coefficient >90%), such that combined technical variation due to extraction, digestion, separation and LC-MS analysis is less than the biological variation between the two cell lines. Future clinical studies will integrate the findings from our proteomic analysis with genomic analyses from the same patient tissue, to develop diagnostic biomarkers that will guide treatment selection at initial diagnosis for patients with HNSCC.