Main

Diffuse large-B-cell-lymphoma (DLBCL) is the most common type of aggressive lymphoma, accounting for nearly 40% of lymphomas overall. The clinical International Prognostic Index (IPI) score for DLBCL helps to stratify patients into risk categories based on clinical features.1 However, even within risk categories, patient outcome is variable. Because of the variable patient outcome, it has long been suspected that DLBCL may actually encompass more than one disease. In recent years, the results of extensive gene expression profiling (GEP) experiments have confirmed that DLBCL is not a uniform disease entity and that differences in GEP-defined subtypes can be related to patient survival.2, 3, 4 GEP has yielded four different sets of prognostic genes based on the work of four different research groups.

The first group, known as the Leukemia and Lymphoma Molecular Profiling Project (LLMPP), studied 240 DLBCL using the LymphoChip. This chip was a spotted oligonucleotide microarray containing numerous elements known to represent genes expressed by B- or T-lymphoid cells, genes involved in the immune response, and genes expressed by lymphoma and leukemia cell lines.5 The investigators discovered four gene expression signatures highly correlated to patient survival called ‘germinal center,’ ‘major histocompatibility (MHC) class II,’ ‘lymph node,’ and ‘proliferation.’ Representative genes from these four signatures with the addition of the gene BMP6 were used to create a 17-gene outcome predictor score, which provided additional prognostic value independent of the clinical IPI score.2 Another group used the Affymetrix high-density oligonucleotide array platform and a supervised learning classification approach to develop a DLBCL outcome predictor score with best accuracy obtained using a set of 13 genes.3 A third group of investigators performed a meta-analysis of the literature for genes that were previously reported to be significantly associated with survival in DLBCL. Using quantitative real-time PCR (qRT-PCR), these researchers assessed a series of 66 DLBCL and determined the six most predictive genes (LM02, BCL6, FN1, CCND2, SCYA3, and BCL2).4 Only two genes were common to both the LymphoChip and qRT-PCR studies (BCL6 and FN1). Thus, a total of 34 genes were previously identified in these studies as being significantly associated with patient outcome in DLBCL. Through our own analysis of the LymphoChip data set, we had previously identified several redox-related genes highly correlated with patient survival and had created a ‘redox score,’ including the representative genes, manganese superoxide dismutase (SOD2) and catalase (CAT).6 All four of these studies used quantification of a small key set of genes that correlate with survival in DLBCL in their particular set of cases. All used techniques that relied on snap-frozen materials as the basis of analysis, making comparisons between the gene sets or expansion to wider groups of patients difficult. Thus, development of a method that is applicable to paraffin-embedded samples might be helpful in further exploring the prognostic utility of these and other, yet to be identified, gene sets.

Quantitative nuclease protection assays hold the promise of analyzing mRNA without the difficult steps of mRNA extraction followed by qRT-PCR and therefore may be applicable to formalin-fixed, paraffin-embedded (FFPE) tissues. We chose to use the ArrayPlate assay from High Throughput Genomics Inc. (HTG, Tucson, AZ, USA) that has been described previously for pharmaceutical discovery and development applications.7 In this assay, thin tissue sections are treated with a permeabilizing/lysing buffer (HTG lysis buffer) and can be tested immediately or frozen for later analysis. Oligonucleotide probes designed for the genes of interest are incubated with the lysate to allow hybridization of specific DNA probe–mRNA heteroduplexes. Unhybridized probes and RNA are digested by a nuclease specific for single-stranded nucleic acids. Alkaline hydrolysis is then used to destroy the mRNA component from the heteroduplexes, leaving intact probes in concentrations proportional to the amounts of specific mRNA originally present. These remaining probes are then transferred to an ArrayPlate, which uses an oligonucleotide array and a sandwich hybridization linker to capture and label the probes for chemiluminescent detection and quantification. Originally designed for high-throughput screening, the assay has the potential to handle large sample numbers using small amounts of variously preserved tissue.

Using this technique, we developed an assay for all 36 genes of interest as well as housekeeping genes and genes aimed at determining the cellular composition of the sample (B cell, T cell, and macrophage genes). We performed several studies to test and validate the performance of the assay in both frozen and paraffin-embedded materials. We then turned to a subset of paraffin blocks from DLBCL cases for which matching frozen tissue had been previously analyzed by GEP, described in the study by Rosenwald et al to determine whether the ArrayPlate technique could similarly quantify the prognostically significant genes and whether this would correlate with the microarray data. The overall goal was to develop and validate an assay that would be useable for outcome prediction in all patients with DLBCL, not limited to the minority with available snap-frozen tissue. In this study, we describe our technique, compare fresh and frozen samples to paraffin samples, compare new paraffin blocks to nearly 20-year-old paraffin blocks, and compare the relationship between the results and microarray results as well as patient hazard ratio of death in a previously published set of 40 patients. These studies demonstrate the usefulness of this technique especially as applied to archived materials with potential consequent far-reaching applications for the field.

MATERIALS AND METHODS

Cell Line and Patient Materials, Preparation of FFPE and Frozen Blocks

To validate the design of the ArrayPlate, we began by using the RJ2.2.5 (RJ) cell line,8 which is an MHC II-negative non-Hodgkin's lymphoma cell line. We also used a recently created RJ2.2.5 transfectant, which includes a CIITA expression vector that induces expression of MHC class II genes, including HLA-DRA, -DRB, -DPA, and -DQA.9 The cells were grown under 5% CO2 in RPMI media (with 10% fetal bovine serum, 1 mM sodium pyruvate, 2 mM L-glutamine, 100 U/ml penicillin, and 100 μg/ml streptomycin). Cells were initially seeded at 1 × 105 cells/ml, allowed to grow for 24 h, then harvested. Sixteen million cells were centrifuged to create cell pellets that were either (1) made into FFPE material by fixing in formalin for 4 h, routine overnight tissue processing on a Tissue-Tek (Sakura, McGaw Park, IL, USA) instrument, and paraffin embedding; (2) made into snap-frozen material by snap freezing in embedding media (optimal cutting temperature compound (OCT), Sakura Finetechnical Co., Torrance, CA, USA) in liquid nitrogen quenched in isopentane; or (3) put into HTG lysis buffer (HTG) at 4 million cells/ml for fresh samples.

Tissue used for this study came from retrospective samples of excess diagnostic tissue under an institutional review board-approved protocol. FFPE tissue used for testing the performance of the ArrayPlate included three benign lymph nodes with reactive follicular hyperplasia and two DLBCL chosen from our hospital files for having both multiple snap-frozen and paraffin blocks. FFPE blocks were from the routine surgical pathology archives and had been fixed in 10% neutral-buffered formalin and stored at room temperature. In addition, the two malignant samples consisted almost entirely of tumor. Samples were from incisional biopsies, which had been fixed in approximately 5 mm3 portions. A new paraffin block was created from a previously snap-frozen block of benign lymphoid tissue from 1989 by thawing to approximately −20°C, then immediately fixing in 10% formalin as described above and processing for ArrayPlate analysis as described below.

To investigate whether the ArrayPlate results on FFPE samples were similar to GEP discovery results from snap-frozen tissue, we used FFPE samples, which had matching frozen tissue already analyzed by GEP on the LymphoChip and Affymetrix microarrays. We previously contributed 45 snap-frozen samples to the LLMPP study of DLBCL. Of these, 24 were included in the LLMPP study by Rosenwald et al, describing the clinical utility of an outcome predictor score in reflecting patient survival in de novo DLBCL, and paraffin blocks were available for 18 samples. We also had paraffin blocks on 22 cases profiled by the LLMPP but not included in the de novo DLBCL composed of 10 transformed and 12 relapsed DLBCL cases. These blocks ranged from 6 to 24 years of age, with an average of 14.8 years. Ten blocks were greater than 20 years.

Sample Preparation for the ArrayPlate Assay

Fresh cell pellets were put into HTG lysis buffer (4 million cells/ml), vortexed briefly, heated at 95°C for 10 min, and then frozen at −70°C until analysis. Three frozen or FFPE sections were cut at 5-μm thickness and immediately placed into 75 μl HTG lysis buffer (25 μl/section), vortexed briefly, heated at 95°C for 10 min, revortexed briefly, and then frozen at −70°C until analysis. One cut of the 5-μm-thick tissue was used per well on the ArrayPlate, but the same HTG lysis buffer containing multiple sections was used for the three ArrayPlate wells required to measure all the genes in the assay, or for multiple replicates of the same wells in the validation assays. It was important to cut the paraffin blocks into thin sections, as FFPE tissue not thinly cut gave poor results. Tissue quantities for the dilutional experiments were measured by comparing the weight of the microcentrifuge tubes with and without tissue sections from which most paraffin been removed.

ArrayPlate Assay

The ArrayPlate assay has been described previously.7 After cells or tissue had been lysed, denatured, and permeabilized by heating in HTG lysis buffer as described above, the samples were frozen and sent to HTG for analysis. At HTG, the 50-mer probes specific for the genes of interest were incubated with the samples, forming specific probe–mRNA duplexes, and then unhybridized probes were digested by S1 nuclease. Next, alkaline hydrolysis destroyed the mRNA in the duplexes, leaving intact probes with stoichiometric concentrations proportional to the amounts of specific mRNA originally present. After neutralization, samples were transferred to ArrayPlates for probe detection. The ArrayPlates contained a universal array of 16 unique, covalently bound, 25-mer ‘anchor’ oligonucleotides spotted in a 4 × 4 grid on the bottom of each well. This universal array was modified to bind 50-mer probes for the genes of interest at preselected positions by exposing the array to a mixture of 50-mer programming linker oligonucleotides that contained a 25-mer sequence to bind one of the probes at one end, and a 25-mer sequence to bind one of the anchor oligonucleotides on the other end. Three different mixtures of programming linker oligonucleotides distributed across three ArrayPlate wells were required to measure all the genes of interest in our assay.

After hybridization, probes from the sample were bound to array elements by the programming linker oligonucleotides. A mixture of detection linker oligonucleotides was added. The 50-mer detection linkers contained a 25-mer sequence that bound sample probe on the end not bound by the programming linker probe on one end, and a common 25-mer sequence to bind a detection probe on the other. Detection probe was added, which bound to all the detection linkers. The detection probe contained bound horseradish peroxidase. Upon the addition of chemiluminescent peroxidase substrate (Lumigen PS-atto, Lumigen Inc., Southfield, MI, USA), each array element gave off light proportional to the amount of sample probe bound at that position. Figure 1 shows a schematic of the assay.

Figure 1
figure 1

Diagram of ArrayPlate assay. HTG prints a Universal Array in every well of a 96-well plate with 16 different anchor oligonucleotides. A mixture of custom programming linkers is added, each designed to hybridize to a specific anchor and capture a gene-specific nuclease protection probe (shown for one linker). The array now captures probes for a custom set of genes. HTG lysis buffer lyses and/or permeabilizes sample, and then nuclease protection probes are added. The probes hybridize to all mRNA, soluble and cross-linked. S1 nuclease destroys nonspecific RNA producing a stoichiometric amount of target-mRNA/probe duplexes. Base hydrolysis releases probe from duplexes. Probes are transferred to a programmed ArrayPlate well, detection linker added, and both probes and detection linkers captured onto the array. The ArrayPlate is washed, HRP-labeled detection probe added, incubated, washed, then chemiluminescent substrate is added and the ArrayPlate imaged to measure the expression of every gene in all wells.

The signals for all 1536 elements in an ArrayPlate were recorded simultaneously by imaging the plate from the bottom with a CCD-based Omix Imager (HTG). Images were analyzed using Vuescript software (HTG), which calculated average pixel intensity for each element to determine expression levels for each gene. Expression levels were normalized to the housekeeping gene TBP for which the signal level was arbitrarily set at 1000, except for the linearity experiments, where we used a weighted normalization of all genes to allow for titration.

For some array elements, signals were commonly too strong relative to other elements in the well. For these, signal was decreased by adding a 25-mer ‘attenuation fragment’ oligonucleotide that competed with one of the detection linkers for binding to sample probe, but that could not bind peroxidase-labeled detection probe. For rare mRNA transcripts, assay signal was increased by targeting the message with several non-overlapping probes and subsequently binding these non-overlapping probes at the same position within the array by several distinct programming linkers. The same amounts of attenuating or amplifying oligos were always added, so expression levels were calculated for these array elements in the same way as for non-altered elements.

Selection of Genes and Probes for Programming of ArrayPlate Assay

As described at the beginning of this article, we used the key genes identified as prognostically important in four previous studies in DLBCL, which accounted for 36 genes of interest.2, 3, 4, 6 Because of the heterogeneity of cellular composition in human tumor samples, we also included probes designed to test the tumor composition for B cells (CD19, CD20), T cells (CD3), and histiocytes (CD68). Two housekeeping genes, TBP and PRKG1, were chosen based on previously published work assessing the utility of different endogenously expressed genes as housekeeping genes, which identified these two genes as stably expressed at moderate or low levels in different types of lymphomas by qRT-PCR.10 These two housekeeping genes were repeated at diagonal corners in each of the three wells used to create the assay. An oligo dT probe was added to assess the quantity of mRNA in the sample (since an oligo dT probe should detect all mRNA which has a poly-A tail). However, for technical reasons owing to the stringency of the assay, this probe was non-functional and not further utilized. A probe for cytochrome oxidase was also initially included because it is coded in mitochondrial DNA, and should be expressed at high levels. This turned out to bind both DNA and RNA, and so gave an extremely bright and generally oversaturated signal and was therefore not further considered, except that it could be used to distinguish whether there was insufficient material for the assay, or whether, if it had disappeared entirely, the sample was too degraded for use.

For each of the 44 genes of interest, four specific probes were designed, although not all were synthesized. ArrayBuilder 2.0 software (HTG) was used to design the oligonucleotides required for the assay to measure target transcripts in groups of 16. Briefly, with the user providing the accession numbers for the target genes and assigning their position in the array, the software retrieved each mRNA sequence from GenBank and ranked successive 50-mer stretches of the target gene sequences according to the melting temperature (Tm) of their 5′ and 3′ constituent 25-mers, giving preference to those 50-mers for which the Tm of each of the two 25-mer halves was nearest to 68°C. The four highest ranked and non-overlapping 50-mer sequences for each of the 16 target mRNA species were subjected to BLAST to identify homologous sequences. Sequences with homology to other genes were rejected and replaced with the next highest-ranking 50-mer sequence that was in turn submitted to BLAST. Sequences without significant homology were retained. Typically, 3–4 iterations were required to yield four suitable target 50-mer sequences within each of the 16 genes. The software then created output files containing the sequences of the four oligonucleotides (programming linker, protection probe, detection linker, and attenuation fragment) required to measure a given 50-mer target in the assay.

For four genes, the signal had to be lowered with undetectable competing oligonucleotide. For nine genes, the signal had to be increased, as shown in Figure 2. Between the two housekeeping genes, PRKG1 and TBP, TBP had stronger and substantially more uniform luminescence (data not shown). PKRG1 expression was variable and expression was low. Results were therefore normalized only to TBP set at 1000. Only one of the probes, for HLA-DRB, had to be redesigned, because the original single GenBank sequence did not identify the common polymorphisms in this highly polymorphic gene.

Figure 2
figure 2

Setup of custom ArrayPlate for prognostic genes in DLBCL. The schematic of the arrays is above with a representative luminescence image from that array below. Genes circled in dashed lines had their signal attenuated with an undetectable competing oligonucleotide. Genes circled in black had their signal enhanced with multiple probes. Genes in gray circles were designed as housekeeping genes. Each array is one of three wells in 96-well plates with 16 locations for an anchor probe, which binds to a custom linker probe, which in turn binds to the specific probe (Figure 1). The putative housekeeping genes are repeated in each well.

Table 1 lists the names of the genes of interest from the original papers, references, gene symbols for the plate, position at which probes begin for that gene, and the sequence of the designed probes. Figure 2 shows the DLBCL assay design, including which signals were reduced (attenuated) or increased (amplified) (described above). Test samples were assayed in replicates, as indicated in figure legends, to provide measures of reproducibility, but because of the low variability found between replicates, one replicate per well of the assay was considered to be sufficient for our analysis of tumor samples.

Table 1 Gene list for DLBCL ArrayPlate

Array Validation Experiments

Unlike fresh or frozen samples, when FFPE sections were treated with HTG lysis buffer, they tended not to dissolve completely. Even though there was remaining visible FFPE tissue in many cases, the assay performed similarly to when a similar weight of fresh or frozen sample that dissolved completely was used. To better understand this phenomenon, we treated a benign FFPE sample and a comparable frozen sample in HTG lysis buffer. The FFPE sample was centrifuged, and the pellet and supernatant were separated. The pellet was resuspended in an equal volume of HTG buffer, and all three samples assayed on the ArrayPlate.

Because the gene probes are not designed across exon boundaries, we wanted to be sure that the ArrayPlate measured concentrations of mRNA and not DNA. We therefore took benign tissue FFPE samples treated with HTG lysis buffer, and then the sample was either treated with RNase, DNase, or was left untreated and assayed on the ArrayPlate.

Assays were also performed with replicates and with dilutions to demonstrate the linearity and reproducibility of the assay with different sample types, including fresh, frozen, FFPE archived and recently made samples. Limited quantities of patient tissue forced us to limit some assays to only a single of the three arrays.

Immunohistochemistry

Immunohistochemistry (IHC) was performed on three samples to detect protein products for genes present on the array for which we had clinical IHC assays routinely performed in paraffin in our hospital. These included CD20, CD3, CD68 (data not shown for these first three), BCL2, BCL6, and HLA-DR. All staining was performed on the Ventana Benchmark XT instrument with Ventana I-View detection (Ventana Medical Systems Incorporated (VMSI), Tucson, AZ, USA). Our standard clinical laboratory staining procedures with on-instrument antigen retrieval were employed. Monoclonal antibodies were used against CD20 (VMSI, clone L26), CD3 (VMSI, clone PS1), CD68 (VMSI, clone KP1), BCL2 (VMSI, clone B4-2/100/D5), BCL6 (CNIO, Madrid, Spain, clone GI191E/A8), and HLA-DR (Biogenix, Bogota, Columbia, clone LN3). Photography was performed on a Labophot-2 microscope using a × 10 eyepiece and a × 40/0.65 objective lens (Nikon, Melville, NY, USA). A SPOT-RT 2.2.0 color camera and SPOT Advanced 4.0.9 software (Diagnostic Instruments, Sterling Heights, MI, USA) were used to capture and digitally acquire images, which were then inserted into PowerPoint 10 (Microsoft, Redmond, WA, USA) and cropped for publication.

Statistical Methods

Simple statistical calculations such as averages, s.d., linear correlations were performed in Excel 10 (Microsoft). Statistical analyses were performed on the 18 de novo untreated and 21 non-de novo relapsed or transformed DLBCL cases with results from both ArrayPlate and Affymetrix/LymphoChip analysis of gene expression. Analyses of hazard ratios and survival were performed on all samples, because of the small sample size, unlike in the study by Rosenwald et al,2 which used only de novo untreated samples. Spearman rank-order correlations for each pairwise comparison of three studied methods were derived for each gene. For a given comparison, analyses with fewer than 10 available observations were not performed due to power considerations. The median overall correlation between available genes for each pairwise comparison was computed; only genes with available data for all three pairwise comparisons were included to avoid bias. Univariate analysis results (hazard ratios, 95% confidence intervals, and P-values) of gene expression as measured by ArrayPlate on overall survival were obtained from the Cox regression model.11 For each method, the genes were defined as either positively or negatively associated with outcome based on univariate Cox model estimates. Two-way tables, representing the level of agreement, were calculated between the LymphoChip vs the ArrayPlate, Affymetrix vs the ArrayPlate, and the LymphoChip vs Affymetrix. The kappa statistic,12 which adjusts for chance agreement, was used to statistically summarize the level of agreement between each method. Given that genes were not independent, confidence intervals for the kappa statistics were calculated using bootstrap resampling.13

RESULTS

ArrayPlate Measures All mRNA and Only mRNA in Samples

To characterize the performance of the assay when FFPE tissues were tested, sections were pretreated with RNase or DNase. As can be seen at the bottom of Figure 3, the assay chemiluminescence was abrogated after RNase but not DNase treatment. Therefore, RNA not DNA was measured in the assay. RNA is difficult to measure in fixed tissue by methods that require extraction, such as qRT-PCR, but relatively easy to measure in situ by methods such as in situ hybridization. RNA is believed to be crosslinked to other tissue molecules by fixative, preserving the structure but making it difficult to remove intact. With extended archiving, the crosslinking may increase. In FFPE samples, the tissue is denatured and permeabilized, but not totally dissolved, as fresh or frozen tissue is, by HTG lysis buffer. Nevertheless, the mRNA profiles measured are similar across samples, regardless of preservation method. In FFPE sections, some mRNA was not released from the tissue, but the probes bound to the mRNA in situ and were protected from nuclease digestion there. The top of Figure 3 shows that the assay signal came mainly from the pelleted material, not the solubilized supernatant. The specimen shown in Figure 3 was an extreme case of fixation. We saw variability across other cases in the relative amounts of mRNA solubilized in HTG lysis buffer to mRNA remaining bound to the tissue section. This finding underscored the importance of measuring both soluble and fixed mRNA to compensate for variations in fixation from one case to another. For this reason, FFPE tissue must be analyzed in thin sections and extra paraffin removed from around the edges of the section, so that the probes are freely diffusible.

Figure 3
figure 3

Array detection of RNA. At top is a demonstration that the nuclease protection assay detects both soluble and fixed RNA from FFPE samples. A highly crosslinked FFPE section and a comparable size frozen section from the same sample were treated with HTG lysis buffer. The FFPE sample was centrifuged, and the pellet and supernatant separated, then pellet resuspended in an equal volume of HTG lysis buffer, and all three samples assayed in an array of a somewhat different customization from the our ArrayPlate. Even in this case where almost all mRNA is fixed in the sample pellet, the assay measures similar amounts to the unfixed sample. At bottom, adding nucleases to samples demonstrates that the nuclease protection assay detects mRNA but not DNA. Sample lysates were either treated with DNase or RNase, or left untreated, before hybridization to an array of a somewhat different customization from the our ArrayPlate. DNase does not change the signal substantially, but RNase abrogates the signal.

ArrayPlate Performs Similarly on Differently Preserved Samples

Figure 4 shows that assay performance on diluted frozen tissue sample replicates was reasonably linear from the detection limits up to about 1 mg of sample/well. Table 2 contains the same data, normalized to the TBP housekeeping gene, demonstrating that concentrations of starting material differing by up to 10-fold gave comparable results, with average %CVs (coefficients of variation) ranging from 15 to 19%.

Figure 4
figure 4

ArrayPlate dilution study. At top is an image of replicates from serial dilutions of case 0, frozen benign hyperplastic lymph node tissue diluted from 1.25 to 0.058 mg/well. Graphs at the bottom show the assay linearity for each array after weighted normalization. Table 2 demonstrates the reproducibility of this experiment.

Table 2 Reproducibility of gene expression in frozen tissue in a dilution study

We then studied the RJ2.2.5 cell line, to obtain sufficient quantities of homogeneous material, unlike biopsy material, which may differ from section to section, and is of limited quantity. Replicate cell pellets, either fresh, frozen in OCT, or fixed and paraffinized, were run in the ArrayPlate assay. Figure 5 shows the linear correlations of assay signals between the differently treated samples, with R2>0.98 for all three. Figure 6 plots the signal intensity for the three treatments ±s.d., showing that RNA expression as measured by the assay was substantially the same for fresh, frozen, or FFPE samples.

Figure 5
figure 5

Comparison replicate cell line pellets after fresh preservation in HTG lysis buffer, snap freezing in OCT, or FFPE. Top graph has FFPE on the x-axis vs fresh on the y-axis, left graph FFPE vs frozen, and right graph fresh vs frozen. Only genes on the ArrayPlate expressed above background in the RJ2.2.5 cell line are shown. Results are nearly identical regardless of whether sample was fresh, frozen, or FFPE.

Figure 6
figure 6

ArrayPlate gene expression measurements by method of preservation in cell lines. Another view of data from Figure 5 showing that gene expression in the RJ2.2.5 cell line does not differ with fresh, frozen, or FFPE cell pellets. Graph shows the average ±s.d. of four replicate wells. Only genes on the ArrayPlate expressed above background in the RJ2.2.5 cell line are shown. Differences between preservation methods are generally within assay variability for samples where sections contain essentially identical material.

Because we wanted to do similar experiments with FFPE tissue samples, we first looked at the differences in gene expression in clinical FFPE tissue samples between sections. Since our sections are 5 μm thick, the three sets of patient sections from the same block are separated by about 10 sections. With the blocks being 5000 μm on a side, the sections from separate blocks can be quite distant. Figure 7 indicates that, while sections have some variability, the relatively small differences show that different sections or even blocks from the same patient have very similar results by ArrayPlate, while preserving the variability between patients.

Figure 7
figure 7

Similarities between non-adjacent FFPE sections from patient biopsies. One tissue section/well was run on array 1. Start section for each sample was the section that first contained significant tissue as the block was cut, mid section was 50–65 μm from first section, and last section was 115–150 μm from the first. Other section was from a different FFPE block of the same patient biopsy. Case 1 at top was an anaplastic DLBCL, case 2 in the middle an immunoblastic DLBCL, and case 4 at bottom a benign lymph node hyperplasia. While sections have some variability, the relatively small differences show that different sections or even blocks from the same patient have very similar GEP by ArrayPlate, while preserving the GEP variability between patients.

Then, on a benign hyperplastic human lymph node biopsy for which we had sufficient material, we compared preservation methods. The new paraffin block was made from a portion of an 18-year-old snap-frozen biopsy, which was quickly defrosted, fixed in formalin, and embedded. The results were compared to a sister paraffin block made from the same biopsy at the time of the patient's surgery 18 years ago. As shown in Figure 8, the correlation between the new vs the old paraffin block from the same biopsy was good (R2=0.71), with no more variability than what is seen in Figure 7, resulting from different sections of the same biopsy. This result indicated the applicability of the assay to very old archival material. ArrayPlate reproducibility with time of assay by method of preservation in tissue sections was also addressed. Frozen, recently FFPE, and archived FFPE tissue from case 0 above was assayed on array 2 on different days 1 week apart. The histogram in Figure 9 shows differences between samples, especially between the recent and archived FFPE samples, is largely within assay variability.

Figure 8
figure 8

Comparison of tissue samples after snap freezing in OCT, archived or recent FFPE. Case 0, a large benign hyperplastic lymph node tissue biopsy from 18 years ago was preserved with both multiple frozen and archived FFPE blocks. A frozen block was recently FFPE. Top graph has recent FFPE on the x-axis vs archived FFPE on the y-axis, left graph recent FFPE vs frozen, and right graph archived FFPE vs frozen. The HLA-DRB gene was not included in the comparisons with frozen samples because the probe design for this gene changed between the frozen and FFPE experiments. Results are similar regardless of whether sample was FFPE either recently or 18 years ago, and reasonably similar if sample was frozen.

Figure 9
figure 9

ArrayPlate reproducibility with time by method of preservation in tissue sections. Frozen, recently FFPE, and archived FFPE tissue from case 0 was assayed on array 2 on different days. Graph shows the average ±s.d. of 2–4 replicate wells. Differences between samples are largely within assay variability.

Comparison of ArrayPlate to Other Expression Measures

The ArrayPlate results accurately reflected that mRNA of HLA-DRA and HLA-DQA increased in the RJ2.2.5 cell line transfectant clone compared to the parent line as demonstrated with qRT-PCR (data not shown). ArrayPlate technology has been previously shown to have a good correlation with qRT-PCR in cell lines.14 Because not all of the genes of interest are expected to be expressed in our B cell line or indeed in any single cell line, it would be difficult to make a uniform, large sample, which would express all of our genes of interest at concentrations relevant to our assay. The samples expressing all the genes are clinical samples containing limited amounts of tissue. Therefore, qRT-PCR verification was not performed for all of our genes of interest. We have performed IHC for verification, and will in the long run validate the technology against the clinical ‘gold standard’ of patient outcome.

IHC on FFPE tissue sections was performed for CD20, CD3, CD68, BCL2, BCL6, and HLA-DR. Protein expression for these tissue sections correlated well with the gene expression results from FFPE sections on the ArrayPlate for all the genes tested. Results for BCL2, BCL6, and HLA-DR are shown in Figure 10.

Figure 10
figure 10

Comparison of results of ArrayPlate and IHC. ArrayPlate gene expression on the left and IHC on the right are shown for three FFPE patient tissue samples. Case 1 was an anaplastic DLBCL, case 2 an immunoblastic DLBCL, and case 3 a benign lymph node hyperplasia. ArrayPlate results correlate well with IHC.

Correlation between the ArrayPlate and Discovery GEP Techniques

We ran 40 cases of archived FFPE DLBCL on the ArrayPlate analysis. Thirty-nine cases were successfully analyzed. One case of transformed DLBCL had no signal at all. In situ hybridization with a poly-T probe demonstrated that the block from this case had no intact mRNA. Results on the remaining 39 cases were compared to previous Affymetrix and LymphoChip GEP results from matched frozen samples using Spearman rank statistics, as shown in Table 3. Since not all the gene results were available on all platforms, there were frequent missing data as indicated. Of 33 genes comparable between the two microarray platforms, 27 (82%) were significantly correlated. Of 32 genes comparable between the ArrayPlate and the LymphoChip, 20 (61%) were significantly correlated, and for the ArrayPlate to Affymetrix correlation there were 37 comparable genes with 19 (51%) significantly correlated. Of the 32 genes comparable between all three technologies, 19 (59%) were significantly correlated between all three. The median correlation for ArrayPlate vs Affymetrix was 0.52, for ArrayPlate vs LymphoChip 0.55, and Affymetrix vs LymphoChip 0.78, as shown in Figure 11. The correlation between microarrays is understandably higher since those two analyses were performed on the same aliquot of RNA derived from a single frozen tissue block, while the ArrayPlate analysis was performed on a different FFPE block from the same specimen. Overall, these are excellent correlations for this type of technology, since correlations between Affymetrix and spotted arrays have been reported to be around 0.4,15, 16 and indicate that relationships between genes and survival that have been detected in discovery experiments could also be seen with the ArrayPlate. The genes were clustered into two groups as either being positively or negatively associated with survival outcome based on Cox regression. The crude agreements between the methods were 79% for the LymphoChip vs the ArrayPlate, 75% for Affymetrix vs the ArrayPlate, and 89% for the LymphoChip vs Affymetrix. The kappa statistics were 0.76, 0.67, and 0.75, respectively. Landis and Koch17 have characterized kappa statistics between 0.40 and 0.60 to represent ‘moderate’ agreement between measurements.

Table 3 Spearman rank correlations by gene
Figure 11
figure 11

Correlations between the ArrayPlate assay and the LymphoChip and Affymetrix microarrays. Histograms show the distribution of correlations for each pair of comparisons across the gene sets, and the median correlation for each is indicated.

Hazard Ratios for Prognostic Genes with ArrayPlate Compared to Discovery GEP Techniques

The results for the 36 previously discovered prognostic genes were then compared to survival for the 39 cases successfully analyzed using the ArrayPlate. We first performed univariate analyses of gene expression levels vs patient survival; however, none of the genes was significantly correlated with survival, which we attributed to the low number of cases, specifically the few de novo previously untreated cases, in this study group (since all of these genes were associated with survival in larger groups of patients). The survival predictors for all the different groups were calculated from only de novo untreated samples, whereas some of our samples were previously treated relapsed or transformed DLBCL. We then calculated the hazard ratios of death for each gene and method. For hazard ratios >1, there is an increased risk of death and for ratios <1, there is a decreased risk of death. These hazard ratios usually trended in the same direction for each gene as they did in the larger data sets from which their significance was derived. Table 4 shows two-way comparisons of agreement (results for hazard ratios above or below 1) between each of the three different quantitative mRNA methods and the directions predicted by individual genes for outcome. For methods, the Affymetrix and Lymphochip microarrays agree 88% in this data set, ArrayPlate agrees with Affymetrix 83%, and with Lymphochip 87%. Agreement with previously predicted outcome is 79% for Affymetrix, 91% for Lymphochip, and 80% for ArrayPlate. The higher agreement of Lymphochip methods with the predictors is expected because the greatest number of prognostic genes (17) was derived from Lymphochip data. Altogether, there are 19 genes concordant and predictive by hazard ratio for all three methods, and four more concordant and predictive for the ArrayPlate and one of the other microarray methods, while not being present on the other microarray. These genes are divided into signatures as follows: 12/17 Rosenwald signature genes, 5/6 Lossos, 6/13 Shipp, 1 Tome, suggesting that the Rosenwald and Lossos signatures are more robust.

Table 4 Agreement between hazard ratios for different technologies and predicted outcomesa

DISCUSSION

The medical importance of developing a prognostic model in DLBCL, which moves beyond the currently used clinical IPI staging system, is obvious given the broad range of outcomes for similarly staged patients. DLBCL prognosis has been the focus of recent intense research and publication in this field. Several groups have investigated and created their own methods of outcome prediction based on the expression of from 6 to 17 genes quantified from samples of snap-frozen tumor biopsies.2, 3, 4, 6 We did not attempt to select new genes of interest. Our goal was to take the best leads from the previously published comprehensive microarray studies and see if the key genes could be quantified in FFPE tissues. The significance of the current work is in the development of an assay that can be used for outcome prediction on routinely fixed, paraffin-embedded tissues, which is the only material available for analysis for many patients.

The ArrayPlate assay developed for this study was extremely reproducible with low %CVs on replicate and repeated assays. The correlation among the various types of preparations, including snap-frozen pellets in OCT, fresh samples, and old and new paraffin blocks, was very good. The ability to obtain similar results whether using frozen or paraffin-embedded materials is particularly notable and most likely results from measuring both soluble and cross-linked mRNA from the fixed tissues. All blocks analyzed in the current work had been fixed and handled at the University Medical Center, Tucson, which may have improved their quality; however, we have since analyzed an additional 120 FFPE blocks from a variety of hospitals with good success (data not shown). A typical qRT-PCR procedure using paraffin-embedded materials requires a long series of steps to first isolate RNA, which can include deparaffinization, protease digestion, phenol:chloroform extraction, alcohol precipitation of the nucleic acids, followed by resuspension, DNase digestion, a second phenol:chloroform extraction, alcohol precipitation, and resuspension. The mRNA template is then reverse transcribed to create cDNA that undergoes PCR amplification. In the ArrayPlate assay, the mRNA is hybridized in situ therefore bypassing several steps in the qRT-PCR process, including extraction and amplification, which can both introduce bias.

The comparison of an old paraffin block (18 years) vs a newly created paraffin block is particularly important since it demonstrates the feasibility of using the ArrayPlate assay to test archival material from previous clinical trials or stored patient samples to validate gene expression correlations to survival from discovery experiments. This aspect may be critical to generating meaningful results from patients who may not otherwise have available material for techniques that require snap-frozen samples, such as patients who are biopsied in smaller hospitals or outlying clinics, those with minimal tissue available for biopsy, or those in whom the possibility of lymphoma was not anticipated before biopsy so that tissue was not snap-frozen.

The level of agreement between the ArrayPlate and both the LymphoChip and Affymetrix arrays, as shown by the kappa statistics, suggests that ArrayPlate technology may be suited to detect outcome associations. While confidence intervals are wide because of the limited number of de novo samples tested, we are collecting samples from a much larger cohort of patients for further testing.

While we were not able to demonstrate a direct relationship between the ArrayPlate and patient survival, attributed to the low number of de novo DLBCL samples in the study, we were able to demonstrate that the hazard ratios agreed with predictions from the larger data sets of Rosenwald, Shipp, Tome, and Lossos 80% of the time, similar to the microarray methods, indicating that it may be possible for clinically significant information to be generated with this technique. Analysis of a larger data set is underway.

In summary, this is an early report on use of the ArrayPlate, a quantitative gene expression protocol for testing paraffin-embedded materials. While this technique does not replace the strength of GEP, which can analyze the entire genome for discovery studies, it does indicate that the significant genes identified by GEP studies can be adapted into a useful platform applicable to nearly all patients. The platform is easily modifiable and can be adjusted as new genes of interest are discovered.