Main

Clear cell renal cell carcinoma (ccRCC) represents the most common subtype of adult kidney cancer.1 Although early diagnosis and treatment of the disease is associated with a favorable prognosis, patients diagnosed at the metastatic stage have abysmal prospects, with only a 9% 5-year survival rate.2 Neoplasms resected at a pre-metastatic stage can also recur, with the greatest risk of recurrence in the first 3–5 years.3 Detecting recurrences early can impact patient outcome since the likelihood of a good response to systemic treatment is greater when the metastatic burden is small.4 In addition, surgical resection of a single or limited number of metastases can result in longer survival.5 Furthermore, ∼3% of patients will develop a second primary renal tumor, either synchronously or metachronously.6

The strongest validated prognostic factors in RCC are stage (anatomic extent of disease) and nuclear grade.7 These parameters are, unfortunately, not very reliable and from a clinical standpoint, ccRCC is notoriously unpredictable. Reports have documented the metastatic potential of even low grade, early stage ccRCC. At the same time, only a portion of advanced stage tumors result in metastatic disease. To address these difficulties, a prognostic model utilizing multiple factors has been developed at Memorial Sloan-Kettering based on a multivariate analysis of patients with metastatic disease.8 This model has had further enhancements based on data from the Cleveland Clinic.9

Recently, moving into the era of personalized medicine, new markers have shown promise in predicting prognosis in ccRCC, including B7H1, B7H410 IMP311 and CAIX.12 Molecular profiling approaches have also identified new biomarkers.13, 14, 15, 16, 17, 18 However, none has gained widespread acceptance as a clinical tool. The identification of additional independent prognostic markers is urgently needed to help in more accurately identifying aggressive ccRCC, as this will have a significant impact on treatment protocols in the current era of targeted therapy. The assessment of microvascular density (MVD) may prove useful in this endeavor and has several advantages as a clinical test, being simple and relatively quick to assess, while using widely available techniques like immunohistochemistry (IHC) on formalin-fixed paraffin-embedded tissues. Currently, digital imaging technology allows for more reproducible automated analysis of IHC results.

The assessment of tumor MVD has been reported as a prognostic indicator in several human malignancies. Specifically, studies have shown that higher tumor vascularity correlates with worse patient outcomes in breast,19, 20, 21 prostate22 and ovarian cancers.23 To date, studies assessing the prognostic value of MVD in ccRCC have shown conflicting results.24, 25, 26, 27, 28, 29, 30, 31 Maclennan and Bostwick25 and Sandlund et al29 both concluded that the extent of angiogenesis has no prognostic significance in relation to patient survival. On the other hand, Yoshino et al31 and Nativ et al27 found that increased MVD was a poor prognostic factor associated with shorter survival. In Yoshino et al's31 case, higher MVD was also correlated with a higher metastatic rate. In contrast to the above studies, Imao et al24 and Yildiz et al30 showed that increased MVD in RCC correlated with longer patient survival. These contradictions are not unique to kidney tumors and one of the apparent reasons is the methodology of MVD assessment. An early and popular method of assessing MVD used ‘hot-spots,’ based on manually counting vessels in the most vascularized areas of histological sections. Originally, it was used in breast cancer and showed a direct relationship between MVD and tumor aggressiveness. Many subsequent studies confirmed these results. However, a substantial number of reports showed no relationship, or an inverse relationship between the vascularity and clinical outcome.32, 33, 34, 35, 36 This discrepancy was attributed to methodological differences, distributional heterogeneity of microvasculature and biological differences between tumor types. In addition, low MVD has been reported as a poor prognostic marker in pancreatic endocrine tumors,37 bladder carcinoma38, 39 and cervical cancer.40 MVD has also been found to vary between different tumor types of the same organ.41, 42 What became apparent, was that the hot-spot methodology for determining MVD was poorly reproducible34, 43, 44 and several studies employed a Chalkley graticule,44, 45 which improved reproducibility, but could not resolve the issue of conflicting results.44, 46

The current advances in digital pathology allow MVD to be assessed with much higher expected accuracy. Computerized approaches offer the capability of analyzing much larger areas and also consistently apply analysis algorithms. Our goal was to improve the accuracy of measurements and test the old hypothesis that MVD correlates with clinical outcome.

In clinical practice, the significance of MVD in ccRCC may have a dual function: prognostic assessment and treatment guidance. Recently, a number of clinical trials have tested the effectiveness of anti-angiogenic therapy for advanced RCC. In particular, Sunitinib and Sorafenib have been approved by the FDA for the treatment of metastatic renal carcinoma.47, 48 The rationale for anti-angiogenic therapy originates from studies documenting the necessity of tumor vasculogenesis for the maintenance, growth and metastatic ability of the tumor.49 However, present anti-angiogenic therapies have shown only modest effects on disease progression while adding a significant cost.50, 51 Detected responses were seen mainly with higher drug doses, which had higher incidences of side effects. Therefore, there is a need for a predictive marker to identify tumors likely to respond in order to justify the higher risk of toxicity and cost of treatment.

In this study, we report the first automated digital assessment of MVD in ccRCC. MVD was assessed in parallel with vascular endothelial growth factor (VEGF), a major driver of angiogenesis.36 We aimed to improve accuracy of measurements by applying strict criteria for quality of staining, tissue integrity and selection of tissue of interest. We also employed data normalization by analyzing normal tissue and calculating tumor/normal ratio, or adjusted values. The unadjusted and adjusted data were then compared for their correlation with prognostic clinical parameters to provide insight into the effect of data normalization as well as the potential use of MVD as a prognostic marker in ccRCC.

MATERIALS AND METHODS

Tissue Collection

With Research Ethics Board approval, 57 consecutive cases of surgically removed ccRCC's between 2001 and 2008 were obtained from the division of Pathology at St Michael's Hospital (Toronto, ON, Canada). Clinico-pathologic data were obtained from the patient medical records and included sex, age, disease-free survival (DFS), overall survival, tumor size, TNM stage and nuclear grade. As per routine reporting protocol, RCC tumors were graded based on the Fuhrman four-scale grading system52 and staged according to the American Joint Committee on Cancer (AJCC) 7th edition. Hematoxylin–eosin sections were reviewed by pathologists (GMY and VVI) to reconfirm the diagnosis and grading and to select appropriate paraffin blocks for IHC.

Immunohistochemistry

All specimens had been fixed in 10% neutral-buffered formalin, dehydrated in graded ethanol and paraffin embedded. Sections of 4 μm thickness were stained with hematoxylin and eosin, as well as CD34 and VEGF. The primary antibodies anti-CD34 (pre-diluted; Ventana Medical Systems, Tucson, AZ, USA) and VEGF (1:50; Santa Cruz Biotechnology, Santa Cruz, CA, USA) were used for staining on a Benchmark staining platform (Ventana Medical Systems), with the BMK iVIEW DAB Paraffin detection kit (Ventana Medical Systems). Heat antigen retrieval was used for VEGF antibody: kit cell conditioner #1 (Ventana Medical Systems) with heating to 95 °C for 8 min, followed by 20 min at 100 °C. I-VIEW inhibitor was incubated for 4 min at 42 °C. For CD34 antigen retrieval, protease 2 was used for 10 min incubation. All slides were then incubated with primary antibody for 32 min, followed by I-VIEW biotin immunoglobulin for 8 min, I-VIEW SA-HRP for 8 min, I-VIEW DAB and H2O2 for 8 min and I-VIEW copper for 4 min. Slides were counterstained with hematoxylin and bluing reagent. After drying, slides were scanned using ScanScope CS (Aperio Technologies, CA, USA) at × 20 magnification and saved as an eight-bit RGB TIFF file at 2 μm/pixel resolution.

Image Analysis

Image analysis was performed similarly to previously described methods.53 The initial image processing of generating signal-mask pairs was achieved using ‘Photoshop CS3’ (Adobe, http://www.adobe.com). The scanned images (representative areas are shown in Figure 1) were duplicated into ‘tumor’ and ‘normal’ folders and manually processed to remove all parts of the image except either tumor or normal kidney medulla within the images of corresponding folders. All areas of necrosis, hemorrhage or non-specific background staining, which could be removed by the eraser tool, were excluded from assessment. Additionally, for normal tissue, only areas without significant changes (fibrosis, inflammation etc) were selected for analysis. In cases where no useable normal tissue was present, only unadjusted values from the tumor areas were recorded.

Figure 1
figure 1

Representative images of CD34 and VEGF staining in tumor and normal kidney medulla.

Within both folders, the images were processed to generate two sets of files: tissue masks to limit analysis within the selected areas (tumor or medulla) and signal images to assess the markers. The tissue masks were obtained by thresholding through a value separating the clear background of histological slides from the tissue. The procedure generated binary black-and-white images. The sequence of transformations was recorded as an ‘action’ (Photoshop, Adobe) and images were processed in batches to save as bitmap files. Next, original images were processed to generate signal images of CD34 and VEGF brown stain. Since CD34 signal was used to count objects, its brown color was converted into solid white color for binary bitmap images (Figure 2), whereas VEGF was assessed as intensity and so its signal was converted into grayscale TIFFs. For CD34, the range of positive brown pixels was collected to convert into black using the ‘convert color’ tool and then thresholding to generate binary images. Both conversion and thresholding steps were adjusted to achieve selection of vascular structures without fragmentation or fusion of vessels. We de-noised the images by 1 pixel using the ‘dust and scratches’ filter (Figure 2). The sequence of transformations was recorded as an ‘action,’ tested on a representative set of images to assure the correct settings and used for batch processing. Then images were checked for quality and fragmented large vessels were additionally restored manually. The vascular objects were counted as vascular density (vessel/mm2) using an image analysis program based on language IDL 6.3 (ITT Visual Information Solutions, Boulder, CO, USA), which has been used in earlier studies.53, 54, 55 The VEGF set of images was processed similarly in batches, but instead of thresholding, the brown color was converted into the grayscale signal using a previously described CMYK approach.56 The grayscale VEGF signal was measured as intensity of staining (mean optical density, MOD) within the tissue masks.

Figure 2
figure 2

Selection of vascular structures by analysis algorithm within normal kidney cortex and medulla. Note that the glomeruli in the cortex with their dense capillary networks each produce a complex, however single, vascular object. Also note that the sclerotic glomerulus is an avascular scar. Variability of glomerular density between individuals and within the cortex (denser in the outer parts) did not allow us to use the renal cortex as a standardized internal control for MVD assessment.

Statistical Analysis

The MVD and VEGF data were collected for both tumor and normal kidney medulla and tumor/normal ratios, or ‘adjusted’ tumor values were calculated. Cutoff points for high and low groups were selected as follows: unadjusted MVD—median of 536.9 vessels/mm2; adjusted MVD, 1.0 (<normal vs >normal); unadjusted VEGF—median of 40.22 MOD, adjusted VEGF, 1.0. Since the cases were split into two groups by MVD and VEGF measurements, for direct comparison, the cases were also stratified into two groups by their stage. The stages were either combined as stages 1 and 2 vs 3, or 1 vs 2 and 3. By tumor grade, the cases were stratified as two groups combining I and II and III and IV grades.

Statistical analysis was performed using XLSTAT version 2010.4.02 (Addinsoft, http://www.xlstat.com). Continuous variables were compared between groups by the non-parametric two-tailed Mann–Whitney test and correlations between continuous variables were assessed by the Spearman test. We used Cox model and Kaplan–Meier analysis to assess associations with DFS. Since tumors with adjusted MVD of <1 had no recurrence events, the hazard ratio could not be calculated, but we were able to calculate the P-value of the likelihood ratio test to assess the significance of associations between the parameters and DFS.

RESULTS

The descriptive statistics of the patients involved in this study is summarized in Table 1. As described in the Materials and Methods section, only technically acceptable and specific tissue areas were included in the analysis. Their number varied within the CD34 and VEGF sets.

Table 1 Descriptive statistics of the numerical variables in the studya

As a preparatory step, we tested kidney cortex and medulla separately for their usability as an internal reference of MVD. Figure 1 shows representative images of CD34 and VEGF staining. Both MVD and VEGF were assessed as unadjusted and adjusted (tumor/normal ratio) values. Our analysis showed that the cortex could not be used as a normal reference because dense glomerular capillary loops could not be reliably separated into individual vascular rings (Figure 2); therefore, we used kidney medulla as an internal reference. For VEGF, we did not observe a specific advantage of either using cortex or medulla as an internal reference and used medulla for consistency.

Associations with DFS

Both tumor stage and adjusted MVD showed similar associations with DFS by Kaplan–Meier analysis (Figure 3). Higher adjusted MVD was associated with significantly shorter DFS. All cases with recurrences occurred in the tumors where MVD was higher than in the normal kidney medulla (log-rank P=0.037) (Figure 3a). Higher stages were also associated with a worse prognosis (log-rank P=0.041 when stages 1 and 2 were combined together, and P=0.03 when stages 2 and 3 were merged in one group). We also applied the Cox proportional hazards model to test the significance of associations with DFS. Higher adjusted MVD showed significant association with tumor recurrences by univariate (likelihood ratio test P=0.02) as well as multivariate analysis in combination with tumor stage (adjusted MVD P=0.032). As expected, higher stage tumors had a higher chance to recur (P=0.056 when stages 1 and 2 were grouped together, and P=0.013 when stages 2 and 3 were grouped together; no significant difference for stage 1 vs 2 or 2 vs 3). The scatter of recurrences vs adjusted MVD and stage are shown in Figure 4a. Interestingly, all recurrent cases fell into the high tumor/normal MVD and stage 3 quadrant (Figure 4a). Figure 4b demonstrates the distribution of recurrences in tumors of different grades. Tumor recurrence did not show an association with higher grade. All recurrent cases showed, however, tumor/normal MVD ratio >1.0, indicating that adjusted MVD is a better indicator for recurrence than grade. Figure 4c shows raw MVD data in relation to stage.

Figure 3
figure 3

Kaplan–Meier plots for disease free survival (DFS) by (a) tumor/normal MVD ratio, as assessed by CD34 staining, and (b, c) tumor stage. Tumor stage data were organized in two groups by either combining stages 1 and 2 (b) or 2 and 3 (c).

Figure 4
figure 4

Distribution of tumor recurrences vs microvascular density (MVD), stage and grade of the tumors. (a) MVD assessed by adjusted MVD/normal medulla compared with stage (b) MVD assessed by adjusted MVD/normal medulla compared with grade and (c) MVD assessed by unadjusted MVD compared with stage. Normal medulla for MVD measurements was available for four tumors with recurrence. Note that all four of these fall in the ‘high stage—high tumor/normal MVD’ quadrant (a).

Adjusted VEGF ratio was available in two tumors, which recurred after surgery. Although this number was too low to assess the significance, both cases had an adjusted VEGF expression >1.0 (Figure 5a). The unadjusted VEGF data did not show an association with DFS (Figure 5b).

Figure 5
figure 5

Vascular endothelial growth factor (VEGF) and grade data distribution. Tumor/normal VEGF ratio was available for assessment in only two tumors, both showed values higher than the median (a). Also note that most of the tumors had VEGF expression higher than normal tissue, which is consistent with the common genetic mechanism in RCC. There was a trend of higher VEGF expression in the higher-grade tumors. Unadjusted data are shown for comparison in panel b.

Associations Between MVD, VEGF and Other Clinical Parameters

An important finding was that, although both adjusted MVD and stage showed comparable associations with DFS, there was no correlation between them as either continuous (Spearman=−0.028; Table 2) or ordinate variables (MVD medians 0.87 and 0.89, Mann–Whitney P=0.8). Figure 4 shows that there was an approximately symmetrical distribution of tumors with adjusted MVD values <1 and >1 in the stage groups: 14 with the MVD <1 and 11 with the ratio >1 for stage 1, compared with 12 and 8 for stages 2 and 3 combined. Another important finding was that the raw MVD measurements, unlike the adjusted ratios, had no association with DFS. As shown in Table 2, we observed a weak trend of decreasing unadjusted MVD in larger tumors (Spearman=0.29, P=0.04). However, there was no difference of unadjusted MVD between the stage groups, which reflect tumor size and the trend was not seen for adjusted MVD data.

Table 2 Correlation matrix (Spearman) between the parameters used for analysis

It is well known that the VHL gene is inactivated in most RCC while VEGF expression is suppressed by VHL in normal non-hypoxic cells. In our data set, 35 out of 46 (76%) tumors showed VEGF expression higher than in normal tissue (Figure 5a). Adjusted VEGF correlated moderately with tumor size (Table 2; Spearman=0.36, P=0.014) and grade (Table 2; Spearman=0.3, P=0.039; Table 3; Figure 5a). We observed higher adjusted VEGF in higher-grade tumors (grouped as grades I and II and III and IV, median 1.23 vs 1.46, respectively, Mann–Whitney P=0.049). Raw VEGF data did not correlate with either grade (Figure 5b), stage, MVD or other parameters (Table 3).

Table 3 MVD and VEGF data vs tumor stage and grade

DISCUSSION

Our study was based on the old hypothesis that solid neoplasms require a vascular network for growth, and a rapidly growing tumor would have a higher vascularity to support the proliferation demands. Despite the fact that the topic has been studied for over 40 years, there is still no clear understanding of what the relationship is between tumor microvasculature and its aggressiveness. Since previous studies employed manual counting methods, their analyses were limited to small areas and were subject to observer-related variability. Without a gold standard, it is difficult to estimate the magnitude of the combined error, but it is likely that a large proportion of previous studies had an error too large to resolve any true biological differences. For example, in hypoxia studies, distribution of the hypoxia marker carbonic anhydrase IX was shown to be heterogeneous by immunohistochemical measurements and 3D reconstruction.53, 57 Only extensive tumor sampling revealed its correlation with tissue oxygen tension measured in vivo. Noticeably, the degree of correlation was increasing with the increasing amount of tumor tissue sampled for immunohistochemical analysis.

Our approach was to minimize both technical and sampling errors. The latest computerized imaging techniques allow for analysis of large areas, which reduces the sampling error caused by tumor heterogeneity. In addition, computer analysis algorithms are consistent, which reduces analytical error. We also aimed to address the recently raised issue of variable antigen preservation within archived tissues, which can have an effect on sensitivity of staining. The time lag between surgery and fixation, size of the specimen and overall fixation time are known to have an effect on tissue preservation. Other factors, such as storage time of paraffin blocks, tissue water content and variability of section thickness can also have an effect on intensity of staining. Practically speaking, it is difficult or impossible to control for all of these factors. These variables directly affect measurements of staining intensity (type of analyses similar to VEGF assessment) and can alter analyses based on object counting (like MVD). Even for the latter type of image analysis, the filters still have a threshold to exclude non-specific background staining. The same filter setting can produce a different number of individual objects due to either fusion of adjacent bright objects or fragmentation of objects with pale staining. Setting a filter manually for each image would introduce an operator-dependent variability. Instead, we employed an internal reference to normalize the data. Since both the tumor and normal tissue within the same section are exposed to the same fixation variables, their ratio is expected to be affected minimally. Due to the inherent even distribution of vessels, we used kidney medulla as an internal reference and calculated tumor/normal ratios, or ‘adjusted’ values.

Using adjusted MVD, we observed an association between high vascular density and reduced DFS (Figure 3). The association was comparable to that of the established prognostic factor, tumor stage. Importantly, only the adjusted MVD, but not the raw MVD measurements, showed a correlation with poorer prognosis. Adjusted MVD and stage were independently associated with decreased DFS, and all tumors with recurrences fell into the ‘high MVD–high stage’ quadrant of the distribution (Figure 4). As seen in Figure 4, the data suggest that, for clinical purposes, adjusted MVD measurements may have a role in selecting more aggressive tumors within the higher-stage groups. A possible biological explanation could be that the more aggressive tumors drive angiogenesis, which either supports or stimulates tumor cell proliferation and invasiveness. The more indolent tumors reach similar size and stage slowly exhausting their vascular supply.

VEGF is the best-characterized mediator of endothelial sprouting, the most common method by which tumors acquire new vascular channels.58 In the presence of VEGF, endothelial cell survival is enhanced and helps to form the vascular network enabling tumor metastases.58, 59 Inclusion of normal tissue into the analysis allowed us to compare the levels of VEGF expression in tumors and normal tissue. Due to the common loss of VHL gene function, the expression level of the normally suppressed VEGF, was expected to be higher in RCCs.60 In fact, in 35 out of 46 cases we did observe VEGF expression higher in tumors compared with normal tissues (76%; Figure 5a). Also, adjusted VEGF levels were higher in higher-grade tumors, which were not observed for unadjusted values. We did not detect any correlation between VEGF expression and tumor stage or MVD. The reason for these results is likely multifactorial. Although VEGF is a major factor in angiogenesis, tumors utilize numerous other mechanisms to develop new vasculature, some of which do not rely on VEGF.58 In addition, it has been shown that VEGF levels are variable within a tumor, with hypoxic areas of tumor secreting higher levels of VEGF compared with well-vascularized areas.61 The degree of VEGF expression in tumors is likely a combination of levels predetermined by the molecular abnormalities with superimposed VEGF production as a physiological response to hypoxia. Since the latter component has an inverse relationship with the microvasculature, it can distort the association between the inherent VEGF expression levels and MVD.

Over several decades, the assessment of tumor microvasculature has drawn attention as a prognostic marker; however a large number of studies produced equivocal results.21, 24, 26, 28, 62, 63, 64, 65, 66 Practically, all studies were based on manual counting of vessels within small areas of tumor utilizing either ‘hot-spots’26, 30, 67 or small tissue microarray cores.68 Both factors, small tissue sample and manual input have a large potential for errors; it is logical to replace those methods with standardized computer analyses of large areas. Moreover, the inclusion of an internal reference point is easily achieved by the new methodology and can further increase the accuracy of measurements. It can be argued that the normal tissue adjacent to the tumor can be affected by it. For example, blood flow can be shunted and the normal tissue may react by an increase in VEGF production and higher MVD. In practice, we observed the associations only using the adjusted data. Therefore, the effect is either not present or smaller than the benefit of normalization and we believe that inclusion of an internal reference has a large potential for immunohistochemical analyses in general.

Both researchers and clinicians should fully understand that accurate assessment of biomarkers is extremely important in the era of targeted therapies. The accumulated experience of currently used targeted medications shows that only a subset of patients benefit from the treatment. The result has been a thorough standardization of clinical tests and development of quality assurance programs for the laboratories performing them. Similarly, researchers should aim to use discovery tools of sufficient accuracy to resolve any potential biological differences. In this study, we aimed to increase the accuracy of immunohistochemical assessment of MVD and VEGF and our main finding was that higher adjusted MVD in ccRCC is predictive of increased tumor aggressiveness. This may prove useful for prognostic and treatment purposes in patients with ccRCC.