Main

The mechanisms for the metastasis of epithelial neoplasms via lymphatic vessels have recently become the focus of intensive research activity.1, 2, 3, 4, 5 Researchers probe into the biology of lymphatics, the interactions of tumor cells with the lymphatic vessels, and the mechanisms of lymphangiogenesis. Central to all these efforts was the discovery of lymphatic endothelium markers, such as LYVE-1, the antibody against podoplanin, D2-40, prox-1 and VEGFR-3.1, 5 It has been postulated that the density of the lymphatic vessels is a predictor of clinically meaningful outcomes in a number of malignancies.6, 7, 8, 9, 10, 11, 12, 13 One of the most critical steps in the assessment of lymphatic vessel density is the selection of the optimal lymphatic endothelium marker,1, 2, 5, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23 in order to avoid spurious findings arising from the low accuracy of the staining. However, given the absence of a gold standard, the evaluation of the sensitivity and the specificity of the lymphatic endothelium markers is a daunting task.

Antibody against podoplanin is considered by many researchers to be the most specific lymphatic endothelium marker available.1, 5, 9, 14, 15, 17, 21 Such claims are not based on formal quantitative comparisons between different lymphatic endothelium markers in the same samples, despite the numerous relative calls in the literature.3, 12, 16, 20, 21, 22 We set out to compare the diagnostic accuracy of podoplanin and D2-40 in the same samples, employing a Bayesian approach to account for the absence of a gold standard.

Materials and methods

Tissue Samples

We used 4-μm-thick sections from a sample of 30 formalin-fixed, paraffin-embedded archival tissue blocks of head and neck squamous cell carcinoma specimens. In all, 25 of these were biopsy specimens, while five were taken from excised tumors. We counted the total number of vessels (both blood and lymphatic) using the pan-endothelial marker CD34.24, 25 A monoclonal antibody raised against podoplanin, as described previously26, 27 (kindly provided by Dr Geleff, Department of Pathology, University of Vienna, Austria), and the commercial monoclonal antibody D2-40 (SIG-730, Signet Laboratories, Dedham, MA, USA, dilution 1:40) were used to stain lymphatic vessels in two adjacent sections. Immunohistochemistry was performed using the EnVision System (DAKO, Copenhagen, Denmark), as previously described.13

Evaluation of Staining Patterns

We assessed the expression of the antibody against podoplanin and D2-40 in each section using an Olympus BX-51 microscope at × 400 magnification. We aimed to identify whether each single vessel in any section was stained with both antibodies, only with podoplanin or only with D2-40. For this reason, we recognized the continuity of each vessel in the adjacent sections. We decided on a set of a priori formulated rules to evaluate the staining. First, we specified that vessels containing red blood cells were most likely blood vessels.28 We therefore considered all stained vessels with intraluminal erythrocytes as potential ‘false-positives’. We recorded these potential ‘false-positives’ in order to check the plausibility of our calculations. Second, we considered vessels, which appeared joined in one section and separated in the other, as one in both sections. Third, we excluded any vessels that were identified only in one section (stained by either antibody) but could not be identified in the adjacent section. Hence, we only counted vessels that could be identified in both sections stained or unstained by the lymphatic markers. Finally, to ensure that no double counting occurred, we photographed areas with high lymphatic vessel density and compared them in the neighboring sections with image processing software. Counts of positively stained vessels with either antibody, as well as potential ‘false-positive’ counts were tabulated in 2 × 2 contingency tables. We compared whether the two antibodies had any difference in the number of identified lymphatic vessels and in their potential ‘false-positive’ counts using McNemar's test.

Assessment of Diagnostic Accuracy

The two antibodies may be conceived as two diagnostic tests to identify lymphatics. In order to assess their accuracy, one has to calculate their sensitivity (the proportion of positively stained vessels that are indeed lymphatic vessels) and their specificity (the proportion of the unstained vessels that are not lymphatic vessels).29 In the absence of a gold standard, it is not possible to use ‘classical’, frequentist approaches to estimate these quantities.30 In simple terms, there are five unknown parameters (the sensitivities and the specificities of the two antibodies as well as the true prevalence of the lymphatic vessels in the examined sections), which cannot be estimated directly from the data. However, such problems can be addressed with nonfrequentist, Bayesian approaches, using information on staining combinations with both antibodies, as previously proposed.30 The basic idea behind any Bayesian assessment of diagnostic accuracy is to utilize external information in order to bypass the restrictions posed by the unknown quantities. Prior knowledge (knowledge not stemming from the data) can be used to describe the unknown quantities mathematically, as distributions of probable values. The available data from the current study are summarized in the table of positively stained vessels with either antibody; the potential ‘false-positive’ counts have not been taken into account in the assessment of the diagnostic accuracy. Finally, the information conveyed by the data is combined with the prior information using Bayes’ theorem, to yield the final, ‘posterior’ knowledge on the specificities and sensitivities of the two antibodies. One can then easily draw inferences on the diagnostic accuracy of the two tests.

The first step in any Bayesian analysis is to obtain reasonable prior distributions, based on sound judgments. In our case, a priori very little was known about the true number of the lymphatic vessels in the sections. We approximated this uncertainty by using a uniform prior distribution for the prevalence of lymphatics (the proportion of true lymphatic vessels among all vessels) in the sections. This distribution considers all prevalence values between 0 and 100% as equally probable, and thus is completely noninformative. Limited information was also available on the sensitivity of each antibody. We assigned a broad range of probable values between 60 and 100% for the sensitivities of both antibodies (Figure 1). This was based on observations from electron microscopy studies,26 where all small lymphatic vessels were intensely stained with podoplanin and larger lymphatics were not intensely stained. Since the vast majority of the lymphatic vessels are small, this prior seems to be valid. We specified the same prior for D2-40. These monoclonal antibodies are considered to be highly specific.1, 5, 9, 14, 15, 17, 21, 26 Therefore, we assigned to both antibodies high specificities. Also, prior beliefs regarded the antibody against podoplanin as the most specific lymphatic endothelium marker;1, 5, 9, 14, 15, 17, 21 hence, we set priors between 90 and 100% for the specificity of the antibody against podoplanin, and between 80 and 100% for D2-40 (Figure 1). In secondary analyses, we examined if the results changed when we assigned the same priors between 80 and 100% to the specificity of both antibodies. Various other vague (noninformative) prior distributions were also used. A first set of analyses considered the two tests (antibodies) statistically independent. This assumption may not hold if D2-40 recognizes podoplanin protein.31 This would mean that the two tests are correlated (not statistically independent), and should be taken into account mathematically.32, 33 Thus, in a second set of analyses, we allowed for possible correlation between the two tests. We specified a prior for this correlation that ranged between negligible and very high. (Details on the technicalities of the approaches, as well as the results of sensitivity analyses, are provided as online supplementary information.)

Figure 1
figure 1

Prior distributions for the unknown quantities. Bayesian approaches borrow strength from prior information (information not stemming from the data) on the five unknown parameters in the problem. Prior information is expressed as distributions of probable values. The same Beta(12,3) prior was set for the sensitivities of the two tests (thin solid line). A Beta(71.25,3.75) prior was used for the specificity of the antibody against podoplanin (thick solid line) and a Beta(31.5,3.5) for the specificity of D2-40 (thick dashed line). The prior for the prevalence (thin dashed line) was a uniform distribution. All distributions extend from 0 to 100%. Narrow distributions imply that a narrow range of values is more probable (has a higher probability density).

Based on the estimates of diagnostic accuracy, we calculated how many of the positively stained vessels would truly be lymphatics in different tissues, where lymphatics may be commonly or sparsely found. This proportion is otherwise called the positive predictive value of each antibody and is dependent on the prevalence of the lymphatics in a section. Finally, we compared the predicted number of false positively stained vessels in our samples with the number of stained vessels with intraluminal erythrocytes in order to assess the plausibility of our findings. Given that the counts of the potential ‘false-positive vessels’ were not taken into account in the estimation of the diagnostic accuracies, this may serve as an external validation of the calculations.

Analyses were conducted in R 2.0.1 (R Foundation for Statistical Computing, Vienna), JAGS 0.80 (Martyn Plummer, 2005) and Intercooled Stata 8.2 (Stata Corp., College Station, TX, USA). Results are presented in the form of posterior medians and 95% equally tailed posterior credible intervals (CrI) (95% CrI). CrIs are analogues of the usually encountered (frequentist) 95% confidence intervals. McNemar test P-values are two tailed, and are considered significant for P<0.05.

Results

Immunostaining

Typical staining patterns with both antibodies for lymphatic vessels are shown in Figure 2. Generally, positively stained vessels with either lymphatic marker were thin walled and irregularly shaped, and were often comprised by two to three endothelial cells. Overall, we identified 23 542 different blood vessels and lymphatics in the 30 specimens we examined. Of these, 1983 and 2255 were stained with the antibody against podoplanin and with D2-40, respectively (Table 1). The distribution of the vessel counts in the 2 × 2 tables across the 30 specimens did not differ (Breslow-Day P=1.00). Significantly more vessels were positive for D2-40 (P<0.001). There were 394 vessels that were positive either for podoplanin or D2-40, but had intraluminal red blood cells and were considered as potential ‘false-positives’ (Table 2). These potential ‘false-positives’ were significantly more common with D2-40 compared to podoplanin (P<0.001).

Figure 2
figure 2

(a) Adjacent sections from the same tumor sample (laryngeal squamous cell carcinoma, SCC), in which the same vessels (indicated by arrows of the same color) are positively stained with both podoplanin and D2-40 (magnification × 400). Vessels appearing joined in one section and separated in the other were counted as one, for both the sections. (b) Adjacent sections from the same tumor sample (laryngeal SCC), in which a vessel with luminal red blood cells (potential ‘false-positive’) is negative for podoplanin and positive for D2-40 (magnification × 1000). Red arrows indicate red blood cells. (c) Adjacent sections from the same tumor sample (SCC of the floor of the mouth), in which a vessel with luminal red blood cells (potential ‘false-positive’) is positively stained with both podoplanin and D2-40 (magnification × 1000). Notice the vessel close to the positive one, which has also luminal red blood cells (red arrow), but is negative for both the antibodies. (d) Adjacent sections from the same tumor sample (laryngeal SCC), in which both true- and false-positive vessels can clearly be seen (magnification × 400). All of these vessels are positive for both markers. Red arrows indicate red blood cells; arrows of the same color indicate the same vessels in the two sections. (e) Adjacent sections from the same tumor sample (SCC of the tongue), in which a vessel with luminal red blood cells (potential ‘false-positive’) is negative for podoplanin and positive for D2-40 (blue arrow, magnification × 4000). There is also a vessel without red blood cells, which is positively stained with both markers (green arrow). (f) Adjacent sections from the same tumor sample (SCC of the lower lip), showing a vessel without luminal red blood cells, which is podoplanin negative and D2-40 positive (magnification × 1000). (g) Adjacent sections from the same tumor sample (laryngeal SCC), showing a vessel filled with lymphocytes, without red blood cells, which is positively stained with both markers.

Table 1 Counts of stained or unstained vessels with either antibody
Table 2 Counts of vessels with intraluminal erythrocytes with either antibody

Diagnostic Accuracy

Under the assumption that the two tests are statistically independent, both antibodies were found to be extremely specific for the identification of lymphatic vessels. The posterior estimates of the specificities of the two antibodies were clearly separated (Figure 3a), although their absolute difference was very small. Specificities were 99.7% (95% CrI: 99.5–99.9%) and 98.8% (95% CrI: 98.3–99.5%) for the antibody against podoplanin and D2-40, respectively. Podoplanin had lower sensitivity (92.6% (95% CrI: 86.1–97.9%)) compared to D2-40 (97.3% (95% CrI: 94.9–99.2%)) as shown in Figure 3b. Similar inferences were drawn when we used a different set of priors.

Figure 3
figure 3

Posterior distributions assuming that the two antibodies do not yield statistically correlated results. The combination of the prior knowledge and the data yields the final estimates of the analysis (posterior knowledge). (a) Posterior distributions of the specificity of the antibody against podoplanin (thick solid line) and D2-40 (thick dashed line). For comparative reasons, the prior distributions for the specificities of the antibody against podoplanin (thin solid line) and D2-40 (thin dashed line) are also shown. (b) Posterior distributions of the sensitivity of the antibody against podoplanin (thick solid line), and of D2-40 (thick dashed line). The thin solid line is the common prior for the sensitivities of the two antibodies.

The estimated prevalence of the lymphatic vessels was 8.8% (95% CrI: 8.1–9.5%). We calculated that the proportion of positively stained vessels (positive predictive values) that are truly lymphatics in our samples was 96.3% (95% CrI: 94.2–98.6%) and 88.9% (95% CrI: 83.9–95.7%) for podoplanin and D2-40, respectively. As evident from Figure 4, although the two antibodies had very similar diagnostic accuracy, the positive predictive values may differ, especially when lymphatics are sparse. Based on these estimates, the predicted proportion of falsely positively stained vessels would be 3.7 and 11.1% for the two antibodies, respectively. The corresponding observed proportions of stained vessels with intraluminal erythrocytes were 4.9 and 13.2% (Table 2).

Figure 4
figure 4

Positive predictive values for different prevalences of lymphatic vessels assuming that the two antibodies do not yield statistically correlated results. The proportion of the stained vessels that are truly lymphatic vessels (positive predictive value) for the antibody against podoplanin (solid line) and for D2-40 (dashed line) is plotted against the prevalence of the lymphatic vessels in a tissue section. When lymphatics are sparse, the difference in the positive predictive value increases. The dots represent the calculated values for our samples, where the estimated prevalence of the true lymphatics was 8.8%.

The second set of analyses that allowed for correlation between the two tests yielded qualitatively similar results. Podoplanin and D2-40 were estimated to have specificities 95.6% (95% CrI: 92.0–98.9%) and 94.8% (95% CrI: 90.9–98.3%) and sensitivities 82.6% (95% CrI: 62.7–94.7%) and 90.0% (95% CrI: 74.0–97.6%), respectively. The positive predictive value was 50.9% for podoplanin and 48.9% for D2-40.

Discussion

In the current study, we compare two lymphatic endothelium markers, and provide handy numerical estimates for their diagnostic accuracy. Our results suggest that both markers have extremely high specificity and sensitivity for the identification of lymphatic vessels, and have comparable positive predictive values in most settings.

Up to date, experts’ opinions favored the antibody against podoplanin as the most reliable and specific marker at hand.1, 5, 9, 14, 15, 17, 21 Nevertheless, scientists emphasize the necessity for a quantitative evaluation of the accuracy of the so-called ‘specific lymphatic endothelium markers’.1, 5, 19, 20, 21, 22 To our knowledge, our work is the first report that pursues direct comparisons between two lymphatic endothelium markers and provides quantitative estimates for their specificity, sensitivity and positive predictive value. Contrary to prior beliefs, we found that both antibodies are essentially equally specific, in all sets of analyses. Thus, there may be little reason to prefer either one of the antibodies in most settings.

In our sections, 5% of the vessels stained by the antibody against podoplanin and 13% of those stained by D2-40 contained intraluminal red blood cells. These counts were very close to those calculated based on the positive predictive values of the two antibodies (approximately 4 and 11%, respectively) under the assumption that the tests are statistically independent. This may be seen as an indirect support in favor of these findings, since the potential ‘false-positive’ counts were not taken into account in the Bayesian analyses. On the contrary, allowing for correlation between the two tests, one would calculate that almost half of the stained vessels were not lymphatics. Analyses allowing for correlation between the two antibodies are heavily dependent on the used prior distributions, and this might be a reasonable explanation for the high number of expected ‘false-positives’. That high proportions of ‘falsely positively’ stained vessels would be incompatible with electron microscopic immunohistology findings,26 where all examined podoplanin-stained vessels in normal skin were found to be lymphatics, and reciprocally, all lymphatics were stained with podoplanin. Taking all the above into account, we lend more credence to the inferences drawn from the model that assumes that the two tests are statistically independent.

The approach we used may yield reliable estimates for the prevalence30 of the lymphatic vessels in a tissue section. One of the main drawbacks of the assessment of the lymphatic vessel density in histological sections is the very high interobserver variability in the identification of the areas with the highest vessel concentration, the so-called ‘hot-spots’.34 On the contrary, the prevalence of the lymphatics in a section is not so subjective, since the assessor simply counts everything. These prevalence estimates might be equally good or better predictors of clinical outcomes compared to the assessment of the lymphatic vessel density. Moreover, the whole procedure could be automated through specialized software, using a PC connected to a light microscope.35

Some limitations of our study should be discussed. First, we estimated the total number of vessels (blood and lymphatics) using CD34. It has been reported that CD34 is expressed both in blood vessels and lymphatics, but blood vessels stain more intensely.24, 25, 36 Thus, the total number of vessels we counted may be an underestimation of the true value. However, this did not bias our findings, as evident from sensitivity analyses. Second, we did not perform double immunostaining, as this would be technically more difficult. Nevertheless, given our elaborate methodology for the evaluation of the staining patterns, large misclassification is unlikely. Third, there is an inherent subjectivity in Bayesian approaches, expressed by the morphology of the prior distributions. The choice of the prior distributions mostly affects inferences based on the assumption that the two antibodies are statistically dependent. However, the used priors are based on the available knowledge and a comprehensive literature search.1, 5, 9, 14, 15, 17, 21, 26 Finally, we considered stained vessels with intraluminal red blood cells as potential ‘false-positives’. Intraluminal erythrocytes might be a histologic artifact resulting from pressure-induced leakage of a neighboring blood vessel in a true lymphatic vessel.26 This phenomenon is observed in lymphangiomas and might also happen in rapidly growing tissues, like malignant tumors.26 However, we believe that the majority of the potential ‘false-positives’ are indeed blood vessels; otherwise, the two antibodies ought to yield roughly the same number of potential ‘false-positive’ vessels.

Allowing for these caveats, both antibodies can be regarded as reference standards for the identification of lymphatic vessels in most settings. Similar approaches could be employed for the evaluation of diagnostic accuracy of other markers, such as the remaining lymphatic endothelium markers, antibodies detecting blood vessels or antibodies from different manufacturers detecting the same antigen.