Introduction

Prostate cancer is the highest incidence malignancy in the US male population, with an estimated 160,000 new cases and 29,000 deaths attributed to the disease in 2018 [1]. Treating localized prostate cancer with radical prostatectomy provides good oncological outcomes and long-term survival benefits [2, 3]. Nerve-sparing radical prostatectomies are favored if cancer does not involve the neurovascular bundles, as patients have better recovery of sexual function and urinary continence [4,5,6,7,8,9,10,11,12,13]. However, identifying patients eligible for nerve sparing can be challenging using existing preoperative staging methods, leading to a higher rate of non-nerve-sparing radical prostatectomies than necessary [14,15,16,17]. Comprehensive intraoperative evaluation of prostate surgical margins using frozen sections has been shown to increase rates of nerve-sparing radical prostatectomies, while decreasing positive surgical margin rates [18, 19]. However, comprehensive frozen section assessment of margins is time and labor intensive, which is impractical and expensive for many hospitals [18, 20]. Techniques that can rapidly evaluate fresh surgical specimens without freezing and microtome sectioning are therefore needed.

Nonlinear microscopy (NLM) is a fluorescence microscopy technique that can generate subcellular resolution images that closely resemble H&E stained sections of formalin-fixed, paraffin-embedded tissue (subsequently referred to as “paraffin H&E”), however imaging can be performed in freshly excised tissue without freezing or microtome sectioning. NLM generates images by scanning a focused short pulse laser beam on the specimen, which nonlinearly excites fluorescence only at the laser focus, producing an optical sectioning effect over a limited depth range comparable with the thickness of a frozen section [21]. The fluorescence signals are detected and displayed as a function of the focused laser position in order to generate a digital image. Prostate specimen imaging has also been investigated using other optical techniques such as structured illumination microscopy [22, 23], microscopy with UV surface excitation [24], light sheet microscopy [25], and confocal microscopy [26]. Initial results using these methods have been promising, however studies have largely focused on feasibility and larger scale validation of tissue handling methods, imaging, and quantitative diagnostic accuracy are necessary. Compared with other methods NLM has the advantage that it can image without physical sectioning, so images are acquired rapidly without loss or destruction of tissue. NLM can image at depths of up to 100 µm [27,28,29,30], avoiding surgical debris, surface artifacts, or areas of electrocautery [31]. This capability is analogous to serial sectioning in histology, except that image depth can be continuously and rapidly adjusted, providing the pathologist with additional information beyond frozen sections. NLM achieves superior contrast and image quality compared with other imaging modalities [25, 29,30,31], making it particularly well-suited for visualizing nuclear and cytoplasmic/stromal detail.

We have previously demonstrated a rapid tissue preparation and imaging protocol where freshly excised prostate tissue is stained in a nuclear and cytoplasmic/stromal fluorescent dye then evaluated using NLM [27, 32]. This protocol generated images which closely resembled those of corresponding paraffin H&E [32]. Pathologists were able to visualize prostate tissue architecture, secretory and basal cells, inflammation, and stromal and glandular hyperplasia. We also showed that NLM enabled visualization of prostate carcinoma including Gleason patterns, perineural invasion, extraprostatic extension, and positive margins. This previous study provided a qualitative, descriptive analysis of fresh prostate tissue visualized using NLM compared with paraffin H&E, an important first step in interpreting NLM images, but did not quantitatively assess the diagnostic capability of NLM image interpretation. In this manuscript, we report a multi-pathologist blinded reading study of NLM images from 122 freshly excised prostate tissue specimens from 40 patients to assess the accuracy of NLM for detecting carcinoma.

Materials and methods

Specimen preparation and imaging

Freshly excised prostate tissue was collected from patients who underwent a radical prostatectomy using protocols approved by Beth Israel Deaconess Medical Center Committee on Clinical Investigations and Institutional Review Board and Massachusetts Institute of Technology Committee on the Use of Humans as Experimental Subjects. Informed consent was waived by both committees.

The freshly excised prostate tissue was prepared using protocols previously described and shown in Fig. 1a [28, 32]. The fresh prostates were grossed following standard protocols then dissected into specimens of ~10 × 10 mm with 1–4 mm thickness. The unprocessed specimens were stained in a 50% ethanol solution containing the fluorescent contrast agents acridine orange (40 µg/ml; #10050, Electron Microscopy Sciences) and sulforhodamine 101 (40 µg/ml; S7635, Sigma-Aldrich) for 2 min (Fig. 1a). Acridine orange stains DNA similar to hematoxylin, while sulforhodamine 101 stains cytosol and stroma similar to eosin [33]. The specimens were then rinsed for 30 s in saline to remove excess dye and placed on a specimen holder with a glass window for imaging with a nonlinear microscope.

Fig. 1
figure 1

Method for evaluating fresh prostate tissue using nonlinear microscopy (NLM). a Fresh tissue was stained in acridine orange and sulforhodamine 101 for 2 min, then rinsed in saline for 30 s. The specimen was placed on a glass specimen holder and transferred to the nonlinear microscope. The nonlinear microscope was operated in two modes: real-time NLM mode and auto-scan NLM mode. b Real-time NLM mode: A white-light photograph of the specimen surface was displayed with a fiducial marker (in red) indicating the current NLM imaging field, providing a navigational guide. b(i) Pathologists examined the specimens on a computer monitor showing NLM images in an H&E color scale at 16 frames/second while translating the specimen to select the nonlinear microscopy field of view. The NLM evaluation procedure was recorded for offline, postprocedural review (b(ii)). c Auto-scan NLM mode: a NLM image of the entire specimen cross section was generated by automatically acquiring a series of overlapping, high magnification regions and stitching them together

The nonlinear microscope [27] used a short-pulsed laser at 1030 nm wavelength to excite acridine orange and sulforhodamine 101 fluorescence in a narrow focus, providing visualization of a thin section without physical sectioning. The microscope had two interchangeable objectives: a 10×, 0.45 numerical aperture (CFI Plan Apo Lambda, Nikon) and a 5×, 0.25 numerical aperture objective (Fluar, Carl Zeiss). Fluorescent light from acridine orange and sulforhodamine 101 was detected using two photomultiplier tubes (H7422-40p, Hamamatsu). A white-light camera was integrated into the nonlinear microscope to record a gross view of the specimen.

Specimens were evaluated on the nonlinear microscope in two modes: real-time nonlinear microscopy mode which emulates the procedure pathologists use with a standard microscope to evaluate histology and auto-scan nonlinear microscopy mode which emulates a digital slide scanner. In real-time NLM mode, shown in Fig. 1b, pathologists examined the specimens on a computer monitor showing NLM images at 16 frames/second while translating the specimen to select the NLM field of view. Images resembling paraffin H&E were generated from NLM by displaying the fluorescence signals from the nuclear (acridine) and stromal (sulforhodamine) detector channels in an H&E color scale using an algorithm called Virtual Transillumination Microscopy [34]. A fiducial marker (shown in red in Fig. 1b) was displayed on the white-light gross image of the specimen indicating the current position of the NLM image to aid in navigation. The objectives could be rapidly changed for variable magnification and focus depth adjusted for visualizing tissue below the specimen surface. The real-time mode evaluation, including NLM images at known positions, was saved for postprocedural analysis and training. In this mode of operation, a pathologist can rapidly and efficiently evaluate large specimen areas, similar to slide evaluation on a standard histology microscope.

In auto-scan NLM mode (Fig. 1c), a NLM image of the entire specimen was generated by automatically acquiring a series of overlapping, high magnification 1 × 1 mm, 2048 × 2048 pixel frames with 1.2 µm lateral optical resolution [27] and 0.49 µm pixel size. The NLM frames were stitched together (using Microsoft Image Composite Editor) and viewed analogously to a whole slide image from a digital slide scanner. The auto-scan mode is more time-consuming than real-time mode and only acquires a limited number of image depths, however it enables multiple readers to evaluate NLM image data offline.

After evaluation with NLM, the fresh specimens were fixed in formalin on the glass surface of the specimen holder to avoid distortion of the imaged region by specimen handling, then processed for conventional paraffin H&E. The paraffin H&E slides were scanned with a digital slide scanner (20× magnification; Aperio AT2, Leica Biosystems Inc.).

Blinded reading

Three pathologists (a senior pathologist with over 20 years of experience, a junior pathologist with <5 years of experience, and a pathology resident) evaluated NLM images and corresponding paraffin H&E in a prospective blinded reading that consisted of training, pretesting, and reading phases. This protocol, outlined in Fig. 2, was used to assess the accuracy of detecting carcinoma on radical prostatectomy specimens using NLM. Intraoperative Gleason scoring of radical prostatectomies is typically not required and thus excluded in our analysis.

Fig. 2
figure 2

Blinded reading protocol of nonlinear microscopy (NLM) images. The three phase protocol consisted of training, pretesting, and blinded reading to assess the sensitivity and specificity of detecting cancer using NLM versus the gold standard of paraffin H&E

Training

The pathologists were trained in three steps. In the first step, pathologists evaluated fresh tissue in real-time NLM mode for at least 30 min to gain familiarity with NLM imaging and ergonomics. In the second step, pathologists reviewed eight prerecorded real-time NLM evaluation procedure videos (Fig. 1b(ii)) (each ~5–10 min in length) from eight different patients. This step provided experience with various prostate pathologies visualized in fresh tissue on NLM. In the final-training step, the pathologists reviewed images from real-time imaging procedures that emphasized the differences and similarities between NLM and paraffin H&E. These differences and similarities were described previously [32]. The NLM training was performed using real-time NLM mode due to the ease and rapidity of capturing large areas of varying pathologies. The training step took ~3 h for each pathologist.

Pretesting and reading

A total of 122 freshly excised prostate specimens with an average size of 10 × 10 mm were collected from 40 patients and imaged in auto-scan NLM mode. Corresponding paraffin H&E slides were made and scanned on a slide scanner. The nonlinear microscope was operated in auto-scan NLM mode to enable evaluation of the same specimen area by multiple pathologists and enable high correspondence between the NLM imaging plane and the paraffin H&E slide. The NLM images and scanned paraffin H&E slides were viewed in OpenSeaDragon, a web-based viewer that enables variable magnification review. NLM data from six specimens were discarded due to poor correspondence between the NLM and paraffin H&E image planes or because of poor tissue preservation unrelated to the study. These mismatches between the plane of NLM imaging and paraffin H&E slide occur because the NLM image is acquired on freshly excised tissue and the paraffin H&E slide requires tissue processing and microtome sectioning.

In pretesting, pathologists read 15 NLM images for the presence or absence of cancer. Corresponding paraffin H&E slides were then read to provide immediate feedback. After training and pretesting, the pathologists read NLM images of the remaining 101 specimens in a randomized order, blinded to paraffin H&E results, and recorded whether cancer was present or absent. After a washout period of 1 week, the pathologists read the paraffin H&E slides. A consensus diagnosis was obtained for paraffin H&E readings in the event of reader discrepancy in order to have a gold standard diagnosis for comparison.

Statistical analysis

The sensitivity, specificity, positive predictive value, negative predictive value, and accuracy of NLM assessment for the presence of cancer was calculated for each NLM reader using the consensus paraffin H&E diagnosis as the gold standard. The 95% confidence intervals were calculated on the pooled results from the 3 readers. The interobserver variability was calculated using Fleiss kappa. A two-sample t test was used to compare the mean time required to evaluate the NLM images and paraffin H&E.

Results

NLM images of fresh prostate specimens

Example NLM images of fresh prostate specimens are shown in Figs. 3 and 4. In these figures, differences between NLM images of fresh specimens and paraffin H&E including cytoplasmic color difference, increased eosinophilic secretions in carcinoma, diminishment of cell border clarity, expanded cellular appearance, and images that appear thicker than paraffin H&E, are apparent, but these differences do not impair interpretation [32]. Prostate carcinoma with poorly formed and fused glands (Gleason 4) is shown in Fig. 3a (NLM) and b (paraffin H&E). Fig 3c, d show an example of foamy gland adenocarcinoma visualized with NLM and paraffin H&E, respectively. The foamy appearance is not as apparent in the NLM image and instead the cytoplasm appears more eosinophilic. The bright intraluminal eosinophilic secretions seen in the NLM image commonly appear in fresh tissue visualization of carcinoma and are particularly evident in foamy gland variant. An example of mucinous fibroplasia visualized with NLM and paraffin H&E is shown in Fig. 3e, f. Collagenous micronodules are seen with associated eosinophilic stroma. Figure 4a, b shows glomeruloid and cribriform patterns, respectively. Figure 4c is an example of perineural invasion. Figure 4d shows prominent nucleoli in a malignant gland, exemplifying the cytological detail present in NLM images of fresh tissue. In these example images, typical histological features of carcinoma (prominent nucleoli, glandular, and stromal architectural patterns) are readily apparent in NLM and enable specimen reading and interpretation.

Fig. 3
figure 3

Example nonlinear microscopy (NLM) images of fresh tissue pathology and corresponding paraffin H&E slides. a, b Poorly formed and fused glands (Gleason 4); c, d Foamy gland adenocarcinoma; e, f Mucinous fibroplasia

Fig. 4
figure 4

Example nonlinear microscopy (NLM) images of carcinoma in fresh prostate tissue. a Glomeruloid pattern. b Cribriforming. c Perineural invasion. d Large nucleoli visualized in malignant glands

Sensitivity and specificity

Table 1 summarizes the blinded reading results of the 101 specimens (61 with cancer present on paraffin H&E, 40 without cancer present). All three pathologists had a 95% or greater sensitivity and a 100% specificity. The pooled sensitivity was 97.3% (93.7–99.1%; 95% confidence interval) and specificity was 100.0% (97.0–100.0%). Interobserver agreement between the three pathologists was almost perfect for the NLM readings with a Fleiss κ = 0.95. The three individual pathologists required an average of 39.2, 53.5, and 73.7 s to evaluate each NLM image and 38.6, 57.0, and 59.4 s to evaluate each paraffin H&E slide giving a combined average time of 55.4 s per NLM image and 51.7 s per paraffin H&E slide. These mean times were not statistically different in a two-sample t test (p = 0.76).

Table 1 Sensitivity and specificity of nonlinear microscopy for detecting carcinoma in fresh prostate tissue versus paraffin H&E

Discussion

In this study, we show that NLM with rapid fluorescent staining enables cancer detection with nearly equivalent accuracy (98%) to that of paraffin H&E with only a short training period. Specimen preparation requires <3 min after grossing, is nondestructive (no freezing or microtome sectioning), and enables gold standard histological postoperative analysis of the same specimen. In this blinded reading, small fragments of tissue (10 × 10 mm) were used to represent localized detection of prostate cancer, however, multi-centimeter, fresh whole-mount specimens can also be evaluated on the nonlinear microscope without increasing specimen preparation times. Furthermore, several specimens can be prepared in parallel without additional specialized equipment because staining only requires a container for the fluorescent solution.

Because the nonlinear microscope can operate in real-time and auto-scan modes, it combines the analogous capabilities of traditional light microscopy and whole slide scanning in a single instrument. In both of these modes, the time to scan an area increases as the image resolution increases. In this study, we acquired data using auto-scan mode, which restricts imaging to a single objective for the entire specimen. Using this mode, we found that acquiring data using a 10×, 0.45 numerical aperture objective was sufficient for cancer detection in large specimens, while maintaining rapid data acquisition times. Real-time mode enables rapid surveying of large areas of tissue at user-specified magnifications and is thus the proposed mode for intraoperative evaluation of fresh tissue. Real-time mode was also used for pathologist training since the trainees could directly operate the microscope and observe a multitude of different specimens efficiently. This mode of operation, however, does not provide unbiased images of specimens because the operator controls the specimen translation, speed, and magnification, and therefore it is not appropriate for a blinded reading. Although the auto-scan mode is slower to acquire data, it enables evaluation of the same specimen by multiple different pathologists unbiased to the other’s evaluation. Because it also enables specimen image archiving for offline viewing, the auto-scan mode was naturally conducive to this reading study.

There are several differences observed when evaluating fresh, unprocessed tissue with NLM. These differences include: variations in cytoplasmic color which can be more basophilic in carcinoma on NLM than paraffin H&E; an increased frequency and volume of eosinophilic intraluminal secretions in carcinoma when visualized in fresh tissue with NLM than on processed tissue with paraffin H&E; loss of cell border sharpness and a thicker appearance of NLM images on fresh tissue due to the tissue not being dehydrated or processed with paraffin. These differences, however, are consistent and, combined with the ability to rapidly image multiple depths (analogous to serial sections), can augment interpretation of fresh tissue. Therefore, with training and experience, pathologists became accurate and comfortable interpreting these differences as evidenced by the high sensitivity and specificity (97.3 and 100%, respectively) and interobserver agreement reported in this study.

Blinded reading of surgical margins with NLM was not practical in this study because positive-margin rates are low and large numbers of specimens representing surgical margins were not available. However, we have previously demonstrated NLM imaging of extraprostatic extension and positive surgical margins [32] and typical histological features of carcinoma (prominent nucleoli, glandular, and stromal architectural patterns) are independent of its location and were readily apparent in NLM images. Furthermore, this study used small fragments of tissue (average 10 × 10 mm) to assess localized cancer detection. Finally, diagnostic performance is expected to further improve when NLM is operated in real-time mode because the pathologist can adjust the instrument and view different depths continuously, while data from auto-scan NLM mode is limited to a few depth planes. This study provides evidence that NLM has a high diagnostic performance for assessing prostate cancer and may be a promising method for intraoperative specimen evaluation.

Gleason scoring would typically not be performed intraoperatively and therefore is not included in this study. Instead, we plan to analyze Gleason scoring accuracy in a separate study on needle core biopsies where it might be used clinically. Our initial studies, which include use of higher magnification objectives with smaller fields of view, suggest that Gleason scoring is feasible but requires increased reading time. There are challenges when interpreting Gleason patterns due to the differences in NLM image for fresh tissue versus paraffin H&E as described above, including thicker appearing sections, increased luminal secretions, and variation in simulated staining colors. However, NLM can generate images at adjustable depths up to 100 µm below the tissue surface, analogous to serial sectioning, which may enhance interpretation of Gleason patterns.

Many studies have investigated the utility of frozen sections in the intraoperative evaluation of prostate margins [35,36,37]. In order to avoid excessively prolonging surgical time, these studies have often restricted frozen sections to a small number of specimens representing a limited fraction of the margin or grossly suspicious areas and have reported wide variations in sensitivity for detecting positive margins [35,36,37,38]. Recent studies using more comprehensive frozen section sampling of prostatectomy specimens demonstrated high sensitivities for detecting positive surgical margins, enabling an increase in the rate of nerve-sparing radical prostatectomies and a reduction in positive surgical margins [18, 19, 39, 40]. For example, in a study of over 11,000 patients, NeuroSAFE [18] demonstrated a significant increase in nerve-sparing radical prostatectomy rates (81 to 97% in all tumor stages) with a decrease in positive margins rates when using frozen sections. However, NeuroSAFE required processing up to 25 frozen sections per patient, with five cryostats, two pathologists, and four technicians to maintain a 35-min average evaluation time. This and other studies demonstrated that intraoperative margin evaluation can improve nerve-sparing radical prostatectomy and positive surgical margin rates, but comprehensive sampling required extensive personnel, which is impractical and expensive for most hospitals and surgical workflows [18,19,20].

The present study suggests that NLM has sufficient accuracy for the rapid evaluation of prostate tissue, achieving a pooled 97.3% sensitivity and 100% specificity for detecting carcinoma compared with paraffin H&E in a blinded reading by three pathologists. In contrast to frozen section analysis, NLM imaging does not require freezing and microtome sectioning. Multiple specimens can be prepared for imaging in parallel and large specimens can be imaged without requiring dissection into smaller sizes. Pathologists can begin NLM assessment within 3 min after inking and grossing. These advances support future studies investigating comprehensive NLM evaluation of radical prostatectomy margins, comparable with the NeuroSAFE, with many fewer personnel and shorter evaluation times. Accurate, safe, and efficient intraoperative margin assessment could improve decisions regarding nerve sparing without increasing positive-margin rates.