Layer thickness prediction and tissue classification in two-layered tissue structures using diffuse reflectance spectroscopy

Geldof, Freija; Dashtbozorg, Behdad; Hendriks, Benno H. W.; Sterenborg, Henricus J. C. M.; Ruers, Theo J. M.

doi:10.1038/s41598-022-05751-5

Download PDF

Article
Open access
Published: 01 February 2022

Layer thickness prediction and tissue classification in two-layered tissue structures using diffuse reflectance spectroscopy

Freija Geldof¹,
Behdad Dashtbozorg¹,
Benno H. W. Hendriks^2,3,
Henricus J. C. M. Sterenborg^1,4 &
…
Theo J. M. Ruers^1,5

Scientific Reports volume 12, Article number: 1698 (2022) Cite this article

2415 Accesses
10 Citations
1 Altmetric
Metrics details

Subjects

Abstract

During oncological surgery, it can be challenging to identify the tumor and establish adequate resection margins. This study proposes a new two-layer approach in which diffuse reflectance spectroscopy (DRS) is used to predict the top layer thickness and classify the layers in two-layered phantom and animal tissue. Using wavelet-based and peak-based DRS spectral features, the proposed method could predict the top layer thickness with an accuracy of up to 0.35 mm. In addition, the tissue types of the first and second layers were classified with an accuracy of 0.95 and 0.99. Distinguishing multiple tissue layers during spectral analyses results in a better understanding of more complex tissue structures encountered in surgical practice.

Evaluation of wavelength ranges and tissue depth probed by diffuse reflectance spectroscopy for colorectal cancer detection

Article Open access 12 January 2021

Marcelo Saito Nogueira, Siddra Maryam, … Stefan Andersson-Engels

Extended-wavelength diffuse reflectance spectroscopy dataset of animal tissues for bone-related biomedical applications

Article Open access 26 January 2024

Celina L. Li, Carl J. Fisher, … Stefan Andersson-Engels

Multi-wavelength interference phase imaging for automatic breast cancer detection and delineation using diffuse reflection imaging

Article Open access 03 January 2024

Alaaeldin Mahmoud & Yasser H. El-Sharkawy

Introduction

In oncological surgery, the aim is usually to remove the entire tumor together with a margin of healthy tissue, while other surrounding healthy structures are spared as much as possible. However, identifying the tumor during surgery can be challenging. Inadequate tumor resection increases the risk of local tumor recurrence and decreases survival rates, making the resection margin status an important prognostic factor for patient outcome¹. This emphasizes the demand for a real-time tissue discrimination technique to provide intra-operative guidance.

In the last decades, optical techniques have been introduced to analyze the composition of materials in various fields, such as environmental monitoring by remote sensing^2,3,4, agriculture^5,6, food quality inspection^7,8,9, chemical and pharmaceutical industry^10,11, and forensics^12,13. In the medical field, multiple optical techniques have been introduced for tissue discrimination and margin assessment. Examples include hyperspectral imaging^14,15, elastic scattering spectroscopy^16,17, and Raman spectroscopy^18,19. Common advantages of these optical techniques are that they are fast, non-invasive, and do not require administration of contrast agents. Another such technique is diffuse reflectance spectroscopy (DRS)^{20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36}. DRS is a point-based technique, in which broadband light with wavelengths in the visible and/or near-infrared range is sent into tissue using optical fibers. After the light undergoes various scattering and absorption events in the tissue, a part of the light is reflected back to the tissue surface. The detected diffuse reflectance spectrum of this light represents an “optical fingerprint” of the measured tissue, which can be analyzed. DRS has been successfully evaluated for detection of cancer in breast^20,21,29, colorectal^{30,31,32,33,34}, head and neck^22,35,36, liver^23,24, lung^25,26, and brain^27,28 tissue, in both ex vivo and in vivo studies. These studies showed that tumor tissue could be discriminated from healthy tissue with classification accuracies of 0.77–1.00, suggesting this technique has a great potential for real-time tissue assessment during surgery.

The aforementioned studies focused primarily on identifying the tissue directly at the measurement surface, assuming a single-layer homogeneous tissue structure. When analyzing the DRS spectra, often fit models were used to estimate optical and physiological parameters of the measured tissue such as water, fat, hemoglobin, β-carotene and collagen concentration, reduced scattering amplitude, and mie scatter slope. In surgical practice, however, the measured tissue is often inhomogeneous and may consist of several layers, which decreases the performance of current analytical models, such as described by Farrell et al.^37,38.

In previous studies, different DRS models for two-layered media have been proposed, mainly to analyze multi-layered skin epithelium^{39,40,41,42,43,44,45,46,47}. However, most of these studies are based on only Monte Carlo simulations to generate diffuse reflectance spectra of two-layered models with various physiological parameters and did not use any experimental data as input. Inverse fitting was then used to quantify and validate the optical properties and epidermal thickness, some combined with liquid phantom measurements^40,41 or comparisons with known values for human skin^44,45,46. The large number of parameters in these two-layer models greatly increases the computational time, which is not desirable for real-time in vivo use during surgery. Moreover, the modeled epidermal layer thicknesses were in the order of 100 µm, while during surgery, layer thicknesses of multiple millimeters can be encountered.

In recent years, machine learning models are progressively replacing physical models. This paper will investigate whether advances in machine learning can address the challenges in analyzing two-layered tissue structures, using experimental data from both a two-layered phantom and two-layered animal tissue. Distinguishing multiple tissue layers during spectral analyses would ultimately facilitate the analysis of more complex tissue structures and the assessment of complete resection margins up to multiple millimeters in depth. First, the feasibility of extracting structural information from fiberoptic DRS data will be examined. To this end, we will explore whether DRS can be used to predict the thickness of the top layer in two-layered tissue. This will be performed in a controlled experiment, using a two-layered tissue-mimicking phantom. To the best of our knowledge, no similar approaches have been explored before. In clinical practice, the tissue type discrimination of the different layers is important as well. Therefore, a second experiment will be performed using animal tissue, containing a layer of fat and a layer of muscle tissue. In addition to predicting the top layer thickness, we will examine whether it is possible to predict the tissue types of both the top layer and the layer beneath it. Lastly, the thickness prediction performance using DRS will be compared to the performance using ultrasound (US) imaging, and to a combination of both techniques.

This study aims to use DRS to predict the top layer thickness in two-layered tissue structures and classify the tissue type of both layers. After acquiring the data, two types of features will be extracted and used in regression and classification models to evaluate the performances of the developed machine learning algorithms.

Methods

Measurement setup

The fiberoptic DRS system consisted of a Tungsten halogen broadband light source and two spectrometers; one for the visible domain (400–1100 nm, Andor Technology, DU420ABRDD) and one for the infrared domain (900–1700 nm, Andor Technology, DU492A-1.7)⁴⁸. A probe with two source-detector fiber distances of 2 and 6 mm was used to retrieve information from both superficial and deeper sampling depths. In DRS, the sampling depth (penetration depth of the detected photons) depends on the distance between the source and detector fiber. Even though the sampling depth is also influenced by the sample optical properties, it has been shown that the sampling depth is approximately equal to the source-detector distance⁴⁹. Ultrasound images were acquired using the portable Philips CX50 machine (Philips Research, Eindhoven, The Netherlands) in combination with the Philips L15-7io transducer (Philips Research, Eindhoven, The Netherlands), a high-frequency 15–7 MHz ultrasound transducer specially designed for superficial imaging.

Phantoms

Tissue-mimicking phantom

For the first phase of this study, an artificial phantom was created that consisted of two homogeneous layers with a sharp boundary in between. The thickness of the top layer gradually increased from 0 to 15 mm, see Fig. 1a. For both layers, 70% polyacrylamide gel (100 mg/mL water) was used as base material and 30% Intralipid (20% stock solution) was added for fat contrast. In the bottom layer, patent blue (5 µg/mL) and BaSO4 particles (20 mg/mL) were added for optical contrast in the visible part of the spectrum and US/X-ray contrast, respectively. A CT scan of the artificial phantom was made as ground truth for dimensions (Fig. 1b), using a clinical CT scanner (Siemens Sensation Open, Siemens Medical Solutions, Erlangen, Germany).

Animal tissue

For the second phase of this study, multiple pieces of two-layered bovine and porcine tissue were used. The tissue consisted of a thick bottom layer and a thin top layer with a thickness varying between 0 and 10 mm. Two types of animal tissue phantoms were examined; muscle tissue with a top layer of fat and fat tissue with a top layer of muscle, as demonstrated in Fig. 1c,d. The animal tissue was obtained from a local supermarket, so no ethical approval was required.

Data acquisition and preprocessing

DRS and US measurements were performed at fifteen locations on the artificial phantom. Data was acquired using four different probe orientations per measurement location, each time the probe was turned a quarter turn. A grid with cutouts of the desired probe locations and orientations was used to ensure that the center of the probes remained at the same location when changing orientations. All measurements were repeated three times, which resulted in 180 measurements in total. The ground truth for the top layer thickness was determined based on the obtained CT scan. The corresponding axial CT slide was extracted for each measurement location, and the top layer thickness was measured using annotation tools and then converted from pixels to millimeters based on CT image resolution.

In the second phase, measurements were performed in a grid pattern at multiple locations on the animal tissue. First, fiberoptic DRS measurements were performed at every location. Since it was not possible to obtain CT images for animal tissue samples, the ground truth for the top layer thickness was determined by US imaging. To this end, an US image was acquired in such a way that the center of the US image corresponded to the location of the DRS measurement. DRS and US measurements were performed at 250 locations in total; 122 fat on top of muscle measurements and 128 muscle on top of fat measurements. The top layer thickness was manually measured in the US image. All measurement locations associated with a top layer thickness smaller than 6 mm (n = 186) were included for further analyses since it is expected that the sampling depth of DRS will be roughly equal to the maximum fiber distance of 6 mm⁴⁹ and there is also no clinical interest to assess tissue deeper than 6 mm. The average top layer thicknesses were equal to 2.81 ± 1.90 mm (0–5.95 mm) and 3.25 ± 1.57 mm (0–5.96 mm) for top layers of fat and muscle, respectively.

After data acquisition, the DRS spectra from the visible and near-infrared wavelength ranges were stitched together to create one spectrum for each measurement location, ranging from 400 to 1600 nm (1200 features). The data from 1600 to 1700 nm was removed, due to a low signal-to-noise ratio. The spectra were calibrated using white and dark reference measurements, which were acquired at the beginning of every measurement session, see Eq. (1):

$${R}_{cal}= \frac{{R}_{meas} - {R}_{dark}}{{R}_{white}- {R}_{dark}},$$

(1)

where ${R}_{cal}$ is the calibrated measurement, ${R}_{meas}$ is the uncalibrated measurement, ${R}_{dark}$ is the dark reference measurement and ${R}_{white}$ is the white reference measurement. The white reference measurement was obtained using Spectralon (Avantes WS-2, Avantes, Apeldoorn, The Netherlands), the dark reference measurement was obtained by switching off the fiberoptic light source. The spectra were normalized with respect to the reflectance value at 800 nm to compensate for any intensity differences that might have been present. This wavelength was chosen since we do not expect any significant absorption to be present in this region.

Data analysis

In this subsection, we will introduce the methodology for extracting features from DRS spectra and US images. The extracted features will be used to train a regression analysis and tissue classification model for a two-layered structure. The feasibility of predicting layer thicknesses based on DRS spectra was evaluated first in a controlled setting using an artificial phantom. As a next step, two-layered animal tissue was used to facilitate tissue classification of the different layers as well.

Because of the less complex situation in the artificial phantom, no feature extraction methods have been performed in this phase and no US features were used. In the more complex phase with animal tissue, two different DRS feature extraction methods were applied and the obtained features from the small and large fiber distances were combined. The added value of US features was evaluated as well in the second phase of this study. The complete pipeline of data acquisition and data analysis is demonstrated in Fig. 2.

Spectral feature extraction

One DRS measurement consists of 1200 wavelength features as demonstrated in Fig. 3a. To effectively reduce the amount of data and to reduce the risk of overfitting, two different spectral feature extraction techniques were examined in this study; feature extraction based on spectral peaks and based on the wavelet transform. For the peak-based method, the largest dip to peak distance was calculated for three spectral regions with distinct peaks: 935 nm, 985 nm and 1200 nm as demonstrated in Fig. 3b. These wavelength regions were chosen for their dominant water and fat absorption in different ratios. The peak heights were calculated by subtracting the minimum intensity value from the maximum intensity value in the regions 920–960 nm, 960–1150 nm, and 1150–1325 nm, respectively. By concatenating the three peak heights for both fiber distances, this method resulted in six features.

The second feature extraction method was based on the wavelet transform, which can be used to analyze spectra at different spectral scales. For this study, the dyadic implementation of the transform, as described by Denstedt et al., was used⁵⁰. The input spectra were convolved with a Symlet low-pass filter of size 8 to obtain approximation spectra. Subsequently, the obtained spectra were down-sampled by a factor of 2 and the procedure was repeated multiple times, each time using the approximation spectrum of the previous iteration. The approximation spectra of the 6th iteration, consisting of 25 features, were used for the analyses (Fig. 3c). Extracting these features for both fiber distances resulted in 50 features in total.

Ultrasound feature extraction

To be able to determine the added value of US imaging for predicting the layer thickness, features were extracted from the US images based on the graph-cut theory. In this technique, an image is represented as a graph with nodes and edges representing the individual pixels and the connections between two neighboring pixels, respectively. The specific method used in this study was based on an automatic segmentation algorithm provided by Chiu et al.⁵¹. Although the algorithm was designed for the segmentation of retinal layers, it is suitable for layered structures in general. Some modifications were made in order to find the fat to muscle and muscle to fat boundaries. First, the image was smoothed using a Gaussian filter ($5x5; \sigma =1$) and then the vertical gradient image was obtained, see Fig. 4b. Subsequently, a weight was assigned to each of the graph edges to create path preferences, in such a way that the edges of pixel pairs with the highest vertical gradients have the lowest weights. Boundaries were selected based on minimal cost, i.e. the path with the highest vertical gradients, for which Dijkstra's algorithm was used⁵². The upper left and bottom right corner pixels are used as endpoints for the path selection. Therefore, two columns with minimal weights are padded left and right to the image to obtain an adequate graph cut, as explained by Chiu et al.⁵¹.

The path corresponding to the contact surface between the transducer and the tissue was selected using the first twenty pixel rows only (blue path in Fig. 4c). In the remaining part of the image, the tissue boundary was selected. Speckle noise and the presence of tissue fibers in the ultrasound images can sometimes result in the selection of other, incorrect, paths that do not represent a tissue transmission boundary. The best path, with the lowest cost, does therefore not always correspond to the correct path. To avoid missing the correct path, the three best paths were selected in each image. In this way, the correct path is always taken into account during the analysis. Figure 4c shows an example of the three selected paths. In this case, the second selected path is the correct boundary.

The three selected boundaries were translated into six features that were used to estimate the top layer thickness. For each of the three selected boundaries, the distance from the contact surface and the vertical gradient value were used as feature values. This resulted in the following six features: (1) distance from the first boundary to the contact surface, (2) distance from the second boundary to the contact surface, (3) distance from the third boundary to the contact surface, (4) vertical gradient value at the first boundary, (5) vertical gradient value at the second boundary, (6) vertical gradient value at the third boundary. These values were extracted in the center of the US image at the location of the paths, as demonstrated in green in Fig. 4c.

Top layer thickness prediction

The top layer thickness of the artificial phantom was predicted using a Gaussian process regression analysis. All 1200 DRS wavelengths were used as features. The regression was performed twice; for a fiber distance of 2 mm and a fiber distance of 6 mm. The average and standard deviation of the prediction error were used for the evaluation using an iterated 10-fold cross-validation technique, in which the results were averaged over 20 iterations. Per location, all spectra were assigned to one fold to ensure that spectra from one location were not split between the training and test set.

For the top layer thickness estimation of the animal tissue, the data set was divided into two groups. One group consisted of the muscle measurements with and without a top layer of fat and one group consisted of the fat measurements with and without a top layer of muscle. For each group separately, the thickness of the top layer was estimated using regression analysis and the combined features of both fiber distances. Again, an iterated cross-validation technique was used and the results were averaged over the 20 iterations. The performances of two regression models were compared; a linear support vector machine (SVM) model and a Gaussian process regression (GPR) model. The regression models were first trained using only DRS features and then using both DRS and US features to examine the added value of US imaging. A Wilcoxon signed-rank test was performed to examine the presence of any significant differences between the results of the two classification models or the different feature sets. The significance threshold was set at 0.05.

Tissue classification

The tissue classification was only performed on animal tissue. All 186 measurements on animal tissue were used for tissue classification of the first layer; 89 with a top layer of fat and 97 with a top layer of muscle. Tissue classification of the second layer was performed using only the two-layered measurements (73 with a second layer of fat and 81 with a second layer of muscle). The tissue types (muscle/fat) of both layers were classified independently of each other using quadratic SVM classification models, based on the DRS features only. An iterated 10-fold cross-validation was performed and the results were averaged over the 20 iterations. The classifier was trained once using the peak features and once using the extracted wavelet features. The classification performance was evaluated using the accuracy, Matthews correlation coefficient (MCC), the area under the curve (AUC), sensitivity, and specificity.