A generalized deep learning framework for whole-slide image segmentation and analysis

Khened, Mahendra; Kori, Avinash; Rajkumar, Haran; Krishnamurthi, Ganapathy; Srinivasan, Balaji

doi:10.1038/s41598-021-90444-8

Download PDF

Article
Open access
Published: 02 June 2021

A generalized deep learning framework for whole-slide image segmentation and analysis

Mahendra Khened¹^na1,
Avinash Kori¹^na1,
Haran Rajkumar¹^na1,
Ganapathy Krishnamurthi¹ &
…
Balaji Srinivasan²

Scientific Reports volume 11, Article number: 11579 (2021) Cite this article

30k Accesses
81 Citations
31 Altmetric
Metrics details

Subjects

Abstract

Histopathology tissue analysis is considered the gold standard in cancer diagnosis and prognosis. Whole-slide imaging (WSI), i.e., the scanning and digitization of entire histology slides, are now being adopted across the world in pathology labs. Trained histopathologists can provide an accurate diagnosis of biopsy specimens based on WSI data. Given the dimensionality of WSIs and the increase in the number of potential cancer cases, analyzing these images is a time-consuming process. Automated segmentation of tumorous tissue helps in elevating the precision, speed, and reproducibility of research. In the recent past, deep learning-based techniques have provided state-of-the-art results in a wide variety of image analysis tasks, including the analysis of digitized slides. However, deep learning-based solutions pose many technical challenges, including the large size of WSI data, heterogeneity in images, and complexity of features. In this study, we propose a generalized deep learning-based framework for histopathology tissue analysis to address these challenges. Our framework is, in essence, a sequence of individual techniques in the preprocessing-training-inference pipeline which, in conjunction, improve the efficiency and the generalizability of the analysis. The combination of techniques we have introduced includes an ensemble segmentation model, division of the WSI into smaller overlapping patches while addressing class imbalances, efficient techniques for inference, and an efficient, patch-based uncertainty estimation framework. Our ensemble consists of DenseNet-121, Inception-ResNet-V2, and DeeplabV3Plus, where all the networks were trained end to end for every task. We demonstrate the efficacy and improved generalizability of our framework by evaluating it on a variety of histopathology tasks including breast cancer metastases (CAMELYON), colon cancer (DigestPath), and liver cancer (PAIP). Our proposed framework has state-of-the-art performance across all these tasks and is ranked within the top 5 currently for the challenges based on these datasets. The entire framework along with the trained models and the related documentation are made freely available at GitHub and PyPi. Our framework is expected to aid histopathologists in accurate and efficient initial diagnosis. Moreover, the estimated uncertainty maps will help clinicians to make informed decisions and further treatment planning or analysis.

Prediction of tumor origin in cancers of unknown primary origin with cytology-based deep learning

Article Open access 16 April 2024

Microenvironmental reorganization in brain tumors following radiotherapy and recurrence revealed by hyperplexed immunofluorescence imaging

Article Open access 15 April 2024

Segment anything in medical images

Article Open access 22 January 2024

Introduction

Histopathology is considered the gold standard for cancer diagnosis^1,2 and identification of prognostic and therapeutic targets. Early diagnosis of cancer significantly increases the probability of survival³. Unfortunately, pathological analysis is an arduous process that is difficult, time-consuming, and requires in-depth knowledge. A study⁴ examining breast biopsies concordance among pathologists found that pathologists disagreed with each other on a diagnosis 24.7% of the time on average. This high rate of misdiagnosis stresses the need to develop computer-aided methods to aid pathologists in histopathology.

Digital pathology is the method of digitizing the histology slides to produce high-resolution images⁵. Studies have been conducted on collecting, analyzing, and interpreting digitized pathological slide images¹. The increasing prevalence of WSI technology that can scan the entire tissue slide at the subcellular level makes the in-silico pathology analysis more viable⁶. Digital pathology’s array of image analysis activities include identification and counting (e.g. mitotic events), segmentation (e.g. nuclei), and tissue differentiation (e.g. cancerous vs. non-cancerous)^5,7,8. Segmentation analysis helps to detect and separate tumor cells from the normal cells^9,10. Segmentation of WSI images is usually the precursor for performing various other downstream analyses such as classification and tumor burden estimation.

Automated WSI image analysis is plagued by a myriad of challenges¹¹, namely:

1.
Large dimensionality of WSI images A WSI image is obtained by digitizing a glass slide at a very high resolution (of the order of 0.25 micrometers/pixel, which corresponds to 40 $\times$ magnification on a microscope). A typical glass-slide of size 20 mm $\times$ 15 mm results in gigapixel image of size 80,000 $\times$ 60,000 pixels.
2.
Insufficient training samples The main impediments to the development and clinical implementation of deep learning algorithms consist of sufficiently large, curated, and representative training data which includes expert labeling, which is a costly and time-consuming process (e.g., pathologist annotated data). Most clinical research groups currently have restricted access to data. The data is often based on small sample sizes with limited geographic variety, resulting in algorithms with limited utility and poor generalization.
3.
Stain variability across laboratories As the data is acquired from multiple sources, there exists a lack of uniformity in staining protocol. Building a generalized framework that is invariant to stain pattern variability is challenging.
4.
Extraction of clinically relevant features and information Another major challenge is trying to extract features that are meaningful from a clinical point of view. Deep learning does an excellent task of automatic feature extraction, but understanding these extracted features and extracting meaningful information from them is challenging.

In this study, we propose a generalized deep learning-based framework for histopathology tissue analysis that addresses all the aforementioned problems. Our proposed framework is a sequence of individual techniques in the preprocessing-training-inference pipeline which, in conjunction, improve the efficiency and the generalizability of the histopathology image analysis. The combination of techniques we have introduced includes an ensemble segmentation model, division of the WSI into smaller overlapping patches while addressing class imbalances, efficient techniques for inference, and an efficient, patch-based uncertainty estimation framework. The organization of the paper is as follows. Prior work on histopathology image analysis using deep learning-based methods is discussed in “Related work” section. In “Datasets used for this study” section, the datasets used in this study are presented. Discussion on training and inference pipelines is provided in “Training pipeline” and “Inference pipeline” sections respectively. Experimental analysis is described in the supplementary note. The comprehensive results to demonstrate the performance of the proposed method on several open-source datasets are presented in “Challenge results” section. Discussion of the results, conclusion, and limitations of the proposed study with the possible course of research is provided in “Discussion and conclusions” section.

Related work

Deep learning methods for histopathological image analysis

The advent of WSI scanners has enabled the digitization of glass slides at very high resolution. Typical WSI images are in the order of gigapixels and are usually stored in multi-resolution pyramidal format. These slide images are suitable for developing computer-aided diagnosis systems for automating the pathologist workflow. The availability of a large amount of data makes them amenable for analysis with machine learning algorithms.

In tumor pathology, nuclear morphology and cellular anatomy are often significant determinants of disease severity. In order to make the diagnostic and grading task of tumors less subjective, quantifiable features are derived from the images that correlate with the condition of the disease¹. For example, algorithms can be designed to detect invasive tumors by first segmenting nuclei from the background, quantifying several nuclear characteristics, such as size, shape, and distribution, and comparing these characteristics with those of normal cells¹². Yu et al.¹³ predicted non-small cell lung cancer prognosis by applying classical machine learning algorithms that use engineered features derived from pathology images.

Feature-engineered algorithms rely on a predetermined set of features to classify the tissue. They can only classify the tissue as good as the features that differentiate between them. Thus, there is a limit to their efficiency even when there is a large amount of data available to refine the algorithm. Therefore, there has been a significant shift in recent years towards applying deep learning algorithms as they are known for their inherent ability to automatically derive features from input data. Typical deep learning-based approaches for WSI image segmentation or classification are usually made by cropping the slide image into multiple small image patches and treating them independently during training and inference. Furthermore, to make an overall slide-level prediction or generate a heatmap of regions of interest, patch-level predictions are aggregated appropriately. Cruz-Roa et al.¹⁴ were one of the first to use this method and showed promising results in detecting invasive ductal breast carcinoma. Several studies have applied deep learning algorithms for various pathology tasks related to breast cancer, prostate biopsies, colon cancer, etc.

Given the image size and resolution of WSI images, a patch-based approach is used for training deep convolutional neural networks^{15,16,17,18,19}. Hameed et al.²⁰ and Li et al.²¹ proposed an ensemble-based framework for classification and segmentation using histopathology images. Qin et al.²² proposed a feature pyramid-based approach for semantic segmentation, authors combined feature pyramid blocks in the decoder blocks along with ResNet50 as an encoder. Authors claim that including feature pyramid block provided an overall boost of 10–20% dice coefficient, with an overall dice coefficient of 63%. Guo et al.¹⁵ proposed a two-stage approach where the first stage utilizes inception-v3 for classifying the tumor region followed by a cascaded deep convolutional network for fine segmentation. Pedersen et al.²³ proposed a C++-based open-source package to read, visualize, zoom, pan, and analyze WSI images using CNN’s. In another interesting study, Shahidi²⁴ proposed the method to use super-resolution generative adversarial networks to generate histopathology images, they tested their approach on CAMELYON16 dataset²⁵. Priego-Torres et al.¹⁷ propose a segmentation pipeline for breast cancer images, using a patch-based approach, where the patches we extracted from all possible regions in an image and later merged with fully connected conditional random fields (CRF). Roy et al.¹⁹presented a multi-resolution-based deep learning approach along with customized reconstruction loss for viable tumor segmentation in liver WSI images. Hägele et al.²⁶ explored the effect of various biases in histopathology image analysis. The authors provided an explainable method such as layer-wise relevance propagation to analyze latent biases and observed and improved area under receiver operating curve by 5% after reducing a labeling bias.

Colorectal carcinoma is the third most common cancer in the world²⁷. The majority of colorectal carcinoma are adenocarcinomas originating from epithelial cells²⁸. Shapcott²⁹ discuss the application of deep learning methods for cell identification on TCGA data. Kather et al.³⁰ discuss the deep learning methods to predict the clinical course of colorectal cancer patients based on histopathological images. Bychkov et al.³¹ discuss the use of Long short-term memory (LSTM)³² artificial recurrent neural network (RNN) architecture for estimating the patient risk score using spatial, sequential memory.

A review on WSI application for histopathological analysis of liver diseases and for understanding liver biology is given by Melo et al.³³. They explore how WSI can enhance the evaluation and quantification of several histologic hepatic parameters and help to identify various liver diseases with clinical implications. Kiani et al.³⁴ developed a deep learning-based system to aid pathologists in differentiating between two subtypes of primary liver cancer, hepatocellular carcinoma, and cholangiocarcinoma on H&E stained WSI images. Lu and Daigle³⁵ demonstrated the usefulness of extracting image features from hepatocellular carcinoma histopathological images using pre-trained CNN models to reliably differentiate between normal and cancer samples.

Contributions

A deep learning-based framework for the segmentation and analysis of WSI images has been proposed. The framework comprises a segmentation network at its core along with novel algorithms that utilize the segmentation to do pathological analyses such as metastasis classification and viable tumor burden estimation. As discussed in “Introduction” section, challenges in WSI image analysis are mainly due to their large size, variability in staining, and the limited amount of annotated data. Although there exist some studies as described in “Related work” section, none of them seem to provide a framework that generalizes well against multiple cancer sites. In this work, we proposed methods to address most of the challenges associated with WSI analysis with the generic framework, which produces benchmarking results along with uncertainty maps on three large open-source databases. The main contributions in this work are described below :

Ensemble segmentation model The ensemble comprises multiple fully convolutional architectures (FCN) architectures, each independently trained on different subsets of the training data. During inference, the ensemble generated the tumor probability map by averaging the posterior probability maps of all the FCNs. The ensemble approach showed superior segmentation performance when compared to its individual constituting FCNs.
Training pipeline The proposed approach divided the WSIs into smaller-sized image patches for the purpose of training FCN models. For the preparation of the training set, efficient methodologies for sampling patches from the WSI images were introduced. The problem of class imbalance due to the limited number of representative patches from tumor regions in the WSI images was addressed by employing overlapping and oversampling techniques during patch extraction (random patch coordinate perturbation technique) alongside various data augmentation schemes.
Inference pipeline For efficiently generating model inference on the entire WSI image, a concept of generating patch coordinate sampling grid from the post-processed tissue mask was introduced. The sampling grid aided in the reduction of computational time by discarding non-tissue patches during the construction of the tumor probability heatmap. The patch-based segmentation of WSI images introduced edge artifacts due to loss of neighboring context information at patch borders, and this issue was addressed by proposing techniques to average prediction probabilities at overlapping regions and making use of large patch size during inference. Apart from this, we also compute inference on multiple models parallelly for ensemble calculation over patches rather than an entire image.
Lymph node metastases classification from WSI images A Random Forest-based ensemble classification algorithm was trained with hand-crafted features derived from the predicted tumor-probability maps. The class imbalance in the training dataset was addressed by employing strategies such as over-sampling (by synthetically generating under-presented class data points) and under-sampling (balance all classes by removing some noisy data points).
Uncertainty estimation An efficient patch-based uncertainty estimation framework was developed to estimate both data specific and model (parameter) specific uncertainties.
Open-source Packaging: The proposed framework was packaged into an open-source GUI application for the benefit of researchers (DigiPathAI on GitHub).

The performance of the segmentation pipeline was benchmarked by validating it on WSI slide images of three different cancer sites, namely- breast lymph nodes, liver, and colon by participating in CAMELYON³⁶, DigestPath³⁷, and PAIP³⁸ challenges respectively.

Materials and methods

This section goes into the details of the proposed framework. Firstly, the ensemble and the network architectures are detailed. Secondly, the strategies used to train the models are explained. Finally, the segmentation inference method is discussed, followed by the methods used to perform secondary histopathology analyses (Viable tumor burden estimation, pn-staging). Figure 1 provides an overview of the proposed deep learning-based segmentation and downstream analyses framework for WSI slide images corresponding to multiple different cancer sites.

Datasets used for this study

The proposed framework was validated on multiple open-source datasets which included CAMELYON³⁶ with 1399 WSI images with an average image size of $100000 \times 100000$ and image resolution of 0.25microns/pixel, PAIP³⁸ with 90 WSI images with an average image size of $50000 \times 50000$ and image resolution of 0.5microns/pixel, and finally DigestPath³⁷ dataset consists of 872 tissue images with image size of $5000 \times 5000$. Table 1 provides an overview of the datasets used in this study.

Table 1 Summary of histopathological datasets used in this work. The test images were hidden by the competition organizers and used only for leaderboard evaluation.

Full size table

CAMELYON16

The CAMELYON16²⁵ dataset comprised of 399 WSI slide images taken from two medical centers in the Netherlands, out of which 159 WSI images were metastases, and the remaining 240 were negative. Pathologists exhaustively annotated all the WSI images with metastases at the pixel level. In the CAMELYON16 challenge, the 399 WSI images were split into training and testing sets, comprising of 160 negative and 110 metastases WSI images for training, 80 negative and 49 metastases WSI images for testing.

Table 2 Tumour size criteria for assigning metastasis type.

Full size table

CAMELYON17

The CAMELYON17³⁹ dataset consisted of 1000 WSI images taken from five medical centers in the Netherlands. In the CAMELYON17 challenge, 500 WSI images were allocated for training, and the remaining 500 WSI images were allocated for testing. The training dataset of CAMELYON17 included 318 negative WSI images and 182 WSI images with metastases. In the CAMELYON17 dataset, slide-level labels of metastases type were provided for all the WSI images, and exhaustive pixel-level annotations were provided for 50 WSI images. The slide-level labels were negative, Isolated tumor cells (ITC), micro-metastases, and macro-metastases. Table 2 provides the size criteria for metastases type. The pN-stage labels were provided for all the 100 patients in the training set and were based on the simplified rules provided in Table 3. Table 4 provides the metastases type distribution in CAMELYON17 training dataset.

Table 3 Pathologic lymph node classification (pN-stage) in CAMELYON17 Challenge.

Full size table

Table 4 Metastases type distribution in CAMELYON17 training set.

Full size table

PAIP

The PAIP 2019³⁸ dataset contains a total of 100 WSI images scanned from liver tissue samples. Each image has an average dimension of 50,000x50,000 pixels. All the images were H&E stained, scanned at 20x magnification, and prepared from a single center (Seoul National University Hospital). The dataset included pixel-level annotation of the viable tumor and whole tumor regions. It also provided the viable tumor burden metric for each image.

Tumor burden is defined as the ratio of the viable tumor region to the whole tumor region. The viable tumor region is defined as the cancerous region. The whole tumor area is defined as the outermost boundary enclosing all the dispersed viable tumor cell nests, tumor necrosis, and tumor capsule . Each tissue sample contains only one whole tumor region. This metric has applications in assessing the response rates of patients to cancer treatment.

Out of the 100 images, 50 images were the publicly available training set, ten images were reserved for validation set that was made publicly available after the challenge was completed, and the rest 40 images were the test set whose ground truth were not publicly available. The test images weren’t used directly in the study. However, the score generated by the PAIP 2019 server by testing our algorithm against the test images was used in the study.

DigestPath

The DigestPath dataset consists of tissue sections collected during the examination of colonoscopy pathology to identify early-stage colon tumor cells. There are ten or more tissue sections in a single WSI image for colonoscopy pathology review. The challenge organizers selected one or two tissue sections in a WSI image and provided images of these tissue sections along with their corresponding lesion annotations by pathologists in jpg format. On average, each tissue image was of size 5000 $\times$ 5000 pixels. The training dataset of DigestPath consists of 660 tissue images taken from 324 patients, in which 250 tissue images from 93 patients had lesions, and the remaining 410 tissue images from 231 patients had no lesions. The data was collected from multiple medical centers, especially from several small centers in developing countries. All the tissue sections were H&E stained and scanned at 20 $\times$ magnification. The testing dataset consisted of 212 tissue images from 152 patients. The challenge organizers released only the training set, and the testing set was kept confidential.

Network architecture

For the task of segmentation of tumor regions from patches of the WSI images, an ensemble of FCN⁴⁰ architectures were used. A typical FCN based segmentation network comprises an encoder network, a decoder network, and a pixel-wise classification layer. An encoder network comprises a series of operations (like convolution and pooling) that transforms the input (image) to a set of low-resolution feature maps. The decoder network comprises of up-sampling or transposed convolution followed by series of convolution operations that transform the low-resolution encoder feature maps to the original input resolution feature maps for pixel-wise classification.

The ensemble consisted of three encoder-decoder-based FCN architectures. Experiments (Tables Experimental 8, 9 in the supplementary note) showed that using an ensemble of three different networks provided superior segmentation performance compared to using the networks individually. During inference, the predicted tumor posterior probability map from all three models was averaged to generate the ensemble model’s final prediction. We carefully selected these three different architectures based on the number of parameters, multi-scale feature extraction, and their performance on PASCAL VOC⁴¹ open leaderboard. The ensemble comprised of the following FCN architectures:

U-Net⁴² with DenseNet-121⁴³ as the backbone (encoder) pre-trained on ImageNet⁴⁴. The decoder comprised of the bi-linear up-sampling module followed by convolutional layers. Features learned in the down-sampling path of the encoder were concatenated with the features learned in the up-sampling path using skip connection.
U-Net⁴² with Inception-ResNet-V2⁴⁵ as the backbone (encoder) pre-trained on ImageNet⁴⁴. The Inception-ResNet-V2⁴⁵ (also known as Inception-v4) integrates the features of the Inception architecture⁴⁶ and the ResNet architecture⁴⁷. Multi-scale convnet blocks in inception network helps in reducing number of parameters along with encoding large amount of information.
DeeplabV3Plus⁴⁸ with Xception⁴⁹ network as the backbone and pre-trained on PASCAL VOC⁴¹. DeepLabV3⁵⁰ was built to obtain multi-scale context. This was done by using atrous convolutions with different rates. DeeplabV3Plus extends this by having low-level features transported from the encoder to the decoder.

Training pipeline

The training can broadly split into tissue mask generation, patch extraction and training the models patchwise. Figure 2 illustrates the training strategy utilized for training each of the models in the ensemble.

Tissue mask generation

In this step, the entire tissue region was segmented from the background glass region of the WSI image. This step aided in preventing unnecessary computations on non-tissue regions of the slide. An approximate tissue region boundary suffices; therefore the processing was done on a low-resolution version of the WSI image to further reduce computational costs. The RGB color space of the low-resolution image was transformed to the HSV (Hue-Saturation-Value) color space and Otsu’s adaptive thresholding⁵¹ was applied to the saturation component. Post thresholding, binary morphological operations were performed to facilitate proper extraction of patches at the small tissue regions and tissue borders.

Tissue mask generation specific to CAMELYON dataset

In some of the CAMELYON17 cases, the Otsu’s thresholding failed because of the black regions in the WSI image. Hence, before the application of image thresholding operation, the pre-processing involved replacing black pixel regions in the WSI image background with white pixels and median blurring with a kernel of size 7x7 on the entire image. Median blur aided in the smoothing of the tissue regions and removal of noise at the tissue bordering the glass-slide region while preserving the edges of the tissue. Figure 3 illustrates the pipeline for tissue mask generation with an example.

Patch coordinate extraction

Using the tissue mask generated from the previous step, patches of the image were randomly extracted to make the training dataset. An equal number of tumourous and non-tumorous patches were extracted. This was done to prevent class imbalance or manifold shift issues and enforce proper training. A patch was considered tumourous if at least one pixel inside the patch was classified as a tumor. The dimensions of the extracted patches were not fixed; rather, they were a hyperparameter we experimented with. The patches were extracted from the highest resolution of the image.

Data augmentation

To increase the number of data points and to better generalize the models across various staining and acquisition protocols, data augmentation schemes were proposed. Augmentations like “horizontal or vertical flip,” “90-degree rotations”, and “Gaussian blurring” along with color augmentation were performed. Colour augmentation included random changes to brightness, contrast, hue, and saturation with a maximum delta of 64.0/255, 0.75, 0.25, 0.04, respectively.

Additionally, in order to introduce some diversity between patches extracted from the images at different epochs, random coordinate perturbation was introduced. This technique involved randomly offsetting the center of the patch within a specified radius (128 pixels) prior to the extraction from the WSI image. Post augmentation, the images were normalized.

Loss function

Tumour regions were represented by a minuscule proportion of pixels in WSI images, thereby leading to class imbalance. This issue was circumvented by training the network to minimize a hybrid loss function. The hybrid loss function is comprised of cross-entropy loss and a loss function based on the Dice overlap coefficient. The Dice coefficient is an overlap metric used for assessing the quality of segmentation maps. The effect of hybrid loss was extensively studied in⁵² showing an overall improvement in segmentation performance by combining cross-entropy and dice loss. The dice loss is a differentiable function that approximates Dice-coefficient and is defined using the predicted posterior probability map and ground truth binary image as defined in (1). The cross-entropy loss is defined in (2). In the equations, $p_i$ and $g_i$ represent pairs of corresponding pixel values of predicted posterior probability and ground truth. N represents the total number of pixels. DL refers to dice loss and CL refers to cross-entropy loss. $DL_{FG}$ and $DL_{BG}$ represent the foreground pixels that correspond to the tumor regions and the background pixels that corresponded to non-tumor regions, respectively.

$$\begin{aligned} DL= & {} 1 - \frac{2\sum _i ^N p_i g_i}{ \sum _i^N p_i^2+ \sum _i^N g_i^2} \end{aligned}$$

(1)

$$\begin{aligned} CL= & {} \sum _i ^N \left( g_ilog(p_i) + (1 - g_i)log(1-p_i)\right) \end{aligned}$$

(2)

$$\begin{aligned} Loss= & {} \alpha *CL + \beta *DL_{BG} + \gamma *DL_{FG} \end{aligned}$$

(3)

The proposed loss was defined as a linear combination of the two-loss components as defined in (3). The neural networks were trained by minimizing the proposed loss function using ADAM optimizer⁵³. The $\alpha , \beta , \gamma$ were assigned such that the cross-entropy loss and the dice loss are given equal weightage ($\alpha = 0.5, \beta = 0.25$ and $\gamma = 0.25$).

Training

All three models were trained independently, with different cross-validation folds of the data. The FCN architectures in the ensemble whose encoders were based on DenseNet-121 and Inception-ResNet-V2 made use of transfer learning by using ImageNet⁴⁴ pre-trained weights for their respective encoders. In the case of DeeplabV3Plus, the model weights were pre-trained on PascalVOC⁴¹. For the network architectures with encoders based on DenseNet-121 and Inception-ResNet-V2, the encoder weights of the models were frozen for the first two epochs, and only the decoder weights were made trainable. For the remaining epochs, both the encoder and decoder parts were trained. The learning rate was decayed every few epochs in a deterministic manner to allow for the model to gradually converge. The training was stopped when the validation loss between epochs started increasing.

Inference pipeline

The pre-processing step in the inference pipeline included segmentation of tissue region from the WSI image (refer “Training pipeline” section). In order to facilitate extraction of patches from the WSI image within the tissue mask region, a uniform patch-coordinate sampling grid was generated at a lower resolution, as shown in Fig. 4. Each point in the patch sampling grid was re-scaled by a factor to map to the coordinate space corresponding to the WSI image at its highest resolution. With these scaled coordinate points as the center, fixed-size high-resolution image patches were extracted from the WSI image for feeding the trained segmentation model as an input.

The sampling stride was defined as the spacing between consecutive points in the patch sampling grid. The patch size and the sampling stride controlled the overlap between consecutive extracted patches from the WSI image. The main drawback of the patch-based segmentation method for WSI images was that the smaller patch sizes could not capture the wider context of the neighborhood region. Moreover, the stitching of the segmented patches introduced boundary artifacts (blockish appearance) in the tumor probability heatmaps. The generated heatmaps were smooth when the inference was done on overlapping patches with larger patch-size and averaging the prediction probabilities at the overlapping regions. The experimental observation suggested that a 50% overlap between consecutive neighboring patches is the optimal balance between accuracy and computational efficiency. Also, during inference, increasing the patch size by a factor of 4 (1024x1024) when compared to the patch size used during training (256x256) improved the quality of generated heatmaps.

pN-staging pipeline for CAMELYON17 dataset

Table 5 List of features extracted for the purpose of predicting lymph node metastases type. Features were extracted after thresholding tumour probability heatmaps. For feature numbers 5, 6, 7, 8 and 9 the following statistics were computed- maximum, mean, variance, skewness, and kurtosis.

Full size table

Figure 5 illustrates the complete pipeline developed for pN-staging of CAMELYON17 dataset. The pipeline comprises four blocks as described below:

Pre-processing: The tissue regions in the WSI images were detected for patch extraction.
Heatmap generation: The extracted patches from the WSI images were passed through the inference pipeline to generate the down-scaled version of the tumor probability heatmaps.
Feature extraction: The heatmaps were binarized by thresholding at 0.5 and 0.9 probabilities, and at each of these thresholds, the connected components were extracted, and region properties were measured using scikit-image⁵⁴ library. Thirty-two geometric and morphological features from the probable metastases regions were computed (Table 5).
Data balancing: In order to handle the class imbalance problem, one of the techniques proposed in the literature is oversampling by synthetically generating minority class samples using SMOTE algorithm⁵⁵. However, this method can introduce noisy samples when the interpolated new samples lie between marginal outliers and inliers. This problem is usually addressed by removing noisy samples by using under-sampling techniques like Tomek’s link⁵⁶ or nearest-neighbors. SMOTETomek⁵⁷ algorithm was employed for balancing the training data. SMOTETomek algorithm is a combination of SMOTE and Tomek’s link performed consecutively.
Classification: The pN-stage was assigned to the patient based on all the available lymph-node WSI images, taking into account their individual metastases type (Table 2). For predicting the metastases type, an ensemble of Random Forest classifiers⁵⁸ was trained using the extracted features.

Tumour burden estimation for PAIP dataset

The tumor burden computation requires the segmentation of the viable tumor and whole tumor regions in the WSI image of the liver cancer tissue. The viable tumor region was segmented using the proposed deep learning-based segmentation network. However, it was observed that training the same segmentation network for the whole tumor region gave sub-optimal results. Hence, a heuristic method was adopted to approximate the whole tumor region from the viable tumor region.

The tumor burden estimation algorithm consisted of the following steps:

Segment the viable tumor region via the proposed algorithm in “Inference pipeline” section
Apply morphological operations on the prediction to remove false positives and fill the small holes
Find the smallest convex hull containing the entire viable tumor region
Estimate the tissue mask, as discussed in “Training pipeline” section
The whole tumor region is approximated to be the intersection of the convex hull and tissue mask region
The tumor burden is calculated by taking the ratio between the area of the viable and whole tumor regions

Uncertainty analysis

Uncertainty estimation is essential in assessing unclear diagnostic cases predicted by deep learning models. It helps pathologists to concentrate more on the uncertain regions for their analysis. Begoli et al.⁵⁹ argues the need for uncertainty analysis in machine-assisted medical decision-making system. There exist two main sources of uncertainty, namely (i) Aleatoric uncertainty and (ii) Epistemic uncertainty. Aleatoric uncertainty is uncertainty due to the data generation process itself. In contrast, the uncertainty induced due to the model parameters, which is the result of not estimating ideal model architectures or weights to fit the given data, is known as epistemic uncertainty⁶⁰. Epistemic uncertainty can be approximated by using test time Bayesian dropouts⁶¹, which estimates uncertainty by Montecarlo simulations with Bayesian dropout.

In the proposed pipeline, aleatoric uncertainty for each model was estimated using test time augmentations, as introduced in⁶² (4).

$$\begin{aligned} var_{al}(x, \Phi _i) \approx {\mathbf {E}}_{t \sim TTA}[(\Phi _i(x|w,t) - {\mathbf {E}}_{t \sim TTA}[\Phi _i(x|w,t)])^2] \end{aligned}$$

(4)

where $\Phi _i(x|w)$ is the output of the neural network with weights w for input x and TTA denotes the set of possible test time data augmentations allowed. The proposed methodology for aleatoric uncertainty included the following augmentations- $TTA \in \{rotation, vertical flip, horizontal flip\}$.

For epistemic uncertainty, the diversity of model architectures were used to calculate uncertainty (5).

$$\begin{aligned} var_{ep}(p(y|x,w)) \approx {\mathbf {E}}_{\phi \sim \{\Phi _i\}}[(\phi (x|w) - {\mathbf {E}}_{\phi \sim \{\Phi _i\}}[\phi (x | w)])^2] \end{aligned}$$

(5)

where the likelihood distribution p(y|x, w) is a probabilistic model which generates outputs (y) for given inputs (x) for some parameter setting (w) and $\Phi _i$ indicates the trained model.

Challenge results

Performance evaluation on CAMELYON17 challenge

On the CAMELYON17 testing dataset (n=500) the ensemble strategy was employed by combining the predictions from all the four trained Random Forest classifiers. The ensembling was based on the majority voting principle, and in case of a tie, the higher metastases category was selected. The ensemble model is referred to as RF-Ensemble. Table 6 compares the results of the proposed ensemble approach with other published approaches on the CAMELYON17 testing dataset (n=500). The proposed ensemble strategy gave Cohen’s kappa score of 0.9090.

Table 6 Comparison of the proposed with other published approaches for automated pN-Staging in CAMELYON17 challenge. The score reported in the table is from the open public leader board of the CAMELYON17 challenge. The proposed approach (RF-Ensemble) stood rank-3 on the leaderboard (Accessed on 31-Dec-2019). The table additionally provides the performance of individual Random Forest classifiers in the ensemble and RF-Ensemble classifier.

Full size table

Performance evaluation on DigestPath 2019 challenge

Table 7 compares the results of the proposed with other approaches on DigestPath-2019 testing dataset (n=212). The proposed approach obtained a Dice score of 0.78 on the test set. Though the proposed method is ranked fourth, it can be observed that the results of all the top three methods lie inside the estimated confidence bounds of $\pm 0.014$ as described in supplementary section 1.6.

Table 7 Top four entries in DigestPath-2019 challenge.

Full size table

Performance evaluation on PAIP 2019 challenge

Table 8 compares the results of the proposed with other approaches on PAIP-2019 testing dataset (n=40). The challenge comprised of two tasks, described as follows-

Task 1: Liver cancer segmentation performance was evaluated using the average Jaccard index.
Task 2: Viable tumor burden estimation was evaluated as the average of products of absolute accuracy and corresponding Task 1 score (Jaccard index) for each of the cases in the test set.

For Task 1, all the participants utilized deep learning-based methods for the segmentation of viable tumors, albeit with different CNN architectures. For Task 2, all the participants used deep learning-based methods for the segmentation of the whole tumor. The proposed convex hull-based approximation method showed comparable performance with deep learning-based methods.

Table 8 Top five entries of PAIP 2019. Task 1 corresponds to Viable tumour segmentation and Task 2 corresponds to Viable tumour burden estimation. Note: FNLCR: Frederick National Laboratory for Cancer Research.

Full size table

Discussion and conclusions

An automated end-to-end deep learning-based framework for segmentation and downstream analysis of WSI images was developed. The proposed framework showed state-of-the-art results on three publicly available histopathology image analysis challenges, namely, CAMELYON, PAIP 2019, and DigestPath 2019. The problem of segmentation of gigapixel WSI images was approached using the divide-and-conquer strategy by dividing the large image into computationally feasible patch sizes, running segmentation algorithms on the extracted patches, and stitching the individual outputs together to generate the segmentation of the entire WSI image. The patches were segmented using an ensemble of FCNs, which are encoder-decoder-based architectures employed for generating dense pixel-level classification. The encoders in the proposed FCNs were some of the state-of-the-art CNNs used for natural image analysis tasks, and the decoders were a learnable upsampling module to generate dense predictions. The proposed segmentation framework was an ensemble comprising of multiple FCN architectures, each independently trained on different subsets of the training data. The ensemble generated the tumor probability map by averaging the posterior probability maps of all the FCNs. The ensemble approach showed superior segmentation performance when compared to its individual constituting FCNs. The patch-based segmentation methods for large-sized images suffer from loss of neighboring context information at patch borders. This issue was addressed during inference by proposing- (i) to use patch size larger than that used during training and (ii) to overlap patches and average the posterior probabilities of the overlapping regions while stitching the output together. In addition to the generation of tumor probability heatmaps, a methodology for generating uncertainty maps based on model and data variability was also incorporated into the framework. These uncertainty maps would assist in better interpretation by pathologists and fine-tuning the model with uncertain regions.

Further research can be done in the design of efficient and multi-resolution FCN architectures for capturing multi-resolution information from WSI images⁶⁵. The proposed experimental analysis on transfer learning showed that pre-training models with different histopathology datasets could act as good starting points for training models where pathology datasets are limited. Post-processing techniques could be one of the directions to improve the predicted WSI image’s tumor segmentation; techniques such as patch-based conditional random fields^66,67 could be employed to refine the predicted segmentation masks rather than employing hardcoded threshold values. In the current study, the presence of artifacts in WSI images makes it difficult for tissue region sampling, which further results in sub-optimal segmentation results. The addition of a pre-processing stage for filtering these artefacts, or the addition to the training set of a significant number of images that include these elements, could lead to an improvement in the robustness of the framework. Moreover, the majority of the images used in this study were stained using H&E stain. Increasing the heterogeneity of the training samples with other possible stains could increase the generalizability of the framework.

The segmentation of WSI images is usually the first step which precedes other specific analyses such as metastases classification and estimation of tumor burden. In this regard, an automated pipeline for lymph node metastases classification and pN-staging was developed. For the task of lymph node metastases classification, an ensemble of multiple Random Forest classifiers was proposed, and each classifier was trained on different subsets of the training data. The training data was prepared by extracting features based on the pathologist’s viewpoint from the tumor probability maps. Additionally, incorporating synthetically generated training samples into the training data demonstrated its efficacy in addressing class imbalanced datasets for such classification tasks.

The proposed method for viable tumor burden estimation from WSI images of liver cancer utilized an empirical method for estimating the whole tumor region from the predicted viable tumor region. The whole tumor region was proposed to approximate a convex hull around the viable tumor region. This approximation performed on par with other deep learning-based segmentation approaches and was also computationally inexpensive. The proposed method could be refined further by incorporating learning-based methods into the empirical method. For example, the convex hull output could be used as an initial point for active contours-based models⁶⁸ for correcting whole tumor region segmentation.

References

Gurcan, M. N. et al. Histopathological image analysis: a review. IEEE Reviews in Biomedical Engineering 2, 147 (2009).
Article Google Scholar
Shahriar Salamat, M. Robbins and Cotran: Pathologic basis of disease. J. Neuropathol. Exp. Neurol. 69(2), 214. https://doi.org/10.1097/NEN.0b013e3181cd8dbc (2010).
Hawkes, N. Cancer survival data emphasise importance of early diagnosis. BMJ 364, l408. https://doi.org/10.1136/bmj.l408 (2019).
Elmore, J. G. et al. Diagnostic concordance among pathologists interpreting breast biopsy specimens. JAMA 313, 1122–1132 (2015).
Article CAS Google Scholar
Janowczyk, A. & Madabhushi, A. Deep learning for digital pathology image analysis: A comprehensive tutorial with selected use cases. J. Pathol. Inform. 7, 29. https://doi.org/10.4103/2153-3539.186902 (2016).
Madabhushi, A. & Lee, G. Image analysis and machine learning in digital pathology: challenges and opportunities. Med. Image Anal. 33, 170–175. https://doi.org/10.1016/j.media.2016.06.037 (2016).
Article PubMed PubMed Central Google Scholar
Nanthagopal, A. P. & Rajamony, R. S. Classification of benign and malignant brain tumor ct images using wavelet texture parameters and neural network classifier. J. Vis. 16, 19–28 (2013).
Article Google Scholar
Guray, M. & Sahin, A. A. Benign breast diseases: classification, diagnosis, and management. Oncol. 11, 435–449 (2006).
Article Google Scholar
Wählby, C., Sintorn, I.-M., Erlandsson, F., Borgefors, G. & Bengtsson, E. Combining intensity, edge and shape information for 2d and 3d segmentation of cell nuclei in tissue sections. J. Microsc. 215, 67–76 (2004).
Article MathSciNet Google Scholar
Xu, J., Luo, X., Wang, G., Gilmore, H. & Madabhushi, A. A deep convolutional neural network for segmenting and classifying epithelial and stromal regions in histopathological images. Neurocomputing 191, 214–223 (2016).
Article Google Scholar
Tizhoosh, H. R. & Pantanowitz, L. Artificial intelligence and digital pathology: Challenges and opportunities. J. Pathol. Inform. 9, 38. https://doi.org/10.4103/jpi.jpi_53_18 (2018).
Diamond, J., Anderson, N. H., Bartels, P. H., Montironi, R. & Hamilton, P. W. The use of morphological characteristics and texture analysis in the identification of tissue composition in prostatic neoplasia. Hum. Pathol. 35, 1121–1131 (2004).
Article Google Scholar
Yu, K.-H. et al. Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features. Nat. Commun. 7, 12474 (2016).
Article ADS CAS Google Scholar
Cruz-Roa, A. et al. Automatic detection of invasive ductal carcinoma in whole slide images with convolutional neural networks. In Medical Imaging 2014: Digital Pathology, Vol. 9041, 904103 (International Society for Optics and Photonics, 2014).
Guo, Z. et al. A fast and refined cancer regions segmentation framework in whole-slide breast pathological images. Sci. Rep. 9, 1–10 (2019).
ADS Google Scholar
Chan, L., Hosseini, M. S., Rowsell, C., Plataniotis, K. N. & Damaskinos, S. Histosegnet: semantic segmentation of histological tissue type in whole slide images. In Proceedings of the IEEE/CVF International Conference on Computer Vision 10662–10671 (2019).
Priego-Torres, B. M., Sanchez-Morillo, D., Fernandez-Granero, M. A. & Garcia-Rojo, M. Automatic segmentation of whole-slide h&e stained breast histopathology images using a deep convolutional neural network architecture. Expert Syst. Appl. 151, 113387 (2020).
Article Google Scholar
Oskal, K. R., Risdal, M., Janssen, E. A., Undersrud, E. S. & Gulsrud, T. O. A u-net based approach to epidermal tissue segmentation in whole slide histopathological images. SN Appl. Sci. 1, 1–12 (2019).
Article CAS Google Scholar
Roy, M. et al. Convolutional autoencoder based model histocae for segmentation of viable tumor regions in liver whole-slide images. Sci. Rep. 11, 1–10 (2021).
Article ADS Google Scholar
Hameed, Z., Zahia, S., Garcia-Zapirain, B., Javier Aguirre, J. & María Vanegas, A. Breast cancer histopathology image classification using an ensemble of deep learning models. Sensors 20, 4373 (2020).
Article ADS CAS Google Scholar
Li, Y. et al. U-net ensemble model for segmentation in histopathology images. Openreview (2019).
Qin, P., Chen, J., Zeng, J., Chai, R. & Wang, L. Large-scale tissue histopathology image segmentation based on feature pyramid. EURASIP J. Image Video Process. 2018, 1–9 (2018).
Article Google Scholar
Pedersen, A. et al. Fastpathology: an open-source platform for deep learning-based research and decision support in digital pathology. arXiv preprintarXiv:2011.06033 (2020).
Shahidi, F. Breast cancer histopathology image super-resolution using wide-attention gan with improved wasserstein gradient penalty and perceptual loss. IEEE Access 9, 32795–32809 (2021).
Article Google Scholar
Bejnordi, B. E. et al. Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer. JAMA 318, 2199–2210 (2017).
Article Google Scholar
Hägele, M. et al. Resolving challenges in deep learning-based analyses of histopathological images using explanation methods. Scientific reports 10, 1–12 (2020).
Article Google Scholar
Fleming, M., Ravula, S., Tatishchev, S. F. & Wang, H. L. Colorectal carcinoma: pathologic aspects. J. Gastrointest. Oncol. 3, 153 (2012).
PubMed PubMed Central Google Scholar
Hamilton, S. Carcinoma of the colon and rectum. World health organization Classification of Tumors. Pathology and Genetics of Tumors of the Digestive System 105–119 (2000).
Shapcott, C. M., Rajpoot, N. & Hewitt, K. Deep learning with sampling for colon cancer histology images. Front. Bioeng. Biotechnol. 7, 52 (2019).
Article Google Scholar
Kather, J. N. et al. Predicting survival from colorectal cancer histology slides using deep learning: a retrospective multicenter study. PLoS Med. 16, e1002730 (2019).
Article Google Scholar
Bychkov, D. et al. Deep learning based tissue analysis predicts outcome in colorectal cancer. Sci. Rep. 8, 3395 (2018).
Article ADS Google Scholar
Greff, K., Srivastava, R. K., Koutník, J., Steunebrink, B. R. & Schmidhuber, J. Lstm: a search space odyssey. IEEE Trans. Neural Netw. Learn. Syst. 28, 2222–2232 (2016).
Article MathSciNet Google Scholar
Melo, R. C. et al. Whole slide imaging and its applications to histopathological studies of liver disorders. Front. Med. 6, 310 (2019).
Article Google Scholar
Kiani, A. et al. Impact of a deep learning assistant on the histopathologic classification of liver cancer. npj Digital Medicine 3, 1–8 (2020).
Lu, L. & Daigle, B. J. Jr. Prognostic analysis of histopathological images using pre-trained convolutional neural networks: application to hepatocellular carcinoma. PeerJ 8, e8668 (2020).
Article Google Scholar
Litjens, G. et al. 1399 h&e-stained sentinel lymph node sections of breast cancer patients: the camelyon dataset. GigaScience 7, giy065 (2018).
Li, J. et al. Signet ring cell detection with a semi-supervised learning framework. In International Conference on Information Processing in Medical Imaging, 842–854 (Springer, 2019).
Kim, Y. J. et al. Paip 2019 - liver cancer segmentation (2019). Dataset available at https://paip2019.grand-challenge.org.
Bandi, P. et al. From detection of individual metastases to classification of lymph node status at the patient level: the camelyon17 challenge. IEEE Trans. Med. imaging 38, 550–560 (2018).
Article Google Scholar
Long, J., Shelhamer, E. & Darrell, T. Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 3431–3440, (2015).
Everingham, M., Van Gool, L., Williams, C. K., Winn, J. & Zisserman, A. The pascal visual object classes (voc) challenge. Int. J. Comput. Vis. 88, 303–338 (2010).
Article Google Scholar
Ronneberger, O., Fischer, P. & Brox, T. U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical Image Computing and Computer-Assisted Intervention, 234–241 (Springer, 2015).
Huang, G., Liu, Z., Van Der Maaten, L. & Weinberger, K. Q. Densely connected convolutional networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 4700–4708, (2017).
Deng, J. et al. Imagenet: A large-scale hierarchical image database. In 2009 IEEE Conference on Computer Vision and Pattern Recognition, 248–255 (IEEE, 2009).
Szegedy, C., Ioffe, S., Vanhoucke, V. & Alemi, A. A. Inception-v4, inception-resnet and the impact of residual connections on learning. In Thirty-First AAAI Conference on Artificial Intelligence (2017).
Szegedy, C. et al. Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 1–9, (2015).
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 770–778, (2016).
Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F. & Adam, H. Encoder–decoder with atrous separable convolution for semantic image segmentation. Proceedings of the European Conference on Computer Vision (ECCV) 801–818, (2018).
Chollet, F. Xception: Deep learning with depthwise separable convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 1251–1258, (2017).
Chen, L.-C., Papandreou, G., Schroff, F. & Adam, H. Rethinking atrous convolution for semantic image segmentation. arXiv preprintarXiv:1706.05587 (2017).
Otsu, N. A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9, 62–66 (1979).
Article Google Scholar
Khened, M., Kollerathu, V. A. & Krishnamurthi, G. Fully convolutional multi-scale residual densenets for cardiac segmentation and automated cardiac diagnosis using ensemble of classifiers. Med. Image Anal. 51, 21–45 (2019).
Article Google Scholar
Kingma, D. & Ba, J. Adam: a method for stochastic optimization. arXiv preprintarXiv:1412.6980 (2014).
van der Walt, S. et al. scikit-image: image processing in Python. PeerJ 2, e453. https://doi.org/10.7717/peerj.453 (2014).
Article PubMed PubMed Central Google Scholar
Chawla, N. V., Bowyer, K. W., Hall, L. O. & Kegelmeyer, W. P. Smote: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002).
Article Google Scholar
Tomek, I. Two modifications of CNN. IEEE Trans. Syst. Man Cybern. 6, 769–772 (1976).
Batista, G. E., Prati, R. C. & Monard, M. C. A study of the behavior of several methods for balancing machine learning training data. ACM SIGKDD Explor. Newsl. 6, 20–29 (2004).
Article Google Scholar
Liaw, A. et al. Classification and regression by randomforest. R News 2, 18–22 (2002).
Google Scholar
Begoli, E., Bhattacharya, T. & Kusnezov, D. The need for uncertainty quantification in machine-assisted medical decision making. Nat. Mach. Intell. 1, 20 (2019).
Article Google Scholar
Kendall, A. & Gal, Y. What uncertainties do we need in Bayesian deep learning for computer vision?. In Advances in Neural Information Processing Systems 5574–5584, (2017).
Leibig, C., Allken, V., Ayhan, M. S., Berens, P. & Wahl, S. Leveraging uncertainty information from deep neural networks for disease detection. Scientific reports 7, 17816 (2017).
Article ADS Google Scholar
Gal, Y. & Ghahramani, Z. Dropout as a Bayesian approximation: representing model uncertainty in deep learning. In International Conference on Machine Learning 1050–1059, (2016).
Lee, S., Oh, S., Choi, K. & Kim, S. W. Automatic classification on patient-level breast cancer metastases (2019). Accessed 31 Dec 2019.
Pinchaud, N. Camelyon17 grand challenge (2019). Accessed 31 Dec 2019.
Graham, S. et al. Mild-net: minimal information loss dilated network for gland instance segmentation in colon histology images. Med. Image Anal. 52, 199–211 (2019).
Article Google Scholar
Krähenbühl, P. & Koltun, V. Efficient inference in fully connected crfs with gaussian edge potentials. Advances in Neural Information Processing Systems 109–117, (2011).
Li, Y. & Ping, W. Cancer metastasis detection with neural conditional random field. In Medical Imaging with Deep Learning (2018).
Kass, M., Witkin, A. & Terzopoulos, D. Snakes: active contour models. Int. J. Comput. Vis. 1, 321–331 (1988).
Article Google Scholar

Download references

Author information

These authors contributed equally: Mahendra Khened, Avinash Kori and Haran Rajkumar.

Authors and Affiliations

Department of Engineering Design, Indian Institute of Technology Madras, Chennai, 600036, India
Mahendra Khened, Avinash Kori, Haran Rajkumar & Ganapathy Krishnamurthi
Department of Mechanical Engineering, Indian Institute of Technology Madras, Chennai, 600036, India
Balaji Srinivasan

Authors

Mahendra Khened
View author publications
You can also search for this author in PubMed Google Scholar
Avinash Kori
View author publications
You can also search for this author in PubMed Google Scholar
Haran Rajkumar
View author publications
You can also search for this author in PubMed Google Scholar
Ganapathy Krishnamurthi
View author publications
You can also search for this author in PubMed Google Scholar
Balaji Srinivasan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.K. worked on the CAMELYON dataset-related algorithms and experiments. A.K. worked on DigestPath related algorithms and experiments. H.R. worked on PAIP2019 related algorithms and experiments. H.R. prepared images S1, S2 and S3. M.K. prepared images 5 and S4. M.K. mentored A.K. and H.R. on their work. H.R. and A.K. developed the DigiPathAI software. G.K. and B.S. guided and were the principal investigators. All authors reviewed the manuscript.

Corresponding author

Correspondence to Ganapathy Krishnamurthi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Khened, M., Kori, A., Rajkumar, H. et al. A generalized deep learning framework for whole-slide image segmentation and analysis. Sci Rep 11, 11579 (2021). https://doi.org/10.1038/s41598-021-90444-8

Download citation

Received: 04 January 2021
Accepted: 04 May 2021
Published: 02 June 2021
DOI: https://doi.org/10.1038/s41598-021-90444-8

This article is cited by

Mapping cell-to-tissue graphs across human placenta histology whole slide images using deep learning with HAPPY
- Claudia Vanea
- Jelisaveta Džigurski
- Christoffer Nellåker
Nature Communications (2024)
Rapid artefact removal and H&E-stained tissue segmentation
- B. A. Schreiber
- J. Denholm
- E. J. Soilleux
Scientific Reports (2024)
Improving the speed and quality of cancer segmentation using lower resolution pathology images
- Jieyi Li
- Anwar Osseyran
- Damian Podareanu
Multimedia Tools and Applications (2024)
Improved Watershed Segmentation and DualNet Deep Learning Classifiers for Breast Cancer Classification
- Jyoti Kadadevarmath
- A. Padmanabha Reddy
SN Computer Science (2024)
Classification of benign and malignant subtypes of breast cancer histopathology imaging using hybrid CNN-LSTM based transfer learning
- Mahati Munikoti Srikantamurthy
- V. P. Subramanyam Rallabandi
- Junhyung Park
BMC Medical Imaging (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Related work

Deep learning methods for histopathological image analysis

Contributions

Materials and methods

Datasets used for this study

CAMELYON16

CAMELYON17

PAIP

DigestPath

Network architecture

Training pipeline

Tissue mask generation

Tissue mask generation specific to CAMELYON dataset

Patch coordinate extraction

Data augmentation

Loss function

Training

Inference pipeline

pN-staging pipeline for CAMELYON17 dataset

Tumour burden estimation for PAIP dataset

Uncertainty analysis

Challenge results

Performance evaluation on CAMELYON17 challenge

Performance evaluation on DigestPath 2019 challenge

Performance evaluation on PAIP 2019 challenge

Discussion and conclusions

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links