An automatic entropy method to efficiently mask histology whole-slide images

Song, Yipei; Cisternino, Francesco; Mekke, Joost M.; de Borst, Gert J.; de Kleijn, Dominique P. V.; Pasterkamp, Gerard; Vink, Aryan; Glastonbury, Craig A.; van der Laan, Sander W.; Miller, Clint L.

doi:10.1038/s41598-023-29638-1

Download PDF

Article
Open access
Published: 15 March 2023

An automatic entropy method to efficiently mask histology whole-slide images

Yipei Song^1,2,
Francesco Cisternino³,
Joost M. Mekke⁴,
Gert J. de Borst⁴,
Dominique P. V. de Kleijn⁴,
Gerard Pasterkamp⁵,
Aryan Vink⁶,
Craig A. Glastonbury³,
Sander W. van der Laan⁵^na1 &
…
Clint L. Miller^2,7^na1

Scientific Reports volume 13, Article number: 4321 (2023) Cite this article

1374 Accesses
2 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Tissue segmentation of histology whole-slide images (WSI) remains a critical task in automated digital pathology workflows for both accurate disease diagnosis and deep phenotyping for research purposes. This is especially challenging when the tissue structure of biospecimens is relatively porous and heterogeneous, such as for atherosclerotic plaques. In this study, we developed a unique approach called ‘EntropyMasker’ based on image entropy to tackle the fore- and background segmentation (masking) task in histology WSI. We evaluated our method on 97 high-resolution WSI of human carotid atherosclerotic plaques in the Athero-Express Biobank Study, constituting hematoxylin and eosin and 8 other staining types. Using multiple benchmarking metrics, we compared our method with four widely used segmentation methods: Otsu’s method, Adaptive mean, Adaptive Gaussian and slideMask and observed that our method had the highest sensitivity and Jaccard similarity index. We envision EntropyMasker to fill an important gap in WSI preprocessing, machine learning image analysis pipelines, and enable disease phenotyping beyond the field of atherosclerosis.

Rapid artefact removal and H&E-stained tissue segmentation

Article Open access 03 January 2024

B. A. Schreiber, J. Denholm, … E. J. Soilleux

TIAToolbox as an end-to-end library for advanced tissue image analytics

Article Open access 24 September 2022

Johnathan Pocock, Simon Graham, … Shan E Ahmed Raza

A generalized deep learning framework for whole-slide image segmentation and analysis

Article Open access 02 June 2021

Mahendra Khened, Avinash Kori, … Balaji Srinivasan

Introduction

Atherosclerosis is a chronic inflammatory process resulting in arterial stiffening and plaque formation, and is the leading cause of myocardial infarction, ischemic stroke, and peripheral artery disease^1,2. Historically, researchers and pathologists have characterized atherosclerotic plaque through standard histology and light microscopy analysis^3,4,5,6. The composition of atherosclerotic plaques is highly variable, with different plaque types having distinct clinical manifestations^3,7. For instance, more stable, fibrous-rich atheroma plaques are typically asymptomatic, whereas unstable, thin-cap fibroatheroma plaques are more prone to rupture and thrombus formation underlying cerebral or coronary events^5,6. However, the value of atherosclerotic plaque composition in predicting cardiovascular events remains a subject of debate and ongoing research^7,8,9.

Histological analysis of atherosclerosis is mainly used in the research setting and can reveal the extent of disease progression as well as underlying etiology. For example, in pre-clinical models of atherosclerosis, the amount of smooth muscle content and collagen extracellular matrix in the plaque often correlates with greater plaque stability⁸. These features can now be easily captured from stained tissue sections and digitized as high-resolution whole-slide images (WSI). WSI data provides a rich resource for quantitative and qualitative image analysis and has been a focus of digital pathology¹⁰. Overall, this has simplified archiving, enabled remote diagnosis, and accelerated both clinical decision-making and research investigations¹¹.

Accurately identifying tissue (foreground) from background is a necessary initial step in WSI analysis in digital pathology workflows. However, the process of WSI acquisition has its own technical challenges. It is often challenging to identify or "mask" the entire non-tissue region from WSI, e.g., due to factors such as dust, translucent tissue, or weak immunostaining, which may lead to incorrect diagnosis¹². Since rescanning and manually inspecting slides is labor-intensive, automated algorithms that highlight the tissue-specific areas to be analyzed is a more efficient solution¹³. These algorithms also adjust for inter- and intra-image aberrations in color, staining and slide preparation through more robust tissue segmentation.

The most straightforward method for identifying tissue on a white background is to set a specified threshold for the grayscale version of the image¹⁴, specific color channels¹⁵, or the optical density of the red, green, and blue (RGB) color channels¹⁶. Alternatively, tissue can be identified from WSIs split into a uniform patch grid that is subjected to grayscale image threshold^17,18. Otsu's thresholding¹⁹ is a commonly used tissue masking method in WSI analysis studies^13,20,21. Several image segmentation methods have been developed to extend Otsu’s method and improve foreground extraction from WSI, such as GrabCut²² and Foreground Extraction from Structural Information (FESI)²³. While these methods are easy to implement, it has been difficult to generalize to more variable stains or clinical specimens. A more accurate approach is still needed in digital pathology workflows, especially for specimens containing small empty spaces between extracellular matrix fibers (e.g., collagen) or due to decalcifying procedures, and transparent tissue (e.g., adipose or lipid droplets) in atherosclerotic plaques. Image thresholding techniques based on entropy maximization have been routinely applied to infrared image analysis^24,25,26. Similar to infrared images with poor contrast and noise, WSI may also benefit from entropy thresholding to distinguish tissues from the slide background; to our knowledge this approach has not yet been applied to WSI.

In this paper, we propose a fully automated approach for separating foreground (tissue) and background in WSI of HE-stained atherosclerotic plaque samples. Our method, EntropyMasker, is unaffected by changes in scanning or image processing conditions, by using a measure of local entropy and generating corresponding binary tissue masks. This allows for increased scalability across various tissue contexts and implementation in both machine learning and computer vision cell segmentation pipelines, e.g., using CellProfiler²⁷ and slideToolKit²⁸.

Methods

Our method: EntropyMasker

For WSI processing, ideally the slide background is evenly achromatic, with the background mostly one color, e.g., black (fluorescence) or white (brightfield). Unfortunately, in practice, there is often inconsistency of the background color due to variations in image capturing color temperature (e.g., from shades of blue to yellow), resulting in edge artifacts from tile-based scanning methods²⁹. Therefore, the slide background cannot be easily masked by transforming the images from RGB color space to the HSV (Hue-Saturation-Value) and excluding the white color (0°, 0%, 100%).

In information theory³⁰, entropy is defined as the log-base-2 of the number of possible outcomes for a sent/received message. In image analysis, entropy refers to the degree of randomness or complexity of pixels in a defined region or neighborhood of pixels. We defined the local entropy of a specific region as follows:

$$H\left( F \right) = - \mathop \sum \limits_{i = 0}^{255} p_{F} \left( i \right)log_{2} p_{F} \left( i \right),$$

where $p_{F} \left( i \right)$ represents the probability of a grayscale pixel, $i$, in a local footprint region.

In the case of $p_{F} \left( i \right) = 0$ for some $i$, the corresponding sum value and $0log_{2} 0$ is set to $0$, which is consistent with the limit: $\mathop {lim}\limits_{{p_{F} \to 0^{ + } }} p_{F} \log \left( {p_{F} } \right) = 0$.

Based on this approach, homogeneous pixels within local neighborhood regions (disk-shaped footprints with radius of 5) have a low probability and contribute less to the local entropy, whereas heterogeneous pixels within local neighborhood regions have a high probability and contribute more to the local entropy. Also, the background of an image has lower entropy due to higher brightness and more homogeneous texture, whereas foreground tissue has higher entropy due to the diversity of pixel intensities in the local neighborhood. In other words, the heterogenous pixels in tissue with high entropy are more informative than homogenous background pixels with low entropy³¹.

The purpose is to extract the tissue (foreground) from the background. We first calculated local entropies and then we made a histogram of all local entropies. Since the background of the image has a lower entropy value, we first defined the threshold as the local minima of the histogram of the entropy map. Next, we applied this threshold to convert the local entropy map into a binary map which distinguishes background noise from tissue (foreground) in the WSI.

Otsu’s method

Otsu's method is a clustering-based image thresholding algorithm¹⁹, commonly utilized in image analysis and digital histopathology applications^{20,29,32,33,34}. Following a bimodal histogram, the technique assumes that the image has two classes of pixels (tissue pixels and background pixels). It determines the best threshold for dividing the two groups such that their total intra-class variance is as low as possible.

Adaptive thresholding

Adaptive thresholding is another widely used method for calculating the threshold value for smaller areas of WSIs^35,36,37,38. Typically, these threshold values are calculated in two different ways, either using the mean of the neighborhood area, or a Gaussian weighted sum of neighborhood values.

slideMask method from slideToolKit

The slideToolKit²⁸ is an assistive tool set for the histological quantification of whole-slide images (https://github.com/swvanderlaan/slideToolKit). The slideMask tool in slideToolKit uses convert from ImageMagick (https://imagemagick.org) and a miniature version of the WSI to generate a mask (https://github.com/swvanderlaan/slideToolKit/blob/master/slideMask). The image is first blurred to eliminate dust and speckles. Then Fuzz in ImageMagick³⁹ (which is computed as the root mean squared difference between two colors of the pixels) is used to create a fuzzy, non-stringent selection to identify the white background, which is subsequently substituted with black. The slideMask tool has settings for blur and fuzziness that may be explored and altered. Manual adjustments may be made to generated masks in any image editor (such as free GNU Image Manipulation Program; GIMP^38,40). Unwanted regions on the WSI (such as marker stripes or air bubbles beneath the coverslip) may necessitate this procedure.

All proposed methods were implemented in the Python programming language using OpenCV⁴¹ and other required packages (see Data and code availability for more details).

Athero-express biobank study

Patient population

Atherosclerotic plaques were obtained from patients undergoing an arterial endarterectomy procedure and included in the Athero-Express Biobank Study (AE), an ongoing biobank study at the University Medical Centre Utrecht (Utrecht, The Netherlands) and the St. Antonius Hospital (Nieuwegein, The Netherlands)³. The medical ethical committees of the respective hospitals approved the AE which was registered under number TME/C-01.18. This study complies with the Declaration of Helsinki, and all participants provided informed consent. However, considering national laws plaque-material are considered ‘waste biomaterial’ and are always allowed to be used without any personal information regardless of informed consent. In this study we also considered this ‘waste biomaterial’, hence no clinical information, e.g., age, is given here nor when sharing data of the relevant samples (n = 3). The study design of the AE was described before³, but in brief: during surgery blood and plaques are obtained, stored at − 80 °C and plaque material is routinely used for histological analysis^3,42.

Whole-slide staining and scanning

The standardized (immuno)histochemical analysis protocols used in the AE biobank have been described previously^3,42. In short, 4‐micron cross‐sections of the paraffin‐embedded segments were cut using a microtome, and 8 different stains were applied for endothelial cells (CD34), macrophages (CD68), elastic Van Gieson (EvG), fibrin, red blood cells (glycophorin C, GLYCC), hematoxylin and eosin (HE) for nuclei, collagen (picrosirius red, SR), and smooth muscle cells (SMCs, α‐actin) on consecutive slides. We set up ExpressScan to obtain whole-slide images (WSIs) by scanning stained slides at 40× using a Roche Ventana iScan HT (https://diagnostics.roche.com/global/en/products/instruments/ventana-iscan-ht.html) or Hamamatsu C12000-22 Digital slide scanner (https://www.hamamatsu.com/eu/en/product/life-science-and-medical-systems/digital-slide-scanner/index.html). WSIs were stored digitally as z-stacked .TIF (Roche) at 0.25 micron/pixel or .ndpi (Hamamatsu) at 0.23 micron/pixel brightfield microscopy images¹⁰. The slides numbers used for this project are listed in Supplemental Table 1.

Table 1 Comparison of different masking methods.

Full size table

Manual tissue annotation and masking

The ground truth tissue areas for each WSI were annotated manually at 40 × magnification using QuPath⁴³. If the tissue had disintegrated into several disjointed fragments during preparation or staining the annotators marked them with a single enclosing polygon. Binary masks of these tissue annotations were then generated using groovy scripts in QuPath.

Statistical analyses

We assessed the algorithms using several metrics: The Jaccard index, sensitivity, False positive Rate (FPR) and pixel accuracy, comparing the algorithms' output to the ground truth binary reference masks of the 97 images in the test set.

Jaccard index was applied to assess the performance of our masking algorithms, by measuring the overlap area of target masks with the ground truth masks annotated by experts and masks of the algorithm outputs divided by the area of the union of both masks. Mathematically, sensitivity, false positive rate and pixel accuracy can be expressed as:

$$Sensitivity = \frac{TP}{{ TP + FN}},$$

$$False positive rate = { }\frac{FP}{{FP + TN}} ,$$

$$Pixel accuracy = \frac{TP + TN}{{TP + FP + FN + TN}}.$$

Evaluation and results

We developed EntropyMasker, a novel masking method using entropy-filtering (Fig. 1). First, we compared our method against three conventional thresholding methods: Otsu’s, adaptive thresholding using the mean of neighborhood area as the threshold, and adaptive thresholding using the Gaussian weighted sum of neighborhood values as the threshold. We also evaluated these methods against the previously developed slideMask tissue masking method in the slideToolKit²⁸.

Application of quantitative WSI analysis pipelines requires access to uniformly processed and well-characterized tissue biobanks. The Athero-Express study³ is a large-scale vascular tissue biobank comprising over 3600 carotid and femoral endarterectomy surgical specimens, which include detailed clinical outcomes and follow-up. This has led to several prospective studies to correlate local atherosclerotic plaque composition with future local and systemic vascular outcomes, using histology^7,8,9,10, RNA^44,45, genetics⁴⁶, and protein^47,48 data. In the histology-based analyses, local plaque indices such as plaque hemorrhage and neovascularity were shown to correlate with more adverse vascular outcomes^7,9. We evaluated each method using 97 randomly selected WSIs from Athero-Express (Supplemental Table 1 for a minimalistic baseline table and Supplemental Table 2 for details on usage) at 40X magnification on layer 6 (pixel spacing at layer 6 is around 8.36 µm) consisting of 8 different stains: HE, picrosirius red, Fibrin, EVG, a-SMA, CD34, CD68, and GLYCC. We did not apply our method to layer 1 due to memory constraints.

The average Jaccard index, also referred to as the Intersection over Union (IoU) metric, of the adaptive thresholding method where its threshold value is the weighted sum of neighborhood values where weights are a gaussian window was 0.7702 and the average sensitivity was 0.8597 (Table 1, Supplemental Table 3 includes results for all stains and methods). This masking method produced the highest pixel accuracy among all the traditional methods which was 0.9485. Otsu’s method had a much lower false positive rate than any of the approaches examined at the cost of much lower average Dice scores and sensitivity levels. The quality of the outcome of the slideMask algorithm was unevenly distributed resulting in a relatively low Jaccard index (IoU) across all the tested methods (Fig. 2). The average false positive rate was 0.0153, which is the lowest of all masking methods in comparison, and its average pixel accuracy was 0.9133. Our automatic local entropy-based masking method, EntropyMasker, had the best average Jaccard index (IoU) 0.8837 and the highest sensitivity 0.9685, outperforming Otsu’s method, two adaptive methods and slideMask masking method in both metrics. The average false positive rate was 0.0176 which is much lower than Otsu’s method, and two adaptive methods.

The results of comparing the 5 masking methods for an H&E-stained whole-slide image are shown in Fig. 3 where generated tissue masks are overlaid by pseudocolor (green). To show the generalizability of our automatic entropy method, we tested on whole-slide images with 8 different types of staining. Examples of masking results of our proposed method are shown on Fig. 4 where generated tissue masks are overlaid by pseudocolor (green).

Since we used two different scanners, we also evaluated the EntropyMasker performance using different image formats (.ndpi or .TIF), which did not influence the Jaccard index (Supplemental Fig. 1). Also, to mitigate the influence of the porosity of the tissue, we evaluated these methods on both high (20X) and low (5x) resolution WSIs, however the performance was unaffected (Supplemental Table 4).

The computation speed for Otsu’s method, adaptive methods, slideMask and proposed EntropyMasker method on a 3.6 GHz Quad-Core Intel Core i7 processor, 32 GB 2400 MHz DDR4 memory iMac are 0.08, 0.11, 0.25 and 0.34 s per one million pixels, respectively.

Conclusions and discussion

Overall, our automatic entropy masking method, EntropyMasker, performed well on atherosclerotic plaque cross-sectional WSIs derived from 8 different types of stains. With a relatively high average Jaccard index our method was able to separate out the foreground and background accurately and consistently from these complex images (Fig. 2 and 4).

In comparison, we observed the popular masking method, Otsu’s, and our previous method, slideMask, both tended to miss some tissue areas along the tissue boundary. This likely results from the relatively lower intensity of these areas, where Otsu’s method sets a threshold in the middle of two peaks, thus partially omitting the higher intensity pixels compared to the threshold (Fig. 2). The slideMask method computes the root mean squared difference between two colors, however this becomes problematic when the intensity of tissue area pixels is relatively large (Fig. 2).

We also found that the adaptive methods tend to produce ‘porosity’ during the masking, since these methods miss small regions of the image which are supposed to be regarded as tissue (Fig. 2). This large scale missing tissue makes Otsu’s, slideMask and adaptive thresholding methods impractical for use in many image processing workflows^10,28, especially when applied to atherosclerotic plaque images. Given the limited availability and heterogeneous nature of these plaques, an ideal masking method will retain all the plaque components, which can be used for automatic cellular segmentation tasks to discover tissue or compartment-specific markers for disease progression and phenotyping. This is also a valuable feature of EntropyMasker, which can be implemented at the preprocessing stage of various machine learning pipelines for other diseases and research areas which require a complete WSI masking step^49,50,51.

Despite the accurate and consistent performance of EntropyMasker, it is worth pointing out some known limitations. In comparison to the other methods that we evaluated, our method has the highest sensitivity, and thus tends to over-include a few pixels outside the edges of tissue (Fig. 3). While these regions represent a small fraction of the total tissue areas that we tested, this should be more carefully considered when evaluating smaller tissue sections¹¹. Also, our method had slightly lower pixel accuracy than the adaptive methods, which may be important for tasks involving more subtle intensity differences in defining target regions of the tissue⁵². Compared to the other methods discussed, ours is slightly slower (0.34 s per one million pixels compared to 0.11 s for Otsu’s) which may pose an issue when examining thousands of images, although this should be negligible since our method is scalable on any high-performance compute cluster.

In conclusion, we demonstrated the effectiveness of our proposed method for tissue masking on human atherosclerotic plaque WSIs of different types of stains including HE, CD34, CD68, EVG, smooth muscle cell α‐actin, picrosirius red, fibrin and Glycophorin C. By evaluating our method against other popular masking methods and those recently developed, we also demonstrated that our entropy-based masking method is scalable and had optimal performance across these WSIs. Given its scalability in WSI processing, we envision EntropyMasker to be readily adopted across various disease contexts and implemented in existing pipelines as an input to both deep learning and cell counting tasks.

Data availability

The full histological data used in this study are available here https://doi.org/10.34894/GN4YOS including the high- and low-resolution whole-slide images.

All documented code and the tutorial to run EntropyMasker can be found here https://github.com/CirculatoryHealth/EntropyMasker.

References

Ross, R. Atherosclerosis–an inflammatory disease. N. Engl. J. Med. 340, 115–126 (1999).
Article CAS PubMed Google Scholar
Lusis, A. J. Atherosclerosis. Nature 407, 233–241 (2000).
Article CAS PubMed PubMed Central Google Scholar
Verhoeven, B. A. N. et al. Athero-express: differential atherosclerotic plaque expression of mRNA and protein in relation to cardiovascular events and patient characteristics. Rationale and design. Eur. J. Epidemiol. 19, 1127–1133 (2004).
Article CAS PubMed Google Scholar
Burke, A. P., Kolodgie, F. D., Farb, A., Weber, D. & Virmani, R. Morphological predictors of arterial remodeling in coronary atherosclerosis. Circulation 105, 297–303 (2002).
Article PubMed Google Scholar
Burke, A. P. et al. Coronary risk factors and plaque morphology in men with coronary disease who died suddenly. N. Engl. J. Med. 336, 1276–1282 (1997).
Article CAS PubMed Google Scholar
Farb, A. et al. Coronary plaque erosion without rupture into a lipid core. A frequent cause of coronary thrombosis in sudden coronary death. Circulation 93, 1354–1363 (1996).
Article CAS PubMed Google Scholar
Willem, E. H. et al. Composition of carotid atherosclerotic plaque is associated with cardiovascular outcome: A prognostic study. Circulation 121, 1941–1950 (2010).
Article Google Scholar
van Lammeren, G. W. et al. Time-dependent changes in atherosclerotic plaque composition in patients undergoing carotid surgery. Circulation 129, 2269–2276 (2014).
Article PubMed Google Scholar
Vrijenhoek, J. E. P. et al. Sex is associated with the presence of atherosclerotic plaque hemorrhage and modifies the relation between plaque hemorrhage and cardiovascular outcome. Stroke 44, 3318–3323 (2013).
Article PubMed Google Scholar
Mekke, J. M. et al. Glycophorin C in atherosclerotic plaque is associated with major adverse cardiovascular events after carotid endarterectomy. bioRxiv https://doi.org/10.1101/2021.07.15.21260570 (2021).
Article Google Scholar
Baxi, V., Edwards, R., Montalto, M. & Saha, S. Digital pathology and artificial intelligence in translational medicine and clinical practice. Mod. Pathol. 35, 23–32 (2022).
Article CAS PubMed Google Scholar
Bándi, P., Balkenhol, M., van Ginneken, B., van der Laak, J. & Litjens, G. Resolution-agnostic tissue segmentation in whole-slide histopathology images with convolutional neural networks. PeerJ 7, e8242 (2019).
Article PubMed PubMed Central Google Scholar
Bandi, P. et al. From detection of individual metastases to classification of lymph node status at the patient level: The CAMELYON17 challenge. IEEE Trans. Med. Imaging 38, 550–560 (2019).
Article PubMed Google Scholar
Hart, S. N. et al. Classification of melanocytic lesions in selected and whole-slide images via convolutional neural networks. J. Pathol. Inform. 10, 5 (2019).
Article PubMed PubMed Central Google Scholar
Yu, K.-H. et al. Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features. Nat. Commun. 7, 12474 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Xia, T., Kumar, A., Feng, D. & Kim, J. Patch-level tumor classification in digital histopathology images with domain adapted deep learning. Conf. Proc. IEEE Eng. Med. Biol. Soc. 2018, 644–647 (2018).
Google Scholar
Liu, Y. et al. Artificial intelligence-based breast cancer nodal metastasis detection: Insights into the black box for pathologists. Arch. Pathol. Lab. Med. 143, 859–868. https://doi.org/10.5858/arpa.2018-0147-oa (2019).
Article CAS PubMed Google Scholar
Coudray, N. et al. Classification and mutation prediction from non–small cell lung cancer histopathology images using deep learning. Nat. Med. 24, 1559–1567. https://doi.org/10.1038/s41591-018-0177-5 (2018).
Article CAS PubMed PubMed Central Google Scholar
Otsu, N. A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9, 62–66. https://doi.org/10.1109/tsmc.1979.4310076 (1979).
Article Google Scholar
Campanella, G. et al. Clinical-grade computational pathology using weakly supervised deep learning on whole slide images. Nat. Med. 25, 1301–1309 (2019).
Article CAS PubMed PubMed Central Google Scholar
Vanderbeck, S., Bockhorst, J., Komorowski, R., Kleiner, D. E. & Gawrieh, S. Automatic classification of white regions in liver biopsies by supervised machine learning. Hum. Pathol. 45, 785–792 (2014).
Article PubMed Google Scholar
Rother, C., Kolmogorov, V. & Blake, A. GrabCut. in ACM SIGGRAPH 2004 Papers on - SIGGRAPH ’04 (ACM Press, 2004). https://doi.org/10.1145/1186562.1015720.
Bug, D., Feuerhake, F. & Merhof, D. Foreground extraction for histopathological whole slide imaging. Informatik aktuell https://doi.org/10.1007/978-3-662-46224-9_72 (2015).
Article Google Scholar
Feng, D., Wenkang, S., Liangzhou, C., Yong, D. & Zhenfu, Z. Infrared image segmentation with 2-D maximum entropy method based on particle swarm optimization (PSO). Pattern Recogn. Lett. 26, 597–603. https://doi.org/10.1016/j.patrec.2004.11.002 (2005).
Article ADS Google Scholar
Zhang, H., Qian, W., Wan, M. & Zhang, K. Infrared image enhancement algorithm using local entropy mapping histogram adaptive segmentation. Infrared Phys. Technol. 120, 104000. https://doi.org/10.1016/j.infrared.2021.104000 (2022).
Article Google Scholar
Deng, H., Sun, X., Liu, M., Ye, C. & Zhou, X. Infrared small-target detection using multiscale gray difference weighted image entropy. IEEE Trans. Aerosp. Electr. Syst. 52, 60–72. https://doi.org/10.1109/taes.2015.140878 (2016).
Article ADS Google Scholar
Stirling, D. R. et al. Cell profiler 4: Improvements in speed, utility and usability. BMC Bioinform. 22, 433 (2021).
Article Google Scholar
Nelissen, B. G. L., van Herwaarden, J. A., Moll, F. L., van Diest, P. J. & Pasterkamp, G. SlideToolkit: An assistive toolset for the histological quantification of whole slide images. PLoS ONE 9, e110289 (2014).
Article ADS PubMed PubMed Central Google Scholar
Anghel, A. et al. A high-performance system for robust stain normalization of whole-slide images in histopathology. Front. Med. 6, 193 (2019).
Article Google Scholar
Tsallis, C. Beyond Boltzmann–Gibbs–Shannon in physics and elsewhere. Entropy 21(7), 696 (2019).
Article ADS MathSciNet CAS PubMed PubMed Central Google Scholar
Lin, J. Divergence measures based on the Shannon entropy. IEEE Trans. Inf. Theory 37, 145–151. https://doi.org/10.1109/18.61115 (1991).
Article MathSciNet MATH Google Scholar
Arunachalam, H. B. et al. Viable and necrotic tumor assessment from whole slide images of osteosarcoma using machine-learning and deep-learning models. PLoS ONE 14, e0210706 (2019).
Article CAS PubMed PubMed Central Google Scholar
Guo, Z. et al. Publisher correction: A fast and refined cancer regions segmentation framework in whole-slide breast pathological images. Sci. Rep. 10, 8591 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Romo-Bucheli, D., Janowczyk, A., Gilmore, H., Romero, E. & Madabhushi, A. Automated tubule nuclei quantification and correlation with oncotype DX risk categories in ER+ breast cancer whole slide images. Sci. Rep. 6, 32706 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Valkonen, M. et al. Metastasis detection from whole slide images using local features and random forests. Cytometry A 91, 555–565 (2017).
Article PubMed Google Scholar
Wang, P., Li, P., Li, Y., Xu, J. & Jiang, M. Classification of histopathological whole slide images based on multiple weighted semi-supervised domain adaptation. Biomed. Signal Process. Control 73, 103400. https://doi.org/10.1016/j.bspc.2021.103400 (2022).
Article Google Scholar
Kothari, S., Phan, J. H. & Wang, M. D. Eliminating tissue-fold artifacts in histopathological whole-slide images for improved image-based prediction of cancer grade. J. Pathol. Inform. 4, 22. https://doi.org/10.4103/2153-3539.117448 (2013).
Article PubMed PubMed Central Google Scholar
Lu, C. & Mandal, M. Automated analysis and diagnosis of skin melanoma on whole slide histopathological images. Pattern Recogn. 48, 2738–2750. https://doi.org/10.1016/j.patcog.2015.02.023 (2015).
Article ADS Google Scholar
ImageMagick Studio LLC. ImageMagick. ImageMagick https://imagemagick.org/.
GIMP. GIMP https://www.gimp.org/.
Bradski. The openCV library. Dr. Dobb’s Journal: Software Tools for the Professional.
van Lammeren, G. W. et al. Atherosclerotic plaque vulnerability as an explanation for the increased risk of stroke in elderly undergoing carotid artery stenting. Stroke 42, 2550–2555 (2011).
Article PubMed Google Scholar
Bankhead, P. et al. QuPath: Open source software for digital pathology image analysis. Sci. Rep. 7, 16878 (2017).
Article ADS PubMed PubMed Central Google Scholar
Depuydt, M. A. C. et al. Microanatomy of the human atherosclerotic plaque by single-cell transcriptomics. Circ. Res. 127, 1437–1455 (2020).
Article CAS PubMed PubMed Central Google Scholar
Slenders, L. et al. Intersecting single-cell transcriptomics and genome-wide association studies identifies crucial cell populations and candidate genes for atherosclerosis. Eur. Heart J. Open https://doi.org/10.1093/ehjopen/oeab043 (2021).
Article PubMed PubMed Central Google Scholar
van der Laan, S. W. et al. Genetic susceptibility loci for cardiovascular disease and their impact on atherosclerotic plaques. Circ. Genom. Precis. Med. 11, e002115 (2018).
PubMed Google Scholar
Peeters, W. et al. Adipocyte fatty acid binding protein in atherosclerotic plaques is associated with local vulnerability and is predictive for the occurrence of adverse cardiovascular events. Eur. Heart J. 32, 1758–1768 (2011).
Article CAS PubMed Google Scholar
de Kleijn, D. P. V. et al. Local atherosclerotic plaques are a source of prognostic biomarkers for adverse cardiovascular events. Arterioscler. Thromb. Vasc. Biol. 30, 612–619 (2010).
Article PubMed Google Scholar
Nirschl, J. J. et al. A deep-learning classifier identifies patients with clinical heart failure using whole-slide images of H&E tissue. PLoS ONE 13, e0192726. https://doi.org/10.1371/journal.pone.0192726 (2018).
Article CAS PubMed PubMed Central Google Scholar
Lipkova, J. et al. Deep learning-enabled assessment of cardiac allograft rejection from endomyocardial biopsies. Nat. Med. 28, 575–582 (2022).
Article CAS PubMed PubMed Central Google Scholar
Bouteldja, N. et al. Deep learning-based segmentation and quantification in experimental kidney histopathology. J. Am. Soc. Nephrol. 32, 52–68 (2021).
Article CAS PubMed Google Scholar
Khened, M., Kori, A., Rajkumar, H., Krishnamurthi, G. & Srinivasan, B. A generalized deep learning framework for whole-slide image segmentation and analysis. Sci. Rep. https://doi.org/10.1038/s41598-021-90444-8 (2021).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We also acknowledge support from the Netherlands CardioVascular Research Initiative of the Netherlands Heart Foundation (CVON 2011/B019 and CVON 2017-20: Generating the best evidence-based pharmaceutical targets for atherosclerosis [GENIUS I&II]), the ERA-CVD program ‘druggable-MI-targets’ (Grant No: 01KL1802), and the Chan Zuckerberg Initiative Foundation Data Insights program (Grant No: DI-092).We would also like to thank all the (former) employees involved in the Athero-Express Biobank Study of the Departments of Surgery of the St. Antonius Hospital Nieuwegein and University Medical Center Utrecht for their continuing work. We would like to thank (in no particular order) Marijke Linschoten, Arjan Samani, Petra H. Homoed-van der Kraak, Tim Bezemer, Tim van de Kerkhof, Joyce Vrijenhoek, Evelyn Velema, Ben van Middelaar, Sander Reukema, Robin Reijers, Joëlle van Bennekom, and Bas Nelissen. Lastly, we would like to thank all participants of the Athero-Express Biobank Study; without you these studies would not be possible.

Funding

Funding for this research was provided by National Institutes of Health (NIH) grant nos. R00HL125912 and R01HL14823 (to CLM), a Leducq Foundation Transatlantic Network of Excellence (‘PlaqOmics’) grant no. 18CVD02 (to CLM and SWvdL), and the EU H2020 TO_AITION Grant no. 848146 (to SWvdL).

Author information

These authors contributed equally: Sander W. van der Laan and Clint L. Miller.

Authors and Affiliations

Department of Computer Engineering, University of Virginia, Charlottesville, VA, 22908, USA
Yipei Song
Center for Public Health Genomics, University of Virginia, MSB 3231, PO Box 800717, Charlottesville, VA, 22908, USA
Yipei Song & Clint L. Miller
Human Technopole, Viale Rita Levi-Montalcini, 1, 20157, Milano, Italy
Francesco Cisternino & Craig A. Glastonbury
Department of Vascular Surgery, Division Surgical Specialties, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
Joost M. Mekke, Gert J. de Borst & Dominique P. V. de Kleijn
Central Diagnostics Laboratory, Division Laboratories, Pharmacy, and Biomedical Genetics, University Medical Center Utrecht, Utrecht University, Heidelberglaan 100, 3584 CX, Utrecht, The Netherlands
Gerard Pasterkamp & Sander W. van der Laan
Department of Pathology, Division Laboratories, Pharmacy, and Biomedical Genetics, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
Aryan Vink
Department of Public Health Sciences, Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA, 22908, USA
Clint L. Miller

Authors

Yipei Song
View author publications
You can also search for this author in PubMed Google Scholar
Francesco Cisternino
View author publications
You can also search for this author in PubMed Google Scholar
Joost M. Mekke
View author publications
You can also search for this author in PubMed Google Scholar
Gert J. de Borst
View author publications
You can also search for this author in PubMed Google Scholar
Dominique P. V. de Kleijn
View author publications
You can also search for this author in PubMed Google Scholar
Gerard Pasterkamp
View author publications
You can also search for this author in PubMed Google Scholar
Aryan Vink
View author publications
You can also search for this author in PubMed Google Scholar
Craig A. Glastonbury
View author publications
You can also search for this author in PubMed Google Scholar
Sander W. van der Laan
View author publications
You can also search for this author in PubMed Google Scholar
Clint L. Miller
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.S., J.M.M., S.W.vdL., and C.L.M. conceptualized the study. Y.S., J.M.M., C.A.G., and S.W.vdL. contributed to the data curation. Y.S. performed the formal analysis. S.W.vdL. and C.L.M. provided funding support. Y.S. contributed to the primary investigation. C.A.G., S.W.vdL., and C.L.M. contributed to project administration. J.M.M., G.J.dB., D.P.V.dK., G.P., A.V., and S.W.vdL. contributed the Athero-Express resources. Y.S., C.A.G., and S.W.vdL. contributed to software development. G.J.dB., D.P.V.dK., G.P., C.A.G., S.W.vdL., and C.L.M. contributed to project supervision. F.C. contributed to validation studies. Y.S., S.W.vdL., and C.L.M. contributed to visualization. Y.S. contributed to writing the original draft. All authors (Y.S., F.C., J.M.M., G.J.dB., D.P.V.dK., G.P., A.V., C.A.G., S.W.vdL., and C.L.M.) contributed to review and editing the final manuscript.

Corresponding authors

Correspondence to Sander W. van der Laan or Clint L. Miller.

Ethics declarations

Competing interests

Dr. Sander W. van der Laan has received Roche funding for unrelated work. Dr Craig A. Glastonbury has stock options in BenevolentAI and is a paid consultant for BenevolentAI, unrelated to this work. Dr. Clint L. Miller has received AstraZeneca funding for unrelated work. All other authors declare that they have no competing interests relevant to the contents of this paper to disclose.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Song, Y., Cisternino, F., Mekke, J.M. et al. An automatic entropy method to efficiently mask histology whole-slide images. Sci Rep 13, 4321 (2023). https://doi.org/10.1038/s41598-023-29638-1

Download citation

Received: 06 September 2022
Accepted: 08 February 2023
Published: 15 March 2023
DOI: https://doi.org/10.1038/s41598-023-29638-1

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.