Registered multi-device/staining histology image dataset for domain-agnostic machine learning models

Ochi, Mieko; Komura, Daisuke; Onoyama, Takumi; Shinbo, Koki; Endo, Haruya; Odaka, Hiroto; Kakiuchi, Miwako; Katoh, Hiroto; Ushiku, Tetsuo; Ishikawa, Shumpei

doi:10.1038/s41597-024-03122-5

Download PDF

Data Descriptor
Open access
Published: 03 April 2024

Registered multi-device/staining histology image dataset for domain-agnostic machine learning models

Scientific Data volume 11, Article number: 330 (2024) Cite this article

427 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Variations in color and texture of histopathology images are caused by differences in staining conditions and imaging devices between hospitals. These biases decrease the robustness of machine learning models exposed to out-of-domain data. To address this issue, we introduce a comprehensive histopathology image dataset named PathoLogy Images of Scanners and Mobile phones (PLISM). The dataset consisted of 46 human tissue types stained using 13 hematoxylin and eosin conditions and captured using 13 imaging devices. Precisely aligned image patches from different domains allowed for an accurate evaluation of color and texture properties in each domain. Variation in PLISM was assessed and found to be significantly diverse across various domains, particularly between whole-slide images and smartphones. Furthermore, we assessed the improvement in domain shift using a convolutional neural network pre-trained on PLISM. PLISM is a valuable resource that facilitates the precise evaluation of domain shifts in digital pathology and makes significant contributions towards the development of robust machine learning models that can effectively address challenges of domain shift in histological image analysis.

HunCRC: annotated pathological slides to enhance deep learning applications in colorectal cancer screening

Article Open access 28 June 2022

Data-efficient and weakly supervised computational pathology on whole-slide images

Article 01 March 2021

Multi-channel feature extraction for virtual histological staining of photon absorption remote sensing images

Article Open access 23 January 2024

Background & Summary

Since the 1990s, whole-slide scanners have facilitated the capture of high-resolution digital images of complete specimens, similar to microscopic images. This has led to the development of digital pathology, which employs computers to analyze whole-slide images (WSIs). Along with rapid advancements in deep learning, researchers are developing artificial intelligence (AI) to help minimize the workload of pathologists, aid in predicting patient prognosis, and provide decision support for treatment plans based on WSIs¹.

However, unwanted color and texture heterogeneity^1,2,3 is present in digital histology images. This heterogeneity is the primary cause of a domain shift in pathological images, thereby restricting the clinical application of deep learning algorithms by decreasing their generalizability⁴. Heterogeneity results from the inconsistencies in the procedures before obtaining WSIs, such as tissue preparation, staining, and scanning^5,6. For instance, inconsistencies in the formulations of hematoxylin and eosin (H&E), exposure to light, and varying storage conditions lead to color inconsistencies⁷. Additionally, different scanners have unique imaging properties, resulting in color and texture variations⁷. Histology images captured through microscopes using smartphones also add to this variability. Smartphones are widely used to capture histological images as they enable the pathologists to easily consult with colleagues, seek consensus, and share images of interest^8,9. This trend is particularly pronounced in developing countries, where resources might be limited¹⁰. Moreover, mobile tools such as content-based image retrieval, which provide image similarity search capabilities, have emerged to address the growing demand for assistance in pathological diagnosis using mobile phones^11,12. Nevertheless, images taken by smartphones significantly differ in quality from those produced by WSI scanners. Furthermore, the wide variety of smartphone devices contributes to the variability in image quality, thereby exacerbating the problem of color and texture heterogeneity.

Color augmentation^1,13,14 is a common technique used to enhance the robustness and generalizability of deep-learning models against color variation. Although this augmentation involves applying random changes in hue, saturation, and brightness to the input image, the degree of perturbation is a hyperparameter that is difficult to optimize. Small perturbations are ineffective at increasing the robustness. Contrarily, large perturbations may lead to an unnatural color distribution, causing a drop in the performance of the trained model¹⁴. While deep learning-based style transfer methods have been utilized to improve robustness against both color and texture variations, histological images in diverse domains are required to train the model¹⁵. Therefore, histopathological datasets encompassing various domains could be beneficial for researchers to optimize the color range of augmentation or develop robust style-transfer models.

Recent studies have provided histopathological image datasets for various domains. For instance, the MIDOG++¹⁶ dataset includes 503 cases of H&E-stained WSIs from seven types of cancer. The images were obtained using five distinct WSI scanners to detect mitotic cells effectively. Kuritcyn and colleagues¹⁷ created an image dataset consisting of 161 cases of colorectal cancer images captured using six different scanners. They found that, model performance decreased owing to domain shifts. The CAMELYON dataset¹⁸ was developed to construct a model aimed at detecting tumor metastases in sentinel lymph nodes, although it was not primarily designed to address domain shifts. The dataset comprises lymph node specimens, scanned using three distinct scanners from multiple medical institutions. However, these studies have several limitations: (1) they target only one organ, except for MIDOG++, which results in a lack of tissue diversity in the images obtained; (2) these studies are limited to only WSI scanners; (3) they do not focus on the differences in staining conditions, which results in a lack of diversity in H&E staining; and (4) the same tissues are not captured across domains, thus limiting the ability to evaluate color and texture differences between different domains. Kuritcyn’s dataset is an exception; however, it is not publicly available. The features of each dataset are summarized in Table 1.

Table 1 Comparison of the existing datasets for domain adaptation with PathoLogy Images of Scanners and Mobile phones (PLISM).

Full size table

To address this issue, we developed a dataset named Pathology Images of Scanners and Mobile phones (PLISM)¹⁹ (Fig. 1a). The dataset contains histopathological images from various domains, including different tissue types, staining conditions, and imaging devices. It covers a wide range of colors similar to that of MIDOG++, comprising images of specimens obtained from multiple laboratories, except for the spectrum with an extremely strong red hue. Based on the observation of a high prevalence of artifact images within the strongly red-hued section of the MIDOG dataset, we believe that the data distribution of the PLISM dataset aligns with existing external datasets (Fig. 1b). The strength of the PLISM dataset lies in its unique design, in which images encompass both WSIs and smartphone images that capture the same tissue or serial sections of tissue microarray (TMA) stained under different H&E staining conditions. Each TMA slide contained 46 different tissues from the human body, providing a diverse tissue collection. We aligned these images properly from different domains at the patch level, which allowed for the statistical analysis of the imaging modality and staining types. This dataset can help evaluate the robustness of an AI model in various domains, providing valuable insights into the impact of diverse imaging modalities and staining on the algorithms. To the best of our knowledge, the PLISM dataset is the first of its kind to encompass a diverse collection of H&E-stained images captured using multiple imaging modalities, such as smartphones, and obtained using various hematoxylin solvents.

Methods

Data collection

All histopathological specimens used in creating the PLISM dataset were sourced from patients who were diagnosed and underwent surgery at the University of Tokyo Hospital between 1955 and 2018. This study was approved by the Institutional Review Board of the University of Tokyo (approval number: 2381). Each TMA slide consisted of 46 different tissues extracted from formalin-fixed, paraffin-embedded human tissues, as shown in Fig. 2.

From a pool of 64 H&E staining conditions, we selected two staining conditions for each H solution category along with the MY staining condition routinely used in our laboratory. The selection criteria were based on the color similarity between H and E, selecting an H solution with the highest color similarity to E and another with the lowest color similarity to E. This approach ensured a color diversity. Specifically, the stain deconvolution method was used to deconvolve the RGB color of the histology image into the H and E color vectors. Subsequently, for each H solution category, the H solution with the minimum and maximum angle to the E color vector was selected. A greater contrast was achieved when the angle between the two vectors was larger, resulting in a vivid color appearance with a clear distinction between the H and E components (e.g. GIVH, GMH, GVH, HRH, KRH and LMH in Table 2 Abbrev.). Conversely, a smaller angle produced a darker tone (e.g. GIV, GV, GM, GV, HR, KR and LM in Table 2 Abbrev.) (Fig. 2a). Therefore, these criteria contributed to the diversity of H&E staining in the dataset. Out of these staining conditions, H was exposed for approximately 24 hours in two conditions, which is considered impractical in clinical settings. These staining conditions were selected to generate an extreme reference H color distribution. A total of 13 slides were stained using the 13 selected H&E staining conditions (Table 2).

Table 2 Hematoxylin and eosin (H&E) staining conditions used in PLISM.

Full size table

Slide digitalization, image registration, and image tiling

The workflow, from slide digitization to image registration are presented in Fig. 1. Once the slides were stained with H&E, they underwent digitization using seven slide scanners and six smartphones to capture scan variability across the devices used, in addition to the variability in the H&E staining procedure. H solvents used were produced by either Sakura Finetek Japan Co., Ltd. or Muto Pure Chemicals Co., Ltd. (Tokyo, Japan). The scanners used in the PLISM are listed in Table 3. All slides were scanned at the maximum resolution of each scanner used, which ranged from 0.22 to 0.26 µm/pixel.

Table 3 Whole-slide image (WSI) scanners used in the dataset.

Full size table

We used discussion heads attached to an Olympus BX-53 microscope (EVIDENT Co. Ltd., Tokyo, Japan) to capture the same microscopic images using various smartphones (Fig. 1a). Each discussion head was attached to a different smartphone. The smartphones used in PLISM are listed in Table 4. All images were captured at 400 × magnification of the microscope. The Open Camera²⁰ tool was used for the Android smartphones, whereas the Camera + 2²¹ tool was employed for the iPhones. Images were captured by manually pressing a Bluetooth switch (ELECOM, P-SRBBK 4953103305977). This setup allowed images of the same tissue structure to be captured simultaneously without the influence of motion blurring from the capturing process. The imaging settings for each tool and camera specifications for each smartphone are presented in Table 5.

Table 4 Specification for smartphone cameras.

Full size table

Table 5 Imaging setting details for applications.

Full size table

All slides were initially stored in the original vendor format. Subsequently, we used VALIS²², an open-source image registration package, to perform both rigid and non-rigid registration among WSIs, designating the slide scanned by Hamamatsu Nanozoomer S60 with Mayer stain as the reference slide and setting ‘align_to_reference’ parameter as True.

We created two PLISM subsets from the images:

1.
PLISM-wsi contains only WSI images. Registration was performed across all scanners and staining conditions. There were 3,417 aligned image groups, with a total of 310,947 (3,417 groups × 91 WSIs) image patches.
2.
PLISM-sm includes both smartphone and WSI images. Registration was performed on all scanners and smartphones under each staining condition. There were a total of 4,454 aligned image groups containing 57,902 (4,454 groups × 13 devices) images.

For the PLISM-sm subset, smartphone images were used as queries, and employed OpenCV’s AKAZE²³ key point matching algorithm to extract the corresponding tissue regions from each WSI with a matching stain type. In PLISM-sm, the size of the reference WSI images varied for each smartphone image. Rigid registration was then performed twice to align the WSI and the smartphone images using ‘cv2.findHomography’ and ‘cv2. warp Perspective’ with RANSAC²⁴ filtering. After image alignment, each image was center-cropped to a size of 512 × 512 pixels because we observed large registration errors around the edges of the images. For the PLISM-wsi subset, we tiled all 91 WSIs to 1024 × 1024 pixels without overlap, regardless of staining. Subsequently, we again performed rigid registration on the AT2, GT450, and P scanner images using OpenCV’s AKAZE²³ key-point matching algorithm to align them with the S60 scanner images, because of their relatively large visible misalignment. Each image was center-cropped to a size of 512 × 512 pixels. Figure 3 shows the number of images classified by tissue type, staining type, and imaging device for both subsets. Example images of both subsets are presented in Fig. 4.

Data validation and quality control

All glass slides included in the PLISM dataset were manually stained and scanned by M. O. and experienced technical staff. The staining quality was visually confirmed by a board-certified pathologist (M. O.), and all images were manually inspected after tiling the WSIs. The quality of the tiled images was assessed by technical staff (H. E. and K. S.), and tile-cropped images with missing parts or significant focus problems were excluded from the analysis.

The registration quality was also evaluated. The Target Registration Error (TRE), the median distance between the registration target features in the image, and the corresponding matched features²² in the reference image was 43 μm for registration between serial sections. This value is within the normal range for serial sections where the target points can be shifted between sections. As Gatenbee et al. demonstrated in the original VALIS study²², the TRE between serial WSIs after both rigid and non-rigid registration is approximately between 20 μm and 100 μm. Furthermore, to evaluate the quality of registration between WSIs and smartphone images, we manually refined the position of each landmark such that it was positioned on the same cell or prominent tissue landmarks in the corresponding WSI-smartphone-group images. In total, we created and manually checked 325 registration points on 65 patch images, which resulted in a 1.0 μm TRE score. All groupwise images were manually checked by H.E., K.S., and O.M., and misaligned image groups were removed.

Evaluation methods

To test whether the PLISM can improve the robustness of convolutional neural networks (CNN) to out-of-domain datasets and exceed conventional color augmentation, we pretrained two ResNet18s model using SimCLR²⁵, a self-supervised learning method. The PLISM-full model was pretrained on PLISM-sm dataset, while the PLISM-WSIonly model was pretrained on PLISM-wsi dataset. The two models used on the PLISM were pre-trained with 224 × 224 pixel images from each data subset for 1000 epochs using the same augmentation method as Ciga et al.²⁶. The pretrained models were trained with a batch size of 256 and a learning rate of 0.3 × (batch size)/256. For a comparative evaluation, we also assessed CNN models trained on two different datasets: one featuring over one million general images from ImageNet with and without HED-light color augmentation method that demonstrated the best performance across various histology datasets¹ and the other comprising of 57 histology datasets from the study conducted by Ciga et al.²⁶. The latter included images from various organs, captured at resolutions ranging from 10 × to 100 × , and predominantly stained with H&E.

We evaluated the pretrained model on two multiclass classification datasets for colorectal adenocarcinoma: Kather19²⁷ and CRC-TP²⁸, and a binary classification dataset for the presence or absence of breast cancer metastasis in sentinel lymph nodes: Camelyon17¹⁸. For the colorectal adenocarcinoma datasets, we modified the datasets to focus exclusively on the six common classes in both the Kather19 and CRC-TP datasets: debris (DEB), lymphocytes (LYM), muscle (MUS), normal glands (NORM), simple stroma (STR), and tumor epithelium (TUM).

In the training for the downstream task, only the linear classification layer was trained, whereas the remaining layers were frozen to rigorously evaluate the performance of the pretrained model. We then evaluated the classified images against the ground truth labels and computed the mean F1 score across all test patch images of each tissue type. For the colorectal adenocarcinoma datasets, Kather19 was used for training and CRC-TP was used for testing. Next, CRC-TP was used for training and Kather19 was used for testing. The Camelyon17 dataset included cases from five institutions and we tested 10 combinations of institutions for the training and testing datasets. We trained the linear layer 10 times with random initial weights for each training and testing combination for the colorectal adenocarcinoma and breast cancer metastasis datasets.

Statistical analysis

Data were analyzed using Python version 3.8.5 and R version 4.3.1. Statistical significance was assessed using Welch’s t-test with Bonferroni correction and two-way analysis of variance (ANOVA) test using “SciPy” library (“Quality Control for Differences in Color Variation Across Staining Conditions and Imaging Devices in Identical Tissue Images”), Kruskal–Wallis test with “ggpubr” library (Fig. 7). All statistical tests were two-sided and p value < 0.05 was considered statistically significant.

Data Records

Image data & lists of data

The entire PLISM dataset is publicly available on figshare plus under the CC-BY 4.0 license¹⁹. PLISM-wsi and PLISM-sm were deposited separately. The folder structure for each subset is as follows.

1.
PLISM-wsi, consists of image groups for all staining conditions between the WSIs for each tile image. Image groups from the same field of view in the WSI images shared common coordinates in their filenames.

├(stain_name)_(device_name)/

└(stain_name)_(device_name)_(top_left_x)_(top_left_y).png

List of images selected through quality control by visual assessment. This is a CSV file with the following columns providing information regarding the images:

Tissue Type: The tissue types out of the 46 types of human tissue.
Stain Type: The staining condition out of the 13 types.
Device Type: The device types out of the 13 device types.
Coordinate: The xy coordinates of the upper-left corner of each WSI image (e.g., 1000 _500)*.
Image Path: The relative path to each image file.

2.
PLISM-sm, where smartphone images are used as queries to create image groups for each staining condition corresponding to each tile image. Image groups from the same field of view shared common coordinates in their file names, which corresponded to the WSI coordinates captured using the AT2 device under the respective staining conditions.

├(stain_name)/

└(device_name)/

└(top_left_x)_(top_left_y)_(right_lower_x)_(right_lower_y).png

List of images selected through quality control by visual assessment. This is a CSV file with the following columns providing information regarding the images:

Tissue Type: The tissue types out of the 46 types of human tissue.
Stain Type: The staining condition out of the 13 types.
Device Type: The device types out of the 13 device types.
Coordinate: The xy coordinates of the upper left and bottom right corners of each WSI image (e.g., 10000_5000 104000_9000)*.
Image Path: The relative path to each image file.

All images were saved in PNG format. Original 91 WSIs are also publicly available¹⁹. The asterisks (*) indicate the coordinates of each image before they were center-cropped.

Technical Validation

Quality control for differences in color variation across staining conditions and imaging devices in identical tissue images

For quality control, and to demonstrate the diversity of color and texture in the H&E-stained images of PLISM across staining conditions and imaging devices, we utilized the PLISM-sm subset, which includes all device types and captures the same tissue image for each group. We statistically tested the differences in color distribution for each of the Hue, Saturation, Value (HSV) components between different devices and staining conditions. Among the different devices, 216 (92.3%) of the 234 combinations were significantly different after the Bonferroni correction. Similarly, among the different staining conditions, 215 (91.8%) of 234 combinations were significantly different. These results suggest that almost all devices and staining types exhibited different color characteristics. The differences in HSV values according to the device and staining type are presented in Fig. 5. There was a greater difference in hue between smartphones and WSIs than between the WSI combinations. There was also a slight difference in the saturation between the WSIs and smartphones. When staining was examined, there was a similar trend of differences in hue and saturation between smartphones and WSIs; however, saturation showed a greater difference. As expected, HR staining with overnight H exposure had a strong H component, resulting in substantially different saturation and values compared to other staining types. However, GIV staining with a different overnight H exposure had a smaller difference in color.

To determine whether the differences in the HSV components were attributable to staining or device type, we compared the sum of squares using ANOVA for staining and device type. The analysis indicated that device type contributed more significantly to Hue (Device: Stain, 947.5 vs. 332.7, p < 0.000001), whereas staining contributed more to saturation (Stain: Device, 360.3 vs. 165.7, p < 0.000001) and value (Stain: Device, 318.9 vs. 120.7, p < 0.000001). This finding suggests that despite staining with various H solvents, the influence of device type on defining image coloration was greater than that of staining in our dataset.

Subsequently, we assessed the color and texture differences in the feature space produced by the CNN model (Fig. 6). Interestingly, the images captured by the smartphones and the WSIs were clearly divided into two distinct clusters (Fig. 6a,b). While HR staining with overnight hematoxylin exposure formed a separate cluster, no clear clusters were formed for the other staining types or devices (Fig. 6c,d). However, a closer examination of the WSI cluster in Fig. 6e reveals that the images are loosely grouped by tissue type. These results suggest that the broad categories of devices, such as WSIs and smartphones, as well as tissue types, have distinct color tones and textures in the images.

Improving domain shift using plism pre-trained convolutional neural network

We performed two sets of experiments: first, we used Kather19 for training and CRC-TP for testing (Fig. 7a) and second, we reversed the roles, employing CRC-TP for training and Kather19 for testing (Fig. 7b).

As shown in Fig. 7a, the model pre-trained on the PLISM-sm subset (PLISM-full) significantly outperformed both Imagenet with and without HED-light color augmentation method and Ciga’s models in terms of macro F1 scores (p ≤ 0.0001). In contrast, no significant differences were observed when comparing the model pretrained on the PLISM-WSI subset (PLISM-WSI only) with model by Ciga et al. In Fig. 7b, for the performance of ImageNet without the HED-Light augmentation method and Ciga’s model, we observed results similar to those observed in Fig. 7a when compared to that of the PLISM models. However, the ImageNet without HED-Light augmentation method and the PLISM-full model demonstrated comparable F1 scores.

We also tested the model using Camelyon17. This dataset included cases from five institutions, and we tested 10 combinations of institutions for the training and test datasets. As shown in Fig. 7c, both PLISM_full and PLISM_WSI had significantly higher F1 scores than ImageNet without HED-light augmentation method and Ciga models (p < = 0.001), and presented comparable scores compared to ImageNet with HED-Light augmentation method. For each combination shown in Fig. 7d, the model pretrained on PLISM significantly outperformed the other models in terms of F1 scores in seven of the 10 combinations. These results suggest that PLISM effectively simulates pathological images across various domains, and pretrained models using PLISM data have the potential to outperform conventional color augmentation methods when used independently.

Code availability

All codes used in the image registration between WSI and smartphone images described in the manuscript were written in Python 3 and are available through our GitHub repository (https://github.com/p024eb/PLISM-registration). We have provided all the necessary libraries and python scripts that allow the tracing of our results.

References

Tellez, D. et al. Quantifying the effects of data augmentation and stain color normalization in convolutional neural networks for computational pathology. Med. Image Anal. 58, 101544, https://doi.org/10.1016/j.media.2019.101544 (2019).
Ren, J., Hacihaliloglu, I., Singer, E. A., Foran, D. J. & Qi, X. Adversarial domain adaptation for classification of prostate histopathology whole-slide images. Med. Image Comput. Comput. Assist. Interv. MICCAI, 201–209, https://doi.org/10.1007/978-3-030-00934-2_23.
Clarke, E. L. & Treanor, D. Colour in digital pathology: A review. Histopathology 70, 153–163, https://doi.org/10.1111/his.13079 (2017).
Janowczyk, A. & Madabhushi, A. Deep learning for digital pathology image analysis: A comprehensive tutorial with selected use cases. J. Pathol. Inform. 7, 29, https://doi.org/10.4103/2153-3539.186902 (2016).
Ciompi, F. et al. The Importance of Stain Normalization in Colorectal Tissue Classification with Convolutional Networks, https://doi.org/10.1109/ISBI.2017.7950492 (2017).
Cong, C. et al. Colour adaptive generative networks for stain normalisation of histopathology images. Med. Image Anal. 82, 102580, https://doi.org/10.1016/j.media.2022.102580 (2022).
Macenko, M. et al. A method for normalizing histology slides for quantitative analysis. in IEEE International Symposium on Biomedical Imaging: From Nano to Macro 1107–1110, https://doi.org/10.1109/ISBI.2009.5193250 (2009).
Hartman, D. J. et al. Pocket pathologist: A mobile application for rapid diagnostic surgical pathology consultation. J. Pathol. Inform. 5, 10, https://doi.org/10.4103/2153-3539.129443 (2014).
Jahan-Tigh, R. R., Chinn, G. M. & Rapini, R. P. A comparative study between smartphone-based microscopy and conventional light microscopy in 1021 dermatopathology specimens. Arch. Pathol. Lab. Med. 140, 86–90, https://doi.org/10.5858/arpa.2014-0593-OA.s1 (2016).
Gruber-Mösenbacher, U. et al. Digital pathology in Cameroon. JCO Glob. Oncol. 7, 1380–1389, https://doi.org/10.1200/GO.21.00166 (2021).
Komura, D. et al. Universal encoding of pan-cancer histology by deep texture representations. Cell Rep. 38, 110424, https://doi.org/10.1016/j.celrep.2022.110424 (2022).
Komura, D. et al. Luigi: Large-Scale Histopathological Image Retrieval System Using Deep Texture Representations. bioRxiv 345785, https://doi.org/10.1101/345785 (2018).
Otálora, S. et al. Stainlib: A Python Library for Augmentation and Normalization of Histopathology H&E Images. 05.17.492245, https://doi.org/10.1101/2022.05.17.492245 (2022).
Marini, N. et al. Data-driven color augmentation for H&E stained images in computational pathology. J. Pathol. Inform. 14, 100183, https://doi.org/10.1016/j.jpi.2022.100183 (2023).
Shaban, M. T., Baur, C., Navab, N. & Albarqouni, S. StainGAN: Stain Style Transfer for Digital Histological Images https://doi.org/10.1109/ISBI.2019.8759152 (2018).
Article Google Scholar
Aubreville, M. et al. A comprehensive multi-domain dataset for mitotic figure detection. Sci. Data 10, 484, https://doi.org/10.1038/s41597-023-02327-4 (2023).
Kuritcyn, P. et al. Robust slide cartography in colon cancer histology: Evaluation on a multi-scanner database. in Bildverarbeitung für die Medizin 2021 (ed. Palm, C.) 229–234, https://doi.org/10.1007/978-3-658-33198-6_54 (Springer Fachmedien Wiesbaden, 2021).
Litjens, G. et al. 1399 H&E-stained sentinel lymph node sections of breast cancer patients: The CAMELYON dataset. GigaScience 7, giy065, https://doi.org/10.1093/gigascience/giy065 (2018).
Ochi, M., Komura, D., Onoyama, T. & Ishikawa, S. Pathology Images of Scanners and Mobilephones (PLISM) Dataset. Figshare+. Collection. https://doi.org/10.25452/figshare.plus.c.6773925 (2023).
Open Camera. https://opencamera.org.uk/index.html.
Camera. App store. https://apps.apple.com/jp/app/camera/id1313580627 (2023).
Gatenbee, C. D. et al. VALIS: Virtual Alignment of pathoLogy Image Series. 11.09.467917, https://doi.org/10.1101/2021.11.09.467917 (2021).
Alcantarilla, P., Nuevo, J. & Bartoli, A. Fast explicit diffusion for accelerated features in nonlinear scale spaces. in Procedings of the British Machine Vision Conference 13.1–13.11, https://doi.org/10.5244/C.27.13 (British Machine Vision Assoc., 2013).
Fischler, M. A. & Bolles, R. C. Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24, 381–395, https://doi.org/10.1145/358669.358692 (1981).
Article MathSciNet Google Scholar
Chen, T., Kornblith, S., Norouzi, M. & Hinton, G. A Simple Framework for Contrastive Learning of Visual Representations https://doi.org/10.48550/arXiv.2002.05709 (2020).
Article Google Scholar
Ciga, O., Xu, T. & Martel, A. L. Self supervised contrastive learning for digital histopathology. Machine Learning with Applications 7, https://doi.org/10.1016/j.mlwa.2021.100198 (2022).
Kather, J. N. et al. Predicting survival from colorectal cancer histology slides using deep learning: A retrospective multicenter study. PLOS Med. 16, e1002730, https://doi.org/10.1371/journal.pmed.1002730 (2019).
Javed, S. et al. Cellular community detection for tissue phenotyping in colorectal cancer histology images. Med. Image Anal. 63, 101696, https://doi.org/10.1016/j.media.2020.101696 (2020).
Article PubMed Google Scholar

Download references

Acknowledgements

We would like to express our gratitude to the various organizations that supported our research. We would also like to thank Olympus, Phillips, Leica, and Hamamatsu Photonics for providing the necessary equipment, including microscopes and digital scanners, which greatly contributed to the success of this study. This work was supported by Japan Agency for Medical Research and Development (AMED) Practical Research for Innovative Cancer Control under grant number JP 23ck0106640 to S.I., AMED Practical Research for Innovative Cancer Control under grant number JP 23ck0106874 to S.I., and Japan Society for the Promotion of Science (JSPS) KAKENHI Grant-in-Aid for Scientific Research (B) under grant number 21H03836 to D.K.

Author information

Authors and Affiliations

Department of Preventive Medicine, Graduate School of Medicine, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-0033, Japan
Mieko Ochi, Daisuke Komura, Takumi Onoyama, Koki Shinbo, Haruya Endo, Hiroto Odaka, Miwako Kakiuchi, Hiroto Katoh & Shumpei Ishikawa
Division of Gastroenterology and Nephrology, Department of Multidisciplinary Internal Medicine, School of Medicine, Faculty of Medicine, Tottori University, 36-1 Nishicho, Yonago, Tottori, 683-8504, Japan
Takumi Onoyama
Department of Pathology, Graduate School of Medicine, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-0033, Japan
Tetsuo Ushiku
Division of Pathology, National Cancer Center Exploratory Oncology Research & Clinical Trial Center, 6-5-1 Kashiwanoha, Kashiwa, Chiba, 277-8577, Japan
Shumpei Ishikawa

Authors

Mieko Ochi
View author publications
You can also search for this author in PubMed Google Scholar
Daisuke Komura
View author publications
You can also search for this author in PubMed Google Scholar
Takumi Onoyama
View author publications
You can also search for this author in PubMed Google Scholar
Koki Shinbo
View author publications
You can also search for this author in PubMed Google Scholar
Haruya Endo
View author publications
You can also search for this author in PubMed Google Scholar
Hiroto Odaka
View author publications
You can also search for this author in PubMed Google Scholar
Miwako Kakiuchi
View author publications
You can also search for this author in PubMed Google Scholar
Hiroto Katoh
View author publications
You can also search for this author in PubMed Google Scholar
Tetsuo Ushiku
View author publications
You can also search for this author in PubMed Google Scholar
Shumpei Ishikawa
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, D.K., S.I. and M.O.; Methodology, D.K. and M.O.; Investigation, M.O., K.S., T.O., H.E. and H.O.; Data Acquisition, M.O., K.S., T.O. and H.E.; Writing, M.O. and D.K.; Supervision, S.I., H.K., M.K., T.U. and D.K.; Visualization, M.O., H.E. and K.S.; Data curation, M.O.; Formal analysis, M.O. and S.K.

Corresponding authors

Correspondence to Daisuke Komura or Shumpei Ishikawa.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ochi, M., Komura, D., Onoyama, T. et al. Registered multi-device/staining histology image dataset for domain-agnostic machine learning models. Sci Data 11, 330 (2024). https://doi.org/10.1038/s41597-024-03122-5

Download citation

Received: 27 October 2023
Accepted: 04 March 2024
Published: 03 April 2024
DOI: https://doi.org/10.1038/s41597-024-03122-5