Artificial intelligence and computational pathology

Cui, Miao; Zhang, David Y.

doi:10.1038/s41374-020-00514-0

Download PDF

Review Article
Published: 16 January 2021

Artificial intelligence and computational pathology

Laboratory Investigation volume 101, pages 412–422 (2021)Cite this article

57k Accesses
220 Citations
73 Altmetric
Metrics details

Subjects

Abstract

Data processing and learning has become a spearhead for the advancement of medicine, with pathology and laboratory medicine has no exception. The incorporation of scientific research through clinical informatics, including genomics, proteomics, bioinformatics, and biostatistics, into clinical practice unlocks innovative approaches for patient care. Computational pathology is burgeoning subspecialty in pathology that promises a better-integrated solution to whole-slide images, multi-omics data, and clinical informatics. However, computational pathology faces several challenges, including the ability to integrate raw data from different sources, limitation of hardware processing capacity, and a lack of specific training programs, as well as issues on ethics and larger societal acceptable practices that are still solidifying. The establishment of the entire industry of computational pathology requires far-reaching changes of the three essential elements connecting patients and doctors: the local laboratory, the scan center, and the central cloud hub/portal for data processing and retrieval. Computational pathology, unlocked through information integration and advanced digital communication networks, has the potential to improve clinical workflow efficiency, diagnostic quality, and ultimately create personalized diagnosis and treatment plans for patients. This review describes clinical perspectives and discusses the statistical methods, clinical applications, potential obstacles, and future directions of computational pathology.

Digital pathology and artificial intelligence in translational medicine and clinical practice

Article Open access 05 October 2021

Artificial intelligence in histopathology: enhancing cancer research and clinical oncology

Article 22 September 2022

A new era: artificial intelligence and machine learning in prostate cancer

Article 15 May 2019

Introduction

Artificial intelligence (AI) refers to the simulation of the human mind in computer systems that are programmed to think like humans and mimic their actions such as learning and problem-solving. AI should be able to perform tasks that normally require human intelligence, such as visual perception, decision-making, and communication. AI-based computational pathology as an emerging discipline has recently shown great promise to increase both the accuracy and availability of high-quality health care to patients in many medical fields. The primary forces and limitations in this field are: (1) a shortage of experienced pathologists and the limitation of global health care resources [1]; (2) the ever increasing amount of health data available, including digital images, omics, clinical records, and patient demographic information, being generated through the process of patient care [2]; (3) the increased complexity that is created in managing and integrating the data across different sources in order to maximize patient care; and (4) machine learning-based algorithms need to be efficiently harnessed in order to process and understand the big data [3]. AI technologies have the ability to handle the gigantic quantity of data created throughout the patient care lifecycle to improve pathologic diagnosis, classification, prediction, and prognostication of diseases.

The most important advantage of the computational pathology is to reduce errors in diagnosis and classification. The Camelyon Grand Challenge 2016 (CAMELYON16 challenge), is a worldwide machine learning-based program to evaluate new algorithms for the automated detection of cancer in hematoxylin and eosin (H&E)-stained whole-slide imaging (WSI), has achieved encouraging results with a 92.4% sensitivity in tumor detection rate. In contrast, a pathologist could only achieve 73.2% sensitivity [4]. Computational pathology has the potential to transform the traditional core functions of pathology and not just growing sub-segments such as digital pathology, molecular pathology, and pathology informatics [5, 6]. Computational pathology aims to improve diagnostic accuracy, optimize patient care, and reduce costs by bringing global collaboration. As the rapid technological advancement drives individualized precision medicine [7], computational pathology is a critical factor in achieving this goal.

Digital pathology, machine learning, and computational pathology

The development of brightfield and fluorescent slide scanners made possible the virtualizing and digitalizing the whole glass slides [8]. Digital pathology includes the process of digitizing histopathology, immunohistochemistry or cytology slides using whole-slide scanners as well as the interpretation, management, and analysis of these digitized whole-slide images using computational approaches. The digital data of the slides can be stored in a central cloud-based space allowing for remote access to the information for manual review by a pathologist or automated review by a data algorithm. It makes AI, a branch of computational science which generates the data algorithms, to be applied in pathology possible [9]. Based on the degree of intelligence, AI can currently be divided into two major categories: weak AI and strong AI (Table 1). Weak AI, also known as artificial narrow intelligence, refers to the classification of data based on a well-established statistic model that has already been trained to perform specific tasks [10]. In contrast, strong AI, also known as artificial general intelligence (AGI), can create a system, which can function intelligently and independently by executing machine learning from any available normalized data.

Table 1 List of terms and abbreviations appearing in this paper.

Full size table

Generally, machine learning is an AI process to allow a computer system to automatically learn and improve from the data set by itself and to solve problems without being programmed during the process. Machine learning is an advanced branch of AGI using a large amount of initial data, training set, to build statistic algorithms to interpret and act on new data later on [11]. At present, various machine learning-based approaches have been developed and tested in pathology to assist pathologic diagnosis using the basic morphology pattern such as cancer cells, cell nuclei, cell divisions, ducts, blood vessels, etc. [12]. Deep learning (also known as deep structured learning) is a subfield of machine learning based on artificial neural networks (ANNs) in which the statistic models are established from input training data [13]. Deep neural networks provide architectures for deep learning. The ANNs can perform its own determination as to whether its interpretation or prediction is correct, resembling a biological complex neural network of the human brain [14]. ANNs are comprised of three functional layers of artificial neurons, known as “nodes”, which include an input layer, multiple hidden layers, and an output layer. The artificial neurons are connected to each other in the ANNs and the strength of their connections is known as “weights”. The connections between artificial neurons in the ANNs are assessed using statistic methods, including clustering algorithms, K-nearest neighbor, support vector machines (SVM), and logistic regressions [15]. The involved artificial neurons, which are related to the output event, and their associated connections, which bear different “weight”, need to be trained by qualified big data set to achieve an optimized algorithm for specific tasks (Fig. 1). The convolutional neural networks are a type of deep multilayer neural networks particularly designed for visual image. It employs convolutional kernels, a set of learnable filters, to build up a pooling layer that can effectively reduce the dimensions of the image data while still retaining its characteristics (Fig. 2). By flattening an image, removing or reducing the dimensions, convolutional kernels act as a preprocess treatment that then allows for computer vision and machine vision models to process, analyze, and classify the digital images, or parts of the image, into known categories.

**Fig. 1: The basic structure of a deep neural network.**

**Fig. 2: The principle of convolutional neural networks.**

With slide scanning technology getting faster and more reliable, a larger volume of WSI data becomes available to train and validate convolutional neural network models. In combination with clinical information, biomarkers, and multi-omics data, computational pathology will become part of the new standard of care [16]. Computational pathology not only facilitates a more efficient pathology workflow, but also provides a more comprehensive and personalized view, enabling pathologists to address the progress of complex diseases for better patient care [17].

Algorithm training progress

Case selection

Patients’ selection is the initial step to train the algorithm (Fig. 3). Both training set and validation set must include all sample types or variants, which are related to the subject of diseases including stages, grades, histologic classification, complication, etc. to eliminate false-negative and false-positive scenarios. Still very much a machine driven process, algorithms have no way to recognize the variants that has not been included in the training set. The criteria for the samples and subsequent slide selection for the learning set need to be established by experienced pathologists alongside a computational team. Confounding variables have to be isolated and removed. For example, the patients with other medical conditions who may interfere with the outcome. In addition, inadequate slide preparation including blurred vision, over- or under-staining, air-bubbles, and folded tissue can produce inaccurate resulting algorithms. The comprehensive initial and follow-up clinical information, as well as laboratory results, should be collected and included. The more relevant the information included, the more accurate the resulting algorithm.

**Fig. 3: Flow chart of algorithm training.**

Whole-slide imaging (WSI)

Several slide scanning systems for whole-slide imaging have been approved by the US Food and Drug Administration (FDA) to be used in clinical settings (Table 2) [18]. The first FDA-approved Ultra-Fast Scanner, the Philips IntelliSite Pathology Solution (PIPS), has a resolution of 0.25 μm/pixel, scanning speed of 60 s for a 15 × 15-mm scan area and scanning capacity of 300 slides in one load [19]. The Aperio AT2 DX System from Leica Biosystems has 400 slide capacity for brightfield and fluorescent slides [20]. File sizes of digital images at applicable resolution vary depending on the scan area on the glass slides. In general, pathology images are tremendously large, in the range of 1–3 GB per image. Therefore, it requires a high-capacity and fast digital working computer. Furthermore, the number of slides needed to achieve a clinically accepted algorithm may vary by tissue type and diagnosis. Campanella et al. showed that at least 10,000 slides are necessary for training to reach a good performance. The authors also observed the discrepancy of the prediction between Leica Aperio and PIPS and found that brightness, contrast, and sharpness affect the prediction performance [21].

Table 2 Whole-slide images systems.

Full size table

Image analysis and automation

For digital slide analysis, Senaras et al. [22] described a novel deep-learning framework, called DeepFocus, which enables the automatic identification of blurry regions in digital slides for immediate re-scan in order to improve image quality for pathologists and image analysis algorithms. Janowczyk et al. [23] presented an open-source tool called HistoQC to assess color histograms, brightness, and contrast of each slide and to identify cohort-level outliers (e.g., darker or lighter stain than other slides in the cohort). These methods play an essential role in the quality control of whole-slide images to standardize the quality of images in computational pathology. Due to improvements in various smart image-recognition algorithmic discriminators, based on high-capacity deep neural network models [24], the pathologist can be released from extensive manual annotations for each whole-slide images at the pixel level so that they can focus other parts of the clinical workflow. The patch-based whole-slide images (224 × 224 to 256 × 256) have been widely used in many machine learning domains to train classifiers for diagnostic or prognostic tasks. For example, Campanella et al. [21] employed multiple instance learning (MIL) approaches with “bag” and “instance” based on convolutional neural networks and recurrent neural networks to classify the prostate cancer images of H&E slides. Kapil et al. [25] applied deep semi-supervised architecture and auxiliary classifier generative adversarial networks, including one generator network and one discriminator network, to automatically analyze the PD-L1 expression in immunohistochemistry slide of late stage non-small cell lung cancer needle biopsies. Barker et al. [26] revealed an elastic net linear regression model and weighted voting system to differentiate glioblastoma multiforme and lower-grade glioma with an accuracy of 93.1%.

Pathologist-centered medical system

Although most AI research is still focused on the detection and grading of tumors in digital pathology and radiology, computational pathology is not limited to the detection of a morphological pattern. It can also contribute to the complex process of analysis and judgment using demographic information, digital pathology, -omics, and laboratory results [27]. Therefore, AI has the potential to contribute to nearly all aspects of the clinical workflow, from more accurate diagnosis to prognosis, and individualized treatment. Multiple sources of clinical data [28] are incorporated into mathematic models to generate diagnostic inferences and predictions, to enable physicians, patients, and laboratory personnel to make the best possible medical decisions [29]. For example, deep neural networks have been applied to automated biomarker assessment of breast tumor images, such as HER2, ER, and Ki67 [30]. Hamidinekoo et al. [31] created a novel convolutional neural network-based mammography–histology–phenotype–linking–model to connect and map the features and phenotypes between mammographic abnormalities and their histopathological representation. Mobadersany et al. [32] developed a genomic survival convolutional neural network model to integrate information from both histology images and genomic data to predict time-to-event outcomes and demonstrated the prediction accuracy surpassed the current clinical paradigm for predicting the overall survival of patients diagnosed with glioma.

As electronic health record (EHR) systems enable us to collect medical data such as age, race, gender, social history, and clinic history, applying these data as independent factors of a particular disease to an appropriate mathematic algorithm becomes feasible [33, 34]. These integrated data allow pathologists to gain deeper insights and to switch between different algorithms of treatment at different stages of the disease and/or for different statuses of the patient. As the health-related apps on mobile devices and smart personal trackers become popular, direct access to continuous real-time health information, such as temperature, heart rate, respiratory rate, electrocardiogram, body mass index, blood glucose, and blood oxygen content, can be recorded into individual health data. These data can then be incorporated into the EHR and laboratory information systems (LIS) [35] to reintegrate into a virtualized and digitalized person, which was not possible previously and was beyond what the human brain alone can accomplish [36] (Fig. 4). This new system of data-driven care requires the pathology, as a cornerstone of modern medicine, to integrate data, algorithms, and analytics to deliver high-quality and efficient care. The combination of computational pathology and big data mining offers the potential to create a revolutionary way of practicing evidence-based, personalized medicine.

**Fig. 4: Pathologist-centered medical system.**

Global pathology service model

Three essential advancements happened in recent years: the possibility to store a great amount of data from network-attached storage to cloud storage, the growing speed of network from WIFI-6 to 5 G, and high-performance central processing unit (CPU) and graphics processing unit. These technological improvements not only enhance people’s daily life, but also have a great impact on medicine, especially digital and computational pathology [37] (Fig. 5). Together with the surging development of network and information technology, these technologic improvements allow for the centralization of medical and computing resources—with the benefit of larger sample data volume for optimization of algorithms. Furthermore, the central cloud-based AI laboratory and data bank of digital and computational pathology make the global network of computational pathology possible. In local laboratories or centralized scanning centers, histology slides can be converted to whole-slide images and numerical data. These data can then be transferred to the central laboratory together with EHR data and multi-omics data for further analysis [38] (Fig. 6). Patients in different geographic areas around the world can benefit from more efficient and effective diagnosis, treatment, and follow-up. In the meantime, pathologists are able to access the information they need to care for patients or to collaborate with specialists anytime and anywhere. Deep-learning platforms have the potential to facilitate the discovery of more complicated or subtle connections and to help pathologists make the best clinical decisions to meet every patient’s needs.

**Fig. 5: The relation between different levels of artificial intelligence and the four bottlenecks that are facing currently.**

**Fig. 6: Global pathology service model.**

Examples of pathology AI

Increasingly, AI detection is being applied to different subspecialties with various sample types [39]. Early reports on accuracy have shown to be promising and that the AI-assisted systems have the potential to classify accurately at an unprecedented scale [40] and lay the foundation for the deployment of computational pathology in nearly all subspecialties [41].

Prostate cancer

Campanella et al. validated a high-capacity deep neural network-based algorithm to analyze image classification and categorization of 44,732 whole-slide images across three different cancer types, including prostate cancer, basal cell carcinoma, and breast cancer metastases to axillary lymph nodes. In terms of whole-slide images, they found that ×5 magnification has higher accuracy. They trained a statistic model with MIL-based tile classifier for each tissue type and achieved area under receiver operating curve (AUC) above 0.98 for all cancer types. Its clinical application would allow pathologists to exclude 65–75% of slides while retaining 100% sensitivity [21]. Wildeboer et al. discussed deep-learning techniques based on different imaging sources including magnetic resonance imaging, echogenicity in ultrasound imaging, and radio density in computed tomography as computer-aided diagnostic tools for prostate cancer. They found that the algorithm of convolutional neural network architecture performed equal or better than SVM or random forest classifiers in machine learning [42].

Colorectal cancer

Korbar et al. developed multiple deep-learning algorithms, modified version of a residual network architecture, which can accurately classify whole-slide images of five types of colorectal polyps, including hyperplastic, sessile serrated, traditional serrated, tubular, and tubulovillous/villous polyps. Among 2074 of images, 90% of them were used for model training and the remaining 10% of images were assigned to the validation set. The overall accuracy for classification of colorectal polyps was 93% (confidence interval (CI) 95%, 89.0–95.9%) [43]. Bychkov et al. combined convolutional neural networks and recurrent neural network architectures to predict colorectal cancer outcomes based on tissue microarray (TMA) samples from 420 colorectal cancer patients. Their results show that the AUC of deep neural network-based outcome prediction was 0.69 (hazard ratio, 2.3; CI 95%, 1.79–3.03). For comparison, pathology experts performed inferiorly on both TMA samples (HR, 1.67; CI 95%, 1.28–2.19; AUC, 0.58) and whole-slide level (HR, 1.65; CI 95%, 1.30–2.15; AUC, 0.57), which implied that deep neural networks could extract more prognostic information from the tissue morphology of colorectal cancer than an experienced pathologist [44].

Breast cancer

Wang et al., the team of winner of competitions in the CAMELYON16 challenge, used input 256 × 256 pixel patches from positive and negative regions of the whole-slide images of breast sentinel lymph nodes to train various classification models including GoogLeNet Patch, AlexNet, VGG16, and FaceNet. The patch classification accuracy is 98.4, 92.1, 97.9, and 96.8% separately. Among the algorithms, GoogLeNet has the best performance and is generally faster and more stable, which achieved AUC of 0.925 for whole-slide images classification. With the assistance of deep-learning system, the accuracy of pathologist’s diagnoses improved significantly as the AUC increased from 0.966 to 0.995, representing ~85% reduction of human error rate [45]. Furthermore, the open resource of a data set of annotated whole-slide images for CAMELYON16 and CAMELYON17 challenges enable testing of new machine learning and image analysis strategies for digital pathology [46].

Cytology

Martin et al. applied convolutional neural networks for classifying cervical cytology images into five diagnostic categories, including negative for intraepithelial lesion or malignancy, atypical squamous cells of undetermined significance, low-grade squamous intraepithelial lesion, atypical squamous cells cannot exclude how-grade squamous intraepithelial lesion and high-grade squamous intraepithelial lesion, and achieved accuracies of 56, 36, 72, 17, and 86% separately, which implies convolutional neural networks are able to learn cytological features [47]. In another cytopathology study, the authors used morphometric algorithm and semantic segmentation network based on VGG-19 to classify urine cytology whole-slide images according to Paris System for Urine Cytopathology and achieved a sensitivity of 77%, false-positive rate of 30% and AUC of 0.8 [48].

COVID-19

During the outbreak of COVID-19, telemedicine and computer-aided medicine are rapidly entering the market in many countries. Highly contagious nature, systemic risks, and social isolation brought unexpected challenges to traditional medicine. Applying AI-based computer-aided medicine along with clinical data from EHR, including individuals’ clinical risk factors of human-to-human interactions and a variety of diverse social data, may provide a quick control of this public health emergency with a better quality and safety [49].

Several AI companies have been working on products to address the COVID-19 pandemic. For example, JLK Inspection, Korea (http://www.jlk-inspection.com/#/medical/main) is integrating the reverse transcription-polymerase chain reaction (RT-PCR) results, imaging tests, and their universal AI platform, AIHuB, to provide COVID-19 diagnosis. Persivia, Massachusetts (https://persivia.com/covid-19-detection/) announced a new surveillance module based on their Soliton AI engine to identify and alert patients who are presumed positive for COVID-19. Biofourmis, Massachusetts (https://www.biofourmis.com/) developed an analytic platform called Biovitals Sentinel, which provides 24/7 remote monitor to identify early clinical deterioration and enable earlier interventions. Schaar et al. described that machine learning could significantly enhance both the efficiency and effectiveness of randomized clinical trials for COVID-19. It has the capability to speed up recruiting subjects from identifiable subgroups and assigning subjects to treatment or control groups as well as significantly reducing error and requiring many fewer patients [50].

Challenges

Although machine learning has produced promising results and provided many benefits in computational pathology, the following limitations need to be addressed before deep machine learning can be implemented in the clinical setting.

Standardization and normalization

The successful adaptation of whole-slide images in digital pathology heavily depends on each step of high-quality pathology slide preparation, including embedding, cutting, staining, and scanning. Folded tissue section during cutting, staining variation and the presence of air bubble during covering slide as well as different settings of brightness, intensity disparity, average color, and boundary intensity during scanning can cause unreliable raw data and produce inaccurate results [8, 51]. The protocols and systemic quality controls need to be standardized to reduce the system errors and random errors resulted from different instruments since a single noise in big data can cause misclassification and change the slide prediction, possibly resulting in a large number of false positives or negatives. The data used to generate algorithms are analyzed by different models by different developers. The larger the accumulated data, the more accurate algorithms, especially for rear diseases and specific small populations. A standard data format and normalization method of data analysis should be engaged to merge consecutive data sets from different resources and train them into one algorithm since different data resources may cause variation in classification accuracy in practice. The digital imaging and communications in medicine (DICOM) developed standards for medical images including radiology (www.dicomstandard.org). It defines the formats for medical images that can be exchanged with the data and quality necessary for clinical use. DICOM Standard now provides support for WSI, by incorporating a way to handle tiled large images as multi-frame images and multiple images at varying resolutions (http://dicom.nema.org/Dicom/DICOMWSI/).

The role in the computational pathology

Computational pathology is not only important in medical research, but also needed to address clinical questions in practice [52]. To achieve this goal, a team of experts in different fields is needed to participate in computational pathology projects, including data scientists such as statisticians and bio-informaticians for algorithm design and architect as well as engineer for the construction of physical environments and maintaining hardware (Fig. 7). Among them, the pathologists play a critical role to introduce medical questions and clinical applications to the developer team and to trigger the downstream industry development [53]. The new role of the pathologist in computational pathology not only requires solid clinical knowledge and experience, but also requires the knowledge of statistical analysis and data mining to bridge the gap between clinical medicine and AI so that when a disease suddenly happens, or a new biomarker is discovered, the pathologist can quickly react to the opportunity to either create a new algorithm or optimize an existing algorithm to assist the clinician [54]. Furthermore, pathologists who understand the potential problems that may occur during the data analysis can address the clinical problems clearly with computational thinking. Good communication among the team members can help to design a more efficient algorithm because the algorithms with different coding approaches will consume different amount of computing resources and time, especially for big data, although they generate the same end result. Furthermore, AI pathology provides excellent tools for experimental pathology by the integration of morphology at organ, histology, cell, and organelle levels with molecular details of omics data.

**Fig. 7: Computational pathology team.**

Hardware limitations

The accuracy of applications in the computational pathology heavily depends on large amounts of data, reliable hardware and software, and a supportive network environment. Large image file size (around 3 GB per slide scan) requires significant big storage space with backup capability in both local and cloud. In addition, deep machine learning solutions, especially when applying analysis of pathology images, heavily depend on graphics processing unit, which is a chip on the computer’s graphics card for rapidly manipulating graphics and processing images [55]. A powerful graphics processing unit can provide significant performance enhancement alongside the CPU to boost computing capacity and reduce turn-around time. For implementation of either data transmission or cloud-based image processing, data bandwidth of both intranet and internet becomes a bottleneck, which limits the speed of upload and download [56]. Only if all these related elements in a network have been developed and evolved into a robust system (Fig. 5), computational pathology can move forward to assist resolving more complex and multifaceted medical and clinical questions and research tasks. Finally, the clearance as medical devices (510k) by FDA is critical to ensure the clinical reliability and the acceptance by pathology community.

Ethics

In the new era of computation-driven decision-making processes based on AI and machine learning, computational pathology must involve in more complicated interactions of massive information from clinical history, omics data, living environment to social habits [57]. It is very likely that the experts involved in these decision-making processes will no longer be exclusively pathologists. Instead, the decision-making panel will include other experts such as data statisticians and bio-informaticians, which may raise ethical concerns [58]. A continuous massive, sensitive health data transfer among clinics, laboratories, and data banks can enable higher precision medicine, but at the same time increases the security vulnerability. Policies around the strict protection of patient privacy and personal data creates an obstacle for computational pathology to access the health databases need to create more comprehensive training data sets. General Data Protection Regulation was enacted in May 2018 in Europe to impose new responsibilities on organizations who process the data of European Union citizens for scientific research [59]. This concept highlights the proportionate approach to regulate computational pathology-related security and ethical issues while not limiting innovation unduly, which is difficult but critical.

Future prospects

Technological innovation in health care is growing at an increasingly fast pace and has been integrated into both our daily lives, such as smart healthy tracker, and diagnostic algorithm in medical practice [60]. With the rapid development of digital pathology, molecular pathology, and informatics pathology, computational pathology is increasingly involved in many subspecialties such as pulmonary, renal, gastrointestinal, neurology, and gynecology pathology. We believe the initial phase of AI will start with specific tasks such as the diagnosis of a particular cancer and classification of tissue types, which require limited and simple criteria [61]. For example, the common subtypes and variants of benign and malignant neoplasm in prostate should be included in the training and validation to ensure the feasibility of daily pathology practice. As a result of more data collection and more powerful computing capacity over time, the clinical applications of AI will be broader and the number of nonspecific cases in the gray zone or with red flags classified by AI for manual review will be decreased.

The growing medical data, including genomics, proteomics, informatics, and whole-slide images [62], is expected to integrate together to become a data-rich pathomics and lead to rapid development and prosperity of an AI-assisted computational pathology. Although many challenges remain, the computational pathology with the deployment of digital pathology technology and statistic algorithm will continue to improve clinical workflows and collaboration among pathologist and other members in the patient care team. The improved infrastructure of the network environment, the enhanced computing capacity, and broad integration of informatics have ushered in new horizons for both computational pathology and collaborative pattern, which make data travel and cloud-based central laboratory and data bank to deliver better care for patients at lower costs possible.

Conclusion

In the new era of deep learning-assisted pathology, the data banking, integration, and cloud laboratory are becoming an essential part of daily practice of pathology. Furthermore, the pathologists, data scientists, and industry are starting to incorporate the genomics, proteomics, bioinformatics, and computer algorithms into a large amount of complex clinical information. Through this process, the computational pathology can contribute valuable insights to the diagnosis, prognosis, and treatment of disease ultimately. Although many technical and ethical challenges need to be addressed, computational pathology as a synergistic system will lead to a boosting workflow, enabling clinical teams to share and analyze image data in a broader platform.

Currently, deep learning has been applied to solve more and more specialized tasks in medicine. Several studies discussed above showed that algorithm assistance has the potential to not only improve the sensitivity and accuracy of the diagnoses but also improve turn-around time. Moreover, according to Sarwar’s et al. [63] study, around 75% of pathologists across 59 countries in the world are interested and excited about using AI as a diagnostic tool. Finally, despite the challenges and obstacles, the potential of computational pathology which will change and improve the current health care system is promising and exciting.

References

George MR, Johnson KA, Gratzinger DA, Brissette MD, McCloskey CB, Conran RM, et al. Will I need to move to get my first job?: geographic relocation and other trends in the pathology job market. Arch Pathol Lab Med. 2020;144:427–34.
Article PubMed Google Scholar
Goldenberg SL, Nir G, Salcudean SE. A new era: artificial intelligence and machine learning in prostate cancer. Nat Rev Urol. 2019;16:391–403.
Article PubMed Google Scholar
Parwani AV. Next generation diagnostic pathology: use of digital pathology and artificial intelligence tools to augment a pathological diagnosis. Diagn Pathol. 2019;14:138.
Article CAS PubMed PubMed Central Google Scholar
Liu Y, Gadepalli K, Norouzi M, Dahl GH, Kohlberger T, Boyko A, et al. Detecting cancer metastases on gigapixel pathology images. arXiv. 2017. https://arxiv.org/abs/1703.02442.
McAlpine ED, Michelow P. The cytopathologist’s role in developing and evaluating artificial intelligence in cytopathology practice. Cytopathology. 2020;31:385–92.
Article PubMed Google Scholar
Huss R, Coupland SE. Software-assisted decision support in digital histopathology. J Pathol. 2020;250:685–92.
Article PubMed Google Scholar
Serag A, Ion-Margineanu A, Qureshi H, McMillan R, Saint Martin MJ, Diamond J, et al. Translational AI and deep learning in diagnostic pathology. Front Med. 2019;1:185.
Article Google Scholar
Hou L, Samaras D, Kurc TM, Gao Y, Davis JE, Saltz JH. Patch-based convolutional neural network for whole slide tissue image classification. Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2016;2016:2424–33.
PubMed PubMed Central Google Scholar
Chang HY, Jung CK, Woo JI, Lee S, Cho J, Kim SW, et al. Artificial intelligence in pathology. J Pathol Transl Med. 2019;53:1–12.
Article PubMed Google Scholar
Tizhoosh HR, Pantanowitz L. Artificial intelligence and digital pathology: challenges and opportunities. J Pathol Inform. 2018;14:38.
Article Google Scholar
Komura D, Ishikawa S. Machine learning methods for histopathological image analysis. Comput Struct Biotechnol J. 2018;9:34–42.
Article Google Scholar
Komura D, Ishikawa S. Machine learning approaches for pathologic diagnosis. Virchows Arch. 2019;475:131–8.
Article CAS PubMed Google Scholar
Levine AB, Schlosser C, Grewal J, Coope R, Jones SJM, Yip S. Rise of the machines: advances in deep learning for cancer diagnosis. Trends Cancer. 2019;5:157–69.
Article PubMed Google Scholar
Ching T, Himmelstein DS, Beaulieu-Jones BK, Kalinin AA, Do BT, Way GP, et al. Opportunities and obstacles for deep learning in biology and medicine. J R Soc Interface. 2018;15:20170387.
Article PubMed PubMed Central Google Scholar
Rashidi HH, Tran NK, Betts EV, Howell LP, Green R. Artificial intelligence and machine learning in pathology: the present landscape of supervised methods. Acad Pathol. 2019;6:2374289519873088.
Article PubMed PubMed Central Google Scholar
Krempel R, Kulkarni P, Yim A, Lang U, Habermann B, Frommolt P. Integrative analysis and machine learning on cancer genomics data using the Cancer Systems Biology Database (CancerSysDB). BMC Bioinform. 2018;24:156.
Article Google Scholar
Cong L, Feng W, Yao Z, Zhou X, Xiao W. Deep learning model as a new trend in computer-aided diagnosis of tumor pathology for lung cancer. J Cancer. 2020;26:3615–22.
Article Google Scholar
Evans AJ, Bauer TW, Bui MM, Cornish TC, Duncan H, Glassy EF, et al. US Food and Drug Administration approval of whole slide imaging for primary diagnosis: a key milestone is reached and new questions are raised. Arch Pathol Lab Med. 2018;142:1383–7.
Article PubMed Google Scholar
Bera K, Schalper KA, Rimm DL, Velcheti V, Madabhushi A. Artificial intelligence in digital pathology—new tools for diagnosis and precision oncology. Nat Rev Clin Oncol. 2019;16:703–15.
Article PubMed PubMed Central Google Scholar
Zhang DY. An outline of office-based bladder and prostate biopsy pathology. Nova Science Publishers Inc.; 2019.
Campanella G, Hanna MG, Geneslaw L, Miraflor A, Werneck Krauss Silva V, Busam KJ, et al. Clinical-grade computational pathology using weakly supervised deep learning on whole slide images. Nat Med. 2019;25:1301–9.
Article CAS PubMed PubMed Central Google Scholar
Senaras C, Niazi MKK, Lozanski G, Gurcan MN. DeepFocus: detection of out-of-focus regions in whole slide digital images using deep learning. PLoS ONE. 2018;25:e0205387.
Article Google Scholar
Janowczyk A, Zuo R, Gilmore H, Feldman M, Madabhushi A. HistoQC: an open-source quality control tool for digital pathology slides. JCO Clin Cancer Inform. 2019;3:1–7.
Article PubMed Google Scholar
Nakata N. Recent technical development of artificial intelligence for diagnostic medical imaging. Jpn J Radiol. 2019;37:103–8.
Article PubMed Google Scholar
Kapil A, Meier A, Zuraw A, Steele KE, Rebelatto MC, Schmidt G, et al. Deep semi supervised generative learning for automated tumor proportion scoring on NSCLC tissue needle biopsies. Sci Rep. 2018;26:17343.
Article Google Scholar
Barker J, Hoogi A, Depeursinge A, Rubin DL. Automated classification of brain tumor type in whole-slide digital pathology images using local representative tiles. Med Image Anal. 2016;30:60–71.
Article PubMed Google Scholar
Saltz J, Gupta R, Hou L, Kurc T, Singh P, Nguyen V, et al. Spatial organization and molecular correlation of tumor-infiltrating lymphocytes using deep learning on pathology images. Cell Rep. 2018;3:181–93.e7.
Article Google Scholar
Harmon SA, Tuncer S, Sanford T, Choyke PL, Turkbey B. Artificial intelligence at the intersection of pathology and radiology in prostate cancer. Diagn Interv Radiol. 2019;25:183–8.
Article PubMed PubMed Central Google Scholar
Moscatelli M, Manconi A, Pessina M, Fellegara G, Rampoldi S, Milanesi L, et al. An infrastructure for precision medicine through analysis of big data. BMC Bioinform. 2018;15 Suppl 10:351.
Article Google Scholar
Robertson S, Azizpour H, Smith K, Hartman J. Digital image analysis in breast pathology-from image processing techniques to artificial intelligence. Transl Res. 2018;194:19–35.
Article PubMed Google Scholar
Hamidinekoo A, Denton E, Rampun A, Honnor K, Zwiggelaar R. Deep learning in mammography and breast histology, an overview and future trends. Med Image Anal. 2018;47:45–67.
Article PubMed Google Scholar
Mobadersany P, Yousefi S, Amgad M, Gutman DA, Barnholtz-Sloan JS, Velazquez Vega JE, et al. Predicting cancer outcomes from histology and genomics using convolutional networks. Proc Natl Acad Sci USA. 2018;27:E2970–9.
Google Scholar
Aeffner F, Zarella MD, Buchbinder N, Bui MM, Goodman MR, Hartman DJ, et al. Introduction to digital image analysis in whole-slide imaging: a white paper from the digital pathology association. J Pathol Inform. 2019;8:9.
Article Google Scholar
Miller DD, Brown EW. Artificial intelligence in medical practice: the question to the answer? Am J Med. 2018;131:129–33.
Article PubMed Google Scholar
Farnell DA, Huntsman D, Bashashati A. The coming 15 years in gynaecological pathology: digitisation, artificial intelligence, and new technologies. Histopathology. 2020;76:171–7.
Article PubMed Google Scholar
Granter SR, Beck AH, Papke DJ Jr. AlphaGo, deep learning, and the future of the human microscopist. Arch Pathol Lab Med. 2017;141:619–21.
Article PubMed Google Scholar
Mazzanti M, Shirka E, Gjergo H, Hasimi E. Imaging, health record, and artificial intelligence: hype or hope? Curr Cardiol Rep. 2018;10:48.
Article Google Scholar
Smail-Tabbone M, Rance B. Contributions from the 2018 literature on bioinformatics and translational informatics. Yearb Med Inform. 2019;28:190–3.
Article PubMed PubMed Central Google Scholar
Landau MS, Pantanowitz L. Artificial intelligence in cytopathology: a review of the literature and overview of commercial landscape. J Am Soc Cytopathol. 2019;8:230–41.
Article PubMed Google Scholar
Houssami N, Kirkpatrick-Jones G, Noguchi N, Lee CI. Artificial Intelligence (AI) for the early detection of breast cancer: a scoping review to assess AI’s potential in breast screening practice. Expert Rev Med Devices. 2019;16:351–62.
Article CAS PubMed Google Scholar
Jang HJ, Cho KO. Applications of deep learning for the analysis of medical data. Arch Pharm Res. 2019;42:492–504.
Article CAS PubMed Google Scholar
Wildeboer RR, van Sloun RJG, Wijkstra H, Mischi M. Artificial intelligence in multiparametric prostate cancer imaging with focus on deep-learning methods. Comput Methods Programs Biomed. 2020;189:105316.
Article PubMed Google Scholar
Korbar B, Olofson AM, Miraflor AP, Nicka CM, Suriawinata MA, Torresani L, et al. Deep learning for classification of colorectal polyps on whole-slide images. J Pathol Inform. 2017;8:30.
Article PubMed PubMed Central Google Scholar
Bychkov D, Linder N, Turkki R, Nordling S, Kovanen PE, Verrill C, et al. Deep learning based tissue analysis predicts outcome in colorectal cancer. Sci Rep. 2018;8:3395.
Article PubMed PubMed Central Google Scholar
Wang DY, Khosla A, Gargeya R, Irshad H, Beck AH. Deep learning for identifying metastatic breast cancer. arXiv. 2016. https://arxiv.org/abs/1606.05718.
Litjens G, Bandi P, Ehteshami Bejnordi B, Geessink O, Balkenhol M, Bult P, et al. 1399 H&E-stained sentinel lymph node sections of breast cancer patients: the CAMELYON dataset. Gigascience. 2018;7:giy065.
Article PubMed Central Google Scholar
Martin V, Kim TH, Kwon M, Kuko M, Pourhomayoun M, Martin S. A more comprehensive cervical cell classification using convolutional neural network. J Am Soc Cytopathol. 2018. https://doi.org/10.1016/j.jasc.2018.06.156.
Vaickus LJ, Suriawinata AA, Wei JW, Liu X. Automating the Paris System for urine cytopathology-A hybrid deep-learning and morphometric approach. Cancer Cytopathol. 2019;127:98–115.
Article CAS PubMed Google Scholar
Niazi MKK, Parwani AV, Gurcan MN. Digital pathology and artificial intelligence. Lancet Oncol. 2019;20:e253–61.
Article PubMed Google Scholar
Van der Schaar M, Alaa A. How artificial intelligence and machine learning can help healthcare systems respond to COVID-19. Mach Learn. 2020. https://doi.org/10.1007/s10994-020-05928-x.
Conceicao T, Braga C, Rosado L, Vasconcelos MJM. A review of computational methods for cervical cells segmentation and abnormality classification. Int J Mol Sci. 2019;20:5114.
Article PubMed Central Google Scholar
Kulkarni S, Seneviratne N, Baig MS, Khan AHA. Artificial intelligence in medicine: where are we now? Acad Radiol. 2020;27:62–70.
Article PubMed Google Scholar
Jha S, Topol EJ. Adapting to artificial intelligence: radiologists and pathologists as information specialists. JAMA. 2016;316:2353–4.
Article PubMed Google Scholar
Madabhushi A, Lee G. Image analysis and machine learning in digital pathology: challenges and opportunities. Med Image Anal. 2016;33:170–5.
Article PubMed PubMed Central Google Scholar
Chan HP, Samala RK, Hadjiiski LM, Zhou C. Deep learning in medical image analysis. Adv Exp Med Biol. 2020;1213:3–21.
Article PubMed PubMed Central Google Scholar
Abels E, Pantanowitz L, Aeffner F, Zarella MD, van der Laak J, Bui MM, et al. Computational pathology definitions, best practices, and recommendations for regulatory guidance: a white paper from the Digital Pathology Association. J Pathol. 2019;249:286–94.
Article PubMed PubMed Central Google Scholar
Lin JC, Fan CT, Liao CC, Chen YS. Taiwan Biobank: making cross-database convergence possible in the Big Data era. Gigascience. 2018;7:1–4.
Article PubMed Google Scholar
Stoekle HC, Mamzer-Bruneel MF, Frouart CH, Le Tourneau C, Laurent-Puig P, Vogt G, et al. Molecular tumor boards: ethical issues in the new era of data medicine. Sci Eng Ethics. 2018;24:307–22.
Article PubMed Google Scholar
Chico V. The impact of the General Data Protection Regulation on health research. Br Med Bull. 2018;128:109–18.
Article PubMed Google Scholar
Mintz Y, Brodie R. Introduction to artificial intelligence in medicine. Minim Invasive Ther Allied Technol. 2019;28:73–81.
Article PubMed Google Scholar
Allen TC. Regulating artificial intelligence for a successful pathology future. Arch Pathol Lab Med. 2019;143:1175–9.
Article PubMed Google Scholar
Wang S, Yang DM, Rong R, Zhan X, Fujimoto J, Liu H, et al. Artificial intelligence in lung cancer pathology image analysis. Cancers. 2019;11:1673.
Article PubMed Central Google Scholar
Sarwar S, Dent A, Faust K, Richer M, Djuric U, Van Ommeren R, et al. Physician perspectives on integration of artificial intelligence into diagnostic pathology. NPJ Digit Med. 2019;2:28.
Article PubMed PubMed Central Google Scholar

Download references

Author information

Authors and Affiliations

St. Luke’s Roosevelt Hospital Center, Icahn School of Medicine at Mount Sinai, New York, NY, 10025, USA
Miao Cui
Pathology and Laboratory Services, VA Medical Center, New York, NY, 10010, USA
David Y. Zhang

Authors

Miao Cui
View author publications
You can also search for this author in PubMed Google Scholar
David Y. Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to David Y. Zhang.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cui, M., Zhang, D.Y. Artificial intelligence and computational pathology. Lab Invest 101, 412–422 (2021). https://doi.org/10.1038/s41374-020-00514-0

Download citation

Received: 08 June 2020
Revised: 08 November 2020
Accepted: 10 November 2020
Published: 16 January 2021
Issue Date: April 2021
DOI: https://doi.org/10.1038/s41374-020-00514-0

This article is cited by

Artificial intelligence’s impact on breast cancer pathology: a literature review
- Amr Soliman
- Zaibo Li
- Anil V. Parwani
Diagnostic Pathology (2024)
Standardizing digital biobanks: integrating imaging, genomic, and clinical data for precision medicine
- Valentina Brancato
- Giuseppina Esposito
- Marco Aiello
Journal of Translational Medicine (2024)
Intraoperative margin assessment for basal cell carcinoma with deep learning and histologic tumor mapping to surgical site
- Joshua J Levy
- Matthew J Davis
- Matthew R LeBoeuf
npj Precision Oncology (2024)
NuInsSeg: A fully annotated dataset for nuclei instance segmentation in H&E-stained histological images
- Amirreza Mahbod
- Christine Polak
- Isabella Ellinger
Scientific Data (2024)
Deep learning infers clinically relevant protein levels and drug response in breast cancer from unannotated pathology images
- Hui Liu
- Xiaodong Xie
- Bin Wang
npj Breast Cancer (2024)

Subjects

Abstract

Similar content being viewed by others

Introduction

Digital pathology, machine learning, and computational pathology

Algorithm training progress

Case selection

Whole-slide imaging (WSI)

Image analysis and automation

Pathologist-centered medical system

Pathologist-centered medical system

Global pathology service model

Examples of pathology AI

Prostate cancer

Colorectal cancer

Breast cancer

Cytology

COVID-19

Challenges

Standardization and normalization

The role in the computational pathology

Hardware limitations

Ethics

Future prospects

Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links