Machine intelligence in non-invasive endocrine cancer diagnostics

Thomasian, Nicole M.; Kamel, Ihab R.; Bai, Harrison X.

doi:10.1038/s41574-021-00543-9

Download PDF

Review Article
Published: 09 November 2021

Machine intelligence in non-invasive endocrine cancer diagnostics

Nature Reviews Endocrinology volume 18, pages 81–95 (2022)Cite this article

7351 Accesses
23 Citations
40 Altmetric
Metrics details

Subjects

Abstract

Artificial intelligence (AI) has illuminated a clear path towards an evolving health-care system replete with enhanced precision and computing capabilities. Medical imaging analysis can be strengthened by machine learning as the multidimensional data generated by imaging naturally lends itself to hierarchical classification. In this Review, we describe the role of machine intelligence in image-based endocrine cancer diagnostics. We first provide a brief overview of AI and consider its intuitive incorporation into the clinical workflow. We then discuss how AI can be applied for the characterization of adrenal, pancreatic, pituitary and thyroid masses in order to support clinicians in their diagnostic interpretations. This Review also puts forth a number of key evaluation criteria for machine learning in medicine that physicians can use in their appraisals of these algorithms. We identify mitigation strategies to address ongoing challenges around data availability and model interpretability in the context of endocrine cancer diagnosis. Finally, we delve into frontiers in systems integration for AI, discussing automated pipelines and evolving computing platforms that leverage distributed, decentralized and quantum techniques.

Key points

Developments in machine intelligence have been made possible by the increase in data ubiquity and computing power and have the potential to enhance image segmentation, analysis and workflow in non-invasive endocrine cancer diagnostics.
Improved adherence to consensus reporting standards and evaluation criteria in artificial intelligence (AI) for medical image analysis is urgently needed in the field of endocrine cancer diagnostics as this will enable meaningful cross-study comparison.
A centralized inventory to track diagnostic algorithms in oncologic endocrinology that are in active clinical use would improve performance auditing and algorithm stewardship.
The looming risk of excessive intervention in endocrine cancers can be addressed with the improved detection facilitated by AI, possibly via correlation with prognostic data for improved risk stratification.
Poor data availability continues to stymie the development of robust machine learning applications, particularly in rare endocrine cancers; solutions to this problem might include database curation, pre-training techniques and workflow automation.
Other breakthroughs in machine intelligence will come with the exploration of alternative computing frameworks, such as decentralized, distributed and quantum networks, that might enhance model training and efficiency.

3D genomic mapping reveals multifocality of human pancreatic precancers

Article 01 May 2024

Segment anything in medical images

Article Open access 22 January 2024

Prediction of tumor origin in cancers of unknown primary origin with cytology-based deep learning

Article Open access 16 April 2024

Introduction

Artificial intelligence (AI) refers to any non-living entity that executes tasks typically requiring human intelligence¹. Endocrinology stands to benefit greatly from the rise of AI, particularly in the realm of cancer diagnostics, where AI has the potential to facilitate enhanced diagnostic precision and improved workflows. Medical images are a mainstay of tumour diagnostics and they also serve as a reservoir of mineable pixel data that naturally lends itself to machine-based classification^2,3. Computer vision (Box 1) applications are already leveraging this property to power robust diagnostic interpretations of endocrine neoplasms^4,5,6,7. Although tissue pathology remains the gold standard in the diagnosis of many endocrine tumours, the macroscopic characterization of tissue in imaging studies can augment histological findings. Indeed, in some cases, a biopsy sample might not always reflect the intratumoural heterogeneity across genomic subclones^8,9. Additionally, a biopsy is invasive and is subject to sampling error that can render it inconclusive. Computer vision can be leveraged in support of histological findings by inferring diagnosis from the structural heterogeneity observed within tumours on medical imaging³. Furthermore, given the high frequency with which medical imaging is performed in cancer management, archives of longitudinal medical imaging data can be used by computer vision applications to better characterize disease, predict progression at the time of diagnosis and monitor response to treatment².

The correlation of AI-driven image analytics with other omics data and clinical expertise can also be used to enable integrative approaches to care^10,11 (Fig. 1). Indeed, studies demonstrate that the mapping of genomics or pathomics (image features are extracted from pathology studies) data with radiomics (image features are extracted from radiology studies) data from medical imaging can illuminate conserved trends at different levels of human physiology, with implications for diagnosis and prognosis^12,13,14. Furthermore, advances in machine intelligence have the potential to enable non-invasive endocrine cancer diagnostics that could preclude or limit the use of invasive biopsy¹⁵. Looking ahead, it will be important for both endocrinologists and radiologists to cultivate a working understanding of the utility and limitations of AI if the benefits of these technologies are to be realized.

In this Review, we highlight the ever-growing contributions of machine intelligence to the field of endocrine imaging diagnostics for tumours of the adrenal, pancreatic, pituitary and thyroid glands.

Box 1 Glossary of key terms

Machine learning: a branch of artificial intelligence where algorithms can learn a task without explicit pre-programming.
Computer vision: the use of artificial intelligence for image or other digital media analysis.
Supervised learning: a training technique that uses labelled inputs and asks the algorithm to identify how the relevant features from that data map to each respective label.
Unsupervised learning: a training technique that uses unlabelled inputs and lets the model adjudicate the data into clusters or associations.
Reinforcement learning: a training technique where the model interacts with its environment through actions that are each tied to a value reward.
Deep learning: a subset of machine learning algorithms that process data in networks of abstracted layers to learn, usually via sequential transformations of the data.
Continuous learning: an open training state whereby models can modify their architectures in real time.
Data augmentation: a process to generate synthetic data that involves slight transformations in the training images.
Transfer learning: a technique that uses large and diverse datasets to prime models prior to training with the limited target dataset.
Segmentation: the process of making images machine-readable through annotation of regions of interest.
Region of interest: demarcation of areas relevant to the classification decision-making process.
Texture features: quantifiable patterns of pixels in the medical images, many of which are not visible to the naked eye.
Feature engineering: extraction of features from the data space is guided by domain knowledge, a process that deep learning can bypass through automated feature probing.
Feature selection: an analytic process in which a subset of the total pool of extracted features is selected for incorporation to the model.
Overfitting: a phenomenon that occurs when the model too closely maps to features in the training data resulting in poor generalizability.
Generalizability: model performance on real-world patient populations outside the study data used to develop the model.
Backpropagation: a training paradigm often used to develop neural networks where the weights of neurons are repeatedly tuned based on the error rate in the previous cycle through the training dataset.
Picture Archiving and Communications Systems: a system for multi-modal (such as MRI, CT, X-ray and ultrasound) medical image storage and transfer using a universal digital imaging and communications in medicine (DICOM) file format.
Cloud computing: a network architecture that performs data operations using a remote, centralized server.
Decentralized and distributed computing: network architectures that perform data operations using multiple local or non-centralized servers.
Federated learning: an ensemble training strategy where gradient information from models trained locally in parallel is loaded on a central server to develop a single consensus model; it does not require the transfer of patient data.
Quantum computing: a network architecture that leverages the properties of atomic and subatomic particles to improve the computational efficiency of conventional algorithms or to develop new learning paradigms.

Understanding AI

The definition of AI is broad and encompasses a variety of approaches that bridge the natural, applied and social sciences. Examples of tasks in medicine that can leverage AI include image interpretation^16,17, disease forecasting¹⁸, genomics^19,20, natural language processing^21,22 and therapeutic discovery²³, among others. We review key concepts in machine learning and deep learning in more detail in this section, both of which fall under the larger umbrella of AI.

Machine learning

Machine learning algorithms can be distinguished from conventional statistical models by their ability to learn without explicit pre-programming²⁴. Therefore, machine learning has the potential to reduce coding effort by researchers. Furthermore, task performance can be improved using rules gleaned from examples in the data rather than from those in prewritten code. This data-driven process also confers an advantage to machine learning in terms of adaptability, whereby algorithms can be configured to update in real time to continuously reflect new data.

Machine learning algorithms are often used to assist with diagnostics in medical imaging, which comprise an excellent source of large volume, multidimensional data. Of note, medical image pixel data contains features that are not apparent to the human eye, which can be extracted using radiomics methods²⁵. The intuitive coupling of machine learning to the field of radiomics has been used to enhance diagnostic performance and to automate workflow. The traditional radiomics workflow moves in a stepwise fashion from image acquisition, to segmentation (Box 1), feature extraction and feature analysis, which ultimately yields a radiomics signature (Fig. 2). Image acquisition begins the workflow, with image capture followed by file conversion to achieve digital workflow compatibility for subsequent data processing. Next, segmentation is performed to delineate tumour regions of interest (ROIs) (Box 1), after which feature extraction is used to harvest quantitative pixel features. Following this step, feature analysis is used to determine the most robust and generalizable features for inclusion in the final model. This selection process prevents overfitting (Box 1), a phenomenon that occurs when the model too closely maps to features in the training data, resulting in poor generalizability²⁶ (Box 1). These steps can be performed by hand; however, the benefit of machine learning algorithms is that they can be used to semi or fully automate this process for improved efficiency and detail. Some examples of machine learning algorithms include support vector machines, random forest and k-nearest neighbour (Table 1). Next, we cover three prominent training methodologies in machine intelligence: supervised, unsupervised and reinforcement learning.

Table 1 Examples of artificial intelligence techniques in endocrine cancer imaging

Full size table

Supervised

Supervised learning (Box 1) uses labelled inputs and asks an algorithm to identify how the relevant features from a dataset map to each respective label²⁶. For example, let us say we are trying to differentiate between benign or malignant thyroid nodules using quantitative pixel features extracted from an ultrasonography study that represent nodule texture. Our labels here are ‘benign’ and ‘malignant’ and our inputs are texture representations, such as pixel correlation or entropy. During model training, the machine learning algorithm studies the texture features (Box 1) of benign and malignant images to develop and refine its decision-making process. Conceptually, the goal of supervised techniques is to correctly classify unlabelled data into the pre-defined categories used during model training. In this hypothetical example, we feed the algorithm feature data from unlabelled scans and we want it to tell us if the imaging findings are benign or malignant. The model is supervised in the sense that its programmer shows the algorithm correct examples to guide the learning process.

Unsupervised

In contrast to supervised learning, unsupervised techniques (Box 1) use unlabelled inputs and let the model adjudicate the data into groups. Revisiting our previous example of the thyroid nodule, we could build a model where pixel data of texture features from scans of patients with unconfirmed or borderline diagnoses are used as the unlabelled inputs. In what is an oversimplification, we could imagine the model ‘plotting’ the imaging data based on common features. Doing so enables the algorithm to identify clusters in the data, which might or might not translate to a substantive interpretation. Critically, the algorithm decides what is important when plotting the data. In the supervised learning example, we were looking to classify thyroid nodules as either benign or malignant. In this unsupervised scenario, the data could cluster any which way. For example, the data might triangulate to ‘coordinates’ or groups for different types of nodules as intended or it could group by background noise. In this way, the unsupervised learning model can potentially elucidate trends that the investigator had not originally set out to find, arguably the greatest strength and weakness of this technique. Unsupervised techniques can also be leveraged for augmenting imaging workflows in the annotation and pre-processing of unlabelled data^27,28. Again, a critical conceptual distinction between supervised and unsupervised learning is that the output for the former will typically be a defined label or value, whereas the latter will be a cluster or association.

Reinforcement

Reinforcement learning (Box 1) is a framework where the model interacts with its environment through actions that are each tied to a value reward²⁹. In keeping with our thyroid nodule example, we could build a model that is fed pixel data from unlabelled scans of patients. The model is tasked with the identification of the malignant and benign target patterns. The model will take an action based on the data it encounters and then uses the reward information from its environment to find the path that maximizes the reward over time. We can think of this type of technique as learning by trial and error.

Deep learning

Deep learning (Box 1) is a subset of machine learning using algorithm architectures inspired by neural processing in humans that make classifications or predictions using layers of abstract data representations³⁰. Deep learning models typically perform sequential operations that distort the data in each successive layer and this series of transformations enables the model to progressively deduce information relevant to the assigned task. Revisiting our hypothetical thyroid nodule example, the first layer of our deep learning model might assess groups of image pixels at different orientations to discern edge information³¹. The second layer might then compile the edges from the first layer to detect patterns of edges³¹. The next layer might assemble different edge motifs to detect hyperechoic or hypoechoic regions of the scan. Finally, subsequent layers might transform inputs from the previous layer to recognize complex image traits such as microcalcifications, cysts and necrosis.

Importantly, deep neural networks can be differentiated from shallow neural networks by their multiple (>1) ‘hidden’ layers, which contain complex, non-linear connections that can be difficult for humans to interpret (Fig. 3). Although these hidden layers are striking in their ability to enhance the complexity of features discernible by the model, deep learning algorithms require lots of data to avoid picking up noise specific to the training dataset (See ‘overfitting’, Box 1). A key strength of deep learning is that the technique is less reliant on feature engineering (Box 1) when compared with classic machine learning models³².

**Fig. 3: A convolutional neural network.**

Deep learning models can also make use of the aforementioned supervised, unsupervised and reinforcement learning techniques. Deep learning models (Table 1) can be used for specific tasks within the radiomics workflow, such as in segmentation or feature extraction, often with improved performance compared with traditional machine learning methods like single-layer ‘shallow’ neural networks. Mixed techniques are often employed in the feature extraction process, whereby ‘deep features’ mined using deep learning algorithms are syphoned into a second classifier algorithm, either in isolation or in some combination with other manually extracted or statistically derived features. However, deep learning can also be used in end-to-end processing, effectively obviating the need for human involvement in the segmentation, feature extraction and feature selection (Box 1) steps of the radiomics workflow^33,34 (Fig. 2).

Diagnostics

In this section, we review AI applications in endocrine cancer diagnostics by organ system, with an emphasis on clinical utility, technical limitations and areas for future research.

Adrenal gland

On abdominal imaging, approximately 5% of the general population have adrenal lesions that are revealed as incidentally found asymptomatic tumours (incidentalomas)^35,36. Clinical work-up for adrenal masses starts with assessing them for potential malignancy and functionality³⁷. Early radiomics efforts to discriminate adrenal lesions on imaging using CT and MRI use mean frequency attenuation mapping with histogram analysis³⁸. However, the replication of findings has been a challenge, possibly owing to variation in techniques to define the ROI^39,40,41. Importantly, histogram analysis has paved the way for automated radiomics-based machine learning techniques with texture analysis, which can assess both low-order and higher-order features. Texture analysis explores hierarchical spatial relationships between pixels. First-order features describe distributions in grey-level pixel intensity, second-order features assess relationships between pixel pairs and higher-order features explore distributions in pixel neighbourhoods^42,43,44.

CT is the dominant imaging modality for evaluating the adrenal gland and can be performed with or without contrast enhancement for the visualization of adrenal tumours. In the evaluation of malignancy, a size of >4 cm is a concerning feature, often prompting resection⁴⁵. However, this risk factor should not be taken in isolation as ~70% of these large adrenal tumours have been shown to be non-malignant lesions^46,47. Machine learning has been used to differentiate large (>4 cm) adrenocortical carcinomas from other large cortical lesions on contrast-enhanced CT⁴⁸. The radiomics signature obtained by machine learning had a diagnostic accuracy for malignant disease exceeding that of radiologists, although there was inter-observer variability on the radiologist evaluation (P <0.0001)⁴⁸. The performance of this machine learning-based texture analysis model further improved with the inclusion of pre-contrast mean attenuation, which is a parameter that is also used in established adrenal radiological criteria⁴⁹.

In terms of functional adrenal lesions, machine intelligence has also been used to differentiate between lipid-poor adenoma and subclinical phaeochromocytoma (which might secrete catecholamines), where attenuation thresholds and washout characteristics might not always be reliable^50,51. As subclinical phaeochromocytomas can sustain secretory function, biopsy or surgery could precipitate haemodynamic instability if a functional tumour goes undetected. Studies have yielded radiomics signatures for subclinical phaeochromocytoma via machine learning-driven texture analysis on non-contrast CT imaging with performance accuracy ranging from 85% to 89%^52,53. However, the potential benefits of this approach over existing clinical criteria are hard to discern due to considerable differences in baseline tumour characteristics, such as attenuation and size, and the lack of comparison between machine learning-driven analysis and expert radiologist evalution^52,53. Still, we can envision a future role for the enhanced detection of subclinical pheochromocytomas with artificial intelligence techniques to confidently and quickly prompt confirmatory biochemical testing.

Other groups have also leveraged the improved resolution of adrenal imaging on MRI to train their models. Indeed, one group developed a machine learning-based radiomics signature to differentiate adrenal adenomas from non-adenomatous lesions on MRI, with non-inferior performance in comparison with expert radiologists⁵⁴. Other studies have explored neural networks for the differentiation of tumour subtypes on MRI (accuracy, 93%) and CT (accuracy, 82%), including adrenal adenomas, cysts, lipomas, lymphangiomas and metastases. However, these neural networks were trained with radiologist evaluation as the ground truth condition rather than with the gold standard of biopsy pathology^55,56.

Looking ahead, we anticipate that the field of AI-powered adrenal tumour diagnostics will move towards robust automated detection and preoperative classification of incidentalomas. Future work is needed in the differentiation of small adrenal masses <4 cm, particularly in the case of malignancy, where early detection is linked with better outcomes⁵⁷. The field will be improved with more robust clinical evaluation and workarounds for small cohort size, possibly through increased data-sharing and/or pre-processing techniques to reduce overfitting.

Pancreas

The aberrant proliferation of endocrine islet cells leads to the development of pancreatic neuroendocrine tumours (NETs) and prognosis is overall favourable with complete resection^58,59,60. A minority of these neoplasms retain the functional status of their original islet cell lineage, which can induce a clinical syndrome due to hormone production, often facilitating their detection^61,62. Absent such biochemical indicators, the clinical management of pancreatic NETs is primarily stage-guided by Ki67 index and mitotic count observed in tissue samples obtained by biopsy; however, imaging characteristics, such as tumour size, depth of invasion and presence of metastases, are also considered^63,64,65,66. Pancreatic NETs classically present on CT imaging as contrast-enhancing masses that are best visualized on the arterial phase, often with a hypervascular appearance and washout on the delayed phase^67,68.

Preoperatively, biopsy samples are typically obtained via fine-needle aspiration on endoscopic ultrasound, although the localization and yield can be complicated by lesion size and spatial orientation⁶⁹. In light of these uncertainties, there is interest in developing a system for preoperative risk stratification of pancreatic NETs, which will help guide therapeutic directions in support of endocrine oncologists and surgeons^70,71. Studies have utilized both conventional machine learning and deep learning on preoperative CT and MRI to classify pancreatic NET grade with robust accuracy in pathology-confirmed tumours^{4,72,73,74,75}. Importantly, the development of classification boundaries for future studies requires consensus in the partitioning of tumour grades. For example, some studies differentiate grade G1 and G2 from G3 neoplasms, whereas others differentiate grade G1 from G2 and G3 neoplasms^4,74,76. Given that pancreatic NETs are so rare, a deep learning study using MRI has used data augmentation (Box 1) with a generative adversarial network on 96 patients with confirmed disease to enable their convolutional neural network to have improved generalizability on unseen data⁷⁵. As well as stratification, future computer-aided diagnosis could also potentially be used for pancreatic NETs if efforts using AI could be expanded to functional imaging techniques with tracers such as the octreotide scan^77,78,79.

We also envision a role for machine intelligence to support radiologists in the differentiation of atypical pancreatic NETs from adenocarcinoma. Pancreatic adenocarcinoma is an exocrine malignancy of the epithelioid ductal cells that often confers a poor prognosis due to delays in diagnosis^80,81. Although pancreatic NETs are usually distinguishable from adenocarcinomas on CT by their vascularity pattern and absence of ductal dilation, a hypovascular enhancement pattern occurs non-infrequently in atypical variants^63,67,68. To date, statistical approaches utilizing histogram analysis on CT images have seen conflicting findings in terms of the robustness of features used for differentiation, including entropy, kurtosis and skew^82,83. Future studies can be performed with AI and focus on combining imaging information with clinical data (such as laboratory tests) for increased accuracy.

Broadly, studies in the field of pancreatic imaging have utilized deep neural networks to improve workflow by carrying out automatic segmentation of pancreatic lesions, a process ordinarily complicated by the irregular contours and difficult anatomy of the pancreas^{84,85,86,87,88}. In addition, several studies used advanced learning techniques for classification in exocrine pancreatic cancer and precursor lesions, with encouraging findings^89,90. For example, one exploratory study with a small dataset used a mix of supervised and unsupervised learning techniques for the classification of pancreatic cystic neoplasms on MRI. We highlight the paper’s use of unsupervised methods, in which a k-means algorithm is trained to cluster pancreatic precursor lesions on unlabelled MRI scans. Following this step, the machine-annotated scans are fed into a novel proportioning type support vector machine for final label adjudication⁹¹ (Table 1). Potential also exists here to eventually adapt such unsupervised models for the automatic labelling of unstructured medical image data in order to reduce the pre-processing workload. This work is still exploratory, with only a modest (6–10%) improvement in diagnostic accuracy over prior unsupervised machine learning approaches; however, it nevertheless highlights the opportunity to improve on prior learning techniques in the field of pancreatic imaging to develop models that can be used clinically.

Pituitary gland

Pituitary adenomas are found to occur in approximately 10% of the population, although they are typically small and subclinical lesions that do not require treatment^92,93. Clinical syndromes such as acromegaly or bitemporal haemianopsia, for example, can result from tumour hormonal hypersecretion or tumour mass effect on surrounding structures^{94,95,96,97,98}. In combination with clinical data, neuroimaging plays a vital role in informing pituitary tumour diagnosis, surgical planning and longitudinal monitoring^99,100. MRI is generally the preferred imaging modality for the sellar region as it can provide exquisite detail of the neuroanatomy. An incredible diversity of sella turcica pathologies localize to the sellar region, including those of primary pituitary, local or distant origin.

Machine intelligence has been leveraged for a variety of diagnostic tasks that reflect the diversity of sellar lesions and hold implications in terms of treatment. For example, an early study utilized a three-layer feedforward artificial neural network (Table 1) with backpropagation (Box 1) for the differentiation of large suprasellar masses such as pituitary adenomas, craniopharyngiomas and Rathke cleft cysts¹⁰¹. Their learning model used patient age together with MRI features to achieve excellent accuracy, which improved on the performance of both neuroradiologists and general radiologists¹⁰¹. Interestingly, upon assessment of expert confidence and misclassifications, the authors found that the AI model was most beneficial when used to identify cases where cystic degeneration occurred in pituitary adenomas¹⁰¹. Other models have been used for the differentiation of null cell adenomas from other non-functioning pituitary adenomas via machine learning-based radiomics signatures, albeit lacking expert radiologist comparison¹⁰². Accurate diagnosis of null cell adenomas is critical, as adjuvant radiotherapy has shown some benefit in this adenoma subtype but not in others due to an overriding risk of hypopituitarism. Deep learning is also gaining traction, with one study utilizing convolutional neural networks (Table 1) on multisequence MRI to diagnose pituitary adenomas from other sellar pathologies and healthy controls, with a performance accuracy of 97.0%, although this protocol is still in need of radiologist comparison⁵.

Robust pituitary tumour characterization at the time of diagnosis can also inform subsequent surgical planning. A variety of conventional machine learning and deep learning techniques have been used to evaluate macroadenoma consistency, with many models achieving good diagnostic performance on par with that of radiologists^103,104,105. This preoperative finding can have surgical implications as soft adenomas are generally amenable to suction curettage upon a transsphenoidal approach, whereas the firm subtype is more difficult to resect and requires ultrasonic aspiration and often a staged transsphenoidal approach^106,107. Other deep learning models have been used to preoperatively predict tumour invasion or cerebrospinal fluid leak, to inform surgical planning^108,109.

Future machine learning directions should strive to enable the early detection of small pituitary lesions, possibly via automated lesion detection or improved diagnostic performance, as early clinical intervention can prevent the sequelae of worsening mass effect or protracted hormone hypersecretion. In terms of disease forecasting, we also see potential value in tools for the determination of appropriate patient follow-up periods for tumour surveillance to reduce unnecessary scanning and promote efficient health-care utilization. To this aim, studies could use longitudinal patient data gathered by automated segmentation and measurement of lesions over time and link those imaging features with clinical outcomes.

Thyroid gland

Thyroid cancer is the most common malignancy of the endocrine system, with an estimated 5-year prevalence of 4.6%¹¹⁰ (International Agency for Research on Cancer). Ultrasonography is the mainstay imaging modality in diagnosis that can provide excellent visualization of nodules and guide potential biopsy acquisition. Many robust AI applications have emerged to characterize thyroid nodules owing, in part, to the ubiquity of data as ultrasound scans are non-ionizing, fairly low-cost and increasingly portable^110,111. Studies to date primarily explore the automatic segmentation and classification aspects of thyroid nodule diagnosis^{112,113,114,115,116,117}. The primary utility of these models lies in their potential to inform decisions around whether to proceed with surveillance or fine-needle aspiration biopsy¹⁵. To date, many of the radiomics signatures for thyroid cancer developed by conventional machine learning approaches map to the five domains in the Thyroid Imaging, Reporting and Data System (TI-RADS, used by radiologists) of echogenicity, echogenic foci, composition, shape and margin criteria^{118,119,120,121}. These models support the robustness of these TI-RADS clinical imaging criteria; however, they also highlight a potential role for automated techniques in reducing inter-observer variability.

An abundance of deep learning models has also been developed to inform clinical decisions in patients with thyroid nodules, although a 2020 metanalysis did not find a clear superiority over classic machine learning techniques or radiologists in terms of diagnostic accuracy¹²². Of course, interpretation of this pooled data is difficult as many of the deep learning models, sample sizes and clinical evaluation criteria vary substantially across studies. For example, one 2019 study with high volume data trained a convolutional neural network (Table 1) with images drawn from over 312,399 ultrasound scans from 42,952 patients across multiple institutions; this model was found to outperform skilled radiologists (>6 years experience) on external validation⁶. Although not all institutions will have access to high volume thyroid ultrasound scans, they can still implement a number of strategies to increase data availability. Here, we want to highlight one emerging strategy: the use of model pre-training with synthetic data creation via generative adversarial networks (Table 1). In fact, in the past year, the endocrine literature began to explore innovative ‘knowledge-guided’ approaches to data synthesis using deep learning-extracted features from TI-RADS to assist the generative adversarial network in its generation of thyroid nodule images¹²³.

It is not clearly established how the benefits of machine intelligence systems for improving diagnostic accuracy will ultimately translate to the clinical setting. Overall, the literature suggests that these systems can achieve non-inferior performance to that of experienced radiologists (experience varies, typically 5–20 years)^122,124. These algorithms do tend to outperform less-experienced radiologists and might therefore play a valuable supporting role, particularly in low-resource settings, where access to experts could be constrained^125,126,127. Compared with a small cohort of models that are actively being utilized in the clinical setting, radiologists seemingly have a slight edge in varying indicators of performance on individual studies, although pooled overall metrics are comparable^122,128. A centralized inventory to actively track these diagnostic algorithms in clinical use would improve performance auditing and algorithm stewardship. Looking ahead, we see that the field is already heading towards 3D detection and reconstruction in thyroid ultrasonography that might power more robust analytics¹¹⁷. Another challenge moving forward will be in mitigating the risks of excessive intervention in thyroid cancers with improved detection as many slow-progressing or early-stage cancers will remain subclinical. Possible solutions here include linking imaging algorithms to pathology reference standards as well as with longitudinal outcomes data for improved risk stratification¹²⁹.

Facial recognition

Interestingly, a number of computer vision applications of facial recognition software have been developed to identify stereotyped facial features induced in hormonal excess¹³⁰. A positive identification of characteristic facial features could indicate a number of pathologies, including an underlying endocrine tumour. The process is similar to the radiomics workflow, except that facial landmark tagging occurs in lieu of segmentation during image pre-processing.

Acromegaly can result in facial manifestations such as frontal bossing, sunken nasolabial folds, prominent zygomatic arch and enlarged jaw often due to an underlying pituitary somatotrophic macroadenoma. Both machine learning and deep learning approaches have been used to craft models to identify stereotyped facial features, with a performance comparable to that of acromegaly specialists and exceeding that of general internists^131,132. Stereotyped features can also occur in Cushing syndrome, such as facial plethora, hirsutism, acne and cervical fat pad, owing to increased cortisol. Initial pilot studies using machine learning are limited by small cohorts and demonstrate variable performance on retrospective validation (accuracy range 62.8–85%)^133,134. Limitations in models to date include poor visualization of facial features and potential entrenched bias due to racial and gender homogeneity in training data^{131,132,133,134}. The diversification of data and obtaining metrics of bias are critically important as is documenting bias assessments in these facial recognition software applications to avoid replicating current racial and gender disparities in the care of Cushing syndrome and acromegaly that manifest as poor outcomes and delays in diagnosis, respectively^135,136.

Clinical evaluation

The metrics used for clinical assessment in AI currently lack standardization, which undermines the smooth integration of AI into the health-care system (Tables 2,3). Many computer vision studies in endocrine cancer imaging lack robust validation, which poses inherent limitations in terms of reproducibility. First, the lack of consistent reference standards (including biopsy, stable imaging and clinical criteria) for common clinical questions in machine intelligence for tumour imaging diagnostics can undermine the ability to establish a ground truth for comparison across studies. Furthermore, no consensus exists in definitions of high versus low experience levels in radiologists; however, the endocrine cancer computer vision literature generally trends towards more than 5 years of clinical practice at a minimum as indicative of a high level of expertise. Next, separation of the data training sets and testing datasets is critically important and cross-validation alone is not adequate in evaluating clinical performance. At a minimum, studies should be validated on external datasets, ideally with prospective studies, which are less prone to selection bias.

Table 2 Key evaluation metrics for computer vision applications in medicine

Full size table

Table 3 Major studies in AI imaging for endocrine cancer diagnostics

Full size table

To improve the quality of research, a number of guidelines for reporting in computer vision studies in medicine have been developed^137,138,139. Moving forward, the development of performance profiles for any high-fidelity model classes or software packages for standard benchmarking might also be helpful, while at the same time acknowledging that, often, many ways exist to accomplish the same task with machine learning. Importantly, for those algorithms intended for autonomous clinical use, multicentre randomized trials might also be indicated to qualify their performance in integrated settings. Long-term monitoring of efficacy and bias across the algorithm life cycle is also indicated, particularly in cases of continuous learning (Box 1) where algorithms continually update to reflect new data.

Interpretability

Decoding AI for physicians can mitigate uncertainty that could undermine trust in machine intelligence^140,141,142. Broadly speaking, interpretability strategies come in multiple flavours, either being specific or agnostic to a given model class and assessing function either at a global or local level¹⁴³. Global interpretations seek to offer holistic depictions of model behaviour and they focus on illuminating trends in the data that are most important to classification. Local interpretations focus on explaining individual model prediction instances. The intent of these strategies is to reassure endocrinologists and radiologists that the model is making decisions of what it should be looking at, often by way of visualizations or text. These explanations are generated using several techniques, including feature importance to highlight salient features, counterfactual examples of model predictions for a given input or decision rules that describe the logical flow of the model^143,144.

Feature attribute strategies are quite popular and include colour mapping¹⁴⁵, an interpretability technique that highlights regions of the medical image that influence the model decision. Other feature attribute methods include surrogate strategies¹⁴⁶, which use simpler models to explain the behaviour of more complex models. In the oncologic endocrinology literature, one form of colour mapping, known as saliency mapping, has been demonstrated in thyroid nodule classification to illustrate model behaviour¹⁴⁷. Other studies have utilized both gradient mapping and surrogate modelling techniques to highlight feature importance in the segmentation of brain tumours on MRI and abdominal CT, with the potential for future use in sellar, pancreatic and adrenal diagnostics^148,149,150. Finally, image similarity feature attribute strategies have also been applied to computer vision models for thyroid cancer. This technique displays a similar image linked to a classification as an explanation for the user, often with a superimposed gradient mapping to illuminate any respective discrepancies in regions of importance¹⁵¹. Of note, textual explanations are less common; however, they have been utilized in breast MRI and pelvic x-ray imaging to generate descriptive semantic outputs^152,153. Interestingly, a combined approach with both saliency maps and textual explanations was shown to be better received by a small group of physicians¹⁵³. Future efforts should strive to develop standardized metrics for evaluating the performance of interpretability models to ensure their effective and reproducible knowledge translation to the clinical setting.

Data availability

Abundant medical imaging data is needed to develop clinically meaningful deep learning models for non-invasive endocrine cancer diagnostics, capable of generalizing to a variety of clinical settings. In this section, we discuss techniques to increase the availability of data to prevent overfitting in AI models.

Open data curation

A lack of high volume, quality data impedes the development of robust AI in endocrine cancer diagnosis. One strategy to improve data for use by machine learning models is through improved sharing of existing data via the creation of open databases. The ongoing coronavirus disease 2019 pandemic has highlighted the role of open science in enabling timely advances in medical research, a movement that we should strive to foster outside of crisis scenarios^154,155,156. In terms of medical imaging specifically, open data can be used either for training and development of models or as external test sets. Some examples would be the US National Institute of Health Cancer Imaging Archive and the UK Biobank, the latter of which expanded its archive in 2020 to include an imaging database with pan-MRI and DXA scans on >5,000 patients.

Automated workflow

AI integration can be targeted through automated pipelines (including XNAT or DICOM Image Analytics and Archive) that can reduce latency in data retrieval through improved integration with existing health-care infrastructure¹⁵⁷ (Fig. 4). These tools can uncouple imaging data in Picture Archiving and Communications Systems from protected health information following image acquisition for use by AI models for near-real-time processing. From there, these imaging findings can be conveyed using automatic workflow interfaces connected with the electronic medical record as a central hub for coordination among endocrinologists, radiologists and other care team members. Looking ahead, we envision the deployment of these automated workflow pipelines to facilitate real-time analytics that endocrinologists can access rapidly at the bedside via smartphone-based imaging viewing platforms or portable imaging devices.

**Fig. 4: Real-time analytics with automatic picture archiving and communications systems integration.**

Data augmentation and transfer learning

Data pre-processing and model pre-training techniques can also be used to engineer workaround solutions to limited imaging data in order to improve AI model generalizability³². Data augmentation is a process that distorts the training images via oversampling to generate synthetic data^158,159. Another popular option in computer vision for treating small sample sizes involves pre-training of the model with a large and diverse image set to transfer preliminary weights to nodes in the network, after which fine-tuning of the model is performed using the target data^160,161. Although these augmentation and transfer learning (Box 1) methods are now becoming staples in medical image informatics research, they were not used in a number of the endocrine cancer studies that we reviewed. Looking ahead, we anticipate that improved uptake of these methods will promote deep learning breakthroughs, particularly in the cases of rare neoplasms with limited availability of imaging data such as those of the adrenal gland and endocrine pancreas.

Alternative computing platforms

The organization of network servers used to access, store and transfer data can influence AI model training and development (Fig. 5). In this section, we draw attention to how exploratory computing frameworks might be leveraged to improve the quality of AI applications for endocrine cancer diagnostics.

Decentralized or distributed

Information technology infrastructures are trending towards Cloud computing (Box 1) solutions that consolidate data operations within a central server. However, computing platforms with diffuse servers are now being explored to circumvent data-sharing issues associated with centralized data that pose barriers to the multi-institutional, collaborative training of AI models. Distributed networks process data diffusely across local nodes, whereas decentralized platforms operate as collectives of nodal clusters (Fig. 5; Box 1).

We highlight the potential of an emerging decentralized training paradigm known as federated learning (Box 1), which is already being utilized to enable deep AI models to be developed for diabetic retinopathy and breast cancer diagnosis¹⁶². Federated learning uses distributed servers across multiple institutions for parallel model training and model updates are subsequently loaded onto a central server to develop an ensemble model. Distributed learning techniques, such as cyclic weight transfer, can conduct this process across local servers in series, using one model passed from institution to institution over the course of training¹⁶³. Importantly, these techniques do not require inter-institutional patient data transfer or co-location. We can similarly envision a role for decentralized and distributed techniques in bypassing current barriers to data sharing and availability to enable deep learning applications in oncologic endocrinology, particularly in rare cancers. However, a notable limitation in current federated learning techniques is that the diversity of data is only as robust as that of the collaborating institutions. Still, past efforts have yielded deep learning models with impressive performance on par with those from shared multi-institutional datasets^162,163,164.

Quantum

Other breakthroughs in machine intelligence in medicine will come with shifts in computing frameworks that can enhance model training and efficiency. Quantum computing (Box 1) represents one emerging prospect that would leverage the physical properties of atomic and subatomic particles to enhance processing power, algorithm performance and data transfer¹⁶⁵. Quantum computers can theoretically support the simultaneous, parallel-path processing of data to create shortcuts that might outperform conventional computing¹⁶⁵ (Fig. 5). Encouraging scientific breakthroughs over the past 5 years demonstrated ‘quantum supremacy’ in terms of problem-solving capabilities over conventional computing, albeit these findings are still very much exploratory¹⁶⁶. However, some experts anticipate that the arrival of usable quantum computing could occur as early as within the next few decades¹⁶⁷.

Conclusions

Machine intelligence continues to gain traction in oncologic endocrinology for its potential to enable robust non-invasive diagnostics. However, for these technologies to take hold, both adherence to consensus reporting standards and evaluation criteria in AI image interpretation are required, which will enable meaningful cross-study comparisons. Although several of such AI guidelines have been established^137,138,139, a lack of harmonization impedes their widespread uptake. Another challenge will be facilitating the smooth movement of these technologies into the clinical setting so that physicians embrace their use. Clarity at the federal and institutional levels is urgently needed in terms of developing longitudinal performance auditing, medicolegal liability frameworks and guidance on reimbursements for clinical AI developers and medical institutions utilizing these technologies.

Another theme is how poor data availability continues to stymie the development of robust machine learning applications, particularly in rare endocrine cancers. Although access to medical imaging is improving through open data-sharing initiatives, we still note a relative paucity of endocrine cancer scans within these larger imaging databases. We encourage the creation of domain-specific imaging databases that can better enable AI for oncologic endocrinology purposes.

Collaborative learning strategies might also centre the foray given their potential to circumvent data access issues without transferring personal health information. Future work on distributed computing paradigms also need to consider how to best manage potential cyber risks and data as the potential surface area vulnerable for cyberattack increases with the increasing number of participants. Digital health could also enable future breakthroughs such as via correlation of radiomics findings with wearables or digital health application data¹⁶⁸. Finally, the advent of smartphone imaging viewing platforms and automated workflows will bring the field closer to smooth, real-time analytics that can enable robust partnerships among endocrinologists, radiologists and AI.

References

Goodfellow, I., Bengio, Y., Courville, A. & Bengio, Y. Deep learning. Vol. 1 (MIT Press, 2016).
Esteva, A. et al. A guide to deep learning in healthcare. Nat. Med. 25, 24–29 (2019). This Review provides an excellent primer on deep learning applications in medicine that covers a variety of modalities, including clinical, imaging, text and mixed data.
CAS PubMed Google Scholar
Hosny, A., Parmar, C., Quackenbush, J., Schwartz, L. H. & Aerts, H. J. W. L. Artificial intelligence in radiology. Nat. Rev. Cancer 18, 500–510 (2018).
CAS PubMed PubMed Central Google Scholar
Luo, Y. et al. Preoperative prediction of pancreatic neuroendocrine neoplasms grading based on enhanced computed tomography imaging: validation of deep learning with a convolutional neural network. Neuroendocrinology 110, 338–350 (2020). This paper finds the deep learning convolutional neural network approach to achieve the highest area under the curve in differentiating pancreatic NET grade 1–2 from grade 3 tumours, although convolutional neural network performance was not statistically different from that of the traditional machine learning models included in the study.
CAS PubMed Google Scholar
Qian, Y. et al. A novel diagnostic method for pituitary adenoma based on magnetic resonance imaging using a convolutional neural network. Pituitary 23, 246–252 (2020). A deep learning technique using convolutional neural networks to differentiate patients with pituitary adenoma from a mixed control group with both healthy and sellar lesion MRI scans.
CAS PubMed Google Scholar
Li, X. et al. Diagnosis of thyroid cancer using deep convolutional neural network models applied to sonographic images: a retrospective, multicohort, diagnostic study. Lancet Oncol. 20, 193–201 (2019). A large cohort study using a convolutional neural network-based approach to thyroid nodule diagnosis on ultrasound demonstrating comparable sensitivity and improved specificity when compared with a group of expert radiologists.
PubMed Google Scholar
Wang, L. et al. Automatic thyroid nodule recognition and diagnosis in ultrasound imaging with the YOLOv2 neural network. World J. Surg. Oncol. 17, 12 (2019).
PubMed PubMed Central Google Scholar
Gerlinger, M. et al. Intratumor heterogeneity and branched evolution revealed by multiregion sequencing. N. Engl. J. Med. 366, 883–892 (2012).
CAS PubMed PubMed Central Google Scholar
Chmielik, E. et al. Heterogeneity of thyroid cancer. Pathobiology 85, 117–129 (2018).
PubMed Google Scholar
Topol, E. J. Individualized medicine from prewomb to tomb. Cell 157, 241–253 (2014).
CAS PubMed PubMed Central Google Scholar
Obermeyer, Z. & Emanuel, E. J. Predicting the future - big data, machine learning, and clinical medicine. N. Engl. J. Med. 375, 1216–1219 (2016).
PubMed PubMed Central Google Scholar
Rao, A. et al. A combinatorial radiographic phenotype may stratify patient survival and be associated with invasion and proliferation characteristics in glioblastoma. J. Neurosurg. 124, 1008–1017 (2016).
CAS PubMed Google Scholar
Yamamoto, S., Maki, D. D., Korn, R. L. & Kuo, M. D. Radiogenomic analysis of breast cancer using MRI: a preliminary study to define the landscape. AJR Am. J. Roentgenol. 199, 654–663 (2012).
PubMed Google Scholar
Aerts, H. J. et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat. Commun. 5, 4006 (2014).
CAS PubMed Google Scholar
Zhao, C. K. et al. A comparative analysis of two machine learning-based diagnostic patterns with thyroid imaging reporting and data system for thyroid nodules: diagnostic performance and unnecessary biopsy rate. Thyroid 31, 470–481 (2021).
CAS PubMed Google Scholar
Gulshan, V. et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA 316, 2402–2410 (2016).
PubMed Google Scholar
Zhou, H. et al. Machine learning reveals multimodal MRI patterns predictive of isocitrate dehydrogenase and 1p/19q status in diffuse low- and high-grade gliomas. J. Neurooncol. 142, 299–307 (2019).
CAS PubMed PubMed Central Google Scholar
Liang, W. et al. Development and validation of a clinical risk score to predict the occurrence of critical illness in hospitalized patients with COVID-19. JAMA Intern. Med. 180, 1081–1089 (2020).
CAS PubMed Google Scholar
Cohen, J. D. et al. Detection and localization of surgically resectable cancers with a multi-analyte blood test. Science 359, 926–930 (2018).
CAS PubMed PubMed Central Google Scholar
Davis, R. J. et al. Pan-cancer transcriptional signatures predictive of oncogenic mutations reveal that Fbw7 regulates cancer cell oxidative metabolism. Proc. Natl Acad. Sci. USA 115, 5462–5467 (2018).
CAS PubMed PubMed Central Google Scholar
Chang, E. K. et al. Defining a patient population with cirrhosis: an automated algorithm with natural language processing. J. Clin. Gastroenterol. 50, 889–894 (2016).
PubMed Google Scholar
Bedi, G. et al. Automated analysis of free speech predicts psychosis onset in high-risk youths. NPJ Schizophrenia 1, 15030 (2015).
PubMed PubMed Central Google Scholar
Yu, P. et al. FGF-dependent metabolic control of vascular development. Nature 545, 224–228 (2017).
CAS PubMed PubMed Central Google Scholar
Samuel, A. L. Some studies in machine learning using the game of checkers. IBM J. Res. Dev. 3, 210–229 (1959).
Google Scholar
Kumar, V. et al. Radiomics: the process and the challenges. Magn. Reson. Imaging 30, 1234–1248 (2012).
PubMed PubMed Central Google Scholar
Erickson, B. J., Korfiatis, P., Akkus, Z. & Kline, T. L. Machine learning for medical imaging. Radiographics 37, 505–515 (2017).
PubMed Google Scholar
Guo, Y., Gao, Y. & Shen, D. Deformable MR prostate segmentation via deep feature learning and sparse patch matching. IEEE Trans. Med. Imaging 35, 1077–1089 (2016).
PubMed Google Scholar
Wu, J. et al. A deep Boltzmann machine-driven level set method for heart motion tracking using cine MRI images. Med. Image Anal. 47, 68–80 (2018).
CAS PubMed PubMed Central Google Scholar
Sutton, R. S. & Barto, A. G. Introduction to Reinforcement Learning. Vol. 135 (MIT Press, 1998).
Bengio, Y., Courville, A. & Vincent, P. Representation learning: a review and new perspectives. IEEE Trans. Pattern Anal. Mach. Intell. 35, 1798–1828 (2013).
PubMed Google Scholar
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
CAS PubMed Google Scholar
Litjens, G. et al. A survey on deep learning in medical image analysis. Med. Image Anal. 42, 60–88 (2017).
PubMed Google Scholar
Esteva, A. et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature 542, 115–118 (2017).
CAS PubMed PubMed Central Google Scholar
Naceur, M. B., Saouli, R., Akil, M. & Kachouri, R. Fully automatic brain tumor segmentation using end-to-end incremental deep neural networks in MRI images. Comput. Methods Prog. Biomed. 166, 39–49 (2018).
Google Scholar
Lee, M. J. et al. Benign and malignant adrenal masses: CT distinction with attenuation coefficients, size, and observer analysis. Radiology 179, 415–418 (1991).
CAS PubMed Google Scholar
Song, J. H., Chaudhry, F. S. & Mayo-Smith, W. W. The incidental adrenal mass on CT: prevalence of adrenal disease in 1,049 consecutive adrenal masses in patients with no known malignancy. AJR Am. J. Roentgenol. 190, 1163–1168 (2008).
PubMed Google Scholar
Zeiger, M. et al. American Association of Clinical Endocrinologists and American Association of Endocrine Surgeons medical guidelines for the management of adrenal incidentalomas. Endocr. Pract. 15, 1–20 (2009).
PubMed Google Scholar
Bae, K. T., Fuangtharnthip, P., Prasad, S. R., Joe, B. N. & Heiken, J. P. Adrenal masses: CT characterization with histogram analysis method. Radiology 228, 735–742 (2003).
PubMed Google Scholar
Ho, L. M., Paulson, E. K., Brady, M. J., Wong, T. Z. & Schindera, S. T. Lipid-poor adenomas on unenhanced CT: does histogram analysis increase sensitivity compared with a mean attenuation threshold? Am. J. Roentgenol. 191, 234–238 (2008).
Google Scholar
Umanodan, T. et al. ADC histogram analysis for adrenal tumor histogram analysis of apparent diffusion coefficient in differentiating adrenal adenoma from pheochromocytoma. J. Magn. Reson. Imaging 45, 1195–1203 (2017).
PubMed Google Scholar
Tüdös, Z. & Čtvrtlík, F. Possible impact of CT histogram analysis in incidentally discovered adrenal masses. Abdom. Radiol. 45, 2937–2938 (2020).
Google Scholar
Alobaidli, S. et al. The role of texture analysis in imaging as an outcome predictor and potential tool in radiotherapy treatment planning. Br. J. Radiol. 87, 20140369 (2014).
CAS PubMed PubMed Central Google Scholar
Parekh, V. S. & Jacobs, M. A. Deep learning and radiomics in precision medicine. Expert Rev. Precis. Med. Drug Dev. 4, 59–72 (2019).
PubMed PubMed Central Google Scholar
Ganeshan, B. & Miles, K. A. Quantifying tumour heterogeneity with CT. Cancer Imaging 13, 140–149 (2013).
PubMed PubMed Central Google Scholar
Nieman, L. K. Approach to the patient with an adrenal incidentaloma. J. Clin. Endocrinol. Metab. 95, 4106–4113 (2010).
CAS PubMed PubMed Central Google Scholar
Iñiguez-Ariza, N. M. et al. Clinical, biochemical, and radiological characteristics of a single-center retrospective cohort of 705 large adrenal tumors. Mayo Clin. Proc. Innov. Qual. Outcomes 2, 30–39 (2018).
PubMed Google Scholar
Angeli, A., Osella, G., Alì, A. & Terzolo, M. Adrenal incidentaloma: an overview of clinical and epidemiological data from the National Italian Study Group. Horm. Res. 47, 279–283 (1997).
CAS PubMed Google Scholar
Elmohr, M. M. et al. Machine learning-based texture analysis for differentiation of large adrenal cortical tumours on CT. Clin. Radiol. 74, 818.e1–818.e7 (2019). This study establishes a radiomics signature to differentiate large adrenal tumours using random forest-based machine learning feature extraction coupled with CT attenuation score; model performance exceeded that of two expert radiologists.
CAS Google Scholar
Korobkin, M. et al. Differentiation of adrenal adenomas from nonadenomas using CT attenuation values. AJR Am. J. Roentgenol. 166, 531–536 (1996).
CAS PubMed Google Scholar
Patel, J., Davenport, M. S., Cohan, R. H. & Caoili, E. M. Can established CT attenuation and washout criteria for adrenal adenoma accurately exclude pheochromocytoma? Am. J. Roentgenol. 201, 122–127 (2013).
Google Scholar
Northcutt, B. G., Trakhtenbroit, M. A., Gomez, E. N., Fishman, E. K. & Johnson, P. T. Adrenal adenoma and pheochromocytoma: comparison of multidetector CT venous enhancement levels and washout characteristics. J. Comput. Assist. Tomogr. 40, 194–200 (2016).
PubMed Google Scholar
Yi, X. et al. Adrenal incidentaloma: machine learning-based quantitative texture analysis of unenhanced CT can effectively differentiate sPHEO from lipid-poor adrenal adenoma. J. Cancer 9, 3577–3582 (2018).
PubMed PubMed Central Google Scholar
Yi, X. et al. Radiomics improves efficiency for differentiating subclinical pheochromocytoma from lipid-poor adenoma: a predictive, preventive and personalized medical approach in adrenal incidentalomas. EPMA J. 9, 421–429 (2018). This study uses machine learning with a LASSO model to differentiate subclinical pheochromocytoma from lipid-poor adenomas on CT with a sensitivity of 90% and sensitivity of 99%, albeit without an expert radiologist comparison group.
PubMed PubMed Central Google Scholar
Romeo, V. et al. Characterization of adrenal lesions on unenhanced MRI using texture analysis: a machine-learning approach. J. Magn. Reson. Imaging 48, 198–204 (2018).
PubMed Google Scholar
Barstugan, M., Ceylan, R., Asoglu, S., Cebeci, H. & Koplay, M. Adrenal tumor characterization on magnetic resonance images. Int. J. Imaging Syst. Technol. 30, 252–265 (2020).
Google Scholar
Koyuncu, H., Ceylan, R., Asoglu, S., Cebeci, H. & Koplay, M. An extensive study for binary characterisation of adrenal tumours. Med. Biol. Eng. Comput. 57, 849–862 (2019).
PubMed Google Scholar
Henley, D. J., van Heerden, J. A., Grant, C. S., Carney, J. A. & Carpenter, P. C. Adrenal cortical carcinoma — a continuing challenge. Surgery 94, 926–931 (1983).
CAS PubMed Google Scholar
Dasari, A. et al. Trends in the incidence, prevalence, and survival outcomes in patients with neuroendocrine tumors in the United States. JAMA Oncol. 3, 1335–1342 (2017).
PubMed PubMed Central Google Scholar
Halfdanarson, T. R., Rabe, K. G., Rubin, J. & Petersen, G. M. Pancreatic neuroendocrine tumors (PNETs): incidence, prognosis and recent trend toward improved survival. Ann. Oncol. 19, 1727–1733 (2008).
CAS PubMed PubMed Central Google Scholar
Genç, C. G. et al. A new scoring system to predict recurrent disease in grade 1 and 2 nonfunctional pancreatic neuroendocrine tumors. Ann. Surg. 267, 1148–1154 (2018).
PubMed Google Scholar
Modlin, I. M. et al. Gastroenteropancreatic neuroendocrine tumours. Lancet Oncol. 9, 61–72 (2008).
CAS PubMed Google Scholar
Zerbi, A. et al. Clinicopathological features of pancreatic endocrine tumors: a prospective multicenter study in Italy of 297 sporadic cases. Am. J. Gastroenterol. 105, 1421–1429 (2010).
PubMed Google Scholar
Manfredi, R. et al. Non-hyperfunctioning neuroendocrine tumours of the pancreas: MR imaging appearance and correlation with their biological behaviour. Eur. Radiol. 23, 3029–3039 (2013).
PubMed Google Scholar
Inzani, F., Petrone, G. & Rindi, G. The New World Health Organization Classification for Pancreatic Neuroendocrine Neoplasia. Endocrinol. Metab. Clin. North. Am. 47, 463–470 (2018).
PubMed Google Scholar
Rindi, G. & Wiedenmann, B. Neuroendocrine neoplasms of the gut and pancreas: new insights. Nat. Rev. Endocrinol. 8, 54 (2012).
Google Scholar
Oronsky, B., Ma, P. C., Morgensztern, D. & Carter, C. A. Nothing but NET: a review of neuroendocrine tumors and carcinomas. Neoplasia 19, 991–1002 (2017).
CAS PubMed PubMed Central Google Scholar
Lee, N. J., Hruban, R. H. & Fishman, E. K. Pancreatic neuroendocrine tumor: review of heterogeneous spectrum of CT appearance. Abdom. Radiol. 43, 3025–3034 (2018).
Google Scholar
Karmazanovsky, G. et al. Nonhypervascular pancreatic neuroendocrine tumors: Spectrum of MDCT imaging findings and differentiation from pancreatic ductal adenocarcinoma. Eur. J. Radiol. 110, 66–73 (2019).
PubMed Google Scholar
Rösch, T. et al. Localization of pancreatic endocrine tumors by endoscopic ultrasonography. N. Engl. J. Med. 326, 1721–1726 (1992).
PubMed Google Scholar
Song, Y. et al. Multiple machine learnings revealed similar predictive accuracy for prognosis of PNETs from the surveillance, epidemiology, and end result database. J. Cancer 9, 3971–3978 (2018).
PubMed PubMed Central Google Scholar
Saleh, M. et al. New frontiers in imaging including radiomics updates for pancreatic neuroendocrine neoplasms. Abdom. Radiol. https://doi.org/10.1007/s00261-020-02833-8 (2020).
Article Google Scholar
Zhao, Z. et al. CT-radiomic approach to predict G1/2 nonfunctional pancreatic neuroendocrine tumor. Acad. Radiol. 27, e272–e281 (2020).
PubMed Google Scholar
Liang, W. et al. A combined nomogram model to preoperatively predict histologic grade in pancreatic neuroendocrine tumors. Clin. Cancer Res. 25, 584–594 (2019).
PubMed Google Scholar
Gu, D. et al. CT radiomics may predict the grade of pancreatic neuroendocrine tumors: a multicenter study. Eur. Radiol. 29, 6880–6890 (2019).
PubMed Google Scholar
Gao, X. & Wang, X. Deep learning for World Health Organization grades of pancreatic neuroendocrine tumors on contrast-enhanced magnetic resonance images: a preliminary study. Int. J. Comput. Assist. Radiol. Surg. 14, 1981–1991 (2019).
PubMed Google Scholar
Choi, T. W., Kim, J. H., Yu, M. H., Park, S. J. & Han, J. K. Pancreatic neuroendocrine tumor: prediction of the tumor grade using CT findings and computerized texture analysis. Acta Radiol. 59, 383–392 (2018).
CAS PubMed Google Scholar
Duan, H., Baratto, L. & Iagaru, A. The role of PET/CT in the imaging of pancreatic neoplasms. Semin. Ultrasound CT MR 40, 500–508 (2019).
PubMed Google Scholar
Zaharchuk, G. Next generation research applications for hybrid PET/MR and PET/CT imaging using deep learning. Eur. J. Nucl. Med. Mol. Imaging 46, 2700–2707 (2019).
PubMed PubMed Central Google Scholar
Wei, L., Osman, S., Hatt, M. & El Naqa, I. Machine learning for radiomics-based multimodality and multiparametric modeling. Q. J. Nucl. Med. Mol. Imaging 63, 323–338 (2019).
PubMed Google Scholar
Hidalgo, M. Pancreatic cancer. N. Engl. J. Med. 362, 1605–1617 (2010).
CAS PubMed Google Scholar
Cameron, J. L. et al. Factors influencing survival after pancreaticoduodenectomy for pancreatic cancer. Am. J. Surg. 161, 120–124 (1991).
CAS PubMed Google Scholar
Guo, C. et al. The differentiation of pancreatic neuroendocrine carcinoma from pancreatic ductal adenocarcinoma: the values of CT imaging features and texture analysis. Cancer Imaging 18, 37 (2018).
PubMed PubMed Central Google Scholar
Li, J. et al. Differentiation of atypical pancreatic neuroendocrine tumors from pancreatic ductal adenocarcinomas: Using whole-tumor CT texture analysis as quantitative biomarkers. Cancer Med. 7, 4924–4931 (2018).
PubMed PubMed Central Google Scholar
Fu, M. et al. Hierarchical combinatorial deep learning architecture for pancreas segmentation of medical computed tomography cancer images. BMC Syst. Biol. 12, 56 (2018).
PubMed PubMed Central Google Scholar
Man, Y., Huang, Y., Feng, J., Li, X. & Wu, F. Deep Q learning driven CT pancreas segmentation with geometry-aware U-net. IEEE Trans. Med. Imaging 38, 1971–1980 (2019).
PubMed Google Scholar
Heinrich, M. P., Blendowski, M. & Oktay, O. TernaryNet: faster deep model inference without GPUs for medical 3D segmentation using sparse and binary convolutions. Int. J. Comput. Assist. Radiol. Surg. 13, 1311–1320 (2018).
PubMed Google Scholar
Gibson, E. et al. Automatic multi-organ segmentation on abdominal CT with dense V-networks. IEEE Trans. Med. Imaging 37, 1822–1834 (2018).
PubMed PubMed Central Google Scholar
Liang, Y. et al. Auto-segmentation of pancreatic tumor in multi-parametric MRI using deep convolutional neural networks. Radiother. Oncol. 145, 193–200 (2020).
PubMed Google Scholar
Corral, J. E. et al. Deep learning to classify intraductal papillary mucinous neoplasms using magnetic resonance imaging. Pancreas 48, 805–810 (2019).
PubMed Google Scholar
Kuwahara, T. et al. Usefulness of deep learning analysis for the diagnosis of malignancy in intraductal papillary mucinous neoplasms of the pancreas. Clin. Transl. Gastroenterol. 10, 1–8 (2019).
CAS PubMed Google Scholar
Hussein, S., Kandel, P., Bolan, C. W., Wallace, M. B. & Bagci, U. Lung and pancreatic tumor characterization in the deep learning era: novel supervised and unsupervised learning approaches. IEEE Trans. Med. Imaging 38, 1777–1787 (2019).
PubMed Google Scholar
Molitch, M. E. Diagnosis and treatment of pituitary adenomas: a review. JAMA 317, 516–524 (2017).
PubMed Google Scholar
Melmed, S. Pituitary-tumor endocrinopathies. N. Engl. J. Med. 382, 937–950 (2020).
CAS PubMed Google Scholar
Chahal, J. & Schlechte, J. Hyperprolactinemia. Pituitary 11, 141–146 (2008).
CAS PubMed Google Scholar
Vilar, L., Vilar, C. F., Lyra, R., Lyra, R. & Naves, L. A. Acromegaly: clinical features at diagnosis. Pituitary 20, 22–32 (2017).
PubMed Google Scholar
Ntali, G. & Wass, J. A. Epidemiology, clinical presentation and diagnosis of non-functioning pituitary adenomas. Pituitary 21, 111–118 (2018).
PubMed Google Scholar
Amlashi, F. G. & Tritos, N. A. Thyrotropin-secreting pituitary adenomas: epidemiology, diagnosis, and management. Endocrine 52, 427–440 (2016).
CAS PubMed Google Scholar
Varlamov, E. V., McCartney, S. & Fleseriu, M. Functioning pituitary adenomas — current treatment options and emerging medical therapies. Eur. Endocrinol. 15, 30–40 (2019).
PubMed PubMed Central Google Scholar
Zamora, C. & Castillo, M. Sellar and parasellar imaging. Neurosurgery 80, 17–38 (2017).
PubMed Google Scholar
Connor, S. E. & Penney, C. C. MRI in the differential diagnosis of a sellar mass. Clin. Radiol. 58, 20–31 (2003).
CAS PubMed Google Scholar
Kitajima, M. et al. Differentiation of common large sellar-suprasellar masses effect of artificial neural network on radiologists’ diagnosis performance. Acad. Radiol. 16, 313–320 (2009).
PubMed Google Scholar
Zhang, S. et al. Non-invasive radiomics approach potentially predicts non-functioning pituitary adenomas subtypes before surgery. Eur. Radiol. 28, 3692–3701 (2018).
PubMed Google Scholar
Zeynalova, A. et al. Preoperative evaluation of tumour consistency in pituitary macroadenomas: a machine learning-based histogram analysis on conventional T2-weighted MRI. Neuroradiology 61, 767–774 (2019).
PubMed Google Scholar
Fan, Y. et al. Preoperative noninvasive radiomics approach predicts tumor consistency in patients with acromegaly: development and multicenter prospective validation. Front. Endocrinol. 10, 403 (2019). This prospective, multi-institutional machine learning study evaluates pituitary adenoma consistency in patients with acromegaly using a support vector machine-derived radiomics signature found to have a higher diagnostic accuracy than clinical characteristics alone.
Google Scholar
Zhu, H., Fang, Q., Huang, Y. & Xu, K. Semi-supervised method for image texture classification of pituitary tumors via CycleGAN and optimized feature extraction. BMC Med. Inf. Decis. Mak. 20, 215 (2020). This study uses multiple deep learning techniques for pituitary texture analysis including a generative adversarial network for data augmentation followed by unsupervised feature extraction with a convolutional neural network-based auto-encoder framework that is then fed into a convolutional recurrent neural network for classification.
Google Scholar
Yamamoto, J. et al. Tumor consistency of pituitary macroadenomas: predictive analysis on the basis of imaging features with contrast-enhanced 3D FIESTA at 3T. Am. J. Neuroradiol. 35, 297–303 (2014).
CAS PubMed PubMed Central Google Scholar
Iuchi, T., Saeki, N., Tanaka, M., Sunami, K. & Yamaura, A. MRI prediction of fibrous pituitary adenomas. Acta Neurochir. 140, 779–786 (1998).
CAS PubMed Google Scholar
Niu, J. et al. Preoperative prediction of cavernous sinus invasion by pituitary adenomas using a radiomics method based on magnetic resonance images. Eur. Radiol. 29, 1625–1634 (2019).
PubMed Google Scholar
Staartjes, V. E. et al. Neural network-based identification of patients at high risk for intraoperative cerebrospinal fluid leaks in endoscopic pituitary surgery. J. Neurosurg. 133, 329–335 (2019).
Google Scholar
Kitahara, C. M. & Sosa, J. A. The changing incidence of thyroid cancer. Nat. Rev. Endocrinol. 12, 646–653 (2016).
PubMed Google Scholar
Cabanillas, M. E., McFadden, D. G. & Durante, C. Thyroid cancer. Lancet 388, 2783–2795 (2016).
CAS PubMed Google Scholar
Song, W. et al. Multitask cascade convolution neural networks for automatic thyroid nodule detection and recognition. IEEE J. Biomed. Health Inf. 23, 1215–1224 (2019).
Google Scholar
Li, H. et al. An improved deep learning approach for detection of thyroid papillary cancer in ultrasound images. Sci. Rep. 8, 6600 (2018).
PubMed PubMed Central Google Scholar
Ma, J., Wu, F., Zhu, J., Xu, D. & Kong, D. A pre-trained convolutional neural network based method for thyroid nodule diagnosis. Ultrasonics 73, 221–230 (2017).
PubMed Google Scholar
Lim, K. J. et al. Computer-aided diagnosis for the differentiation of malignant from benign thyroid nodules on ultrasonography. Acad. Radiol. 15, 853–858 (2008).
PubMed Google Scholar
Chi, J. et al. Thyroid nodule classification in ultrasound images by fine-tuning deep convolutional neural network. J. Digit. Imaging 30, 477–486 (2017).
PubMed PubMed Central Google Scholar
Acharya, U. R. et al. Non-invasive automated 3D thyroid lesion classification in ultrasound: a class of ThyroScan™ systems. Ultrasonics 52, 508–520 (2012).
PubMed Google Scholar
Tessler, F. N. et al. ACR thyroid imaging, reporting and data system (TI-RADS): white paper of the ACR TI-RADS Committee. J. Am. Coll. Radiol. 14, 587–595 (2017).
PubMed Google Scholar
Zhang, B. et al. Machine learning-assisted system for thyroid nodule diagnosis. Thyroid 29, 858–867 (2019).
PubMed Google Scholar
Zhu, L. C. et al. A model to discriminate malignant from benign thyroid nodules using artificial neural network. PLoS One 8, e82211 (2013).
PubMed PubMed Central Google Scholar
Song, G., Xue, F. & Zhang, C. A model using texture features to differentiate the nature of thyroid nodules on sonography. J. Ultrasound Med. 34, 1753–1760 (2015).
PubMed Google Scholar
Xu, L. et al. Computer-aided diagnosis systems in diagnosing malignant thyroid nodules on ultrasonography: a systematic review and meta-analysis. Eur. Thyroid. J. 9, 186–193 (2020).
PubMed Google Scholar
Shi, G. et al. Knowledge-guided synthetic medical image adversarial augmentation for ultrasonography thyroid nodule classification. Comput. Methods Prog. Biomed. 196, 105611 (2020).
Google Scholar
Zhao, W. J., Fu, L. R., Huang, Z. M., Zhu, J. Q. & Ma, B. Y. Effectiveness evaluation of computer-aided diagnosis system for the diagnosis of thyroid nodules on ultrasound: a systematic review and meta-analysis. Medicine 98, e16379 (2019).
PubMed PubMed Central Google Scholar
Buda, M. et al. Management of thyroid nodules seen on US images: deep learning may match performance of radiologists. Radiology 292, 695–701 (2019).
PubMed Google Scholar
Jeong, E. Y. et al. Computer-aided diagnosis system for thyroid nodules on ultrasonography: diagnostic performance and reproducibility based on the experience level of operators. Eur. Radiol. 29, 1978–1985 (2019).
PubMed Google Scholar
Sollini, M., Cozzi, L., Chiti, A. & Kirienko, M. Texture analysis and machine learning to characterize suspected thyroid nodules and differentiated thyroid cancer: where do we stand? Eur. J. Radiol. 99, 1–8 (2018).
PubMed Google Scholar
Choi, Y. J. et al. A computer-aided diagnosis system using artificial intelligence for the diagnosis and characterization of thyroid nodules on ultrasound: initial clinical assessment. Thyroid 27, 546–552 (2017).
PubMed Google Scholar
Daniels, K. et al. Machine learning by ultrasonography for genetic risk stratification of thyroid nodules. JAMA Otolaryngol. Head Neck Surg. 146, 36–41 (2020).
PubMed Google Scholar
Kosilek, R. P. et al. Diagnostic use of facial image analysis software in endocrine and genetic disorders: review, current results and future perspectives. Eur. J. Endocrinol. 173, M39–44 (2015).
CAS PubMed Google Scholar
Kong, X., Gong, S., Su, L., Howard, N. & Kong, Y. Automatic detection of acromegaly from facial photographs using machine learning methods. EBioMedicine 27, 94–102 (2018). This study evaluates multiple machine learning and deep learning models to differentiate patients with acromegaly from facial photographs, with the top-performing ensemble model achieving a diagnostic accuracy that was on par with that of specialists and superior to that of primary care physicians.
PubMed Google Scholar
Schneider, H. J. et al. A novel approach to the detection of acromegaly: accuracy of diagnosis by automatic face classification. J. Clin. Endocrinol. Metab. 96, 2074–2080 (2011).
CAS PubMed Google Scholar
Kosilek, R. P. et al. Automatic face classification of Cushing’s syndrome in women — a novel screening approach. Exp. Clin. Endocrinol. Diabetes 121, 561–564 (2013).
CAS PubMed Google Scholar
Popp, K. H. et al. Computer vision technology in the differential diagnosis of Cushing’s syndrome. Exp. Clin. Endocrinol. Diabetes 127, 685–690 (2019).
CAS PubMed Google Scholar
Dal, J. et al. Disease control and gender predict the socioeconomic effects of acromegaly: a nationwide cohort study. J. Clin. Endocrinol. Metab. 105, 2975–2982 (2020).
Google Scholar
Gkourogianni, A. et al. Pediatric Cushing disease: disparities in disease severity and outcomes in the Hispanic and African-American populations. Pediatr. Res. 82, 272–277 (2017).
PubMed PubMed Central Google Scholar
Lambin, P. et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat. Rev. Clin. Oncol. 14, 749–762 (2017).
PubMed Google Scholar
Mongan, J., Moy, L. & Kahn, C. E. Jr Checklist for artificial intelligence in medical imaging (CLAIM): a guide for authors and reviewers. Radiol. Artif. Intell. 2, e200029 (2020).
PubMed PubMed Central Google Scholar
Liu, X., Cruz Rivera, S., Moher, D., Calvert, M. J. & Denniston, A. K. Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT-AI extension. Nat. Med. 26, 1364–1374 (2020).
CAS PubMed PubMed Central Google Scholar
Miotto, R., Wang, F., Wang, S., Jiang, X. & Dudley, J. T. Deep learning for healthcare: review, opportunities and challenges. Brief. Bioinform. 19, 1236–1246 (2018).
PubMed Google Scholar
Topol, E. J. High-performance medicine: the convergence of human and artificial intelligence. Nat. Med. 25, 44–56 (2019).
CAS PubMed Google Scholar
Wang, F., Kaushal, R. & Khullar, D. Should health care demand interpretable artificial intelligence or accept “Black Box” Medicine? Ann. Intern. Med. 172, 59–60 (2019).
PubMed Google Scholar
Murdoch, W. J., Singh, C., Kumbier, K., Abbasi-Asl, R. & Yu, B. Definitions, methods, and applications in interpretable machine learning. Proc. Natl Acad. Sci. USA 116, 22071–22080 (2019).
CAS PubMed PubMed Central Google Scholar
Reyes, M. et al. On the interpretability of artificial intelligence in radiology: challenges and opportunities. Radiol. Artif. Intell. 2, e190043 (2020).
PubMed PubMed Central Google Scholar
Chattopadhay, A., Sarkar, A., Howlader, P. & Balasubramanian, V. N. Grad-CAM++: generalized gradient-based visual explanations for deep convolutional networks. IEEE Winter Conf. Appl. Comput. Vis. https://doi.org/10.1109/WACV.2018.00097 (2018).
Article Google Scholar
Ribeiro, M. T., Singh, S. & Guestrin, C. Why Should I Trust You?: Explaining the Predictions of Any Classifier. in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1135-1144 (Association for Computing Machinery, 2016).
Akkus, Z. et al. Reduction of Unnecessary Thyroid Biopsies using Deep Learning. Vol. 10949 MI (SPIE, 2019).
Pereira, S. et al. Enhancing interpretability of automatically extracted machine learning features: application to a RBM-Random Forest system on brain lesion segmentation. Med. Image Anal. 44, 228–244 (2018).
PubMed Google Scholar
Natekar, P., Kori, A. & Krishnamurthi, G. Demystifying brain tumor segmentation networks: interpretability and uncertainty analysis. Front. Comput. Neurosci. 14, 6 (2020).
PubMed PubMed Central Google Scholar
Philbrick, K. A. et al. What does deep learning see? Insights from a classifier trained to predict contrast enhancement phase from CT images. Am. J. Roentgenol. 211, 1184–1193 (2018).
Google Scholar
Thomas, J. & Haertling, T. AIBx, artificial intelligence model to risk stratify thyroid nodules. Thyroid 30, 878–884 (2020).
PubMed Google Scholar
Gallego-Ortiz, C. & Martel, A. L. Using quantitative features extracted from T2-weighted MRI to improve breast MRI computer-aided diagnosis (CAD). PLoS One 12, e0187501 (2017).
PubMed PubMed Central Google Scholar
Gale, W., Oakden-Rayner, L., Carneiro, G., Palmer, L. J. & Bradley, A. P. Producing radiologist-quality reports for interpretable deep learning. IEEE Int. Symp. Biomed. Imaging https://doi.org/10.1109/ISBI.2019.8759236 (2019).
Article Google Scholar
Wu, F. et al. A new coronavirus associated with human respiratory disease in China. Nature 579, 265–269 (2020).
CAS PubMed PubMed Central Google Scholar
Shuja, J., Alanazi, E., Alasmary, W. & Alashaikh, A. COVID-19 open source data sets: a comprehensive survey. Appl. Intell. 51, 1296–1325 (2021).
Google Scholar
Bai, H. X. & Thomasian, N. M. RICORD: a precedent for open AI in COVID-19 image analytics. Radiology 299, E219–E220 (2021).
PubMed Google Scholar
Marcus, D. S., Olsen, T. R., Ramaratnam, M. & Buckner, R. L. The extensible neuroimaging archive toolkit: an informatics platform for managing, exploring, and sharing neuroimaging data. Neuroinformatics 5, 11–34 (2007).
PubMed Google Scholar
Dao, T. et al. A kernel theory of modern data augmentation. Proc. Mach. Learn. Res. 97, 1528–1537 (2019).
PubMed PubMed Central Google Scholar
Hussain, Z., Gimenez, F., Yi, D. & Rubin, D. Differential data augmentation techniques for medical imaging classification tasks. AMIA Annu. Symp. Proc. 2017, 979–984 (2017).
PubMed Google Scholar
Deepak, S. & Ameer, P. M. Brain tumor classification using deep CNN features via transfer learning. Comput. Biol. Med. 111, 103345 (2019).
CAS PubMed Google Scholar
Shin, H. C. et al. Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning. IEEE Trans. Med. Imaging 35, 1285–1298 (2016).
PubMed Google Scholar
Sheller, M. J. et al. Federated learning in medicine: facilitating multi-institutional collaborations without sharing patient data. Sci. Rep. 10, 12598 (2020).
PubMed PubMed Central Google Scholar
Chang, K. et al. Distributed deep learning networks among institutions for medical imaging. J. Am. Med. Inf. Assoc. 25, 945–954 (2018). This paper provides a helpful overview and empirical demonstration of distributed learning techniques for multi-institutional collaborative deep learning model training.
Google Scholar
Sheller, M. J., Reina, G. A., Edwards, B., Martin, J. & Bakas, S. Multi-institutional deep learning modeling without sharing patient data: a feasibility study on brain tumor segmentation. Brainlesion 11383, 92–104 (2019).
PubMed PubMed Central Google Scholar
Biamonte, J. et al. Quantum machine learning. Nature 549, 195–202 (2017).
CAS PubMed Google Scholar
Arute, F. et al. Quantum supremacy using a programmable superconducting processor. Nature 574, 505–510 (2019).
CAS PubMed Google Scholar
Princeton University Center for Information Technology Policy. Implications of quantum computing for encryption policy. CEIP. https://carnegieendowment.org/2019/04/25/implications-of-quantum-computing-for-encryption-policy-pub-78985 (2019).
Smets, E. et al. Large-scale wearable data reveal digital phenotypes for daily-life stress detection. NPJ Digital Med. 1, 67 (2018).
Google Scholar
Tuncer, S. A. & Alkan, A. Segmentation of thyroid nodules with K-means algorithm on mobile devices. IEEE Int. Symp. Biomed. Imaging https://doi.org/10.1109/CINTI.2015.7382947 (2015).
Article Google Scholar
Ma, J. et al. Efficient deep learning architecture for detection and recognition of thyroid nodules. Comput. Intell. Neurosci. 2020, 1242781 (2020).
PubMed PubMed Central Google Scholar
Poudel, P., Illanes, A., Sheet, D. & Friebe, M. Evaluation of commonly used algorithms for thyroid ultrasound images segmentation and improvement using machine learning approaches. J. Healthc. Eng. 2018, 8087624 (2018).
PubMed PubMed Central Google Scholar
Shin, H. C., Orton, M. R., Collins, D. J., Doran, S. J. & Leach, M. O. Stacked autoencoders for unsupervised feature learning and multiple organ detection in a pilot study using 4D patient data. IEEE Trans. Pattern Anal. Mach. Intell. 35, 1930–1943 (2013).
PubMed Google Scholar
Song, J. et al. Ultrasound image analysis using deep learning algorithm for the diagnosis of thyroid nodules. Medicine 98, e15133 (2019).
PubMed PubMed Central Google Scholar
Wang, H. et al. Machine learning-based multiparametric MRI radiomics for predicting the aggressiveness of papillary thyroid carcinoma. Eur. J. Radiol. 122, 108755 (2020). This study found a machine learning pipeline with LASSO for feature selection with a Gradient Boosting Classifier for classification that was superior to clinical characteristics in terms of preoperatively differentiating aggressive versus non-aggressive papillary thyroid carcinoma.
PubMed Google Scholar
Haugen, B. R. et al. 2015 American Thyroid Association Management Guidelines for Adult Patients with Thyroid Nodules and Differentiated Thyroid Cancer: The American Thyroid Association Guidelines Task Force on Thyroid Nodules and Differentiated Thyroid Cancer. Thyroid 26, 1–133 (2016).
PubMed PubMed Central Google Scholar
Yi, T. et al. DICOM Image Analysis and Archive (DIANA): an open-source system for clinical AI applications. J. Digit. Imaging https://doi.org/10.1007/s10278-021-00488-5 (2021).
Article PubMed PubMed Central Google Scholar
Perseguers, S., Lewenstein, M., Acín, A. & Cirac, J. I. Quantum random networks. Nat. Phys. 6, 539–543 (2010).
CAS Google Scholar
Biamonte, J., Faccin, M. & De Domenico, M. Complex networks from classical to quantum. Commun. Phys. 2, 53 (2019).
Google Scholar

Download references

Author information

Authors and Affiliations

Warren Alpert Medical School of Brown University, Providence, RI, USA
Nicole M. Thomasian
Department of Imaging & Imaging Sciences, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Ihab R. Kamel & Harrison X. Bai

Authors

Nicole M. Thomasian
View author publications
You can also search for this author in PubMed Google Scholar
Ihab R. Kamel
View author publications
You can also search for this author in PubMed Google Scholar
Harrison X. Bai
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The authors contributed equally to all aspects of the article.

Corresponding author

Correspondence to Harrison X. Bai.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information

Nature Reviews Endocrinology thanks J. Thomas and J. Wang for their contribution to the peer review of this work.

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Thomasian, N.M., Kamel, I.R. & Bai, H.X. Machine intelligence in non-invasive endocrine cancer diagnostics. Nat Rev Endocrinol 18, 81–95 (2022). https://doi.org/10.1038/s41574-021-00543-9

Download citation

Accepted: 17 July 2021
Published: 09 November 2021
Issue Date: February 2022
DOI: https://doi.org/10.1038/s41574-021-00543-9

This article is cited by

Hardware-efficient quantum principal component analysis for medical image recognition
- Zidong Lin
- Hongfeng Liu
- Dawei Lu
Frontiers of Physics (2024)
Enhanced image diagnosing approach in medicine using quantum adaptive machine learning techniques
- Sajja Suneel
- R. Krishnamoorthy
- G. Sunil
Optical and Quantum Electronics (2024)
DNA-framework-based multidimensional molecular classifiers for cancer diagnosis
- Fangfei Yin
- Haipei Zhao
- Chunhai Fan
Nature Nanotechnology (2023)
The Driver Role of Pathologists in Endocrine Oncology: What Clinicians Seek in Pathology Reports
- Shereen Ezzat
- Wouter W. de Herder
- Ashley Grossman
Endocrine Pathology (2023)
Nanofabrication in polymeric materials with Raman scattering techniques based on noninvasive imaging for tumor precursor lesions
- Varun Kumar Singh
- N. Beemkumar
- Mukrsh Rajput
Optical and Quantum Electronics (2023)

Subjects

Abstract

Key points

Similar content being viewed by others

Introduction

Understanding AI

Machine learning

Supervised

Unsupervised

Reinforcement

Deep learning

Diagnostics

Adrenal gland

Pancreas

Pituitary gland

Thyroid gland

Facial recognition

Clinical evaluation

Interpretability

Data availability

Open data curation

Automated workflow

Data augmentation and transfer learning

Alternative computing platforms

Decentralized or distributed

Quantum

Conclusions

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Peer review information

Publisher’s note

Related links

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links