Enhancing radiomics and Deep Learning systems through the standardization of medical imaging workflows

Cobo, Miriam; Menéndez Fernández-Miranda, Pablo; Bastarrika, Gorka; Lloret Iglesias, Lara

doi:10.1038/s41597-023-02641-x

Download PDF

Comment
Open access
Published: 21 October 2023

Enhancing radiomics and Deep Learning systems through the standardization of medical imaging workflows

Scientific Data volume 10, Article number: 732 (2023) Cite this article

2849 Accesses
4 Citations
2 Altmetric
Metrics details

Subjects

Recent advances in computer-aided diagnosis, treatment response and prognosis in radiomics and deep learning challenge radiology with requirements for world-wide methodological standards for labeling, preprocessing and image acquisition protocols. The adoption of these standards in the clinical workflows is a necessary step towards generalization and interoperability of radiomics and artificial intelligence algorithms in medical imaging.

Introduction

According to the American Cancer Society it is estimated that around 2 million new cancer cases will be diagnosed in 2023 in the United States¹. Medical imaging in oncology is the reference to evaluate most cancers, in particular for lesion detection and staging, which proves the need for general standards and guidelines in radiology to advance research in digital diagnosis. Medical images in radiomics play a key role not only in diagnosis, but also in monitoring the progression and development of tumors, in addition to supervising the response to therapy and risk of relapse^2,3. Throughout the present text, the term radiomics will be used to encompass both classic radiomics and advanced data analysis techniques based on Artificial Intelligence (AI), such as deep radiomics^4,5.

Recent advances in medical imaging have shown the prospects of quantitative image descriptors to emerge as noninvasive prognosis phenotypes and predictive biomarkers^6,7. Medical imaging and virtual biopsy are noninvasive techniques in oncology that reach the whole tumor volume⁸, in contrast with genomics and proteomics, which rely on biopsies or invasive surgeries to analyze only a limited sample of tumor tissue that may not be representative of the whole lesion due to its heterogeneity⁶. Radiomics and radiogenomics have shown potential to solve these inconveniences^7,9,10, but there are several challenges that must be implemented in the workflows of clinical practice. Interestingly, radiomics is not exclusive to oncology, and can be applied to a wide range of medical imaging modalities, from magnetic resonance imaging (MRI), computed tomography (CT), ultrasound, positron-emission-tomography (PET) and single-photon emission computerized tomography (SPECT)^5,11,12.

The promise of radiomics lies in its potential for noninvasive automated evaluation of medical images. The price will be standardizing the different workflows in image acquisition, preprocessing, annotation, anonymization, metadata, and storage processes. Here we present an overview of current methods in preprocessing and harmonization alongside the limitations of radiomics. Furthermore, we propose guidelines to facilitate standardization and outline future prospects in the field of medical imaging.

Medical imaging beyond the hospital’s four walls: limitations

The translation of computer vision advances into clinical practice is currently being delayed due to the lack of standardization and harmonization of radiology clinical protocols and workflows¹³, a well-known problem¹⁴ that calls for a unified approach with the engagement of the industrial sector in the field of radiomics. The potential of P5 medicine (predictive, preventive, personalized, participatory, psycho-cognitive)¹⁵ to revolutionize the state of the art in medical imaging requires a paradigm shift from individual to collective standards, particularly in data collection and preprocessing. This shift will also enable the transition of research from retrospective studies to clinical applications.

Several reviews of publications discussed by¹⁶ reveal that most current machine learning models are far from being ready for real-world clinical deployment. These models lack sufficient reproducibility, rigorous validation, generalizability to external datasets, and robustness to translate to clinical practice.

There is a wide variability between manufacturers that implement distinct reconstruction algorithms, and institutions that utilize different reconstruction parameters, which may also be customized for each patient¹⁷. The implementation of standard scanning protocols across institutions will satisfy the urgent need for consistency in the acquisition parameters. Orhlac et al.¹⁸ showed in CT that scanner parameters such as reconstruction kernel or slide thickness influence radiomics texture features. Moreover, Son et al.¹⁹ showed that similar CT protocols and same slice gaps in data from different hospitals lead to an improved performance of machine learning algorithms. Rizzo et al.¹⁷ proposed identifying and excluding radiomic features highly influenced by the acquisition and reconstruction parameters, however this solution may limit the power of radiomics analyses. Image quality is another factor that impacts the performance of radiomics systems, particularly if the equipment has become obsolete compared to modern devices¹⁶. In case the images come from different sources (manufacturers, hospitals) a similar distribution of “positive” and “negative” cases needs to be ensured to train an AI algorithm¹⁶. Moreover, preprocessing steps like filtering, resampling and morphological image processing also have an impact on radiomic features, as depicted in Fig. 1, that remains to be further investigated²⁰. Finally, for AI systems, data augmentation should not alter the images in a way that the underlying biological or tissue properties are implausible¹⁶.

There have been some attempts in the literature to provide guidelines to preprocess medical images. Van Timmeren et al.¹¹ enumerates some of the necessary steps before radiomic feature extraction, such as interpolation, normalization and discretization. However, the authors highlight that many questions regarding these steps remain open. Aerts et al.⁶ performed radiomics analysis from the RAW imaging data (before the images are reconstructed), without any pre-processing or normalization, yet a strong dependence of their radiomic signature on tumor volume was later revealed by²¹.

Recently, the ComBat harmonization technique has been applied in several medical imaging modalities, such as lung cancer CT datasets²². ComBat harmonization is a batch-effect correction²³ that aims to suppress batch effects by standardizing the means (location) and variances (scale) of each feature across batches to reduce the batch effect error^24,25. This algorithm is based on an empirical Bayes approach, originally developed for genomics data²⁶, later applied to reducing radiomics variability in PET²⁷, and CT¹⁸. There are other variations of the algorithm, such as longComBat²⁴, developed for longitudinal data. Overall, ComBat is intended to harmonize radiomic features, thereby minimizing the impact of different acquisition protocols on radiomic feature extraction, which is particularly useful for retrospective studies, where it would be impractical -or even impossible- to re-image patients to a controlled imaging protocol²². Ligero et al.²⁵ applied ComBat considering different sources of variance as batches: manufacturer-dependent convolution kernel, slice thickness, and the combination of both. Their results showed that ComBat correction minimized radiomics data variability regardless of differences in CT acquisition protocols²⁵. In the study by²², ComBat harmonization proved to be effective by harmonizing radiomic features extracted from different imaging protocols, although the authors underline that its effect on imaging-feature based predictive models requires further investigation²². In fact, research is underway to analyze the power of ComBat harmonization in multicenter studies in various imaging modalities, for example²⁸ studied ComBat harmonization on PET/MRI and PET/CT for radiomics-based tissue classification. Furthermore, ComBat is generalizable to other imaging modalities as it makes no assumptions about the origin of the site effects²³.

The previous examples illustrate the need for general guidelines for medical image preprocessing in computer vision tasks.

The clinical utility of an algorithm highly relies on the quality of the reference standard used in its training and evaluation¹⁶. Reference standards based on radiologists’ opinion are subjective, especially if established by a single expert, and should therefore be replaced whenever possible by objective reference standards, such as diagnostic tests and pathologic evaluation of biopsies or excised lesions, patient survival or time-to-progression for shorter-term reference standards¹⁶.

There are several standardization initiatives and imaging protocols investigating homogenization of image biomarkers and radiomic features, such as the Image Biomarker Standardization Initiative (IBSI)²⁹, the Quantitative Imaging Network of the National Institute of Health (QIN)³⁰, the Quantitative Imaging Biomarkers Alliance (QIBA)³¹, and the European Imaging Biomarker ALLiance (EIBALL)³², among others. Harmonization of the extraction and validation of robust radiomic features is essential to achieve results that are reliable and reproducible^18,33,34,35, although it does not address the systematic variations between patient subpopulations¹⁶. The range of different standardization initiatives shows the need to reach consensus among the radiomics research community on joint standards.

Radiomic signatures are intrinsically data driven, which poses several challenges as the high volume of features is susceptible to overfitting and overinterpretation of the derived models¹⁰. The development of radiomic signatures is significantly affected by underlying dependencies between radiomic features, redundancies and multicollinearity, as outlined by³³. Machine learning algorithms can be effective to identify unexpected effects, such as volume-confounding features^34,35. Lately, the lack of biological meaning of current high-throughput agnostic radiomic analyses has raised concerns. Tomaszewski and Gillies¹⁰ emphasize the need of supporting radiomics with biological validations to gain insights into the casual relationships of the features with the outcomes.

Most published radiomics studies lack independent validations of their signatures beyond a single external test set¹⁰, which is insufficient for their deployment in clinical practice. Independent validations of radiomic signatures on different cohorts and multiple institutions are hindered by the lack of standardization in medical imaging, although³⁶ have already proposed an approach for distributed radiomics. Therefore, to achieve generalization and robustness of radiomic signatures further efforts are required to homogenize image acquisition and preprocessing¹⁸, in addition to controlling the effect of potential confounders³⁵.

Another aspect that hinders the translation of radiomics and AI tools to clinical practice is the black-box nature of most current deep learning systems. Thus, in Europe, the General Data Protection Regulation establishes that individuals have the right to receive a clear and understandable explanation of how artificial intelligence is being used to make decisions that directly affect them³⁷. Explainable AI is essential to gain the trust of physicians and understand the reasons behind a prediction or decision¹⁶. Besides, interpretability can detect biases and problems such as unbalanced data, and explainable models are more robust against adversarial attacks³⁸. Post-hoc explanations like saliency maps are insufficient to provide a full explanation of why and how the features are connected and weighted to identify the target lesion. Provided explanations should align with medical knowledge or be supported by clinical evidence¹⁶. In this regard³⁹, introduced Co-12 properties, a high level decomposition of explanation quality, such as completeness, correctness, and compactness. A promising alternative to black-box AI algorithms are Part-prototype models, explainable by design. In this field, PIP-Net (Patch-based Intuitive Prototypes Network) proposed by⁴⁰ opens up a new field of research for explainable AI in medical imaging.

The shortage of large enough datasets to train and externally validate radiomic signatures in prospective multi-center studies also happens for medical AI devices⁴¹. Several of the devices approved by FDA for diagnostic use were trained on small datasets from a single center or from only two centers⁴². These algorithms are prone to biases and lack of generalizability outside the site where they were trained.

Public databases provide free validation datasets to the medical imaging community, however, as argued by¹⁶, the QA process for data in a public database is often overlooked. For example, the well-known LIDC-IDRI dataset⁴³ includes the manufacturer in DICOM metadata, but not demographic information such as patient age or gender⁴⁴, which can lead to unexpected biases when developing radiomics and machine learning models.

As outlined by¹⁶, even if a hospital could use a vendor-trained computer-aided diagnosis (CAD) AI tool with multi-institutional data and approved for clinical use, its performance in the local population could not be the same as in the vendor’s specifications. Hence, the hospital would have to evaluate the performance of the CAD-AI tool on their patients in an adjustment phase, achieving a deeper understanding of the CAD-AI performance in the local setting, while reducing unrealistic expectations and improper use of the CAD-AI tool¹⁶.

To ensure data availability, accessibility and reusability, radiomic signatures demand stability and reproducibility across different hospitals, scanners and acquisition protocols, that is, the adoption of FAIR principles, as described by⁴⁵, in a manner that preserves patient privacy¹³. Data collection must also conform to the ethical considerations and legal framework of the country in which the data were obtained¹⁶. Standardization extends to validation and evaluation criteria, providing guidelines and contrasted metrics to reduce bias and overly optimistic results hiding the lack of generalization of certain models subjected to highly restrictive data conditions and insufficient reporting¹¹. The Radiomic Ontology project⁴⁶ provides a Python library for FAIR radiomics analysis which aims to facilitate the transfer of research efforts to clinical practice.

Despite the mentioned efforts, it is important to note that a consolidated standard in the field of radiomics is still far from being established.

Towards standardization: guidelines

At the moment, there are several public databases available with medical images, such as The Cancer Imaging Archive⁴⁷ or Neurovault⁴⁸. However, the absence of standardization in the format of these databases (i. e. interoperability) hinders simultaneous use of different data sources in the same machine learning algorithm¹³. Thus, the change of paradigm from visual assessment of medical images to computer-aided evaluation demands for methodological standardization of the workflows in medical imaging as proposed in Fig. 2. This standardization should implement the FAIR principles to the extent that the requirements due to the nature of medical images (de-identification, security) allow.

Data collection is a crucial step to create computer vision models and involves different agents within the hospital: radiologists, technicians, nurses, general practitioners, etc. Data interoperability is vital to facilitate research and multicenter studies, therefore all the involved agents in data collection should become aware of methodological standards when these are adopted. We believe radiologists will play a key role in ensuring the correct application of standards and the effective adoption of protocols. There are two levels at which standardization of the workflows in image analysis should be implemented: software (consistency of technical implementation among scanners and manufacturers) and human interaction (coherence between different observers and practitioners)⁵.

At human interaction, we identify two levels at which radiological studies should be labeled: study level (e.g. brain MRI FLAIR sequence, chest radiography AP, etc.) and pathology level (e.g. tumor, benign nodule, etc.). The study level labeling relies on the work of technicians and nurses, who are responsible for the correct categorization of the data according to the type of study modality they have performed. Hence, in the study level labeling, the Series Description parameter in DICOM should correctly include the type of study modality that was carried out. Ultimately, the labeling at study level should be incorporated in the DICOM Study Description and Series Description fields, according to the RadLex lexicon⁴⁹ standard. Therefore, it is essential that this field is homogenized for each DICOM across all hospitals and scanners. In addition, the pathology level labeling should be incorporated into the structured report⁵⁰.

In software, we believe that manufacturers’ involvement in the process of standardization is essential, as they are in charge of bringing the latest technology to the clinic. To ensure their engagement, we propose that all leading radiological societies join forces to request the implementation of the necessary technology from the manufacturers. In particular, we acknowledge that standardization of MRI protocols for MRI-based radiomics is a challenge⁵¹, due to the inherent versatility of this imaging modality. The experience of⁵² first reported a systematic inventory of MRI technology and personnel. They proposed the creation of a committee of stakeholders (radiologists, MRI physicists, technologists and scientists) committed to establishing and maintaining a standardized imaging strategy, with annual protocol reviews. As for their conclusions⁵², demanded better remote connectivity to MRI systems and automation of exam acquisition, from protocol selection and configuration to parameter modification. In other medical imaging modalities, such as radiography or CT^53,54,55, the same process as in MRI could be followed, automating exam acquisition and parameter selection based on the patient’s characteristics.

We propose the following guidelines to ensure generalization of radiomics systems:

Medical imaging datasets should always incorporate metadata information about the manufacturer and the acquisition protocol.
Datasets’ anonymization process should retain demographical information (e.g., age, gender, comorbidities, ethnicity) to avoid biases, as long as the patient cohort is sufficient to ensure patient de-identification.
Datasets that include segmentations should provide metadata describing if the segmentation was manually performed, otherwise information describing the automatic or semiautomatic method that was used should be provided, including values of internal parameters in case of fine-tuning of the algorithm.
Reference standards should be objective as far as possible, otherwise, independent evaluations should be secured from several experts with an assessment of the inter-reader variability.
Hospitals should appoint a stakeholder committee within their staff to guide and monitor the standardization strategy, through a QA/QC process.
All hospitals should adopt the same standards and guidelines to ensure interoperability.
Radiomics and AI systems should include interpretable explanations in human-understandable terms, similar to medical standards, on how and why they perform predictions or decisions to assist physicians.
Datasets along with their metadata, and code if exists, should be made publicly available to allow reusability and reproducibility.

Standardization of computational statistics for radiomics-based systems should consider data balancing, sufficient patient population in size and diversity to prevent potential biases, interpretability, biological validation (relation of radiomic signature to cell morphology, density, distribution pattern, etc.⁵), generalization and suitability of performance metrics to the case of use, among other aspects. Ultimately, it is critical to continuously monitor the performance of radiomics systems to ensure their efficiency does not degrade over time, the so-called data drift⁵⁶, as clinical practices, protocols and patient demographics may change, with a corresponding impact on performance.

Potential and scope of radiomics

The potential of radiomics is currently hindered by the absence of standardization in the medical imaging workflow. There are two factors that hamper standardization. On the one hand, there was no homogeneity in the mathematical definition of radiomic characteristics. This point has already been solved by IBSI, which should be adopted by all institutions. On the other hand, radiological images, despite being based on physical metrics, differ in the capture of the same phenomenon (disease) depending on the machine and the patient. Although this issue cannot be totally solved, it can be alleviated by homogenizing the machines to the same standard, which would be achieved by configuring the same acquisition parameters according to standard protocols, as previously explained. In addition, the establishment of standard protocols would also help to reduce the radiation dose^53,54. The recommendations that have been proposed here require the engagement of all agents within the hospitals across the world, which may seem unrealistic, given the large number of entities that would have to get involved. Ultimately, we argue that the progressive adoption of these guidelines, under the auspices of radiological societies, will encourage new institutions to adhere to them, and, thus, radiomic signatures will progressively start the transition from research to clinical practice.

Apart from the standardization requirements, the translation of radiomics analyses to clinical practice should be relatively effortless and inexpensive. Firstly, radiomics research is usually based on the studies that are routinely performed to patients and it does not require additional diagnostic techniques. Secondly, radiomics studies do not need expensive or complex equipment since the biomarkers can be easily extracted with the aid of a conventional computer with a Graphics Processing Unit (GPU) and the validation of radiomics signatures can be performed distributedly to preserve patient privacy, following federated learning approaches⁵⁷. To the best of our knowledge, one the first studies that assessed the economic impact of AI as an assistive tool was⁵⁸, who conducted a cost-minimisation analysis in diabetic retinopathy screening to evaluate the potential savings of two deep learning approaches compared to current human assessment, concluding that the semi-automated screening model was the least expensive. For this reason, the field of radiomics in medical imaging has the potential to become a powerful tool in providing universal, high-quality and affordable health care to all, including those in low- and middle-income countries (LMICs) where resources and expertise are limited⁵⁹, with the caveat that biases must be carefully considered in this deployment⁶⁰. On a final note, when standardized protocols are established, technicians will be able to focus more effectively on patient care and image quality⁵².

Paving the way for future medicine: conclusions

The safe adoption of radiomics and computer-aided diagnosis systems poses as a critical requirement the standardization of protocols and workflows in medical imaging. We have presented guidelines to standardize the workflows in medical imaging, with references to the different levels at which homogenization is required and the hospital personnel involved in each phase. The clinical deployment of radiomics will promote the application of more adapted and personalized treatments to the patient, which will ultimately translate into a more efficient management and distribution of the available resources, likely resulting in cost reductions for health systems. Radiomics based systems have shown potential to analyze patient data and predict future needs, which will allow healthcare providers to plan and allocate resources more efficiently. For this reason, it is necessary to standardize medical imaging workflows as soon as possible, to enable the progressive clinical implementation of radiomics and machine learning tools, and to bring precision medicine to the patient.

References

Siegel, R. L., Miller, K. D., Wagle, N. S. & Jemal, A. Cancer statistics, 2023. Ca Cancer J Clin 73, 17–48 (2023).
Google Scholar
Hricak, H. et al. Medical imaging and nuclear medicine: a lancet oncology commission. The Lancet Oncology 22, e136–e172 (2021).
PubMed Central Google Scholar
Elshafeey, N. et al. Multicenter study demonstrates radiomic features derived from magnetic resonance perfusion images identify pseudoprogression in glioblastoma. Nature communications 10, 3170 (2019).
ADS PubMed Central Google Scholar
Kobayashi, K., Miyake, M., Takahashi, M. & Hamamoto, R. Observing deep radiomics for the classification of glioma grades. Scientific Reports 11, 10942 (2021).
ADS CAS PubMed Central PubMed Google Scholar
Fournier, L. et al. Incorporating radiomics into clinical trials: expert consensus endorsed by the european society of radiology on considerations for data-driven compared to biologically driven quantitative biomarkers. European radiology 31, 6001–6012 (2021).
PubMed Central Google Scholar
Aerts, H. J. et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nature communications 5, 4006 (2014).
ADS CAS Google Scholar
Chen, N. et al. Progression-free survival prediction in small cell lung cancer based on radiomics analysis of contrast-enhanced ct. Frontiers in Medicine 9, 833283 (2022).
PubMed Central Google Scholar
Murray, J. M., Wiegand, B., Hadaschik, B., Herrmann, K. & Kleesiek, J. Virtual biopsy: just an ai software or a medical procedure? Journal of Nuclear Medicine 63, 511 (2022).
PubMed Central Google Scholar
Grimm, L. J. & Mazurowski, M. A. Breast cancer radiogenomics: current status and future directions. Academic Radiology 27, 39–46 (2020).
Google Scholar
Tomaszewski, M. R. & Gillies, R. J. The biological meaning of radiomic features. Radiology 298, 505–516 (2021).
PubMed Google Scholar
Van Timmeren, J. E., Cester, D., Tanadini-Lang, S., Alkadhi, H. & Baessler, B. Radiomics in medical imaging–“how-to” guide and critical reflection. Insights into imaging 11, 1–16 (2020).
Google Scholar
Poplin, R. et al. Prediction of cardiovascular risk factors from retinal fundus photographs via deep learning. Nature biomedical engineering 2, 158–164 (2018).
PubMed Google Scholar
Kaissis, G. A., Makowski, M. R., Rückert, D. & Braren, R. F. Secure, privacy-preserving and federated machine learning in medical imaging. Nature Machine Intelligence 2, 305–311 (2020).
Google Scholar
Kumar, V. et al. Radiomics: the process and the challenges. Magnetic resonance imaging 30, 1234–1248 (2012).
PubMed Central PubMed Google Scholar
Gorini, A. & Pravettoni, G. P5 medicine: a plus for a personalized approach to oncology. Nature Reviews Clinical Oncology 8, 444–444 (2011).
Google Scholar
Hadjiiski, L. et al. Aapm task group report 273: Recommendations on best practices for ai and machine learning for computer-aided diagnosis in medical imaging. Medical Physics 50, e1–e24 (2023).
PubMed Google Scholar
Rizzo, S. et al. Radiomics: the facts and the challenges of image analysis. European radiology experimental 2, 1–8 (2018).
Google Scholar
Orlhac, F., Frouin, F., Nioche, C., Ayache, N. & Buvat, I. Validation of a method to compensate multicenter effects affecting ct radiomics. Radiology 291, 53–59 (2019).
PubMed Google Scholar
Son, J. W. et al. How many private data are needed for deep learning in lung nodule detection on ct scans? a retrospective multicenter study. Cancers 14, 3174 (2022).
PubMed Central Google Scholar
Demircioğlu, A. The effect of preprocessing filters on predictive performance in radiomics. European Radiology Experimental 6, 40 (2022).
PubMed Central PubMed Google Scholar
Vallieres, M., Visvikis, D. & Hatt, M. Dependency of a validated radiomics signature on tumor volume and potential corrections (2018).
Mahon, R., Ghita, M., Hugo, G. D. & Weiss, E. Combat harmonization for radiomic features in independent phantom and lung cancer patient computed tomography datasets. Physics in Medicine & Biology 65, 015010 (2020).
ADS CAS Google Scholar
Fortin, J.-P. et al. Harmonization of multi-site diffusion tensor imaging data. Neuroimage 161, 149–170 (2017).
PubMed Google Scholar
Cabini, R. F. et al. Preliminary report on harmonization of features extraction process using the combat tool in the multi-center “blue sky radiomics” study on stage iii unresectable nsclc. Insights into Imaging 13, 38 (2022).
PubMed Central PubMed Google Scholar
Ligero, M. et al. Minimizing acquisition-related radiomics variability by image resampling and batch effect correction to allow for large-scale data analysis. European radiology 31, 1460–1470 (2021).
PubMed Google Scholar
Johnson, W. E., Li, C. & Rabinovic, A. Adjusting batch effects in microarray expression data using empirical bayes methods. Biostatistics 8, 118–127 (2007).
MATH PubMed Google Scholar
Orlhac, F. et al. A postreconstruction harmonization method for multicenter radiomic studies in pet. Journal of Nuclear Medicine 59, 1321–1328 (2018).
CAS PubMed Google Scholar
Leithner, D. et al. Impact of combat harmonization on pet radiomics-based tissue classification: a dual-center pet/mri and pet/ct study. Journal of Nuclear Medicine 63, 1611–1616 (2022).
CAS PubMed Central PubMed Google Scholar
Zwanenburg, A. et al. The image biomarker standardization initiative: standardized quantitative radiomics for high-throughput image-based phenotyping. Radiology 295, 328–338 (2020).
PubMed Google Scholar
Yankeelov, T. E. The quantitative imaging network: a decade of achievement (2019).
Guimaraes, A. R. Quantitative imaging biomarker alliance (qiba): Protocols and profiles. Quantitative Imaging in Medicine: Background and Basics (2021).
deSouza, N. M. et al. Validated imaging biomarkers as decision-making tools in clinical trials and routine practice: current status and recommendations from the eiball* subcommittee of the european society of radiology (esr). Insights into imaging 10, 1–16 (2019).
Google Scholar
Welch, M. L. et al. Vulnerabilities of radiomic signature development: The need for safeguards. Radiotherapy and Oncology 130, 2–9 (2019).
Google Scholar
Traverso, A. et al. Machine learning helps identifying volume-confounding effects in radiomics. Physica Medica 71, 24–30 (2020).
Google Scholar
Lu, L. et al. Uncontrolled confounders may lead to false or overvalued radiomics signature: a proof of concept using survival analysis in a multicenter cohort of kidney cancer. Frontiers in Oncology 11, 638185 (2021).
CAS PubMed Central Google Scholar
Shi, Z. et al. Distributed radiomics as a signature validation study using the personal health train infrastructure. Scientific data 6, 218 (2019).
PubMed Central Google Scholar
European Parliament & Council of the European Union. Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the protection of natural persons with regard to the processing of personal data and on the free movement of such data, and repealing Directive 95/46/EC (General Data Protection Regulation). https://data.europa.eu/eli/reg/2016/679/oj (2023).
Finlayson, S. G. et al. Adversarial attacks on medical machine learning. Science 363, 1287–1289 (2019).
ADS CAS PubMed Central Google Scholar
Nauta, M. et al. From anecdotal evidence to quantitative evaluation methods: A systematic review on evaluating explainable ai. ACM Computing Surveys 55, 1–42 (2023).
Google Scholar
Nauta, M. et al. Interpreting and correcting medical image classification with pip-net. arXiv preprint arXiv:2307.10404 (2023).
Ziller, A. et al. Medical imaging deep learning with differential privacy. Scientific Reports 11, 13524 (2021).
ADS CAS PubMed Central Google Scholar
Wu, E. et al. How medical ai devices are evaluated: limitations and recommendations from an analysis of fda approvals. Nature Medicine 27, 582–584 (2021).
CAS Google Scholar
Armato, S. G. III et al. The lung image database consortium (lidc) and image database resource initiative (idri): a completed reference database of lung nodules on ct scans. Medical physics 38, 915–931 (2011).
ADS PubMed Central Google Scholar
Wang, J. et al. Preparing ct imaging datasets for deep learning in lung nodule analysis: Insights from four well-known datasets. Heliyon (2023).
Wilkinson, M. D. et al. The fair guiding principles for scientific data management and stewardship. Scientific data 3, 1–9 (2016).
Google Scholar
Shi, Z., Traverso, A., van Soest, J., Dekker, A. & Wee, L. Ontology-guided radiomics analysis workflow (o-raw). Medical Physics 46, 5677–5684 (2019).
ADS Google Scholar
Prior, F. et al. The public cancer radiology imaging collections of the cancer imaging archive. Scientific data 4, 1–7 (2017).
Google Scholar
Gorgolewski, K. J. et al. Neurovault. org: a web-based repository for collecting and sharing unthresholded statistical maps of the human brain. Frontiers in neuroinformatics 9, 8 (2015).
PubMed Central Google Scholar
Informatics, R. RadLex. https://radlex.org/ Accessed 15 Feb 2023 (2016).
Nobel, J. M., Kok, E. M. & Robben, S. G. Redefining the structure of structured reporting in radiology. Insights into imaging 11, 1–5 (2020).
Google Scholar
Carré, A. et al. Standardization of brain mr images across machines and protocols: bridging the gap for mri-based radiomics. Scientific reports 10, 12340 (2020).
ADS MathSciNet PubMed Central PubMed Google Scholar
Sharma, P. S. & Saindane, A. M. Standardizing magnetic resonance imaging protocols across a large radiology enterprise: barriers and solutions. Current Problems in Diagnostic Radiology 49, 312–316 (2020).
PubMed Google Scholar
Wang, Y., Chu, P., Szczykutowicz, T. P., Stewart, C. & Smith-Bindman, R. Ct acquisition parameter selection in the real world: impacts on radiation dose and variation amongst 155 institutions. European Radiology 1–9 (2023).
McCollough, C. & Leng, S. Use of artificial intelligence in computed tomography dose optimisation. Annals of the ICRP 49, 113–125 (2020).
CAS PubMed Google Scholar
Midya, A., Chakraborty, J., Gönen, M., Do, R. K. & Simpson, A. L. Influence of ct acquisition and reconstruction parameters on radiomic feature reproducibility. Journal of Medical Imaging 5, 011020–011020 (2018).
PubMed Central PubMed Google Scholar
Lacson, R., Eskian, M., Licaros, A., Kapoor, N. & Khorasani, R. Machine learning model drift: predicting diagnostic imaging follow-up as a case example. Journal of the American College of Radiology 19, 1162–1169 (2022).
PubMed Google Scholar
Rieke, N. et al. The future of digital health with federated learning. NPJ digital medicine 3, 119 (2020).
PubMed Central PubMed Google Scholar
Xie, Y. et al. Artificial intelligence for teleophthalmology-based diabetic retinopathy screening in a national programme: an economic analysis modelling study. The Lancet Digital Health 2, e240–e249 (2020).
Google Scholar
Alami, H. et al. Artificial intelligence in health care: laying the foundation for responsible, sustainable, and inclusive innovation in low-and middle-income countries. Globalization and Health 16, 1–6 (2020).
Google Scholar
Ricci Lara, M. A., Echeveste, R. & Ferrante, E. Addressing fairness in artificial intelligence for medical imaging. Nature Communications 13, 4581 (2022).
ADS CAS PubMed Central PubMed Google Scholar

Download references

Acknowledgements

M. C. would like to acknowledge the support received by the Ministry of Education of Spain (FPU grant, reference FPU21-04458). The authors would like to acknowledge the support from the project AI4EOSC ‘‘Artificial Intelligence for the European Open Science Cloud” that has received funding from the European Union’s Horizon Europe research and innovation programme under grant agreement number 101058593.

Author information

Authors and Affiliations

Advanced Computing and e-Science Group, Institute of Physics of Cantabria (IFCA), CSIC - UC, Santander, Spain
Miriam Cobo & Lara Lloret Iglesias
Clínica Universidad de Navarra, Department of Radiology, Pamplona, Spain
Pablo Menéndez Fernández-Miranda & Gorka Bastarrika

Authors

Miriam Cobo
View author publications
You can also search for this author in PubMed Google Scholar
Pablo Menéndez Fernández-Miranda
View author publications
You can also search for this author in PubMed Google Scholar
Gorka Bastarrika
View author publications
You can also search for this author in PubMed Google Scholar
Lara Lloret Iglesias
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Miriam Cobo.

Ethics declarations

Competing interests

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cobo, M., Menéndez Fernández-Miranda, P., Bastarrika, G. et al. Enhancing radiomics and Deep Learning systems through the standardization of medical imaging workflows. Sci Data 10, 732 (2023). https://doi.org/10.1038/s41597-023-02641-x

Download citation

Received: 18 May 2023
Accepted: 12 October 2023
Published: 21 October 2023
DOI: https://doi.org/10.1038/s41597-023-02641-x