Privacy-preserving distributed learning of radiomics to predict overall survival and HPV status in head and neck cancer

Bogowicz, Marta; Jochems, Arthur; Deist, Timo M.; Tanadini-Lang, Stephanie; Huang, Shao Hui; Chan, Biu; Waldron, John N.; Bratman, Scott; O’Sullivan, Brian; Riesterer, Oliver; Studer, Gabriela; Unkelbach, Jan; Barakat, Samir; Brakenhoff, Ruud H.; Nauta, Irene; Gazzani, Silvia E.; Calareso, Giuseppina; Scheckenbach, Kathrin; Hoebers, Frank; Wesseling, Frederik W. R.; Keek, Simon; Sanduleanu, Sebastian; Leijenaar, Ralph T. H.; Vergeer, Marije R.; Leemans, C. René; Terhaard, Chris H. J.; van den Brekel, Michiel W. M.; Hamming-Vrieze, Olga; van der Heijden, Martijn A.; Elhalawani, Hesham M.; Fuller, Clifton D.; Guckenberger, Matthias; Lambin, Philippe

doi:10.1038/s41598-020-61297-4

Download PDF

Article
Open access
Published: 11 March 2020

Privacy-preserving distributed learning of radiomics to predict overall survival and HPV status in head and neck cancer

Marta Bogowicz ORCID: orcid.org/0000-0002-4747-5375^1,2^na1,
Arthur Jochems²^na1,
Timo M. Deist²,
Stephanie Tanadini-Lang¹,
Shao Hui Huang ORCID: orcid.org/0000-0002-8072-4388³,
Biu Chan³,
John N. Waldron³,
Scott Bratman ORCID: orcid.org/0000-0001-8610-4908³,
Brian O’Sullivan³,
Oliver Riesterer^1,4,
Gabriela Studer^1,5,
Jan Unkelbach¹,
Samir Barakat²,
Ruud H. Brakenhoff⁶,
Irene Nauta⁶,
Silvia E. Gazzani⁷,
Giuseppina Calareso⁸,
Kathrin Scheckenbach⁹,
Frank Hoebers¹⁰,
Frederik W. R. Wesseling¹⁰,
Simon Keek²,
Sebastian Sanduleanu²,
Ralph T. H. Leijenaar²,
Marije R. Vergeer¹¹,
C. René Leemans⁶,
Chris H. J. Terhaard¹²,
Michiel W. M. van den Brekel¹³,
Olga Hamming-Vrieze¹⁴,
Martijn A. van der Heijden¹³,
Hesham M. Elhalawani ORCID: orcid.org/0000-0001-9848-2623¹⁵,
Clifton D. Fuller ORCID: orcid.org/0000-0002-5264-3994¹⁵,
Matthias Guckenberger¹ &
…
Philippe Lambin ORCID: orcid.org/0000-0002-9034-0177²

Scientific Reports volume 10, Article number: 4542 (2020) Cite this article

3264 Accesses
45 Citations
4 Altmetric
Metrics details

Subjects

Abstract

A major challenge in radiomics is assembling data from multiple centers. Sharing data between hospitals is restricted by legal and ethical regulations. Distributed learning is a technique, enabling training models on multicenter data without data leaving the hospitals (“privacy-preserving” distributed learning). This study tested feasibility of distributed learning of radiomics data for prediction of two year overall survival and HPV status in head and neck cancer (HNC) patients. Pretreatment CT images were collected from 1174 HNC patients in 6 different cohorts. 981 radiomic features were extracted using Z-Rad software implementation. Hierarchical clustering was performed to preselect features. Classification was done using logistic regression. In the validation dataset, the receiver operating characteristics (ROC) were compared between the models trained in the centralized and distributed manner. No difference in ROC was observed with respect to feature selection. The logistic regression coefficients were identical between the methods (absolute difference <10⁻⁷). In comparison of the full workflow (feature selection and classification), no significant difference in ROC was found between centralized and distributed models for both studied endpoints (DeLong p > 0.05). In conclusion, both feature selection and classification are feasible in a distributed manner using radiomics data, which opens new possibility for training more reliable radiomics models.

Distributed radiomics as a signature validation study using the Personal Health Train infrastructure

Article Open access 22 October 2019

MRI radiomics in head and neck cancer from reproducibility to combined approaches

Article Open access 24 April 2024

A distributed feature selection pipeline for survival analysis using radiomics in non-small cell lung cancer patients

Article Open access 03 April 2024

Introduction

In recent years radiomics has shown to be a promising tool in disease classification and prognostic modeling^1,2,3,4. One of the major challenges in radiomics is assembling a large cohort, which is essential for reliable model training. Training models on small cohorts without validation can result in model overfitting and lack of generalization^5,6. It is difficult to collect a sufficiently large amount of data in a single institution setting. Single institution data may also not represent variations in patient populations across the world. Moreover, single institution data may not be a good representation of global variations in image acquisition protocols, which further influence quantitative image analysis⁷. On the other hand, sharing data between hospitals is restricted by legal and ethical regulations^8,9. Patients signing an informed consent should have two options: participating in the study in a full extent or participating in a study without external data sharing¹⁰. Additionally, central collection of imaging data requires large storage infrastructure.

Distributed learning in radiotherapy, introduced in 2013 and pioneered in the euroCAT network, is a promising technique to address these challenges¹¹. This methodology allows for training a model on data, which do not leave a local repository, for example a hospital. Instead, the model parameters are sent between members of the network and the central server. These models parameters are aggregate values and cannot be reversed or linked back to individual data points. Hence, this approach has also been referred to as “privacy-preserving” distributed learning¹². Results from different members are compared in the central server and the updated results are sent back to the members. This procedure is continued until an agreement is reached. The feasibility of distributed learning for training prognostic models in healthcare was already shown for prediction of both normal tissue complications and overall survival following radiotherapy^12,13,14. The prognostic power of the models trained in the distributed fashion was equally good as the models trained in the centralized manner.

In previously published works, the sole process of model fitting and data privacy issues were investigated. However, training a radiomics-based model requires two additional steps: feature normalization and feature selection. Feature normalization can be done with the assumption of selecting random samples (single hospital data) from a normal distribution (overall population). Radiomic features are known to exhibit a high degree of correlation and thus dimensionality reduction is a crucial step of the radiomics workflow. Distributed feature selection algorithms for horizontal data partitioning have been investigated^15,16. In horizontal partitioning, the database is split based on rows, where each smaller database has the same structure. This type of feature selection was not tested on radiomics data. Therefore, this work aims at developing and testing a distributed learning workflow for model training on radiomics data. We hypothesize that distributed algorithms can be used to efficiently train robust radiomics models, achieving quality comparable with models trained in a centralized manner. We have used data from six different head and neck cancer (HNC) cohorts (more than 1000 patients) to compare results from centralized and distributed workflows. The workflows were evaluated on two, clinically-relevant binary endpoints, tumor human papillomavirus (HPV) status and 2 year overall survival.

Material and methods

Analyzed cohorts

This retrospective analysis was based on 6 cohorts of patients, with a total enrollment of 1174 patients. The analysis was approved by local ethical commissions and was conducted according to their guidelines, for some cohorts the need for informed consent was waived (see details in the Supplement). The survival data were available for 1064 patients from 5 different cohorts. Similarly, HPV status was determined in biopsy analysis in 834 patients from 5 cohorts. Details on the studied cohorts can be found in Table 1 and imaging protocols are described in the Table 1S. The HPV status was confirmed by immunohistochemical p16 staining in biopsy specimens. All patients were treated with definitive chemoradiotherapy, except the VUmc and PMH cohort, where definitive radiotherapy alone was allowed. The patients underwent contrast-enhanced CT imaging for the purpose of treatment planning, according to the local protocols.

Table 1 Characteristic of studied cohorts.

Full size table

Radiomics analysis

Radiomic features were extracted from the primary tumor region. The treatment defined gross tumor volume (GTV) was visually assessed for the presence of artifacts and slices with artifacts were manually removed from the contour. Images were resampled to 3.3 mm cubic voxels using linear interpolation. The Hounsfield unit range was set to (−20, 180) to limit the analysis to soft tissue. In total, 981 features were extracted with the Z-Rad radiomics software implementation¹⁷:

shape (n = 18).
intensity distribution (n = 17).
texture (n = 90): the Gray Level Co-occurrence Matrix (n = 26), the Neighborhood Gray Tone Difference Matrix (n = 4), the Gray Level Run Length Matrix (n = 14), the Gray Level Size Zone Matrix (n = 14), the Gray Level Distance Zone Matrix (n = 16) and the Neighboring Gray Level Dependence Matrix (n = 16).
wavelet transform (n = 856).

Distributed learning platform

The Oncoradiomics distributed learning solution DistriM was used. This software consists of a master script and a site script. The site script is executed at each medical institution, where the data is located, and waits for a learning call from the master script. The master script is run by the researcher and initiates the distributed learning procedure. This script also mediates the transmission of the model coefficients to and from the sites. When model learning is complete, the master script outputs the model coefficients of the learned model. In this experimental setting, all data was centralized and artificially distributed across laptops on a per-center basis. The site script was executed on each laptop. The laptops were located at Maastricht University.

Feature selection

First, data quality check was performed. Missing values were assessed and features with more than 20% missing values were excluded. Similarly, to avoid outliers, features with skewed distribution (skewness > 5) were excluded. The exclusion criteria were evaluated in the entire dataset for the centralized learning and per cohort for distributed learning. In the distributed learning, the union of features excluded per cohort was considered as the excluded subset.

Next, inter-features correlations were assessed (Fig. 1). Features were scaled with the z-score. In distributed learning, the global mean and standard deviation per feature were obtained by sharing local statistics on mean, dispersion from mean and number of patients in the cohort. The global correlations were estimated as weighted average of fisher transformed local correlation coefficients. The average linkage hierarchical clustering (Python SciPy library v. 1.3.0) was performed on the set of inter-features correlation coefficients with a 0.6 cutoff, separately for the centralized and distributed learning.

Finally, to select a feature representative per cluster a univariate logistic regression was performed on the entire dataset (centralized learning) as well as the separate cohorts (distributed learning). In the centralized learning, per cluster, the feature with the highest area under the receiver operator characteristic curve (AUC) was chosen if the false discovery rate <0.05. In the distributed learning, per cohort and per cluster, the feature with the highest AUC was chosen to represent each cluster. In the central sever the cohort-specific sets were compared and weighted by the number of patients in the cohort. The final distributed feature selection comprised features with at least 80% selection rate, based on cohort sizes as weights.

Classification

A multivariate logistic regression model was trained for both outcomes, HPV and 2 year overall survival (2yOS). In the centralized learning, the model was fitted with a GLM (generalized linear models) function in R (version 3.2.3). In the distributed learning, the grid binary logistic regression (GLORE) method was used to fit the coefficients¹⁸. It is based on the intermediate agglomeration of the Newton-Raphson solutions. It has been previously shown to estimate the coefficients well in the horizontally partitioned datasets¹⁸.

Comparison of the models

Five models were created to predict HPV status and another five to predict 2yOS. For each of the models, four cohorts were used for training and one was left out for external validation (patients with unknown status were excluded from modeling of the respective outcome). The prognostic power of a model was evaluated in the validation cohort. Models were trained in a distributed and centralized manner for comparison.

The comparison was divided in three parts. First, the feature selection was evaluated. The overlap in class assignments in hierarchical clustering was computed. Features were divided into subgroups based on the centralized clustering and next, on the cluster by cluster basis, the largest distributed subcluster was reported. Sum of features in the distributed subclusters divided by total number of features was defined as cluster overlap. To quantify the impact of feature selection on the prognostic power of the model, the glm function was used to fit the model based on centralized and distributed feature selection. The area under receiver operating characteristics (AUC) from the following models were compared with a DeLong test (p-value < 0.05). Additionally, overlap between the selected features was reported. In the second step, model fitting was compared. The models based on distributed feature selection were created with glm and GLORE. The quality of fit (loglikelihood) was reported. The performance of models was evaluated with DeLong test. Finally, the full process (feature selection and classification) was compared. ROC curves were evaluated and model calibration in the validation cohort was checked. Calibration was estimated by fitting a logistic regression model in the validation cohort with one variable - predictions based on the model from the training cohort. The model was considered well-calibrated, if the obtained coefficient was not significantly different from 1. The calibration on a feature-basis was not analyzed. The patients were split into two groups (HPV+/−, and OS risk groups) based on the median prediction in the training cohort and group assignments between centralized and distributed models were compared \(({\rm{classification}}\,{\rm{discrepancy}}\,=\) \({\rm{number}}\,{\rm{of}}\,{\rm{patients}}\,{\rm{assigned}}\,{\rm{to}}\,{\rm{different}}\,{\rm{classes}}/{\rm{total}}\,{\rm{number}}\,{\rm{of}}\,{\rm{patients}}\,{\rm{in}}\,{\rm{the}}\,{\rm{validation}}\,{\rm{cohrot}})\). Additionally, for the 2yOS model, the Kaplan-Meier curves were plotted, using a median split to divide patients into risk groups.

Results

Centralized vs distributed feature selection

Close to 20% of radiomic features were excluded in the data cleaning process due to missing values or highly skewed distribution (details presented in Supplementary Table S7 and S8), irrespective of modeling endpoint and centralized or distributed cleaning. The remaining features were independently clustered using centralized and distributed correlation coefficients. Here we present the respective values as a range, depending on the results from different training/validation cohorts. The centralized clustering resulted in a slightly higher number of clusters 97–103 vs 90–95 for HPV and 105–113 vs 94–98 for 2yOS. Depending on the studied cohorts combination 94–97% of the features were clustered in the same groups in the centralized and distributed clustering.

For tumor HPV status prediction, 26–30 and 12–28 features were selected in the centralized and distributed way, respectively. The overlap of selected features between the methods was around 50%. Less variability in the number of selected features was observed in the case of 2yOS endpoint, with 10–21 and 7–23 features in the centralized and distributed selection, respectively. However, the overlap was lower, on average 40%. Detailed comparison is presented in Supplement Fig. 1S and Tables 2.1S–3.5S.

Figure 2 presents the summary of performance (AUC) of models trained on the feature subsets selected in the centralized and distributed workflows, for both HPV (a) and 2yOS (b). The model coefficients were trained with glm in both cases. No significant difference in AUC was observed (DeLong p-value> 0.05), indicating that a lower number of radiomic features in the distributed selection does not decrease model performance.

The 2yOS model validation failed in the DESIGN cohort. However, this is the only cohort with solely HPV negative patients. To further check the influence of HPV on our 2yOS models, we validated the 2yOS models in the oropharyngeal carcinoma cohorts for subgroups of HPV+ and HPV−. They showed good prognostic value in both subgroups, with AUC in a range of 0.61 to 1 (Supplementary Table 4S).

Centralized vs distributed logistic regression

The logistic regression fits were compared based on the subset of features selected in the distributed manner. The glm and GLORE algorithms reached identical log-likelihood for all training cohorts combinations and both endpoints (Supplementary Tables 5S and 6S). The sum of absolute differences in the coefficients between the centralized and distributed solution was less than 10⁻⁷. Figure 3 presents an example of nomograms obtained using the centralized and distributed logistic regression for HPV prediction.

Centralized vs distributed models

In the final comparison, results from both centralized and distributed workflows were evaluated in the validation cohorts. The HPV prediction models performed equally good in terms of discriminatory power in the centralized and distributed learning (Fig. 4, Table 5S). However, 18–28% classification discrepancy was observed between the centralized and distributed models, when median prediction in training dataset was used as threshold. Also, no significant difference in the discriminatory power was observed for models predicting 2yOS. Additionally, both centralized and distributed risk-group split thresholds were significant for all validation cohorts, except DESIGN cohort (Figs. 5 and 2S, Table 6S). The resulting Kaplan-Meier curves followed the same trend. Similarly, to the HPV models, classification discrepancy of 13–21% was observed between centralized and distributed model. In total, 12 out of 20 models (HPV and OS) would have required recalibration in the validation cohort (logistic regression coefficient significantly different from 1), however it was not dependent on the training workflow (Table 6S). Recalibration was not performed as part of this study and the results of split into risk groups for 2yOS model were based on the original predictions.

Discussion

This study aimed at designing and testing of a distributed learning workflow using radiomics data. CT images from more than 1000 HNC patients were analyzed with HPV status and 2 year overall survival prediction as endpoints. Combination of hierarchical clustering and univariate logistic regression was used for feature selection, and multivariate logistic regression was used for final classification. The resulting models obtained with distributed learning were compared to the centrally trained models. Models for both endpoints showed comparable results in the centralized and distributed training, on the level of feature selection, model fitting as well as the full workflow comparison.

Other studies have investigated horizontal data partitioning and distributed feature selection mostly with a focus on higher computational efficiency^15,16,19. Here we present a simple algorithm based on the assumption that the distribution of radiomic feature values is similar in all studied cohorts. Although this assumption may not always be correct due to different image acquisition protocols^7,20,21, we observed a good agreement between centralized and distributed clustering. Of note, the selection of features using majority voting among the cohorts may decrease the risk of selecting cohort-specific or scanner-specific biomarkers. The overlap between the final feature selection (distributed vs centralized) was not high but this could be caused by strong inter-features correlations or redundancy of the selected features as no stepwise feature selection was included in the multivariate model training. No difference in model performance was observed depending on the feature selection manner.

Several previous studies have investigated distributed classification algorithms in the healthcare data, presenting satisfactory results in terms of model accuracy^12,13,14. The GLORE algorithm used in this study provided excellent results with a fast convergence (less than 10 iterations). In the comparison of the entire workflows, the difference in the AUCs between the centralized and distributed models was smaller than the AUCs dispersion resulting from different combination of training data. In the HPV models the largest difference between centralized and distributed learning was 0.07, whereas the observed range of AUCs depending on the training data was 0.69–0.82. We observed 18–28% classification discrepancy between our centralized and distributed models. The median threshold was used to classify patients, other splits should be evaluated in the future.

CT radiomics has previously been evaluated for prediction of overall survival and HPV status^22,23,24,25. The performance of the distributed HPV models (AUC 0.73–0.80) is comparable with previously published results (AUC 0.70–0.80). In this study, the HPV prediction was performed for all patient with available data and was not limited to the oropharyngeal cancer, which would be more relevant in the clinical practice. In the context of overall survival, Parmar et al. reported an AUC of 0.61–0.67 depending on the used classifier²⁵. This study was able to achieve similar model performance in distributed learning even using a fixed classification method (AUC 0.64–0.77). One exception was observed for the model trained on a mixed cohort of head and neck cancer patients and validated on the HPV- cohort (DESIGN cohort), for both centralized and distributed learning. Recent literature provides extensive evidence on superior survival rates of HPV positive oropharyngeal cancer patients^26,27,28. We have shown in other combinations of training data that our overall survival models were prognostic in both HPV+ and HPV− oropharyngeal cancer (Table 4S). This would indicate that the models were not driven by HPV status and radiomics can be used as biomarker for both disease subtypes. However to fully exploit potential of radiomics, matched data should be used for model training, i.e. only HPV− patients. The access to large databases does not replace careful data curation.

Currently, implementation of distributed learning into healthcare is still at an early stage. There is a need to build trust between hospitals, IT departments and ethical committees to allow for integration of distributed learning network into the clinical picture archiving and communication systems and reporting systems. From the technological perspective, integration of distributed learning is feasible, two commercial solution supporting distributed learning infrastructure are available DistriM from Oncoradiomics and Varian Learning Portal from Varian as well as open source solutions²⁹. In the DistriM solution, which is compatible with the algorithms developed for this study, data are secured by storing them on computer systems within the firewalls of the hospital. Only model coefficients are transmitted, from which individual patient characteristics cannot be derived.

Our study is the first attempt to combine radiomics data and distributed learning. For the comparison purpose, all data were collected at the same location and data quality assurance as well as radiomic features were extracted by one person. This experiment was a proof of concept that radiomics-based models can be trained in the distributed fashion. However, all the algorithms developed in this work are compatible with DistriM framework. Due to the experiment design we were not able to evaluate important aspects of real-life distributed learning scenario, such as speed, security and network issues. Moreover, in the multicenter setting, simple data quality checks should be implemented, for example reporting of maximum and minimum intensity in the region of interest to avoid major contour shifts. The standardization of radiomic features extraction is currently ongoing. If future studies will decide to use mixed software implementations (separate implementation in each of the learning sites), an ontology for radiomics has to be defined and the implementations have to be benchmarked, for example in the Imaging Biomarker Standardization Initiative^30,31,32,33. Additionally, multicenter data analysis requires efforts in establishing post-processing steps for data standardization, as for example contrast-enhancement normalization³⁴ or robustness studies on contouring variability^{7,17,20,33,35}. Our models showed good discrimination, but in 12/20 cases would require recalibration. This is a challenge in the transfer of the trained models into a new institution or scanner. For the quality assurance, such model should be first validated on sample of data in the new institution/scanner (if needed recalibrated) and only then used in prospective setting. Despite feature preselection, the final models consisted of 7–28 features, which might have resulted in inclusion of redundant features into the multivariate model. The next step in the development of distributed radiomics workflow could be integration of stepwise regression. Additionally, in the future easy access to radiomics data via distributed learning will allow for regular updates (e.g. yearly) of the studied signatures to further prove that they are not study time dependent or whether they are applicable for new treatment modalities³⁶. Finally, we would like to apply distributed learning to various clinically relevant outcomes, such as treatment failure, early death and hypoxia status^37,38,39 and compare distributed learning radiomics to results from distributed deep learning⁴⁰.

In conclusion, this study describes the first workflow for radiomics analysis in a distributed setting. Centralized and distributed learning results for prediction of HPV status and 2 year overall survival in HNSCC patients treated with radical chemoradiotherapy or radiotherapy were similar. This methodology will allow for easier access to radiomics data from large cohorts and thus development of more robust and reliable models. This approach will also facilitate regular updates of radiomics signatures when new treatment or imaging modalities are implemented.

References

Lambin, P. et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat. Rev. Clin. Oncol. 14, 749–762 (2017).
Article Google Scholar
Lee, G. et al. Radiomics and its emerging role in lung cancer research, imaging biomarkers and clinical management: State of the art. Eur. J. Radiol. 86, 297–307 (2017).
Article ADS Google Scholar
Lambin, P. et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur. J. Cancer 48, 441–446 (2012).
Article Google Scholar
Morin, O. et al. A deep look into the future of quantitative imaging in oncology: a statement of working principles and proposal for change. Int. J. Radiat. Oncol. Biol. Physi. (2018).
Alyass, A., Turcotte, M. & Meyre, D. From big data analysis to personalized medicine for all: challenges and opportunities. BMC Med. Genomics 8, 33 (2015).
Article Google Scholar
Collins, G. S. et al. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMC Med. 13, 1 (2015).
Article Google Scholar
Larue, R. T. et al. Influence of gray level discretization on radiomic feature stability for different CT scanners, tube currents and slice thicknesses: a comprehensive phantom study. Acta Oncologica, 1–10 (2017).
Knoppers, B. M. & Thorogood, A. M. Ethics and big data in health. Curr. Opin. Syst. Biol. 4, 53–57 (2017).
Article Google Scholar
Hollis, K. F. To Share or Not to Share: Ethical Acquisition and Use of Medical Data. AMIA Summits Transl. Sci. Proc. 2016, 420 (2016).
PubMed Google Scholar
Bauchner, H., Golub, R. M. & Fontanarosa, P. B. Data sharing: an ethical and scientific imperative. Jama 315, 1238–1240 (2016).
Article Google Scholar
Lambin, P. et al. Rapid Learning health care in oncology’–an approach towards decision support systems enabling customised radiotherapy. Radiotherapy Oncol. 109, 159–164 (2013).
Article Google Scholar
Deist, T. M. et al. Infrastructure and distributed learning methodology for privacy-preserving multi-centric rapid learning health care: euroCAT. Clin. Transl. Radiat. Oncol. 4, 24–31 (2017).
Article Google Scholar
Jochems, A. et al. Developing and validating a survival prediction model for NSCLC patients through distributed learning across 3 countries. Int. J. Radiat. Oncol. Biol. Phys. 99, 344–352 (2017).
Article Google Scholar
Jochems, A. et al. Distributed learning: developing a predictive model based on data from multiple hospitals without data leaving the hospital–a real life proof of concept. Radiother. Oncol. 121, 459–467 (2016).
Article Google Scholar
Bolon-Canedo, V., Sanchez-Marono, N. & Alonso-Betanzos, A. A distributed wrapper approach for feature selection. In ESANN. Citeseer. (2013).
Bolón-Canedo, V., Sánchez-Maroño, N. & Alonso-Betanzos, A. A distributed feature selection approach based on a complexity measure. in International Work-Conference on Artificial Neural Networks. Springer (2015).
Pavic, M. et al. Influence of inter-observer delineation variability on radiomics stability in different tumor sites. Acta Oncologica, 1–5 (2018).
Wu, Y. et al. G rid Binary LO gistic RE gression (GLORE): building shared models without sharing data. J. Am. Med. Inform. Assoc. 19, 758–764 (2012).
Article Google Scholar
Bolón-Canedo, V., Sánchez-Maroño, N. & Alonso-Betanzos, A. Recent advances and emerging challenges of feature selection in the context of big data. Knowl. Syst. 86, 33–45 (2015).
Article Google Scholar
Larue, R. T. et al. Quantitative radiomics studies for tissue characterization: a review of technology and methodological procedures. Br. J. radiology 90, 20160665 (2017).
Article Google Scholar
Mackin, D. et al. Measuring CT scanner variability of radiomics features. Investig. Radiol. 50, 757 (2015).
Article Google Scholar
Bogowicz, M. et al. CT radiomics predicts HPV status and local tumor control after definitive radiochemotherapy in head and neck squamous cell carcinoma. Int. J. Radiat. Oncol. Biol. Phys. 99, 921–928 (2017).
Article Google Scholar
Leijenaar, R. T. et al. Development and validation of a radiomic signature to predict HPV (p16) status from standard CT imaging: a multicenter study. Br. J. Radiol. 91, 20170498 (2018).
Article Google Scholar
Aerts, H.J. et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat. Commun. 5 (2014).
Parmar, C. et al. Radiomic machine-learning classifiers for prognostic biomarkers of head and neck cancer. Frontiers Oncol., 5 (2015).
Lassen, P. et al. Impact of HPV-associated p16-expression on radiotherapy outcome in advanced oropharynx and non-oropharynx cancer. Radiother. Oncol. 113, 310–316 (2014).
Article Google Scholar
Sørensen, B. S. et al. Radiosensitivity and effect of hypoxia in HPV positive head and neck cancer cells. Radiother. Oncol. 108, 500–505 (2013).
Article Google Scholar
O’rorke, M. et al. Human papillomavirus related head and neck cancer survival: a systematic review and meta-analysis. Oral. Oncol. 48, 1191–1201 (2012).
Article Google Scholar
ppDLI software solution, https://distributedlearning.ai/.
Vallières, M. et al. Responsible radiomics research for faster clinical translation. Soc. Nuclear Med. (2018).
Shi, Z. et al. O-RAW: Ontology-guided radiomics analysis workflow. Phys. Medica: Eur. J. Med. Phys. 52, 27–28 (2018).
Article Google Scholar
Zwanenburg, A. et al. The Image Biomarker Standardization Initiative: standardized quantitative radiomics for high throughput image-based phenotyping. Radiology (2020).
Bogowicz, M. et al. Post-radiochemotherapy PET radiomics in head and neck cancer - the influence of radiomics implementation on the reproducibility of local control tumor models. Radiother. Oncol. 125, 385–391 (2017).
Article Google Scholar
He, L. et al. Effects of contrast-enhancement, reconstruction slice thickness and convolution kernel on the diagnostic performance of radiomics signature in solitary pulmonary nodule. Sci. Rep. 6, 34921 (2016).
Article ADS CAS Google Scholar
Yip, S. S. & Aerts, H. J. Applications and limitations of radiomics. Phys. Med. Biol. 61, R150 (2016).
Article ADS CAS Google Scholar
Lambin, P. et al. Modern clinical research: How rapid learning health care and cohort multiple randomised clinical trials complement traditional evidence based medicine. Acta Oncologica 54, 1289–1300 (2015).
Article Google Scholar
Jochems, A. et al. A prediction model for early death in non-small cell lung cancer patients following curative-intent chemoradiotherapy. Acta Oncologica 57, 226–230 (2018).
Article CAS Google Scholar
Even, A. J. et al. Predicting tumor hypoxia in non-small cell lung cancer by combining CT, FDG PET and dynamic contrast-enhanced CT. Acta Oncologica 56, 1591–1596 (2017).
Article Google Scholar
Zindler, J. D. et al. Individualized early death and long-term survival prediction after stereotactic radiosurgery for brain metastases of non-small cell lung cancer: Two externally validated nomograms. Radiother. Oncol. 123, 189–194 (2017).
Article Google Scholar
Chang, K. et al. Distributed deep learning networks among institutions for medical imaging. J. Am. Med. Informat. Associ. (2018).

Download references

Acknowledgements

This project was supported by the Swiss National Science Foundation Sinergia grant (310030_173303) and Scientific Exchange grant (IZSEZ0_180524). The clinical study used as one of the cohorts was supported by a research grant from Merck (Schweiz) AG. This work was also supported by the Interreg grant EURADIOMICS and the Dutch technology Foundation STW (grant n° 10696 DuCAT and n° P14-19 Radiomics STRaTegy), which is the applied science division of NWO, the Technology Program of the Ministry of Economic Affairs and the Manchester Cancer Research UK major centre grant. The authors also acknowledge financial support from the EU 7th framework program (ARTFORCE - n° 257144, REQUITE - n° 601826), CTMM-TraIT, EUROSTARS (E-DECIDE, DEEPMAM), Kankeronderzoekfonds Limburg from the Health Foundation Limburg, Alpe d’HuZes-KWF (DESIGN), The Dutch Cancer Society, the European Program H2020-2015-17 (ImmunoSABR - n° 733008 and BD2Decide - PHC30-689715), the ERC advanced grant (ERC-ADG-2015, n° 694812 - Hypoximmuno), SME Phase 2 (EU proposal 673780 – RAIL). Dr. Elhalawani was supported in part by the philanthropic donations from the Family of Paul W. Beach to Dr. G. Brandon Gunn, MD. Drs. Elhalawani and Fuller receive funding and project-relevant salary support from NIH/NCI Head and Neck Specialized Programs of Research Excellence (SPORE) Developmental Research Program Award (P50 CA097007-10). This research is supported by the Andrew Sabin Family Foundation; Dr. Fuller is a Sabin Family Foundation Fellow. Dr. Fuller receive funding and project-relevant salary support from the National Institutes of Health (NIH), including: National Institute for Dental and Craniofacial Research Award (1R01DE025248-01/R56DE025248-01); National Cancer Institute (NCI) Early Phase Clinical Trials in Imaging and Image-Guided Interventions Program(1R01CA218148-01); National Science Foundation (NSF), Division of Mathematical Sciences; NIH Big Data to Knowledge (BD2K) Program of the National Cancer Institute Early Stage Development of Technologies in Biomedical Computing, Informatics, and Big Data Science Award (1R01CA214825-01); NIH/NCI Cancer Center Support Grant (CCSG) Pilot Research Program Award from the UT MD Anderson CCSG Radiation Oncology and Cancer Imaging Program (P30CA016672) and National Institute of Biomedical Imaging and Bioengineering (NIBIB) Research Education Program (R25EB025787). Dr. Fuller has received direct industry grant support and travel funding from Elekta AB. We thank Jessica van Rossum for language editing of this manuscript.

Author information

These authors contributed equally: Marta Bogowicz and Arthur Jochems.

Authors and Affiliations

University Hospital Zurich and University of Zurich, Department of Radiation Oncology, Zurich, Switzerland
Marta Bogowicz, Stephanie Tanadini-Lang, Oliver Riesterer, Gabriela Studer, Jan Unkelbach & Matthias Guckenberger
GROW–School for Oncology and Developmental Biology-Maastricht University Medical Centre-, Department of Precision Medicine, The D Lab: Decision Support for Precision Medicine-, Maastricht, The Netherlands
Marta Bogowicz, Arthur Jochems, Timo M. Deist, Samir Barakat, Simon Keek, Sebastian Sanduleanu, Ralph T. H. Leijenaar & Philippe Lambin
Princess Margaret Cancer Center- University of Toronto, Department of Radiation Oncology, Toronto, Ontario, Canada
Shao Hui Huang, Biu Chan, John N. Waldron, Scott Bratman & Brian O’Sullivan
Kantonsspital Aarau, Center for Radiation Oncology- KSA-KSB-, Aarau, Switzerland
Oliver Riesterer
Cantonal Hospital Lucerne, Radiation Oncology, Lucerne, Switzerland
Gabriela Studer
Amsterdam UMC, Vrije Universiteit Amsterdam, Department of Otolaryngology/Head and Neck Surgery, Amsterdam, The Netherlands
Ruud H. Brakenhoff, Irene Nauta & C. René Leemans
Parma University Hospital, Radiology Department, Parma, Italy
Silvia E. Gazzani
IRCCS Fondazione Istituto Nazionale dei Tumori, Radiology Department, Milan, Italy
Giuseppina Calareso
University Hospital Duesseldorf, Heinrich-Heine-University, Department of Otorhinolaryngology & Head/Neck, Surgery, Duesseldorf, Germany
Kathrin Scheckenbach
Department of Radiation Oncology (MAASTRO), GROW-School for Oncology and Developmental Biology-Maastricht University Medical Centre, Department of Radiation Oncology, Maastricht, The Netherlands
Frank Hoebers & Frederik W. R. Wesseling
Amsterdam UMC, Vrije Universiteit Amsterdam, Department of Radiation Oncology, Amsterdam, The Netherlands
Marije R. Vergeer
University Medical Center Utrecht, Department of Radiotherapy, Utrecht, The Netherlands
Chris H. J. Terhaard
The Netherlands Cancer Institute, Department of Head and Neck Oncology and Surgery, Amsterdam, The Netherlands
Michiel W. M. van den Brekel & Martijn A. van der Heijden
The Netherlands Cancer Institute, Department of Radiation Oncology, Amsterdam, The Netherlands
Olga Hamming-Vrieze
Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
Hesham M. Elhalawani & Clifton D. Fuller

Authors

Marta Bogowicz
View author publications
You can also search for this author in PubMed Google Scholar
Arthur Jochems
View author publications
You can also search for this author in PubMed Google Scholar
Timo M. Deist
View author publications
You can also search for this author in PubMed Google Scholar
Stephanie Tanadini-Lang
View author publications
You can also search for this author in PubMed Google Scholar
Shao Hui Huang
View author publications
You can also search for this author in PubMed Google Scholar
Biu Chan
View author publications
You can also search for this author in PubMed Google Scholar
John N. Waldron
View author publications
You can also search for this author in PubMed Google Scholar
Scott Bratman
View author publications
You can also search for this author in PubMed Google Scholar
Brian O’Sullivan
View author publications
You can also search for this author in PubMed Google Scholar
Oliver Riesterer
View author publications
You can also search for this author in PubMed Google Scholar
Gabriela Studer
View author publications
You can also search for this author in PubMed Google Scholar
Jan Unkelbach
View author publications
You can also search for this author in PubMed Google Scholar
Samir Barakat
View author publications
You can also search for this author in PubMed Google Scholar
Ruud H. Brakenhoff
View author publications
You can also search for this author in PubMed Google Scholar
Irene Nauta
View author publications
You can also search for this author in PubMed Google Scholar
Silvia E. Gazzani
View author publications
You can also search for this author in PubMed Google Scholar
Giuseppina Calareso
View author publications
You can also search for this author in PubMed Google Scholar
Kathrin Scheckenbach
View author publications
You can also search for this author in PubMed Google Scholar
Frank Hoebers
View author publications
You can also search for this author in PubMed Google Scholar
Frederik W. R. Wesseling
View author publications
You can also search for this author in PubMed Google Scholar
Simon Keek
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Sanduleanu
View author publications
You can also search for this author in PubMed Google Scholar
Ralph T. H. Leijenaar
View author publications
You can also search for this author in PubMed Google Scholar
Marije R. Vergeer
View author publications
You can also search for this author in PubMed Google Scholar
C. René Leemans
View author publications
You can also search for this author in PubMed Google Scholar
Chris H. J. Terhaard
View author publications
You can also search for this author in PubMed Google Scholar
Michiel W. M. van den Brekel
View author publications
You can also search for this author in PubMed Google Scholar
Olga Hamming-Vrieze
View author publications
You can also search for this author in PubMed Google Scholar
Martijn A. van der Heijden
View author publications
You can also search for this author in PubMed Google Scholar
Hesham M. Elhalawani
View author publications
You can also search for this author in PubMed Google Scholar
Clifton D. Fuller
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Guckenberger
View author publications
You can also search for this author in PubMed Google Scholar
Philippe Lambin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.B. and A.J. – performed data analysis, designed distributed learning workflows, wrote the manuscript; T.M.D., S.Ba. – wrote parts of the distributed algorithms, reviewed the manuscript; S.T.L., M.G. and P.L. – provided expertise, contributed to study design and reviewed the manuscript; S.H.H., B.C., J.N.W., S.Br., B.O., O.R., G.S., J.U., R.H.B., I.N., S.E.G., G.C., K.S., F.H., F.W.R.W., S.K., S.S., R.T.H.L., M.R.V., R.C.L., C.H.J.T., M.W.M.B., O.H.V., M.A.H., H.M.E. and C.D.F. – provided data and expertise, reviewed the manuscript.

Corresponding author

Correspondence to Marta Bogowicz.

Ethics declarations

Competing interests

Dr. Lambin reports grants/sponsored research from Oncoradiomics SA, ptTheragnostic, advisor (SAB)/presentor fee from Oncoradiomics SA. Dr. Lambin is inventor of two patents on radiomics and one non patentable invention (softwares), licensed to Oncoradiomics SA and has (minority) shares in the company Oncoradiomics SA. Dr. Jochems has (minority) shares in the company Oncoradiomics SA. Dr. Barakat is an employee of ptTheragnostic and Oncoradiomics SA. Dr. Leijenaar has shares in, and is Chief Technology Officer of, the company Oncoradioomics SA. He is co-inventor of an issued patent with royalties related to radiomics (PTC/NL2014/050728) licensed to Oncoradiomics. Dr. Bogowicz, Dr. Deist, Dr. Tanadini-Lang, Dr. Huang, Dr. Chan, Dr. Waldron, Dr. Bratman, Dr. O’Sullivan, Dr. Riesterer, Dr. Studer, Dr. Unkelbach, Dr. Brakenhoff, Dr. Nauta, Dr. Gazzani, Dr. Calareso, Dr. Scheckenbach, Dr. Hoebers, Dr. Wesseling, Dr. Keek, Dr. Sanduleanu, Dr. Vergeer, Dr. Leemans, Dr. Terhaard, Dr. van den Brekel, Dr. Hamming-Vrieze, and Dr. van der Heijden, Dr. Elhalawani, Dr. Fuller and Dr. Guckenberger declare no potential conflict of interest.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary material.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bogowicz, M., Jochems, A., Deist, T.M. et al. Privacy-preserving distributed learning of radiomics to predict overall survival and HPV status in head and neck cancer. Sci Rep 10, 4542 (2020). https://doi.org/10.1038/s41598-020-61297-4

Download citation

Received: 03 April 2019
Accepted: 28 January 2020
Published: 11 March 2020
DOI: https://doi.org/10.1038/s41598-020-61297-4

This article is cited by

A distributed feature selection pipeline for survival analysis using radiomics in non-small cell lung cancer patients
- Benedetta Gottardelli
- Varsha Gouthamchand
- Andrea Damiani
Scientific Reports (2024)
MRI-based radiomic prognostic signature for locally advanced oral cavity squamous cell carcinoma: development, testing and comparison with genomic prognostic signatures
- Anna Corti
- Loris De Cecco
- Luca Mainardi
Biomarker Research (2023)
Artificial intelligence-driven radiomics study in cancer: the role of feature engineering and modeling
- Yuan-Peng Zhang
- Xin-Yun Zhang
- Jing Cai
Military Medical Research (2023)
CT-based radiomics can identify physiological modifications of bone structure related to subjects’ age and sex
- Riccardo Levi
- Federico Garoli
- Letterio S. Politi
La radiologia medica (2023)
Development and usage of an anesthesia data warehouse: lessons learnt from a 10-year project
- Antoine Lamer
- Mouhamed Djahoum Moussa
- Benoît Tavernier
Journal of Clinical Monitoring and Computing (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.