A unified method to revoke the private data of patients in intelligent healthcare with audit to forget

Zhou, Juexiao; Li, Haoyang; Liao, Xingyu; Zhang, Bin; He, Wenjia; Li, Zhongxiao; Zhou, Longxi; Gao, Xin

doi:10.1038/s41467-023-41703-x

Download PDF

Article
Open access
Published: 06 October 2023

A unified method to revoke the private data of patients in intelligent healthcare with audit to forget

Nature Communications volume 14, Article number: 6255 (2023) Cite this article

1644 Accesses
1 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Revoking personal private data is one of the basic human rights. However, such right is often overlooked or infringed upon due to the increasing collection and use of patient data for model training. In order to secure patients’ right to be forgotten, we proposed a solution by using auditing to guide the forgetting process, where auditing means determining whether a dataset has been used to train the model and forgetting requires the information of a query dataset to be forgotten from the target model. We unified these two tasks by introducing an approach called knowledge purification. To implement our solution, we developed an audit to forget software (AFS), which is able to evaluate and revoke patients’ private data from pre-trained deep learning models. Here, we show the usability of AFS and its application potential in real-world intelligent healthcare to enhance privacy protection and data revocation rights.

Causal machine learning for predicting treatment outcomes

Article 19 April 2024

scGPT: toward building a foundation model for single-cell multi-omics using generative AI

Article 26 February 2024

Demographic bias in misdiagnosis by computational pathology models

Article 19 April 2024

Introduction

Revoking personal private data is one of the basic human rights, which has already been sheltered by privacy-preserving regulations like The General Data Protection Regulation (GDPR)¹, The Health Insurance Portability and Accountability Act of 1996 (HIPAA)², and the California Consumer Privacy Act³ since twentieth century. With those regulations, users are allowed to request the deletion of their own data for privacy concerns and to secure their own “right to be forgotten”. However, with the development of data science, machine learning (ML) and deep learning (DL) techniques, this basic right is usually neglected or violated. For example, it has been observed that patients’ genetic markers were leaked from ML methods for genetic data processing^4,5 while the patients were unaware of that. When users realize the existence of such risks, they may request their own data to be deleted to protect their privacy⁶. Meanwhile, those aforementioned regulations will force involved third parties to take actions immediately. According to the requirements of those regulations, not only the previously authorized data by individuals need to be deleted immediately from hosts’ storage systems but also the associated information should be removed from DL models trained with those data, because DL models could memorize sensitive information of training data and thus expose individual’s privacy under risk^7,8,9,10,11.

Nowadays, healthcare is one of the most promising areas for the deployment of artificial intelligent (AI) systems as so-called intelligent healthcare. ML and DL-based computer-aided diagnosis (CAD) systems in intelligent healthcare accelerate the diagnosis of various diseases and achieve even better results than doctors, such as tumor detection^12,13, retinal fundus imaging¹⁴, detection and segmentation of COVID-19 lung infections^15,16 and so on. However, as more and more patients’ data are being collected and used for model training in intelligent healthcare, their privacy is exposed to high risk. Therefore, intelligent healthcare is a sector where technology must meet the law, regulations, and privacy principles to ensure that the innovation is for the common good¹⁷. To obey those privacy-preserving regulations, methods to revoke personal private data from pre-trained DL models are necessary.

Deleting the stored personal data is simple, whereas forgetting individuals’ private information from pre-trained DL models could be difficult as we could not fully measure the contribution of individual data on the training process of DL models due to the stochasticity of training¹⁸. Besides, due to the incremental nature of training, the model update brought by one sample would affect the model performance on samples followed, thus making it difficult to unlearn¹⁸. Finally, catastrophic unlearning might happen and the unlearned model will perform worse than the model retrained on the remaining dataset¹⁹.

In general, the process to forget data from a pre-trained DL model could be divided into two steps. Firstly, the unlearning process (forgetting) is performed on a given pre-trained DL model to forget the target data with different techniques and a new DL model will be generated. Secondly, an evaluation of the new model (auditing) against different metrics will be performed to prove that the model has forgotten the target data. These two processes should be repeated until the new model passes the evaluation. In simple terms, there are two commonly acknowledged sub-tasks, which could also be stated in the reverse order: auditing and forgetting, as a two-player game. Auditing requires auditors to precisely evaluate whether the data of certain patients were used to train the target DL model. Once the data of certain patients is confirmed to be used to train the target DL model by auditing, forgetting requires the removal of learnt information of certain patients’ data from the target DL model, which is also called machine unlearning, while auditing could act as the verification of machine unlearning¹⁸.

In order to achieve forgetting, existing unlearning methods could be classified into three major classes, including model-agnostic methods, model-intrinsic methods and data-driven methods²⁰. Model-agnostic methods refer to algorithms or frameworks that can be used for different DL models, including differential privacy^18,21,22,23, certified removal^24,25,26, statistical query learning⁶, decremental learning²⁷, knowledge adaptation^28,29 and parameter sampling³⁰. Model-intrinsic approaches are those methods designed for specific types of models, such as for softmax classifiers³¹, linear models³², tree-based models³³ and Bayesian models¹⁹. Data-driven approaches focus on the data itself, including data partitioning¹⁸, data augmentation^34,35,36 and other unlearning strategies based on data influence³⁷. However, most of them are theoretical studies and do not provide open-source codes. Besides, most methods have their specific application scenarios and acknowledged limitations and few of them focused on the application in real-world intelligent healthcare. Among the three methods, model-agnostic methods might have the strongest application prospects, as they can be applied to different models, and SISA, short for Sharded, Isolated, Sliced, and Aggregated training, is the most classic and well-known method in the community³⁸. As the state-of-the-art method, Goel et al.³⁹ proposed Catastrophic Forgetting-k (CF-k) and Exact Unlearning-k (EU-k) to unlearn information from deep learning models. CF-k means to finetune the last k layers of the original model on D_r and freeze other layers, while EU-k means to retrain the last k layers of the original model from scratch on D_r and freeze other layers, where the D_r stands for the retain dataset.

When forgetting is accomplished, auditing is the next necessary step to verify it. Different metrics have been proposed to audit the membership of the query dataset, including accuracy, completeness⁶, unlearn time, relearn time, retrain time, layer-wise distance, activation distance, JS-divergence, membership inference^40,41, ZRF score²⁸, epistemic uncertainty⁴² and model inversion attack⁷. In recent studies, membership inference-based metrics were frequently utilized to determine whether or not any information about the samples to be forgotten was retained in the model in intelligent healthcare⁴¹. A black-box setting was shared by the membership inference attack (MIA) to calculate the probability of a single datapoint being a member of the training dataset D. Based on this individual-level MIA, Liu et al.⁴⁰ and Yangsibo et al.⁴¹ focused on a more challenging task: audit the membership of a set of data points. The ensembled membership auditing (EMA)⁴¹ was proposed as the state-of-the-art method to verify whether a query dataset is memorized by a pre-trained DL model, which is also a benchmark metric in machine unlearning. However, due to the black box property of DL models, efficient and accurate auditing is still challenging and an under-studied topic. Moreover, researchers have tended to treat auditing and forgetting as separate tasks, ignoring the fact that the two can be linked up associatively to work as a self-consistent mechanism.

Here, we proposed a solution by using auditing to guide the forgetting process in a negative feedback manner. We unified the two tasks by introducing knowledge purification (KP), an approach to selectively transfer the needed knowledge to forget the target information instead of simply transferring all information like knowledge distillation (KD)⁴³. On the basis of KP, we have developed a user-friendly and open-source audit to forget software (AFS) (Fig. 1), which can be easily used to revoke patients’ private data from DL models in intelligent healthcare with KP (Fig. 2). To demonstrate the generality of AFS, we applied it to four tasks based on four datasets, including the MNIST dataset, the PathMNIST dataset, the COVIDx dataset, and the ASD dataset, with different data sizes and various architectures of deep learning networks (Fig. 3). Our results demonstrate the usability of AFS and its application potential in real-world intelligent healthcare to enhance privacy protection and data revocation rights. AFS is a unified method of auditing and forgetting that could effectively forget the information of the target query dataset from the pre-trained DL model with the guidance of auditing. AFS could generate a smaller model, which requires much less time and GPU memory during the inference, by training with a partial training dataset (~50%) with our KP approach.

**Fig. 1: AFS is a unified method to revoke patients’ private data in intelligent healthcare.**

**Fig. 2: Illustration of knowledge distillation and knowledge purification.**

**Fig. 3: Illustration of four datasets and DL models used to show the versatility of AFS.**

Results

AFS audits private datasets stably and robustly

To evaluate the robustness of auditing by AFS, we used it to audit query datasets with different sizes, various purity (k percent of the query dataset was overlapped with the training dataset) and the different sizes of calibration dataset (the size ranged from 100 to 5000) (“Method” and Fig. 4a). For each sample in the query dataset, AFS calculates three metrics for the membership inference, including correctness, confidence and negative entropy (“Method”). As shown in Figs. 4b and S1, all three metrics showed different distributions for QO (query dataset overlapped with the training dataset) and QNO (query dataset disjoint with the training dataset), indicating the dataset-wise divergence of metrics between samples in the training dataset and samples disjoint with the training dataset. Given the differences observed in the three metrics, we could distinguish between QO and QNO by calculating the p value and utilizing it as the sole audit metric for forgetting (Method). Finally, by integrating these three metrics, AFS reports a p value to evaluate whether or not a query dataset has been used to train the target DL model. The large p values indicate the higher probability that the query dataset was used in training.

**Fig. 4: Performance of auditing using AFS on the four datasets.**

When the size of the query dataset and the calibration dataset varied, AFS could still efficiently distinguish QO and QNO (Fig. 4c, d). Compared to QO, AFS reported a much smaller p value for QNO, indicating a weak membership (a small probability that the query dataset has been used to train the target DL model), thus allowing users to judge whether the query dataset was used to train the target DL model. Meanwhile, when the size of the dataset increased from 1 to 2000, AFS discriminated QO and QNO more confidently as there was a more significant divergence of the p values, which was not affected by the size of the calibration dataset. To further understand the effect of the purity of the query dataset in auditing, we mixed some samples from the training dataset to QNO, thus the new query dataset was labeled as QM (partial data overlapped with the training dataset). The percentage of data overlapped with the training dataset in QM was denoted by ${{k}}=\frac{{{{{{{{\rm{number}}}}}}\; {{{{{\rm{of}}}}}}\; {{{{{\rm{data}}}}}}\; {{{{{\rm{overlapped}}}}}}\; {{{{{\rm{with}}}}}}\; {{{{{\rm{training}}}}}}\; {{{{{\rm{dataset}}}}}}}}}{{{{{{{{\rm{size}}}}}}\; {{{{{\rm{of}}}}}}\; {{{{{\rm{QM}}}}}}}}}$. As shown in Fig. 4e, AFS showed a decreasing p value trend when k decreased, meaning that the query dataset was less likely to be used to train the target DL model. The distinctiveness of the p value on the ASD dataset is due to the small size of the ASD dataset. k = 0 was shown in Fig. 4d as QNO, which occurred when the query dataset size was 100 for ASD and 2000 for MNIST, PathMNIST, and COVIDx. Similarly, k = 1 was also displayed in Fig. 4d as QO, which occurred when the query dataset size was 100 for ASD and 2000 for MNIST, PathMNIST, and COVIDx. In conclusion, these results indicate the robustness of AFS in determining whether the query data has been used to train the target DL model.

AFS forgets the information of query dataset, maintains perfect usability and generates smaller model

Once the prior knowledge that a dataset has been used to train the target DL model is confirmed with auditing, AFS could be used for forgetting, to remove the information of the dataset from the pre-trained DL model. To comprehensively show the ability of AFS in removing information against the model performance, we compared nine methods, including (1) training the teacher model with a complete training dataset (Independent teacher), (2) retraining the student model with a complete training dataset (Independent student), (3) retraining the teacher model with k ∈ {0.25, 0.5, 0.75} percentage of the complete training dataset excluding the data to be forgotten (Independent teacher with k ∈ {0.25, 0.5, 0.75}), (4) retraining the student model with k ∈ {0.25, 0.5, 0.75} percentage of the complete training dataset excluding the data to be forgotten (Independent Student with k ∈ {0.25, 0.5, 0.75}), (5) retraining the model of the corresponding shard with SISA, (6) fine-tuning the last layers of the model with CF-k, (7) retraining the last layers of the model from scratch with EU-k, (8) AFS, and (9) training the student model with AFS without the guidance of auditing (AFS w/o Audit), as an ablation study of AFS. Both AFS w/o Audit and AFS were also conducted with varied k ∈ {0.25, 0.5, 0.75}. For both Independent teacher and Independent student methods trained with the complete training dataset, QF₁₀₀ and QF₁₀₀₀ were included in the training dataset, while these two query datasets were excluded from the training dataset when k ∈ {0.25, 0.5, 0.75}.

Taking the MNIST dataset as an example, for models trained with each method, except for auditing on QO and QNO, we further audited the membership of two datasets designed to be forgotten (a small query dataset QF₁₀₀ and a large query dataset QF₁₀₀₀) to assess the ability of different methods in forgetting the query dataset. As shown in Table 1, regardless of the model trained based on which method, AFS could effectively distinguish between QO and QNO, and the divergence in auditing two query datasets was enlarged as the size of the query dataset increased.

Table 1 Comparison of AFS with other methods on auditing QO and QNO from the MNIST dataset with a varied number of samples in the query dataset

Full size table

As shown in Table 2, AFS perfectly predicted the membership of QF₁₀₀ and QF₁₀₀₀ on both models from Independent teacher and Independent student methods as both query datasets were included in the training dataset. Since both query datasets were disjoint with the partial training dataset when k ∈ {0.25, 0.5, 0.75}, thus auditing on the model trained with Independent teacher and Independent student with k ∈ {0.25, 0.5, 0.75} weakly denied the membership of QF₁₀₀ (${{{P}}}_{{{{{{{\rm{{QF}}}}}}}}{{100}},{{k}}={{0}}{{.}}{{75}}}={{1}}{{.}}{{57}}{{E}}-{{1}},{{{P}}}_{{{{{{{\rm{{QF}}}}}}}}{{100}},{{k}}={{0}}{{.}}{{5}}}={{1}}{{.}}{{56}}{{E}}-{{1}},{{{P}}}_{{{{{{{\rm{{QF}}}}}}}}{{100}},{{k}}={{0}}{{.}}{{25}}}={{8}}{{.}}{{17}}{{E}}-{{2}}$ for Independent teacher and ${{{P}}}_{{{{{{\rm{QF}}}}}}{{100}},{{k}}={{0}}{{.}}{{75}}}={{4}}{{.}}{{36}}{{E}}-{{2}},{{{P}}}_{{{{{{\rm{QF}}}}}}{{100}},{{k}}={{0}}{{.}}{{5}}}={{6}}{{.}}{{91}}{{E}}-{{3}},{{{P}}}_{{{{{{\rm{QF}}}}}}{{100}},{{k}}={{0}}{{.}}{{25}}}={{6}}{{.}}{{91}}{{E}}-{{3}}$ for Independent student) and QF1000 (${{{P}}}_{{{{{{\rm{QF}}}}}}{{1000}},{{k}}={{0}}{{.}}{{75}}}={{1}}{{.}}{{05}}{{E}}-{{8}},{{{P}}}_{{{{{{\rm{QF}}}}}}{{1000}},{{k}}={{0}}{{.}}{{5}}}={{2}}{{.}}{{71}}{{E}}-{{11}},{{{P}}}_{{{{{{\rm{QF}}}}}}{{1000}},{{k}}={{0}}{{.}}{{25}}}={{2}}{{.}}{{80}}{{E}}-{{18}}$ for Independent teacher and ${{{P}}}_{{{{{{\rm{QF}}}}}}{{1000}},{{k}}={{0}}{{.}}{{75}}}={{5}}{{.}}{{26}}{{E}}-{{12}},{{{P}}}_{{{{{{{{\rm{QF}}}}}}}}{{1000}},{{k}}={{0}}{{.}}{{5}}}={{2}}{{.}}{{34}}{{E}}-{{15}},{{{P}}}_{{{{{{\rm{QF}}}}}}{{1000}},{{k}}={{0}}{{.}}{{25}}}={{2}}{{.}}{{90}}{{E}}-{{19}}$ for Independent student). However, since only the partial training dataset was used when k ∈ {0.25, 0.5, 0.75}, the retrained models with Independent student and Independent student only learnt the information of the partial training dataset and lost the information from the remaining data in the complete training dataset, thus resulting in the significant drop of model performance compared to either the Independent student or the Independent teacher trained with the complete training dataset. The same conclusion could be drawn with SISA (The training data were divided into 10 shards. Then, QF was removed from the shard where the QF was located, and the model of the corresponding shard was re-trained to re-aggregate the final model). Meanwhile, due to the unique design, SISA requires storing 10 model parameters simultaneously, resulting in greater storage consumption than AFS. Although CF-k and EU-k achieved good accuracy and F1-score, the audit results showed that only fine-tuning or retraining the last few layers of the original model is not enough to forget the information of the query data.

Table 2 Comparison of AFS with other methods on forgetting QF and model performance with the MNIST dataset

Full size table

To rescue the information lost due to the usage of partial training samples and further increase the model performance, AFS could use only a partial training dataset (k ∈ {0.25, 0.5, 0.75}) to transfer the knowledge from the Independent teacher pre-trained with the complete training dataset. As shown in Table 2, the model trained with AFS provided higher accuracy and F1-score compared to the Independent student trained with partial training dataset (k ∈ {0.25, 0.5, 0.75}) and together with a better forgetting performance (much smaller auditing score on QF₁₀₀ and QF₁₀₀₀), as AFS used auditing as feedback for forgetting and could forget not only the query samples but also other samples with similar features.

We also applied AFS on the 9-classes classification of hematoxylin and eosin-stained histological images from the PathMNIST dataset with CNN. As shown in Table 3, AFS could still distinguish QO and QNO from the PathMNIST dataset. The divergence of auditing between QO and QNO was more significant than that on the MNIST dataset. With the requirement to forget both query datasets (QF₁₀₀ and QF₁₀₀₀), the model trained with AFS outperformed on forgetting information (${{{P}}}_{{{{{{\rm{QF}}}}}}{{100}},{{k}}={{0}}{{.}}{{75}}}={{2}}{{.}}{{25}}{{E}}-{{5}}$, ${{{P}}}_{{{{{{\rm{QF}}}}}}{{100}},{{k}}={{0}}{{.}}{{5}}}={{2}}{{.}}{{87}}{{E}}-{{6}}$, ${{{P}}}_{{{{{{\rm{QF}}}}}}{{100}},{{k}}={{0}}{{.}}{{25}}}={{3}}{{.}}{{32}}{{E}}-{{7}}$, ${{{P}}}_{{{{{{\rm{QF}}}}}}{{1000}},{{k}}={{0}}{{.}}{{75}}}={{2}}{{.}}{{05}}{{E}}-{{41}}$, ${{{P}}}_{{{{{{\rm{QF}}}}}}{{1000}},{{k}}={{0}}{{.}}{{5}}}={{4}}{{.}}{{75}}{{E}}-{{35}}$, ${{{P}}}_{{{{{{\rm{QF}}}}}}{{1000}},{{k}}={{0}}{{.}}{{25}}}={{1}}{{.}}{{84}}{{E}}-{{56}}$) while learnt more information from the Independent teacher model trained with a complete training dataset.

Table 3 Comparison of AFS with other methods on auditing QO and QNO from the PathMNIST dataset with a varied number of samples in the query dataset

Full size table

In summary, AFS could effectively forget the information of the query dataset from the target DL model. Since KP was integrated into AFS, it could generate a smaller DL model, which masters knowledge from the larger teacher model by using only a partial training dataset (k = 0.5 could achieve a good balance between forgetting and model performance), without the need to retrain the larger model with the complete training dataset. Compared to retraining the student model, the model trained with AFS showed even better performance in forgetting the information while maintaining better model performance (accuracy and F1-score) as it learnt the knowledge from the model trained with the complete training dataset. As shown by the ablation study in Tables 2 and 4, compared to AFS w/o Audit, the audit-guided AFS could forget the information more significantly but with an acceptable cost in decreasing the model performance (accuracy and F1-score).

Table 4 Comparison of AFS with other methods on forgetting QF and model performance with the PathMNIST dataset

Full size table

Apply AFS to forget medical images

To show the versatility of AFS, we applied it to the classification of pneumonia and normal with chest X-ray images from the COVIDx dataset with ResNet, which is a classic task in medical image analysis. As shown in Fig. 5a, on both query datasets (QF₁₀₀ and QF₁₀₀₀), AFS could effectively forget the information of the query dataset, while generating the new model with much less number of parameters as shown in Fig. 5b. Surprisingly, the model generated by AFS showed even better accuracy than the Independent teacher trained with the complete dataset and the Independent student trained with the partial training dataset. This result not only indicated that AFS could effectively transfer the knowledge from the teacher model to the student model but also suggested that the student model with simpler architecture could even perform better than the teacher model with KP in AFS due to the reduction of model parameters and purification of knowledge in some real-world cases. Meanwhile, the results also showed that the student model trained by the AFS achieved better generalization ability than the model obtained by using the exact same model structure and training data using only the hard-target training method.

**Fig. 5: Performance of forgetting using AFS on four datasets.**

Apply AFS to forget electrical health records

To further prove the generalizability of AFS in both the auditing and forgetting, we applied AFS to predicting early autism spectrum disorder (ASD) traits of toddlers, which contains sensitive information about patients, such as the age, gender and the family gene trait. That information was stored as electrical health records (EHR). As shown in Fig. 5a, similar to previous results on other datasets, AFS effectively removed the information of both query datasets from the pre-trained DL model. Since the size of the ASD dataset was quite small, we adopted two smaller query datasets (QF₅₀ and QF₁₀₀) to be forgotten. Compared to the models trained with other methods, the model trained with AFS successfully forgot the information of both QF₅₀ (${{{P}}}_{{{{{{\rm{QF}}}}}}{{50}},{{k}}={{0}}{{.}}{{75}}}={{0}}{{.}}{{08}},{{{P}}}_{{{{{{\rm{QF}}}}}}{{50}},{{k}}={{0}}{{.}}{{55}}}={{0}}{{.}}{{08}},{{{P}}}_{{{{{{\rm{QF}}}}}}{{50}},{{k}}={{0}}{{.}}{{25}}}={{0}}{{.}}{{156}}$) and QF₁₀₀ (${{{P}}}_{{{{{{\rm{QF}}}}}}{{100}},{{k}}={{0}}{{.}}{{75}}}={{0}}{{.}}{{004}},{{{P}}}_{{{{{{\rm{QF}}}}}}{{100}},{{k}}={{0}}{{.}}{{55}}}={{0}}{{.}}{{007}},{{{P}}}_{{{{{{\rm{QF}}}}}}{{100}},{{k}}={{0}}{{.}}{{25}}}={{0}}{{.}}{{007}}$) without affecting the model utility significantly (${{{{{{{\rm{Ac}}}}}}}}{{{{{{{{\rm{c}}}}}}}}}_{{{{{{\rm{AFS}}}}}},{{k}}={{0}}{{.}}{{75}}}={{0}}{{.}}{{98}},{{{{{\rm{Ac}}}}}}{{{{{{\rm{c}}}}}}}_{{{{{{\rm{AFS}}}}}},{{k}}={{0}}{{.}}{{5}}}={{0}}{{.}}{{98}},{{{{{\rm{Ac}}}}}}{{{{{{\rm{c}}}}}}}_{{{{{{\rm{AFS}}}}}},{{k}}={{0}}{{.}}{{25}}}={{0}}{{.}}{{98}}$).

Discussion

AFS is a unified method of auditing and forgetting that could effectively forget the information of the target query dataset from the pre-trained DL model with the guidance of auditing. We designed AFS as a model-agnostic and open-source method that is applicable to different models. As shown in Fig. 5c, AFS could generate a smaller model, which requires much less time and GPU memory during the inference (Tables S1 and S2), by training with a partial training dataset (~50%) with our KP approach. Moreover, AFS could forget the information of the query dataset at the expense of an acceptable reduction in the model performance.

Our experiments on four datasets showed that AFS was generalized for datasets of different sizes and forms, including medical images and EHR. Since deep learning models with different architectures were applied to four tasks, we further demonstrated the broad applicability of AFS to common deep learning models. In addition, our tasks include both binary classification and multiclassification tasks, which also suggested that AFS was applicable for tasks with multiple labels.

In practice, the size of the student model could be manually adjusted when applying AFS to meet specific requirements. In the initial stages, we could make the student model smaller than the teacher model to achieve model compression while forgetting private information. However, instead of continuously generating smaller and smaller student models as knowledge is forgotten, we could maintain the size of the student model once it reaches a certain threshold (e.g., small enough to meet our needs). In doing so, AFS would focus solely on forgetting knowledge and transferring the remaining knowledge, without the need for further model compression.

AFS could be incorporated into the workflow of institutions as shown in Fig. 5d. Patients’ requests for data forgetting may occur in two different phases. The first phase relates to requests made before model compression, where patients request the institution to forget their data from the initial dataset. The second phase pertains to requests made after model compression, where patients seek to have their data forgotten from the compressed model. In both cases, AFS could be employed to forget information from the model. In the first case, AFS facilitates model compression while performing forgetting, which ensures that the compressed model not only meets the requirements for deployment but also respects data privacy by removing sensitive information. In the second case, AFS could be utilized to forget information from the model without involving model compression. This allows the institution to respond to data forget requests while retaining the compressed model’s structure and size. The institution may stop forgetting and retrain a new model in two possible scenarios. The first scenario occurs when the number of forgetting requests becomes too high, leading to a significant degradation in the current model’s performance that exceeds the institution’s predefined budget. In such cases, it may be necessary to stop the forgetting process and initiate the retraining of a new model. The second scenario arises when the institution introduces new data into the system. In this case, instead of continuing to forget specific records from the existing model, the institution can incorporate the retained data along with the new data to train a new model.

With current laws that guarantee people the right to revoke their own data, AFS could help institutions and companies to efficiently iterate their models to forget individual information at the model level. However, there are still some shortcomings in the application of the current version of AFS in the production environment, which could be the main potential direction of research in the future. Firstly, the models and data we tested in this study were still not large enough compared to the data in the real production environment. Therefore, it is unknown whether scaling AFS to current large models will cause new problems (e.g. LLM like ChatGPT, which we are unable to further pursue). Secondly, there are different approaches to audit, and thus we could add more metrics of auditing to AFS to guide the forgetting process in the future version. Meanwhile, due to the limitation of auditing, it is still difficult to perform individual-level forgetting, as we need to compare the difference in statistical distribution based on a fraction of data points, which could be the major possible improvement for the future version of AFS. Though AFS is not applicable for individual-level forgetting due to limitations of algorithm design, we can achieve favorable forgetting outcomes when operating on a batch of query data. In real-world scenarios, companies with millions of users can easily meet the required number of the query dataset. Furthermore, when hospitals or companies face continuous requests for forgetting individual patient data, they can collect and store these requests, and perform the forgetting as a batch once the amount of data reaches the required threshold. Finally, the current version of AFS could not be applicable for regression tasks because the design of auditing metrics does not work for regression tasks and KP is limited to classification tasks only. Despite these limitations, we believe that AFS will make a valuable contribution toward better protection of people’s privacy and the right to revoke data with the rapid development of intelligent healthcare.

Methods

The overall framework of AFS

AFS is a unified method to revoke patients’ private data by using auditing to guide the forgetting process in a negative feedback manner (Fig. 1).

To audit the membership of the query dataset, AFS takes a pre-trained DL model and the query dataset as inputs, and determines whether the query dataset has been used for training the target DL model. This function was re-implemented based on EMA⁴¹, a published MIA-based method to evaluate the membership of a query dataset. Our re-implementation allows quicker and easier usage of auditing by introducing parallel computing in each epoch, which suggests a significant acceleration for a complete forgetting process and could be more attractive to institutions to forget larger-scale data (Fig. S2).

To forget the query dataset from a DL model, AFS takes the pre-trained DL model and the query dataset to be forgotten as inputs, in which the query dataset has been used to train the DL model. To effectively forget the information of the query dataset from the pre-trained DL model, an idea is to transfer the information of the remaining dataset except for the query dataset from the pre-trained model to a new model. Therefore, we designed a mechanism called knowledge purification (KP) by using auditing to guide the forgetting process to exclude the information of the query dataset while transferring the remaining information by incorporating the auditing loss into the training process (Fig. 2). With KP integrated, AFS could generate a new model, in which the information of the target dataset should be forgotten under the guidance of auditing.

To provide an applicable solution, we implemented AFS as open-source software that provides a user-friendly entry point allowing users to use both functions with only one command. To demonstrate the generality of AFS, we applied it to four tasks based on four datasets, including the MNIST dataset, the PathMNIST dataset, the COVIDx dataset and the ASD dataset, which have different data sizes (Fig. 3) and various architectures of deep learning networks.

Dataset preparation

We used four public datasets that were commonly acknowledged in the machine learning and intelligent healthcare field to demonstrate the versatility of AFS. For the benchmark experiment, we applied AFS on MNIST⁴⁴ and PathMNIST⁴⁵ from the MedMNIST⁴⁶ dataset. The MNIST dataset contains 60,000 training images and 10,000 testing images of handwritten digits with size 28 × 28 and labeled from 0 to 9. PathMNIST contains 100,000 nonoverlapping image patches from hematoxylin and eosin-stained histological images and 7180 image patches from different clinical centers. In total, 9 types of tissues are involved in the PathMNIST dataset, including adipose, background, debris, lymphocytes, mucus, smooth muscle, normal colon mucosa, cancer-associated stroma, and COAD epithelium. All images in PathMNIST were 224 × 224 (0.5 µm px⁻¹) and were normalized with the Macenko method⁴⁷. For the application of AFS in intelligent healthcare, we used the COVIDx⁴⁸ dataset, which contains 13,975 chest X-ray (CXR) images across 13,870 patient cases, and the autism spectrum disorder (ASD) dataset for toddlers⁴⁹, which contains 20 features of 1054 samples to be utilized for determining influential autistic traits and improving the classification of ASD cases.

For each dataset, we further sampled partial data as the training dataset, the testing dataset, and the calibration dataset as below:

MNIST

We randomly sampled 10,000 images as the training dataset and 10,000 images as the testing dataset. We also randomly sampled 100, 1000, 2000, and 5000 images that are disjoint with the training dataset as four calibration datasets to illustrate the effect of the calibration dataset of varied sizes on auditing and forgetting.

PathMNIST

We randomly sampled 10,000 images as the training dataset and 5000 images as the testing dataset. We also randomly sampled 1000 images that are disjoint with the training dataset as the calibration dataset.

COVIDx

We randomly sampled 5000 images as the training dataset and 1000 images as the testing dataset. We also randomly sampled 1000 images that are disjoint with the training dataset as the calibration dataset.

ASD

We randomly sampled 500 images as the training dataset and 100 images as the testing dataset. We also randomly sampled 100 images that are disjoint with the training dataset as the calibration dataset.

For all four datasets, we randomly sampled partial data from the training dataset with percentage k from {0.25, 0.5, 0.75} as the training dataset for knowledge distillation (KD) and AFS.

In addition, we prepared query datasets with different sizes N from {1, 10, 100, 500, 1000, 2000}. A query dataset that completely overlapped with the training dataset is labeled as QO, while the query dataset that is completely disjoint with the training dataset is labeled QNO. To further understand the effect of the purity of the query dataset, we also prepared the query dataset called QM with a k percentage of the query dataset to be overlapped with the training dataset. Finally, for the query dataset designed to be forgotten, we labeled it as QF. QO, QNO, QM, and QF were all sampled randomly from the complete dataset and all reported values were the average of 5 replicate experiments.

Deep learning models and experiment setup

To present the generalizability of AFS toward various DL models, we adopted different architectures for each of the four tasks, including the multilayer perception⁵⁰ (MLP), the convolutional neural network (CNN)⁵¹ and ResNet⁵². There were a large DL model and a small DL model for each task, where the large model refers to the original pre-trained model and the small model is the new model generated by AFS.

For the MNIST dataset, we used MLP with 671,754 parameters as the teacher model and 155,658 parameters as the student model to achieve the 10-class classification task.

For the PathMNIST dataset, we adopted CNN with 21,285,698 parameters as the teacher model and 11,177,538 parameters as the student network for the 9-class classification task.

For the COVIDx dataset, we took ResNet34 with 21,285,698 parameters as the teacher model and ResNet18 with 11,177,538 parameters as the student network to achieve the binary classification of healthy people and patients.

For the ASD dataset, we used the MLP with 3586 parameters as the teacher model and the MLP with 898 parameters as the student model for the binary classification of autism in toddlers.

During model training, the number of epochs was fixed to 50, the learning rate was set to 1e–5 and the Adam optimizer was used. A workstation with 252 GB RAM, 112 CPU cores and 2 Nvidia V100 GPUs were adopted for all experiments. The AFS method was developed based on Python3.7, PyTorch1.9.1 and CUDA11.4. A detailed list of dependencies could be found in our code availability.

Audit the membership of query dataset

EMA⁴¹ is designed as a 2-step process. In the first step, the best threshold for each metric is selected to optimize $({{{{{{\rm{TPR}}}}}}}(t)+{{{{{{\rm{TNR}}}}}}}(t))/2$ based on the calibration dataset as shown in Algorithm 1. Once the thresholds for all metrics are selected, the membership of each sample in the query dataset will be confirmed as at least one metric is larger than the corresponding threshold. In total, three metrics, including correctness⁵³, confidence^54,55, and entropy^56,57, were adopted to further calculate the p value, which is the key audit metric in AFS as proposed in the previous work^41,58. The details for calculating correctness, confidence, and entropy are as below:

Correctness: the target model is trained to predict correctly on training data and may not generalize well on test data. Thus, we can define the correctness as in Eq. (1):

$${I}_{{{{{{{\rm{correctness}}}}}}}}\left(F,\left(x,\,y\right)\right){\mathbb{=}}{\mathbb{1}}\{{{{{{{\rm{argma}}}}}}}{{{{{{\rm{x}}}}}}}_{i}F{\left(x\right)}_{i}=y\}$$

(1)

where F is the deep learning model, x is the input data, F(x) is the output logits, y is the label, and ${\mathbb{1}}$ is the indicator function.

Confidence: the target model is usually more confident in predictions on training data, but less confident in test data. Thus, we can define confidence as in Eq. (2):

$${I}_{{{{{{{\rm{confidence}}}}}}}}\left(F,\left({{{{{\boldsymbol{x}}}}}},\,y\right)\right){\mathbb{=}}{\mathbb{1}}\{F{\left({{{{{\boldsymbol{x}}}}}}\right)}_{y}\ge {\tau }_{y}\}$$

(2)

where F is the deep learning model, x is the input data, F(x) is the output logits, y is the label, $F{\left({{{{{\boldsymbol{x}}}}}}\right)}_{y}$ is the output logits for label y, τ_y is a threshold for the logit for label y, and ${\mathbb{1}}$ is the indicator function.

Entropy: the target model is trained by minimizing the prediction loss over training data and usually has a larger prediction entropy on a test sample. Thus, we can define entropy as in Eq. (3):

$${I}_{{{{{{{\rm{entropy}}}}}}}}\left(F,\,\left(x,\,y\right)\right){\mathbb{=}}{\mathbb{1}}\left\{-\mathop{\sum}\limits_{i}F{\left(x\right)}_{i}\log \left(F{\left(x\right)}_{i}\right)\le \hat{{{{{{{\rm{\tau }}}}}}}_{y}}\right\}$$

(3)

where F is the deep learning model, x is the input data, F(x) is the output logits, y is the label, $F{\left({{{{{\boldsymbol{x}}}}}}\right)}_{i}$ is the output logits for class i, $\hat{{\tau }_{y}}$ is a threshold, and ${\mathbb{1}}$ is the indicator function.

Once the membership of all samples in the query dataset is confirmed in the previous step, the query dataset will be further evaluated to determine whether the query dataset has been used to train the target pre-trained DL model. A two-sample statistical test is adopted to evaluate the query dataset based on the sample-wise membership and an all-one vector. The p value of the two-sample statistical test is used as the output of auditing. Given a user-defined threshold α, if p < α, then users could conclude that the query dataset was not used for training the target DL model. EMA was re-implemented and integrated into AFS to allow easy and fast auditing.

Algorithm 1

Infer thresholds

Audit-guided forgetting of query dataset with AFS

Forgetting aims to remove the remembered information of the query dataset from the target DL model. Similar to knowledge distillation (KD), a teacher-student paradigm was also adopted in AFS, but with an additional requirement to selectively forget information associated with the data we want to forget. Thus, we designed an approach called knowledge purification (KP), meaning purifying the knowledge in the teacher model (the original pre-trained model), discarding the information related to the data that needed to be forgotten and transferring the purified information into the student model (the new model). AFS unified auditing and forgetting into a circular process to effectively enhance the unlearning in a negative feedback manner.

As shown in Fig. 1, during each epoch of training, the training data will be fed into both the teacher model and the student model, while the data to be forgotten will be audited on the student model. Our main goal is to transfer the knowledge from the teacher model to the student model while forcing the student model to reject the information associated with data to be forgotten. In order to achieve that, we added the audit loss into the total loss, thus allowing the student model to accept partial knowledge from the teacher model and achieve KP as shown in Algorithm 2.

Algorithm 2

AFS

Evaluation metrices

Since all four tasks are either multi-classes classification tasks or binary classification tasks, we adopted the accuracy and F1-score as the evaluation metrics as in Eqs. (4) and (5):

$${{{{{{\rm{Accuracy}}}}}}}=\frac{{{{{{{\rm{TP}}}}}}}+{{{{{{\rm{TN}}}}}}}}{{{{{{{\rm{TP}}}}}}}+{{{{{{\rm{TN}}}}}}}+{{{{{{\rm{FP}}}}}}}+{{{{{{\rm{FN}}}}}}}}$$

(4)

$${{{{{\rm{F1}}}}}}-{{{{{{\rm{score}}}}}}}=\frac{2{{{{{{\rm{TP}}}}}}}}{2{{{{{{\rm{TP}}}}}}}+{{{{{{\rm{FP}}}}}}}+{{{{{{\rm{FN}}}}}}}}$$

(5)

where TP represents true positives, TN stands for true negatives, FN represents false negatives and FP stands for false positives.

To evaluate the membership of the query dataset, the p value of the two-sample statistical test was used as mentioned previously.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

All four datasets used in this work are publicly available. The MNIST dataset is available at https://www.kaggle.com/datasets/hojjatk/mnist-dataset. The PathMNIST dataset is available at https://medmnist.com/. The COVIDx dataset is stored at https://www.kaggle.com/datasets/andyczhao/covidx-cxr2?select=competition test. The ASD dataset can be accessed at https://www.kaggle.com/datasets/fabdelja/autism-screening-for-toddlers. All data supporting the findings described in this paper are available in the article and in the Supplementary Information and from the corresponding author upon request. Source data are provided with this paper.

Code availability

The AFS software is publicly available at https://github.com/JoshuaChou2018/AFS and https://doi.org/10.5281/zenodo.8275769. SISA is implemented based on the codes at https://github.com/cleverhans-lab/machine-unlearning.

References

Voigt, P. & Von dem Bussche, A. The EU General Data Protection Regulation (GDPR). A Practical Guide 1st edn (Springer International Publishing, 2017).
Act A. Health insurance portability and accountability act of 1996. Public Law 104, 191 (1996).
Pardau, S. L. The california consumer privacy act: towards a european-style privacy regime in the united states. J. Technol. Law Policy 23, 68 (2018).
Google Scholar
Wang, R., Li, Y. F., Wang, X., Tang, H. & Zhou, X. Learning your identity and disease from research papers: information leaks in genome wide association study. In Proceedings of the 16th ACM conference on Computer and Communications Security 534–544 (2009).
Fredrikson, M. et al. Privacy in pharmacogenetics: an {End-to-End} case study of personalized warfarin dosing. In 23rd USENIX Security Symposium (USENIX Security 14) 17–32 (2014).
Cao, Y. & Yang, J. Towards making systems forget with machine unlearning. In 2015 IEEE Symposium on Security and Privacy (IEEE, 2015).
Fredrikson, M., Jha, S. & Ristenpart, T. Model inversion attacks that exploit confidence information and basic countermeasures. In Proceedings of the 22nd ACM SIGSAC Conference on Computer and Communications Security (2015).
Song, C., Ristenpart, T. & Shmatikov, V. Machine learning models that remember too much. In Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security (2017).
Ganju, K., Wang, Q., Yang, W., Gunter, C. A. & Borisov, N. Property inference attacks on fully connected neural networks using permutation invariant representations. In Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security (2018).
Carlini, N., Liu, C., Erlingsson, Ú., Kos, J. & Song, D. The secret sharer: evaluating and testing unintended memorization in neural networks. In 28th USENIX Security Symposium (USENIX Security 19) (2019).
Zhou, J. et al. PPML-Omics: a privacy-preserving federated machine learning method protects patients’ privacy in omic data. bioRxiv https://doi.org/10.1101/2022.03.23.485485 (2022).
McKinney, S. M. et al. International evaluation of an AI system for breast cancer screening. Nature 577, 89–94 (2020).
Article ADS CAS PubMed Google Scholar
Ardila, D. et al. End-to-end lung cancer screening with three-dimensional deep learning on low-dose chest computed tomography. Nat. Med. 25, 954–961 (2019).
Article CAS PubMed Google Scholar
Poplin, R. et al. Prediction of cardiovascular risk factors from retinal fundus photographs via deep learning. Nat. Biomed. Eng. 2, 158–164 (2018).
Article PubMed Google Scholar
Zhou, L. et al. A rapid, accurate and machine-agnostic segmentation and quantification method for CT-based COVID-19 diagnosis. IEEE Trans. Med. Imaging 39, 2638–2652 (2020).
Article PubMed Google Scholar
Zhou, L. et al. An interpretable deep learning workflow for discovering subvisual abnormalities in CT scans of COVID-19 inpatients and survivors. Nat. Mach. Intell. 4, 494–503 (2022).
Article Google Scholar
Bartoletti, I. AI in healthcare: ethical and privacy challenges. In Artificial Intelligence in Medicine: 17th Conference on Artificial Intelligence in Medicine, AIME 2019, Poznan, Poland, June 26–29, 2019, Proceedings 17 (Springer, 2019).
Bourtoule, L. et al. Machine unlearning. In 2021 IEEE Symposium on Security and Privacy (SP) (IEEE, 2021).
Nguyen, Q. P., Low, B. K. H. & Jaillet, P. Variational bayesian unlearning. Adv. Neural Inf. Process. Syst. 33, 16025–16036 (2020).
Google Scholar
Nguyen, T. T. et al. A survey of machine unlearning. arXiv https://doi.org/10.48550/arXiv.2209.02299 (2022).
Gupta, V. et al. Adaptive machine unlearning. Adv. Neural Inf. Process. Syst. 34, 16319–16330 (2021).
Google Scholar
Sekhari, A., Acharya, J., Kamath, G. & Suresh, A. T. Remember what you want to forget: algorithms for machine unlearning. Adv. Neural Inf. Process. Syst. 34, 18075–18086 (2021).
Google Scholar
Thudi, A., Deza, G., Chandrasekaran, V. & Papernot, N. Unrolling sgd: understanding factors influencing machine unlearning. In 2022 IEEE 7th European Symposium on Security and Privacy (EuroS&P) (IEEE, 2022).
Guo, C., Goldstein, T., Hannun, A. & Van Der Maaten, L. Certified data removal from machine learning models. arXiv https://doi.org/10.48550/arXiv.1911.03030 (2019).
Golatkar, A., Achille, A. & Soatto, S. Eternal sunshine of the spotless net: selective forgetting in deep networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 9304–9312 (2020).
Neel, S., Roth, A. & Sharifi-Malvajerdi, S. Descent-to-delete: gradient-based methods for machine unlearning. In: Algorithmic Learning Theory (PMLR, 2021).
Ginart, A., Guan, M., Valiant, G. & Zou, J. Y. Making AI forget you: data deletion in machine learning. Advances in Neural Information Processing Systems 32 (NeurIPS, 2019).
Chundawat, V. S., Tarun, A. K., Mandal, M. & Kankanhalli, M. Can bad teaching induce forgetting? Unlearning in deep networks using an incompetent teacher. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI, 2023).
Kim, J. & Woo, S. S. Efficient two-stage model retraining for machine unlearning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 4361–4369 (2022).
Nguyen, Q. P., Oikawa, R., Divakaran, D. M., Chan, M. C. & Low, B. K. H. Markov Chain Monte Carlo-based machine unlearning: unlearning what needs to be forgotten. In Proceedings of the 2022 ACM on Asia Conference on Computer and Communications Security 351–363 (2022).
Baumhauer, T., Schöttle, P. & Zeppelzauer, M. Machine unlearning: linear filtration for logit-based classifiers. Mach. Learn. 111, 3203–3226 (2022).
Article MathSciNet PubMed PubMed Central MATH Google Scholar
Izzo, Z., Smart, M. A., Chaudhuri, K. & Zou, J. Approximate data deletion from machine learning models. In International Conference on Artificial Intelligence and Statistics (PMLR, 2021).
Schelter, S., Grafberger, S. & Dunning, T. Hedgecut: maintaining randomised trees for low-latency machine unlearning. In Proceedings of the 2021 International Conference on Management of Data 1545–1557 (2021).
Shan, S. et al. Protecting personal privacy against unauthorized deep learning models. In Proceedings of USENIX Security Symposium 1589–1604 (2020).
Tarun, A. K., Chundawat, V. S., Mandal, M. & Kankanhalli, M. Fast yet effective machine unlearning. IEEE Transactions on Neural Networks and Learning Systems 1–10 (IEEE, 2023).
Huang, H., Ma, X., Erfani, S. M., Bailey, J. & Wang, Y. Unlearnable examples: making personal data unexploitable. arXiv https://doi.org/10.48550/arXiv.2101.04898 (2021).
Peste, A., Alistarh, D. & Lampert, C. H. SSSE: efficiently erasing samples from trained machine learning models. arXiv https://doi.org/10.48550/arXiv.2107.03860 (2021).
Koch, K. & Soll, M. No matter how you slice it: machine unlearning with SISA comes at the expense of minority classes. In 2023 IEEE Conference on Secure and Trustworthy Machine Learning (SaTML) (IEEE, 2023).
Goel, S., Prabhu, A. & Kumaraguru, P. Towards adversarial evaluations for inexact machine unlearning. arXiv Preprint at arXiv:220106640 (2022).
Liu, X. & Tsaftaris, S. A. Have you forgotten? A method to assess if machine learning models have forgotten data. In Medical Image Computing and Computer Assisted Intervention—MICCAI 2020: 23rd International Conference, Lima, Peru, October 4–8, 2020, Proceedings, Part I 23 (Springer, 2020).
Huang, Y., Li, X. & Li, K. EMA: Auditing data removal from trained models. In International Conference on Medical Image Computing and Computer-Assisted Intervention (Springer, 2021).
Hüllermeier, E. & Waegeman, W. Aleatoric and epistemic uncertainty in machine learning: an introduction to concepts and methods. Mach. Learn. 110, 457–506 (2021).
Article MathSciNet MATH Google Scholar
Hinton, G., Vinyals, O. & Dean, J. Distilling the knowledge in a neural network. arXiv https://doi.org/10.48550/arXiv.1503.02531 (2015).
LeCun, Y. The MNIST database of handwritten digits. http://yann.lecun.com/exdb/mnist/ (1998).
Kather, J. N. et al. Predicting survival from colorectal cancer histology slides using deep learning: a retrospective multicenter study. PLoS Med. 16, e1002730 (2019).
Article PubMed PubMed Central Google Scholar
Yang, J. et al. MedMNIST v2-A large-scale lightweight benchmark for 2D and 3D biomedical image classification. Sci. Data 10, 41 (2023).
Article PubMed PubMed Central Google Scholar
Macenko, M. et al. A method for normalizing histology slides for quantitative analysis. In 2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro (IEEE, 2009).
Wang, L., Lin, Z. Q. & Wong, A. COVID-NeT: a tailored deep convolutional neural network design for detection of covid-19 cases from chest x-ray images. Sci. Rep. 10, 19549 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Thabtah, F. Autism spectrum disorder screening: machine learning adaptation and DSM-5 fulfillment. In Proceedings of the 1st International Conference on Medical and Health Informatics 2017 1–6 (2017).
Gardner, M. W. & Dorling, S. Artificial neural networks (the multilayer perceptron)—a review of applications in the atmospheric sciences. Atmos. Environ. 32, 2627–2636 (1998).
Article ADS CAS Google Scholar
O’Shea, K. & Nash, R. An introduction to convolutional neural networks. arXiv https://doi.org/10.48550/arXiv.1511.08458 (2015).
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (IEEE, 2016).
Leino, K. & Fredrikson, M. Stolen memories: leveraging model memorization for calibrated {White-Box} membership inference. In 29th USENIX Security Symposium (USENIX Security 20) (2020).
Yeom, S., Giacomelli, I., Fredrikson, M. & Jha, S. Privacy risk in machine learning: analyzing the connection to overfitting. In 2018 IEEE 31st Computer Security Foundations Symposium (CSF) (IEEE, 2018).
Song, L., Shokri, R. & Mittal, P. Privacy risks of securing machine learning models against adversarial examples. In Proceedings of the 2019 ACM SIGSAC Conference on Computer and Communications Security 241–257 (2019).
Shokri, R., Stronati, M., Song, C. & Shmatikov, V. Membership inference attacks against machine learning models. In 2017 IEEE Symposium on Security and Privacy (SP) (IEEE, 2017).
Salem, A. et al. ML-Leaks: model and data independent membership inference attacks and defenses on machine learning models. arXiv https://doi.org/10.48550/arXiv.1806.01246 (2018).
Song, L. & Mittal, P. Systematic evaluation of privacy risks of machine learning models. In 30th USENIX Security Symposium (USENIX Security 21) (2021).

Download references

Acknowledgements

J.Z., H.L., X.L., B.Z., W.H., Z.L., L.Z. and X.G. were supported in part by grants from the Office of Research Administration (ORA) at King Abdullah University of Science and Technology (KAUST) under award number FCC/1/1976-44-01, FCC/1/1976-45-01, REI/1/5202-01-01, REI/1/5234-01-01, REI/1/4940-01-01, RGC/3/4816-01-01, and REI/1/0018-01-01. X.L. was also supported in part by grants from the National Natural Science Foundation of China under grant number No. 62002388.

Author information

These authors contributed equally: Juexiao Zhou, Haoyang Li.

Authors and Affiliations

Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), 23955-6900, Thuwal, Kingdom of Saudi Arabia
Juexiao Zhou, Haoyang Li, Xingyu Liao, Bin Zhang, Wenjia He, Zhongxiao Li, Longxi Zhou & Xin Gao
Computational Bioscience Research Center, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), 23955-6900, Thuwal, Kingdom of Saudi Arabia
Juexiao Zhou, Haoyang Li, Xingyu Liao, Bin Zhang, Wenjia He, Zhongxiao Li, Longxi Zhou & Xin Gao

Authors

Juexiao Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Haoyang Li
View author publications
You can also search for this author in PubMed Google Scholar
Xingyu Liao
View author publications
You can also search for this author in PubMed Google Scholar
Bin Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Wenjia He
View author publications
You can also search for this author in PubMed Google Scholar
Zhongxiao Li
View author publications
You can also search for this author in PubMed Google Scholar
Longxi Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Xin Gao
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: J.Z. and X.G. Design: J.Z. and X.G. Data analysis and interpretation: J.Z., H.L. and W.H. Code implementation: J.Z. and H.L. Application: J.Z., H.L., X.L. and B.Z. Code improvement: Z.L. and L.Z. Drafting of the manuscript: J.Z. and H.L. Critical revision of the manuscript for important intellectual content: J.Z., X.L. and B.Z. Supervision: J.Z. and X.G. Funding acquisition: X.G.

Corresponding author

Correspondence to Xin Gao.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Seyit Camtepe and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhou, J., Li, H., Liao, X. et al. A unified method to revoke the private data of patients in intelligent healthcare with audit to forget. Nat Commun 14, 6255 (2023). https://doi.org/10.1038/s41467-023-41703-x

Download citation

Received: 07 February 2023
Accepted: 14 September 2023
Published: 06 October 2023
DOI: https://doi.org/10.1038/s41467-023-41703-x

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Causal machine learning for predicting treatment outcomes

scGPT: toward building a foundation model for single-cell multi-omics using generative AI

Demographic bias in misdiagnosis by computational pathology models

Introduction

Results

AFS audits private datasets stably and robustly

AFS forgets the information of query dataset, maintains perfect usability and generates smaller model

Apply AFS to forget medical images

Apply AFS to forget electrical health records

Discussion

Methods

The overall framework of AFS

Dataset preparation

MNIST

PathMNIST

COVIDx

ASD

Deep learning models and experiment setup

Audit the membership of query dataset

Algorithm 1

Audit-guided forgetting of query dataset with AFS

Algorithm 2

Evaluation metrices

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Source data

Source Data

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links