Skip to content

DASH-Lab/SAFER

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

Robust Continual Unlearning against Knowledge Erosion and Forgetting Reversal

This is the repository for the paper titled Robust Continual Unlearning against Knowledge Erosion and Forgetting Reversal.

Abstract

As a means to balance the growth of the AI industry with the need for privacy protection, machine unlearning plays a crucial role in realizing the "right to be forgotten" in artificial intelligence. This technique enables AI systems to remove the influence of specific data while preserving the rest of the learned knowledge. Although it has been actively studied, most existing unlearning methods assume that unlearning is performed only once. In this work, we evaluate existing unlearning algorithms in a more realistic scenario where unlearning is conducted repeatedly, and in this setting, we identify two critical phenomena: (1) Knowledge Erosion, where the accuracy on retain data progressively degrades over unlearning phases, and (2) Forgetting Reversal, where previously forgotten samples become recognizable again in later phases. To address these challenges, we propose SAFER (StAbility-preserving Forgetting with Effective Regularization), a continual unlearning framework that maintains representation stability for retain data while enforcing negative logit margins for forget data. Extensive experiments show that SAFER mitigates not only knowledge erosion but also forgetting reversal, achieving stable performance across multiple unlearning phases.

Results

1. Comparison of methods for continual unlearning

  • Unlearning Efficacy (ToW)

1. CIFAR100

2. VGGFace2

3. MUFAC
  • Knowledge Erosion
KE_grouped_bar_f16_g3
  • Forgetting Reversal
FR_grouped_bar_f16_g3

2. MIA results across phases

  1. CIFAR100
cifar2_mia_v4
  1. VGGFace2
vgg2_mia_v4
  1. MUFAC
mufac_mia_v4

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages