Description
This repo contains the code for Xu et al. "BAN: detecting backdoors activated by adversarial neuron noise." Advances in Neural Information Processing Systems 37 (2024): 114348-114373. This paper improves backdoor feature inversion for backdoor detection by incorporating extra neuron activation information. We adversarially increase the loss of backdoored models with respect to weights to activate the backdoor effect, based on which we can easily differentiate backdoored and clean models.
| Date made available | 24 Mar 2026 |
|---|---|
| Publisher | TU Delft - 4TU.ResearchData |
Research output
- 1 Conference contribution
-
BAN: Detecting Backdoors Activated by Adversarial Neuron Noise
Xu, X., Liu, Z., Koffas, S., Yu, S. & Picek, S., 2024, Proceedings of Advances in Neural Information Processing Systems. Vol. 37. p. 14348-114373 (Advances in Neural Information Processing Systems).Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review
Open AccessFile1 Downloads (Pure)
Cite this
- DataSetCite