Tolga Birdal

h-index: 27 3,538 citations 62 papers (total)

Papers in Database (1)

benchmark arXiv Oct 13, 2025 · Oct 2025

Adversarial Attacks Leverage Interference Between Features in Superposition

Edward Stevinson, Lucas Prieto, Melih Barsbey et al. · Imperial College London

Explains adversarial vulnerability mechanistically via superposition, showing feature interference predicts attack patterns and transferability

Input Manipulation Attack vision
3 citations PDF