defense 2025

FaRAccel: FPGA-Accelerated Defense Architecture for Efficient Bit-Flip Attack Resilience in Transformer Models

Najmeh Nazari ¹, Banafsheh Saber Latibari ², Elahe Hosseini ¹, Fatemeh Movafagh ³, Chongzhou Fang ¹, Hosein Mohammadi Makrani ¹, Kevin Immanuel Gubbi ¹, Abhijit Mahalanobis ², Setareh Rafatirad ¹, Hossein Sayadi ⁴, Houman Homayoun ¹

¹ University of California, Davis

² University of Arizona

³ Simon Fraser University

⁴ California State University, Long Beach

1 citations · 36 references · ICCD

Published on arXiv

2510.24985

Model Poisoning

OWASP ML Top 10 — ML10

Key Finding

FaRAccel achieves up to 15× speedup in FaR inference latency over software implementations while maintaining equivalent resilience against gradient-based Bit-Flip Attacks.

FaRAccel

Novel technique introduced

Forget and Rewire (FaR) methodology has demonstrated strong resilience against Bit-Flip Attacks (BFAs) on Transformer-based models by obfuscating critical parameters through dynamic rewiring of linear layers. However, the application of FaR introduces non-negligible performance and memory overheads, primarily due to the runtime modification of activation pathways and the lack of hardware-level optimization. To overcome these limitations, we propose FaRAccel, a novel hardware accelerator architecture implemented on FPGA, specifically designed to offload and optimize FaR operations. FaRAccel integrates reconfigurable logic for dynamic activation rerouting, and lightweight storage of rewiring configurations, enabling low-latency inference with minimal energy overhead. We evaluate FaRAccel across a suite of Transformer models and demonstrate substantial reductions in FaR inference latency and improvement in energy efficiency, while maintaining the robustness gains of the original FaR methodology. To the best of our knowledge, this is the first hardware-accelerated defense against BFAs in Transformers, effectively bridging the gap between algorithmic resilience and efficient deployment on real-world AI platforms.

Key Contributions

FaRAccel: first FPGA-based hardware accelerator for executing Forget-and-Rewire (FaR) BFA defense operations on Transformer models
Reconfigurable datapath supporting dynamic neuron rerouting, activation modulation, and parameter concealment without model retraining
Up to 15× inference latency speedup over software-based FaR while preserving equivalent robustness against Bit-Flip Attacks

🛡️ Threat Analysis

Model Poisoning

Bit-Flip Attacks corrupt model weights/parameters at the hardware level (via DRAM vulnerabilities) to degrade or redirect model behavior — a form of model parameter corruption. FaRAccel accelerates the FaR defense, which obfuscates critical weight parameters through dynamic neuron rewiring to resist these targeted weight-manipulation attacks.

Details

Domains

nlpvision

Model Types

transformer

Threat Tags

inference_timetargetedwhite_box

Applications

transformer inferenceedge ai deploymentnatural language processingcomputer vision

Read PDF arXiv DOI

FaRAccel: FPGA-Accelerated Defense Architecture for Efficient Bit-Flip Attack Resilience in Transformer Models

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

AtPatch: Debugging Transformers via Hot-Fixing Over-Attention

BitFlipScope: Scalable Fault Localization and Recovery for Bit-Flip Corruptions in LLMs

Rotated Robustness: A Training-Free Defense against Bit-Flip Attacks on Large Language Models

Pruning and Malicious Injection: A Retraining-Free Backdoor Attack on Transformer Models

Test-Time Attention Purification for Backdoored Large Vision Language Models

PEPPER: Perception-Guided Perturbation for Robust Backdoor Defense in Text-to-Image Diffusion Models

SLIP: Soft Label Mechanism and Key-Extraction-Guided CoT-based Defense Against Instruction Backdoor in APIs

Lethe: Purifying Backdoored Large Language Models with Knowledge Dilution