Latest papers

2 papers
defense arXiv Dec 23, 2025 · Dec 2025

Bridging Efficiency and Safety: Formal Verification of Neural Networks with Early Exits

Yizhak Yisrael Elboher, Avraham Raviv, Amihay Elboher et al. · The Hebrew University of Jerusalem · Bar Ilan University +2 more

Formal verification framework for early exit neural networks that certifies local robustness and improves verification efficiency

Input Manipulation Attack visionnlp
1 citations PDF
attack arXiv Jan 3, 2025 · Jan 2025

Rerouting LLM Routers

Avital Shafran, Roei Schuster, Thomas Ristenpart et al. · The Hebrew University of Jerusalem · Wild Moose +1 more

Adversarially optimized token sequences (confounder gadgets) reliably manipulate LLM routers into routing any query to expensive models, evading perplexity defenses

Input Manipulation Attack nlp
7 citations PDF