ML Security Papers

Latest papers

2 papers

defense arXiv Dec 23, 2025 · Dec 2025

Bridging Efficiency and Safety: Formal Verification of Neural Networks with Early Exits

Yizhak Yisrael Elboher, Avraham Raviv, Amihay Elboher et al. · The Hebrew University of Jerusalem · Bar Ilan University +2 more

Formal verification framework for early exit neural networks that certifies local robustness and improves verification efficiency

Input Manipulation Attack visionnlp

1 citations PDF

attack arXiv Jan 3, 2025 · Jan 2025

Rerouting LLM Routers

Avital Shafran, Roei Schuster, Thomas Ristenpart et al. · The Hebrew University of Jerusalem · Wild Moose +1 more

Adversarially optimized token sequences (confounder gadgets) reliably manipulate LLM routers into routing any query to expensive models, evading perplexity defenses

Input Manipulation Attack nlp

7 citations PDF

Latest papers

Bridging Efficiency and Safety: Formal Verification of Neural Networks with Early Exits

Rerouting LLM Routers

Filters

Time Period

Paper Type

OWASP ML Top 10

OWASP LLM Top 10

Institution

Venue