ML Security Papers

Latest papers

2 papers

tool arXiv Feb 17, 2026 · 6w ago

ExLipBaB: Exact Lipschitz Constant Computation for Piecewise Linear Neural Networks

Tom A. Splittgerber · University of Bremen

Exact Lipschitz constant computation tool extended to arbitrary piecewise linear networks for certified robustness guarantees

Input Manipulation Attack vision

PDF

defense arXiv Sep 30, 2025 · Sep 2025

SafeBehavior: Simulating Human-Like Multistage Reasoning to Mitigate Jailbreak Attacks in Large Language Models

Qinjian Zhao, Jiaqi Wang, Zhiqiang Gao et al. · Wenzhou-Kean University · University of Bremen +2 more

Three-stage LLM jailbreak defense using intention inference, self-introspection, and self-revision to counter optimization-based and prompt-based attacks

Input Manipulation Attack Prompt Injection nlp

PDF

Latest papers

ExLipBaB: Exact Lipschitz Constant Computation for Piecewise Linear Neural Networks

SafeBehavior: Simulating Human-Like Multistage Reasoning to Mitigate Jailbreak Attacks in Large Language Models

Filters

Time Period

Paper Type

OWASP ML Top 10

OWASP LLM Top 10

Institution

Venue