Latest papers

3 papers
attack arXiv Apr 18, 2026 · 4w ago

When Choices Become Risks: Safety Failures of Large Language Models under Multiple-Choice Constraints

Yuheng Chen, Zhiyu Wu, Bowen Cheng et al. · Kagoshima University · Fudan University +1 more

Bypasses LLM safety alignment by reformulating harmful prompts as forced-choice questions where all options violate policies

Prompt Injection nlp
PDF
defense IEEE Access Jan 1, 2026 · Jan 2026

Rectifying Adversarial Examples Using Their Vulnerabilities

Fumiya Morimoto, Ryuto Morita, Satoshi Ono · Kagoshima University

Defends against adversarial examples by re-attacking them across the decision boundary to recover correct original labels

Input Manipulation Attack vision
2 citations 1 influentialPDF
attack IEICE Transactions on Informat... Dec 31, 2025 · Dec 2025

Projection-based Adversarial Attack using Physics-in-the-Loop Optimization for Monocular Depth Estimation

Takeru Kusakabe, Yudai Hirose, Mashiho Mukaida et al. · Kagoshima University

Projects adversarial light patterns onto objects using physics-in-the-loop black-box optimization to fool monocular depth estimation DNNs

Input Manipulation Attack vision
PDF