Xiaolin Hu

Papers in Database (2)

defense arXiv Apr 12, 2026 · 5w ago

Defending against Patch-Based and Texture-Based Adversarial Attacks with Spectral Decomposition

Wei Zhang, Xinyu Chang, Xiao Li et al. · Tsinghua University · University of Science and Technology Beijing

Spectral defense using wavelet decomposition to detect and mitigate both patch-based and texture-based adversarial attacks on vision models

Input Manipulation Attack vision
PDF Code
defense arXiv Sep 4, 2025 · Sep 2025

Between a Rock and a Hard Place: The Tension Between Ethical Reasoning and Safety Alignment in LLMs

Shei Pern Chua, Zhen Leng Thai, Kai Jun Teh et al. · Tsinghua University · ByteDance +1 more

Multi-turn jailbreak embeds harmful requests in ethical dilemmas to bypass LLM safety; LoRA defense separates analytic from instrumental harmful responses

Prompt Injection nlp
PDF