Duen Horng Chau

h-index: 48 9,903 citations 267 papers (total)

Papers in Database (2)

defense arXiv Oct 1, 2025 · Oct 2025

Large Reasoning Models Learn Better Alignment from Flawed Thinking

ShengYun Peng, Eric Smith, Ivan Evtimov et al. · Meta · Georgia Institute of Technology +1 more

Defends LLMs against chain-of-thought jailbreaks by RL-training models to self-correct injected flawed reasoning premises

Prompt Injection nlp
7 citations PDF
tool arXiv Oct 19, 2025 · Oct 2025

UNDREAM: Bridging Differentiable Rendering and Photorealistic Simulation for End-to-end Adversarial Attacks

Mansi Phute, Matthew Hull, Haoran Wang et al.

Software framework bridging Unreal Engine and differentiable rendering for end-to-end physical adversarial texture optimization on 3D objects

Input Manipulation Attack vision
PDF Code