Latest papers

2 papers
attack arXiv Nov 12, 2025 · Nov 2025

Cost-Minimized Label-Flipping Poisoning Attack to LLM Alignment

Shigeki Kusaka, Keita Saito, Mikoto Kudo et al. · University of Tsukuba · RIKEN +2 more

Theoretically minimizes label-flipping attack cost during RLHF/DPO alignment using convex optimization post-processing

Data Poisoning Attack Training Data Poisoning nlp
1 citations PDF Code
defense arXiv Sep 10, 2025 · Sep 2025

Securing Private Federated Learning in a Malicious Setting: A Scalable TEE-Based Approach with Client Auditing

Shun Takagi, Satoshi Hasegawa · LY Corporation

TEE-based trusted computing base enforces DP-FTRL privacy guarantees against malicious servers in cross-device federated learning with client auditing

Model Inversion Attack federated-learning
PDF