Zeerak Talat

h-index: 23 8,285 citations 65 papers (total)

Papers in Database (2)

defense IJCNLP-AACL Oct 19, 2025 · Oct 2025

Online Learning Defense against Iterative Jailbreak Attacks via Prompt Optimization

Masahiro Kaneko, Zeerak Talat, Timothy Baldwin · MBZUAI · University of Edinburgh

Online learning defense dynamically counters iterative LLM jailbreaks via RL prompt optimization and gradient damping

Prompt Injection nlp
3 citations PDF
attack arXiv Oct 15, 2025 · Oct 2025

Personal Attribute Leakage in Federated Speech Models

Hamdan Al-Ali, Ali Reza Ghavamipour, Tommaso Caselli et al. · Mohamed bin Zayed University of Artificial Intelligence · Maastricht University +2 more

Infers private personal attributes from federated ASR model weight differentials using shadow models and centroid classification

Model Inversion Attack audiofederated-learning
PDF