Sergey D. Erokhin

Papers in Database (1)

attack arXiv Aug 22, 2025 · Aug 2025

HAMSA: Hijacking Aligned Compact Models via Stealthy Automation

Alexey Krylov, Iskander Vagizov, Dmitrii Korzh et al. · MIPT · Sberbank +4 more

Evolutionary search framework generates fluent, perplexity-evading jailbreak prompts against safety-aligned compact LLMs in English and Arabic

Prompt Injection nlp
PDF