Benyamin Tafreshian

h-index: 1 5 citations 2 papers (total)

Papers in Database (1)

attack arXiv Nov 24, 2025 · Nov 2025

RoguePrompt: Dual-Layer Ciphering for Self-Reconstruction to Circumvent LLM Moderation

Benyamin Tafreshian · Boston University

Automated cipher-encoding jailbreak using ROT-13 and Vigenère nesting to bypass LLM moderation and self-reconstruct forbidden prompts

Prompt Injection nlp
PDF