Jaejin Lee

Papers in Database (1)

attack arXiv Aug 2, 2025 · Aug 2025

PUZZLED: Jailbreaking LLMs through Word-Based Puzzles

Yelim Ahn, Jaejin Lee · Seoul National University

Jailbreaks LLMs by embedding harmful keywords as word search, anagram, and crossword puzzles, achieving 88.8% average ASR across five frontier models

Prompt Injection nlp
PDF