Juan Ren

h-index: 2 7 citations 4 papers (total)

Papers in Database (2)

defense arXiv Oct 15, 2025 · Oct 2025

SHIELD: Classifier-Guided Prompting for Robust and Safer LVLMs

Juan Ren, Mark Dras, Usman Naseem · Macquarie University

Plug-and-play preprocessing guardrail for LVLMs that classifies harm categories and applies tailored Block/Reframe/Forward safety prompts against multimodal jailbreaks

Input Manipulation Attack Prompt Injection visionnlpmultimodal
4 citations PDF Code
defense BigData Congress Oct 29, 2025 · Oct 2025

Agentic Moderation: Multi-Agent Design for Safer Vision-Language Models

Juan Ren, Mark Dras, Usman Naseem · Macquarie University

Multi-agent safety framework defending VLMs against jailbreak attacks via cooperative Shield, Evaluator, and Reflector agents with context-aware moderation

Input Manipulation Attack Prompt Injection multimodalvisionnlp
1 citations PDF