attack 2025

TEAR: Temporal-aware Automated Red-teaming for Text-to-Video Models

0 citations · 47 references · arXiv

Published on arXiv

2511.21145

Prompt Injection

OWASP LLM Top 10 — LLM01

Key Finding

TEAR achieves over 80% attack success rate on open-source and commercial T2V models, a significant improvement over the prior best of 57%.

TEAR

Novel technique introduced

Text-to-Video (T2V) models are capable of synthesizing high-quality, temporally coherent dynamic video content, but the diverse generation also inherently introduces critical safety challenges. Existing safety evaluation methods,which focus on static image and text generation, are insufficient to capture the complex temporal dynamics in video generation. To address this, we propose a TEmporal-aware Automated Red-teaming framework, named TEAR, an automated framework designed to uncover safety risks specifically linked to the dynamic temporal sequencing of T2V models. TEAR employs a temporal-aware test generator optimized via a two-stage approach: initial generator training and temporal-aware online preference learning, to craft textually innocuous prompts that exploit temporal dynamics to elicit policy-violating video output. And a refine model is adopted to improve the prompt stealthiness and adversarial effectiveness cyclically. Extensive experimental evaluation demonstrates the effectiveness of TEAR across open-source and commercial T2V systems with over 80% attack success rate, a significant boost from prior best result of 57%.

Key Contributions

TEAR: a two-stage automated red-teaming framework (initial generator training + temporal-aware online preference learning) that generates textually innocuous prompts exploiting temporal dynamics in T2V models
A cyclical refine model that iteratively improves prompt stealthiness and adversarial effectiveness
State-of-the-art attack success rate of >80% on open-source and commercial T2V systems, up from prior best of 57%

🛡️ Threat Analysis

Details

Domains

visionnlpgenerativemultimodal

Model Types

diffusiontransformermultimodal

Threat Tags

black_boxinference_timetargeted

Applications

text-to-video generationvideo synthesis safety

Read PDF arXiv DOI

TEAR: Temporal-aware Automated Red-teaming for Text-to-Video Models

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

VEIL: Jailbreaking Text-to-Video Models via Visual Exploitation from Implicit Language

$PC^2$: Politically Controversial Content Generation via Jailbreaking Attacks on GPT-based Text-to-Image Models

Metaphor-based Jailbreaking Attacks on Text-to-Image Models

Multimodal Prompt Decoupling Attack on the Safety Filters in Text-to-Image Models

RunawayEvil: Jailbreaking the Image-to-Video Generative Models

When the Prompt Becomes Visual: Vision-Centric Jailbreak Attacks for Large Image Editing Models

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

JANUS: A Lightweight Framework for Jailbreaking Text-to-Image Models via Distribution Optimization