Latest papers

5 papers
tool arXiv Apr 22, 2026 · 29d ago

AVISE: Framework for Evaluating the Security of AI Systems

Mikko Lempinen, Joni Kemppainen, Niklas Raesalmi · University of Oulu

Modular framework for automated LLM security testing, demonstrating multi-turn Red Queen jailbreak attacks across nine models

Prompt Injection nlp
PDF
benchmark arXiv Apr 9, 2026 · 6w ago

SciFigDetect: A Benchmark for AI-Generated Scientific Figure Detection

You Hu, Chenzhuo Zhao, Changfa Mo et al. · Zhejiang University · Independent Researcher +1 more

Benchmark dataset and evaluation framework for detecting AI-generated scientific figures across multiple generator sources and degradation scenarios

Output Integrity Attack visionmultimodalnlp
PDF Code
defense arXiv Dec 16, 2025 · Dec 2025

Semantic Mismatch and Perceptual Degradation: A New Perspective on Image Editing Immunity

Shuai Dong, Jie Zhang, Guoying Zhao et al. · China University of Geosciences · Chinese Academy of Sciences +2 more

Defends images from unauthorized diffusion model editing via adversarial intermediate feature perturbations that disrupt semantic and perceptual output quality

Output Integrity Attack visiongenerative
PDF
defense arXiv Nov 30, 2025 · Nov 2025

OmniFD: A Unified Model for Versatile Face Forgery Detection

Haotian Liu, Haoyu Chen, Chenhui Pan et al. · University of Oulu

Unified multi-task deepfake detection framework covering image/video classification and spatial/temporal localization in a single Swin Transformer model

Output Integrity Attack vision
PDF Code
defense IEEE Open Journal of the Commu... Sep 22, 2025 · Sep 2025

Hybrid Reputation Aggregation: A Robust Defense Mechanism for Adversarial Federated Learning in 5G and Edge Network Environments

Saeid Sheikhi, Panos Kostakos, Lauri Loven · University of Oulu

Defends federated learning against label flipping, backdoors, and Byzantine attacks via geometric anomaly detection plus reputation tracking

Data Poisoning Attack Model Poisoning federated-learningtabular
PDF