attack 2025

SegTrans: Transferable Adversarial Examples for Segmentation Models

Yufei Song 1, Ziqi Zhou 1, Qi Lu 1, Hangtao Zhang 1, Yifan Hu 1, Lulu Xue 1, Shengshan Hu 1, Minghui Li 1, Leo Yu Zhang 2

5 citations · 64 references · IEEE transactions on multimedi...

α

Published on arXiv

2510.08922

Input Manipulation Attack

OWASP ML Top 10 — ML01

Key Finding

SegTrans improves adversarial transfer success rate by an average of 8.55% over SOTA across four segmentation models and three backbone networks, with over 100% gain in computational efficiency.

SegTrans

Novel technique introduced


Segmentation models exhibit significant vulnerability to adversarial examples in white-box settings, but existing adversarial attack methods often show poor transferability across different segmentation models. While some researchers have explored transfer-based adversarial attack (i.e., transfer attack) methods for segmentation models, the complex contextual dependencies within these models and the feature distribution gaps between surrogate and target models result in unsatisfactory transfer success rates. To address these issues, we propose SegTrans, a novel transfer attack framework that divides the input sample into multiple local regions and remaps their semantic information to generate diverse enhanced samples. These enhanced samples replace the original ones for perturbation optimization, thereby improving the transferability of adversarial examples across different segmentation models. Unlike existing methods, SegTrans only retains local semantic information from the original input, rather than using global semantic information to optimize perturbations. Extensive experiments on two benchmark datasets, PASCAL VOC and Cityscapes, four different segmentation models, and three backbone networks show that SegTrans significantly improves adversarial transfer success rates without introducing additional computational overhead. Compared to the current state-of-the-art methods, SegTrans achieves an average increase of 8.55% in transfer attack success rate and improves computational efficiency by more than 100%.


Key Contributions

  • SegTrans framework that partitions input images into local regions and remaps semantic information to generate diverse enhanced samples for perturbation optimization, improving adversarial transferability across segmentation models.
  • Demonstrates that retaining only local (rather than global) semantic information during optimization is key to bridging the feature distribution gap between surrogate and target models.
  • Achieves an average 8.55% improvement in transfer attack success rate over SOTA methods while also improving computational efficiency by more than 100% on PASCAL VOC and Cityscapes.

🛡️ Threat Analysis

Input Manipulation Attack

SegTrans crafts adversarial perturbations on surrogate segmentation models that transfer to black-box target models at inference time — a classic input manipulation/evasion attack with a focus on cross-model transferability.


Details

Domains
vision
Model Types
cnntransformer
Threat Tags
white_boxblack_boxinference_timeuntargeteddigital
Datasets
PASCAL VOCCityscapes
Applications
semantic segmentation