attack 2025

RoBCtrl: Attacking GNN-Based Social Bot Detectors via Reinforced Manipulation of Bots Control Interaction

0 citations · 71 references · arXiv

Published on arXiv

2510.16035

Input Manipulation Attack

OWASP ML Top 10 — ML01

Key Finding

RoBCtrl effectively degrades GNN-based social bot detector performance across multiple datasets, with structural entropy-based state abstraction significantly reducing attack execution time.

RoBCtrl

Novel technique introduced

Social networks have become a crucial source of real-time information for individuals. The influence of social bots within these platforms has garnered considerable attention from researchers, leading to the development of numerous detection technologies. However, the vulnerability and robustness of these detection methods is still underexplored. Existing Graph Neural Network (GNN)-based methods cannot be directly applied due to the issues of limited control over social agents, the black-box nature of bot detectors, and the heterogeneity of bots. To address these challenges, this paper proposes the first adversarial multi-agent Reinforcement learning framework for social Bot control attacks (RoBCtrl) targeting GNN-based social bot detectors. Specifically, we use a diffusion model to generate high-fidelity bot accounts by reconstructing existing account data with minor modifications, thereby evading detection on social platforms. To the best of our knowledge, this is the first application of diffusion models to mimic the behavior of evolving social bots effectively. We then employ a Multi-Agent Reinforcement Learning (MARL) method to simulate bots adversarial behavior. We categorize social accounts based on their influence and budget. Different agents are then employed to control bot accounts across various categories, optimizing the attachment strategy through reinforcement learning. Additionally, a hierarchical state abstraction based on structural entropy is designed to accelerate the reinforcement learning. Extensive experiments on social bot detection datasets demonstrate that our framework can effectively undermine the performance of GNN-based detectors.

Key Contributions

First adversarial MARL framework (RoBCtrl) for black-box evasion attacks against GNN-based social bot detectors
First application of diffusion models to generate high-fidelity adversarial bot accounts by reconstructing real accounts with minimal modifications
Hierarchical state abstraction using structural entropy to accelerate MARL convergence during graph-based adversarial optimization

🛡️ Threat Analysis

Input Manipulation Attack

The framework crafts adversarial inputs — high-fidelity fake bot accounts (via diffusion model) and optimized graph interaction patterns (via MARL) — to evade GNN-based classifiers at inference time. This is a black-box evasion attack causing misclassification of malicious bots as legitimate accounts.

Details

Domains

graphreinforcement-learninggenerative

Model Types

gnnrldiffusion

Threat Tags

black_boxinference_timetargeteddigital

Datasets

social bot detection datasets (TwiBot-20, Cresci-2015 implied by experiments)

Applications

social bot detectionsocial network analysis

Read PDF arXiv DOI

RoBCtrl: Attacking GNN-Based Social Bot Detectors via Reinforced Manipulation of Bots Control Interaction

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

Optimal Transport-Guided Adversarial Attacks on Graph Neural Network-Based Bot Detection

Evading Overlapping Community Detection via Proxy Node Injection

Vulnerability Analysis of Safe Reinforcement Learning via Inverse Constrained Reinforcement Learning

JANUS: A Dual-Constraint Generative Framework for Stealthy Node Injection Attacks

Diffusion Guided Adversarial State Perturbations in Reinforcement Learning

Learning to Attack: A Bandit Approach to Adversarial Context Poisoning

Leveraging Vulnerabilities in Temporal Graph Neural Networks via Strategic High-Impact Assaults

CAMA: Exploring Collusive Adversarial Attacks in c-MARL