Roman Vainshtein

h-index: 4 37 citations 19 papers (total)

Papers in Database (3)

defense arXiv Jan 18, 2026 · 11w ago

AgenTRIM: Tool Risk Mitigation for Agentic AI

Roy Betser, Shamik Bose, Amit Giloni et al. · Fujitsu

Defends LLM agents against indirect prompt injection and excessive agency via least-privilege tool access enforcement at runtime

Prompt Injection Excessive Agency nlp
4 citations PDF
attack arXiv Feb 4, 2026 · 8w ago

Inference-Time Backdoors via Hidden Instructions in LLM Chat Templates

Ariel Fogel, Omer Hofman, Eilon Cohen et al. · Pillar Security · Fujitsu Research of Europe

Backdoors LLMs by injecting malicious Jinja2 chat templates into GGUF files, evading HuggingFace scans with 80%+ attack success

AI Supply Chain Attacks Model Poisoning nlp
PDF Code
defense arXiv Feb 24, 2026 · 5w ago

Adversarial Intent is a Latent Variable: Stateful Trust Inference for Securing Multimodal Agentic RAG

Inderjeet Singh, Vikas Pahuja, Aishvariya Priya Rathina Sabapathy et al. · Fujitsu Research of Europe · Fujitsu Limited

Stateful POMDP-based defense detects distributed multi-stage prompt injections in multimodal agentic RAG via LLM belief-state tracking

Input Manipulation Attack Prompt Injection multimodalnlp
PDF