Eduard Kapelko

h-index: 0 0 citations 1 papers (total)

Papers in Database (1)

defense arXiv Sep 23, 2025 ยท Sep 2025

Cyclic Ablation: Testing Concept Localization against Functional Regeneration in AI

Eduard Kapelko

Tests whether LLM deception is localizable and removable via sparse autoencoder ablation, finding it resilient and distributed

Prompt Injection nlp
PDF