Ahmed Mohamed Hussain

Papers in Database (1)

benchmark arXiv Jan 2, 2025 · Jan 2025

CySecBench: Generative AI-based CyberSecurity-focused Prompt Dataset for Benchmarking Large Language Models

Johan Wahréus, Ahmed Mohamed Hussain, Panos Papadimitratos · KTH Royal Institute of Technology

Introduces cybersecurity-domain jailbreak benchmark with 12,662 prompts; prompt obfuscation attack achieves 88% success on Gemini

Prompt Injection nlp
PDF