Latest papers

3 papers
benchmark Asia-Pacific Software Engineer... Dec 16, 2025 · Dec 2025

PerProb: Indirectly Evaluating Memorization in Large Language Models

Yihan Liao, Jacky Keung, Xiaoxue Ma et al. · City University of Hong Kong · Hong Kong Metropolitan University

Benchmark framework using perplexity and log probability metrics to evaluate membership inference vulnerabilities in LLMs across black-box and white-box settings

Membership Inference Attack Sensitive Information Disclosure nlp
PDF
defense Asia-Pacific Software Engineer... Dec 9, 2025 · Dec 2025

Exposing and Defending Membership Leakage in Vulnerability Prediction Models

Yihan Liao, Jacky Keung, Xiaoxue Ma et al. · City University of Hong Kong · Hong Kong Metropolitan University

Attacks and defends vulnerability prediction models against membership inference via output masking and Gaussian noise injection

Membership Inference Attack nlp
PDF
benchmark arXiv Sep 26, 2025 · Sep 2025

RedNote-Vibe: A Dataset for Capturing Temporal Dynamics of AI-Generated Text in Social Media

Yudong Li, Yufei Sun, Yuhan Yao et al. · Tsinghua University · Beijing University of Posts and Telecommunications +2 more

Longitudinal social media AIGT dataset and psycholinguistic detection framework revealing temporal trends in AI-generated content engagement

Output Integrity Attack nlp
PDF Code