Nov 18, 2025 • 1 min read T2I-Based Physical-World Appearance Attack against Traffic Sign Recognition Systems in Autonomous Driving arxiv papers
Nov 18, 2025 • 1 min read VEIL: Jailbreaking Text-to-Video Models via Visual Exploitation from Implicit Language arxiv papers
Nov 18, 2025 • 1 min read Whistledown: Combining User-Level Privacy with Conversational Coherence in LLMs arxiv papers
Nov 18, 2025 • 1 min read ForgeDAN: An Evolutionary Framework for Jailbreaking Aligned Large Language Models arxiv papers
Nov 11, 2025 • 1 min read Differentiated Directional Intervention A Framework for Evading LLM Safety Alignment arxiv papers
Nov 11, 2025 • 1 min read EduGuardBench: A Holistic Benchmark for Evaluating the Pedagogical Fidelity and Adversarial Safety of LLMs as Simulated Teachers arxiv papers
Nov 11, 2025 • 1 min read FoCLIP: A Feature-Space Misalignment Framework for CLIP-Based Image Manipulation and Detection arxiv papers
Nov 11, 2025 • 1 min read JPRO: Automated Multimodal Jailbreaking via Multi-Agent Collaboration Framework arxiv papers
Nov 7, 2025 • 1 min read AdversariaLLM: A Unified and Modular Toolbox for LLM Robustness Research arxiv papers
Nov 6, 2025 • 1 min read Let the Bees Find the Weak Spots: A Path Planning Perspective on Multi-Turn Jailbreak Attacks against LLMs arxiv papers
Nov 5, 2025 • 1 min read An Automated Framework for Strategy Discovery, Retrieval, and Evolution in LLM Jailbreak Attacks arxiv papers