Nov 27, 2025 • 1 min read CAHS-Attack: CLIP-Aware Heuristic Search Attack Method for Stable Diffusion arxiv papers
Nov 27, 2025 • 1 min read Self-Guided Defense: Adaptive Safety Alignment for Reasoning Models via Synthesized Guidelines arxiv papers
Nov 27, 2025 • 1 min read Multimodal Robust Prompt Distillation for 3D Point Cloud Models arxiv papers
Nov 21, 2025 • 1 min read Multi-Faceted Attack: Exposing Cross-Model Vulnerabilities in Defense-Equipped Vision-Language Models arxiv papers
Nov 21, 2025 • 1 min read An Image Is Worth Ten Thousand Words: Verbose-Text Induction Attacks on VLMs arxiv papers
Nov 21, 2025 • 1 min read When Alignment Fails: Multimodal Adversarial Attacks on Vision-Language-Action Models arxiv papers
Nov 21, 2025 • 1 min read PSM: Prompt Sensitivity Minimization via LLM-Guided Black-Box Optimization arxiv papers
Nov 21, 2025 • 1 min read Q-MLLM: Vector Quantization for Robust Multimodal Large Language Model Security arxiv papers
Nov 21, 2025 • 1 min read "To Survive, I Must Defect": Jailbreaking LLMs via the Game-Theory Scenarios arxiv papers
Nov 21, 2025 • 1 min read The Shawshank Redemption of Embodied AI: Understanding and Benchmarking Indirect Environmental Jailbreaks arxiv papers
Nov 20, 2025 • 1 min read Effective Code Membership Inference for Code Completion Models via Adversarial Prompts arxiv papers
Nov 20, 2025 • 1 min read Adversarial Poetry as a Universal Single-Turn Jailbreak Mechanism in Large Language Models arxiv papers