Dec 5, 2025 • 1 min read Chameleon: Adaptive Adversarial Agents for Scaling-Based Visual Prompt Injection in Multimodal AI Systems arxiv papers
Dec 4, 2025 • 1 min read Immunity memory-based jailbreak detection: multi-agent adaptive guard for large language models arxiv papers
Dec 3, 2025 • 1 min read COGNITION: From Evaluation to Defense against Multimodal LLM CAPTCHA Solvers arxiv papers
Dec 3, 2025 • 1 min read Characterizing Cyber Attacks against Space Infrastructures with Missing Data: Framework and Case Study arxiv papers
Dec 3, 2025 • 1 min read SR-GRPO: Stable Rank as an Intrinsic Geometric Reward for Large Language Model Alignment arxiv papers
Dec 3, 2025 • 1 min read Contextual Image Attack: How Visual Context Exposes Multimodal Safety Vulnerabilities arxiv papers
Dec 2, 2025 • 1 min read DefenSee: Dissecting Threat from Sight and Text - A Multi-View Defensive Pipeline for Multi-modal Jailbreaks arxiv papers
Dec 2, 2025 • 1 min read Securing Large Language Models (LLMs) from Prompt Injection Attacks arxiv papers
Dec 2, 2025 • 1 min read A Wolf in Sheep's Clothing: Bypassing Commercial LLM Guardrails via Harmless Prompt Weaving and Adaptive Tree Search arxiv papers
Nov 27, 2025 • 1 min read Towards Trustworthy Legal AI through LLM Agents and Formal Reasoning arxiv papers
Nov 27, 2025 • 1 min read TEAR: Temporal-aware Automated Red-teaming for Text-to-Video Models arxiv papers