arxiv papers

Dec 5, 2025 • 1 min read

Chameleon: Adaptive Adversarial Agents for Scaling-Based Visual Prompt Injection in Multimodal AI Systems

arxiv papers

Dec 4, 2025 • 1 min read

Immunity memory-based jailbreak detection: multi-agent adaptive guard for large language models

arxiv papers

Dec 3, 2025 • 1 min read

COGNITION: From Evaluation to Defense against Multimodal LLM CAPTCHA Solvers

arxiv papers

Dec 3, 2025 • 1 min read

Characterizing Cyber Attacks against Space Infrastructures with Missing Data: Framework and Case Study

arxiv papers

Dec 3, 2025 • 1 min read

SR-GRPO: Stable Rank as an Intrinsic Geometric Reward for Large Language Model Alignment

arxiv papers

Dec 3, 2025 • 1 min read

Contextual Image Attack: How Visual Context Exposes Multimodal Safety Vulnerabilities

arxiv papers

Dec 3, 2025 • 1 min read

Invasive Context Engineering to Control Large Language Models

arxiv papers

Dec 2, 2025 • 1 min read

DefenSee: Dissecting Threat from Sight and Text - A Multi-View Defensive Pipeline for Multi-modal Jailbreaks

arxiv papers

Dec 2, 2025 • 1 min read

Securing Large Language Models (LLMs) from Prompt Injection Attacks

arxiv papers

Dec 2, 2025 • 1 min read

A Wolf in Sheep's Clothing: Bypassing Commercial LLM Guardrails via Harmless Prompt Weaving and Adaptive Tree Search

arxiv papers

Nov 27, 2025 • 1 min read

Towards Trustworthy Legal AI through LLM Agents and Formal Reasoning

arxiv papers

Nov 27, 2025 • 1 min read

TEAR: Temporal-aware Automated Red-teaming for Text-to-Video Models

arxiv papers