gptleaks (Page 2)

Dec 9, 2025 • 1 min read

Think-Reflect-Revise: A Policy-Guided Reflective Framework for Safety Alignment in Large Vision Language Models

arxiv papers

Dec 9, 2025 • 1 min read

RL-MTJail: Reinforcement Learning for Automated Black-Box Multi-Turn Jailbreaking of Large Language Models

arxiv papers

Dec 8, 2025 • 2 min read

Navigating Artificial Intelligence: Innovations, Ethics, and Regulations

weekly news about ai

Dec 8, 2025 • 1 min read

Maximizing Efficiency through Iterative Algorithm Management

weekly news about llm security

Dec 5, 2025 • 1 min read

Malicious Image Analysis via Vision-Language Segmentation Fusion: Detection, Element, and Location in One-shot

arxiv papers

Dec 5, 2025 • 1 min read

SoK: a Comprehensive Causality Analysis Framework for Large Language Model Security

arxiv papers

Dec 5, 2025 • 1 min read

Chameleon: Adaptive Adversarial Agents for Scaling-Based Visual Prompt Injection in Multimodal AI Systems

arxiv papers

Dec 4, 2025 • 1 min read

Immunity memory-based jailbreak detection: multi-agent adaptive guard for large language models

arxiv papers

Dec 3, 2025 • 1 min read

COGNITION: From Evaluation to Defense against Multimodal LLM CAPTCHA Solvers

arxiv papers

Dec 3, 2025 • 1 min read

Characterizing Cyber Attacks against Space Infrastructures with Missing Data: Framework and Case Study

arxiv papers

Dec 3, 2025 • 1 min read

SR-GRPO: Stable Rank as an Intrinsic Geometric Reward for Large Language Model Alignment

arxiv papers

Dec 3, 2025 • 1 min read

Contextual Image Attack: How Visual Context Exposes Multimodal Safety Vulnerabilities

arxiv papers