gptleaks

Nov 17, 2025 • 2 min read

Exploring the Impact of Artificial Intelligence Across Industries

weekly news about ai

Nov 17, 2025 • 1 min read

Securing Large Language Models: Exploring Vulnerabilities and Best Practices

weekly news about llm security

Nov 11, 2025 • 1 min read

Differentiated Directional Intervention A Framework for Evading LLM Safety Alignment

arxiv papers

Nov 11, 2025 • 1 min read

EduGuardBench: A Holistic Benchmark for Evaluating the Pedagogical Fidelity and Adversarial Safety of LLMs as Simulated Teachers

arxiv papers

Nov 11, 2025 • 1 min read

FoCLIP: A Feature-Space Misalignment Framework for CLIP-Based Image Manipulation and Detection

arxiv papers

Nov 11, 2025 • 1 min read

JPRO: Automated Multimodal Jailbreaking via Multi-Agent Collaboration Framework

arxiv papers

Nov 10, 2025 • 2 min read

AI: Shaping Society's Future

weekly news about ai

Nov 10, 2025 • 2 min read

Addressing the Security Concerns of Large Language Models

weekly news about llm security

Nov 7, 2025 • 1 min read

AdversariaLLM: A Unified and Modular Toolbox for LLM Robustness Research

arxiv papers

Nov 6, 2025 • 1 min read

Death by a Thousand Prompts: Open Model Vulnerability Analysis

arxiv papers

Nov 6, 2025 • 1 min read

Let the Bees Find the Weak Spots: A Path Planning Perspective on Multi-Turn Jailbreak Attacks against LLMs

arxiv papers

Nov 5, 2025 • 1 min read

An Automated Framework for Strategy Discovery, Retrieval, and Evolution in LLM Jailbreak Attacks

arxiv papers