Digests

June 16, 2025

168

Towards Secure AI Week 23 — From Zero-Click Exploits to Policy-Backed Guardrails: Where AI Security Stands Now

As AI systems transition from passive tools to autonomous agents, the risks surrounding them evolve just as fast. This week’s digest reveals how attackers are already exploiting agentic AI, how regulators are racing to keep up, and how industry is responding with new benchmarks and standards. From Microsoft’s EchoLeak zero-click ...

June 11, 2025

811

MCP Security + MCP Security Digest ADMIN

MCP Security Digest — June 2025

MCP Security is a top concern for anyone building Agentic AI systems. The Model Context Protocol (MCP) connects tools, agents, and actions. It plays a role similar to TCP/IP—but for autonomous workflows. If MCP is compromised, the entire agent stack is at risk. Attackers can inject prompts, hijack tools, and ...

June 9, 2025

181

Secure AI Weekly ADMIN

Towards Secure AI Week 22 — Testing the Limits of Guardrails and Autonomy

AI systems aren’t just generating answers—they’re taking action, reasoning independently, and connecting to real-world systems. This week’s stories highlight how current defenses fail to address these expanded capabilities, revealing critical blind spots in identity management, cross-agent communication, and cloud-based safety infrastructure. From one-shot jailbreaks and latent-level exploits to insecure identity ...

June 2, 2025

62

Secure AI Weekly ADMIN

Towards Secure AI Week 21 — From Reactive Defense to Capability-Aware AI Red Teaming

AI systems are no longer just responding to prompts — they’re acting, adapting, and making decisions. This week’s stories reveal how traditional security tools like SIEM, firewalls, and EDR fail to protect GenAI and Agentic AI systems, and why new approaches like continuous AI Red Teaming, identity enforcement, and jailbreak ...

May 26, 2025

170

Secure AI Weekly ADMIN

Towards Secure AI Week 20 — Identity, Jailbreaks, and the Future of Agentic AI Security

This week’s stories highlight the rapid emergence of new threats and defenses in the Agentic AI landscape. From OWASP’s DNS-inspired Agent Name Service (ANS) for verifying AI identities to real-world exploits like jailbreakable “dark LLMs” and prompt-injected assistants like GitLab Duo, the ecosystem is shifting toward identity-first architecture and layered ...

May 19, 2025

169

Secure AI Weekly ADMIN

Towards Secure AI Week 19 — AI Agents Under Attack, Evaluation Becomes Strategy

This week’s stories highlight a critical evolution in AI risk: the shift from isolated agent failures to system-level compromise in Agentic AI architectures and memory-based applications. From Princeton’s demonstration of cryptocurrency theft via false memory injection to Fortnite’s AI Darth Vader being manipulated into swearing within an hour of launch, ...

May 12, 2025

204

Secure AI Weekly ADMIN

Towards Secure AI Week 18 — LLM Jailbreaks Hit New Highs, AI Security Market Accelerates

As LLMs become embedded across enterprise applications, new red-teaming research shows jailbreak success rates surpassing 87% on models like GPT-4—even under safety-aligned settings. Techniques such as multi-turn roleplay, token-level obfuscation, and cross-model attacks continue to outpace current safeguards. Meanwhile, insider misuse and unfiltered GenAI outputs pose growing risks, prompting calls ...

May 7, 2025

73

MCP Security + MCP Security Digest ADMIN

MCP Security Digest — May 2025

MCP Security is a top concern for anyone building Agentic AI systems. The Model Context Protocol (MCP) connects tools, agents, and actions. It plays a role similar to TCP/IP—but for autonomous workflows. If MCP is compromised, the entire agent stack is at risk. Attackers can inject prompts, hijack tools, and ...

May 5, 2025

168

Secure AI Weekly admin

Towards Secure AI Week 17 — AI Guardrails Under Pressure as Jailbreaking Techniques Advance

Enterprise use of generative AI is expanding, but so is the sophistication of attacks targeting these systems. New jailbreak methods are achieving nearly 100% success rates, even on well-aligned models like GPT-4 and Llama3, while recent research exposes vulnerabilities in memory, prompt interpretation, and cross-tool coordination protocols like MCP. At ...

Towards Secure AI Week 23 — From Zero-Click Exploits to Policy-Backed Guardrails: Where AI Security Stands Now

MCP Security Digest — June 2025

Towards Secure AI Week 22 — Testing the Limits of Guardrails and Autonomy

Towards Secure AI Week 21 — From Reactive Defense to Capability-Aware AI Red Teaming

Towards Secure AI Week 20 — Identity, Jailbreaks, and the Future of Agentic AI Security

Towards Secure AI Week 19 — AI Agents Under Attack, Evaluation Becomes Strategy

Towards Secure AI Week 18 — LLM Jailbreaks Hit New Highs, AI Security Market Accelerates

MCP Security Digest — May 2025

Towards Secure AI Week 17 — AI Guardrails Under Pressure as Jailbreaking Techniques Advance

Trusted AI Security

Explore Our Blog

Featured Post

Universal LLM Jailbreak: ChatGPT, GPT-4, BARD, BING, Anthropic, and Beyond

Latest Posts

We built an AI agent that breaks AI defenses. It ranked top globally.

OpenClaw proved high-agency AI works. Now enterprises need a security strategy, not a ban

You have AI guardrails. Red teaming is how you know they’re working

The 9 attack surfaces your AI security vendor has never heard of

Top GenAI security resources — March 2026

Top MCP security resources — March 2026

Top Agentic AI security resources — March 2026

OpenClaw attacks: Seven real scenarios putting AI agents at risk

A practical guide to the OpenClaw threat model

From chatbots to digital workers: Managing the business risks of agentic AI

SecureClaw: How we mapped 5 AI security frameworks to protect OpenClaw and future autonomous agents in the enterprise

Adversa AI launches SecureClaw — a comprehensive open-source security solution for OpenClaw agents