Trusted AI Blog

May 27, 2025

89

Cisco The state of AI Security 2025 Annual report — Top 10 insights

AI Is Eating the Enterprise — But the Enterprise Is on the Menu Too. Seventy-two percent of organisations already embed AI, yet only 13 percent feel truly ready. Attackers know that gap and are rushing to weaponise it. Cisco’s latest report reads like a flight-recorder transcript from the future of ...

May 26, 2025

146

Secure AI Weekly + Digests ADMIN

Towards Secure AI Week 20 — Identity, Jailbreaks, and the Future of Agentic AI Security

This week’s stories highlight the rapid emergence of new threats and defenses in the Agentic AI landscape. From OWASP’s DNS-inspired Agent Name Service (ANS) for verifying AI identities to real-world exploits like jailbreakable “dark LLMs” and prompt-injected assistants like GitLab Duo, the ecosystem is shifting toward identity-first architecture and layered ...

May 22, 2025

344

Articles + LLM Security ADMIN

Prompt Injection Risks Interview: Are AIs Ready to Defend Themselves? Conversation with ChatGPT, Claude, Grok & Deepseek

Prompt injection remains one of the most dangerous and poorly understood threats in AI security. To assess how today’s large language models (LLMs) handle Prompt Injection risks, we interviewed ChatGPT, Claude, Grok, and Deepseek. We asked each of them 11 expert-level questions covering real-world attacks, defense strategies, and future readiness. ...

May 20, 2025

227

Articles + Agentic AI Security ADMIN

Microsoft’s Taxonomy of Failure Modes in Agentic AI Systems — TOP 10 Insights

Based on Microsoft AI Red Team’s white paper “Taxonomy of Failure Modes in Agentic AI Systems”. Why CISOs, Architects & Staff Engineers Must Read Microsoft’s Agentic AI Failure Mode Taxonomy Agentic AI is moving from proof-of-concept to production faster than most security teams can update their threat models. In response, ...

May 19, 2025

151

Secure AI Weekly + Digests ADMIN

Towards Secure AI Week 19 — AI Agents Under Attack, Evaluation Becomes Strategy

This week’s stories highlight a critical evolution in AI risk: the shift from isolated agent failures to system-level compromise in Agentic AI architectures and memory-based applications. From Princeton’s demonstration of cryptocurrency theft via false memory injection to Fortnite’s AI Darth Vader being manipulated into swearing within an hour of launch, ...

May 14, 2025

124

Articles + GenAI Security ADMIN

ETSI TS 104 223: 10 Security Insights Every CISO Needs

As AI systems rapidly integrate into critical infrastructure and enterprise workflows, their attack surfaces are expanding just as quickly. Consequently, traditional cybersecurity controls are no longer sufficient. To address this growing risk, the new ETSI TS 104 223 V1.1.1 (2025-04) — Securing Artificial Intelligence (SAI); Baseline Cyber Security Requirements for ...

May 12, 2025

175

Secure AI Weekly + Digests ADMIN

Towards Secure AI Week 18 — LLM Jailbreaks Hit New Highs, AI Security Market Accelerates

As LLMs become embedded across enterprise applications, new red-teaming research shows jailbreak success rates surpassing 87% on models like GPT-4—even under safety-aligned settings. Techniques such as multi-turn roleplay, token-level obfuscation, and cross-model attacks continue to outpace current safeguards. Meanwhile, insider misuse and unfiltered GenAI outputs pose growing risks, prompting calls ...

May 7, 2025

30

MCP Security + Digests ADMIN

MCP Security Digest — May 2025

MCP Security is a top concern for anyone building Agentic AI systems. The Model Context Protocol (MCP) connects tools, agents, and actions. It plays a role similar to TCP/IP—but for autonomous workflows. If MCP is compromised, the entire agent stack is at risk. Attackers can inject prompts, hijack tools, and ...

May 5, 2025

141

Secure AI Weekly + Digests admin

Towards Secure AI Week 17 — AI Guardrails Under Pressure as Jailbreaking Techniques Advance

Enterprise use of generative AI is expanding, but so is the sophistication of attacks targeting these systems. New jailbreak methods are achieving nearly 100% success rates, even on well-aligned models like GPT-4 and Llama3, while recent research exposes vulnerabilities in memory, prompt interpretation, and cross-tool coordination protocols like MCP. At ...

Cisco The state of AI Security 2025 Annual report — Top 10 insights

Towards Secure AI Week 20 — Identity, Jailbreaks, and the Future of Agentic AI Security

Prompt Injection Risks Interview: Are AIs Ready to Defend Themselves? Conversation with ChatGPT, Claude, Grok & Deepseek

Microsoft’s Taxonomy of Failure Modes in Agentic AI Systems — TOP 10 Insights

Towards Secure AI Week 19 — AI Agents Under Attack, Evaluation Becomes Strategy

ETSI TS 104 223: 10 Security Insights Every CISO Needs

Towards Secure AI Week 18 — LLM Jailbreaks Hit New Highs, AI Security Market Accelerates

MCP Security Digest — May 2025

Towards Secure AI Week 17 — AI Guardrails Under Pressure as Jailbreaking Techniques Advance

Trusted AI Security

Explore Our Blog

Featured Post

Universal LLM Jailbreak: ChatGPT, GPT-4, BARD, BING, Anthropic, and Beyond

Latest Posts

Adversa AI was selected as TOP #6 AI blog in Israel by FeedSpot

MCP Security Digest — June 2025

Agentic AI Red Teaming Interview: Can Autonomous Agents Handle Adversarial Testing? Conversation with ChatGPT, Claude, Grok & Deepseek

Towards Secure AI Week 22 — Testing the Limits of Guardrails and Autonomy

CSA’s Agentic AI Red Teaming Guide: 10 Quick Insights You Can’t Afford to Ignore

Adversa AI Agentic AI Red Teaming Platform Wins Leading Cybersecurity solution in AI at Fortress Cybersecurity Awards

Top 12 Security Issues in Model Context Protocol (MCP) and How to Fix Them

Towards Secure AI Week 21 — From Reactive Defense to Capability-Aware AI Red Teaming

ICIT Securing AI: Addressing the OWASP Top 10 for Large Language Model Applications — TOP 10 insights

Cisco The state of AI Security 2025 Annual report — Top 10 insights

Towards Secure AI Week 20 — Identity, Jailbreaks, and the Future of Agentic AI Security

Prompt Injection Risks Interview: Are AIs Ready to Defend Themselves? Conversation with ChatGPT, Claude, Grok & Deepseek