Publications

March 31, 2025

351

NIST AI 100-2 E2025 Adversarial Machine Learning: A Taxonomy and Terminology of Attacks and Mitigations

NIST’s New AML Taxonomy: Key Changes in AI Security Guidelines (2023 vs. 2025) In an ever-evolving landscape of AI threats and vulnerabilities, staying ahead means staying updated. The National Institute of Standards and Technology (NIST) recently published a crucial update to its cornerstone document, “Adversarial Machine Learning: A Taxonomy and ...

February 18, 2025

18258
1

1

Research + LLM Security admin

Grok 3 Jailbreak and AI red Teaming

Grok 3 Jailbreak and AI Red Teaming In this article, we will demonstrate how Grok 3 respond to different hacking techniques including Jailbreaks and Prompt leaking attacks. Our initial study on AI Red Teaming different LLM Models using various approaches focused on LLM models released before the so-called “Reasoning Revolution”, ...

February 4, 2025

7025

Research + LLM Security admin

AI Red Teaming Reasoning LLM US vs China: Jailbreak Deepseek, Qwen, O1, O3, Claude, Kimi

Warning, Some of the examples may be harmful!: The authors of this article show LLM Red Teaming and hacking techniques but have no intention to endorse or support any recommendations made by AI Chatbots discussed in this post. The sole purpose of this article is to provide educational information and ...

January 31, 2025

18840

Research + LLM Security admin

DeepSeek Jailbreak’s

Deepseek Jailbreak’s In this article, we will demonstrate how DeepSeek respond to different jailbreak techniques. Our initial study on AI Red Teaming different LLM Models using various aproaches focused on LLM models released before the so-called “Reasoning Revolution”, offering a baseline for security assessments before the emergence of advanced reasoning-based ...

April 2, 2024

3802

Research + LLM Security admin

LLM Red Teaming: Adversarial, Programming, and Linguistic approaches VS ChatGPT, Claude, Mistral, Grok, LLAMA, and Gemini

Warning, Some of the examples may be harmful!: The authors of this article show LLM Red Teaming and hacking techniques but have no intention to endorse or support any recommendations made by AI Chatbots discussed in this post. The sole purpose of this article is to provide educational information and ...

November 15, 2023

5342
2

2

Research + LLM Security admin

What is Prompt Leaking, API Leaking, Documents Leaking in LLM Red Teaming

What is AI Prompt Leaking? Adversa AI Research team revealed a number of new LLM Vulnerabilities, including those resulted in Prompt Leaking that affect almost any Custom GPT’s right now. Subscribe for the latest LLM Security news: Prompt Leaking, Jailbreaks, Attacks, CISO guides, VC Reviews, and more Step one. Approximate Prompt ...

November 1, 2023

122

Article + LLM Security admin

White House Executive Order On Safe And Secure AI: A Need For External AI Red Teaming

Why is it important? In recognition of AI’s transformative potential and the associated challenges, President Biden has taken the decisive step of issuing an Executive Order geared toward ensuring AI evolves safely, securely, and in the best interest of all Americans. Given the expansive impacts of AI, it’s pivotal that ...

October 16, 2023

6797
1

1

Article + GenAI Security admin

Aljazeera Fake News Investigation: Burned babies and an AI-generated dog

Long story short, Adversa AI Red Team conducted a quick research. Yesterday morning, we received a link to the article with the analysis of an AI-detection tool that marked an official photo of burned by hamas israeli babies as a fake image. We were shocked to see the tweet by ...

April 13, 2023

64172
1

1

Research + LLM Security admin

Universal LLM Jailbreak: ChatGPT, GPT-4, BARD, BING, Anthropic, and Beyond

Introducing Universal LLM Jailbreak approach. Subscribe for the latest AI Jailbreaks, Attacks and Vulnerabilities If you want more news and valuable insights on a weekly and even daily basis, follow our LinkedIn to join a community of other experts discussing the latest news. In the world of artificial intelligence (AI), ...

NIST AI 100-2 E2025 Adversarial Machine Learning: A Taxonomy and Terminology of Attacks and Mitigations

Grok 3 Jailbreak and AI red Teaming

AI Red Teaming Reasoning LLM US vs China: Jailbreak Deepseek, Qwen, O1, O3, Claude, Kimi

DeepSeek Jailbreak’s

LLM Red Teaming: Adversarial, Programming, and Linguistic approaches VS ChatGPT, Claude, Mistral, Grok, LLAMA, and Gemini

What is Prompt Leaking, API Leaking, Documents Leaking in LLM Red Teaming

White House Executive Order On Safe And Secure AI: A Need For External AI Red Teaming

Aljazeera Fake News Investigation: Burned babies and an AI-generated dog

Universal LLM Jailbreak: ChatGPT, GPT-4, BARD, BING, Anthropic, and Beyond

Trusted AI Security

Explore Our Blog

Featured Post

Universal LLM Jailbreak: ChatGPT, GPT-4, BARD, BING, Anthropic, and Beyond

Latest Posts

We built an AI agent that breaks AI defenses. It ranked top globally.

OpenClaw proved high-agency AI works. Now enterprises need a security strategy, not a ban

You have AI guardrails. Red teaming is how you know they’re working

The 9 attack surfaces your AI security vendor has never heard of

Top GenAI security resources — March 2026

Top MCP security resources — March 2026

Top Agentic AI security resources — March 2026

OpenClaw attacks: Seven real scenarios putting AI agents at risk

A practical guide to the OpenClaw threat model

From chatbots to digital workers: Managing the business risks of agentic AI

SecureClaw: How we mapped 5 AI security frameworks to protect OpenClaw and future autonomous agents in the enterprise

Adversa AI launches SecureClaw — a comprehensive open-source security solution for OpenClaw agents