Trusted AI Blog

April 9, 2024

323

Towards Secure AI Week 14 – New AI Security Report and Hacking Grok AI

X’s Grok AI is great – if you want to know how to hot wire a car, make drugs, or worse The Register, April 2, 2024 The innovative generative AI model known as Grok, developed under the helm of Elon Musk’s X, faces a significant challenge: Despite its advanced capabilities, ...

April 2, 2024

3593

Articles + LLM Security admin

LLM Red Teaming: Adversarial, Programming, and Linguistic approaches VS ChatGPT, Claude, Mistral, Grok, LLAMA, and Gemini

Warning, Some of the examples may be harmful!: The authors of this article show LLM Red Teaming and hacking techniques but have no intention to endorse or support any recommendations made by AI Chatbots discussed in this post. The sole purpose of this article is to provide educational information and ...

April 1, 2024

108

Secure AI Weekly + Digests admin

Towards Secure AI Week 13 – Advancing AI Governance and Security

California Releases Generative AI State Procurement Guidelines Government Technology, March 22, 2024 In response to Governor Gavin Newsom’s Executive Order N-12-23, which called for a closer examination of generative AI technologies, new directives have been introduced to fortify the security and safety measures surrounding AI within state agencies and vendor ...

March 25, 2024

133

Secure AI Weekly + Digests admin

Towards Secure AI Week 12 – New AI Security Framework

Introducing the Databricks AI Security Framework (DASF) Data Bricks, March 21, 2024 This framework has been meticulously crafted to foster collaboration across various domains including business, IT, data, AI, and security, offering a comprehensive approach towards fortifying AI systems against potential threats. Through demystifying AI and ML concepts, cataloging AI ...

March 24, 2024

105

Digests + LLM Security admin

LLM Security Digest: From Chatbot Mishaps to Job Opportunities

Welcome to our LLM Security Digest! In this edition we unveil the LLM security threat model and serious incident, LLM Prompt Injection techniques and noteworthy LLM Security courses. Take a look at the best LLM security job and comprehensive resource empowering CISOs to fortify LLM strategies against adversarial risks. Let’s ...

March 21, 2024

113

Secure AI Weekly + Digests admin

Towards Secure AI Week 11 – GenAI security policies

Hackers can read private AI-assistant chats even though they’re encrypted ArsTechnica, March 14, 2024 Despite efforts to encrypt communications, a newly developed attack has demonstrated the ability to decode AI assistant responses with alarming accuracy. Exploiting a side channel present in major AI systems, excluding Google Gemini, this attack compromises ...

March 11, 2024

161

Secure AI Weekly + Digests admin

Towards Secure AI Week 10 – AI worm VS Malicious AI Models

Over 100 Malicious AI/ML Models Found on Hugging Face Platform The Hacker News, March 4, 2024 In recent discoveries on the Hugging Face platform, alarming revelations have emerged, with as many as 100 malicious artificial intelligence (AI) and machine learning (ML) models being identified. JFrog, a software supply chain security ...

March 5, 2024

131

Secure AI Weekly + Digests admin

Towards Secure AI Week 9 – BEAST Jailbreak and AI Security Predictions 2024

Cyber Insights 2024: Artificial Intelligence Security Week, February 26, 2024 In the ever-evolving landscape of AI within cybersecurity, 2024 brings forth profound insights from Mr. Alex Polyakov, CEO and co-founder of Adversa AI. Polyakov highlights the expanding threat landscape, citing instances such as the jailbreak of Chevrolet’s Chatbot and data ...

February 26, 2024

170

Secure AI Weekly + Digests admin

Towards Secure AI Week 8 – FS-ISAC AI Risk Guides

Google Gemini “Diverse” Prompt Injection Know Your Meme, February 22, 2024 This scrutiny emphasizes the necessity for a steadfast commitment to Quality and Robustness testing before releasing AI in production. The crux of the controversy emerged on February 9th, 2024, when a Reddit user expressed dissatisfaction with Gemini’s seeming inability ...

Towards Secure AI Week 14 – New AI Security Report and Hacking Grok AI

LLM Red Teaming: Adversarial, Programming, and Linguistic approaches VS ChatGPT, Claude, Mistral, Grok, LLAMA, and Gemini

Towards Secure AI Week 13 – Advancing AI Governance and Security

Towards Secure AI Week 12 – New AI Security Framework

LLM Security Digest: From Chatbot Mishaps to Job Opportunities

Towards Secure AI Week 11 – GenAI security policies

Towards Secure AI Week 10 – AI worm VS Malicious AI Models

Towards Secure AI Week 9 – BEAST Jailbreak and AI Security Predictions 2024

Towards Secure AI Week 8 – FS-ISAC AI Risk Guides

Trusted AI Security

Explore Our Blog

Featured Post

Universal LLM Jailbreak: ChatGPT, GPT-4, BARD, BING, Anthropic, and Beyond

Latest Posts

Adversa AI was selected as TOP #6 AI blog in Israel by FeedSpot

MCP Security Digest — June 2025

Agentic AI Red Teaming Interview: Can Autonomous Agents Handle Adversarial Testing? Conversation with ChatGPT, Claude, Grok & Deepseek

Towards Secure AI Week 22 — Testing the Limits of Guardrails and Autonomy

CSA’s Agentic AI Red Teaming Guide: 10 Quick Insights You Can’t Afford to Ignore

Adversa AI Agentic AI Red Teaming Platform Wins Leading Cybersecurity solution in AI at Fortress Cybersecurity Awards

Top 12 Security Issues in Model Context Protocol (MCP) and How to Fix Them

Towards Secure AI Week 21 — From Reactive Defense to Capability-Aware AI Red Teaming

ICIT Securing AI: Addressing the OWASP Top 10 for Large Language Model Applications — TOP 10 insights

Cisco The state of AI Security 2025 Annual report — Top 10 insights

Towards Secure AI Week 20 — Identity, Jailbreaks, and the Future of Agentic AI Security

Prompt Injection Risks Interview: Are AIs Ready to Defend Themselves? Conversation with ChatGPT, Claude, Grok & Deepseek