Trusted AI Blog

February 5, 2025

79

Towards Secure AI Week 4 – DeepSeek’s AI Security Failures

Wiz Research Uncovers Exposed DeepSeek Database Leaking Sensitive Information, Including Chat History Wiz, January 29, 2025 A recent security lapse in AI infrastructure has underscored the critical need for stronger protections in artificial intelligence systems. Wiz Research uncovered an unprotected ClickHouse database belonging to DeepSeek, a Chinese AI startup known ...

February 4, 2025

5386

Articles + LLM Security admin

AI Red Teaming Reasoning LLM US vs China: Jailbreak Deepseek, Qwen, O1, O3, Claude, Kimi

Warning, Some of the examples may be harmful!: The authors of this article show LLM Red Teaming and hacking techniques but have no intention to endorse or support any recommendations made by AI Chatbots discussed in this post. The sole purpose of this article is to provide educational information and ...

January 31, 2025

17640

Articles + LLM Security admin

DeepSeek Jailbreak’s

Subscribe for the latest LLM Security and AI Red Teaming news: Jailbreaks Attacks, Defenses, Frameworks, CISO guides, VC Reviews, Policies and more. Deepseek Jailbreak’s In this article, we will demonstrate how DeepSeek respond to different jailbreak techniques. Our initial study on AI Red Teaming different LLM Models using various aproaches focused ...

January 31, 2025

98

Secure AI Weekly + Digests admin

Towards Secure AI Week 3 – OWASP Guidelines and Risk Reduction Strategies

AI Security Among Top Priorities for Cybersecurity Leaders Channel Futures, January 24, 2025 A recent report from Info-Tech Research Group outlines key security priorities necessary to mitigate emerging risks while harnessing AI’s potential for strengthening cybersecurity defenses. These priorities include establishing AI governance frameworks to manage security and privacy risks, ...

December 12, 2024

27

Company Updates + Press Releases admin

Adversa AI’s Red Teaming Platform Recognized in OECD’s Catalogue of Tools & Metrics for Trustworthy AI

December 12, 2024 – Adversa AI is proud to announce that its AI Red Teaming Platform has been included in the OECD prestigious Catalogue of Tools & Metrics for Trustworthy AI. OECD.AI helps countries and shape trustworthy AI with the OECD AI Principles. It gives access to 900+ national AI ...

December 4, 2024

79

Secure AI Weekly + Digests admin

Towards Secure AI Week 48 – Biggest AI Security Bug Bounty

Artificial Intelligence Vulnerability Scoring System (AIVSS) GitHub The AI Vulnerability Scoring System (AIVSS) has been proposed as a framework designed to evaluate vulnerabilities in AI systems comprehensively. Unlike static models, AIVSS incorporates dynamic metrics tailored to AI, including model robustness, data sensitivity, ethical impact, and adaptability, alongside traditional security considerations. ...

November 24, 2024

79

Secure AI Weekly + Trusted AI Blog admin

Towards Secure AI Week 47 – New OWASP Top 10 for LLMs

OWASP Reveals Updated 2025 Top 10 Risks for LLMs, Announces New LLM Project Sponsorship Program and Inaugural Sponsors OWASP, November 17, 2024 The OWASP Foundation has unveiled a refreshed OWASP Top 10 for LLM Applications and Generative AI Project, emphasizing the need for robust security in the development, deployment, and ...

November 18, 2024

85

Secure AI Weekly + Trusted AI Blog admin

Towards Secure AI Week 46 – Hacking LLM Robots

It’s Surprisingly Easy to Jailbreak LLM-Driven Robots Researchers induced bots to ignore their safeguards without exception IEEE Spectrum, November 11, 2024 The rapid integration of large language models (LLMs) like ChatGPT into robotics has revolutionized how robots interact with humans, offering capabilities such as voice-activated commands and task execution based ...

November 12, 2024

100

Secure AI Weekly + Trusted AI Blog admin

Towards Secure AI Week 45 – AI Safety Through Testing, Legislation, and Talent Building

Microsoft’s Yonatan Zunger on Red Teaming Generative AI The Cyber Wire, November 6, 2024 In a recent Microsoft Threat Intelligence Podcast episode, host Sherrod DeGrippo speaks with Yonatan Zunger, Corporate Vice President of AI Safety and Security at Microsoft, to explore the critical importance of securing AI systems. The conversation ...

Towards Secure AI Week 4 – DeepSeek’s AI Security Failures

AI Red Teaming Reasoning LLM US vs China: Jailbreak Deepseek, Qwen, O1, O3, Claude, Kimi

DeepSeek Jailbreak’s

Towards Secure AI Week 3 – OWASP Guidelines and Risk Reduction Strategies

Adversa AI’s Red Teaming Platform Recognized in OECD’s Catalogue of Tools & Metrics for Trustworthy AI

Towards Secure AI Week 48 – Biggest AI Security Bug Bounty

Towards Secure AI Week 47 – New OWASP Top 10 for LLMs

Towards Secure AI Week 46 – Hacking LLM Robots

Towards Secure AI Week 45 – AI Safety Through Testing, Legislation, and Talent Building

Trusted AI Security

Explore Our Blog

Featured Post

Universal LLM Jailbreak: ChatGPT, GPT-4, BARD, BING, Anthropic, and Beyond

Latest Posts

Adversa AI was selected as TOP #6 AI blog in Israel by FeedSpot

MCP Security Digest — June 2025

Agentic AI Red Teaming Interview: Can Autonomous Agents Handle Adversarial Testing? Conversation with ChatGPT, Claude, Grok & Deepseek

Towards Secure AI Week 22 — Testing the Limits of Guardrails and Autonomy

CSA’s Agentic AI Red Teaming Guide: 10 Quick Insights You Can’t Afford to Ignore

Adversa AI Agentic AI Red Teaming Platform Wins Leading Cybersecurity solution in AI at Fortress Cybersecurity Awards

Top 12 Security Issues in Model Context Protocol (MCP) and How to Fix Them

Towards Secure AI Week 21 — From Reactive Defense to Capability-Aware AI Red Teaming

ICIT Securing AI: Addressing the OWASP Top 10 for Large Language Model Applications — TOP 10 insights

Cisco The state of AI Security 2025 Annual report — Top 10 insights

Towards Secure AI Week 20 — Identity, Jailbreaks, and the Future of Agentic AI Security

Prompt Injection Risks Interview: Are AIs Ready to Defend Themselves? Conversation with ChatGPT, Claude, Grok & Deepseek