Digests

September 27, 2023

286

Towards Trusted AI Week 39 – Open AI Red Teaming & The rise of Secure AI Startups

OpenAI Red Teaming Network OpenAI, September 19, 2023 Finally, OpenAI launched the OpenAI Red Teaming Network, a pivotal initiative designed to bolster the safety and security of our AI models. This venture welcomes experts from a myriad of fields to collaborate, utilizing their diverse insights for the thorough evaluation and ...

September 21, 2023

51

Secure AI Weekly + Digests admin

Towards Trusted AI Week 38 – The Cybersecurity Dilemmas of AI

Comply or Die: The Rise of the AI Governance Stack Battery Ventures, September 13, 2023 While regulatory efforts are catching up, with the European Union leading the way and localized efforts in the United States filling the federal void, there is still much work to be done. As the race ...

September 18, 2023

196

Adversarial ML Digest admin

Secure AI Research papers: The Dark Corners of AI

With technology advances the ethical, security, and operational questions loom ever larger. From hijacked images that can control AI to camouflage techniques that can make vehicles invisible to sensors, the latest batch of research papers unveils some startling vulnerabilities in AI systems. Can anyone hack an AI model by just ...

September 14, 2023

139

Secure AI Weekly + Digests admin

Towards Trusted AI Week 37 – Why AI TRiSM is Essential

Tackling Trust, Risk and Security in AI Models Gartner, September 5, 2023 The surge of interest in generative AI technologies has led to a plethora of pilot projects, but what often falls by the wayside is a robust risk assessment. Organizations frequently don’t consider safety and security implications until their ...

September 5, 2023

90

Secure AI Weekly + Digests admin

Towards Trusted AI Week 36 – The Critical Quest for Secure and Reliable AI Systems

UK cybersecurity agency warns of chatbot ‘prompt injection’ attacks The Guardian, August 30, 2023 The United Kingdom’s National Cyber Security Centre (NCSC) has recently raised alarms about the escalating cybersecurity threats surrounding chatbots. These automated conversational agents, powered by large language models (LLMs) like OpenAI’s ChatGPT and Google’s Bard, are ...

September 4, 2023

194

LLM Security Digest admin

LLM Security and Prompt Engineering Digest: Top August events, guides, incidents, VC reviews and research papers

Welcome to a brief exploration into the fascinating world of AI security—a realm where innovation and danger intertwine like DNA strands. Dive in to learn how red teaming tests AI vulnerabilities, what Google recommends for AI security, the unforeseen risks of AI in everyday applications, and academic approaches to the ...

September 1, 2023

173

Secure AI Weekly + Digests admin

Towards Trusted AI Week 35 – The Achilles’ Heel of AI

Tricks for making AI chatbots break rules are freely available online NewScientist, August 21, 2023 Artificial intelligence chatbots like ChatGPT have become essential tools for various online activities, but their security loopholes present an emerging concern. Manipulative text prompts, often referred to as “jailbreak prompts,” can mislead these AI systems ...

August 25, 2023

173

Secure AI Weekly + Digests admin

Towards Trusted AI Week 34 – Defcon AI Red Teaming wrap-ups and the Quest for AI Security

Don’t expect quick fixes in ‘red-teaming’ of AI models. Security was an afterthought APNews, August 14, 2023 The recent DefCon hacker conference in Las Vegas served as a stark reminder of the pressing concerns around AI safety and security. The event saw 2,200 participants rigorously testing eight advanced language models, ...

August 13, 2023

150

Secure AI Weekly + Digests admin

Towards Trusted AI Week 33 – AI Security Takes Center Stage

Meet the hackers who are trying to make AI go rogue Washington Post, August 8, 2023 With the White House’s endorsement, the Generative Red Team Challenge aims to rigorously assess the reliability of AI. This includes examining the potential for political misinformation, inherent biases, and even defamatory outputs. Companies like ...

Towards Trusted AI Week 39 – Open AI Red Teaming & The rise of Secure AI Startups

Towards Trusted AI Week 38 – The Cybersecurity Dilemmas of AI

Secure AI Research papers: The Dark Corners of AI

Towards Trusted AI Week 37 – Why AI TRiSM is Essential

Towards Trusted AI Week 36 – The Critical Quest for Secure and Reliable AI Systems

LLM Security and Prompt Engineering Digest: Top August events, guides, incidents, VC reviews and research papers

Towards Trusted AI Week 35 – The Achilles’ Heel of AI

Towards Trusted AI Week 34 – Defcon AI Red Teaming wrap-ups and the Quest for AI Security

Towards Trusted AI Week 33 – AI Security Takes Center Stage

Trusted AI Security

Explore Our Blog

Featured Post

Universal LLM Jailbreak: ChatGPT, GPT-4, BARD, BING, Anthropic, and Beyond

Latest Posts

We built an AI agent that breaks AI defenses. It ranked top globally.

OpenClaw proved high-agency AI works. Now enterprises need a security strategy, not a ban

You have AI guardrails. Red teaming is how you know they’re working

The 9 attack surfaces your AI security vendor has never heard of

Top GenAI security resources — March 2026

Top MCP security resources — March 2026

Top Agentic AI security resources — March 2026

OpenClaw attacks: Seven real scenarios putting AI agents at risk

A practical guide to the OpenClaw threat model

From chatbots to digital workers: Managing the business risks of agentic AI

SecureClaw: How we mapped 5 AI security frameworks to protect OpenClaw and future autonomous agents in the enterprise

Adversa AI launches SecureClaw — a comprehensive open-source security solution for OpenClaw agents