Secure AI Weekly

September 14, 2023

136

Secure AI Weekly + Trusted AI Blog admin

Towards Trusted AI Week 37 – Why AI TRiSM is Essential

Tackling Trust, Risk and Security in AI Models Gartner, September 5, 2023 The surge of interest in generative AI technologies has led to a plethora of pilot projects, but what often falls by the wayside is a robust risk assessment. Organizations frequently don’t consider safety and security implications until their ...

September 5, 2023

86

Secure AI Weekly + Trusted AI Blog admin

Towards Trusted AI Week 36 – The Critical Quest for Secure and Reliable AI Systems

UK cybersecurity agency warns of chatbot ‘prompt injection’ attacks The Guardian, August 30, 2023 The United Kingdom’s National Cyber Security Centre (NCSC) has recently raised alarms about the escalating cybersecurity threats surrounding chatbots. These automated conversational agents, powered by large language models (LLMs) like OpenAI’s ChatGPT and Google’s Bard, are ...

September 1, 2023

140

Secure AI Weekly + Trusted AI Blog admin

Towards Trusted AI Week 35 – The Achilles’ Heel of AI

Tricks for making AI chatbots break rules are freely available online NewScientist, August 21, 2023 Artificial intelligence chatbots like ChatGPT have become essential tools for various online activities, but their security loopholes present an emerging concern. Manipulative text prompts, often referred to as “jailbreak prompts,” can mislead these AI systems ...

August 25, 2023

159

Secure AI Weekly + Trusted AI Blog admin

Towards Trusted AI Week 34 – Defcon AI Red Teaming wrap-ups and the Quest for AI Security

Don’t expect quick fixes in ‘red-teaming’ of AI models. Security was an afterthought APNews, August 14, 2023 The recent DefCon hacker conference in Las Vegas served as a stark reminder of the pressing concerns around AI safety and security. The event saw 2,200 participants rigorously testing eight advanced language models, ...

August 13, 2023

141

Secure AI Weekly + Trusted AI Blog admin

Towards Trusted AI Week 33 – AI Security Takes Center Stage

Meet the hackers who are trying to make AI go rogue Washington Post, August 8, 2023 With the White House’s endorsement, the Generative Red Team Challenge aims to rigorously assess the reliability of AI. This includes examining the potential for political misinformation, inherent biases, and even defamatory outputs. Companies like ...

August 7, 2023

54

Secure AI Weekly + Trusted AI Blog admin

Towards Trusted AI Week 32 – Navigating the Future of Cyber Resilience

The generative A.I. battle between companies and hackers is starting CNBC, August 2, 2023 Last month, tech titans like Amazon, Google, Meta, and Microsoft collaborated with President Joe Biden, emphasizing their commitment to ensure that AI technologies undergo rigorous safety checks before public deployment. The primary concern is the role ...

August 3, 2023

150

Secure AI Weekly + Trusted AI Blog admin

Towards Trusted AI Week 31 – New LLM Jailbreak, Plugin hacks and more

ChatGPT Has a Plugin Problem Wired, July 25, 2023 Over the past eight months, OpenAI’s ChatGPT has dazzled millions with its ability to produce lifelike text, from stories to code. However, the development and rapid proliferation of plugins to extend ChatGPT’s capabilities have raised serious security concerns. The introduction of ...

July 29, 2023

65

Secure AI Weekly + Trusted AI Blog admin

Towards Trusted AI Week 30 – Global Initiatives to Enhance AI Cybersecurity

FACT SHEET: Biden-⁠Harris Administration Secures Voluntary Commitments from Leading Artificial Intelligence Companies to Manage the Risks Posed by AI White House, July 21, 2023 The Biden-Harris Administration has underscored its commitment to harness the transformative potential of Artificial Intelligence (AI), while simultaneously ensuring its responsible and secure use. Taking decisive ...

July 21, 2023

74

Secure AI Weekly + Trusted AI Blog admin

Towards Trusted AI Week 29 – Challenges of Enterprise LLM Adoption

An AI detector mislabeled nearly every essay written by a non-native English speaker as being written by a bot Insider, July 13, 2023 Safety and security issues of AI systems are under increasing scrutiny, as Stanford University research reveals that AI detection tools are incorrectly identifying essays written by non-native ...

Towards Trusted AI Week 37 – Why AI TRiSM is Essential

Towards Trusted AI Week 36 – The Critical Quest for Secure and Reliable AI Systems

Towards Trusted AI Week 35 – The Achilles’ Heel of AI

Towards Trusted AI Week 34 – Defcon AI Red Teaming wrap-ups and the Quest for AI Security

Towards Trusted AI Week 33 – AI Security Takes Center Stage

Towards Trusted AI Week 32 – Navigating the Future of Cyber Resilience

Towards Trusted AI Week 31 – New LLM Jailbreak, Plugin hacks and more

Towards Trusted AI Week 30 – Global Initiatives to Enhance AI Cybersecurity

Towards Trusted AI Week 29 – Challenges of Enterprise LLM Adoption

Trusted AI Security

Explore Our Blog

Featured Post

Universal LLM Jailbreak: ChatGPT, GPT-4, BARD, BING, Anthropic, and Beyond

Latest Posts

Adversa AI Agentic AI Red Teaming Platform Wins Leading Cybersecurity solution in AI at Fortress Cybersecurity Awards

Top 12 Security Issues in Model Context Protocol (MCP) and How to Fix Them

Towards Secure AI Week 21 — From Reactive Defense to Capability-Aware AI Red Teaming

ICIT Securing AI: Addressing the OWASP Top 10 for Large Language Model Applications — TOP 10 insights

CISCO The state of ai security 2025 Annual report — Top 10 insights

Towards Secure AI Week 20 — Identity, Jailbreaks, and the Future of Agentic AI Security

Prompt Injection Risks Interview: Are AIs Ready to Defend Themselves? Conversation with ChatGPT, Claude, Grok & Deepseek

Microsoft’s Taxonomy of Failure Modes in Agentic AI Systems — TOP 10 Insights

Towards Secure AI Week 19 — AI Agents Under Attack, Evaluation Becomes Strategy

ETSI TS 104 223: 10 Security Insights Every CISO Needs

Towards Secure AI Week 18 — LLM Jailbreaks Hit New Highs, AI Security Market Accelerates

Towards Secure AI Week 17 — AI Guardrails Under Pressure as Jailbreaking Techniques Advance