Secure AI Weekly

235 Results / Page 9 of 27

todayOctober 20, 2023

  • 77
close

Secure AI Weekly + Trusted AI Blog admin

Towards Trusted AI Week 42 – Multi-modal prompt injections again!

AI safety guardrails easily thwarted, security study finds The Register, October 12, 2023 Models, such as OpenAI’s GPT-3.5 Turbo, were designed with built-in safety measures to prevent the generation of harmful or toxic content. However, recent research has shed light on the vulnerability of these safeguards, revealing that they may ...

todayOctober 5, 2023

  • 71
close

Secure AI Weekly + Trusted AI Blog admin

Towards Trusted AI Week 40 – Job of the Week: Head of Generative AI Security

Malicious Actors Exploiting AI Chatbot Jailbreaking Tips Security Boulevard, September 27, 2023 Recent developments in the world of AI have raised concerns about the security and safety of these advanced systems. Malicious actors have been collaborating to breach the ethical and safety boundaries placed around AI chatbots like ChatGPT. This ...

todaySeptember 27, 2023

  • 253
close

Secure AI Weekly + Trusted AI Blog admin

Towards Trusted AI Week 39 – Open AI Red Teaming & The rise of Secure AI Startups

OpenAI Red Teaming Network OpenAI, September 19, 2023 Finally, OpenAI launched the OpenAI Red Teaming Network, a pivotal initiative designed to bolster the safety and security of our AI models. This venture welcomes experts from a myriad of fields to collaborate, utilizing their diverse insights for the thorough evaluation and ...

todaySeptember 5, 2023

  • 86
close

Secure AI Weekly + Trusted AI Blog admin

Towards Trusted AI Week 36 – The Critical Quest for Secure and Reliable AI Systems

UK cybersecurity agency warns of chatbot ‘prompt injection’ attacks The Guardian, August 30, 2023 The United Kingdom’s National Cyber Security Centre (NCSC) has recently raised alarms about the escalating cybersecurity threats surrounding chatbots. These automated conversational agents, powered by large language models (LLMs) like OpenAI’s ChatGPT and Google’s Bard, are ...

todaySeptember 1, 2023

  • 137
close

Secure AI Weekly + Trusted AI Blog admin

Towards Trusted AI Week 35 – The Achilles’ Heel of AI

Tricks for making AI chatbots break rules are freely available online NewScientist, August 21, 2023 Artificial intelligence chatbots like ChatGPT have become essential tools for various online activities, but their security loopholes present an emerging concern. Manipulative text prompts, often referred to as “jailbreak prompts,” can mislead these AI systems ...

todayAugust 25, 2023

  • 155
close

Secure AI Weekly + Trusted AI Blog admin

Towards Trusted AI Week 34 – Defcon AI Red Teaming wrap-ups and the Quest for AI Security

Don’t expect quick fixes in ‘red-teaming’ of AI models. Security was an afterthought APNews, August 14, 2023 The recent DefCon hacker conference in Las Vegas served as a stark reminder of the pressing concerns around AI safety and security. The event saw 2,200 participants rigorously testing eight advanced language models, ...