Digests

360 Results / Page 16 of 40

Background

todaySeptember 27, 2023

  • 286
close

Secure AI Weekly + Digests admin

Towards Trusted AI Week 39 – Open AI Red Teaming & The rise of Secure AI Startups

OpenAI Red Teaming Network OpenAI, September 19, 2023 Finally, OpenAI launched the OpenAI Red Teaming Network, a pivotal initiative designed to bolster the safety and security of our AI models. This venture welcomes experts from a myriad of fields to collaborate, utilizing their diverse insights for the thorough evaluation and ...

todaySeptember 18, 2023

  • 196
close

Adversarial ML Digest admin

Secure AI Research papers: The Dark Corners of AI

With technology advances the ethical, security, and operational questions loom ever larger. From hijacked images that can control AI to camouflage techniques that can make vehicles invisible to sensors, the latest batch of research papers unveils some startling vulnerabilities in AI systems.  Can anyone hack an AI model by just ...

todaySeptember 5, 2023

  • 90
close

Secure AI Weekly + Digests admin

Towards Trusted AI Week 36 – The Critical Quest for Secure and Reliable AI Systems

UK cybersecurity agency warns of chatbot ‘prompt injection’ attacks The Guardian, August 30, 2023 The United Kingdom’s National Cyber Security Centre (NCSC) has recently raised alarms about the escalating cybersecurity threats surrounding chatbots. These automated conversational agents, powered by large language models (LLMs) like OpenAI’s ChatGPT and Google’s Bard, are ...

todaySeptember 4, 2023

  • 194
close

LLM Security Digest admin

LLM Security and Prompt Engineering Digest: Top August events, guides, incidents, VC reviews and research papers

Welcome to a brief exploration into the fascinating world of AI security—a realm where innovation and danger intertwine like DNA strands. Dive in to learn how red teaming tests AI vulnerabilities, what Google recommends for AI security, the unforeseen risks of AI in everyday applications, and academic approaches to the ...

todaySeptember 1, 2023

  • 173
close

Secure AI Weekly + Digests admin

Towards Trusted AI Week 35 – The Achilles’ Heel of AI

Tricks for making AI chatbots break rules are freely available online NewScientist, August 21, 2023 Artificial intelligence chatbots like ChatGPT have become essential tools for various online activities, but their security loopholes present an emerging concern. Manipulative text prompts, often referred to as “jailbreak prompts,” can mislead these AI systems ...

todayAugust 25, 2023

  • 173
close

Secure AI Weekly + Digests admin

Towards Trusted AI Week 34 – Defcon AI Red Teaming wrap-ups and the Quest for AI Security

Don’t expect quick fixes in ‘red-teaming’ of AI models. Security was an afterthought APNews, August 14, 2023 The recent DefCon hacker conference in Las Vegas served as a stark reminder of the pressing concerns around AI safety and security. The event saw 2,200 participants rigorously testing eight advanced language models, ...