Towards Secure AI Week 21 – EU AI Act Revolution

Secure AI Weekly + Trusted AI Blog admin todayMay 27, 2024 29

Background
share close

World’s first major law for artificial intelligence gets final EU green light

CNBC, May 21, 2024

The European Union has officially passed the world’s first comprehensive law regulating artificial intelligence, marking a significant milestone in the realm of AI safety and security. The newly approved Artificial Intelligence Act introduces a risk-based framework that classifies AI systems based on their potential for harm. This law aims to ensure that high-risk AI applications, such as those used in critical infrastructure, law enforcement, and employment, adhere to strict compliance standards to protect fundamental rights, enhance transparency, and prevent misuse​.

In addition to stringent oversight of high-risk AI, the legislation outright bans certain AI practices considered too dangerous, like government social scoring and real-time biometric surveillance in public areas. By setting these legal precedents, the EU hopes to inspire similar regulatory measures worldwide, fostering a global environment where AI is developed and used responsibly and ethically​.

AI chatbots’ safeguards can be easily bypassed, say UK researchers

The Guardian, May 20, 2024

UK researchers have raised alarms about the security of AI chatbots, revealing that existing safeguards can be easily bypassed. A recent study by a prominent UK university demonstrated how protective measures in popular AI chatbots could be circumvented, leading to potential misuse and harmful consequences.

The study identified multiple vulnerabilities in the design and implementation of these chatbots. Despite efforts to filter inappropriate content and prevent misuse, the researchers discovered that simple manipulations and exploits could override these safeguards. This presents significant risks for user safety and data security, as malicious actors could exploit chatbots to spread harmful information or engage in unethical activities.

Google SAIF in Focus: 5 AI Security Risks and SAIF Mitigations

Cloud Security Podcast by Google, May 21, 2024

In a recent discussion on EP173 SAIF, Shan Rao, Group Product Manager at Google, shed light on the imperative of securing artificial intelligence (AI) in cloud environments. Rao outlined five critical AI security risks and stressed the importance of adopting comprehensive frameworks like SAIF to effectively mitigate these challenges. Despite the unique nature of each risk, Rao highlighted the overlap in mitigation strategies, emphasizing the necessity of a unified security approach.

Moreover, Rao underscored the significance of integrating security measures early in the AI deployment process to enable rapid innovation while safeguarding against evolving threats. Looking ahead, Rao speculated on the future trajectory of AI security, questioning whether it will evolve as a standalone domain or remain integrated within broader security frameworks. Regardless of the future direction, he emphasized the interconnectedness of AI security with application, data, and cloud security, advocating for a unified strategy to protect AI in cloud environments.

OpenAI safety update

OpenAI, May 21, 2024

OpenAI has outlined their comprehensive approach to ensuring the security and safety of their AI models. They emphasize the importance of integrating safety measures from the development stage, continuously improving through practices like empirical testing, alignment research, and abuse monitoring. Key initiatives include protecting children, safeguarding election integrity, and collaborating with governments on AI policies. OpenAI is committed to evolving its security practices to counter sophisticated threats, highlighting the ongoing need for robust safety protocols in AI deployment.

How People Trick Generative Artificial Intelligence Chatbots into Exposing Company Secrets

AIThority, May 23, 2024

Generative AI chatbots, though highly sophisticated, face significant security and safety challenges due to their susceptibility to manipulation. Users have identified ways to trick these chatbots into bypassing their safety protocols, resulting in the production of inappropriate or harmful content. These exploits often involve cleverly crafted prompts or “jailbreaking” techniques that take advantage of vulnerabilities in the AI’s programming.

This ability to deceive AI chatbots highlights the crucial need for robust security measures in AI development. As these systems become increasingly integrated into various applications, ensuring their resilience against malicious misuse is essential. Developers must continuously enhance and adapt security protocols to protect against these vulnerabilities. These findings emphasize the ongoing challenges in AI security and the necessity for vigilant, adaptive defenses to safeguard users and maintain the integrity of AI technologies.

Beware – Your Customer Chatbot is Almost Certainly Insecure: Report

Security Week, May 24, 2024

The proliferation of customer chatbots, driven by versatile gen-AI engines, presents a burgeoning security challenge. Despite their ease of development, these chatbots often lack robust defenses against prompt manipulation, as highlighted by recent incidents and studies. For instance, a January 2024 case involving DPD’s chatbot showcased vulnerabilities that allowed unauthorized access and inappropriate behavior. Immersive Labs’ comprehensive study further underscores the need for heightened security measures, revealing that even sophisticated chatbots are susceptible to prompt engineering attacks. This gap in defense mechanisms not only risks reputational damage but also opens avenues for the theft of sensitive corporate data. As AI technologies continue to advance, prioritizing security measures becomes imperative to mitigate potential risks and safeguard user information.

 

Subscribe for updates

Stay up to date with what is happening! Get a first look at news, noteworthy research and worst attacks on AI delivered right in your inbox.

    Written by: admin

    Rate it
    Previous post