Trusted AI Blog

December 4, 2024

99

Towards Secure AI Week 48 – Biggest AI Security Bug Bounty

Artificial Intelligence Vulnerability Scoring System (AIVSS) GitHub The AI Vulnerability Scoring System (AIVSS) has been proposed as a framework designed to evaluate vulnerabilities in AI systems comprehensively. Unlike static models, AIVSS incorporates dynamic metrics tailored to AI, including model robustness, data sensitivity, ethical impact, and adaptability, alongside traditional security considerations. ...

November 24, 2024

87

Secure AI Weekly admin

Towards Secure AI Week 47 – New OWASP Top 10 for LLMs

OWASP Reveals Updated 2025 Top 10 Risks for LLMs, Announces New LLM Project Sponsorship Program and Inaugural Sponsors OWASP, November 17, 2024 The OWASP Foundation has unveiled a refreshed OWASP Top 10 for LLM Applications and Generative AI Project, emphasizing the need for robust security in the development, deployment, and ...

November 18, 2024

100

Secure AI Weekly admin

Towards Secure AI Week 46 – Hacking LLM Robots

It’s Surprisingly Easy to Jailbreak LLM-Driven Robots Researchers induced bots to ignore their safeguards without exception IEEE Spectrum, November 11, 2024 The rapid integration of large language models (LLMs) like ChatGPT into robotics has revolutionized how robots interact with humans, offering capabilities such as voice-activated commands and task execution based ...

November 12, 2024

115

Secure AI Weekly admin

Towards Secure AI Week 45 – AI Safety Through Testing, Legislation, and Talent Building

Microsoft’s Yonatan Zunger on Red Teaming Generative AI The Cyber Wire, November 6, 2024 In a recent Microsoft Threat Intelligence Podcast episode, host Sherrod DeGrippo speaks with Yonatan Zunger, Corporate Vice President of AI Safety and Security at Microsoft, to explore the critical importance of securing AI systems. The conversation ...

November 6, 2024

114

Secure AI Weekly admin

Towards Secure AI Week 44 – From Open-Source AI Risks to National Policies

Researchers Uncover Vulnerabilities in Open-Source AI and ML Models The Hacker News, October 29, 2024 Recent disclosures have highlighted over thirty security vulnerabilities within various open-source artificial intelligence (AI) and machine learning (ML) models, some of which could allow for remote code execution and unauthorized data access. Key flaws have ...

October 30, 2024

97

Secure AI Weekly admin

Towards Secure AI Week 43 – New Tools and AI incidents

SAIF Risk Assessment: A new tool to help secure AI systems across industry Google Blog, October 24, 2024 In recent years, the Secure AI Framework (SAIF) was developed to promote the safe and responsible deployment of AI models. Designed to support developers and security professionals, SAIF provides best practices and ...

October 23, 2024

125

Secure AI Weekly admin

Towards Secure AI Week 42 – New Jailbreaks and Incidents

LLMs are easier to jailbreak using keywords from marginalized groups, study finds The Decoder, October 20, 2024 A recent study highlights unintended vulnerabilities in the safety protocols of large language models (LLMs), revealing that well-meaning ethical measures can introduce security gaps. Researchers found that the ease with which these models ...

October 16, 2024

117

Secure AI Weekly admin

Towards Secure AI Week 41 – AI Security Skills Shortage

How to enable secure use of AI The Register, October 10, 2024 The UK National Cyber Security Centre (NCSC) highlights several areas where AI can be exploited, but organizations need practical solutions that enable them to adopt AI safely and responsibly. This is where the SANS AI Toolkit comes in, ...

October 9, 2024

94

Secure AI Weekly admin

Towards Secure AI Week 40 – What You Need to Know About the Risks

California Governor Vetoes AI Regulation Bill, Calls for More Targeted Approach Campus Technology, September 30, 2024 California Governor Gavin Newsom has vetoed Senate Bill 1047, a proposed law designed to regulate AI and prevent its misuse. The bill, which had received strong legislative support, aimed to establish some of the ...