Trusted AI Blog

504 Results / Page 17 of 56

Background

todayOctober 30, 2024

  • 97
close

Secure AI Weekly admin

Towards Secure AI Week 43 – New Tools and AI incidents

SAIF Risk Assessment: A new tool to help secure AI systems across industry Google Blog, October 24, 2024 In recent years, the Secure AI Framework (SAIF) was developed to promote the safe and responsible deployment of AI models. Designed to support developers and security professionals, SAIF provides best practices and ...

todayOctober 23, 2024

  • 125
close

Secure AI Weekly admin

Towards Secure AI Week 42 – New Jailbreaks and Incidents

LLMs are easier to jailbreak using keywords from marginalized groups, study finds The Decoder, October 20, 2024 A recent study highlights unintended vulnerabilities in the safety protocols of large language models (LLMs), revealing that well-meaning ethical measures can introduce security gaps. Researchers found that the ease with which these models ...

todayOctober 1, 2024

  • 90
close

Secure AI Weekly admin

Towards Secure AI Week 39 – False AI Memories

AI ‘godfather’ says OpenAI’s new model may be able to deceive and needs ‘much stronger safety tests’ Business Insider, September, 2024 Yoshua Bengio, the “Godfather of AI,” raises concerns about OpenAI’s new O1 model, warning it could deceive users and pose significant risks if not properly controlled. He advocates for ...

todaySeptember 17, 2024

  • 84
close

Secure AI Weekly admin

Towards Secure AI Week 37 – Global AI Security Frameworks Dubai, China

Governance framework promotes AI security China Daily, September 11, 2024 A new governance framework aimed at enhancing the security and safety of AI was introduced during China Cybersecurity Week in Guangzhou, Guangdong province. Announced by the National Technical Committee 260 on Cybersecurity of the Standardization Administration of China, the framework ...

todaySeptember 11, 2024

  • 260
close

GenAI Security + GenAI Security Digest admin

GenAI Security Top Digest: Slack and Apple Prompt Injections, threats of Microsoft Copilot, image attacks

This is the first-of-its-kind GenAI Security Top digest, originated from our world-first LLM Security Digest, providing an essential summary of the most critical vulnerabilities and threats to all Generative AI technologies from LLV and VLM to GenAI Copilots and GenAI infrastructure, along with expert strategies to protect your systems, ensuring ...

todaySeptember 9, 2024

  • 141
close

Secure AI Weekly admin

Towards Secure AI Week 36 – AI Security Guides from WDTA

Top five strategies from Meta’s CyberSecEval 3 to combat weaponized LLMs Venture Beat, September 3, 2024 Meta’s CyberSecEval 3 framework highlights the urgent need for comprehensive security measures as AI technologies, particularly large language models (LLMs), become more prevalent. The framework suggests five key strategies for safeguarding AI systems. These ...