Secure AI Weekly

224 Results / Page 2 of 25

todayOctober 23, 2024

  • 96
close

Secure AI Weekly + Trusted AI Blog admin

Towards Secure AI Week 42 – New Jailbreaks and Incidents

LLMs are easier to jailbreak using keywords from marginalized groups, study finds The Decoder, October 20, 2024 A recent study highlights unintended vulnerabilities in the safety protocols of large language models (LLMs), revealing that well-meaning ethical measures can introduce security gaps. Researchers found that the ease with which these models ...

todaySeptember 17, 2024

  • 55
close

Secure AI Weekly + Trusted AI Blog admin

Towards Secure AI Week 37 – Global AI Security Frameworks Dubai, China

Governance framework promotes AI security China Daily, September 11, 2024 A new governance framework aimed at enhancing the security and safety of AI was introduced during China Cybersecurity Week in Guangzhou, Guangdong province. Announced by the National Technical Committee 260 on Cybersecurity of the Standardization Administration of China, the framework ...

todaySeptember 9, 2024

  • 101
close

Secure AI Weekly + Trusted AI Blog admin

Towards Secure AI Week 36 – AI Security Guides from WDTA

Top five strategies from Meta’s CyberSecEval 3 to combat weaponized LLMs Venture Beat, September 3, 2024 Meta’s CyberSecEval 3 framework highlights the urgent need for comprehensive security measures as AI technologies, particularly large language models (LLMs), become more prevalent. The framework suggests five key strategies for safeguarding AI systems. These ...

todaySeptember 3, 2024

  • 85
close

Secure AI Weekly + Trusted AI Blog admin

Towards Secure AI Week 35 – Latest GenAI hacking incidents: Slack, Copilot, GPT’s etc..

Hundreds of LLM Servers Expose Corporate, Health & Other Online Data DarkReading, August 28, 2024 Recent discoveries have highlighted a troubling issue: hundreds of LLM servers are inadvertently exposing sensitive corporate, healthcare, and personal data online due to misconfigurations and insufficient security measures.  These servers, often left unprotected by adequate ...