Trusted AI Blog

280 Results / Page 2 of 32

todayMarch 5, 2024

  • 73
close

Secure AI Weekly + Trusted AI Blog admin

Towards Secure AI Week 9 –  BEAST Jailbreak and AI Security Predictions 2024

Cyber Insights 2024: Artificial Intelligence Security Week, February 26, 2024 In the ever-evolving landscape of AI within cybersecurity, 2024 brings forth profound insights from Mr. Alex Polyakov, CEO and co-founder of Adversa AI. Polyakov highlights the expanding threat landscape, citing instances such as the jailbreak of Chevrolet’s Chatbot and data ...

todayFebruary 26, 2024

  • 106
close

Secure AI Weekly + Trusted AI Blog admin

Towards Secure AI Week 8 –  FS-ISAC AI Risk Guides

Google Gemini “Diverse” Prompt Injection Know Your Meme, February 22, 2024 This scrutiny emphasizes the necessity for a steadfast commitment to Quality and Robustness testing before releasing AI in production. The crux of the controversy emerged on February 9th, 2024, when a Reddit user expressed dissatisfaction with Gemini’s seeming inability ...

todayFebruary 22, 2024

  • 101
close

Secure AI Weekly + Trusted AI Blog admin

Towards Secure AI Week 7 –  New book in GenAI Security

DARPA and IBM are ensuring that anyone can protect their AI systems from hackers IBM, February 7, 2024 Collaborating with DARPA’s Guaranteeing AI Robustness Against Deception (GARD) project, IBM has been at the forefront of addressing these challenges, particularly through the development of the Adversarial Robustness Toolbox (ART). Beyond military ...

todayFebruary 8, 2024

  • 65
close

Secure AI Weekly + Trusted AI Blog admin

Towards Secure AI Week 5 –  Threat of Prompt Injection Looms Large

How to detect poisoned data in machine learning datasets VentureBeat, February 4, 2024 Data poisoning in machine learning datasets poses a significant threat, allowing attackers to manipulate model behavior intentionally. Proactive detection efforts are crucial to safeguarding against this threat. Data poisoning involves maliciously tampering with datasets to mislead machine ...

todayFebruary 6, 2024

  • 211
close

Trusted AI Blog + LLM Security admin

LLM Security Digest: TOP Security Platforms, Incidents, Developer Guides, Threat Models and Hacking Games   

Welcome to the latest edition of our LLM Security Digest!  We explore the dynamic landscape of LLM Security Platforms, innovative real-world incidents, and cutting-edge research that shape the field of LLM security. From adversarial AI attacks to the challenges of securing foundational models, we bring you insights, debates, and practical ...

todayJanuary 31, 2024

  • 131
close

Secure AI Weekly + Trusted AI Blog admin

Towards Secure AI Week 4 – Thousands ChatGPT jailbreaks for sale

Top 4 LLM threats to the enterprise CSO Online, January 22, 2024 The intersection of natural language prompts and training sources poses unique threats, including prompt injection, prompt extraction, phishing schemes, and the poisoning of models. Traditional security tools find it challenging to keep pace with these dynamic risks, necessitating ...

todayJanuary 25, 2024

  • 117
close

Trusted AI Blog + LLM Security admin

LLM Security Digest: Jailbreaks, Red Teaming, CISO Guides, Incidents and Jobs

Here’s the top LLM security publications collected in one place for you. This digest provides insights into various aspects of Large Language Model (LLM) security. It covers a range of topics, from checklists for LLM Security and incidents involving vulnerabilities in chatbots to real-world attacks and initiatives by the Cloud ...

todayJanuary 24, 2024

  • 89
close

Secure AI Weekly + Trusted AI Blog admin

Towards Secure AI Week 3 – DPD AI Chatbot incident

A CISO’s perspective on how to understand and address AI risk SCMedia, January 16, 2024 The adoption of AI in enterprises introduces significant risks that span technical, reputational, regulatory, and operational dimensions. From supply chain vulnerabilities to the potential theft of sensitive data, the stakes are high, demanding a proactive ...