admin

401 Results / Page 4 of 45

NIST FMF AI 100-2 2025

todayMarch 31, 2025

  • 351
close

Review + Adversarial ML admin

NIST AI 100-2 E2025 Adversarial Machine Learning: A Taxonomy and Terminology of Attacks and Mitigations

NIST’s New AML Taxonomy: Key Changes in AI Security Guidelines (2023 vs. 2025) In an ever-evolving landscape of AI threats and vulnerabilities, staying ahead means staying updated. The National Institute of Standards and Technology (NIST) recently published a crucial update to its cornerstone document, “Adversarial Machine Learning: A Taxonomy and ...

todayMarch 18, 2025

  • 41
close

Secure AI Weekly admin

Towards Secure AI Week 10 – Lessons from Siri Delays

Apple may have delayed the Siri upgrade for fear of jailbreaks GSMArena, March 10, 2025 Apple’s decision to delay its planned AI enhancements for Siri highlights the growing security concerns surrounding artificial intelligence, particularly the risk of “jailbreaking” through prompt injections. These attacks involve manipulating AI models into performing unintended ...

todayMarch 11, 2025

  • 73
close

Secure AI Weekly admin

Towards Secure AI Week 9 – Exploiting AI Weaknesses

Researchers Jailbreak 17 Popular LLM Models to Reveal Sensitive Data GBHackers, March 7, 2025 Researchers from Palo Alto Networks’ Threat Research Center have discovered that 17 popular generative AI (GenAI) applications are vulnerable to jailbreaking techniques, allowing users to bypass safety protocols. By using both single-turn and multi-turn strategies, attackers ...

todayMarch 4, 2025

  • 113
close

Secure AI Weekly admin

Towards Secure AI Week 8 – Massive AI security breach

MITRE Releases OCCULT Framework to Address AI Security Challenges GBHackers, February 26, 2025 MITRE has launched the Offensive Cyber Capability Unified LLM Testing (OCCULT) framework, a structured approach designed to assess the potential security risks of large language models (LLMs) in cyberattacks. This initiative focuses on evaluating AI systems’ ability ...

todayFebruary 25, 2025

  • 147
close

Secure AI Weekly admin

Towards Secure AI Week 7 – OWASP for Agentic AI and more

Agentic AI – Threats and Mitigations OWASP, February 17, 2025 Agentic AI, driven by large language models (LLMs) and generative AI, is advancing rapidly, offering new capabilities while introducing significant security risks. These autonomous systems can plan, adapt, and interact with external environments, making them powerful but also susceptible to ...

Grok 3 AI Red Teaming

todayFebruary 18, 2025

  • 18258
  • 1
close

Research + LLM Security admin

Grok 3 Jailbreak and AI red Teaming

Grok 3 Jailbreak and AI Red Teaming In this article, we will demonstrate  how Grok 3 respond to different hacking  techniques including Jailbreaks and Prompt leaking attacks. Our initial study on AI Red Teaming different LLM Models using various approaches focused on LLM models released before the so-called “Reasoning Revolution”, ...

todayFebruary 17, 2025

  • 101
close

Secure AI Weekly admin

Towards Secure AI Week 6 – New AI Security Framework

Announcing the Databricks AI Security Framework 2.0 DataBricks, February 12, 2025 Databricks has unveiled the second edition of its AI Security Framework (DASF 2.0), a comprehensive guide designed to address the growing risks associated with AI deployments. The framework identifies 62 technical AI risks and introduces 64 mitigation controls, offering ...