Cisco The state of AI Security 2025 Annual report — Top 10 insights

Articles + GenAI Security ADMIN May 27, 2025 89

AI Is Eating the Enterprise — But the Enterprise Is on the Menu Too.

Seventy-two percent of organisations already embed AI, yet only 13 percent feel truly ready. Attackers know that gap and are rushing to weaponise it. Cisco’s latest report reads like a flight-recorder transcript from the future of cyber-warfare. Below are the ten insights every CISO, director and hands-on engineer should tape to the wall today.

1. Secure AI Infrastructure Before You Secure the Model

“Your model can’t hallucinate if your GPU cluster is already owned.” Share on X

Adversaries aren’t starting with the model — they’re targeting the foundational infrastructure that powers AI systems. Container environments like Docker and Kubernetes, orchestration layers, and GPU clusters are all under attack. If an attacker gains access to a distributed framework like Ray, they can hijack training processes, extract sensitive model weights, or even run cryptomining scripts. AI security must begin below the model layer to prevent lateral movement and infrastructure hijack.

Takeaways:

Use workload identity (SPIFFE/SPIRE) for model-serving containers.
Harden nodes with AppArmor, seccomp, and disable unnecessary syscalls.
Implement runtime anomaly detection for training environments.

2. Detect and Prevent AI Supply Chain Attacks Like “Sleepy Pickle”

“What Log4j did to Java, poisoned models will do to AI.” Share on X

AI models can be silently compromised through backdoored artifacts such as PyTorch .pt or Pickle .pkl files. These supply chain attacks embed payloads that activate only after the model is deployed to production — often undetected during scanning. Once active, they allow remote code execution, data leakage, or command injection. These threats mirror past software supply chain compromises but operate with new stealth in ML pipelines.

Takeaways:

Treat model artefacts like packages—pin SHAs.
Always scan .pt, .pkl, .onnx files with static analyzers.
Use ML-focused SBOMs: record hashes, authors, training data sources.

3. Prompt Injection Still Works, Even on GPT-4-Class LLMs

“LLMs forget their morals faster than users forget semi-colons.” Share on X

Despite reinforcement learning with human feedback (RLHF), foundational models remain vulnerable to simple prompt injection attacks. Attackers continue to bypass guardrails using clever text manipulations like “Ignore all previous instructions…” This persists even on state-of-the-art systems, demonstrating how limited current alignment techniques are when facing user-controlled input channels.

Takeaways:

Stack rule-based filters after the model, not before (post-generation audits).
Use dual-model self-debiasing: ask a smaller guard model to critique the larger one.
Log every system-prompt diff; pipe anomalies to SIEM for AI red-team replay.

4. Defend Against Indirect Prompt Injection via Untrusted Inputs

“One poisoned PDF in the context window and your chatbot sings NDAs.” Share on X

Indirect prompt injection occurs when malicious content embedded in user-uploaded documents — like PDFs or HTML — is parsed into a model’s context window. This gives adversaries access to the prompt space without direct interaction. It’s the AI equivalent of drive-by malware, weaponizing passive content ingestion to issue hidden instructions to your chatbot or assistant.

Takeaways:

Strip HTML/markdown before vectorising external docs.
Apply regex-based allow-lists (e.g., ban substrings like SYSTEM: or Unicode homoglyphs).
Validate chunk entropy in RAG pipelines to detect low-entropy hidden prompts.

5. Combat Advanced Jailbreaks Through Token Smuggling and Context Pollution

“Attackers don’t break rules; they smuggle them past the bouncers.” Share on X

Sophisticated jailbreak techniques exploit encoding tricks — like zero-width characters — or hide malicious instructions deep in long input chains. These methods bypass content filters by exploiting how LLMs prioritize context or interpret semantically identical characters, leading to successful bypasses of safety alignment.

Takeaways:

Normalise Unicode and strip zero-width characters pre-prompt.
Run sliding-window embeddings to spot semantic drift.
Red-team your LLMs with generative fuzzing frameworks.

6. Prevent Training Data Extraction Through Probing Attacks

“If your model saw it, assume an adversary can too.” Share on X

Large Language Models (LLMs) can regurgitate exact snippets of their training data through prompt engineering. Attackers use recursive querying and statistical techniques to elicit memorized text, potentially exposing sensitive IP or PII from private datasets. If proprietary or licensed data was used in training, this becomes a critical security and legal risk.

Takeaways:

Enable per-sample differential privacy.
Watermark training data; use canary phrases to detect leakage.
Rate-limit similarity queries; auto-ban IPs with high-entropy, low-diversity probing patterns.

7. Detect Web-Scale Data Poisoning in Public Training Pipelines

“One pixel change in 10 million images can rewrite your model’s worldview.” Share on X

Threat actors inject malicious examples into publicly scraped training data to create latent backdoors. These backdoors remain dormant until triggered by a specific input, allowing attackers to hijack model behavior post-deployment. Poisoning is subtle and easily missed in large datasets, making detection especially hard without intentional defenses.

Takeaways:

Deploy Cleanlab or influence-based filtering before each epoch.
Scan training data for statistical outliers.
Use trusted data sources and validate label distributions.

8. Prepare for AI-Augmented Social Engineering Campaigns

“Your help-desk ticket just passed the Turing test — for evil.” Share on X

Generative AI is now routinely used to craft personalized phishing messages, clone voices, or generate convincing internal documents. These campaigns reduce attacker dwell time by automating reconnaissance and lowering detection barriers. Enterprise chat, support tickets, and customer service channels are all vulnerable.

Takeaways:

Generate adversarial-style phishing emails internally; train staff with “confusion” scoring.
Deploy AI-based phishing detection tuned for GenAI lures.
Integrate behavioral analysis tools for real-time comms (Slack, Teams).

9. Anticipate Regulatory Enforcement: EO 14110, EU AI Act & Beyond

“Compliance is the new attack surface.” Share on X

With over 300 AI-related bills introduced in 2024, global regulators are moving fast. The 2025 horizon includes mandatory AI SBOMs, red teaming, explainability, and risk registries. Failing to prepare is no longer a technical oversight — it’s a legal and reputational vulnerability.

Takeaways:

Map every AI asset to NIST AI RMF functions (Govern, Map, Measure, Manage).
Build an internal “Model BOM” JSON schema (weights hash, training data license, eval scores).
Maintain an AI Risk Register with real-time updates.

10. Shift to Full-Stack AI Security Platforms, Not Point Solutions

“Point tools patch holes; platforms raise the waterline.” Share on X

Cisco’s new AI security suite integrates telemetry, scanning, and runtime defenses into a single mesh. This marks a transition away from isolated fixes toward comprehensive, lifecycle-oriented security — covering models, infrastructure, data, and prompts. The future of AI defense lies in platform thinking.

Takeaways:

Adopt AI-aware XDR with LLM threat detection.
Connect prompt firewalls to SIEM/SOAR pipelines.
Simulate attacks in purple-team drills with real-time telemetry replay.

Securing AI From Prompt to Production: A Leadership Imperative

The battle for AI security has already begun. Enterprises aren’t just adopting AI — they’re deploying it into production faster than most security teams can assess the risks. Meanwhile, attackers are learning the anatomy of LLM pipelines, regulators are raising the bar, and every model becomes a new target.

The ten lessons above reveal a deeper truth: AI security isn’t a subset of cybersecurity — it’s a new discipline. Protecting the infrastructure, vetting the supply chain, managing model behavior, and preparing for regulatory scrutiny all require focused effort. What used to be “cutting-edge” is now just “baseline.”

This is where platforms like Adversa AI come in. We help you operationalize security at every layer of your AI systems — from adversarial red teaming to continuous monitoring, from regulatory compliance to prompt injection defenses. Instead of chasing the next jailbreak or patching blind spots, you can finally start building secure AI systems by design.

Because when AI becomes the business engine — security becomes the business enabler.

Subscribe for updates

Stay up to date with what is happening! Get a first look at news, noteworthy research and worst attacks on AI delivered right in your inbox.

Written by: ADMIN

Rate it

May 26, 2025

Secure AI Weekly ADMIN

Towards Secure AI Week 20 — Identity, Jailbreaks, and the Future of Agentic AI Security

This week’s stories highlight the rapid emergence of new threats and defenses in the Agentic AI landscape. From OWASP’s DNS-inspired Agent Name Service (ANS) for verifying AI identities to real-world ...

Cisco The state of AI Security 2025 Annual report — Top 10 insights

1. Secure AI Infrastructure Before You Secure the Model

Takeaways:

2. Detect and Prevent AI Supply Chain Attacks Like “Sleepy Pickle”

Takeaways:

3. Prompt Injection Still Works, Even on GPT-4-Class LLMs

Takeaways:

4. Defend Against Indirect Prompt Injection via Untrusted Inputs

Takeaways:

5. Combat Advanced Jailbreaks Through Token Smuggling and Context Pollution

Takeaways:

6. Prevent Training Data Extraction Through Probing Attacks

Takeaways:

7. Detect Web-Scale Data Poisoning in Public Training Pipelines

Takeaways:

8. Prepare for AI-Augmented Social Engineering Campaigns

Takeaways:

9. Anticipate Regulatory Enforcement: EO 14110, EU AI Act & Beyond

Takeaways:

10. Shift to Full-Stack AI Security Platforms, Not Point Solutions

Takeaways:

Securing AI From Prompt to Production: A Leadership Imperative

Subscribe for updates

Previous post

Towards Secure AI Week 20 — Identity, Jailbreaks, and the Future of Agentic AI Security

Similar posts

Agentic AI Red Teaming Interview: Can Autonomous Agents Handle Adversarial Testing? Conversation with ChatGPT, Claude, Grok & Deepseek

CSA’s Agentic AI Red Teaming Guide: 10 Quick Insights You Can’t Afford to Ignore