Cybersecurity blog

OWASP Top 10 for LLMs Explained: Securing AI Applications

PCI SSC Qualified Security Assessor — CYBERSIGMA CONSULTING SERVICES LLP

QSA Authorized
CEMEA · Asia Pacific · USA

Our Offerings -PCI-DSS Audit,RBI/SEBI/IRDAI/Aadhar/NBFC & Housing Cybersecurity Audit,SOC1/2/3,GDPR,ISMS,ISO,Our Offerings -PCI-DSS Audit,RBI/SEBI/IRDAI/Aadhar/NBFC & Housing Cybersecurity Audit,SOC1/2/3,GDPR,ISMS,ISO,

OWASP Top 10 for LLMs Explained: Securing AI & GenAI Applications

Large Language Models (LLMs) and Generative AI are being built into products, customer support, internal tools and developer workflows at remarkable speed. But these systems introduce a new class of security risks that traditional application security testing was never designed to catch — a chatbot can be tricked into revealing confidential data, an AI agent can be manipulated into taking unauthorised actions, and a poisoned model from a public hub can quietly compromise an entire pipeline.

To help teams secure these systems, the Open Worldwide Application Security Project (OWASP) maintains the OWASP Top 10 for LLM Applications — the industry-standard list of the most critical security risks in LLM and Generative AI applications. This guide explains each of the 2025 risks in plain language, with practical defences you can apply today.

What is the OWASP Top 10 for LLM Applications?

The OWASP Top 10 for LLM Applications is a community-driven list of the ten most important security risks specific to applications that use Large Language Models. It gives developers, security teams and leaders a shared vocabulary for understanding how LLM-powered systems fail — and a prioritised starting point for testing and hardening them. It complements, rather than replaces, the classic OWASP Top 10 for web applications: you still need to secure the software around your AI, and now you also need to secure the model and its behaviour.

The OWASP Top 10 for LLMs (2025), Explained

LLM01: Prompt Injection

Prompt injection is the number-one LLM risk. An attacker crafts input that overrides the model's instructions — directly (typing a malicious prompt) or indirectly (hiding instructions inside a document, web page, email or tool output the model later reads). Successful prompt injection can make a model ignore its guardrails, leak data, or trigger harmful actions through connected tools.

LLM02: Sensitive Information Disclosure

LLMs can reveal information they should not — personal data (PII), credentials, proprietary business data, or other users' inputs. This happens through over-broad context, inadequate output filtering, or training on sensitive data. The result can be a privacy breach under DPDP, GDPR or sector regulation.

LLM03: Supply Chain

AI applications depend on third-party models, datasets, libraries and hosting platforms. A compromised or maliciously crafted model from a public hub, a tampered dataset, or a vulnerable dependency can backdoor your application. Provenance, integrity verification and vetting of AI components are essential.

LLM04: Data and Model Poisoning

If an attacker can influence training or fine-tuning data, they can introduce bias, backdoors or vulnerabilities into the model itself — a poisoning attack. This is especially dangerous when models are continuously fine-tuned on user-supplied or scraped data without validation.

LLM05: Improper Output Handling

Treating model output as trusted is dangerous. If an application passes LLM output straight into a browser, database, shell or downstream system without validation, it can lead to cross-site scripting (XSS), SQL injection, server-side request forgery (SSRF) or remote code execution. All model output should be treated as untrusted user input.

LLM06: Excessive Agency

As LLMs are given tools, plugins and the ability to act (AI agents), excessive agency becomes a serious risk: too much functionality, too many permissions, or too much autonomy means a manipulated model can take damaging, irreversible actions. Limit what agents can do, and require human approval for high-impact operations.

LLM07: System Prompt Leakage

The system prompt that configures an LLM often contains instructions, business logic or even secrets. If it can be extracted by an attacker, it exposes how the system works and may reveal credentials or bypass routes. Never rely on the system prompt to keep secrets, and test whether it can be leaked.

LLM08: Vector and Embedding Weaknesses

Retrieval-Augmented Generation (RAG) and other systems that use vectors and embeddings introduce their own risks — poisoned or manipulated embeddings, data leakage across tenants, and access-control failures in the vector store. As RAG becomes the default pattern, securing the retrieval layer is critical.

LLM09: Misinformation

LLMs can produce confident but false output (hallucinations). When applications and users over-rely on this output for decisions — legal, medical, financial or operational — the consequences can be severe. Mitigations include grounding responses in trusted sources, human oversight and clear communication of limitations.

LLM10: Unbounded Consumption

LLMs are resource-intensive. Without limits, attackers can drive excessive inference (denial-of-wallet), extract the model through repeated queries, or cause denial of service. Rate limiting, quotas, monitoring and spend caps protect both availability and cost.

OWASP LLM Top 10 at a Glance

RiskWhat it meansCore defence
LLM01 Prompt InjectionInput overrides the model's instructionsSeparate trusted vs untrusted input; constrain tools
LLM02 Sensitive Info DisclosureModel leaks PII, secrets or dataData minimisation, output filtering, least privilege
LLM03 Supply ChainCompromised model/dataset/dependencyVet, pin and verify AI components
LLM04 Data & Model PoisoningTampered training data backdoors the modelValidate and control training data
LLM05 Improper Output HandlingUnsafe use of model outputTreat output as untrusted; encode/sandbox
LLM06 Excessive AgencyAgents take unauthorised actionsLeast privilege; human approval for high impact
LLM07 System Prompt LeakageSystem prompt/secrets exposedNo secrets in prompts; test for leakage
LLM08 Vector & Embedding WeaknessesRAG/retrieval layer abusedSecure vector store; tenant isolation
LLM09 MisinformationConfident but false outputGround in sources; human oversight
LLM10 Unbounded ConsumptionDenial-of-wallet / model extractionRate limits, quotas, monitoring

How to Defend Your LLM Applications

Securing AI is not a one-time task — it spans the model, the data, the application and the agents around it. A practical programme combines hands-on testing with governance:

  • Red-team your LLM applications against all ten OWASP LLM risks and the MITRE ATLAS adversarial-ML tactics.
  • Separate trusted system instructions from untrusted user and retrieved content, and treat every model output as untrusted.
  • Apply least privilege to models, tools and agents, and require human approval for high-impact actions.
  • Vet and verify third-party models, datasets and dependencies; protect training data from poisoning.
  • Stand up AI governance aligned to the NIST AI RMF and ISO/IEC 42001, with a working AI risk register.
  • Monitor AI systems in production with rate limits, logging and an AI incident-response plan.

Map the OWASP LLM Top 10 to a Real Assessment

Knowing the risks is the first step; proving your AI is resilient against them is the next. The most effective way to do that is an independent AI/LLM security assessment that red-teams your application against the OWASP LLM Top 10 and MITRE ATLAS, then maps remediation to recognised AI governance frameworks. This pairs naturally with traditional penetration testing (VAPT), which secures the software and infrastructure around your AI — most AI products need both.

Conclusion

The OWASP Top 10 for LLM Applications gives every team building with AI a clear, prioritised map of what can go wrong — from prompt injection and data leakage to excessive agency and supply-chain risk. Treat it as a baseline: test against all ten, harden the model and the software around it, and back it with AI governance. Do that, and you can adopt AI quickly without taking on hidden risk.

Naveen Kumar

Naveen Kumar

CyberSigma is a CERT-In empanelled, PCI QSA authorized cybersecurity firm helping organisations secure AI and LLM applications with red-teaming, penetration testing and AI governance aligned to OWASP, NIST AI RMF and ISO/IEC 42001.

Leave A Comment

CyberSigma office locations across India, UAE, Egypt and Australia

Our Office

Locations we operate from

HQ, Noida, India

405, 4th Floor, Majestic Signia, Sector 62, Noida, Uttar Pradesh 201309

Pune, India

InCube Centre, Tejaswini Society, Lane 2, Aundh, PUNE, India, 411007

Mumbai, India

A802, Crescenzo, C /38-39, G-Block, Bandra Kurla Complex, Mumbai-400051, Maharashtra, India

Bengaluru, India

Maharaj, 152/4, 8th Cross, Chamrajpet, Bengaluru, Karnataka, India, 560018

UAE

Business Point Building - Office No. 702 - Dubai - United Arab Emirates

UAE

L.L.C Muna AlJaziri Building, Office No 303 Al Mararr Dubai, UAE

Egypt

19 Dr. Omar Dessouky Street, Cairo- Egypt 4271020

Australia

Level 4, 80 Market Street, South Melbourne 3205