Category Hub

AI Security Guides and Practical Reviews

Use this hub to navigate SecureCodeReviews coverage on LLM application risks, agent behavior, model integrations, tool use, and enterprise AI controls.

AI Security

Articles

Latest Update

May 9, 2026

Top Tags

Security guides backed by a real operating company.

These category hubs group original security content written for engineers, buyers, and reviewers. We keep author names visible on articles, maintain public policy pages, and update important guidance when the underlying risk picture changes.

Named authors

Each guide links to a visible author and publish history.

Update history

Important posts show when they were revised, not just published once.

Public trust pages

Editorial, company, contact, privacy, and terms pages stay easy to verify.

Verify the site context

Readers can review how we publish, who we are, and how to request corrections or help.

Editorial policy and content standards

Company and contact pages for accountability

Consistent legal and product navigation

AI Security

What Is Agentic AI? Security Risks, Use Cases, Challenges, and Future

A detailed guide to agentic AI for engineering and security teams. Learn what agentic AI is, how it works, where it creates business value, why it is harder to secure than a standard chatbot, and what the future of agentic AI security looks like.

SCR Security Research Team

AI Security Testing Tools: Garak, PyRIT, promptfoo, and the Controls They Actually Validate

A practical guide to AI security testing tools for LLM and agentic applications. Explains what Garak, PyRIT, and promptfoo are good at, where each tool falls short, and how to combine automated testing with human review for prompt injection, data leakage, and unsafe tool use.

SCR Security Research Team

AI Chatbot Security Best Practices: Production Checklist for 2026

A practical guide to securing AI chatbots and customer-facing assistants in production. Covers prompt injection, insecure output rendering, account abuse, data leakage, unsafe actions, and the controls teams need before exposing a chatbot to real users.

SCR Security Research Team

LLM Hallucinations: Detection, Mitigation, and Enterprise Risk Reduction

A practical guide to reducing LLM hallucinations in enterprise AI systems. Explains when hallucinations become security or compliance incidents, how to measure them, and what teams can do with grounding, validation, abstention, and workflow design.

SCR Security Research Team

AI Compliance Checklist: GDPR, HIPAA, SOC 2, and Data Retention for LLM Apps

A practical compliance guide for LLM applications handling customer, employee, health, or regulated data. Covers GDPR, HIPAA, SOC 2, retention controls, logging boundaries, vendor contracts, and the technical guardrails teams need before shipping AI features.

SCR Security Research Team

Third-Party AI Integration Security: Plugins, APIs, and Agent Tool Chains

A practical security guide for teams connecting LLMs to SaaS tools, internal APIs, and agent workflows. Explains the real risks in plugins, OAuth scopes, webhook trust, retrieved third-party content, and action execution across tool chains.

SCR Security Research Team

AI Agent Memory Security: Context Poisoning, Secret Retention, and Session Isolation

Agent memory is one of the fastest ways an AI assistant turns one bad interaction into a recurring security problem. Learn how context poisoning works, where secret retention happens, and how to design memory systems that do not become persistent attack surface.

LLM Guardrails in Production: Filters, Policy Engines, and Failure Modes

Guardrails are not a checkbox. This guide explains how real production guardrails work, where they fail, and how to combine prompt attack detection, output controls, and fallback behavior into something operators can actually trust.

Model Provenance Security: How to Verify Open-Weight Models Before Deployment

A model file is a software artifact, not a neutral blob. Learn how to verify open-weight models, reduce pickle risk, use safer weight formats, and build provenance checks into your AI deployment pipeline.

Secure Tool Calling for LLMs: Function Calling Risks and Runtime Controls

Tool calling is where an LLM application stops being a text system and starts becoming an action system. Learn the runtime controls, permission boundaries, and confirmation patterns that keep function calling from becoming an automation incident.

Multi-Tenant LLM Security: Preventing Cross-Tenant Data Leakage in Shared AI Apps

Shared AI platforms fail at the boundaries first. Learn how cross-tenant data leakage happens in prompts, caches, retrieval, and logs, and how to design tenant isolation that still holds when the AI features become more complex.

Self-Hosted LLM Security: Hardening vLLM, TGI, Ollama, and Inference APIs

Self-hosting an LLM gives you more control, but it also moves model, runtime, and network risk onto your team. This guide covers the hardening steps that matter for inference servers, private model pulls, prompt logs, and exposed GPU infrastructure.

AI Data Leakage Prevention: Prompts, Logs, Outputs, and Enterprise Controls

Sensitive data leaks in AI systems rarely come from one place. They move through prompts, retrieval context, outputs, logs, and evaluation traces. This guide shows how to build AI DLP controls that actually match how LLM apps are used in production.

Fine-Tuning Security: Poisoned Datasets, LoRA Risks, and Safer Training Pipelines

Fine-tuning moves AI risk into your own pipeline. Learn how dataset poisoning, unsafe adapters, and weak evaluation practices affect fine-tuned models, and how to secure training workflows without grinding delivery to a halt.

LLM Gateway Security: Model Routing, Budget Controls, and Abuse Detection

An LLM gateway is not just a cost-control layer. It is the place where authentication, model policy, rate limiting, prompt controls, and provider failover need to come together. Learn how to design gateway security that does more than forward requests.

AI Evals Security: How to Test LLM Applications Without Gaming Your Benchmarks

Evaluation pipelines decide what gets shipped, but they are often easier to game than teams admit. Learn how to secure AI evals against leakage, benchmark contamination, weak security coverage, and unsafe auto-promotion rules.

Prompt Injection Attacks: Complete Prevention Guide for 2026

The most comprehensive guide to prompt injection attacks — direct, indirect, and multi-turn. Covers real-world breaches, OWASP mitigations, and defense-in-depth strategies with code examples for securing LLM applications in production.

RAG Security: Vulnerabilities in Retrieval-Augmented Generation Systems (2026)

Deep dive into security vulnerabilities in RAG (Retrieval-Augmented Generation) pipelines — data poisoning, indirect prompt injection via retrieved context, embedding inversion attacks, and tenant isolation failures. Includes real-world breaches and production-ready defenses.

AI Supply Chain Security: Pre-trained Models, Datasets & ML Pipeline Risks (2026)

Your AI is only as secure as its supply chain. This guide covers backdoored model weights on Hugging Face, poisoned training datasets, compromised ML libraries, and the emerging AI SBOM standard — with real incidents and production defenses.

LLM Output Security: Preventing XSS, Code Injection & Data Leakage in AI Apps (2026)

LLM output is untrusted input. This guide covers how AI-generated responses can introduce XSS, SQL injection, command injection, and data leakage — with production code examples for output sanitization, CSP headers, and structured output schemas.

AI Red Teaming: How to Test LLM Applications for Security Vulnerabilities (2026)

A practical, step-by-step methodology for red teaming LLM applications — from reconnaissance and prompt injection testing to output abuse and agentic AI exploitation. Includes 30+ test cases, open-source tools (Garak, PyRIT), and a scoring framework.

AI Security: Complete Guide to LLM Vulnerabilities, Attacks & Defense Strategies 2025

Master AI and LLM security with comprehensive coverage of prompt injection, jailbreaks, adversarial attacks, data poisoning, model extraction, and enterprise-grade defense strategies for ChatGPT, Claude, and LLaMA.

OWASP Top 10 for Agentic AI 2026: Risks, Attack Paths, and Security Controls

A detailed guide to the OWASP Top 10 for Agentic AI Applications covering goal hijacking, tool manipulation, prompt injection, uncontrolled autonomy, and the security controls teams need for agentic AI deployments.

SCR Security Research Team

How to Secure AI Agents: Identity & Access Management for Agentic AI

Machine identities now outnumber human identities 45:1. Learn how to implement IAM for AI agents — authentication, authorization, credential management, and delegation chains in multi-agent systems.

SCR Security Research Team

AI-Powered Attacks in 2026: Deepfakes, Vibe Coding & Automated Exploits

AI is supercharging cyberattacks. From $25M deepfake fraud to insecure AI-generated 'vibe code' to fully automated exploit chains, this guide covers the threats defenders face in 2026 with real cases, statistics, and defensive strategies.

SCR Security Research Team

Securing Generative AI APIs: MCP Security & Shadow AI Risks in 2026

Model Context Protocol (MCP) is the emerging standard for connecting AI to tools and data. But MCP servers, shadow AI usage, and AI supply chain attacks introduce critical risks. Learn how to secure generative AI APIs.

SCR Security Research Team

AI Governance Framework 2026: Building Guardrails for Enterprise AI

94% of executives say AI is the biggest driver of change, but only 44% have AI governance policies. This guide provides a complete AI governance framework with policy templates, risk assessment matrices, EU AI Act compliance, and organizational structure.

SCR Security Research Team

Securing RAG Pipelines: Retrieval-Augmented Generation Threats & Defenses

RAG is the most popular LLM architecture pattern — and the most attacked. Learn about document poisoning, embedding manipulation, and how to build secure RAG systems.

SCR Security Research Team

AI Red Teaming: How to Break LLMs Before Attackers Do

A practical guide to AI red teaming — adversarial testing of LLMs, prompt injection techniques, jailbreaking methodologies, and building an AI security testing program.

SCR Security Research Team

AI Security & LLM Threats: Prompt Injection, Data Poisoning & Beyond

A comprehensive analysis of AI/ML security risks including prompt injection, training data poisoning, model theft, and the OWASP Top 10 for LLM Applications. With practical defenses and real-world examples.

SCR Security Research Team

Jun 10, 2025

20 min read

Read article

Need this category reviewed in your own stack?

The articles here are a good starting point. If you need a targeted review for a release, feature, or audit scope, we can assess the concrete implementation rather than the generic pattern.