← Back to Learn
deploymentbest-practicesguardrails

AI Agent Security Checklist Before Deployment

Authensor

Deploying an AI agent to production without a security review is the equivalent of shipping a web application without authentication. This checklist covers the essential security controls to verify before any agent goes live.

Access and Authentication

  • [ ] Agent API keys use least-privilege roles (not admin keys for routine operations)
  • [ ] Tool access is restricted to only the tools required for the agent's task
  • [ ] Database connections use read-only credentials where write access is not needed
  • [ ] Network egress is restricted to known, required endpoints
  • [ ] Secrets are stored in environment variables or a secrets manager, not in code

Policy Configuration

  • [ ] A YAML policy is defined and tested for the agent's specific use case
  • [ ] The policy uses fail-closed defaults (deny actions not explicitly permitted)
  • [ ] High-risk actions require human approval (financial transactions, data deletion, external communications)
  • [ ] Tool parameter constraints are defined (allowed values, regex patterns, size limits)
  • [ ] Policy has been tested with both legitimate inputs and adversarial inputs

Content Safety

  • [ ] Input scanning is enabled for prompt injection detection
  • [ ] Output scanning is enabled for PII and sensitive data leakage
  • [ ] Content scanning thresholds are tuned to the use case (not just defaults)
  • [ ] False positive rates have been measured and documented

Monitoring and Alerting

  • [ ] Behavioral monitoring is enabled and baseline metrics are established
  • [ ] Alert rules are configured for anomalous action rates, error spikes, and policy denials
  • [ ] An on-call rotation exists for safety-related alerts
  • [ ] Log retention meets compliance requirements

Audit and Compliance

  • [ ] Hash-chained audit trail is enabled and recording
  • [ ] Receipt storage is configured with appropriate retention
  • [ ] Audit trail integrity verification is scheduled

Incident Response

  • [ ] A kill switch exists to disable the agent immediately
  • [ ] An incident response runbook is documented and accessible
  • [ ] The team has conducted at least one tabletop exercise for an agent safety incident

Review this checklist before every production deployment, not just the first one. Configuration changes, model updates, and new tool integrations all warrant a fresh security review.

Keep learning

Explore more guides on AI agent safety, prompt injection, and building secure systems.

View All Guides