Firewalls
A firewall evaluates every prompt before it reaches the model. Rules are LLM-based — you write a short classification prompt and the firewall calls a judge instance to score the input. The resulting decision is one of block, allow, or audit.
Why an LLM judge
How it fits together
Three primitives work together. A rule is a classification prompt the judge runs against the user message. A firewall is a named bundle of rules plus a mode (block / allow / audit) and a judge instance. An OpenAI client attaches a firewall by slug; the platform evaluates it before the request reaches the model.
The full configuration shape, the rule catalog, and the HTTP surface are covered in the guides below.
Where to go next
Rules →
The fields that make up a rule, with a full example.
Firewall configuration →
Firewall fields, modes (block / allow / audit), and how to attach one.
API surface →
HTTP endpoints for rules and firewalls, including /evaluate.
Create a firewall →
Pick a judge, set the mode, and attach the first rules.