Prompt Firewall
A prompt firewall checks user or tool inputs for unsafe, disallowed, or suspicious instructions before they reach an AI model or agent.
Category
These ideas describe how agent systems constrain behavior, route decisions through policy, and leave records for later review.
AI agent controls that help govern tool use and review actions.
In a daily board, this category groups terms by their shared role. Look for four cards that describe the same mechanism, risk area, or workflow rather than four words that merely sound similar.
These entries are vocabulary notes for learning. They are not project endorsements, token recommendations, exchange rankings, or trading signals.
A prompt firewall checks user or tool inputs for unsafe, disallowed, or suspicious instructions before they reach an AI model or agent.
A policy engine evaluates requests or planned actions against rules so a system can allow, block, modify, or escalate them.
Human approval is a control that requires a person to review and authorize a sensitive action before an automated system proceeds.
An audit log records important events, decisions, and changes so operators can review what happened and when.