Category

Agent Safety Controls

These ideas describe how agent systems constrain behavior, route decisions through policy, and leave records for later review.

How to recognize this theme

AI agent controls that help govern tool use and review actions.

In a daily board, this category groups terms by their shared role. Look for four cards that describe the same mechanism, risk area, or workflow rather than four words that merely sound similar.

Educational context

These entries are vocabulary notes for learning. They are not project endorsements, token recommendations, exchange rankings, or trading signals.

Prompt Firewall

A prompt firewall checks user or tool inputs for unsafe, disallowed, or suspicious instructions before they reach an AI model or agent.

Policy Engine

A policy engine evaluates requests or planned actions against rules so a system can allow, block, modify, or escalate them.

Human Approval

Human approval is a control that requires a person to review and authorize a sensitive action before an automated system proceeds.

Audit Log

An audit log records important events, decisions, and changes so operators can review what happened and when.