Quality criteria
Define what a useful and correct result looks like.
Type help to explore.
Evaluation
A convincing AI project needs task-specific checks, failure analysis, and human review.
Define what a useful and correct result looks like.
Test ambiguity, outdated data, unsupported claims, and unsafe actions.
Evaluate whether the workflow saves time without reducing control.