Join
AI Evaluation Checklist

AI Evaluation Checklist

A compact checklist for deciding whether an AI output is actually usable.

Before keeping an AI output, check:

  • is it accurate enough to trust
  • is it structured in the right way for the task
  • does it sound like something you would actually publish or use
  • are there hidden assumptions that need to be made explicit
  • did it save time, or just create another layer to clean up

If the answer to the last question is no, the workflow still needs work.

What this checklist is for

This page is meant to stop "looks plausible" from becoming "good enough". Most AI failures are not dramatic hallucinations. They are subtler: weak structure, hidden assumptions, shallow verification, or outputs that technically exist but still increase cleanup work.

How to use it

Use the checklist near the end of the workflow, not at the prompt-writing stage. It works best when you already have a draft, summary, plan, or analysis in front of you and need to decide whether it should be trusted, revised, or discarded.