Tag: llm-evals
All the articles with the tag "llm-evals".
-
Eval-Driven Development: How I Actually Build LLM Features Now
My day-to-day loop for LLM features in 2026: write the eval first, then the prompt, then the code, and fold every production failure back in as a case.
-
Evals Are the New Unit Tests (And You're Not Writing Them)
Shipping an LLM feature with no evals is shipping with no tests, and almost everyone is doing it. A small, hand-written harness you run on every change, plus the honest limits of grading with another model.