Tag: llm-evals

All the articles with the tag "llm-evals".

Eval-Driven Development: How I Actually Build LLM Features Now

24 Mar, 2026

My day-to-day loop for LLM features in 2026: write the eval first, then the prompt, then the code, and fold every production failure back in as a case.
Evals Are the New Unit Tests (And You're Not Writing Them)

13 Feb, 2024

Shipping an LLM feature with no evals is shipping with no tests, and almost everyone is doing it. A small, hand-written harness you run on every change, plus the honest limits of grading with another model.

Eval-Driven Development: How I Actually Build LLM Features Now