Published onDecember 21, 2025Automatic Debugging and Failure Detection in AI Agent SystemsAI-AgentsLLMDebuggingObservabilityReliabilityA survey of DoVer and related work on failure attribution, intervention-based debugging, and observability tooling for LLM agent systems.
Published onDecember 10, 2025Why You Don’t Need AI Agent EvaluationsAI-AgentsEvaluationObservabilityLLMsStartupsA satirical look at why skipping AI agent evaluations makes perfect sense if you don't value maintainability, customers, or long-term sanity.