Scott Clark

Co-founder and CEO of Distributional, an AI reliability company building testing, evaluation, calibration, and monitoring tools for production AI systems. He previously co-founded SigOpt, an AI optimization platform acquired by Intel, where he later led AI and high-performance-computing software teams.

Production Analytics Finds Agent Failures That Standard Evals Miss

Scott Clark, co-founder and chief executive of Distributional, argues that teams running LLM agents need to look beyond pre-production evals and dashboards of known metrics. His case is that the most consequential failures often emerge only in production, where agents interact with users, tools and changing models in ways teams did not know to test. Clark proposes an observability stack in which telemetry records what happened, monitoring tracks known signals, and analytics clusters trace behavior to surface unknown failure modes that can become new evals, guardrails, prompts or system fixes.

The TWIML AI PodcastMay 7, 202620 min read