Vincent Koc

AI research engineer and developer-relations professional at Comet ML, and an OpenClaw maintainer focused on agents, telemetry, hooks, and security.

OpenClaw’s 3,000-Commit Day Shows Code Review Becoming the Bottleneck

Vincent Koc uses OpenClaw’s high-velocity refactor to argue that agentic software development is becoming an industrial management problem, not a prompting trick. In his account, a project that briefly touched 82% of its core codebase and produced thousands of commits exposed a new bottleneck: the human ability to supervise parallel agents, trust the test harness, reject bloat, and stop sessions that have lost the plot.

AI EngineerJun 5, 202611 min read

Fixed Evaluation Suites Go Stale as Agents Optimize Toward Intent

Vincent Koc of Comet ML argues that AI evaluation is being outpaced by the systems it is meant to measure. In a talk on adaptive evaluation for agents, Koc says static benchmarks and handcrafted test sets are poorly suited to applications that change with prompts, tools, production traces, user behavior and even their own harnesses. His proposed direction is to define the intended end state, use traces and telemetry to surface drift and edge cases, and treat evals as a continuously revised system rather than a one-time benchmark.

AI EngineerMay 12, 202611 min read