
Romain Huet
Head of Developer Experience at OpenAI, where he works on Codex, the API, and developer tools. He previously held product and developer-platform roles at Stripe and Twitter and co-founded Jolicloud.
OpenAI Folds Codex Into ChatGPT for a Unified Enterprise Workflow
OpenAI used its Intelligence at Work enterprise event to argue that workplace AI is moving from separate tools into a single operating workflow for companies. Sam Altman framed the roadmap as a response to customer demand to bring OpenAI’s products together, while executives pointed to ChatGPT and Codex integration, role-specific agents, annotations in existing tools, and deployment through Sites as the product layer for enterprise adoption. BNY chief executive Robin Vince supplied the customer case, saying the bank chooses AI optimism because it sees the technology as a capacity creator.
Codex Moves Builder Work From Coding to Specification
Matias Castello, product lead at Alchemy, argues that Codex is shifting software work from writing code toward specifying intent, constraints and preferences clearly enough for an agent to act. In a conversation with OpenAI’s Romain Huet, Castello describes using Codex for code review, product documents, backlog creation, feature experiments and personal projects, with human judgment reserved for deciding what should ship. His central claim is that the limiting factor is increasingly not implementation capacity but how well builders can communicate what they want.
Codex Can Now Operate Local Mac Apps Without Taking Over
OpenAI’s Ari Weinstein argues that computer use turns Codex from a coding agent into a system that can operate local Mac applications by seeing interfaces, clicking, typing and continuing work in the background. In a demonstration with Romain Huet, Weinstein presents the feature as distinct from a full-desktop takeover: Codex uses a separate cursor, combines screenshots with macOS accessibility data, and requires app-by-app permission before it can see or type into local software.
OpenAI Splits Audio API Into Translation, Transcription, and Voice-Agent Models
OpenAI is presenting three new API audio models as infrastructure for voice applications that can translate, transcribe, reason and act in real time. Romain Huet’s demonstration centered on GPT-Realtime-Translate, which keeps pace with multilingual speech, and GPT-Realtime-2, a voice-agent model that can follow turn-taking instructions, use business context and call tools while explaining its work. GPT-Realtime-Whisper completes the set as a streaming speech-to-text model for live transcription.