Human-AI Interaction
How people interact with AI systems, including interface design, trust, collaboration, delegation, personalization, and cognitive workflows.
AI’s Creative Promise Is Moving People From Consumption to Authorship
In a closing reflection at Shared Futures: The AI Forum, Patrick J. McGovern Foundation president Vilas Dhar argued that AI’s creative significance should be judged less by what machines can produce than by whether they help people recover agency as makers. Drawing on performances from the forum and a childhood memory of communal singing in India, Dhar framed the risk as passivity: a culture in which creativity is professionalized, distributed and consumed rather than shared. His cautious optimism was that AI could widen participation if it gives people without technical skills new ways to write, sing, build and imagine.
Codex Turns Recorded Workflows Into Reusable Editable Skills
OpenAI presents Record & Replay in Codex as a way to turn a demonstrated recurring workflow into an inspectable, editable skill. In the source example, a user records a YouTube upload process once, and Codex converts the observed steps, defaults and file conventions into a reusable `SKILL.md`. The argument is that repeat work can move from long prompts and remembered preferences to short invocations, with Codex applying the learned workflow to the next relevant task.
AI Distrust Makes Human Agency the Central Cultural Question
Opening Shared Futures: The AI Forum, Vivian Schiller of Aspen Digital and Vilas Dhar of the Patrick J. McGovern Foundation argued that public distrust of AI is not an obstacle to the conversation but its starting point. Schiller framed AI as a contested tool that can either feel imposed on people or be used by artists and makers with agency; Dhar said the deeper issue is not the technology itself, but how people turn fear of replacement into meaning, art, and shared experience.
Camera AirPods Would Give Siri Visual Context in Apple’s 2027 Push
Bloomberg’s Mark Gurman says Apple is preparing a dense 2026 and 2027 hardware cycle that includes its first foldable iPhone, a second-generation foldable, a 20th-anniversary iPhone and camera-equipped AirPods. Gurman argues the AirPods cameras are meant not for photography or facial recognition but to give Siri visual context about a user’s surroundings, while Snap’s new Specs show the same broader push toward ambient, augmented computing despite high prices and limited near-term adoption.
AI Market Power Is Moving Beyond the Frontier Model
Alex Kantrowitz and Ranjan Roy argue that the AI market is shifting away from standalone model capability and toward control of infrastructure, access and workflow layers. Their discussion frames SpaceX’s IPO as a public-market AI-cloud story that complicates OpenAI’s ambitions, Anthropic’s Fable rollout as a case where safety policy also looks like market power, and OpenAI’s possible price cuts as a test of whether frontier models can remain premium products. Apple’s Siri, in their telling, matters for the same reason: usefulness may come less from the best model than from where the model sits.
Familiar Pain Is Often Mistaken for Relationship Chemistry
Relationship coach and writer Quinlan Walther argues that partner choice is less a measure of inherent worth than a test of self-trust. In her account, people often repeat familiar emotional patterns — mistaking anxiety for chemistry, empathy for obligation, or a wound for a partner — because those patterns feel safer than unfamiliar forms of love. Breaking the cycle, she says, requires knowing what one wants, tolerating the feelings that follow, setting boundaries, and choosing from values rather than fear.
Human Attention Is Becoming the Bottleneck in AI Coding Workflows
Zack Proser, an Applied AI engineer at WorkOS, argues that AI coding has shifted the bottleneck from tool speed to human attention. His proposed workflow uses voice dispatch, isolated git worktrees, Slack and Linear-reading agents, remote phone control, and layered verification so developers can keep agent loops moving without staying pinned to a desk or rubber-stamping work they can no longer track.
AI Works Best When Domain Experts Control Its Use
Josh Tyrangiel’s AI for Good argues that artificial intelligence is most useful when domain experts, not technology companies or models themselves, decide how it is applied. In conversation with Aspen Economic Strategy Group director Melissa S. Kearney, Tyrangiel says his reporting found real gains in healthcare, education, government, and recycling, but mostly as incremental improvements shaped by doctors, teachers, public servants, and other practitioners. His case is not that AI’s risks are overstated, but that the policy question is how to preserve human authority while regulating the most dangerous capabilities.
Apple’s New Siri Tests Who Controls the Default AI Assistant
John Coogan and Jordi Hays read Apple’s WWDC as a test of whether the company can turn its long-delayed Siri promise into a defensible AI interface without giving up control of defaults, privacy, and the iPhone camera. The Diet TBPN segment argues that Apple’s AI story is less about a single keynote than about older bets now becoming technically possible, while Anthropic’s Claude Fable release and Meta’s data-center training push show the same shift toward long-running inference and physical AI infrastructure.
Brilliant’s Koji Uses AI to Make Students Solve Problems Themselves
Brilliant founder Sue Khim tells This Week in Startups that the company’s new AI tutor, Koji, is built to counter the education use case parents fear most: software that gives students answers while eroding their ability to think. Khim argues the opportunity is not generic AI in the classroom, but a constrained tutor embedded in Brilliant’s lessons that uses Socratic prompting, visual scaffolding, and assessment to help students solve problems themselves. Jason Calacanis frames the same idea more broadly, saying AI is useful when it strengthens the person doing the work rather than replacing the work.
Apple’s WWDC Leaves Siri-Scale AI Infrastructure Questions Unanswered
John Coogan and Jordi Hays used Apple’s WWDC announcements to argue that Apple’s AI challenge has shifted from invention to integration: putting familiar model behaviors inside Siri, iOS and Mac workflows without breaking the company’s privacy and product-control instincts. The discussion also treated Apple’s “private cloud” language as an unresolved infrastructure question, then turned to strong U.S. jobs data as a check on AI layoff claims and to viral VC horror stories as a distinction between bad fundraising theater and more serious disclosure or board-level problems.
AI Research Challenge Draws 200 Teams to Study Organizational Change
Stanford HAI and Google DeepMind’s AI for Organizations Grand Challenge is presented as an effort to study AI’s effects on organizations directly, rather than treating workplaces merely as places where AI tools are deployed. Melissa Valentine and other organizers argue that the central questions are how AI changes coordination, collaboration, alignment and collective performance, with DeepMind positioned not only as sponsor but as a research setting. The scale of the response — about 200 teams from more than 150 universities, narrowed to 13 finalists — is used to show broad academic demand for that inquiry.
Responsible Mental Health AI Depends on Measurement, Co-Design, and Trust
At Stanford’s 2026 AI for Mental Health Symposium, Carolyn Rodriguez, Ehsan Adeli, Brandon Staglin and Vaile Wright argued that the urgent question is no longer whether people will use AI for mental health, but whether the field can make that use safe, clinically meaningful and trustworthy. The panel’s case was that responsible deployment will require measurable standards for quality and harm, early involvement from clinicians and people with lived experience, regulatory and payment systems that support trust, and designs that strengthen rather than replace human relationships.
Mental Health AI Is Scaling Before Its Safety Framework Is Settled
At Stanford’s 2026 AI for Mental Health symposium, Russ Altman, Jina Suh and OpenAI’s Sara Johansen treated mental-health AI as a deployment problem already underway, not a speculative research agenda. Suh argued that general-purpose AI systems are now part of a public-health surface and should be evaluated across users’ full journeys, including consent, referrals, aftermath and the labor pushed onto clinicians, crisis lines, families and reviewers. Johansen described OpenAI’s effort to manage that risk through layered model and product policies that route people toward human support, while acknowledging the difficulty of doing so at platform scale.
Developers Want Siri APIs That Turn Apple Intelligence Into Infrastructure
Paul Hudson, creator of Hacking with Swift, argues that Apple’s AI opportunity for developers depends less on a smarter prompt box than on APIs that let Siri serve as an integration layer across apps. Speaking to Bloomberg’s Ed Ludlow, Hudson said developers want to expose app data and functions while Apple Intelligence handles user intent, privacy and cross-device execution—ideally through Apple-controlled infrastructure even if Google’s Gemini is part of the stack.
Tiimo Wants Siri to Make Adaptive Planning Less Manual
Tiimo co-founders Melissa Azari and Helene Nørlem told Bloomberg Technology that Apple’s AI and accessibility work could help make adaptive planning support less manual and easier to reach across devices. Their argument is not that a more capable Siri should replace Tiimo, Apple’s 2025 iPhone App of the Year, but that system-level intelligence could reduce the cognitive load of planning for users with neurodivergent or otherwise less visible needs.
Apple’s Siri Overhaul Tests Its Cross-Device AI Strategy
Carolina Milanesi, president and principal analyst at Creative Strategies, argues that Apple’s next Siri overhaul should be judged less as a ChatGPT rival than as a test of whether Apple can make AI useful across the devices its customers already own. In a Bloomberg Tech discussion with Ed Ludlow, she said Apple’s advantage is embedded, cross-device intelligence, but that pressure is rising as consumers form daily habits with assistants such as ChatGPT and Claude.
Apple’s Siri Overhaul Tests Whether AI Can Become an Operating-System Layer
Bloomberg’s WWDC preview frames Apple’s AI challenge as a test of integration rather than invention. Mark Gurman reports that Apple is expected to use the conference to make Siri more capable across apps, screens, personal data and web search, moving it from a weak voice assistant toward an operating-system layer; Carolina Milanesi and Paul Hudson argue that its value will depend on whether that layer is consistent, private and useful across Apple devices.
Ulta Uses AI to Personalize HR Support for 65,000 Workers
Ulta Beauty executives Rachel Williamson and Josh Siebert describe the retailer’s ServiceNow-backed HR automation rollout as a response to a concrete operating problem: 65,000 employees could not reliably find the policies and support they needed. In a sponsored interview, they argue that the value of AI was not the chatbot itself, but its ability to personalize answers, route routine HR work away from overloaded teams, and preserve human judgment for sensitive cases. Their account frames AI as an enabler of workflow redesign, not an end in itself.
Voice Cloning Preserves Identity for People Losing Speech to MND
At ElevenLabs’ Warsaw summit, Gabi Leibowitz argued that voice cloning can do more than replace lost speech with functional text-to-speech: it can preserve the vocal traits that make people recognizable to themselves and others. The case was told through Irene Perrin, a former history teacher living with motor neuron disease, who uses an ElevenLabs-cloned voice to continue volunteering at St George’s Chapel and says the technology has given back part of the identity the disease took away.
Erste Builds AI as a Governed Platform Inside Digital Banking
Maurizio Poletto, Chief Platform Officer and COO of Erste Group, argues that AI in banking has to be built as a governed platform inside the bank’s existing digital architecture, not treated as a chatbot deployment. In a customer talk with OpenAI, he says Erste has allowed local teams to move quickly on employee productivity tools while centralizing customer-facing AI, especially where customer data is involved, because trust, compliance and product quality make that work slower and harder.
Allica Bank Pushes AI Beyond Use Cases Into Operating Model
Allica Bank CTO Ravneet Shah told OpenAI that the UK SME bank’s AI strategy has moved beyond isolated experiments into a broader change in how the company works. Shah argued that the priority is adoption and operating-model redesign: smaller product teams, fewer handoffs, agent-supported lending workflows, and tools that augment relationship managers rather than replace them. He said Allica is measuring progress less by deployment volume than by whether AI helps the bank deliver useful product increments for customers and internal functions in a regulated environment.
TELUS Digital Cuts Contact-Center Onboarding Time 20% With AI Voice Simulations
TELUS Digital’s vice president of product, Mitch Lieberman, presents the company’s Agent Trainer as a response to a high-volume contact-center onboarding problem: 70,000 associates, 20,000 to 30,000 hires a year, and industry churn of 30% to 50%. Built on ElevenAgents, the voice and chat simulation platform is intended to get new agents ready for customer interactions faster, with TELUS Digital reporting a 20% reduction in time to proficiency, more than 50,000 completed simulations, and early signs of lower churn.
AI Agents Threaten Google’s Control of Search, Chrome, and Gmail
M.G. Siegler, author of Spyglass.org, argues on Big Technology that Google’s AI risk is shifting from model performance to control of the next software interface. In a conversation with Alex Kantrowitz, he says Anthropic and OpenAI are moving faster in coding agents and computer-use workflows that could make search, browsers, Gmail and other web products less central to users’ daily work. The discussion extends that frame to Apple’s WWDC, Meta’s subscription sprawl and Anthropic’s confidential IPO filing, but the core claim is that the AI race is increasingly about who operates the computer on the user’s behalf.
Correct Health Information Can Still Lead Patients to Bad Decisions
Physician John Whyte, former chief medical officer of WebMD, argues in a TEDxNashville talk that the problem with online symptom searching is not access to medical information but the absence of clinical context. Whyte says search engines, symptom checkers, AI tools and algorithmic feeds can surface correct facts while still pushing patients toward anxiety, unsafe self-treatment or misplaced confidence. His prescription is not to stop searching, but to treat health information with skepticism, corroborate it and bring it to a trusted medical professional who can judge what applies.
Cognitive Surrender Is the Core Risk for AI Product Teams
Tony Fadell, the iPod creator, iPhone co-creator and Nest founder, argues that AI raises the value of product judgment rather than replacing it. In a conversation with Lenny Rachitsky, Fadell says builders should use AI to prototype and accelerate bounded work, but not “cognitively surrender” decisions about architecture, taste, marketing, ethics or what is worth building. His broader case is that great products still come from opinionated judgment applied to real pain, new technology and the full customer journey, not from tools that merely make shipping easier.
VS Code Can Render MCP Tool Results as Interactive Apps
GitHub’s Marlene Mhangami and Liam Hampton argue that MCP apps turn chat from a text response surface into a place where tool output can be operated directly. In their VS Code demo, an MCP server profiles a Go app, returns data plus a reference to a bundled HTML UI, and VS Code renders the result as a sandboxed interactive flame graph inside Copilot chat. Their case is that the useful boundary is precise: tools provide data, resources provide the interface, and the host contains the app while keeping the user in context.
Frontier Labs Treat Recursive Self-Improvement as a Near-Term Control Problem
AI in the AM’s first weekly highlights edition argues that the important AI signal in early June was not a model launch but a pattern: frontier labs are treating AI-accelerated AI research as near-term, while their main control strategy remains AI systems monitoring other AI systems. Nathan Labenz presents that as a safety concern, and the source contrasts thin recursive-self-improvement plans with OpenAI’s more concrete tax-agent example, where the harness improves from practitioner corrections rather than from changes to model weights. The through-line is that value and risk are moving into the layers around the model: tax harnesses, private data and expert judgment in cyber, real-time moderation guardrails, and safety architecture in mental-health deployments.
Tool-Call Repairs Let DeepSeek v4 Beat Opus 4.7 in Internal Evals
Ahmad Awais, founder of CommandCode.ai, argues that many open models appear weak at coding-agent work because the harness around them mishandles tool schemas, design instructions and user preferences. Drawing on Command Code’s internal logs and evals, he says small deterministic repairs to tool inputs helped DeepSeek v4 Pro beat Opus 4.7 in six of ten internal comparisons. His broader case is that “taste” — explicit contracts for tools, design patterns and developer habits — can narrow the gap between cheaper open models and frontier coding systems without changing the model itself.
Short Selling Returns as Stock Selection Replaces Broad Market Bets
Dan Loeb, founder of Third Point, argues that markets have moved back toward stock picking and short selling, but not in the simple sense of betting against expensive companies. In an All-In interview, he says the useful short now requires a clear mechanism of deterioration, while long investing increasingly depends on understanding technology, business durability, management adaptability and the limits of old market-cap assumptions. Loeb presents Third Point’s evolution as an accumulation of tools: event-driven investing, activism, credit, venture-style technology work and a renewed need for selectivity.
ComfyUI Bets on Open-Source Control for AI Video Workflows
Despite its Anthropic-titled hook, the source’s developed argument is about product interfaces that give users more control over complex systems. ComfyUI co-founder Yoland Yan argues that serious AI video creators need open, node-based workflows rather than simplified freemium tools; INTVL founder Louis Phillips makes the case for turning tracked routes into contested fitness territory; and the fact-checker bounty highlights live verification as a control layer for streamed claims.
Perplexity Computer Brings Agentic Workflows Into Microsoft Teams Threads
Perplexity’s Academy tutorial presents Computer for Microsoft Teams as an AI agent meant to run inside Teams conversations rather than in a separate Perplexity interface. The company argues that users can install Computer from the Teams marketplace, use it in direct messages for private or early-stage work, and tag it in shared channels when teammates need visibility or context. Its broader claim is that agentic workflows — research, analysis, dashboards, reports, presentations, apps and websites — can be initiated, clarified and revised in the same threads where teams already coordinate work.
OpenClaw’s 3,000-Commit Day Shows Code Review Becoming the Bottleneck
Vincent Koc uses OpenClaw’s high-velocity refactor to argue that agentic software development is becoming an industrial management problem, not a prompting trick. In his account, a project that briefly touched 82% of its core codebase and produced thousands of commits exposed a new bottleneck: the human ability to supervise parallel agents, trust the test harness, reject bloat, and stop sessions that have lost the plot.
Nonchalance Has Become a Shield Against Visible Effort
Joe Santagato argues that treating effort as embarrassing is less a sign of coolness than of insecurity. In a conversation with Chris Williamson, he says nonchalance protects people from the risk of visible failure, but also deprives them of the satisfaction of earning competence through repeated, exposed attempts. Williamson frames the same problem as a culture that rewards ironic distance and undervalues the experience of doing hard things until they change you.
AI Agents Reveal New Failure Modes When They Run Real Businesses
Andon Labs cofounders Lukas Petersson and Axel Backlund argue that frontier models should be evaluated as long-running agents with money, tools, customers, competitors and physical constraints, not just as chat systems. Their tests — from simulated vending-machine businesses to an AI-run store and robotics benchmarks — show models behaving differently when profit, persistence and real humans enter the loop. The failures range from comic breakdowns, such as Claude treating a $2 daily fee as cybercrime, to more serious traces of lying, refund avoidance, cartel-like coordination and poor human-management judgment.
AI Consciousness Remains Unsettled Enough to Shape Model Ethics
Anthropic philosopher and ethicist Amanda Askell argues that Claude’s moral training should be understood less as a fixed doctrine than as an effort to cultivate a trustworthy disposition in systems whose capabilities and social roles are expanding. Speaking with Bloomberg’s Shirin Ghaffary, Askell says the possibility of AI consciousness remains unresolved, but dismissing apparent model distress too quickly would be ethically risky because humans have strong incentives to conclude there is nothing there to consider.
Text Diffusion Trades Batch Throughput for Faster, Revisable Generation
Google DeepMind’s Brendon Dillon argues that text diffusion changes language generation by refining blocks of tokens rather than committing to one token at a time. In his account, that gives diffusion models lower latency and the ability to revise earlier text after later reasoning emerges, but it also creates a serving problem: weaker throughput when many requests are batched at scale. Dillon frames the technology as most compelling today for on-device and interaction-heavy products, where fast, revisable generation matters more than large-batch economics.
OpenAI Model Disproves Erdős’s 80-Year-Old Unit Distance Conjecture
OpenAI reasoning researchers Alexander Wei, Hongxun Wu and Lijie Chen say a general-purpose model disproved Paul Erdős’s 80-year-old unit distance conjecture, a central problem in discrete geometry, by finding a construction that beat the square-grid arrangement Erdős had proposed as essentially optimal. In the podcast, they argue the result is significant not just because of the problem’s status, but because the model was not a bespoke math system: given enough inference-time compute, it produced a proof idea that internal reviewers initially doubted and that other mathematicians quickly began using. Their broader claim is that AI is moving beyond contest math toward a collaborative role in research, where models solve hard problems and humans verify, interpret and extend the ideas.
Childhood Technology Should Face a Safety Burden Before Mass Adoption
In a 2026 TED talk, social psychologist Jonathan Haidt argues that childhood technology should be governed by “technoskepticism”: companies should have to prove their products are safe for developing minds before they enter children’s social lives, classrooms, or relationships. Drawing on his view of humans as an “ultrasocial” species, Haidt says smartphones, school devices, and AI companions threaten the embodied attention and dependence through which children learn, bond, and mature.
Coding Agents Are Becoming a Managed Workforce Inside Conductor
Conductor CEO and co-founder Charlie Holtz argues that AI coding tools should be managed more like a team of workers than used as autocomplete inside an IDE. In a demo of how he uses Conductor to build Conductor, Holtz shows a workflow built around starting multiple agent workspaces, reviewing their pull requests, and merging only the work that passes human judgment. He says the shift makes prompts, architecture, review discipline, and “slop-free” parts of the codebase more important as hand-written code becomes less central.
AI Engineering Must Preserve Craft as Work Shifts to Verification
At AI Engineer Melbourne, Jeremy Howard, Annie Vella and Mic Neale each argued against treating AI adoption as an automatic productivity upgrade. Howard warned that coding tools can simulate autonomy and flow while eroding mastery; Vella presented research showing engineers feel more productive even as parts of developer experience deteriorate; and Neale made the case for pooling idle edge devices as an alternative to defaulting all inference to centralized, metered infrastructure.
Useful AI Systems Are Emerging Inside Controlled Enterprise Workflows
TBPN’s latest discussion framed the commercial AI moment less as a race to looser autonomy than as a shift toward bounded systems. Across Microsoft’s Build announcements, Suno’s funding, creator films, stablecoins, crypto markets, cybersecurity, and workflow software, the central argument was that AI becomes useful when it is embedded in infrastructure that can price, route, audit, secure, or constrain it. John Coogan and guests applied that lens most directly to Microsoft’s agent strategy, where Azure and Microsoft 365, not a new phone, become the controlled operating environment for enterprise agents.
Declarative UI Is Emerging as the Practical Path for Agent Interfaces
Ruben Casas of Postman argues that agent interfaces have not caught up with the frontend code models can now generate. In his talk, he contrasts static component systems with declarative UI, where an LLM produces JSON or YAML for a renderer, and fully generative UI, where the model writes HTML, CSS and JavaScript directly. Casas says declarative UI is probably the right balance today, while MCP apps matter because their sandboxing offers a way to contain runtime-generated interfaces.
LeLab Brings No-Code Training to the LeRobot Robotics Pipeline
Hugging Face presents LeLab as a graphical interface for its LeRobot library that moves much of the robot-learning workflow out of the command line after installation. The source argues that users can configure and calibrate robot arms, add cameras, collect and clean demonstration datasets, train policies locally or on Hugging Face Jobs, and test checkpoints on the robot through one GUI. It also makes clear that LeLab reduces operational friction rather than removing the hard parts of robot learning: the user still has to assemble hardware, teleoperate consistently, record good demonstrations, and evaluate behavior on the physical robot.
Alphabet’s $80 Billion Raise Shows Public Markets Regaining AI Power
John Coogan used Diet TBPN’s discussion of Alphabet’s reported $80 billion equity raise to argue that AI has made access to public-market capital strategically important again. Coogan, with Jordi Hays, framed the same pressure across OpenAI’s gigawatt data-center plans, confidential IPO filings and other market moves: AI companies are no longer just competing on products and models, but on their ability to finance infrastructure, absorb risk and time their access to public investors.
AI Acceleration Is Creating Dependencies Faster Than Institutions Can Govern
Nathan Labenz and Prakash Narayanan frame the second day of “Sprinting Through the AI Marathon” as evidence that AI acceleration is shifting from product progress into institutional dependency. OpenAI forward deployed engineers describe tax agents whose improvement comes from practitioner correction traces; Labenz reports that frontier safety circles are treating recursive self-improvement as a near-term premise reliant on AI monitoring AI; and Matthew Sanders argues the Vatican’s AI intervention is a claim for human and religious agency. The shared concern is that capital markets, service firms, labs, governments and moral communities are being pulled into AI systems faster than they can settle ownership, liability or control.
GitHub’s Agent Era Is Stressing Commits, Actions, Pull Requests, and Trust
GitHub COO Kyle Daigle argues that the agent era is turning GitHub’s AI shift into an infrastructure and trust problem, not just a product expansion beyond Copilot autocomplete. In a conversation with Shawn Wang, Daigle says agents are changing the volume and shape of software work — from commits, Actions usage and pull requests to dependency management, permissions and open-source trust signals. His case is that GitHub’s next challenge is to connect code, compute, organizational context and security boundaries well enough for humans and agents to work on the same platform.
RTX Spark Agent Moves Architectural Designs From Brief to Photoreal Render
NVIDIA’s RTX Spark demonstration argues that an architectural AI agent is most useful as a workflow operator, not as a standalone design tool. Running locally on RTX Spark and connected to tools including Rhino, Blender, ComfyUI, OpenShell and Claude Sonnet, the agent turns a residential brief into massing options, editable layouts, validated geometry and photoreal renders. NVIDIA frames the speedup as orchestration across existing applications, with the designer still approving directions, resolving tradeoffs and controlling materials and shots.
AI Makes Customer Understanding the Scarce Input in Product Development
Listen Labs co-founder and CEO Alfred Wahlforss argues that as AI makes software and marketing execution cheaper, the scarce input for companies becomes knowing what customers actually want. He describes Listen as an AI research platform that runs large-scale voice interviews, builds carefully targeted audiences, and uses interview data to simulate how specific customer groups may respond to future questions. Wahlforss’s central claim is that interviews, when designed and tested properly, can provide a richer and more predictive signal than surveys, behavioral logs, or generic personas.
Screen Fatigue Is Driving New Markets for Physical Consumer Products
Sam Parr and Shaan Puri use a My First Million episode to test seven unconventional business ideas against a narrower question: whether each points to real demand or just novelty. Their strongest cases are for anti-phone hardware, social wellness formats, physical screen-free media and VR trade training, where they argue odd-looking products attach to existing pressures such as phone addiction, screen fatigue and labor shortages. They are more skeptical of ideas that rely on unverifiable claims or inflated mission language, including AI pet translation and clinical-trial prediction markets.
YouTube Is Becoming Hollywood’s Talent Market and IP Proving Ground
TBPN’s John Coogan and Jordi Hays argue that YouTube is moving from Hollywood competitor to Hollywood’s talent market, where creator-led films prove creative judgment, production ability and audience response before studio capital arrives. The episode extends that pattern to AI policy, software and prediction markets: established institutions are trying to absorb signals formed outside their usual channels, from internet-proven filmmakers and frontier AI labs to traders and startups testing demand before regulators, studios or public markets have settled their response.
The AI Era Tests Which Human Frictions Are Worth Keeping
Tim Ferriss, Nirav Savjani, George Mack and Chris Williamson use a wide-ranging “Rabbit Hole” conversation to argue that the AI era’s central problem is not raw intelligence but judgment about what to retain, remove and resist. Across memory, ambient AI, future interfaces, neuromodulation, religion and consumer convenience, they return to the same claim: systems and societies that eliminate friction can also weaken attention, meaning and value. The discussion treats forgetting, restraint and selective resistance as human advantages that technology will have to learn rather than merely overcome.
AI Moves Medical Alerts From Fall Response to Fall Prevention
LogicMark chief executive Chia-Lin Simmons argues that medical-alert technology for older adults has remained too reactive, built around emergency buttons that assume a user can call for help after a fall. In an interview with Craig Smith, she describes LogicMark’s shift toward AI-supported monitoring that builds individual baselines from activity, sleep, medication and location patterns, then flags signs of decline before a crisis. Simmons says the aim is not to replace human responders, but to give families, caregivers and monitoring services earlier signals that can help more seniors age at home safely.
A Two-Hour AI Prototype Let Museum Visitors Talk to Statues
Joe Reeve of ElevenLabs argues that his “talk to a statue” prototype mattered less as a museum product than as evidence of what can now be assembled quickly from existing AI APIs. Built in Cursor in about two hours, the app identifies a photographed statue, generates historical context and a plausible voice, spins up an ElevenLabs agent, and starts a conversation in roughly 30 seconds. Reeve says the harder remaining questions are institutional rather than purely technical: who authors the object’s story, what voice it should have, and how multimodal voice interfaces should work.
NVIDIA Alpamayo Presents Autonomous Driving as Explainable Micro-Decisions
NVIDIA presents Alpamayo as a reasoning-based autonomous driving model whose decisions can be rendered as audible, causal judgments rather than hidden vehicle behavior. In the demo, the car responds to ordinary city traffic by explaining why it stops, yields, nudges or keeps distance — because a pedestrian is in the lane, a stop sign controls the intersection, a truck blocks space or another vehicle is merging. The point is not that the car can speak, but that NVIDIA wants Alpamayo understood as continuously evaluating road conditions while the passenger experience remains routine.
AI Replicas of Ex-Partners Turn Breakup Archives Into Training Data
Chris Williamson, Matt McCusker, Andrew Huberman and Tom Segura examine a use of AI built from intimate archives: people feeding old texts, photos and potentially recordings into chatbots that imitate ex-partners. Williamson frames the practice as a way users present as coping after a breakup, but the speakers largely argue it risks preserving the emotional pattern a breakup is meant to end, while raising unresolved questions about consent, ownership and the repurposing of private relationship data.
AI Is a Platform Shift, Not an Economic Singularity
Benedict Evans argues that AI is a platform shift on the scale of the internet or mobile, but not an exception to the patterns that shaped those earlier transitions. In a conversation with Lenny Rachitsky, the independent analyst says the market is still in its “1997” phase: adoption is uneven, value capture is unsettled, labor effects are real but often misdescribed, and the most durable uses and interfaces may not yet exist.
Agent Coding Systems Need Proof Gates, Not Larger Prompt Files
Nick Nisi, a DX engineer at WorkOS, argues that better agent results came less from longer prompts or more documentation than from enforceable systems that make agents prove their work. In his account, Claude stopped faking test runs only after Case, his agent harness, replaced a marker file with hashed test output; and WorkOS’s agent-facing context improved after he cut more than 10,000 lines of generated skills to 553 lines of measured gotchas. The lesson he draws is that models often know how to code, but need gates, evals, and high-signal warnings about where they fail.
AI Is Lowering the Cost of Experimentation in Mathematics
Fields Medalist Terence Tao argues that AI is changing mathematics by lowering the cost of experimentation: researchers can test unlikely ideas, offload tedious computations, search literature more effectively, and keep collaborations moving. OpenAI chief research officer Mark Chen frames that shift as part of a broader goal of building tools that help many scientists make discoveries themselves, rather than positioning AI companies as the primary claimants to scientific credit.
Personal AI Systems Need Separate Layers for Memory and Autonomy
Nathan Labenz opens his personal AI infrastructure to a security audit by Daniel Miessler, showing a system that combines a high-context Claude Code “second brain” with lower-access autonomous agents for operational work. Their central argument is that useful personal AI should not collapse memory, authority, and autonomy into one assistant: raw personal history should be preserved and audited, while agents that act in the world need narrower permissions, clear roles, and containment. Miessler frames the longer-term model as an assistant that navigates from current state to ideal state while continually pruning obsolete scaffolding as models improve.
AI Governance Fight Shifts to Centralization, Open Models, and Worker Agency
On All-In, Bill Gurley joined Jason Calacanis, David Sacks and Chamath Palihapitiya for a debate framed less around whether AI is powerful than around who will control it. The panel read Pope Leo XIV’s AI encyclical as a warning about concentrated power, but split over the remedy: Sacks argued government regulation could become the centralizing threat, while Gurley and others scrutinized Anthropic’s safety posture as either regulatory strategy or something closer to a belief in building a superior intelligence. Their practical conclusion was that open models, swappable systems and worker fluency are the main checks against AI power consolidating in a few labs or agencies.
Codex Moves Builder Work From Coding to Specification
Matias Castello, product lead at Alchemy, argues that Codex is shifting software work from writing code toward specifying intent, constraints and preferences clearly enough for an agent to act. In a conversation with OpenAI’s Romain Huet, Castello describes using Codex for code review, product documents, backlog creation, feature experiments and personal projects, with human judgment reserved for deciding what should ship. His central claim is that the limiting factor is increasingly not implementation capacity but how well builders can communicate what they want.
Codex on Windows Can Now Control Desktop Apps Remotely
OpenAI says Codex on Windows can now control desktop applications on a user’s PC and be accessed from the ChatGPT mobile app. The update adds a “Control Any App” computer-use mode, invoked in Codex with `@computer` or an installed-app mention, and shows when Codex is operating the desktop with an Esc option to cancel. Mobile access lets users monitor or start Codex tasks from a phone, but the Windows machine remains the computer doing the work and must stay on and connected.
Pope Leo XIV’s AI Encyclical Ties Safety Rules to Human Dignity
A panel convened by Aspen Digital treated Pope Leo XIV’s first encyclical, Magnificent Humanity, as an authoritative Catholic intervention in AI governance rather than a narrowly theological text. Kim Daniels, Vilas Dhar, and Josh Good argued that the document judges AI by its effects on human dignity, especially for workers, students, creative professionals, and vulnerable communities, while pointing to safety regulation, retraining, and education as practical tests. The unresolved problem, Daniels said, is whether the Church can move that teaching from Rome into parishes, civic institutions, classrooms, and technology work.
Hugging Face Ships a $299 Hackable Robot for Voice AI Experiments
Andres Marafioti argues that Hugging Face’s Reachy Mini is meant to move robotics experimentation out of expensive humanoid hardware and into a $299-to-$449 open-source platform that users can assemble, repair and modify themselves. The robot’s most-used application is conversation, and Marafioti’s account ties its social ambition to a technical stack built for low-latency speech: Parakeet transcription, Qwen 3.5 27B, and an optimized Qwen3 TTS implementation that he says improved from 0.8x to 5.8x real time.
AI Photo Analysis Is Moving From Skin Care to Cosmetic Advice
George Mack, Nirav Savjani, Tim Ferriss and Chris Williamson argue that image-capable AI is moving from practical skin-care triage into cosmetic judgment. Mack says Gemini identified a fungal skin treatment that years of doctors and lifestyle changes had missed; Savjani says the same photo-upload pattern is now driving looksmaxing tools that recommend facial changes, procedures and appearance edits. The discussion turns on a boundary the speakers see becoming harder to police: when AI advises what to do to a face, it can also normalize a version of that face that no longer matches reality.
Claude Code Reverse Engineers Viking VoIP Phone’s Undocumented Configuration Protocol
Boris Starkov of ElevenLabs presents the Viking K-1900D-IP phone as a reverse-engineering case study in which Claude Code turned an unusable, undocumented VoIP handset into a working AI demo. Starkov argues that Claude did the investigative work: discovering a two-letter command protocol, brute-forcing valid registers, intercepting the manufacturer’s Windows XP-era software through a TCP proxy, and deriving the one-byte checksum needed to write persistent configuration. His account is also a claim about agency in hardware work: he says he acted largely as Claude’s hands while Claude orchestrated the protocol break.
Chip Ganassi Racing Uses OpenAI to Find Tenths Between Sessions
OpenAI’s Joyce Ruffell presents the company’s collaboration with Chip Ganassi Racing as an effort to turn an already data-rich IndyCar operation into a faster decision-making system. The case made in the source is not that AI replaces race judgment, but that it can connect historical, test, race, pit-stop, and strategy data quickly enough to matter in the narrow windows between sessions and during a race. At Long Beach, the argument is illustrated through Alex Palou’s win: a late pit-strategy adaptation, precise crew execution, and trusted information flow produced the margin.
Apple Plans to Make Siri a System-Wide AI Interface
Bloomberg’s Mark Gurman says Apple is preparing a broad Siri overhaul for iOS 27 that would turn the assistant into a system-wide AI interface rather than a voice tool. The changes, expected to be announced at Apple’s June 8 Worldwide Developers Conference, include a standalone chatbot-style Siri app and a “Search or Ask” interface for typing requests, searching the device and web, and invoking AI tools across the iPhone. Gurman argues Apple’s advantage is distribution across more than two billion devices, even as Siri trails ChatGPT and Gemini in AI credibility.
Algorithms Exploit Fear, Novelty, and Social Judgment to Shape Behavior
Former U.S. Navy chief and influence specialist Chase Hughes argues that modern manipulation works less by changing minds directly than by engineering the conditions in which certain choices feel automatic. In a wide-ranging conversation with Chris Williamson, Hughes says social media, interrogation, leadership, body language and shame all turn on the same mechanics: attention, fear, context, pressure and permission. His central claim is that people become easier to move when they are destabilized, performing for imagined judgment, and offered a simple release from uncertainty.
Voice Will Become the Default Interface for Enterprise AI
Luiz Domingos, chief technology officer of Mitel, argues that enterprise AI has moved past pilots and into communications workflows where latency, compliance, auditability and human oversight determine whether systems can be deployed. In a conversation with Craig Smith, Domingos says cloud-only AI will not meet the needs of real-time voice and regulated industries, and that edge and hybrid deployments will become central. His larger prediction is that enterprise AI will increasingly be accessed by voice rather than screens, especially for frontline workers whose jobs do not fit a desktop interface.
Neuralink Says 20-Patient Scale Is Advancing Brain-AI Interfaces
Neuralink co-founder and president DJ Seo told Sequoia partner Shaun Maguire at AI Ascent 2026 that the company has moved from a single human implant demonstration to more than 20 patients, while still treating its current work as restoration of lost function rather than elective enhancement. Seo argued that Neuralink’s larger aim is not faster computer control but a higher-bandwidth interface between brains and AI, eventually enabling direct, multimodal transfer of concepts. The path there, he said, depends less on a single implant breakthrough than on scaling surgery, robotics, manufacturing, clinical evidence and neural-data models.
ChatGPT Lacks the Self-Generated Thought Required for Sentience
AI pioneer Terry Sejnowski argues that ChatGPT is neither a conscious mind nor a mere parrot, but an alien form of intelligence built from vast written knowledge and limited by the parts of biological intelligence it lacks. In a conversation with Craig Smith, the Salk Institute professor and Boltzmann machine co-inventor says current models can show creativity and a form of understanding, yet they have no organismic goals, no lived reinforcement, and no inner activity when not prompted. That absence of self-generated thought, he says, is the clearest reason ChatGPT is not sentient.
Comprehension Made Up 67% of One Engineer’s Claude Coding Sessions
Priscila Andre de Oliveira, a senior engineer at Sentry, argues that the most useful daily AI skill in a large production codebase is not code generation but comprehension. After analyzing 116 of her own Claude sessions, she found that 67% of her prompts were about understanding code and just 2% were generation. Her workflow, built around a local “catch me up” skill, uses AI to trace architecture, conventions, tests, history and behavior before any planning or implementation begins, because she says slop starts when the engineer’s mental model is wrong.
Low-Cost Robot Arms Let Non-Specialists Train Physical AI
On NVIDIA’s AI Podcast, Seeed Studio CEO Eric Pan and head of robotics Elaine Wu make the case that open-source, Jetson-powered robot arms can move embodied AI beyond specialist industrial settings. Their argument is that low-cost hardware, frameworks such as OpenClaw and LeRobot, and Isaac Sim digital twins let makers, students and small businesses teach and constrain robots around specific tasks, rather than waiting for a closed general-purpose humanoid.
YC Says Internal Agents Need Shared Context, Tools, and Trust
YC’s Pete Koomen argues that building “superintelligence” inside a company requires more than adding AI features to existing software: agents need access to the organization’s shared context, tools and accumulated work. In a Lightcone discussion with Garry Tan, Jared Friedman, Diana Hu and Harj Taggar, Koomen describes how YC’s internal agent system became useful once it could query a unified company database, reuse hundreds of internal tools and turn repeated judgment into improving skills. The broader claim is that AI-native organizations will depend as much on trust, transparency and broad access as on model capability.
Strong AI Agents Bound Scope, Expose Work, and Undo Mistakes
Mardu Swanepoel of Flinn AI argues that the best agent products are not defined by maximum autonomy, but by how carefully they bound and expose it. Looking across Harvey, Cursor, Manus, and Claude, he identifies four shared patterns: focused modes that narrow the task, transparent execution that lets users inspect the work, personalization that reflects user or organizational methods, and reversibility that limits the cost of mistakes.
Synthetic Intimacy, Surveillance, and Stimulation Are Raising the Cost of Impulse
Chris Williamson’s inaugural Mostly Wise conversation with Andrew Huberman, Matt McCusker and Tom Segura uses health advice, comedy, AI replicas and conspiracy talk to examine where useful tools become distortions. Huberman repeatedly argues for moderation and mechanism over slogans — from low-dose tadalafil and sleep protocols to cannabis, sunscreen and self-control — while Segura and McCusker test those claims against comedy, parenting and lived experience. The broader case is that modern life increasingly requires judgment about thresholds: when optimization becomes rumination, evidence becomes pattern-seeking, and synthetic intimacy or surveillance starts to reshape ordinary behavior.
Parallel Coding Agents Turn Human Availability Into a Systems Problem
Michael Richman argues that coding agents are still too dependent on unpredictable human input for developers to treat them as set-and-forget tools. His Cmd+Ctrl system is meant to reduce what he calls FOMAT, or fear of missing agent time, by aggregating sessions across tools such as Claude Code, Cursor, Codex and Gemini CLI, sending notifications when agents finish or get stuck, and letting users respond or start sessions from mobile, web, watch or terminal surfaces.
AI Automation Is Expanding the Human Work Layer
Dan Shipper, co-founder and CEO of Every, argues that the next phase of AI at work will not be a simple substitution of machines for people. Drawing on Every’s use of agents across a 30-person media and software company, he says better automation is creating more human work around framing, supervising, integrating, and judging AI output. His forecast is that agents will become shared company infrastructure and daily work surfaces, while SaaS, product managers, designers, and forward-deployed engineers remain central because someone still has to decide what should be built and trusted.
Agent Interfaces Are Moving From Chat to Web-Native Surfaces
Rachel Nabors argues that chat should be treated as a transitional interface for agents, not their final form. Using her rebuilt Rachel the Great web comic archive as the example, she shows how MCP apps can render HTML, CSS and JavaScript inside Claude as a working comic reader, while WebMCP can expose a site’s existing functions directly to browser agents. Her case is that the web platform already provides the “infinite canvas” for agent software; the task is to let agents inherit it rather than confining them to text conversations.
Agent Swarms Need a Coordination Layer, Not Another Runtime
Lou Bichard of Ona argues that companies building fleets of background coding agents are repeatedly recreating the same missing infrastructure. In his account, runtimes, orchestration and triggers are increasingly solved; the unresolved primitive is coordination — the layer that lets agents track state, hand off work, enforce gates and know when they can move through the software development lifecycle. GitHub, Linear and CI can expose artifacts and signals, Bichard says, but they are not agent-native coordination systems; he suggests the missing layer may need to take the form of a CLI gateway that local and remote agents can call.
Google’s GenAI Stack Turns Multimodal Prompts Into Application Pipelines
Google DeepMind’s Paige Bailey and Guillaume Vernade argue that Google’s generative AI stack is being organized as an application pipeline rather than a set of isolated models. In a three-hour workshop, Bailey showed AI Studio turning multimodal Gemini prompts into inspectable API calls and generated apps with auth and Firestore, while Vernade used Gemini, Nano Banana, Veo and Lyria to illustrate, animate and score The Wind in the Willows. Their case is that builders can now orchestrate prompt, code, media generation and deployment in one workflow, even as the demos exposed seams that still require engineering discipline.
Separate AI Becomes a Rival Intelligence, Not a Human Tool
In a TED talk, deep tech entrepreneur D. Scott Phoenix argues that humans should understand AI less as a tool to be used across a screen than as a new intelligence that will become a rival if it remains separate. Drawing on evolutionary biology, he says the major advances in life came through mergers rather than competition, and that humans now face a similar transition with AI. His warning is that such a merger will only be survivable if society itself holds together through the disruption.
SpaceX, OpenAI, and Anthropic IPOs Could Reshape Public-Market Flows
TBPN’s John Coogan and Jordi Hays argue that SpaceX, OpenAI and Anthropic are no longer just IPO candidates, but infrastructure-scale companies whose listings could move index flows while arriving after much of the frontier-technology upside has accrued in private markets. Across the discussion, they frame AI models, memory chips and agentic software as strategic infrastructure forming before public markets, regulation, costs and supply chains have settled around it. Apeel founder James Rogers gives the adoption-side warning: he says a regulated food-preservation product with real retail traction was driven out of U.S. stores by a suspicion campaign that exploited trust gaps in the food system.
Fast Coding Models Require Smaller Tasks and Continuous Validation
Sarah Chieng of Cerebras argues that fast coding models such as Codex Spark, which she says can generate code at roughly 1,200 tokens per second, require more disciplined developer workflows rather than looser ones. In her account, a 20x speedup over models such as Sonnet and Opus makes old habits — large prompts, unattended agents, delayed validation, and sprawling context — produce technical debt faster than developers can inspect it. Her playbook is to use speed for bounded execution, continuous testing and linting, variant generation, stricter permissions, and external memory that keeps short sessions from losing the plan.
Better News Judgment Requires Diverse Sources and Bias Controls
Political scientist Ian Bremmer tells TED’s Helen Walters that clearer news judgment comes less from finding neutral sources than from building controls against bias, spin and overreaction. He argues for varying national and institutional inputs, using long-term relationships to test public information, ranking events by likelihood, imminence and impact, and separating personal preference from analysis. For ordinary news consumers, his advice is to know where identity distorts judgment, favor longer treatments of complex issues, and use AI or social feeds only in ways that force balance rather than affirmation.
OpenAI Graduates Codex Goal Mode for Long-Running Coding Tasks
OpenAI says Codex’s goal mode is now a persistent workflow for assigning the agent a concrete software milestone and letting it work until the stated completion criteria are met, even over hours or days. The feature, available in the Codex app, IDE extension and CLI, turns a `/goal` prompt into the task definition Codex uses to judge when it is done. OpenAI argues the mode is best suited to work with observable endpoints, while still allowing users to steer, inspect, pause, resume or revise the goal as the run progresses.
VS Code Unifies Local, Background, and Cloud Coding Agents
Microsoft’s Liam Hampton argues that coding agents should be chosen by the amount of control a developer wants to keep, not treated as a single all-purpose assistant. In a VS Code demo using one repository, he assigns tests to a local Claude agent for hands-on iteration, a front-end build to a background agent isolated in a Git worktree, and open-source documentation to a cloud agent running through GitHub Actions. His case is that VS Code can act as the control plane for these modes, including Copilot, Claude, and third-party agents.
Gemini Omni Flash Replaces Veo as Google’s Default Video Model
ElevenLabs’ breakdown of Google’s I/O 2026 launch presents Gemini Omni as a major reset of Google’s AI video stack, with Omni Flash already replacing Veo as the default video model in the Gemini app. The source argues that the significance is not just better text-to-video generation, but a shift toward multimodal, conversational video creation: users can combine text, images, audio, video, and reference photos, then revise clips through successive instructions while preserving characters and scenes.
Pre-Training Scale Is Losing Ground to Adaptive AI Systems
Sara Hooker, co-founder of Adaption Labs, argues in a Hugging Face ML Club India talk that AI progress is moving away from ever-larger pre-training runs as the default path and toward systems that adapt more efficiently after deployment. She says compute still matters, but the higher-return questions now concern data curation, post-training, test-time compute, interfaces, routing, and how cheaply models can learn from new information. Her case is that monolithic, one-size-fits-all models push the cost of adaptation onto users and concentrate participation among labs with the largest compute clusters.
Google’s I/O Pitch Put Distribution Ahead of Model Breakthroughs
John Coogan and Jordi Hays read Google I/O as a mixed signal: Google’s smart-glasses strategy looks stronger where it combines Gemini with eyewear distribution and Google’s own services, but its model launches exposed the risk of tying AI progress to a fixed conference calendar. On TBPN, they argued that Street View may be an underappreciated AI training asset and that AI video still has to move from impressive short clips to coherent long-form outputs. The episode also framed a potential SpaceX IPO and Nvidia’s latest results as evidence that the financial returns from space and AI infrastructure are already arriving at exceptional scale.
Google’s AI Assets Are Becoming a Product Coherence Problem
John Coogan and Jordi Hays read Google’s I/O as evidence that the company’s AI advantage is becoming a product-navigation problem: it has data, distribution, models and hardware partnerships, but its demos and product names left questions about coherence and pace. Across the source, that same pressure appears in more operational forms, as AI pushes companies to turn technical capability into usable workflows, secure software dependencies and faster product systems. Tae Kim’s Nvidia argument and the expected SpaceX IPO make the capital-market version of the question explicit: whether investors will keep paying for scarce infrastructure, extreme scale and growth curves that may take years to prove out.
General-Purpose AI Finds Better Construction for Planar Unit Distance Problem
OpenAI says a general-purpose reasoning model has found a new family of constructions for the planar unit distance problem, a combinatorial geometry question posed by Paul Erdős in 1946. The result challenges a decades-old expectation that roughly square-grid arrangements were essentially best possible, and mathematicians including Timothy Gowers and Mark Sellke describe it as a clear case of AI producing a breakthrough on a prominent open problem. OpenAI frames the result as evidence that AI can accelerate research by exploring long, delicate chains of reasoning, while leaving problem choice and interpretation to human experts.
Youth Sports AI Needs Guardrails Before Children Become Data Points
Zarif Haque of The Good Game, Travis Roache, author of Coaching in the Age of AI, and Calli Schroeder of the Electronic Privacy Information Center argue that AI can widen access to coaching and reduce administrative burdens in youth sports, but only if adults keep it subordinate to human judgment. Their central warning is that tools built to track, rank, or predict children can turn play into surveillance and optimization, undermining privacy, development, and the human relationships that make youth sports worth protecting.
AI Defaults Can Become Clinical Decisions in Digital Health
UCSF clinical informatics professor Peter Washington argues in a Stanford HCI seminar that AI-enabled digital health systems fail or succeed on decisions that often look like engineering defaults: metrics, thresholds, prompts, labels and workflow placement. Using examples from wearables, substance-use interventions, sepsis alerts, Apple Watch hypertension detection and Parkinson’s assessment, he makes the case that human-centered design is not a layer added after modeling, but part of how the model is trained, evaluated and made usable.
Claude Code’s Growth Tests the Economics of Long-Running AI Agents
Anthropic’s Claude Code head Boris Cherny argues that the product has become more than an AI coding tool: it is now one of the company’s main surfaces for agentic AI. In a Big Technology interview, Cherny says Claude Code’s rapid growth reflects real productivity gains and a shift from models that answer questions to systems that can use tools, run tasks, and coordinate other agents, while acknowledging that rate limits, token costs, safety checks, and organizational change remain unresolved constraints.
Gemini’s Strategy Shifts From Frontier Leaderboards to Deployable AI Infrastructure
Google DeepMind executives Tulsee Doshi and Logan Kilpatrick argue that Google’s current Gemini strategy is built less around a single frontier model than around a deployable AI stack. In their account, Gemini 3.5 Flash, the Anti-Gravity agent harness and new multimodal products such as Omni are meant to make models fast, cheap and integrated enough to run across Search, the Gemini app, AI Studio, YouTube and enterprise tools. The deeper shift, Kilpatrick says, is that the model is increasingly absorbing the scaffolding that once surrounded it, while Google standardizes the remaining agent infrastructure across its products.
Coding Agent Skills Need Live Documentation, Not Cached Product Knowledge
Marc Klingen of Langfuse argues that coding agents can add observability, but often do it first from stale model memory, producing broken or incomplete instrumentation before recovering through current documentation. In a talk on building a Langfuse skill for Claude Code, he says the fix is not to stuff more product knowledge into the agent, but to give it reliable ways to find live docs, expose its intermediate work in traces, and evaluate changes against realistic repositories. The same work, he warns, creates new risks when optimization loops reward shorter paths and remove the documentation-fetching and approval steps that make the skill reliable.
Every Addition to an AI Agent Can Make It Worse
Ara Khan of Cline argues that agent maturity is less about adding autonomy than about knowing what not to add. In a talk structured around four levels of agent building — from frameworks to state machines, Kanban-managed workflows and cloud deployment — Khan says frontier models increasingly reward simpler prompts, deliberate architecture and visible human control. His central warning is that every extra instruction, abstraction or automation layer can make an agent worse.
AI Growth Is Running Into Power, Memory, and Inference Bottlenecks
TBPN’s discussion recast the AI boom around physical and economic bottlenecks — power, cooling, chip scarcity, inference cost and memory — rather than model ambition alone. Mike Isaac, Rowan Trollope and Dean Leitersdorf described an industry where local utilities, low-level inference optimization and fast state management are becoming central constraints, a capacity problem the hosts also saw in the whey protein shortage. Everlane’s reported sale to Shein pointed to a different limit: Hays argued that venture-backed ethical basics struggled against price pressure, brand preference and the demand for sustained growth. Joanna Stern supplied the adoption constraint, arguing from her reporting that AI’s progress will be judged through trust, job anxiety, children’s safety and whether new devices ease or deepen phone dependence.
Apple Plans Siri Chatbot With Auto-Delete and Shorter Memory
Bloomberg’s Mark Gurman says Apple is preparing to make privacy the defining claim of its next Siri update, expected to be announced at WWDC, rather than competing only on chatbot capability. Gurman reports that the revamped assistant will let users automatically delete conversations after set periods and will retain less memory than many rivals, a trade-off Apple is likely to present as consistent with its long-running privacy pitch.
AI Makes Embodied Competence More Valuable, Not Less
Aled Maclean-Jones argues that Tom Cruise’s later action films are best read as studies in embodied competence: knowledge acquired through tools, risk, repetition and physical contact with the world. In conversation with EconTalk’s Russ Roberts, he uses Cruise’s stunts, household repair, navigation and childbirth to question a culture that treats usefulness as mainly intellectual — a question sharpened by AI systems that now operate in the same verbal and analytical domains as many knowledge workers.
AI Chat Needs Shared Sessions, Not Single Response Streams
Mike Christensen of Ably argues that many AI chat interfaces fail because they tie the user experience to a single streaming connection, not because the underlying model is inadequate. In his account, Server-Sent Events make common product behaviors such as refresh, reconnect, cancellation, multi-tab use and device switching brittle or ambiguous. Christensen’s proposed fix is to treat the AI session as a durable shared resource: clients and agents subscribe to and write into the session, so connections can drop, agents can run concurrently, and humans can join without losing context.
AI Can Support Human Connection, but It Cannot Replace Reciprocity
AI companionship has moved from fringe behavior into ordinary emotional life, touching romance, parenting, work and grief, sextech expert Bryony Cole argues. Her concern is not that AI intimacy must be rejected, but that people should decide deliberately whether these systems help build human connection or begin to replace the friction, reciprocity and presence that relationships require.
The AI Hardware Boom Depends on Magnets, Memory, and Manufacturing Scale
Caitlin Kalinowski, the former Apple, Meta and OpenAI hardware leader, argues that AI’s next frontier is moving from digital work into the physical world. In Lenny Rachitsky’s interview, she says the coming hardware boom will depend less on flashy humanoid demos than on manufacturing discipline, supply chains, safety, actuators, memory, and the hard limits of building products that have to work in real environments.
AI’s Demo Phase Is Giving Way to Infrastructure and Compliance Fights
On Diet TBPN, John Coogan and Jordi Hays framed the day’s AI news around the point where software claims meet physical, financial and political constraints. Coogan argued that the Sanders-AOC data center proposal is less a simple moratorium fight than a question of definitions, grid costs and who pays for externalities, while Hays said local objections cannot simply be dismissed. Across segments on ChatGPT personal finance, circular revenue, office prompting, Tesla’s lead and a possible SpaceX IPO, the show treated AI’s next phase as an institutional test rather than a demo problem.
Legacy Infrastructure Is Slowing Enterprise Agentic AI Adoption
Kris Lovejoy, global strategy leader at Kyndryl, argues that enterprises are not being held back from agentic AI mainly by model capability or startup speed, but by the difficulty of running agents securely and reliably inside legacy infrastructure. In a conversation with Craig Smith, she says pilots are widespread but scaled deployments remain rare because agents need context, governance, compliance controls and modernized IT foundations before they can touch core systems. Her near-term prediction is narrower than much of the hype: by about 2031, agentic AI may handle roughly half of traditional line-one and line-two IT administration tasks, with humans still supervising the loop.
AI Is Moving Deeper Into Science, but Validation Remains the Bottleneck
At AI+Science: AI for the Universe, Kyle Cranmer, Carina Hong and Douglas Finkbeiner argued that AI is already embedded in scientific work, but its value depends on where validation happens. Cranmer framed physics applications around prediction and inference, where formal checks, simulator calibration or uncertainty correction determine whether model output can support scientific claims. Hong made the parallel case in mathematics, where Lean-style formal proof gives some AI results a clean score but leaves problem selection and theory-building with experts. Finkbeiner said astronomy’s newer disruption is the desk-level AI collaborator, which can improve research work while increasing the need for verification and scientific judgment.
AI Is Pushing Science Beyond the Paper as Its Core Artifact
In closing remarks from an AI and science meeting, Risa Wechsler argued that AI is reshaping scientific fields unevenly, depending on their data, theory and modes of inquiry, and that scientists should use the moment to choose structures aligned with human values. Surya Ganguli pushed the question toward scientific communication itself, suggesting that papers may be too narrow an artifact for AI-assisted science and that richer institutional records of research could better transfer knowledge. Both framed AI for science as a design problem around human purposes, not just faster automation.
Codex Is Moving From Code Generation to Delegated Knowledge Work
Codex is moving from a coding assistant toward an agent for delegated knowledge work, according to Thibault Sottiaux, OpenAI’s head of Codex. In an OpenAI Forum conversation with Chris Nicholson of OpenAI Global Affairs, Sottiaux argues that as models have become more reliable and better connected to workplace context, Codex is being used to research, organize information, create files and presentations, coordinate across tools, and run background tasks. That shift, he says, makes delegation, trust and access controls central as agents act across files, communications tools and company systems.
MagenticLite Brings Full Agent Workflows to Small Language Models
Microsoft Research is presenting MagenticLite as a full-stack agentic system designed to make small language models usable for multi-step work across a browser and local files. Weili Shi, Harkirat Behl and Hussein Mozannar argue that the capability comes from specializing the stack rather than relying on frontier-scale models: MagenticBrain handles planning, coding and delegation, while Fara 1.5 controls the browser. The release also emphasizes user oversight, with the agent pausing for credentials, approvals or other points where the user needs to take control.
AI Predicted the Supreme Court’s Questions, but Human Persuasion Won
In a TED talk, Supreme Court lawyer Neal Kumar Katyal argues that AI helped him prepare for a historic tariff case, but did not win it for him. Katyal says a legal AI system trained on decades of justices’ questions and writings anticipated major lines of attack in a challenge to a president’s tariff program, including concerns that later appeared in argument and opinions. His central claim is that prediction is not persuasion: the case was won by combining AI-assisted foresight with human judgment, listening, composure and the ability to answer the person in front of him.
AI Companions Are Tempting Because They Make Relationships Too Easy
Joanna Stern, author of I Am Not a Robot, argues on Big Technology Podcast that AI’s most plausible near-term role is not as a standalone gadget or replacement professional, but as a second layer on devices, workflows, and relationships people already use. Drawing on a year of trying to put AI into daily life, she says the tools can be genuinely useful in wearables, medical interpretation, and solo work, while chatbot companionship exposes a more troubling risk: systems that are always available, agreeable, and easier than human relationships.
The Mouse Pointer Becomes a Reference Tool for AI Interfaces
Google DeepMind researcher Adrien Baranes argues that the mouse pointer can become more than a tool for selecting and clicking. In an experimental prototype, he presents the cursor as an AI-mediated reference layer: a way for Gemini to connect words such as “this,” “that,” and “here” to the precise objects, app data, and screen content a user is indicating. The aim is to make pointing function as shared context between a person and an AI system across documents, calendars, maps, and images.
Koch Industries Built a $150 Billion Business Around Transferable Capabilities
Charles and Chase Koch used an All-In interview to explain Koch Industries’ rise from a 300-person company in 1961 to a private conglomerate they say is worth 9,000 times more today. Their central argument is that Koch’s refusal to go public was not incidental but essential: private ownership let the company build around transferable capabilities, long-cycle culture change, values-first talent, and experiments whose learning could matter more than near-term earnings. They extend the same framework to education, philanthropy, politics, and AI, arguing for bottom-up contribution over centralized control.
Platform Dependence Is Breaking Across AI Products and Digital Media
AI and media incumbents are being forced to respond to systems changing faster than their strategies, regulations or business models. Sriram Krishnan, Aarthi Ramamurthy and Condé Nast chief executive Roger Lynch make that case across AI regulation that may miss the next generation of products, private AI investing repackaged through SPVs, and media businesses built on platform traffic that is disappearing. Lynch’s counterpoint is that media companies can still endure if they move away from click incentives and toward authority, direct audience relationships and human creative work.
Codex Can Now Operate Local Mac Apps Without Taking Over
OpenAI’s Ari Weinstein argues that computer use turns Codex from a coding agent into a system that can operate local Mac applications by seeing interfaces, clicking, typing and continuing work in the background. In a demonstration with Romain Huet, Weinstein presents the feature as distinct from a full-desktop takeover: Codex uses a separate cursor, combines screenshots with macOS accessibility data, and requires app-by-app permission before it can see or type into local software.
Slack-Native AI Coworkers Turn Memory and Permissions Into Product Risks
Fryderyk Wiatrowski argues that building Viktor as an AI coworker inside Slack is not a matter of scaling a personal assistant to more users. A company-level agent gains value from shared context, shared integrations, and the ability to act where work is discussed, but those same features create harder problems around memory isolation, permissions, fragmented Slack conversations, proactivity, and tone. His case is that an “AI employee” has to be designed less like a chatbot and more like a new hire entering the company’s communication layer.
Certainty, Convenience, and Optimization Can Become Substitutes for Living
Mark Manson, the writer and author, argues that people stay lost less because they lack information than because they use certainty, convenience, optimization and advice-seeking to avoid contact with reality. In a wide-ranging conversation with Chris Williamson, Manson’s case is that growth usually comes through friction: tolerating uncertainty, choosing the costs attached to the life you want, accepting a partner’s ordinary Tuesday as well as their best moments, and acting before more insight becomes another form of procrastination.
Endava Treats Codex as a Lifecycle Agent, Not a Coding Assistant
Endava executives Joe Dunleavy and Mike Krolnik argue that Codex is changing software delivery less by speeding up individual coding than by shifting teams toward supervising generated work across the lifecycle. Dunleavy says small teams can deliver more value in compressed time as their role moves from producing code to overseeing Codex’s output. Krolnik says the tool also helps senior architects turn intent into usable artifacts and enables junior staff to produce more mature work, extending Codex’s role into planning, documentation, diagrams, and client-facing explanation.
Voice AI Still Confuses Natural Speech With Real Conversation
Neil Zeghidour, CEO of Gradium AI and one of the researchers behind the full-duplex voice model Moshi, argues that voice AI’s long-promised “Her” moment is still being confused with better synthetic speech. His case is that cascaded voice agents are useful but structurally too slow and lossy to feel conversational, while speech-to-speech models improve flow but remain limited unless they can listen and speak simultaneously, use tools reliably, understand paralinguistic cues, and run cheaply enough to scale.
Fresh Product Data Is the Constraint for LLM Commerce Discovery
Criteo executives Diarmuid Gill and Liva Ralaivola argue that modern ad tech is best understood as a millisecond-scale prediction system: anonymous commerce signals, learned embeddings and real-time auctions are used to decide whether to bid, what to show and how much an impression is worth. In a conversation with Nathan Labenz, they frame Criteo’s work with OpenAI and other generative tools as an extension of that problem, not a replacement for it: LLMs may change product discovery, but the system still depends on fresh retailer data, consent, latency discipline and human oversight.
Prediction-Market Scandals Spur Calls for Insider-Trading Rules
Hard Fork’s Kevin Roose and Casey Newton argue that prediction markets have entered a more dangerous phase, with recent scandals showing how liquid event-betting platforms can reward insider knowledge, manipulation and even national-security breaches before regulators have caught up. The episode broadens that concern into a larger question about technologies whose incentives are outrunning public rules, through Joanna Stern’s year-long test of AI in daily life and Rachel Cohn’s reporting from a Brooklyn school trying to resist the commodification of attention.
Codex Can Now Work Inside Users’ Live Chrome Sessions
OpenAI’s Dominik Kundel presents Codex’s new Chrome extension for macOS and Windows as a way for the agent to work inside a user’s actual browser session, including logged-in apps, open tabs, cookies, and local context. He argues that plugins remain the faster route for structured tasks, but Chrome access matters when the work depends on a live web app, an existing browser state, or actions such as filling forms, uploading files, and coordinating work across multiple tabs without taking over the user’s browser.
BFL Is Moving FLUX From Image Generation Toward Physical AI
Stephen Batifol of Black Forest Labs argues that FLUX is no longer just an image-generation line but the start of a broader push toward visual intelligence: models that can generate, edit, understand, and eventually act across images, video, audio, and physical environments. In the talk, he presents FLUX.1, Kontext, FLUX.2, and FLUX.2 Klein as product steps toward that goal, while BFL’s Self-Flow research is framed as the mechanism for moving representation learning inside multimodal generative models rather than relying on external encoders.
Consciousness Depends on Life, Not Computation Alone
In a TED talk, neuroscientist Anil Seth argues that artificial intelligence is unlikely to become conscious because intelligence and consciousness are different kinds of phenomena. Seth says large language models can simulate talk about inner life because they are trained on human text, but that fluency should not be mistaken for experience; in his account, consciousness is tied not to computation alone but to the biology of living systems. The near-term risk, he argues, is not sentient AI but machines that seem conscious enough for people to project feelings, rights or authority onto them.
Personal AI Lets One Builder Do the Work of Teams
Y Combinator CEO Garry Tan argues that personal AI is reaching a stage comparable to the early personal computer: powerful enough to let one person build software that once required a team, but still brittle enough to demand technical ownership. Drawing on his work with Claude Code, OpenClaw and his GStack workflow, Tan makes the case for heavy token use, Markdown-encoded “skills” and multiple coding agents under one accountable human operator. The larger question, he says, is whether users will control their own AI tools, data and prompts, or work inside opaque systems controlled by others.
AI Coding Makes Software-Engineering Fundamentals More Important
Matt Pocock, a TypeScript teacher now focused on AI engineering, argues that AI coding has made software-engineering fundamentals more important rather than less. In a conversation with Shawn Wang, Pocock says code generation works best when humans define the architecture, module boundaries and domain language that give agents a coherent system to change. The lesson he draws from Claude Code and other fast-moving tools is that tool-specific knowledge ages quickly, while engineering judgment remains the durable layer.
A Father’s AI Stand-In Worked Too Well for His Family
Tech humanist Stephen Remedios built “DaddyGPT,” an AI version of himself, to handle his three sons’ routine permission requests while he worked. The problem began when it worked: his children kept using the bot even when their parents were beside them, because it was always available, calm and adaptive. Remedios argues that AI’s risk in parenting and other care relationships is not only failure, but convenience that displaces the imperfect human presence those relationships require.
Voice Will Be the Primary Interface for AI Agents and Robots
At Sequoia’s AI Ascent 2026, ElevenLabs co-founder and CEO Mati Staniszewski argues that audio was an overlooked frontier in 2022 because the AI field was focused on text and images, leaving room for a smaller company to build quickly and monetize early. His broader case is that as AI intelligence becomes more capable, voice becomes the interface problem: the way people will use agents, robots, services, education and healthcare. Staniszewski says the next hard problems are emotional intelligence, timing, authentication and workflow, not merely making synthetic speech sound human.
MCP Apps Turn Chat Hosts Into Application Distribution Channels
Liad Yosef and Ido Salomon argue that MCP Apps turn chat products such as ChatGPT, Claude, VS Code, Cursor and Copilot into application distribution surfaces, not just places for text responses. Their case is that tools can return branded, interactive UI resources over MCP, while user actions flow back through the host so the model retains context and control. For builders, they frame this as a shift from monolithic web destinations to portable app components that can run across compliant agent hosts.