NVIDIA

NVIDIA is a computing company founded in 1993, known for GPUs and accelerated computing, with products spanning PC gaming, computer graphics, AI, industrial digitalization, and data-center-scale infrastructure.

NVIDIA Says Agentic AI Is Forcing a Redesign of Enterprise Computing

At GTC Taipei during COMPUTEX, NVIDIA founder and chief executive Jensen Huang argued that agentic AI and frontier models have already changed the computer industry. The company’s case was that enterprises now need full agent-building infrastructure, AI-capable PCs such as RTX Spark represent a break from the old laptop model, and production hardware including Vera Rubin will underpin the next phase of AI computing. NVIDIA framed that shift through Taiwan’s manufacturing ecosystem, presenting Taipei as both industrial partner and symbolic home.

Jensen Huang · Wayne ChiangJun 8, 20264 min read

AI Infrastructure Is Shifting From Accelerator Racks to Distributed Agent Systems

At Dell Technologies World, Nvidia chief Jensen Huang and Dell CEO Michael Dell argued that enterprise AI is moving from experimental promise to operational infrastructure, with agentic systems driving a sharp increase in compute demand. Huang said agents change the workload from single prompt-response transactions to long-running loops of reasoning, planning and tool use, while Dell framed the response as a pragmatic push toward distributed, “unmetered” intelligence across PCs, data centers and cloud-scale systems.

Michael Dell · Jensen HuangJun 5, 20267 min read

NVIDIA RTX Spark Recasts Windows PCs as Local AI Agent Machines

NVIDIA chief executive Jensen Huang used his GTC Taipei keynote to present RTX Spark as the basis for a new class of Windows PCs built around personal AI agents. His argument was that the PC needs an abstraction layer comparable to the one that made the original Windows ecosystem work: existing applications, CUDA workloads and games still run, but large language models and agent runtimes become part of the operating environment.

Jensen HuangJun 4, 202610 min read

Microsoft and NVIDIA Redesign PCs and Data Centers for Agentic AI

At Microsoft Build, NVIDIA chief executive Jensen Huang joined Microsoft chief executive Satya Nadella to frame their expanded partnership around a single premise: agents are becoming a primary computing workload. Huang argued that this shift requires redesigning PCs, data centers and software together, from RTX Spark devices that can run local autonomous assistants to Grace Blackwell and Vera Rubin systems built for large-scale reasoning and low-latency agent execution. Nadella positioned the work as an extension of Microsoft’s infrastructure and developer platform strategy across Windows, Azure, Fabric, Foundry and GitHub.

Jensen Huang · Satya NadellaJun 3, 20266 min read

NVIDIA Frames Cosmos 3 as Compute-Generated Data for Physical AI

NVIDIA presents Cosmos 3 as an open foundation model for physical AI, built to address what it frames as a data-scaling problem in robotics, autonomous vehicles and other systems that operate in the physical world. The company argues that real-world data cannot capture enough variability on its own, so compute must generate usable training and evaluation signals: synthetic video, predicted sensor outputs, simulation loops and action plans. Cosmos 3 is positioned as a post-trainable mixture-of-transformers system that combines multimodal reasoning with generation to support perception, prediction, simulation and action.

Jun 2, 20265 min read

NVIDIA Positions 1,000 CUDA-X Libraries as Physical AI Infrastructure

NVIDIA’s GTC Taipei and COMPUTEX 2026 montage presents CUDA-X as the software stack that extends CUDA from an accelerated-computing architecture into what the company calls the algorithmic foundation for physical AI. NVIDIA argues that more than 1,000 CUDA-X libraries now support simulation and engineering work across domains including molecular science, robotics, factory automation, autonomous systems and Earth-scale digital twins, with the visual evidence explicitly framed as computer graphics and simulation rather than generative AI.

Jun 2, 20267 min read

DSX MaxLPS Claims 45% More GPUs Inside a 1 GW Power Budget

NVIDIA is positioning DSX as a control stack for gigawatt-scale AI factories where the binding constraint is usable power rather than installed hardware. In its press release and technical blog, the company argues that DSX Sim, MaxLPS, Flex and OS let operators design, validate and run facilities as integrated power, cooling, compute and grid systems, increasing GPU capacity inside fixed power budgets. The central claim is that AI infrastructure economics will depend on maximizing reliable tokens per watt, not simply adding more racks.

Jun 2, 20265 min read

NVIDIA Says Vera Rubin Is in Full Production for Agentic AI

NVIDIA says its Vera Rubin platform is now in full production, positioning it as a pod-scale “AI factory” for agentic workloads rather than a conventional accelerator launch. The company argues that agents shift the bottleneck from model execution to full-system orchestration — reasoning, memory, tool use, low-latency token generation, storage, networking and power — and that Vera Rubin addresses this through five connected rack-scale systems. NVIDIA frames the milestone as both a technical and manufacturing claim, built on extreme co-design across chips, racks, data centers and Taiwan’s supply chain.

Jun 2, 20265 min read

RTX Spark Agent Moves Architectural Designs From Brief to Photoreal Render

NVIDIA’s RTX Spark demonstration argues that an architectural AI agent is most useful as a workflow operator, not as a standalone design tool. Running locally on RTX Spark and connected to tools including Rhino, Blender, ComfyUI, OpenShell and Claude Sonnet, the agent turns a residential brief into massing options, editable layouts, validated geometry and photoreal renders. NVIDIA frames the speedup as orchestration across existing applications, with the designer still approving directions, resolving tradeoffs and controlling materials and shots.

Jun 2, 20265 min read

NVIDIA Frames Tokens as the Industrial Output of AI Factories

NVIDIA’s GTC Taipei keynote intro presents tokens as the manufactured output of a new “AI factory,” turning data into knowledge, reason and action across scientific, medical, robotic and industrial systems. The company argues that its accelerated computing platform, built with partners in Taiwan, is the infrastructure behind that production model, with Taipei positioned as the starting point for an AI industry that extends from data centers to cities, healthcare, factories and space.

Jun 2, 20266 min read

NVIDIA Frames AI Agents as the Workload Driving Its Compute Stack

NVIDIA’s closing video for Jensen Huang’s GTC Taipei 2026 keynote recast the company’s announcements around a single claim: “useful AI” now means agents doing work. In the recap, NVIDIA ties that workload to demand for Vera Rubin inference performance, cheaper tokens, BlueField memory support, enterprise guardrails, Windows PCs, DGX infrastructure and robotics systems. The argument is that agents are no longer a novelty layer on top of computing, but the demand signal connecting NVIDIA’s silicon, software, cloud and physical AI stack.

Jun 2, 20265 min read

NVIDIA Says Vera Runs Agentic Tasks 80% Faster Than x86

NVIDIA is pitching Vera as a data center CPU built for the CPU-side work created by agentic AI, not as a conventional cloud processor optimized mainly for core count and virtualization. The company argues that as agents run Python code, tool calls, retrieval, sandboxed execution and data orchestration around GPUs, CPU delays become a constraint on GPU utilization, throughput and latency. Vera’s case rests on NVIDIA’s custom Olympus cores, LPDDR5X memory bandwidth, a coherent 88-core fabric and NVLink-C2C links into GPU systems, extending its AI platform from acceleration into orchestration.

Jun 1, 20265 min read

NVIDIA Says Isaac GR00T Cuts Humanoid Robotics Setup From Months to Hours

NVIDIA is making the case that humanoid robot development is being slowed less by model ambition than by the repeated work of assembling simulation, teleoperation, data, training and deployment infrastructure. Its Isaac GR00T platform is presented as an open, modular stack that can cut setup from months to hours by connecting Isaac Lab, Omniverse, Cosmos, Isaac ROS and Jetson Thor in one development path. The company also introduces a Jetson Thor-based reference humanoid robot meant to give research teams a starting hardware design for skill development and real-world validation.

Jun 1, 20265 min read

NVIDIA Positions RTX Spark as a 128 GB Local AI Workstation

NVIDIA’s Computex preview positioned RTX Spark as a compact Windows platform for local AI, creative production and RTX gaming, built around a new superchip pairing a Blackwell RTX GPU with a Grace CPU. Jacob Freeman and other NVIDIA presenters argued that its 128 GB of unified memory and RTX acceleration allow slim laptops and small desktops to run larger local agents, handle heavy creative scenes and support modern ray-traced games with DLSS 4.5.

Gerardo Delgado · Joel Pennington · Jacob FreemanJun 1, 20265 min read

NVIDIA Alpamayo Presents Autonomous Driving as Explainable Micro-Decisions

NVIDIA presents Alpamayo as a reasoning-based autonomous driving model whose decisions can be rendered as audible, causal judgments rather than hidden vehicle behavior. In the demo, the car responds to ordinary city traffic by explaining why it stops, yields, nudges or keeps distance — because a pedestrian is in the lane, a stop sign controls the intersection, a truck blocks space or another vehicle is merging. The point is not that the car can speak, but that NVIDIA wants Alpamayo understood as continuously evaluating road conditions while the passenger experience remains routine.

Jun 1, 20265 min read

Cadence and NVIDIA Claim 40x Faster RTL Verification With AI Agents

Cadence and NVIDIA say an autonomous verification stack built around Cadence ChipStack, Nemotron, Codex and NVIDIA OpenShell can reduce RTL verification cycles from weeks to hours by automating simulation, formal verification, debugging and code repair. The companies present the system as a way to compress one of chip development’s most time-consuming loops, while still escalating major design issues to human engineers.

Jun 1, 20265 min read

Sarvam and NVIDIA Build Full-Stack Sovereign AI Infrastructure for India

Sarvam co-founder Pratyush Kumar argues that India’s AI sovereignty cannot mean putting Indian-language interfaces on foreign-built systems. In a NVIDIA-backed account of Sarvam’s work, he describes a full-stack effort to build foundational models, data pipelines, inference systems and developer APIs inside India, using NVIDIA H100 clusters and NeMo tooling to process Indian-language data at scale. The case is that voice-first AI for India’s population requires domestic capability across data, models, applications and accelerated-compute expertise.

Pratyush KumarJun 1, 20265 min read

NVIDIA Positions RTX Spark as a Local AI Runtime for Windows PCs

NVIDIA is pitching RTX Spark as more than a faster Windows PC chip: it says the Blackwell-and-Grace “superchip” is the hardware basis for a new class of personal AI computers built around local agents. Developed in close collaboration with Microsoft, the platform is framed as a Windows architecture for agents that can run natively, use local or cloud models, remain sandboxed, and handle substantial on-device AI workloads alongside creation and gaming.

Jun 1, 20265 min read

AI Factories Are Turning Taiwan’s Supply Chain Into Strategic Infrastructure

NVIDIA’s GTC keynote pregame in Taipei presented Taiwan as more than a manufacturing base for the AI boom. Across interviews led by Bruce Lu of Goldman Sachs and Tracy Tsai of Gartner, Jensen Huang and Taiwanese technology executives argued that AI is becoming infrastructure, requiring chips, advanced packaging, racks, power, factories, robots, software, local compute and talent to work as one system. The case was optimistic but conditional: Taiwan’s strength is the density of its industrial stack, and its test is whether it can move up into systems, software and application leadership.

Jensen Huang · Simon Chang · Rick Tsai · Tracy Tsai · Bruce Lu · Alex Yeh · Barry Lam · Neo Yao · Jonney Shih · Haw Chen · Hung-yi Lee · Tzu-Hsien Tung · Simon Lin · Yuh-Jier Mii · Kathy YangJun 1, 202622 min read

Automated Cognitive Intelligence Can Sustain Decades of AI Growth

Asked about fears of an AI bubble during a TVBS exchange in Taiwan, Nvidia chief executive Jensen Huang argued that the durability of the industry rests on usefulness rather than market timing. Because AI can now automate cognitive intelligence, Huang said, demand for compute and AI capability should have “decades” of growth ahead, with Taiwan’s chip and packaging partners positioned inside that buildout. His advice to individuals was similarly practical: learn the technology and use it to improve their own work rather than stand aside.

Jensen Huang · Tingting LiuMay 30, 20262 min read

Low-Cost Robot Arms Let Non-Specialists Train Physical AI

On NVIDIA’s AI Podcast, Seeed Studio CEO Eric Pan and head of robotics Elaine Wu make the case that open-source, Jetson-powered robot arms can move embodied AI beyond specialist industrial settings. Their argument is that low-cost hardware, frameworks such as OpenClaw and LeRobot, and Isaac Sim digital twins let makers, students and small businesses teach and constrain robots around specific tasks, rather than waiting for a closed general-purpose humanoid.

Noah Kravitz · Elaine Wu · Eric PanMay 27, 202612 min read

AI Factory Digital Twins Link Facility Design to Tokens per Watt

Leaders from Jacobs, PTC and Phaidra argue that AI factories are becoming too complex and volatile to design, build and operate through siloed handoffs. In their account, NVIDIA’s DSX reference design and Omniverse DSX Blueprint provide a shared digital twin that carries design intent from planning into simulation and operations, allowing teams to test facility layouts before construction and train AI agents to manage cooling, power use and tokens per watt once the data center is running.

Jim Gao · Catherine Kniker · Dana TilleyMay 27, 20265 min read

Cost Per Token Is Replacing FLOPS as the AI Infrastructure Metric

Shruti Koparkar of NVIDIA’s Accelerated Computing team argues that AI infrastructure should be evaluated by token economics rather than by GPU-hour pricing or FLOPS per dollar. On NVIDIA’s AI Podcast, she lays out a four-part framework — token utility, supply, demand and monetization — in which cost per token becomes the central measure of business value. Koparkar says NVIDIA Blackwell’s system-level design delivers 50 times more tokens per watt than Hopper and 35 times lower token cost, while lower token costs will expand GPU demand by making more AI workloads economically viable.

Noah Kravitz · Shruti KoparkarMay 21, 202612 min read

Snap Cut Experimentation Job Costs 76% With GPU-Accelerated Spark

Prudhvi Vatala, Snap’s head of engineering platforms, argues that the company’s 10-plus-petabyte daily experimentation pipeline became a cost and scale problem that could not be solved by adding more CPUs. In an NVIDIA AI Podcast interview, he says Snap cut job costs by 76% by moving Spark workloads to NVIDIA GPU-accelerated infrastructure on Google Cloud, reusing idle inference GPUs overnight, and doing so without application code changes.

Noah Kravitz · Prudhvi VatalaMay 13, 20269 min read

Enterprise AI Agents Need Harnesses, Traces, and Controlled Runtimes

LangChain co-founder and CEO Harrison Chase argues that enterprise AI agents are becoming an architectural problem rather than a question of adding autonomy wherever possible. In an NVIDIA AI Podcast interview, he says systems such as Claude Code, Manus and Deep Research share a common “deep agent” pattern: an LLM in a tool-calling loop, supported by a reusable harness, workspace, subagents and planning. For enterprises, Chase says trust depends on choosing the right level of autonomy and surrounding agents with observability, evaluation, secure runtimes and continued iteration.

Harrison Chase · Noah KravitzMay 7, 202612 min read