
NVIDIA
NVIDIA is a computing company founded in 1993, known for GPUs and accelerated computing, with products spanning PC gaming, computer graphics, AI, industrial digitalization, and data-center-scale infrastructure.
NVIDIA Says Agentic AI Is Forcing a Redesign of Enterprise Computing
At GTC Taipei during COMPUTEX, NVIDIA founder and chief executive Jensen Huang argued that agentic AI and frontier models have already changed the computer industry. The company’s case was that enterprises now need full agent-building infrastructure, AI-capable PCs such as RTX Spark represent a break from the old laptop model, and production hardware including Vera Rubin will underpin the next phase of AI computing. NVIDIA framed that shift through Taiwan’s manufacturing ecosystem, presenting Taipei as both industrial partner and symbolic home.
AI Infrastructure Is Shifting From Accelerator Racks to Distributed Agent Systems
At Dell Technologies World, Nvidia chief Jensen Huang and Dell CEO Michael Dell argued that enterprise AI is moving from experimental promise to operational infrastructure, with agentic systems driving a sharp increase in compute demand. Huang said agents change the workload from single prompt-response transactions to long-running loops of reasoning, planning and tool use, while Dell framed the response as a pragmatic push toward distributed, “unmetered” intelligence across PCs, data centers and cloud-scale systems.
NVIDIA RTX Spark Recasts Windows PCs as Local AI Agent Machines
NVIDIA chief executive Jensen Huang used his GTC Taipei keynote to present RTX Spark as the basis for a new class of Windows PCs built around personal AI agents. His argument was that the PC needs an abstraction layer comparable to the one that made the original Windows ecosystem work: existing applications, CUDA workloads and games still run, but large language models and agent runtimes become part of the operating environment.
Microsoft and NVIDIA Redesign PCs and Data Centers for Agentic AI
At Microsoft Build, NVIDIA chief executive Jensen Huang joined Microsoft chief executive Satya Nadella to frame their expanded partnership around a single premise: agents are becoming a primary computing workload. Huang argued that this shift requires redesigning PCs, data centers and software together, from RTX Spark devices that can run local autonomous assistants to Grace Blackwell and Vera Rubin systems built for large-scale reasoning and low-latency agent execution. Nadella positioned the work as an extension of Microsoft’s infrastructure and developer platform strategy across Windows, Azure, Fabric, Foundry and GitHub.
NVIDIA Frames Cosmos 3 as Compute-Generated Data for Physical AI
NVIDIA presents Cosmos 3 as an open foundation model for physical AI, built to address what it frames as a data-scaling problem in robotics, autonomous vehicles and other systems that operate in the physical world. The company argues that real-world data cannot capture enough variability on its own, so compute must generate usable training and evaluation signals: synthetic video, predicted sensor outputs, simulation loops and action plans. Cosmos 3 is positioned as a post-trainable mixture-of-transformers system that combines multimodal reasoning with generation to support perception, prediction, simulation and action.
NVIDIA Positions 1,000 CUDA-X Libraries as Physical AI Infrastructure
NVIDIA’s GTC Taipei and COMPUTEX 2026 montage presents CUDA-X as the software stack that extends CUDA from an accelerated-computing architecture into what the company calls the algorithmic foundation for physical AI. NVIDIA argues that more than 1,000 CUDA-X libraries now support simulation and engineering work across domains including molecular science, robotics, factory automation, autonomous systems and Earth-scale digital twins, with the visual evidence explicitly framed as computer graphics and simulation rather than generative AI.
DSX MaxLPS Claims 45% More GPUs Inside a 1 GW Power Budget
NVIDIA is positioning DSX as a control stack for gigawatt-scale AI factories where the binding constraint is usable power rather than installed hardware. In its press release and technical blog, the company argues that DSX Sim, MaxLPS, Flex and OS let operators design, validate and run facilities as integrated power, cooling, compute and grid systems, increasing GPU capacity inside fixed power budgets. The central claim is that AI infrastructure economics will depend on maximizing reliable tokens per watt, not simply adding more racks.
NVIDIA Says Vera Rubin Is in Full Production for Agentic AI
NVIDIA says its Vera Rubin platform is now in full production, positioning it as a pod-scale “AI factory” for agentic workloads rather than a conventional accelerator launch. The company argues that agents shift the bottleneck from model execution to full-system orchestration — reasoning, memory, tool use, low-latency token generation, storage, networking and power — and that Vera Rubin addresses this through five connected rack-scale systems. NVIDIA frames the milestone as both a technical and manufacturing claim, built on extreme co-design across chips, racks, data centers and Taiwan’s supply chain.
RTX Spark Agent Moves Architectural Designs From Brief to Photoreal Render
NVIDIA’s RTX Spark demonstration argues that an architectural AI agent is most useful as a workflow operator, not as a standalone design tool. Running locally on RTX Spark and connected to tools including Rhino, Blender, ComfyUI, OpenShell and Claude Sonnet, the agent turns a residential brief into massing options, editable layouts, validated geometry and photoreal renders. NVIDIA frames the speedup as orchestration across existing applications, with the designer still approving directions, resolving tradeoffs and controlling materials and shots.
NVIDIA Frames Tokens as the Industrial Output of AI Factories
NVIDIA’s GTC Taipei keynote intro presents tokens as the manufactured output of a new “AI factory,” turning data into knowledge, reason and action across scientific, medical, robotic and industrial systems. The company argues that its accelerated computing platform, built with partners in Taiwan, is the infrastructure behind that production model, with Taipei positioned as the starting point for an AI industry that extends from data centers to cities, healthcare, factories and space.
NVIDIA Frames AI Agents as the Workload Driving Its Compute Stack
NVIDIA’s closing video for Jensen Huang’s GTC Taipei 2026 keynote recast the company’s announcements around a single claim: “useful AI” now means agents doing work. In the recap, NVIDIA ties that workload to demand for Vera Rubin inference performance, cheaper tokens, BlueField memory support, enterprise guardrails, Windows PCs, DGX infrastructure and robotics systems. The argument is that agents are no longer a novelty layer on top of computing, but the demand signal connecting NVIDIA’s silicon, software, cloud and physical AI stack.
NVIDIA Says Vera Runs Agentic Tasks 80% Faster Than x86
NVIDIA is pitching Vera as a data center CPU built for the CPU-side work created by agentic AI, not as a conventional cloud processor optimized mainly for core count and virtualization. The company argues that as agents run Python code, tool calls, retrieval, sandboxed execution and data orchestration around GPUs, CPU delays become a constraint on GPU utilization, throughput and latency. Vera’s case rests on NVIDIA’s custom Olympus cores, LPDDR5X memory bandwidth, a coherent 88-core fabric and NVLink-C2C links into GPU systems, extending its AI platform from acceleration into orchestration.
NVIDIA Says Isaac GR00T Cuts Humanoid Robotics Setup From Months to Hours
NVIDIA is making the case that humanoid robot development is being slowed less by model ambition than by the repeated work of assembling simulation, teleoperation, data, training and deployment infrastructure. Its Isaac GR00T platform is presented as an open, modular stack that can cut setup from months to hours by connecting Isaac Lab, Omniverse, Cosmos, Isaac ROS and Jetson Thor in one development path. The company also introduces a Jetson Thor-based reference humanoid robot meant to give research teams a starting hardware design for skill development and real-world validation.
NVIDIA Positions RTX Spark as a 128 GB Local AI Workstation
NVIDIA’s Computex preview positioned RTX Spark as a compact Windows platform for local AI, creative production and RTX gaming, built around a new superchip pairing a Blackwell RTX GPU with a Grace CPU. Jacob Freeman and other NVIDIA presenters argued that its 128 GB of unified memory and RTX acceleration allow slim laptops and small desktops to run larger local agents, handle heavy creative scenes and support modern ray-traced games with DLSS 4.5.
NVIDIA Alpamayo Presents Autonomous Driving as Explainable Micro-Decisions
NVIDIA presents Alpamayo as a reasoning-based autonomous driving model whose decisions can be rendered as audible, causal judgments rather than hidden vehicle behavior. In the demo, the car responds to ordinary city traffic by explaining why it stops, yields, nudges or keeps distance — because a pedestrian is in the lane, a stop sign controls the intersection, a truck blocks space or another vehicle is merging. The point is not that the car can speak, but that NVIDIA wants Alpamayo understood as continuously evaluating road conditions while the passenger experience remains routine.
Cadence and NVIDIA Claim 40x Faster RTL Verification With AI Agents
Cadence and NVIDIA say an autonomous verification stack built around Cadence ChipStack, Nemotron, Codex and NVIDIA OpenShell can reduce RTL verification cycles from weeks to hours by automating simulation, formal verification, debugging and code repair. The companies present the system as a way to compress one of chip development’s most time-consuming loops, while still escalating major design issues to human engineers.
Sarvam and NVIDIA Build Full-Stack Sovereign AI Infrastructure for India
Sarvam co-founder Pratyush Kumar argues that India’s AI sovereignty cannot mean putting Indian-language interfaces on foreign-built systems. In a NVIDIA-backed account of Sarvam’s work, he describes a full-stack effort to build foundational models, data pipelines, inference systems and developer APIs inside India, using NVIDIA H100 clusters and NeMo tooling to process Indian-language data at scale. The case is that voice-first AI for India’s population requires domestic capability across data, models, applications and accelerated-compute expertise.
NVIDIA Positions RTX Spark as a Local AI Runtime for Windows PCs
NVIDIA is pitching RTX Spark as more than a faster Windows PC chip: it says the Blackwell-and-Grace “superchip” is the hardware basis for a new class of personal AI computers built around local agents. Developed in close collaboration with Microsoft, the platform is framed as a Windows architecture for agents that can run natively, use local or cloud models, remain sandboxed, and handle substantial on-device AI workloads alongside creation and gaming.
AI Factories Are Turning Taiwan’s Supply Chain Into Strategic Infrastructure
NVIDIA’s GTC keynote pregame in Taipei presented Taiwan as more than a manufacturing base for the AI boom. Across interviews led by Bruce Lu of Goldman Sachs and Tracy Tsai of Gartner, Jensen Huang and Taiwanese technology executives argued that AI is becoming infrastructure, requiring chips, advanced packaging, racks, power, factories, robots, software, local compute and talent to work as one system. The case was optimistic but conditional: Taiwan’s strength is the density of its industrial stack, and its test is whether it can move up into systems, software and application leadership.
Automated Cognitive Intelligence Can Sustain Decades of AI Growth
Asked about fears of an AI bubble during a TVBS exchange in Taiwan, Nvidia chief executive Jensen Huang argued that the durability of the industry rests on usefulness rather than market timing. Because AI can now automate cognitive intelligence, Huang said, demand for compute and AI capability should have “decades” of growth ahead, with Taiwan’s chip and packaging partners positioned inside that buildout. His advice to individuals was similarly practical: learn the technology and use it to improve their own work rather than stand aside.
Low-Cost Robot Arms Let Non-Specialists Train Physical AI
On NVIDIA’s AI Podcast, Seeed Studio CEO Eric Pan and head of robotics Elaine Wu make the case that open-source, Jetson-powered robot arms can move embodied AI beyond specialist industrial settings. Their argument is that low-cost hardware, frameworks such as OpenClaw and LeRobot, and Isaac Sim digital twins let makers, students and small businesses teach and constrain robots around specific tasks, rather than waiting for a closed general-purpose humanoid.
AI Factory Digital Twins Link Facility Design to Tokens per Watt
Leaders from Jacobs, PTC and Phaidra argue that AI factories are becoming too complex and volatile to design, build and operate through siloed handoffs. In their account, NVIDIA’s DSX reference design and Omniverse DSX Blueprint provide a shared digital twin that carries design intent from planning into simulation and operations, allowing teams to test facility layouts before construction and train AI agents to manage cooling, power use and tokens per watt once the data center is running.
Cost Per Token Is Replacing FLOPS as the AI Infrastructure Metric
Shruti Koparkar of NVIDIA’s Accelerated Computing team argues that AI infrastructure should be evaluated by token economics rather than by GPU-hour pricing or FLOPS per dollar. On NVIDIA’s AI Podcast, she lays out a four-part framework — token utility, supply, demand and monetization — in which cost per token becomes the central measure of business value. Koparkar says NVIDIA Blackwell’s system-level design delivers 50 times more tokens per watt than Hopper and 35 times lower token cost, while lower token costs will expand GPU demand by making more AI workloads economically viable.
Snap Cut Experimentation Job Costs 76% With GPU-Accelerated Spark
Prudhvi Vatala, Snap’s head of engineering platforms, argues that the company’s 10-plus-petabyte daily experimentation pipeline became a cost and scale problem that could not be solved by adding more CPUs. In an NVIDIA AI Podcast interview, he says Snap cut job costs by 76% by moving Spark workloads to NVIDIA GPU-accelerated infrastructure on Google Cloud, reusing idle inference GPUs overnight, and doing so without application code changes.
Enterprise AI Agents Need Harnesses, Traces, and Controlled Runtimes
LangChain co-founder and CEO Harrison Chase argues that enterprise AI agents are becoming an architectural problem rather than a question of adding autonomy wherever possible. In an NVIDIA AI Podcast interview, he says systems such as Claude Code, Manus and Deep Research share a common “deep agent” pattern: an LLM in a tool-calling loop, supported by a reusable harness, workspace, subagents and planning. For enterprises, Chase says trust depends on choosing the right level of autonomy and surrounding agents with observability, evaluation, secure runtimes and continued iteration.