Orply.

NVIDIA Frames Tokens as the Industrial Output of AI Factories

NVIDIATuesday, June 2, 20266 min read

NVIDIA’s GTC Taipei keynote intro presents tokens as the manufactured output of a new “AI factory,” turning data into knowledge, reason and action across scientific, medical, robotic and industrial systems. The company argues that its accelerated computing platform, built with partners in Taiwan, is the infrastructure behind that production model, with Taipei positioned as the starting point for an AI industry that extends from data centers to cities, healthcare, factories and space.

Intelligence is framed as an industrial output

NVIDIA frames modern intelligence as something manufactured. The “new kind of factory” it names is an AI factory, and its output is tokens — described in the narration as “the building blocks of AI.”

The claim is an operating metaphor: tokens are the unit through which data becomes usable intelligence. In NVIDIA’s language, tokens turn data into “knowledge, reason, action.” A prompt-bar visual asks, “How do tokens turn knowledge into discovery?” and the sequence applies that question across science, cities, robotics, healthcare, industrial work, data centers, and space.

Tokens have opened a new frontier, turning data into knowledge, reason, action.

The framing shifts attention from AI as a chatbot interface to AI as production infrastructure. A factory generates tokens; those tokens, in NVIDIA’s telling, support discovery, interpretation, machine work, and human activity in environments and at scales that are difficult or unreachable.

The first promise is pattern-finding in complex scientific and industrial systems

The first concrete use case is scientific search. A visual attributed to NYB.ai shows research documents and papers connected by colored lines, followed by a prompt interface over chemical molecule structures. The visible prompt asks: “Help me find small molecules that can stabilize SOD1 and treat ALS.” The interface includes score metrics, and the sequence then shows a 3D molecular or protein structure.

NVIDIA’s narration says tokens “reveal patterns in complexity we could never see.” The example places AI in the middle of scientific work: reading across papers, representing molecules, scoring possibilities, and visualizing biological structures. The same claim is then applied to physical production systems. A factory conveyor system attributed to MetAI and KENMEC appears after the molecular sequence, placing scientific complexity and factory complexity under one industrial frame: both are domains where AI-generated representations can expose patterns and support action.

Healthcare returns to the same pattern from a more clinical angle. A screenshot attributed to Insilico Medicine shows a 3D gray ribbon protein structure, followed by a digital lung model and a gauge labeled “Fibrosis Relief.” The narration describes tokens as “closing the gap between hope and healing,” then adds, “So that we breathe easier.” A subsequent visual attributed to Children’s Hospital of Philadelphia shows a 3D heart simulation being fitted with different stents. Visible text reads: “Stent A Simulating fit...” and “Stent B Simulating fit...” The narration says, “And the smallest hearts beat stronger.”

Those images connect token generation to drug discovery, biological modeling, lung disease, and pediatric heart care. The substance is less a single medical claim than a map of where NVIDIA wants AI factories to matter: scientific and medical domains where complexity limits what people can see, simulate, and decide.

Cities and robots become environments AI can represent and act within

NVIDIA applies the same claim to urban infrastructure. Footage attributed to Linker Vision shows Taipei, then traffic in Kaohsiung on a wet, cloudy street. The narration says tokens “mirror our cities to keep us safe.”

That phrase carries a specific image of AI systems: they create representations of real environments that can support situational awareness. The city footage appears with the language of mirroring, making cities into systems that can be modeled and interpreted.

The next image extends urban mobility upward. Footage attributed to The ePlane Company shows an eVTOL aircraft hovering over a rooftop helipad. The narration says tokens “lift us high above them.” The city mirror and eVTOL sequence together place AI in the built environment: first as a way to represent the city, then as part of a future of movement above it.

Robotics turns that representational idea into embodied work. Footage attributed to Red Pill Lab shows a person making hand gestures beside a robot and a digital skeleton that mimics the movements. The narration says tokens help robots “learn from us.” The emphasis is imitation and transfer: human motion becomes information a robot can respond to.

A Foxconn-attributed visual then shows a robotic arm interacting with a wall of surgical instruments, with bounding boxes labeling objects. One visible label reads “U.S. Army Retractor Kelly Curved Smooth Forceps.” The narration says tokens help robots “work alongside us.” Here the example is perception in a specialized setting, where identifying precise tools appears to be part of the task.

The robotics sequence then shifts to a four-legged robot attributed to Diden Robotics crawling on a vertical metal surface. “They go where we cannot,” the narration says. Footage attributed to Multiply Labs shows robotic arms manipulating lab trays and scientific equipment, while the narration describes “helping hands.”

Across these examples, tokens are presented as a bridge between human behavior, machine perception, and robotic execution. The roles differ — learning from demonstration, working near people, entering difficult environments, handling lab equipment — but NVIDIA treats them as parts of the same continuum: AI-generated intelligence moving from screens into machines.

DomainAttribution shownWhat appears on screen
Scientific discoveryNYB.aiResearch papers, molecule structures, a prompt about stabilizing SOD1 and treating ALS, and a 3D molecular or protein structure
CitiesLinker VisionTaipei skyline footage and Kaohsiung traffic footage
RoboticsRed Pill Lab, Foxconn, Diden Robotics, Multiply LabsGesture imitation, surgical-instrument recognition, a wall-crawling quadruped, and lab-equipment handling
HealthcareInsilico Medicine, Children's Hospital of PhiladelphiaProtein and lung visualizations, a “Fibrosis Relief” gauge, and stent-fit simulations for a heart
InfrastructureKomatsu | Applied Intuition, NAVER Data Center, GMI Cloud, NTT DATA, SpaceXMining equipment, data-center construction, server racks, a rocket launch, and spacecraft separation
The intro uses attributed application visuals to connect token generation with physical-world domains.

The AI factory becomes physical infrastructure, with Taipei at the center

The final third widens the scale from applications to the infrastructure surrounding AI factories. Footage attributed to Komatsu and Applied Intuition shows a massive dump truck in a mining environment while the narration says tokens are “helping us break new ground.” A construction-site visual attributed to NAVER Data Center, labeled South Korea, follows the line “on a scale never attempted.” Then data-center imagery appears: a GMI Cloud facility labeled Taiwan, and an NTT DATA aisle of server racks labeled Japan and USA.

This sequence connects the opening metaphor back to physical plant. If tokens are the output of AI factories, the factory image is not only metaphorical: NVIDIA places token generation beside construction sites, data-center buildings, and server racks. Taiwan is central to that framing. The published description says NVIDIA’s accelerated computing platform is “built by our partners in Taiwan,” and the narration’s closing line locates the beginning of the story in the host city: “And here, in Taipei, is where it all begins.”

The claim then reaches beyond terrestrial infrastructure. SpaceX-attributed footage shows a rocket launch, followed by a spacecraft separating in orbit. A voice says, “Starcloud 1, separation confirmed,” before the narration concludes with “to infinity and beyond.” The final line broadens the register again: “Together, we take the next great leap into a bright new future, built for all mankind.”

The aspiration is expansive, but the internal logic is consistent with the opening. NVIDIA presents tokens as a common substrate connecting AI agents, robotics, healthcare, urban systems, industrial machinery, data centers, and space infrastructure. The through-line is the manufacturing paradigm: intelligence generated at scale by AI factories and applied across the physical world.

The frontier, in your inbox tomorrow at 08:00.

Sign up free. Pick the industry Briefs you want. Tomorrow morning, they land. No credit card.

Sign up free