{"schema_version":"onlylabs.public_analysis.v1","url":"https://onlylabs.fyi/analysis/nvidia","json_url":"https://onlylabs.fyi/analysis/nvidia/analysis.json","evidence_json_url":"https://onlylabs.fyi/analysis/nvidia/evidence.json","generated_at":"2026-06-11T15:09:54.307Z","analysis":{"org_slug":"nvidia","url":"https://onlylabs.fyi/analysis/nvidia","json_url":"https://onlylabs.fyi/analysis/nvidia/analysis.json","evidence_json_url":"https://onlylabs.fyi/analysis/nvidia/evidence.json","dossier_url":"https://onlylabs.fyi/labs/nvidia","org":{"slug":"nvidia","name":"NVIDIA","category":"frontier-lab","category_label":"Frontier lab","homepage_url":"https://www.nvidia.com"},"title":"NVIDIA analysis","summary":"NVIDIA is positioning itself as the full-stack supplier of the \"AI factory\" era — selling not just silicon but open models, agent runtimes, and physical-AI foundation models that run on its hardware. The current push centers on three fronts: long-running agents (the Nemotron 3 Ultra family and the NemoClaw agent blueprint), physical/world AI (Cosmos 3 and robotics), and local/personal agents on new hardware (RTX…","markdown":"## Thesis\n\nNVIDIA is positioning itself as the full-stack supplier of the \"AI factory\" era — selling not just silicon but open models, agent runtimes, and physical-AI foundation models that run on its hardware. The current push centers on three fronts: long-running agents (the [Nemotron 3 Ultra](https://blogs.nvidia.com/blog/nvidia-gtc-taipei-computex-2026-news/) family and the [NemoClaw](https://github.com/NVIDIA/NemoClaw) agent blueprint), physical/world AI ([Cosmos 3](https://blogs.nvidia.com/blog/cosmos-3-physical-ai-open-world-foundation-model/) and robotics), and local/personal agents on new hardware (RTX Spark, DGX Spark, Jetson). Nearly all first-party writing in the window is GTC Taipei / COMPUTEX launch and partnership coverage, framing NVIDIA as the infrastructure layer that converts \"energy into tokens.\"\n\n## Shipping\n\nThe flagship open release is **Nemotron 3 Ultra**, an open model built for long-running agents — the [`nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16`](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16) checkpoint (~560B params) leads the model footprint at **49,784** downloads / 158 likes, with companion Base ([1,059 downloads](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-Base-BF16)) and GenRM reward-model ([413 downloads](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-GenRM)) variants. The **Cosmos 3** world-model line ships [`Cosmos3-Super-Text2Image`](https://huggingface.co/nvidia/Cosmos3-Super-Text2Image) (5,075 dl), [`Cosmos3-Super-Image2Video`](https://huggingface.co/nvidia/Cosmos3-Super-Image2Video) (4,515 dl), and the robotics-policy [`Cosmos3-Nano-Policy-DROID`](https://huggingface.co/nvidia/Cosmos3-Nano-Policy-DROID) (4,153 dl). Smaller Nemotron-branded releases cover multimodal and speech — [`Nemotron-Labs-Diffusion-VLM-8B`](https://huggingface.co/nvidia/Nemotron-Labs-Diffusion-VLM-8B) (5,978 dl) and the streaming-ASR [`nemotron-3.5-asr-streaming-0.6b`](https://huggingface.co/nvidia/nemotron-3.5-asr-streaming-0.6b) (3,439 dl, the most-liked model at 264) — plus a safety classifier, [`Nemotron-3.5-Content-Safety`](https://huggingface.co/nvidia/Nemotron-3.5-Content-Safety) (494 dl).\n\nOn GitHub, the headline repo is [`NVIDIA/NemoClaw`](https://github.com/NVIDIA/NemoClaw) at **21,050 stars** — the open agent blueprint, described in posts as \"an open blueprint for building specialized, long-running agents with a secure runtime and frontier models.\" The training/inference stack remains heavily starred: [`Megatron-LM`](https://github.com/NVIDIA/Megatron-LM) (16,624), [`TensorRT-LLM`](https://github.com/NVIDIA/TensorRT-LLM) (13,825), [`cutlass`](https://github.com/NVIDIA/cutlass) (9,859), and [`nccl`](https://github.com/NVIDIA/nccl) (4,791). Physical-AI and tooling repos round it out: [`cosmos`](https://github.com/NVIDIA/cosmos) (9,677), [`Isaac-GR00T`](https://github.com/NVIDIA/Isaac-GR00T) (7,280), [`warp`](https://github.com/NVIDIA/warp) (6,736), and the LLM red-teaming tool [`garak`](https://github.com/NVIDIA/garak) (8,050). Recent releases are mostly infra/tooling: [`Model-Optimizer 0.45.0rc0`](https://github.com/NVIDIA/Model-Optimizer/releases/tag/0.45.0rc0), [`NeMo-text-processing r1.2.0`](https://github.com/NVIDIA/NeMo-text-processing/releases/tag/r1.2.0), and the front-end component library [`@nvidia-elements/core-v0.2.4`](https://github.com/NVIDIA/elements/releases/tag/%40nvidia-elements/core-v0.2.4).\n\n## Research themes\n\nFirst-party writing clusters into a few clear directions:\n\n- **AI factories as a unit of infrastructure** — the conceptual frame in [\"AI Factories: The New Infrastructure of Intelligence\"](https://blogs.nvidia.com/blog/ai-factories-the-new-infrastructure-of-intelligence/) (converting \"energy into tokens\"; economics defined by tokens/sec, tokens/watt) and the [Vera CPU](https://blogs.nvidia.com/blog/vera-cpu-phoronix/) post on agentic-workload silicon (88 Olympus cores, 1.2TB/s bandwidth).\n- **Long-running and agentic AI** — Nemotron 3 Ultra \"built for long-running agents,\" the [Microsoft unified-stack partnership](https://blogs.nvidia.com/blog/microsoft-build-windows-local-cloud-devices/), and [NemoClaw-based \"autonomous AI engineers\"](https://blogs.nvidia.com/blog/industrial-software-leaders-secure-autonomous-ai-engineers-nemoclaw/) for industrial software.\n- **Physical AI / sim-to-real robotics** — [\"How Cosmos 3 Helps Physical AI Think Before It Acts\"](https://blogs.nvidia.com/blog/cosmos-3-physical-ai-open-world-foundation-model/), the [ICRA sim-to-real paper round-up](https://blogs.nvidia.com/blog/icra-research-robotics-simulation-to-real-world/) (8 of 28 accepted papers), and [CVPR work on grasping, autonomous driving, and agent training at scale](https://blogs.nvidia.com/blog/cvpr-research-grasping-driving-agent-training/).\n- **Local / personal agents** — [RTX Spark and DGX Spark for local agents](https://blogs.nvidia.com/blog/rtx-ai-garage-computex-spark-local-agents/) and [Jetson + NemoClaw at the edge](https://blogs.nvidia.com/blog/jetson-agentic-ai-physical-world/).\n- **Domain foundation models** — the PRAGMA transaction foundation model with Revolut Research, captured both as an [arXiv paper (2604.08649)](https://arxiv.org/pdf/2604.08649) and a [blog explainer](https://blogs.nvidia.com/blog/financial-institutions-transaction-foundation-models/).\n\nA second strand is sovereign-AI / partnership PR — [UK sovereign AI](https://blogs.nvidia.com/blog/uk-sovereign-ai-advancements/), [LG](https://blogs.nvidia.com/blog/nvidia-and-lg-group-ai-factory/) and [Doosan](https://blogs.nvidia.com/blog/nvidia-and-doosan-group-physical-ai/) AI factories, and [Taiwan's Vera Rubin supply chain](https://blogs.nvidia.com/blog/taiwan-ecosystem-ai-infrastructure/) — which reads more as ecosystem/go-to-market than research.\n\n## Hiring & scaling\n\nNo careers data captured yet.\n\n## Traction highlights\n\nOn Hacker News, NVIDIA's open developer tools and agent stack drove the most discussion: [`NVIDIA/warp`](https://github.com/NVIDIA/warp) topped the list at **490 points / 136 comments**, followed by the [`NemoClaw`](https://github.com/NVIDIA/NemoClaw) agent blueprint at **385 points / 261 comments** (the most-commented thread), the [`garak`](https://github.com/NVIDIA/garak) LLM red-teaming tool at **211 points / 62 comments**, and [`NVIDIA/MatX`](https://github.com/NVIDIA/MatX) at **103 points / 79 comments**. The GTC Taipei live-updates post drew only minor HN attention (4 points).\n\nMost-starred repos: [`NemoClaw`](https://github.com/NVIDIA/NemoClaw) (21,050), [`Megatron-LM`](https://github.com/NVIDIA/Megatron-LM) (16,624), and [`TensorRT-LLM`](https://github.com/NVIDIA/TensorRT-LLM) (13,825). Most-downloaded models: [`Nemotron-3-Ultra-550B-A55B-BF16`](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16) (49,784), [`Nemotron-Labs-Diffusion-VLM-8B`](https://huggingface.co/nvidia/Nemotron-Labs-Diffusion-VLM-8B) (5,978), and [`Cosmos3-Super-Text2Image`](https://huggingface.co/nvidia/Cosmos3-Super-Text2Image) (5,075).\n\n## Sources\n\n- [Nemotron-3-Ultra-550B-A55B-BF16 (Hugging Face)](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16)\n- [Cosmos3-Super-Text2Image (Hugging Face)](https://huggingface.co/nvidia/Cosmos3-Super-Text2Image)\n- [nemotron-3.5-asr-streaming-0.6b (Hugging Face)](https://huggingface.co/nvidia/nemotron-3.5-asr-streaming-0.6b)\n- [NVIDIA/NemoClaw (GitHub)](https://github.com/NVIDIA/NemoClaw)\n- [NVIDIA/Megatron-LM (GitHub)](https://github.com/NVIDIA/Megatron-LM)\n- [NVIDIA/TensorRT-LLM (GitHub)](https://github.com/NVIDIA/TensorRT-LLM)\n- [NVIDIA/warp (GitHub)](https://github.com/NVIDIA/warp)\n- [NVIDIA/garak (GitHub)](https://github.com/NVIDIA/garak)\n- [NVIDIA GTC Taipei at COMPUTEX: Live Updates](https://blogs.nvidia.com/blog/nvidia-gtc-taipei-computex-2026-news/)\n- [How Cosmos 3 Helps Physical AI Think Before It Acts](https://blogs.nvidia.com/blog/cosmos-3-physical-ai-open-world-foundation-model/)\n- [AI Factories: The New Infrastructure of Intelligence](https://blogs.nvidia.com/blog/ai-factories-the-new-infrastructure-of-intelligence/)\n- [NVIDIA Vera CPU vs. Competition](https://blogs.nvidia.com/blog/vera-cpu-phoronix/)\n- [NVIDIA Levels Up Local AI Agents Across RTX PCs and DGX Spark](https://blogs.nvidia.com/blog/rtx-ai-garage-computex-spark-local-agents/)\n- [Industrial Software Leaders Build AI Engineers With NemoClaw](https://blogs.nvidia.com/blog/industrial-software-leaders-secure-autonomous-ai-engineers-nemoclaw/)\n- [NVIDIA Research Advances Robotics From Simulation to the Real World (ICRA)](https://blogs.nvidia.com/blog/icra-research-robotics-simulation-to-real-world/)\n- [PRAGMA: Revolut Foundation Model (arXiv 2604.08649)](https://arxiv.org/pdf/2604.08649)\n- [NVIDIA Partners With Microsoft on Unified Stack for Agentic AI](https://blogs.nvidia.com/blog/microsoft-build-windows-local-cloud-devices/)","generated_at":"2026-06-08T15:59:09.594+00:00","citations":[{"url":"https://blogs.nvidia.com/blog/nvidia-gtc-taipei-computex-2026-news/","path":null,"label":"blogs.nvidia.com/blog","type":"external"},{"url":"https://github.com/NVIDIA/NemoClaw","path":null,"label":"NVIDIA/NemoClaw","type":"external"},{"url":"https://blogs.nvidia.com/blog/cosmos-3-physical-ai-open-world-foundation-model/","path":null,"label":"blogs.nvidia.com/blog","type":"external"},{"url":"https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16","path":null,"label":"nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16","type":"external"},{"url":"https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-Base-BF16","path":null,"label":"nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-Base-BF16","type":"external"},{"url":"https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-GenRM","path":null,"label":"nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-GenRM","type":"external"},{"url":"https://huggingface.co/nvidia/Cosmos3-Super-Text2Image","path":null,"label":"nvidia/Cosmos3-Super-Text2Image","type":"external"},{"url":"https://huggingface.co/nvidia/Cosmos3-Super-Image2Video","path":null,"label":"nvidia/Cosmos3-Super-Image2Video","type":"external"},{"url":"https://huggingface.co/nvidia/Cosmos3-Nano-Policy-DROID","path":null,"label":"nvidia/Cosmos3-Nano-Policy-DROID","type":"external"},{"url":"https://huggingface.co/nvidia/Nemotron-Labs-Diffusion-VLM-8B","path":null,"label":"nvidia/Nemotron-Labs-Diffusion-VLM-8B","type":"external"},{"url":"https://huggingface.co/nvidia/nemotron-3.5-asr-streaming-0.6b","path":null,"label":"nvidia/nemotron-3.5-asr-streaming-0.6b","type":"external"},{"url":"https://huggingface.co/nvidia/Nemotron-3.5-Content-Safety","path":null,"label":"nvidia/Nemotron-3.5-Content-Safety","type":"external"},{"url":"https://github.com/NVIDIA/Megatron-LM","path":null,"label":"NVIDIA/Megatron-LM","type":"external"},{"url":"https://github.com/NVIDIA/TensorRT-LLM","path":null,"label":"NVIDIA/TensorRT-LLM","type":"external"},{"url":"https://github.com/NVIDIA/cutlass","path":null,"label":"NVIDIA/cutlass","type":"external"},{"url":"https://github.com/NVIDIA/nccl","path":null,"label":"NVIDIA/nccl","type":"external"},{"url":"https://github.com/NVIDIA/cosmos","path":null,"label":"NVIDIA/cosmos","type":"external"},{"url":"https://github.com/NVIDIA/Isaac-GR00T","path":null,"label":"NVIDIA/Isaac-GR00T","type":"external"},{"url":"https://github.com/NVIDIA/warp","path":null,"label":"NVIDIA/warp","type":"external"},{"url":"https://github.com/NVIDIA/garak","path":null,"label":"NVIDIA/garak","type":"external"},{"url":"https://github.com/NVIDIA/Model-Optimizer/releases/tag/0.45.0rc0","path":null,"label":"NVIDIA/Model-Optimizer","type":"external"},{"url":"https://github.com/NVIDIA/NeMo-text-processing/releases/tag/r1.2.0","path":null,"label":"NVIDIA/NeMo-text-processing","type":"external"},{"url":"https://github.com/NVIDIA/elements/releases/tag/%40nvidia-elements/core-v0.2.4","path":null,"label":"NVIDIA/elements","type":"external"},{"url":"https://blogs.nvidia.com/blog/ai-factories-the-new-infrastructure-of-intelligence/","path":null,"label":"blogs.nvidia.com/blog","type":"external"},{"url":"https://blogs.nvidia.com/blog/vera-cpu-phoronix/","path":null,"label":"blogs.nvidia.com/blog","type":"external"},{"url":"https://blogs.nvidia.com/blog/microsoft-build-windows-local-cloud-devices/","path":null,"label":"blogs.nvidia.com/blog","type":"external"},{"url":"https://blogs.nvidia.com/blog/industrial-software-leaders-secure-autonomous-ai-engineers-nemoclaw/","path":null,"label":"blogs.nvidia.com/blog","type":"external"},{"url":"https://blogs.nvidia.com/blog/icra-research-robotics-simulation-to-real-world/","path":null,"label":"blogs.nvidia.com/blog","type":"external"},{"url":"https://blogs.nvidia.com/blog/cvpr-research-grasping-driving-agent-training/","path":null,"label":"blogs.nvidia.com/blog","type":"external"},{"url":"https://blogs.nvidia.com/blog/rtx-ai-garage-computex-spark-local-agents/","path":null,"label":"blogs.nvidia.com/blog","type":"external"},{"url":"https://blogs.nvidia.com/blog/jetson-agentic-ai-physical-world/","path":null,"label":"blogs.nvidia.com/blog","type":"external"},{"url":"https://arxiv.org/pdf/2604.08649","path":null,"label":"arxiv.org/pdf","type":"external"},{"url":"https://blogs.nvidia.com/blog/financial-institutions-transaction-foundation-models/","path":null,"label":"blogs.nvidia.com/blog","type":"external"},{"url":"https://blogs.nvidia.com/blog/uk-sovereign-ai-advancements/","path":null,"label":"blogs.nvidia.com/blog","type":"external"},{"url":"https://blogs.nvidia.com/blog/nvidia-and-lg-group-ai-factory/","path":null,"label":"blogs.nvidia.com/blog","type":"external"},{"url":"https://blogs.nvidia.com/blog/nvidia-and-doosan-group-physical-ai/","path":null,"label":"blogs.nvidia.com/blog","type":"external"},{"url":"https://blogs.nvidia.com/blog/taiwan-ecosystem-ai-infrastructure/","path":null,"label":"blogs.nvidia.com/blog","type":"external"},{"url":"https://github.com/NVIDIA/MatX","path":null,"label":"NVIDIA/MatX","type":"external"}],"provenance":{"provider":null,"model":null,"workflow":"synthesize-analyses","agent":null},"evidence":{"total":null,"pages":null,"events":null,"web":null,"signal_desks":null,"data_radar_lanes":null,"data_radar_matches":null}},"signal_counts":{"total":781,"model_released":30,"release":495,"repo_new":211,"repo_forked":20,"post_published":25,"job_opened":0}}