Nvidia recently announced the launch of Cosmos Reason 2 at CES 2026, marking a significant step towards enhancing the capabilities of physical artificial intelligence (AI) agents. The latest iteration of Nvidia’s vision-language model, Cosmos Reason 2, is tailored for embodied reasoning, enabling enterprises to customize applications and empower physical agents to strategize their next actions. This advancement builds on the foundation set by Cosmos Reason 1, which introduced a two-dimensional ontology for embodied reasoning and currently leads in physical reasoning for video tasks.
Moreover, Nvidia introduced a new iteration of Cosmos Transfer, a model that facilitates the creation of training simulations for robots. While other vision-language models like Google’s PaliGemma and Mistral’s Pixtral Large can process visual inputs, not all offer support for reasoning capabilities.
Nvidia’s vice president for generative AI software, Kari Briski, highlighted the significance of Cosmos Reason 2 in enhancing robots’ reasoning abilities to navigate unpredictable physical environments. Briski emphasized the transition in robotics from specialized single-task robots to versatile systems combining broad knowledge with specialized skills.
Nvidia’s roadmap includes a range of open models designed for physical AI applications, robotics, and agentic AI. By providing access to diverse datasets, compute resources, and training tools, Nvidia aims to foster the development and deployment of purpose-built AI systems for various applications in the digital and physical realms.
The company’s commitment to expanding its Nemotron family, which now includes Nemotron Speech, Nemotron RAG, and Nemotron Safety, underscores its dedication to advancing AI capabilities across different domains.
Source: VentureBeat