Glean Adds Support for NVIDIA Nemotron 3 Ultra to Expand Enterprise AI Options

Glean Adds Support for NVIDIA Nemotron 3 Ultra to Expand Enterprise AI Options
🕧 5 min

The new model shows leaps in open-source agentic capabilities, delivering 91% of frontier-model performance on key metrics like completeness

Enterprise AI leader Glean, announced support for NVIDIA Nemotron 3 Ultra, expanding the set of models available in Glean’s platform and giving customers a new open model option for cost-effective agentic work.

Read More: From Chaos to a Repeatable Enterprise Strategy: Why Standardization Matters with AI

The Nemotron 3 Ultra open model gives customers strong agentic capabilities for everyday enterprise work, delivering 91% of frontier LLM completeness with the cost profile of an open model. With Nemotron 3 Ultra available in Glean, customers have more flexibility in how they deploy AI across the business. Rather than forcing every task through a single model family, Glean helps organizations choose the best model for the job and orchestrate it within a secure, context-aware enterprise platform.

“Enterprises are moving beyond the idea that one model should do everything,” said Emrecan Dogan, Chief Product Officer, Glean. “They want the ability to match the right model to the right task, and they need a cost-effective way to bring AI into everyday work. Our support for NVIDIA Nemotron 3 Ultra reflects that reality and gives customers a strong option as they scale AI across the enterprise.”

Read More: The Modern Data Engineering Stack in 2026: Architecture, Tools, and Strategy for AI-Driven Enterprises

“Glean is bringing NVIDIA Nemotron 3 Ultra into enterprise AI workflows where model choice, cost, and performance are critical,” said Kari Briski, Vice President of Generative AI, NVIDIA“Together, we’re helping companies deploy open models for everyday work at scale.”

Today’s announcement underscores Glean’s long-standing model-agnostic platform strategy: enterprises should be able to build across an ecosystem of models, not rely on a single provider. With access to 30+ models, including Nemotron 3 Ultra, Glean customers can take advantage of the latest open source and proprietary advances with stronger performance, lower cost, and the flexibility to avoid provider lock-in as AI evolves rapidly.

The work with NVIDIA is a continuation of our collaboration across the Nemotron family of models. Glean Waldo, an agentic search model, is post-trained on NVIDIA Nemotron 3 Nano and delivers 50% lower latency and 25% fewer tokens. Waldo takes on the search tasks that frontier models used to handle, preserving their reasoning and response capacity for work that actually requires it. This is a blueprint for how Glean approaches token economics: multiple models working together to deliver frontier-level intelligence with fewer tokens.

Write to us [wasim.a@demandmediaagency.com] to learn more about our exclusive editorial packages and programmes.

  • ITTech Pulse News Desk is a premier news hub delivering latest updates and in-depth analysis on Information Technology. Covering AI, cybersecurity, cloud computing, and emerging trends, it empowers IT professionals, business leaders, and tech enthusiasts to always remain on top of the industry.

Recommended Reads :