Close Menu

    Subscribe to Updates

    Get the latest creative news from infofortech

    What's Hot

    Boox’s new Go E Ink tablet includes a 10-inch display and runs Android 15

    March 17, 2026

    Instagram Users Urged to Save Encrypted DMs Before Feature Disappears

    March 17, 2026

    File Your Taxes With TurboTax Full Service Now Before Prices Go Up

    March 17, 2026
    Facebook X (Twitter) Instagram
    InfoForTech
    • Home
    • Latest in Tech
    • Artificial Intelligence
    • Cybersecurity
    • Innovation
    Facebook X (Twitter) Instagram
    InfoForTech
    Home»Artificial Intelligence»NVIDIA launches open model family for agentic AI
    Artificial Intelligence

    NVIDIA launches open model family for agentic AI

    InfoForTechBy InfoForTechJanuary 21, 2026No Comments3 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    NVIDIA launches open model family for agentic AI
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email


    The Nemotron 3 lineup – comprising Nano, Super, and Ultra – delivers leading performance for multi-agent AI systems, combining advanced reasoning, conversational, and collaborative capabilities. The models leverage a hybrid Mamba-Transformer mixture-of-experts (MoE) architecture, providing best-in-class inference throughput while supporting context lengths of up to 1 million tokens.

    Nemotron 3 Nano, the smallest model, is optimized for cost-efficient inference and tasks such as software debugging, content summarization, AI assistant workflows, and information retrieval. Despite possessing 30 billion total parameters, it intelligently activates only about 3 billion per token. With a unique hybrid MoE design, Nano achieves up to 4× higher token throughput than its predecessor and reduces reasoning-token generation by 60%, all while maintaining superior accuracy. Early benchmarks show Nano outperforming comparable open models like GPT-OSS-20B and Qwen3-30B on reasoning and long-context tasks.

    Nemotron 3 Super and Ultra extend these capabilities for high-volume collaborative agents and complex AI applications, incorporating innovations such as latent MoE, a hardware-aware expert design that increases model quality without sacrificing efficiency, and multi-token prediction (MTP), which enhances long-form text generation and multi-step reasoning. Both larger models are trained using NVIDIA’s NVFP4 format, enabling faster training and reduced memory requirements.

    All Nemotron 3 models are post-trained using multi-environment reinforcement learning (RL), enabling them to handle tasks spanning mathematical and scientific reasoning, competitive coding, instruction following, software engineering, chat, and multi-agent tool use. The models also support granular reasoning budget control at inference time, allowing developers to fine-tune computational resources while maintaining accuracy.

    NVIDIA has also released a comprehensive suite of datasets, training libraries, and evaluation tools, including over three trillion tokens of pretraining and reinforcement learning data, the NeMo Gym and NeMo RL open-source libraries, and the Nemotron Agentic Safety Dataset for real-world safety evaluation.

    The Nemotron 3 family is designed to empower developers, startups, and enterprises to build specialized AI agents transparently and efficiently. Nano is available today through Hugging Face, NVIDIA NIM microservices, and major cloud and AI platforms including AWS, Google Cloud, and Microsoft Foundry. Super and Ultra are expected to launch in the first half of 2026.

    Early adopters such as Accenture, ServiceNow, Perplexity, and Palantir are already integrating Nemotron 3 models into AI workflows for manufacturing, cybersecurity, software development, media, and enterprise operations.

    With Nemotron 3, NVIDIA is working on a new standard for efficient, accurate, and open AI models. This will allow developers to scale agentic AI applications from prototype to enterprise deployment while maintaining transparency, cost-efficiency, and state-of-the-art performance.

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    InfoForTech
    • Website

    Related Posts

    Clarifai Reasoning Engine Achieves 414 Tokens Per Second on Kimi K2.5

    March 16, 2026

    Influencer Marketing in Numbers: Key Stats

    March 16, 2026

    Tremble Chatbot App Access, Costs, and Feature Insights

    March 15, 2026

    U.S. Holds Off on New AI Chip Export Rules in Surprise Move in Tech Export Wars

    March 14, 2026

    How Joseph Paradiso’s sensing innovations bridge the arts, medicine, and ecology | MIT News

    March 14, 2026

    A better method for planning complex visual tasks | MIT News

    March 14, 2026
    Leave A Reply Cancel Reply

    Advertisement
    Top Posts

    How a Chinese AI Firm Quietly Pulled Off a Hardware Power Move

    January 15, 20268 Views

    The World’s Heart Beats in Bytes — Why Europe Needs Better Tech Cardio

    January 15, 20265 Views

    HHS Is Using AI Tools From Palantir to Target ‘DEI’ and ‘Gender Ideology’ in Grants

    February 2, 20264 Views

    Rising Digital Financial Fraud in South Africa

    January 15, 20264 Views
    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo
    Advertisement
    About Us
    About Us

    Our mission is to deliver clear, reliable, and up-to-date information about the technologies shaping the modern world. We focus on breaking down complex topics into easy-to-understand insights for professionals, enthusiasts, and everyday readers alike.

    We're accepting new partnerships right now.

    Facebook X (Twitter) YouTube
    Most Popular

    How a Chinese AI Firm Quietly Pulled Off a Hardware Power Move

    January 15, 20268 Views

    The World’s Heart Beats in Bytes — Why Europe Needs Better Tech Cardio

    January 15, 20265 Views

    HHS Is Using AI Tools From Palantir to Target ‘DEI’ and ‘Gender Ideology’ in Grants

    February 2, 20264 Views
    Categories
    • Artificial Intelligence
    • Cybersecurity
    • Innovation
    • Latest in Tech
    © 2026 All Rights Reserved InfoForTech.
    • Home
    • About Us
    • Contact Us
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.

    Ad Blocker Enabled!
    Ad Blocker Enabled!
    Our website is made possible by displaying online advertisements to our visitors. Please support us by disabling your Ad Blocker.