Close Menu

    Subscribe to Updates

    Get the latest creative news from infofortech

    What's Hot

    File Your Taxes With TurboTax Full Service Now Before Prices Go Up

    March 17, 2026

    Death by Tariffs: Volvo Discontinuing Entry-Level EX30 EV in the US

    March 16, 2026

    Nvidia launches NemoClaw, Agent Toolkit to enhance AI agents

    March 16, 2026
    Facebook X (Twitter) Instagram
    InfoForTech
    • Home
    • Latest in Tech
    • Artificial Intelligence
    • Cybersecurity
    • Innovation
    Facebook X (Twitter) Instagram
    InfoForTech
    Home»Innovation»Google launches speedy Gemini 3.1 Flash-Lite model in preview
    Innovation

    Google launches speedy Gemini 3.1 Flash-Lite model in preview

    InfoForTechBy InfoForTechMarch 4, 2026No Comments4 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Google launches speedy Gemini 3.1 Flash-Lite model in preview
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email



    Google LLC today debuted Gemini 3.1 Flash-Lite, the latest addition to its Gemini series of multimodal artificial intelligence models.

    The company’s engineers developed the algorithm with cost-efficiency in mind. Gemini 3.1 Pro, Google’s most capable model, starts at $2 per million input tokens and $18 per million output tokens. Those rates increase significantly for demanding workloads. Gemini 3.1 Flash-Lite is priced $0.25 per million input tokens, while generating a million output tokens costs $1.50.

    Google says that the algorithm is also faster than other Gemini models. In an internal test, the company compared it against Gemini 2.5 Flash, an earlier AI that is likewise optimized for cost-efficiency. Gemini 3.1 Flash-Lite’s overall answer generation speed is 45% higher, while the amount of time that users must wait until the first output token is 2.5 times shorter.

    The model can process multimodal prompts with up to 1 million tokens worth of data. It generates responses with up to 64,000 tokens of text. That text can include software code, which enables Gemini 3.1 Flash-Lite to generate code-based visual assets such as business intelligence dashboards.

    Google ran 11 benchmark tests to evaluate the model’s output quality. Gemini 3.1 Flash-Lite achieved the top score across six of the tests, besting GPT-5 mini and Anthropic PBC’s Claude 4.5 Haiku. One of the benchmarks that the model completed more accurately is GPAQ Diamond, which contains nearly 200 doctorate-level science questions.

    The model achieved a 16% score on HLA, one of the world’s most difficult AI benchmarks. Google’s top-end Gemini 3.1 Pro scored 44.4%.

    The company sees developers using Gemini 3.1 Flash-Lite for high-volume tasks that don’t require extensive reasoning capabilities. An e-commerce marketplace operator, for example, could use it to translate third-party product listings and block items that breach its terms of service. 

    The model also lends itself to certain other tasks. A demo video posted by Google shows a developer using Gemini 3.1 Flash-Lite to generate a weather tracking dashboard with natural language prompts. In another demo, the model added hundreds of illustrative product listings to an e-commerce website prototype. 

    The new model is based on Gemini 3 Pro, which was until recently Google’s flagship reasoning model. The latter algorithm features a mixture-of-experts architecture, which means that it only activates some of its parameters to answer prompts. That approach helps reduce inference costs.

    Gemini 3.1 Flash-Lite is available in preview through Google Cloud’s Vertex AI suite of AI services. It’s also accessible via the Google AI Studio code generation tool, which enables developers to build simple applications with natural language prompts.

    Image: Google

    Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

    • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
    • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

    About SiliconANGLE Media

    SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

    Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    InfoForTech
    • Website

    Related Posts

    File Your Taxes With TurboTax Full Service Now Before Prices Go Up

    March 17, 2026

    Nvidia launches NemoClaw, Agent Toolkit to enhance AI agents

    March 16, 2026

    EU’s Patience Is Running Out, Expects Google To Pay Up Instantly

    March 16, 2026

    Samsung is reportedly pausing Galaxy Z TriFold sales, and it may soon become even harder to find

    March 16, 2026

    These 15 Amazon Spring Sale Tech Deals Are Actually Good. WWe Checked the Price History (2026)

    March 16, 2026

    Report: Meta could lay off 20% of its staff and replace many of them with AI workers

    March 16, 2026
    Leave A Reply Cancel Reply

    Advertisement
    Top Posts

    How a Chinese AI Firm Quietly Pulled Off a Hardware Power Move

    January 15, 20268 Views

    The World’s Heart Beats in Bytes — Why Europe Needs Better Tech Cardio

    January 15, 20265 Views

    HHS Is Using AI Tools From Palantir to Target ‘DEI’ and ‘Gender Ideology’ in Grants

    February 2, 20264 Views

    Rising Digital Financial Fraud in South Africa

    January 15, 20264 Views
    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo
    Advertisement
    About Us
    About Us

    Our mission is to deliver clear, reliable, and up-to-date information about the technologies shaping the modern world. We focus on breaking down complex topics into easy-to-understand insights for professionals, enthusiasts, and everyday readers alike.

    We're accepting new partnerships right now.

    Facebook X (Twitter) YouTube
    Most Popular

    How a Chinese AI Firm Quietly Pulled Off a Hardware Power Move

    January 15, 20268 Views

    The World’s Heart Beats in Bytes — Why Europe Needs Better Tech Cardio

    January 15, 20265 Views

    HHS Is Using AI Tools From Palantir to Target ‘DEI’ and ‘Gender Ideology’ in Grants

    February 2, 20264 Views
    Categories
    • Artificial Intelligence
    • Cybersecurity
    • Innovation
    • Latest in Tech
    © 2026 All Rights Reserved InfoForTech.
    • Home
    • About Us
    • Contact Us
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.

    Ad Blocker Enabled!
    Ad Blocker Enabled!
    Our website is made possible by displaying online advertisements to our visitors. Please support us by disabling your Ad Blocker.