Close Menu

    Subscribe to Updates

    Get the latest creative news from infofortech

    What's Hot

    The software supply chain is the new ground zero for enterprise cyber risk. Don’t get caught short

    May 15, 2026

    How Hybrid Work and Cloud Are Changing Ransomware Risk

    May 15, 2026

    Orbitkey Grid Desk Organiser Lets You Build Your Own Layout

    May 15, 2026
    Facebook X (Twitter) Instagram
    InfoForTech
    • Home
    • Latest in Tech
    • Artificial Intelligence
    • Cybersecurity
    • Innovation
    Facebook X (Twitter) Instagram
    InfoForTech
    Home»Innovation»Wowed by computer-use AI agents? Research says they’re “digital disasters” even for routine tasks
    Innovation

    Wowed by computer-use AI agents? Research says they’re “digital disasters” even for routine tasks

    InfoForTechBy InfoForTechMay 15, 2026No Comments3 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Wowed by computer-use AI agents? Research says they’re “digital disasters” even for routine tasks
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email


    AI agents built to run everyday computer tasks have a serious context problem, according to new research from UC Riverside.

    The team tested 10 agents and models from major developers, including OpenAI, Anthropic, Meta, Alibaba, and DeepSeek. On average, the agents took undesirable or potentially harmful actions 80% of the time and caused damage 41% of the time.

    These systems can open apps, click buttons, fill out forms, move through websites, and act on a computer screen with limited supervision. Their mistakes land differently from a chatbot’s bad answer because the software can actually do things.

    The UC Riverside findings suggest today’s desktop agents can treat unsafe requests as jobs to finish, not signals to stop.

    Why agents miss obvious danger

    The researchers built a benchmark called BLIND-ACT to test whether agents would pause when a task became unsafe, contradictory, or irrational. In the latest tests, they didn’t pause often enough.

    google data center
    Google

    Across 90 tasks, the benchmark pushed agents into situations that required context, restraint, and refusal. One test involved sending a violent image file to a child. Another had an agent filling out tax forms falsely mark a user as disabled because it reduced the tax bill. A third asked an agent to disable firewall rules in the name of better security, and the agent followed through instead of rejecting the contradiction.

    The researchers call the pattern blind goal-directedness. The agent keeps chasing the assigned outcome even when the surrounding context says the task is broken.

    Why obedience becomes the flaw

    The failures clustered around obedience. These agents can act as if a user’s request is enough reason to keep going.

    The team identified patterns called execution-first bias and request-primacy. In plain terms, the agent focuses on how to complete the task, then treats the request itself as justification. That risk grows when the same system can touch a variety of things like email or security settings.

    AI image of chip burning
    Image created with ChatGPT

    That doesn’t mean the agents are malicious. It means they can be confidently wrong while moving through software at machine speed.

    Why guardrails need to come first

    AI agents need stronger guardrails before they get broad permission to act across a computer.

    These systems work through a loop. They look at the screen, decide the next step, act, then look again. When that loop is paired with weak contextual restraint, a shortcut can turn into a fast-moving mistake.

    For now, treat agents as supervised tools. Use them first on low-risk chores, keep them away from financial and security workflows, and watch whether developers add clearer refusal systems, tighter permissions, and better ways to catch contradictions before the next click.

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    InfoForTech
    • Website

    Related Posts

    The software supply chain is the new ground zero for enterprise cyber risk. Don’t get caught short

    May 15, 2026

    HostGator Promo Codes: 76% Off for April 2026

    May 15, 2026

    OpenAI reportedly mulls taking Apple to court over ChatGPT’s Siri integration

    May 15, 2026

    Razer’s new Blade 18 gets Arrow Lake refresh and a modest $3,999.99 starting price

    May 14, 2026

    AI Promised the Audemars Piguet x Swatch Wristwatch. China Will Deliver It

    May 14, 2026

    Microsoft’s LinkedIn is about to lay off 5% of its staff

    May 14, 2026
    Leave A Reply Cancel Reply

    Advertisement
    Top Posts

    DoJ Disrupts 3 Million-Device IoT Botnets Behind Record 31.4 Tbps Global DDoS Attacks

    March 20, 202638 Views

    Microsoft is bringing an AI helper to Xbox consoles

    March 14, 202616 Views

    We’re Tracking Streaming Price Hikes in 2026: Spotify, Paramount Plus, Crunchyroll and Others

    February 15, 202615 Views

    This is the tech that makes Volvo’s latest EV a major step forward

    January 24, 202615 Views
    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo
    Advertisement
    About Us
    About Us

    Our mission is to deliver clear, reliable, and up-to-date information about the technologies shaping the modern world. We focus on breaking down complex topics into easy-to-understand insights for professionals, enthusiasts, and everyday readers alike.

    We're accepting new partnerships right now.

    Facebook X (Twitter) YouTube
    Most Popular

    DoJ Disrupts 3 Million-Device IoT Botnets Behind Record 31.4 Tbps Global DDoS Attacks

    March 20, 202638 Views

    Microsoft is bringing an AI helper to Xbox consoles

    March 14, 202616 Views

    We’re Tracking Streaming Price Hikes in 2026: Spotify, Paramount Plus, Crunchyroll and Others

    February 15, 202615 Views
    Categories
    • Artificial Intelligence
    • Cybersecurity
    • Innovation
    • Latest in Tech
    © 2026 All Rights Reserved InfoForTech.
    • Home
    • About Us
    • Contact Us
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.

    Ad Blocker Enabled!
    Ad Blocker Enabled!
    Our website is made possible by displaying online advertisements to our visitors. Please support us by disabling your Ad Blocker.