Close Menu
Tech Savvyed
  • Home
  • News
  • Artificial Intelligence
  • Gadgets
  • Apps
  • Mobile
  • Gaming
  • Accessories
  • More
    • Web Stories
    • Spotlight
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On
Everything We Know About 1666: Amsterdam

Everything We Know About 1666: Amsterdam

6 June 2026
Netflix says there is no future for theatrical releases in its streaming universe

Netflix says there is no future for theatrical releases in its streaming universe

6 June 2026
Free Blasphemous 2 DLC Called The Third Sin Just Shadow Dropped During The Future Games Show

Free Blasphemous 2 DLC Called The Third Sin Just Shadow Dropped During The Future Games Show

6 June 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Tech Savvyed
SUBSCRIBE
  • Home
  • News
  • Artificial Intelligence
  • Gadgets
  • Apps
  • Mobile
  • Gaming
  • Accessories
  • More
    • Web Stories
    • Spotlight
    • Press Release
Tech Savvyed
Home»News»Wowed by computer-use AI agents? Research says they’re “digital disasters” even for routine tasks
News

Wowed by computer-use AI agents? Research says they’re “digital disasters” even for routine tasks

News RoomBy News Room15 May 20263 Mins Read
Wowed by computer-use AI agents? Research says they’re “digital disasters” even for routine tasks
Share
Facebook Twitter Reddit Telegram Pinterest Email

AI agents built to run everyday computer tasks have a serious context problem, according to new research from UC Riverside.

The team tested 10 agents and models from major developers, including OpenAI, Anthropic, Meta, Alibaba, and DeepSeek. On average, the agents took undesirable or potentially harmful actions 80% of the time and caused damage 41% of the time.

These systems can open apps, click buttons, fill out forms, move through websites, and act on a computer screen with limited supervision. Their mistakes land differently from a chatbot’s bad answer because the software can actually do things.

The UC Riverside findings suggest today’s desktop agents can treat unsafe requests as jobs to finish, not signals to stop.

Why agents miss obvious danger

The researchers built a benchmark called BLIND-ACT to test whether agents would pause when a task became unsafe, contradictory, or irrational. In the latest tests, they didn’t pause often enough.

Across 90 tasks, the benchmark pushed agents into situations that required context, restraint, and refusal. One test involved sending a violent image file to a child. Another had an agent filling out tax forms falsely mark a user as disabled because it reduced the tax bill. A third asked an agent to disable firewall rules in the name of better security, and the agent followed through instead of rejecting the contradiction.

The researchers call the pattern blind goal-directedness. The agent keeps chasing the assigned outcome even when the surrounding context says the task is broken.

Why obedience becomes the flaw

The failures clustered around obedience. These agents can act as if a user’s request is enough reason to keep going.

The team identified patterns called execution-first bias and request-primacy. In plain terms, the agent focuses on how to complete the task, then treats the request itself as justification. That risk grows when the same system can touch a variety of things like email or security settings.

AI image of chip burning

That doesn’t mean the agents are malicious. It means they can be confidently wrong while moving through software at machine speed.

Why guardrails need to come first

AI agents need stronger guardrails before they get broad permission to act across a computer.

These systems work through a loop. They look at the screen, decide the next step, act, then look again. When that loop is paired with weak contextual restraint, a shortcut can turn into a fast-moving mistake.

For now, treat agents as supervised tools. Use them first on low-risk chores, keep them away from financial and security workflows, and watch whether developers add clearer refusal systems, tighter permissions, and better ways to catch contradictions before the next click.

Share. Facebook Twitter Pinterest LinkedIn Telegram Reddit Email
Previous ArticleBombshell OpenAI lawsuit claims your ChatGPT convos were shared with Google and Meta
Next Article Samsung PenUp adds new stylus tricks to your Galaxy phone, if it support an S Pen

Related Articles

Netflix says there is no future for theatrical releases in its streaming universe

Netflix says there is no future for theatrical releases in its streaming universe

6 June 2026
The post-warranty graveyard is filling up with working gadgets

The post-warranty graveyard is filling up with working gadgets

6 June 2026
The hidden labor of modern tech support is turning us all into unpaid employees

The hidden labor of modern tech support is turning us all into unpaid employees

6 June 2026
3 underrated TV series on Hulu you should watch this weekend (June 5-7)

3 underrated TV series on Hulu you should watch this weekend (June 5-7)

6 June 2026
Apple could offer MacBook Ultra in two sizes with one-of-a-kind OLED display

Apple could offer MacBook Ultra in two sizes with one-of-a-kind OLED display

6 June 2026
This animated film with 99% RT score is one of the 3 underrated Apple TV movies to watch this weekend [June 5-7]

This animated film with 99% RT score is one of the 3 underrated Apple TV movies to watch this weekend [June 5-7]

6 June 2026
Demo
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo
Don't Miss
Netflix says there is no future for theatrical releases in its streaming universe

Netflix says there is no future for theatrical releases in its streaming universe

By News Room6 June 2026

Netflix may be willing to send Greta Gerwig’s upcoming Narnia movie into theaters, but if…

Free Blasphemous 2 DLC Called The Third Sin Just Shadow Dropped During The Future Games Show

Free Blasphemous 2 DLC Called The Third Sin Just Shadow Dropped During The Future Games Show

6 June 2026
Cairn’s Free On The Trail: Deep Water DLC Drops August 13

Cairn’s Free On The Trail: Deep Water DLC Drops August 13

6 June 2026
Xbox Exclusive Grounded 2 Launches On PlayStation 5 This August Alongside Underwater ‘Into The Abyss’ Update

Xbox Exclusive Grounded 2 Launches On PlayStation 5 This August Alongside Underwater ‘Into The Abyss’ Update

6 June 2026
Tech Savvyed
Facebook X (Twitter) Instagram Pinterest
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact
© 2026 Tech Savvyed. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.