• Topics
  • Artificial wisdom
  • Autopilot
  • network
  • Processor
  • 手機
  • exhibition activities
    • CES
      • CES 2014
      • CES 2015
      • CES 2016
      • CES 2017
      • CES 2018
      • CES 2019
      • CES 2020
    • MWC
      • MWC 2014
      • MWC 2015
      • MWC 2016
      • MWC 2017
      • MWC 2018
      • MWC 2019
    • Computex
      • Computex 2014
      • Computex 2015
      • Computex 2016
      • Computex 2017
      • Computex 2018
      • Computex 2019
    • E3
      • E3 2014
      • E3 2015
      • E3 2016
      • E3 2017
    • IFA
      • IFA 2014
      • IFA 2015
      • IFA 2016
      • IFA 2017
    • TGS
      • TGS 2016
  • About us
    • About mashdigi
    • mashdigi website contact details
2026/01/20 06:11 Tuesday
  • Login
mashdigi-Technology, new products, interesting news, trends
  • Topics
  • Artificial wisdom
  • Autopilot
  • network
  • Processor
  • 手機
  • exhibition activities
    • CES
      • CES 2014
      • CES 2015
      • CES 2016
      • CES 2017
      • CES 2018
      • CES 2019
      • CES 2020
    • MWC
      • MWC 2014
      • MWC 2015
      • MWC 2016
      • MWC 2017
      • MWC 2018
      • MWC 2019
    • Computex
      • Computex 2014
      • Computex 2015
      • Computex 2016
      • Computex 2017
      • Computex 2018
      • Computex 2019
    • E3
      • E3 2014
      • E3 2015
      • E3 2016
      • E3 2017
    • IFA
      • IFA 2014
      • IFA 2015
      • IFA 2016
      • IFA 2017
    • TGS
      • TGS 2016
  • About us
    • About mashdigi
    • mashdigi website contact details
No Result
View All Result
  • Topics
  • Artificial wisdom
  • Autopilot
  • network
  • Processor
  • 手機
  • exhibition activities
    • CES
      • CES 2014
      • CES 2015
      • CES 2016
      • CES 2017
      • CES 2018
      • CES 2019
      • CES 2020
    • MWC
      • MWC 2014
      • MWC 2015
      • MWC 2016
      • MWC 2017
      • MWC 2018
      • MWC 2019
    • Computex
      • Computex 2014
      • Computex 2015
      • Computex 2016
      • Computex 2017
      • Computex 2018
      • Computex 2019
    • E3
      • E3 2014
      • E3 2015
      • E3 2016
      • E3 2017
    • IFA
      • IFA 2014
      • IFA 2015
      • IFA 2016
      • IFA 2017
    • TGS
      • TGS 2016
  • About us
    • About mashdigi
    • mashdigi website contact details
No Result
View All Result
mashdigi-Technology, new products, interesting news, trends
No Result
View All Result
Home App

Google previews Gemini 2.5 Computer Use model, enabling AI models to open web pages, click a mouse, and fill out forms like humans
Can simulate human operation interface to complete tasks

Author: Mash Yang
2025-10-08
in App, Market dynamics, Life, network, software, Topics
A A
0
Share to FacebookShare on TwitterShare to LINE

Google announces a preview of its latest generation of AI models「Gemini 2.5 Computer Use」This model not only understands text and images but can also actually "operate" web interfaces like a human. Through actions like clicking, scrolling, typing, and dragging, the AI ​​can complete tasks without API connections, such as filling out forms, submitting data, or searching for information on the web. This technology allows AI to go beyond simply answering questions and directly "act."

Google previews Gemini 2.5 Computer Use model, enabling AI models to open web pages, click a mouse, and fill out forms like humans

AI simulates human operating interfaces, opening up new application scenarios

Google states that the Gemini 2.5 Computer Use model possesses "visual understanding and reasoning capabilities," enabling it to observe web content and perform actions based on user instructions. This allows AI to interact with web interfaces and other user interfaces without relying on API connections. Applications include UI testing, automated operations, data collection, and internal enterprise tool integration.

The model currently supports 13 types of operation commands, including opening web pages, entering text, clicking buttons, and dragging and dropping elements. Google pointed out that this feature does not yet support full desktop system control, but it performs better than its peers in multiple web and mobile operation benchmarks.

Extended from Project Mariner, it can automatically complete browsing tasks

Gemini 2.5 Computer Use is actually a previous research project of GoogleProject MarinerThis is an extension of the project, which has demonstrated that AI can autonomously complete complex tasks in the browser, such as automatically adding items to a shopping cart based on a list of ingredients.

This new version has been integrated into the Gemini platform and supports developers to access it on Google AI Studio and Vertex AI.

Google previews Gemini 2.5 Computer Use model, enabling AI models to open web pages, click a mouse, and fill out forms like humans

Competition heats up for AI "action agents" against OpenAI and Anthropic

Google's new announcement comes on the heels of OpenAI'sDev Day EventAnnounce onNew ChatGPT App and Agent FeaturesAfterwards, it was emphasized that AI can complete multi-step tasks autonomously. Anthropic also launched a computer-operatedClaude Computer Use Model.

Unlike competitors that allow AI to control the entire computer environment, Google emphasizes that Gemini is currently limited to browser-level operations, aiming to ensure both security and controllability. Even so, Google stated that the model still "surpasses other mainstream alternatives" in multiple real-world tests and will continue to be optimized to support more interactions and application scenarios.

AI moves from "talking" to "operating"

Gemini 2.5 Computer Use represents a new phase in generative AI's evolution from "language understanding" to "action capability." In the future, developers will be able to not only instruct AI to answer questions through commands, but also enable it to directly execute operational tasks.

Between human operation and automation, Google is clearly trying to create a new AI interaction model - making AI not just an assistant, but a virtual agent that can actually "get its hands dirty."

Tags: AIGeminiGemini 2.5 Computer UseGoogleProject MarinerArtificial wisdom
ShareTweetShare
Mash Yang

Mash Yang

Founder and editor of mashdigi.com, and student of technology journalism.

Leave a comment Cancel reply

Your e-mail address Will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Translation (Tanslate)

Recent updates:

Google Gemini's enterprise sales have skyrocketed! Improved model quality drives Google Cloud server revenue.

Google Gemini's enterprise sales have skyrocketed! Improved model quality drives Google Cloud server revenue.

2026-01-20
Xbox Game Pass gets a complete overhaul, restructured into three new plans, and its cloud streaming service officially leaves beta.

Microsoft is reportedly set to release an "ad-supported" version of Xbox Cloud Gaming, allowing users to play cloud-streamed games for free by watching ads.

2026-01-20
OpenAI launches "Deep Research" feature that can further deepen search and easily compile online information into comprehensive reports

OpenAI has revealed its trump card in a rare move: revenue is expected to exceed $200 billion by 2025!

2026-01-19
mashdigi-Technology, new products, interesting news, trends

Copyright © 2017 mashdigi.com

  • About mashdigi.com
  • Place ads
  • Contact mashdigi.com

Follow us

Welcome back!

Login to your account below

Forgotten Password?

Retrieve your password

Hãy nhập tên người dùng hoặc địa chỉ email để mở mật khẩu

Log In
No Result
View All Result
  • About mashdigi.com
  • Place ads
  • Contact mashdigi.com

Copyright © 2017 mashdigi.com