• Topics
  • Artificial wisdom
  • Autopilot
  • network
  • Processor
  • 手機
  • exhibition activities
    • CES
      • CES 2014
      • CES 2015
      • CES 2016
      • CES 2017
      • CES 2018
      • CES 2019
      • CES 2020
    • MWC
      • MWC 2014
      • MWC 2015
      • MWC 2016
      • MWC 2017
      • MWC 2018
      • MWC 2019
    • Computex
      • Computex 2014
      • Computex 2015
      • Computex 2016
      • Computex 2017
      • Computex 2018
      • Computex 2019
    • E3
      • E3 2014
      • E3 2015
      • E3 2016
      • E3 2017
    • IFA
      • IFA 2014
      • IFA 2015
      • IFA 2016
      • IFA 2017
    • TGS
      • TGS 2016
  • About us
    • About mashdigi
    • mashdigi website contact details
2026 / 05 / 11 23:30 Monday
  • Login
mashdigi-Technology, new products, interesting news, trends
  • Topics
  • Artificial wisdom
  • Autopilot
  • network
  • Processor
  • 手機
  • exhibition activities
    • CES
      • CES 2014
      • CES 2015
      • CES 2016
      • CES 2017
      • CES 2018
      • CES 2019
      • CES 2020
    • MWC
      • MWC 2014
      • MWC 2015
      • MWC 2016
      • MWC 2017
      • MWC 2018
      • MWC 2019
    • Computex
      • Computex 2014
      • Computex 2015
      • Computex 2016
      • Computex 2017
      • Computex 2018
      • Computex 2019
    • E3
      • E3 2014
      • E3 2015
      • E3 2016
      • E3 2017
    • IFA
      • IFA 2014
      • IFA 2015
      • IFA 2016
      • IFA 2017
    • TGS
      • TGS 2016
  • About us
    • About mashdigi
    • mashdigi website contact details
No Result
View All Result
  • Topics
  • Artificial wisdom
  • Autopilot
  • network
  • Processor
  • 手機
  • exhibition activities
    • CES
      • CES 2014
      • CES 2015
      • CES 2016
      • CES 2017
      • CES 2018
      • CES 2019
      • CES 2020
    • MWC
      • MWC 2014
      • MWC 2015
      • MWC 2016
      • MWC 2017
      • MWC 2018
      • MWC 2019
    • Computex
      • Computex 2014
      • Computex 2015
      • Computex 2016
      • Computex 2017
      • Computex 2018
      • Computex 2019
    • E3
      • E3 2014
      • E3 2015
      • E3 2016
      • E3 2017
    • IFA
      • IFA 2014
      • IFA 2015
      • IFA 2016
      • IFA 2017
    • TGS
      • TGS 2016
  • About us
    • About mashdigi
    • mashdigi website contact details
No Result
View All Result
mashdigi-Technology, new products, interesting news, trends
No Result
View All Result
Home Market dynamics

Google announces "StreetViewAI" research, allowing visually impaired people to explore the world of Street View through conversations

Author: Mash Yang
2025-10-06
in Market dynamics, Life, network, software
A A
0
Share to FacebookShare on TwitterShare to LINE

Google Research and DeepMind team recently published aNew research, proposed an innovative system called "StreetViewAI" to try to solve the long-standing limitation of Street View maps on the "visual dependence" of the visually impaired, allowing them to explore Google Street View's huge database of more than 2200 billion images in more than 100 countries around the world through AI dialogue.

Google announces "StreetViewAI" research, allowing visually impaired people to explore the world of Street View through conversations

This is an advertisement.

Traditional street view services, centered around immersive 360-degree images, can provide general users with intuitive environmental perception, but are not very user-friendly for the visually impaired who must rely on hearing or assistive tools.

StreetViewAI was designed to change this situation. By integrating a multimodal model based on Google Gemini Flash 2.0, the research team established three subsystems: "AI Describer," "AI Chat Agent," and "AI Tour Guide."

The AI ​​Describer instantly converts objects, spatial relationships, and navigation clues in the image into concise voice descriptions. The AI ​​Chat Agent allows users to freely ask questions such as "Is this sidewalk shaded?", "Is the cafe entrance wheelchair accessible?", and even "Are there any surprising attractions along this route?" The AI ​​can provide answers based on previous perspectives and the context of the conversation.

As for the AI ​​Tour Guide, it further provides guided tour information on history, culture and architectural background, making the exploration process more in-depth.

StreetViewAI function summary table:

Subsystem nameThe main functionUsage scenarios/examples
AI DescriberReal-time voice description of important objects, spatial relationships and navigation clues in the pictureUsers can get information such as "There is a bus stop 10 meters ahead" and "There is a pedestrian crossing on the right"
AI Chat AgentProvide natural dialogue interaction, answer users' scenario-specific questions, and preserve the conversation context“Is this path shaded?”, “Is the cafe entrance wheelchair accessible?”, “Are there any surprises along this route?”
AI Tour GuideSupplementary guide information, including historical background, cultural significance, architectural style, etc.Describe the history or architectural features of a building while exploring the streets of Paris

In actual testing, the research team invited 11 visually impaired individuals who frequently used white canes and screen-based reading tools to participate. They designed two tasks: destination search and free exploration. During the process, participants interacted with the AI ​​Chat Agent 917 times, significantly higher than the 136 interactions with the AI ​​Describer, demonstrating that conversational interaction better met their needs.

Statistics show that the AI ​​correctly answered 86.3% of questions, with an incorrect answer rate of only 3.9%. The most frequently asked topics were spatial relationships (27%), object presence confirmation (26.5%), and immediate scene description (18.4%).

This is an advertisement.

Participants generally used voice as their primary mode of interaction, accounting for over 90%. One tester noted that previous navigation systems often only led them to a destination within a few meters, but StreetView AI not only led them to the door but also described the door's appearance and accessibility, providing more precise guidance.

This research highlights Google's ambitions in multimodal AI applications and demonstrates that AI is more than just a tool for entertainment or productivity; it can also serve as a vital bridge to improving the quality of life for vulnerable groups. With continued improvements to its accuracy and support, StreetViewAI may not only transform the digital experience for the visually impaired, but also expand into broader application scenarios such as education, tourism, and smart city navigation.

This is an advertisement.
Tags: AIGeminiGemini FlashGoogleGoogle DeepMindGoogle MapsGoogle ResearchStreetView Artificial wisdomStreet view
ShareTweetShare
Mash Yang

Mash Yang

Founder and editor of mashdigi.com, and student of technology journalism.

Post a responseCancel Reply

This site uses Akismet service to reduce spam.Learn more about how Akismet processes website visitor comments.

Translation (Tanslate)

Recent updates:

Samsung's Bespoke AI Smart Heat Pump Front-Loading Washing Machine debuts with a record-breaking 89-minute wash-dry time, boasting a super-large capacity and extreme energy efficiency.

Samsung's Bespoke AI Smart Heat Pump Front-Loading Washing Machine debuts with a record-breaking 89-minute wash-dry time, boasting a super-large capacity and extreme energy efficiency.

2026-05-11
The peculiar division of classic RPG intellectual property rights: Atari successfully acquired the rights to the first five installments of Wizardry, but the global trademark rights remain with the Japanese developer Drecom.

The peculiar division of classic RPG intellectual property rights: Atari successfully acquired the rights to the first five installments of Wizardry, but the global trademark rights remain with the Japanese developer Drecom.

2026-05-11
Synology introduces its next-generation all-flash enterprise storage models, the FS6420 and FS3420, targeting latency-sensitive applications such as virtualization and databases.

Synology introduces its next-generation all-flash enterprise storage models, the FS6420 and FS3420, targeting latency-sensitive applications such as virtualization and databases.

2026-05-11
mashdigi-Technology, new products, interesting news, trends

Copyright © 2017 mashdigi.com

  • About mashdigi.com
  • Place ads
  • Contact mashdigi.com

Follow us

Welcome back!

Login to your account below

Forgotten Password?

Retrieve your password

Hãy nhập tên người dùng hoặc địa chỉ email để mở mật khẩu

Log In
×

You are about to be redirected to an external website.

The link you clicked will open an external webpage:

In reciprocal calculation...
×

Want to take a break? We recommend the following content:

  • • The WPC proposed the Qi2 wireless charging specification, incorporating Apple's MagSafer magnetic interface into the design standard.
  • Samsung also filed a patent for a Z-shaped concave fold mobile phone design, but its ideas differed slightly from TCL's.
  • • Kymco celebrates its 60th anniversary by launching the "Da Le" modular gasoline motorcycle, new macaron-colored electric motorcycles, and the "Li Duo Hui" promotional program.

You can return by swiping the page or clicking anywhere.

No Result
View All Result
  • About mashdigi.com
  • Place ads
  • Contact mashdigi.com

Copyright © 2017 mashdigi.com