• Topics
  • Artificial wisdom
  • Autopilot
  • network
  • Processor
  • 手機
  • exhibition activities
    • CES
      • CES 2014
      • CES 2015
      • CES 2016
      • CES 2017
      • CES 2018
      • CES 2019
      • CES 2020
    • MWC
      • MWC 2014
      • MWC 2015
      • MWC 2016
      • MWC 2017
      • MWC 2018
      • MWC 2019
    • Computex
      • Computex 2014
      • Computex 2015
      • Computex 2016
      • Computex 2017
      • Computex 2018
      • Computex 2019
    • E3
      • E3 2014
      • E3 2015
      • E3 2016
      • E3 2017
    • IFA
      • IFA 2014
      • IFA 2015
      • IFA 2016
      • IFA 2017
    • TGS
      • TGS 2016
  • About us
    • About mashdigi
    • mashdigi website contact details
2025/12/11 10:50 Thursday
  • Login
mashdigi-Technology, new products, interesting news, trends
  • Topics
  • Artificial wisdom
  • Autopilot
  • network
  • Processor
  • 手機
  • exhibition activities
    • CES
      • CES 2014
      • CES 2015
      • CES 2016
      • CES 2017
      • CES 2018
      • CES 2019
      • CES 2020
    • MWC
      • MWC 2014
      • MWC 2015
      • MWC 2016
      • MWC 2017
      • MWC 2018
      • MWC 2019
    • Computex
      • Computex 2014
      • Computex 2015
      • Computex 2016
      • Computex 2017
      • Computex 2018
      • Computex 2019
    • E3
      • E3 2014
      • E3 2015
      • E3 2016
      • E3 2017
    • IFA
      • IFA 2014
      • IFA 2015
      • IFA 2016
      • IFA 2017
    • TGS
      • TGS 2016
  • About us
    • About mashdigi
    • mashdigi website contact details
No Result
View All Result
  • Topics
  • Artificial wisdom
  • Autopilot
  • network
  • Processor
  • 手機
  • exhibition activities
    • CES
      • CES 2014
      • CES 2015
      • CES 2016
      • CES 2017
      • CES 2018
      • CES 2019
      • CES 2020
    • MWC
      • MWC 2014
      • MWC 2015
      • MWC 2016
      • MWC 2017
      • MWC 2018
      • MWC 2019
    • Computex
      • Computex 2014
      • Computex 2015
      • Computex 2016
      • Computex 2017
      • Computex 2018
      • Computex 2019
    • E3
      • E3 2014
      • E3 2015
      • E3 2016
      • E3 2017
    • IFA
      • IFA 2014
      • IFA 2015
      • IFA 2016
      • IFA 2017
    • TGS
      • TGS 2016
  • About us
    • About mashdigi
    • mashdigi website contact details
No Result
View All Result
mashdigi-Technology, new products, interesting news, trends
No Result
View All Result
Home Life

Amazon launches new Nova Sonic model that can deeply understand human conversations and capture tone and intonation
Unifying speech understanding and speech generation into a single model makes voice conversations in AI applications more like real-life interactions.

Author: Mash Yang
2025-04-09
in Life, network, software
A A
0
Share to FacebookShare on TwitterShare to LINE

Amazon announces new base modelAmazon Nova Sonic, unifying speech understanding and speech generation into a single model, making the voice conversation performance of artificial intelligence application services closer to that of real people. It can be called through Amazon Bedrock in the form of an API and can be used for service call automation services or cross-industry artificial intelligence agent services covering fields such as tourism, education, medical care, and entertainment.

advertisement

Traditional voice application development requires coordinating multiple models simultaneously, such as a speech recognition model that converts speech into text, a large language model that understands and generates responses, and a text-to-speech model that converts text into audio presentation. This not only increases development complexity, but also makes it difficult to preserve the vocal context and nuances that are crucial in natural conversations, such as tone, intonation, and speaking style.

Nova Sonic, on the other hand, abandons the previous design of using multiple different models and unifies the understanding and generation functions into a single model, allowing the model to adjust the generated voice responses based on the sound context such as tone and style, as well as the spoken input, to make the performance closer to the intonation of natural conversation.

Nova Sonic can even understand the subtle nuances of human conversation, including natural pauses and hesitations, enabling it to respond appropriately and gracefully handle interruptions. The model also generates text from the spoken content, allowing developers to use this text to call specific tools and APIs, thereby building richer voice AI agent services.

You can experience the natural intonation performance generated by Nova Sonic through the following link:

• AI agent for travel built on Amazon Nova Sonic

• Enterprise AI assistant built on Amazon Nova Sonic

Tags: AIAmazonAWSNovaNova SonicAmazonArtificial wisdom
ShareTweetShare
Mash Yang

Mash Yang

Founder and editor of mashdigi.com, and student of technology journalism.

Leave a comment Cancel reply

Your e-mail address Will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.

mashdigi-Technology, new products, interesting news, trends

Copyright © 2017 mashdigi.com

  • About mashdigi.com
  • Place ads
  • Contact mashdigi.com

Follow us

Welcome back!

Login to your account below

Forgotten Password?

Retrieve your password

Hãy nhập tên người dùng hoặc địa chỉ email để mở mật khẩu

Log In
No Result
View All Result
  • About mashdigi.com
  • Place ads
  • Contact mashdigi.com

Copyright © 2017 mashdigi.com

Go to Mobile Version