Azure AI milestone: New Neural Text-to-Speech models more closely mirror natural speech - Microsoft Research

Imported December 17, 2021

Azure AI milestone: New Neural Text-to-Speech models more closely mirror natural speech

Published December 17, 2021

Sheng Zhao

Partner Group Engineer Manager

Share on Twitter
Share on Facebook
Share on LinkedIn
Share on Reddit
Subscribe to our RSS feed

Research Area

Artificial intelligence

Neural Text-to-Speech—along with recent milestones in computer vision and question answering—is part of a larger Azure AI mission to provide relevant, meaningful AI solutions and services that work better for people because they better capture how people learn and work—with improved vision, knowledge understanding, and speech capabilities. At the center of these efforts is XYZ-code, a joint representation of three cognitive attributes: monolingual text (X), audio or visual sensory signals (Y), and multilingual (Z). For more information about these efforts, read the XYZ-code blog post.

Neural Text-to-Speech (Neural TTS), a powerful speech synthesis capability of Azure Cognitive Services, enables developers to convert text to lifelike speech. It is used in voice assistant scenarios, content read aloud capabilities, accessibility tools, and more. Neural TTS has now reached a significant milestone [...]

Read article at microsoft.com

Azure AI milestone: New Neural Text-to-Speech models more closely mirror natural speech – Microsoft Research

Article Taxonomies