From the course: Microsoft Azure AI Essentials: Workloads and Machine Learning on Azure
Unlock this course with a free trial
Join today to access over 24,700 courses taught by industry experts.
Understanding speech recognition and synthesis
From the course: Microsoft Azure AI Essentials: Workloads and Machine Learning on Azure
Understanding speech recognition and synthesis
- [Instructor] Speech recognition converts spoken words into text. It processes input from audio files or live microphone input by analyzing speech patterns and mapping them to words. This process typically involves two key models. The acoustic model converts audio signals into phonemes, the basic sounds of speech. The language model maps these phonemes to words, predicting the most likely word sequences. Speech recognition has many applications, including generating captions for videos, transcribing phone calls or meetings, automating note taking, and interpreting user input for further actions. Now speech synthesis is the reverse process, turning text into speech. A speech synthesis system requires the text to be spoken and a voice to vocalize the text. This technology is useful in phone apps that respond with voice, navigation systems providing directions, reading messages, emails, or books aloud, and broadcasting public announcements, such as in airports or train stations. These…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.
Contents
-
-
-
-
-
-
(Locked)
Overview of natural language processing2m 24s
-
(Locked)
Introduction to Azure AI Language3m 40s
-
(Locked)
Introduction to Azure AI Translator2m 7s
-
(Locked)
Understanding speech recognition and synthesis1m 38s
-
(Locked)
Introduction to Azure AI Speech3m 20s
-
(Locked)
Practical application of natural language processing in business2m 52s
-
(Locked)
Creating an Azure AI Language and Azure AI Speech resource2m 1s
-
(Locked)
Azure AI Language demo3m 31s
-
(Locked)
Azure AI Speech demo4m 6s
-
(Locked)
-
-
-
-
-