Data Processed by Speech to Text
Azure Custom Speech Service processes various types of data, including audio input or voice audio, input transcription text, and transcriptions for speech translation. It accepts voice audio as input and uses it for transcription services. The service also assesses pronunciations based on transcribed text in pronunciation assessment tasks. Additionally, speech translation involves transcribing text and translating it into a specified language through the Translator service.
Data Processing by Speech to Text
In real-time speech to text scenarios, audio input is processed by the speech recognition engine on Azure's server memory without storing data at rest. All data in transit are encrypted for protection. For batch transcription, customers specify storage locations for audio input and output transcription text files. Customers control data storage and retention, including setting retention times for generated transcription files.
Speaker Diarization/Separation
Azure Custom Speech Service offers speaker separation (diarization) for both real-time and batch APIs. When enabled, the engine analyzes audio input to differentiate between speakers. Unique voice characteristics signals are used temporarily to annotate the transcription output with speaker markers. Signal data for speaker separation is discarded post-process and supports multiple speakers within a single audio file.
Language Detection and Translation
Language detection in Azure Custom Speech Service calculates probabilities of mapping between phonemes and languages to identify spoken languages in audio input. Speech translation involves machine transcription followed by text translation services for language conversion. Translated text can also be converted into audio format if needed.
Stay Ahead in Today’s Competitive Market!
Unlock your company’s full potential with a Virtual Delivery Center (VDC). Gain specialized expertise, drive
seamless operations, and scale effortlessly for long-term success.
Book a Meeting to Avail the Services of Azure Custom Speech Service