Google Releases Largest Overhaul For Cloud Speech To Text Engine

Late last month, Google released its Cloud Text-to-speech engine to developers worldwide which featured 32 different voices spanning across 12 languages and variants. Now, the company has released a major update for another product from its Cloud AI speech lineup- the Cloud Speech-to-text engine (formerly known as the Cloud Speech API).

At least a few of these could have real world consumer applications – such as using the engine for transcribing voice recordings.

The API can support up to 4 speakers for phone calls and over 4 speakers on video calls, while seamlessly accounting for background noise, static from the phone line, and other agents.