News

Google’s Cloud Speed-to-Text API can be used to transcribe short and long-form audio in 120 languages and dialects in near real-time.
Google Cloud on Tuesday announced the general availability of its Cloud Text-to-Speech API, which lets developers add natural-sounding speech to their devices or applications.
The Google Cloud Speech-to-Text API got a series of upgrades today, including the ability to speak 7 new languages, as well as more automatic punctuation.
According to Google, Cloud Text-to-Speech can be used to power call center voice response systems, enabling Internet of things devices to talk and converting text-based media into spoken formats.
Following last month’s Cloud Text-to-Speech update that added more natural voices through DeepMind WaveNet models, Google is now revamping the inverse of that API. Cloud Speech-to-Text is today ...
The Cloud Speech API, in a nutshell, allows third-party developers and companies to integrate Google’s speech recognition smarts into their own products.
With the service, Google is targeting three main markets, including voice response systems for call centers, for which Cloud Text-to-Speech can provide real-time, natural-language conversation.
With the release of Audio Profiles, you can optimize Cloud Text-to-Speech for playback on different types of hardware. There are also updates to the counterpart Cloud Speech-to-Text API.
Google’s WaveNet technology in Cloud Text-to-Speech is getting better at sounding like a real person, according to a panel of real people. (Google Photo) ...