Google speech API

I’m now working with my project and I’m about to build a Siri-like application for the desktop computer. I am thinking if Google Speech API is reliable and accurate for speech recognition? Can you suggest to me what speech API is the most accurate in terms of speech recognition? Most preferably a free API. Thank you.

Google Speech Recognition API

I’m trying to use the Google Speech API v2 (at address https://www.google.com/speech-api/v2/recognize?…) I need to use my Api Key, but when I use it I get error 403 Forbidden When I use an API key t

Using Google Speech API

What is the code for implementing the Google Speech API in my C# based application? I found out that it is possible to create an audio file and sent it to http://slides.html5rocks.com/#speech-input an

Using Google Speech API

What is the code for implementing the Google Speech API in my C# based application? I found out that it is possible to create an audio file and sent it to http://slides.html5rocks.com/#speech-input an



Grammar in Google speech API

Can I improve google speech API recognition by give him a words list (in my case the request of user is very predictable) to make recognition more accurate? Thanks.

Decoding Google Speech API response in python

I’m trying to use the Google Speech API in Python. I load a .flac file like this: url = https://www.google.com/speech-api/v1/recognize?xjerr=1&client=chromium&lang=en-US audio = open(‘temp_v

API limit for Google Android Speech recognition

Android has a speech recognition API well documented over here. Now, if I plan to integrate this in my application, I want to know what is limit to the API calls/requests that I can make to the Google

Java Google Speech Recognition API

I developed a Java Application that used Google Speech Recognition, but from few days/weeks ago it doesn’t work, i’m receiving this message. java.io.IOException: Server returned HTTP response code: 4

Does Anyone Uses Google Speech API in Production?

I have found some articles (http://mikepultz.com/2011/03/accessing-google-speech-api-chrome-11/) describing how to use the Google speech API. However, is this reliable? Does anyone uses it in producti

Google Text-To-Speech API

I want to know how can I use google text to speech API in my .net project. I think I need to call a URL to use the web service, but the idea for me is not clear. can anyone help

JAVA using google speech recognition API

I’m trying to use google speech recognition API. Here’s the code i’ve written: http://pastebin.com/zJEhnJ74 It works. I get an answer from the server: {status:5,id:8803471b14a2310dfcf917754e8bd4a

Answers

While the Google speech API is free it is not an official public API. Some people have reverse engineered it, as is discussed in this blog. If you are planning on accessing the API directly for a commercial product I would not recommend it because they can drop it or change it without warning, breaking your product. This recently happened to developers that used the Google Weather API. If you are accessing it through a Chrome browser using x-webkit-speech on the other hand you are probably safe since it is supported by Google. Google’s speech recognition is right up there with a lot of the more popular commercial solutions. They have a lot of experience with it in other projects like Google Voice and the now defunct Google 411. They have some of the top speech scientists working for them. The only other free alternative I can think of is Sphinx which is an open source project out of Carnegie Mellon University. Steep learning curve using this solution and if you want it to be setup as a service you will have to develop that yourself. Nuance is the other big player in the speech recognition market (I believe that is what Siri uses) and they do have solutions that offer speech recognition as a service. But they are pricey.

Update on Answer From Comments on Language Support

Windows Speech Recognition supports other languages, as does most speech recognition systems. But the caveat is that you have to tell the system what language to use and it has to support the language in question. Each vendor has a list of languages it supports and they are specific to a region. For example a vendor may support Mexican Spanish, American Spanish and Spain Spanish; which all have slightly different dialects. But the speech recognition engine can only support one language/dialect at a timer per user. A user cannot speak multiple languages to a speech recognition system without first requesting it to change to that language.

Updated 3/17/2014

The x-webkit-speech input field is being deprecated due to lack of support in other browsers. This will be replaced with the Web Speech API, which is a javascript API. You can find an example on how to use it here.

I’ve got an updated full-duplex PHP example here:

http://mikepultz.com/2013/07/google-speech-api-full-duplex-php-version/