iPhone App › Add voice recognition? [closed]

Asked 2/6, 2009 at 22:50 Answered 8/1, 2013 at 11:36

iphone speech-recognition voice-recording speech-to-text

I'd like to build an app that uses voice recognition. I've seen big companies like Google etc implement this feature, but I'm curious about doing it on a start-up level. Anyone looked into this? Are there any tools out there for us to do this?

Chiffon answered 2/6, 2009 at 22:50 Comment(2)

i think you need to provide more details - what you want the app to do, etc... – Plaything 2/6, 2009 at 22:53

If you are looking to ad Voice to Text control to your iPhone then read this thread surreystore.com/cms/technology/7-apple/… – Hunkydory 10/2, 2011 at 2:37

OpenEars looks promising... http://www.politepix.com/openears/

Based on Pocket Sphinx.

Shabby answered 28/12, 2010 at 18:48 Comment(0)

If you start here at wikipedia, you'll get a good list engines (http://en.wikipedia.org/wiki/Speech_recognition#Commercial_software.2Fmiddleware)

As I write this (June 24, 2009) it looks to me that are two viable open source solutions

Pocket Sphinx (http://www.speech.cs.cmu.edu/pocketsphinx)
Julius (http://en.wikipedia.org/wiki/Julius_(software))

Both have been used in iphone apps, but the iphone friendly source isn't readily available.

As I edit this (8 July, 2009) I recently learned that Loquendo (http://www.loquendo.com/en/) has voice recognition and speech synthesis (ASR & TTS) for the iphone.

Garnet answered 24/6, 2009 at 20:4 Comment(2)

@Rohrer, Will Apple approve this if we add any external engines for the voice recognition – Petronille 19/8, 2010 at 8:29

@Shibin - I've never heard of such apps being rejected, and I wouldn't expect them to be, either, but your mileage may vary. You can always search around for users of a particular sdk and make sure their apps are actually being published. This would be particularly easy with the commercial sdks. – Garnet 23/8, 2010 at 13:14

The best approach will probably be to:

Record the voice on the phone
Send the recording to a server that runs the speech recognition software
Then return something to the phone to indicate what it should do

Ibert answered 2/6, 2009 at 23:26 Comment(4)

That's a lot of data to send. I might try it on the iPhone itself. After all, PCs could do a fair job of this 10 years ago, so perhaps iPhones should be able to now. – Congratulatory 2/6, 2009 at 23:28

This is actually the technique the Google Search app uses – Undercoating 8/7, 2009 at 20:28

Google encodes the voice in a special way, they don't just send the raw audio data for exactly the reason Nosredna gave. – Emaciation 7/7, 2010 at 11:48

There's nothing stopping step 2 from including compression. – Embryonic 10/2, 2011 at 3:5

The Dragon Mobile SDK from Nuance does what is asked for. You need an internet connection to be able to send the audio to Nuance's server and you get a list of text responses. You can then decide what to do with the text responses (e.g. ask your user to choose the one he meant or perform some action). Here is the link:

http://dragonmobile.nuancemobiledeveloper.com/

Osteoclasis answered 8/1, 2013 at 11:36 Comment(0)

Recommended topics

Hot tags