cmusphinx Questions

1

Is there some command line utility of pocket sphinx or cmu sphinx to convert a .wav file to text? pocketsphinx_continuous -hmm -lm -dict will do. But I don't want to keep speaking the same sentenc...
Schmuck asked 30/11, 2013 at 17:7

2

Currently we are investigating into using sphinx4 for speech recognition. We are trying to achieve a good response for a dictation type application. The input is a wav file and we wish to transcrib...
Aluminate asked 14/9, 2011 at 5:16

2

I am studying the source code of TestPocketSphinxAndAndroidASR.java and the first thing that is not so clear to me is how the system knows which recognizer (i.e. Google or CMUSphinx) to invoke. I ...
Erinn asked 4/9, 2013 at 1:42

1

Solved

Overview: So this is a follow up to my last problem (here). I will be posting a full answer on that very soon. I'm able to get pocketsphinx to recognize audio input from my PS3 Eye in Python via Gs...

1

Solved

So python is acting like acting like it can't hear ANYTHING from my microphone at all. Here's the problem. I have a Python ( 2.7 ) script that is suppose to be using Gstreamer to access my microph...
Singles asked 6/8, 2013 at 18:25

1

I need to train CMU Sphinx offline in my android app. I downloaded this project that use this parameter but when I start it I have this error: Fatal signal 11 (SIGSEGV) at 0x0000001c (code=1). This...
Milena asked 2/2, 2013 at 20:18

1

Solved

I am using CMU sphinx library, but after successfully running one iteration, this error appears "Can't open microphone line with format PCM_SIGNED 16000.0 Hz, 16 bit, mono, 2 bytes/frame, big-endia...
Spoonbill asked 1/4, 2013 at 19:44

1

Solved

I'm looking for a way to match a known data set, let's say a list of MP3s or wav files, each which is a sample of someone speaking. At this point I know file ABC is of Person X speaking. I would t...

1

Solved

I have a big batch of files I'd like to run recognition on using CMU Sphinx 4. Sphinx requires the following format: 16 khz 16 bit mono little-endian My files are something like 44100 khz, 32 b...
Subdeacon asked 3/12, 2012 at 22:36

1

Solved

I am running Enthought Python 2.7 as well as default Python 2.7, Xcode 4.5.1 in Mac OS 10.8.2. I am trying to develop a speech to text converter in Python. I use Enthought Python as it allows me to...
Slake asked 8/11, 2012 at 5:37

1

Solved

I need to train my own acoustic model. On the official wiki, this process is described for ngramm models in DMP format, but I need to use a (grammar) language model provided in JSGF format. Is ther...
Strictly asked 29/6, 2012 at 10:28

3

I want to speech recognition for burmese language. I refer following links : http://cmusphinx.sourceforge.net/2011/05/building-pocketsphinx-on-android/ http://cmusphinx.sourceforge.net/sph...
Rondelet asked 10/4, 2012 at 10:24

2

Solved

I am trying to compile PocketSphinxAndroidDemo, which provides an example implementation of the CMU pocketsphinx speech recognizer on Android. I first received an error similar to the discussion he...
Danyluk asked 2/3, 2011 at 15:30

1

I have two .wav files that I need to compare and decide if they contain the same words (same order too). I have been searching for the best method for a while now. I can't figure out how to have ...
Scattering asked 22/2, 2012 at 22:24

1

Solved

My requirements are similar to this question since the question is now 3 years old I am re-posting the question with information specific to mine, I want to create an application which takes a .wav...
Echevarria asked 4/1, 2012 at 13:7

1

I'm trying to figure out how to use sphinx4 or pocketsphinx with the english voxforge model but I can't get it working. I have tried to read doc pages (like this one http://cmusphinx.sourceforge.ne...
Whinstone asked 31/12, 2011 at 0:20

2

Solved

I want to create an ARPA language model file with nearly 50,000 words. I can't generate the language model by passing my text file to the CMU Language Tool. Is any other link available where I can ...
Foucault asked 21/4, 2011 at 11:24

2

Solved

I would like to build a language model for CMU Sphinx, but my corpus has more than 1000 words so I cannot use the online tool. How do I use (the scripts in cmuclmtk?) to build my language model?
Volteface asked 24/1, 2011 at 14:49

1

Solved

Can anyone recommend reliable open source software for transcribing English speech in wav files? The two main programs I've researched are Sphinx and Julius, but I've never been able to get e...
Saboteur asked 30/9, 2011 at 16:6

1

Solved

As part of my application I'm looking to add speech recognition, but not really in the traditional sense. I have a bunch of lyrics (divided into verses) that are sung by someone, and the idea is to...
Josie asked 23/8, 2011 at 13:20

1

Solved

Can OpenEars do Chinese speech recognition? See here: http://www.politepix.com/openears
Forsythia asked 1/8, 2011 at 6:13

© 2022 - 2024 — McMap. All rights reserved.