Voice to Text conversion R

Asked 10/5, 2017 at 15:48 Answered 19/7, 2023 at 20:4

Is there any way to convert user speech to text in realtime using R ? Just curious. Also it will be great if anybody could share some examples regarding what they have done in this domain.

Bitstock answered 10/5, 2017 at 15:48 Comment(0)

I'm just working on googleLanguageR that includes speech-to-text via the Google Cloud Speech API

Jermayne answered 18/6, 2017 at 20:37 Comment(5)

this looks like a pre-recorded wave fine needs to be fed into the function. Can speech to text work in real time ? – Bitstock 2/7, 2017 at 6:28

Depends how real time you need it, an audio file can be recorded right before the API call. There is a Shiny app in the works that will work via a push button, recording the voice via JavaScript from your browser. – Jermayne 2/7, 2017 at 6:36

@Jermayne do I need a Google cloud platform account in advance to use this API? – Wealth 27/6, 2018 at 13:17

Yes, it uses your authentication you get from there – Jermayne 28/6, 2018 at 14:56

Any non-commercial alternatives? – Pneumonia 7/7, 2018 at 15:50

As of 2023, it is possible to get speech-to-text transcription (and translation) using the "Whisper" Automatic Speech Recognition model.

The R package audio.whisper wraps the whisper.cpp C++ library, and basically makes it possible to transcribe text from within R. Once the model has been downloaded, the whole process can be conducted offline, without the need to call any external API.

The quality of the transcription is surprisingly good, including for major languages other than English. This is however not meant for "real time" transcriptions, as mentioned in the question, even if it probably can be adapted to work this way using one of the smaller models.

At the time of writing, one issue in particular should be mentioned for anyone who intends to try out audio.whisper:

as mentioned in the Readme, you should really consider installing (or reinstalling) the package using some of the suggested flags, as this dramatically improves performance

Searching on GitHub for "whisper language:R" shows other R packages that rely on Whisper, but they mostly expect you to install whisper separately.

More complete, refined, or better documented R packages may appear, but these suggestions should put you on the right track to find a meaningful solution.

Shaftesbury answered 19/7, 2023 at 20:4 Comment(0)

Recommended topics

Hot tags