I would like to use the Bluemix Conversation sample application
and add speech input and output to it. There are other sample applications for Bluemix TTS and STT available.
What are options to integrate these 3 functions and which of them are recommended for beginners?
There are no immediate plans to provide a 'simple' sample app which demonstrates combining Watson STT (Speech to Text), Conversation, and TTS (Text to Speech). Longer term it is definitely on the radar.
In the immediate term, to get an idea as to how to do this, please take a look at the car-dashboard app code:
- https://github.com/watson-developer-cloud/car-dashboard/blob/master/ui/index.html#L85
- https://github.com/watson-developer-cloud/car-dashboard/tree/master/ui/ibm
- https://github.com/watson-developer-cloud/car-dashboard/tree/master/speech
- https://github.com/watson-developer-cloud/car-dashboard/blob/master/ui/ibm/stream_speech_to_text.js#L34
The car dashboard app uses the IBM Watson Speech JS SDK: https://github.com/watson-developer-cloud/speech-javascript-sdk
Hopefully this helps.
I did something along these lines with the Dialog service demo app and the Speech JS SDK a few months ago:
http://speech-dialog.mybluemix.net/
Full code is on github but almost all of the changes were in this commit.
Note that it was built on an older beta of the SDK. You can get the latest release from github releases or npm (for use with webpack/browserify/etc.) and there are lots of examples.
This is an old question, but IBM Watson is still evolving so this may be a more up-to-date answer.
You have 2 options.
You can simply have your app submit an HTTP REST request (either GET or POST) by following this tutorial
Or you can leverage a language-specific SDK.
If you're using nodejs, then check out this example.
For java, see this example.
Edit
Here's an example git project I created to integrate text-to-speech to the conversation-simple sample app: conversation-simple-with-text-to-speech
Here's the specific commit where the integration was added: commit 3564aeb
© 2022 - 2024 — McMap. All rights reserved.