Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enhancement: Support Mozilla Deepspeech #340

Open
ghost opened this issue Mar 26, 2018 · 7 comments
Open

Enhancement: Support Mozilla Deepspeech #340

ghost opened this issue Mar 26, 2018 · 7 comments

Comments

@ghost
Copy link

ghost commented Mar 26, 2018

I'm surprised I'm the first person to raise this one. :)

Mozilla have been building datasets and software for privacy-respecting speech recognition, over at DeepSpeech. It's cool that speech_recognition will allow abstraction over so many APIs, so that software can be built to treat the SR backend as a utility. It would be especially cool if this let users of all those programs written for Google Voice or whatever switch over to DeepSpeech, and remove another surveillance device from the world.

DeepSpeech now ship models in their releases channel, and Mycroft are switching their backend to DeepSpeech shortly, so it's getting traction and the kind of support that should rapidly improve it.

Thanks!

@Matthias84
Copy link

Beside the API call, the lib need an public API endpoint for different languages.

@bredmor
Copy link

bredmor commented Feb 4, 2019

Beside the API call, the lib need an public API endpoint for different languages.

DeepSpeech is an offline library (like Sphinx), not a web-based service.

@Matthias84
Copy link

There are API endpoints for 'as a service' pretrained models and voice detection. For example mycroft.ai

@MikeyBeez
Copy link

They have a websocket.

@reuben
Copy link

reuben commented Nov 27, 2019

FWIW DeepSpeech's focus is on client-side, offline recognition. If this package works offline with PocketSphinx there's no reason it shouldn't be able to work with DeepSpeech. Similar constraints apply, namely needing to download models for languages you want to transcribe.

@norbusan
Copy link

Hi all,
here is my shot at supporting deepspeech in the same vein as pocketsphinx.
#454
This can be definitely improved by using the streaming api of deepspeech, but there is hardly any documentation about how to use it so I am hesitating.

@MikeyBeez
Copy link

I've given up on deepspeech. It isn't accurate enough. I'm using Julius for a wake word and google for everything else. Thank you all.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants