Settings

Theme

PocketSphinx+PG+JavaScript Voice/Text Experiment

vmorgulys.github.io

3 points by vmorgulis 9 years ago · 3 comments

Reader

vmorgulisOP 9 years ago

Original video: https://www.youtube.com/watch?v=0KR2MSFROLI

CMUSphinx: http://cmusphinx.sourceforge.net/

  • detaro 9 years ago

    So, what am I looking at? It seems like you fed the audio in PocketSphinx to get time-tagged text and the site basically shows said text as subtitles to what was said, is that the gist of it?

    • vmorgulisOP 9 years ago

      > ... is that the gist of it?

      Yes, it is.

      I'd like to improve the speech recognition and expected some advice about that.

      Another possibility is to add a semantic level with NLP or use another library like Kaldi (http://kaldi-asr.org/).

      Another particularity: the WAV file is serialized in JSON (as an array).

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection