Settings

Theme

Ask HN: Most accurate ML speech-to-text API?

2 points by lumens 5 years ago · 2 comments · 1 min read


I'm building a project that relies on at least pretty-good transcription with timestamps for each word and ideally speaker diarization.

Right now I'm using Google Cloud's Speech-to-Text, but the accuracy is underwhelming when transcribing a Zoom call (50%ish).

Am I likely to fare much better with Azure/AWS? What about Symbl.ai?

taf2 5 years ago

Which model are you using on the zoom calls? Also are you used enhanced or just default? There a lot of factors with any engine.

mdrabla 5 years ago

While sometimes more expensive, I've found GCP the best option (from an accuracy standpoint) for STT diarization

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection