Settings

Theme

Microsoft releases VibeVoice-ASR, an open speech-to-text model

github.com

3 points by putlake 22 days ago · 1 comment

Reader

putlakeOP 22 days ago

VibeVoice-ASR is a unified speech-to-text model designed to handle 60-minute long-form audio in a single pass, generating structured transcriptions containing Who (Speaker), When (Timestamps), and What (Content), with support for Customized Hotwords.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection