Settings

Theme

Show HN: Voice to Text– Free browser-based speech-to-text with local projects

voicetotextonline.com

1 points by digi_wares 5 months ago · 0 comments · 3 min read

Reader

Hi HN,

I built a voice-to-text tool that runs entirely in your browser. No account required for the free tier, no data sent to my servers.

Try it: https://voicetotextonline.com

Why I built this:

- Existing tools require signups, have minute limits, or cost money - Google Docs voice typing requires a Google account - Dragon costs $150-500 - Otter.ai has free tier limits

(A) Free Features (no account required):

1/ Core Transcription:

- Real-time voice-to-text using Web Speech API - 55+ languages supported - Auto-punctuation & sentence case options - Works offline after first load (PWA)

2/ AI Enhance (added based on user survey – 80% voted yes):

- Auto-fix grammar, punctuation & formatting - One-click cleanup of transcripts

3/ My Projects (local storage):

- Save transcripts to browser localStorage - Organize with folders (Notes, Work, Personal, etc.) - Custom folders & tags - Search across all transcripts - Edit, copy, download as TXT - 100% private – never leaves your device

- Export:

- Copy to clipboard - Download as TXT or DOCX

(B) Pro Features ($10/month or $1/hour pay-per-use):

1/ File Upload & Transcription:

- Upload audio/video files (MP3, WAV, M4A, MP4, MOV, AVI, MKV) - Up to 500MB per file - Batch upload (10 files at once) - Powered by AssemblyAI (95%+ accuracy) - 150 hours/month transcription

2/ Advanced Features:

- Real-time progress with ETA - Speaker labels - In-browser audio recording (5 min with pause/resume) - Translation to 25+ languages (GPT-4o)

3/ Export Formats:

- TXT, SRT, VTT, JSON with timestamps - Segment-level timestamp precision

4/ Cloud Storage:

- Transcription history in the cloud - 10 GB storage, 1,000 files/month

(C) Data & Privacy:

Free tier:

- All transcripts stored in browser localStorage only - Never touches our servers - 100% private

Pro tier:

- Audio files stored in Supabase (encrypted) - Files retained for 30 days for re-download, then auto-deleted - Transcripts stored permanently in your account - You can delete any transcript or your entire account anytime - We don't use your data for training

Tech stack:

- Next.js 14 (App Router) - Web Speech API (free real-time transcription) - AssemblyAI (Pro file transcription, 95%+ accuracy) - OpenAI GPT-4o (AI Enhance & translation) - Supabase (auth & storage) - Stripe (payments) - Tailwind CSS - Hosted on Vercel

Limitations:

- Real-time transcription doesn't work in Firefox (Web Speech API not supported) - Free tier accuracy depends on Chrome's speech engine

Would love feedback on UX, pricing, or feature ideas. Considering open-sourcing the core transcription component.

No comments yet.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection