Settings

Theme

Show HN: Push-to-talk dictation for Android apps and terminal workflows

2 points by pol_avec a month ago · 0 comments · 1 min read


I built this because MacWhisper is not available on Android and voice typing on Android is pretty bad. Moreover Gemini does not allow you to edit transcripts before they are auto-sent.

I like my SwiftKey keyboard though, so I did not want to replace that. So the only way was to make a floating push-to-talk button on top of any app.

You tap the overlay, speak, tap again, transcribe, and insert text into the currently focused field.

It supports local on-device transcription, cloud transcription with your own OpenAI key, and optional post-processing/cleanup for punctuation, formatting, prompts, commands, etc.

A nice use case for me has been Termux / terminal workflows on Android. You have a "dev mode" where you can just say "command mode" and anything after it will be converted into a proper CLI command.

The app is open source. No backend — in cloud mode requests go directly from the phone to OpenAI using the user's own API key.

Repo: https://github.com/kafkasl/phone-whisper APK: https://github.com/kafkasl/phone-whisper/releases

No comments yet.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection