Show HN: Omi – watches your screen, hears conversations, tells you what to do
github.comSpent 4 months and built Omi for Desktop, your life architect: It sees your screen, hears your conversations and will advise you on what to do next
Basically Cluely + Rewind + Granola + Wisprflow + ChatGPT + Claude in one app
I talk to claude/chatgpt 24/7 but I find it frustrating that i have to capture/send screenshots of my screen and that it doesn't help proactively during my work
Whenever omi sees something wrong about my workflow, it will send me a proactive notification with advice. It will also point to something I'm missing.
The hardest part was to nail proactivity - after trying 20+ similar tools I didn't find a single one with smart proactive notifications based on content on your screen. I made it look at your screen every second with 4 main prompts:
1. Is the user productive or distracted?
2. Is there anything useful to say right now?
3. is there any task to add to do later?
4. is there anything important to remember about the user?
Full stack: - Swift - Rust backend - Deepgram transcription - Claude code for messaging - GPT 5.4 summaries - Gemini for embeddings and translation
Open source, stores screenshots locally, uses Claude Code for chat. Has cloud to sync with hardware or mobile app but can be disabled in settings >1. Is the user productive or distracted? Pomodoro and todo list apps are so yesterday. Now I can have my graphics card observe me as an ever-vigilant guardian of productivity. That might sound sarcastic, but moving context between prompts and just keeping the gears turning often isn't really that cognitively engaging these days. Thus, attention suffers. So, that's actually pretty useful. sudo humanctl status It’s funny how some people try so hard to protect their personal data, while others just give it all away. Oh nooo not my precious daaaata Case in point. You could pitch it as your "digital nagging housewife", or a "micromanager in a box". How about "your time wasting interrupt-otron" or just "flow-breaker"? Seriously why would you think AI could read my mind and tell me what to do next without knowing my goals? This sounds like the irritating tangential follow-on questions they ask on steroids. Generally irrelevant and take the conversation in a direction you don't want to go. Woah, this is super cool! I'd love to hear more~ What motivated you to build this? What did you learn? What is your favourite part? this sounds cool but on the website I saw the previous version where its more like a passive device to listen, transcribe and save. how does it record the screen and doens't capturing the screen and converting that into text takes a lot of time? That will make it super slow. isnt it? What kind of token usage you have with this setup? Also why both ChatGPT and Claude? this sounds cool but on the website I saw the previous version where its more like a passive device to listen, transcribe and save. Something, something, torment nexus. imagine getting micro managed by this omi lol I guess you have to turn it on. Might be handy for things like electrician courses, DIY jobs, maths homework etc. Maybe one day even surgery!