Settings

Theme

Show HN: Omi – watches your screen, hears conversations, tells you what to do

github.com

19 points by kodjima33 2 months ago · 28 comments · 2 min read

Reader

Spent 4 months and built Omi for Desktop, your life architect: It sees your screen, hears your conversations and will advise you on what to do next

Basically Cluely + Rewind + Granola + Wisprflow + ChatGPT + Claude in one app

I talk to claude/chatgpt 24/7 but I find it frustrating that i have to capture/send screenshots of my screen and that it doesn't help proactively during my work

Whenever omi sees something wrong about my workflow, it will send me a proactive notification with advice. It will also point to something I'm missing.

The hardest part was to nail proactivity - after trying 20+ similar tools I didn't find a single one with smart proactive notifications based on content on your screen. I made it look at your screen every second with 4 main prompts:

1. Is the user productive or distracted?

2. Is there anything useful to say right now?

3. is there any task to add to do later?

4. is there anything important to remember about the user?

Full stack: - Swift - Rust backend - Deepgram transcription - Claude code for messaging - GPT 5.4 summaries - Gemini for embeddings and translation

Open source, stores screenshots locally, uses Claude Code for chat. Has cloud to sync with hardware or mobile app but can be disabled in settings

rl3 2 months ago

>1. Is the user productive or distracted?

Pomodoro and todo list apps are so yesterday. Now I can have my graphics card observe me as an ever-vigilant guardian of productivity.

That might sound sarcastic, but moving context between prompts and just keeping the gears turning often isn't really that cognitively engaging these days. Thus, attention suffers.

So, that's actually pretty useful.

sudo humanctl status

hahooh 2 months ago

It’s funny how some people try so hard to protect their personal data, while others just give it all away.

nprateem 2 months ago

You could pitch it as your "digital nagging housewife", or a "micromanager in a box". How about "your time wasting interrupt-otron" or just "flow-breaker"?

Seriously why would you think AI could read my mind and tell me what to do next without knowing my goals?

This sounds like the irritating tangential follow-on questions they ask on steroids. Generally irrelevant and take the conversation in a direction you don't want to go.

naomi_lgbt 2 months ago

Woah, this is super cool! I'd love to hear more~

What motivated you to build this? What did you learn? What is your favourite part?

smartypant 2 months ago

this sounds cool but on the website I saw the previous version where its more like a passive device to listen, transcribe and save. how does it record the screen and doens't capturing the screen and converting that into text takes a lot of time? That will make it super slow. isnt it?

jusasiiv 2 months ago

What kind of token usage you have with this setup? Also why both ChatGPT and Claude?

smartypant 2 months ago

this sounds cool but on the website I saw the previous version where its more like a passive device to listen, transcribe and save.

_aavaa_ 2 months ago

Something, something, torment nexus.

bakaev 2 months ago

imagine getting micro managed by this omi lol

  • Biologist123 2 months ago

    I guess you have to turn it on. Might be handy for things like electrician courses, DIY jobs, maths homework etc. Maybe one day even surgery!

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection