Settings

Theme

Show HN: Nightwatch, The open-source, read-only AI SRE

github.com

27 points by egorferber a day ago · 10 comments · 2 min read

Reader

nightwatch is a local-first, read-only layer on top of your monitoring. it groups alert storm into incidents, flags noisy checks and has an agent that can investigate for you live systems. You can e.g. jump from the incident into the agent directly.

the reason for this weekend project is that we had a kubernetes upgrade that went wrong, and at some point a rollback wasn't possible anymore, so it had to be fixed live during the night while several problems came together. We run a lot of different systems, on-prem and several Kubernetes clusters, and in a situation like that you spend most of the time just figuring out what is actually broken and where.

So i thought that it would be pretty cool to have eyes in the dark in each system that can talk to your "brain".

so the idea is to put a baby owl into each environment. Each owl runs where the systems live, keeps that environment's credentials local, and only dials outbound to a central brain, so there is no inbound hole into prod. It exposes a set of read-only skills, and the agent uses them to gather evidence and form a root-cause hypothesis, so the on-call engineer starts with a head start instead of from zero.

read-only for now, i don't trust it near prod yet and honestly neither should you.

llocal-first for easy self-hosting and to keep credentials on your side. the clustering and recommendations run fully offline with no llm at all. the agent needs a tool-calling llm, you can point it at a remote one, or self-host one (ollama etc.) if you want to stay fully offline.

for non selfhosters: before every remote llm call, nightwatch strips real secrets (unrestorable) and swaps identifiers like ips, hostnames and paths for reversible placeholders, so the model only sees masked data while real values are restored only in the proposed commands and tool calls

Would love if you try it in your Systems

kapilvt 21 hours ago

Is the name on homage to James Mickey’s classic column article in usenix ?

https://www.usenix.org/system/files/1311_05-08_mickens.pdf

  • egorferberOP 14 hours ago

    Thats a cool find! No Not all i have never Heard of It :)

  • egorferberOP 13 hours ago

    i just have read through it is a funny article thanks for that, he has a quite interesting view on the system programmers

tam159 7 hours ago

LLM may not have enough context about the related services to investigate the errors, should we also include some info e.g projects/ services README.md, CLAUDE.md in the Nightwatch agent?

defied 13 hours ago

Same name as the test automation framework https://nightwatchjs.org/

egorferberOP 21 hours ago

Would be happy to hear some feedback!

pricewatcherco 8 hours ago

why not stormmwatch ?

  • egorferberOP 7 hours ago

    Well i thought owls would be quite funny, as ninox is a owl breed. So i kept with the logic and have learned about a group of owls which is called parliament of owls which is even more cool. so i kept it, owls see good in the night.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection