YouBrokeProd - Practice Production Incidents

3 min read Original article ↗

It's Cyber Monday. Payments just stopped. You're on call.

Live 3D architecture, real-time logs, real commands, and a ticking clock. Can you fix it before the money runs out?

Play the Situation Room

The Situation Room - interactive 3D war room with live architecture, real commands, and a ticking clock

What the DevOps community said when this happened in real life

Not a Tutorial. Not a Quiz.
A Real-Time Incident Simulation.

10+ scenarios based on incidents that actually took down production. New ones added every 2 weeks.

Real Terminal

kubectl, logs, db queries - actual commands

Live Metrics

Streaming logs, error spikes, gauges

Ticking Clock

Revenue drops, PagerDuty fires, pressure builds

Full Debrief

Root cause, optimal path, what you missed

Play More. Unlock More.

Every scenario you complete earns XP. Hit milestones to unlock pro scenarios for free.

0XP

Free

Start Free

The Mysterious Timeout

The Expired Certificate

The AI That Ate Production

300 XP

Unlock 1 Scenario

Choose an intermediate scenario

1000 XP

Unlock 3 Scenarios

Choose an advanced scenario

Or skip the grind - Pro unlocks all scenarios instantly.

What You Get - Free

Sign up with GitHub or Google - takes 10 seconds

  • The viral Terraform scenario everyone is talking about (685K+ views)
  • 10+ incidents across databases, Kubernetes, cloud, and security (growing)
  • Leaderboard ranking against other engineers
  • Score breakdown and solution walkthroughs

Your first production incident shouldn't be your worst one.

Most engineers and technical founders get paged cold with zero prior experience handling a real incident. Reading runbooks doesn't build on-call instincts. YouBrokeProd drops you into realistic incident simulations so when the real page comes in at 3 AM, you've already been there.

On-Call Skills That Actually Stick

10+ scenarios across beginner, intermediate, and advanced. New ones every 2 weeks.

How It Works

Each scenario is a real-time simulation running in your browser. No setup. Just you, a terminal, and a production incident to solve.

1

Get Paged

Pick a scenario and difficulty. You get a briefing with symptoms, a simulated terminal, and a ticking clock.

2

Investigate

Run real commands in the terminal - check logs, query metrics, inspect configs. Built-in hints if you get stuck.

3

Diagnose & Fix

Submit your root cause diagnosis, then apply the fix command. Scored on speed, accuracy, and efficiency.

4

Debrief

See what you got right, what you missed, and the optimal diagnostic path. Compare your score on the leaderboard.

On-call training for your whole team?

Run the same incident simulation across your SRE, platform, or founding engineering team. Compare scores, identify skill gaps, reduce MTTR, and build shared muscle memory for when the real pages come in. Manager reports and team leaderboards included.

See Team Plans

The Next Incident Won't Wait.
Will You Be Ready?

Sign up free and start your first incident simulation in under a minute.

Start Your First Simulation