GitHub - guidupuy/screenwright

🎬 Screenwright

Turn Playwright E2E tests into polished product demo videos.

Screenwright analyzes your existing Playwright tests, generates a cinematic "demo scenario" with human pacing and narration, records it with video capture, then composites cursor animation and voiceover into a final MP4.

Example

Generated from cli/scripts/showcase-instaclaw/scenario.js:

animation-instaclaw.mp4

Installation

CLI

npm install -D screenwright
npx screenwright init

screenwright init creates a config file, sets up your TTS provider (Piper for local/offline or OpenAI for cloud), and auto-installs the coding assistant skill for detected assistants (Claude Code, Codex).

Prerequisites: Node.js >= 20, Playwright browsers (npx playwright install chromium)

Claude Code Skill

screenwright init auto-detects Claude Code and offers to install the skill. You can also install it manually:

mkdir -p ~/.claude/skills/screenwright
npx screenwright skill > ~/.claude/skills/screenwright/SKILL.md

The postinstall hook tries to keep the skill in sync on upgrade, but if lifecycle scripts are disabled (--ignore-scripts), re-run the command above after upgrading.

Then use /screenwright in Claude Code to get started.

Quick Start

With Claude Code (recommended)

The skill walks you through test selection, scenario generation, and video composition.

With the CLI

# 1. Generate a demo scenario from a Playwright test
npx screenwright generate --test ./tests/checkout.spec.ts

# 2. Review and edit the generated scenario at ./demos/checkout-demo.ts

# 3. Compose the final video
npx screenwright compose ./demos/checkout-demo.ts

# 4. Or quickly preview without cursor/voiceover
npx screenwright preview ./demos/checkout-demo.ts

CLI Reference

`screenwright init`

Bootstrap config, set up TTS provider, and install coding assistant skills.

npx screenwright init [--tts piper|openai] [--piper-voice <model>] [--openai-voice <voice>] [--skip-voice-download] [--skip-skill-install]

Flag	Default	Description
`--tts`	(interactive)	TTS provider: `openai` (recommended) or `piper` (local/free, lower quality)
`--piper-voice`	`en_US-amy-medium`	Piper TTS voice model
`--openai-voice`	`nova`	OpenAI voice name
`--skip-voice-download`	false	Skip downloading Piper voice model
`--skip-skill-install`	false	Skip coding assistant skill installation

`screenwright generate`

Prepare LLM prompts for demo scenario generation, or validate an existing scenario. With --test, reads your Playwright test and prints a system/user prompt pair — pipe them to any LLM, or use the /screenwright skill in Claude Code which handles this automatically. With --validate, checks that a scenario file uses the sw.* API correctly.

npx screenwright generate --test <path> [--out <path>] [--narration-style brief|detailed] [--app-description <desc>]
npx screenwright generate --validate <path>

Flag	Default	Description
`--test`	(required)	Path to Playwright test file
`--out`	`./demos/<name>-demo.ts`	Output path
`--narration-style`	`detailed`	`brief` or `detailed` narration
`--app-description`	-	Brief description of the app for context
`--validate`	-	Validate an existing scenario file

`screenwright compose`

Record scenario and compose final MP4 with cursor overlay and voiceover.

npx screenwright compose <scenario> [--out <path>] [--resolution WxH]

Flag	Default	Description
`--out`	`./output/<name>.mp4`	Output path
`--resolution`	`1280x720`	Video resolution
`--no-voiceover`	false	Skip voiceover audio
`--no-cursor`	false	Skip cursor overlay
`--keep-temp`	false	Keep intermediate files

`screenwright preview`

Quick preview (WebM) without cursor overlay or voiceover.

npx screenwright preview <scenario> [--out <path>]

Demo Scenario API

Generated scenarios use the sw helper API:

import type { ScreenwrightHelpers } from 'screenwright';

export default async function scenario(sw: ScreenwrightHelpers) {
  await sw.scene('Getting Started', { slide: {} });
  await sw.navigate('http://localhost:3000', {
    narration: "Let's open the app.",
  });
  await sw.click('[data-testid="login"]', {
    narration: 'Click the login button.',
  });
  await sw.fill('[data-testid="email"]', 'sarah@example.com');
  await sw.wait(2000);
}

Available Helpers

Method	Description
`sw.scene(title)`	Mark a scene boundary (no slide)
`sw.scene(title, { slide?: { duration?, brandColor?, textColor?, fontFamily?, titleFontSize?, narrate? } })`	Scene with optional transition slide. `narrate` adds voiceover to the slide (auto-extends duration to fit). Pass `{ slide: {} }` for defaults (2000ms duration / config branding)
`sw.navigate(url, { narration? })`	Navigate to URL
`sw.click(selector, { narration? })`	Click an element
`sw.dblclick(selector, { narration? })`	Double-click an element
`sw.fill(selector, value, { narration? })`	Type into an input (character by character)
`sw.hover(selector, { narration? })`	Hover over an element
`sw.press(key, { narration? })`	Press a keyboard key
`sw.wait(ms)`	Pause for pacing
`sw.narrate(text)`	Speak narration without an action
`sw.transition({ type, duration })`	Visual transition between any two states (see Transitions below)

Transitions

Add visual transitions anywhere — between slides, after actions, or any time the screen changes:

await sw.scene('First Scene', { slide: { duration: 1500 } });
await sw.transition({ type: 'cube', duration: 800 });
await sw.scene('Second Scene', { slide: { duration: 1500 } });

Transition	Description
`fade`	Cross-dissolve between scenes
`wipe`	Horizontal wipe reveal
`slide-up`	Slide up with push effect
`slide-left`	Slide left with push effect
`zoom`	Zoom in/out with fade
`doorway`	Two halves split apart revealing the next scene expanding from center
`swap`	3D horizontal swap with rotation
`cube`	3D cube rotation between faces

Transitions work between any two visual states — slides, frame-based captures, or a mix of both.

Transition Timing

When sw.transition() is followed by an action, the transition animation plays first, then the action executes in full view. The "after" frame of the transition is chosen to show the page before the action's visible effect begins:

Next action after `sw.transition()`	Transition ends on	Then you see
`sw.click(sel)`	Page before cursor moves	Cursor moves to element, click fires
`sw.dblclick(sel)`	Page before cursor moves	Cursor moves to element, double-click fires
`sw.fill(sel, value)`	Page before cursor moves	Cursor moves to input, text is typed character by character
`sw.hover(sel)`	Page before cursor moves	Cursor moves to element, hover triggers
`sw.press(key)`	Page before keypress	Keypress fires
`sw.navigate(url)`	Loaded new page	Page is already visible (navigation happened during transition)
`sw.scene(_, { slide })`	Slide overlay	Slide is already visible

This means the action's visual feedback (typing, hover effects, page changes from a click) always plays out naturally in the recorded frames rather than being hidden by the transition.

Resolution

Videos are rendered at 2x (Retina) resolution — a 1280×720 config produces a crisp 2560×1440 output.

Frame rate

Screenwright records at 30 fps by default. During recording, each captured screenshot advances a virtual clock by exactly 1000/30 ms, making the frame manifest authoritative for video timing.

Low-power machines

If your machine can't sustain 30 fps (i.e., each screenshot takes longer than ~33 ms), the virtual clock falls behind wall time. This can cause narration audio to overlap in the output because the virtual duration of each segment becomes shorter than the actual audio.

Screenwright detects this automatically. If the actual capture rate drops below 85% of the target, you'll see a warning:

⚠ Capture loop averaged 18.2fps (target 30fps). Video timing may be inaccurate.
  Consider setting fps: 18 in your screenwright config, or running on a faster machine.

Follow the suggestion to lower the frame rate to match your machine's capability. A lower frame rate with accurate timing produces better results than a higher frame rate with drift.

Configuration

screenwright.config.ts (created by screenwright init):

const config = {
  // TTS
  ttsProvider: "openai",             // "openai" (recommended) or "piper" (local/free, lower quality)
  openaiVoice: "nova",               // OpenAI voice (when ttsProvider is "openai")
  piperVoice: "en_US-amy-medium",     // Piper voice model (when ttsProvider is "piper")
  openaiTtsInstructions: "...",      // Tone instructions for OpenAI TTS

  // Video
  resolution: { width: 1280, height: 720 },
  outputDir: "./output",

  // Browser
  locale: "en-US",
  colorScheme: "light",
  timezoneId: "America/New_York",

  // Default slide styling (used when sw.scene() is called with { slide: {} })
  branding: {
    brandColor: "#4F46E5",       // Default slide background color (hex)
    textColor: "#FFFFFF",        // Default slide text color (hex)
    fontFamily: "Inter",         // Default Google Fonts family (optional)
  },
};

export default config;

When using OpenAI TTS, set the OPENAI_API_KEY environment variable.

Troubleshooting

"Playwright browsers not installed"

npx playwright install chromium

"Could not connect to the app" Make sure your dev server is running before composing.

"Voiceover generation failed" Re-run npx screenwright init to download the Piper TTS binary. Or use --no-voiceover to skip.

"Out of memory during rendering" Try a lower resolution: --resolution 1280x720

"Timed out waiting for an element" Check that selectors in the scenario match your app's current DOM. The error message includes the exact sw.* call and selector that failed.

Architecture

cli/
  src/
    commands/       # CLI commands (init, generate, compose, preview)
    runtime/        # Playwright instrumentation (sw.* helpers, timeline collector)
    composition/    # Remotion components (DemoVideo, CursorOverlay, NarrationTrack)
    voiceover/      # Piper TTS engine, OpenAI TTS engine, narration timing
    generator/      # LLM prompt templates for scenario generation
    timeline/       # Timeline JSON types and Zod schema
    config/         # Configuration schema and defaults
skill/
  SKILL.md          # Claude Code skill definition

Releasing

Bump the version in cli/package.json, commit, tag, and push:

# edit cli/package.json version
git add cli/package.json && git commit -m "Release v0.X.Y"
git tag v0.X.Y && git push origin main --tags

GitHub Actions publishes to npm via trusted publishing and creates a GitHub Release automatically.

License

MIT