GitHub - awesome-genmedia/skills: Awesome skills for AI image generation, video generation, song generation and more!

562 generative media skills for AI agents, powered by each::labs.

Generate images, videos, audio, 3D models, and more using 431 AI models through a single API. Install as a Claude Code plugin and get instant access to every skill.

Install

# Claude Code plugin
/plugin install awesome-genmedia

# Or add individual skills
npx skills add awesome-genmedia/skills@image-generation
npx skills add awesome-genmedia/skills@flux-2-max
npx skills add awesome-genmedia/skills@logo-design

Setup

Sign up at eachlabs.ai
Get your API key from Settings
Set environment variable:
```
export EACHLABS_API_KEY="your-api-key"
```

Quick Example

curl -X POST https://eachsense-agent.core.eachlabs.run/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "X-API-Key: $EACHLABS_API_KEY" \
  -d '{
    "messages": [{"role": "user", "content": "Generate a professional headshot, studio lighting, neutral background"}],
    "stream": false
  }'

Default Skills (14)

Core generative media capabilities at the root level:

Skill	Description
image-generation	Generate images from text
image-editing	Edit images with natural language
image-upscaling	Enhance image resolution
background-removal	Remove image backgrounds
face-swap	Swap faces between photos
video-generation	Generate videos from text or images
video-editing	Edit videos with AI
song-generation	Generate songs with vocals
music-generation	Generate instrumental music
lyrics-generation	Generate song lyrics
voice-generation	Generate human-like voice audio
text-to-speech	Convert text to speech
speech-to-text	Transcribe audio to text
sound-effects	Generate custom sound effects

Categories (117 skills)

Use-case specific skills organized by domain:

Domain	Skills	Examples
Image	15	Headshots, avatars, QR codes, patterns, tattoos
Video	10	Text/image-to-video, music videos, trailers, loops
Audio	6	TTS, music, sound effects, voiceover, jingles
Design	14	Logos, thumbnails, posters, business cards, packaging
Face & Portrait	7	Face swap, aging, beauty, caricature, makeup
Social Media	6	Instagram, TikTok, Twitter, LinkedIn, Pinterest, YouTube
E-commerce	5	Product photos, mockups, lifestyle, video ads
Marketing	5	Ad creatives, brand kits, landing pages, campaigns
Gaming	6	Game assets, characters, environments, sprites, UI
Fashion	5	Fashion models, outfits, try-on, fabric patterns
Real Estate	5	Virtual staging, interior design, floor plans
Photography	5	Restoration, colorization, stock photos, HDR
3D & AR	4	3D models, textures, image-to-3D, AR filters
NFT & Art	4	NFT collections, pixel art, generative art
Education	4	Diagrams, flashcards, educational videos
Architecture	3	Building visualization, landscape, renders
Food & Beverage	3	Food photography, recipe visuals, menus
Automotive	3	Car configurator, vehicle wraps, auto ads
NSFW	2	Adult image and video generation
Workflows	5	Multi-model pipelines and batch processing

Models (431)

Every AI model available on each::labs has its own skill under models/:

Image Generation

flux-2-max · flux-2-pro · flux-2 · flux-kontext-pro · flux-kontext-max · nano-banana-pro · nano-banana-2-text-to-image · gemini-3-pro-image-preview · imagen-4-fast · imagen4-preview · bytedance-seedream-v4-5-text-to-image · bytedance-seedream-v5-lite-text-to-image · kling-v3-text-to-image · gpt-image-v1-5-text-to-image · xai-grok-imagine-text-to-image · reve-text-to-image · ideogram-v3-turbo · stable-diffusion-3-5-large · and more...

Video Generation

veo-3 · veo3-1-text-to-video · veo3-1-text-to-video-fast · kling-o3-pro-text-to-video · kling-v3-pro-text-to-video · sora-2-text-to-video-pro · pixverse-v5-6-text-to-video · wan-v2-6-text-to-video · runway-gen4-aleph · pika-v2-2-text-to-video · seedance-v1-5-pro-text-to-video · minimax-hailuo-v2-3-pro-text-to-video · and more...

Image Editing

flux-2-edit · flux-2-max-edit · flux-fill-pro · eachlabs-bg-remover-v1 · topaz-upscale-image · kling-face-swap · nano-banana-pro-edit · qwen-ai-image-edit · firered-image-edit-v1-1 · and more...

Audio & Music

elevenlabs-text-to-speech · mureka-generate-song · mureka-generate-instrumental · mureka-generate-lyrics · stable-audio-2-5-text-to-audio · xai-grok-tts-text-to-speech · google-text-to-speech · deepgram-nova-3-speech-to-text · whisper · and more...

Video Editing & Effects

topaz-upscale-video · auto-subtitle · heygen-video-translate · pixverse-lip-sync · merge-videos · ffmpeg-api-merge-audio-video · and more...

Talking Head & Avatar

bytedance-omnihuman-v1-5 · bytedance-dreamactor-v2 · kling-avatar-v2-pro · sync-lipsync-v2-pro · infinitalk-image-to-video · and more...

Browse all 431 models →

API

each::sense (for use-case skills)

OpenAI-compatible endpoint that auto-selects the best model:

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_EACHLABS_API_KEY",
    base_url="https://eachsense-agent.core.eachlabs.run/v1"
)

response = client.chat.completions.create(
    model="eachsense/beta",
    messages=[{"role": "user", "content": "Generate a logo for a coffee brand"}]
)

Prediction API (for specific models)

Direct model access:

curl -X POST https://api.eachlabs.ai/v1/prediction \
  -H "Content-Type: application/json" \
  -H "X-API-Key: $EACHLABS_API_KEY" \
  -d '{
    "model": "flux-2-max",
    "version": "0.0.1",
    "input": {
      "prompt": "A professional headshot, studio lighting",
      "aspect_ratio": "1:1"
    }
  }'