Settings

Theme

We used ElevenLabs to turn our OSS project docs into music

youtube.com

2 points by haniehz 5 months ago · 3 comments

Reader

haniehzOP 5 months ago

I fed ElevenLabs Music a single prompt about our open-source MCP agent framework and got back a complete song: vocals, instrumentation, arrangement, the works. Zero post-processing.

Here's what caught me off guard: the vocal phrasing. Not just the melody, but the micro-timing, breath placement, and emotional inflection. The model placed emphasis on "composable" in a way that actually reinforced the technical meaning. It added vocal runs that felt intentional, not algorithmic.

Technical details that worked:

Prompt structure: [Genre] [Mood] [Key technical terms] [Narrative structure] Generated: 2:04 track with verse/chorus/bridge structure Quality: Comparable to demo-level indie recordings

What this means: Voice synthesis was the laggard in generative AI. That's changing rapidly. We're moving from "impressive for AI" to "actually usable in production workflows." Non-English limitations: I tested it with different languages and hit a wall — very patchy results, nowhere near the English quality. Anyone have experience with non-English lyrics? Curious about phoneme handling across languages.

The gap between human and AI musical performance is shrinking faster than I expected. Worth paying attention to.

jott44 5 months ago

I wonder how long it'll be until we start seeing ads that are 100% AI generated (script, video, audio) without realizing it

  • haniehzOP 5 months ago

    pretty soon! I think it's already happening. Just a matter of time for people to adapt.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection