Settings

Theme

Elevenlabs Conversational AI 2.0

elevenlabs.io

32 points by killion 7 months ago · 12 comments

Reader

miles 7 months ago

Given their excellent record, it's strange that ElevenLabs would make a basic transcription mistake in the embedded introduction video[1].

At around 30 seconds in[2], an AI agent types and says, "Hi! Welcome to our design studio." to which a customer responds「すみません、日本語で話してもいいですか?」(Sorry, may I speak Japanese?).

The AI agent then says 「もちろんです。日本語に切り替えましょう。」(Of course. Let's switch to Japanese.) but types「ご心配なく。数量は柔軟に対応できます。」which means "No worries. We can be flexible with the quantity."

[1] https://www.youtube.com/watch?v=TlclS4wLWgY

[2] https://imgur.com/18U9uUU

  • giaccoangelo 7 months ago

    Yep that one's on me, we originally were doing an interpreter agent demo here, and then switched to the one in the video, but somehow this text from the interpreter agent was kept in the published video. something for us to learn from

  • rolph 7 months ago

    that could have serious consequences during a contract negotiation.

gorgoiler 7 months ago

The play button widget will read the article aloud. Alas, it called the product “conversational AI two dot zero comma” and the company “vertical bar elevenlabs”.

Their TTS is good but unfortunately the exemplar widget is not nearly up to the same quality.

giaccoangelo 7 months ago

a lot of awesome features in this ship, comment your feedback for me:)

  • mdrzn 7 months ago

    Any demo so we can try it?

    You should have learned something from the Sesame AI demo and the hype it generated for them.

  • flyingspaceship 7 months ago

    for elevenlabs in general, pricing too high. if we put the price to punch ratio of kokoro at 100, elevenlabs is probably a 35

  • piterrro 7 months ago

    who are you?

    • frankvdwaal 7 months ago

      He's a "Growth Engineer" from ElevenLabs. I'm not sure what that entails, but then I'm not familiar with that area of tech, so maybe someone else can explain it.

wild_egg 7 months ago

What does SOTA tool use with voice agents look like these days? Any providers have MCP support?

It'd be incredible to get to a point where we can have natural conversations and the AI is running tools in the background and keeping tabs on things.

ldenoue 7 months ago

How is the turn detection working? LLM prompting or a special AI audio plus text model?

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection