Settings

Theme

Elevenlabs Conversational AI 2.0

elevenlabs.io

32 points by killion 10 months ago · 12 comments

Reader

miles 10 months ago

Given their excellent record, it's strange that ElevenLabs would make a basic transcription mistake in the embedded introduction video[1].

At around 30 seconds in[2], an AI agent types and says, "Hi! Welcome to our design studio." to which a customer responds「すみません、日本語で話してもいいですか?」(Sorry, may I speak Japanese?).

The AI agent then says 「もちろんです。日本語に切り替えましょう。」(Of course. Let's switch to Japanese.) but types「ご心配なく。数量は柔軟に対応できます。」which means "No worries. We can be flexible with the quantity."

[1] https://www.youtube.com/watch?v=TlclS4wLWgY

[2] https://imgur.com/18U9uUU

  • giaccoangelo 10 months ago

    Yep that one's on me, we originally were doing an interpreter agent demo here, and then switched to the one in the video, but somehow this text from the interpreter agent was kept in the published video. something for us to learn from

  • rolph 10 months ago

    that could have serious consequences during a contract negotiation.

gorgoiler 10 months ago

The play button widget will read the article aloud. Alas, it called the product “conversational AI two dot zero comma” and the company “vertical bar elevenlabs”.

Their TTS is good but unfortunately the exemplar widget is not nearly up to the same quality.

giaccoangelo 10 months ago

a lot of awesome features in this ship, comment your feedback for me:)

  • mdrzn 10 months ago

    Any demo so we can try it?

    You should have learned something from the Sesame AI demo and the hype it generated for them.

  • flyingspaceship 10 months ago

    for elevenlabs in general, pricing too high. if we put the price to punch ratio of kokoro at 100, elevenlabs is probably a 35

  • piterrro 10 months ago

    who are you?

    • frankvdwaal 10 months ago

      He's a "Growth Engineer" from ElevenLabs. I'm not sure what that entails, but then I'm not familiar with that area of tech, so maybe someone else can explain it.

wild_egg 10 months ago

What does SOTA tool use with voice agents look like these days? Any providers have MCP support?

It'd be incredible to get to a point where we can have natural conversations and the AI is running tools in the background and keeping tabs on things.

ldenoue 10 months ago

How is the turn detection working? LLM prompting or a special AI audio plus text model?

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection