Settings

Theme

Gemini Pro vs. GPT3.5

klu.ai

15 points by ssabev 2 years ago · 6 comments

Reader

jafitc 2 years ago

This "vibe" check that it's even better than GPT-4 Turbo is not what its Elo rating shows on the Chatbot Arena based on not 1 but thousands of user votes. GPT-4 (Turbo) is in a league of its own still.

  • npinsker 2 years ago

    By its nature, that site isn't very representative of how the models perform in real-world use.

    • Reubend 2 years ago

      That depends on what real world use you're targeting, but unfortunately I'm not aware of anything better than that leaderboard in terms of sample size and model coverage.

    • ssabevOP 2 years ago

      The ELO leaderboard you mean?

  • Racing0461 2 years ago

    The vibe check is for pro tho. I want to see how ultra is benchmarked.

ssabevOP 2 years ago

Spoiler: it's fast, cheap, overly protective, and has Kafkaesque DX

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection