Gemini Pro vs. GPT3.5

15 points by ssabev 3 years ago · 6 comments

Reader

jafitc 3 years ago

This "vibe" check that it's even better than GPT-4 Turbo is not what its Elo rating shows on the Chatbot Arena based on not 1 but thousands of user votes. GPT-4 (Turbo) is in a league of its own still.

npinsker 3 years ago

By its nature, that site isn't very representative of how the models perform in real-world use.
- Reubend 3 years ago
  
  That depends on what real world use you're targeting, but unfortunately I'm not aware of anything better than that leaderboard in terms of sample size and model coverage.
- ssabevOP 3 years ago
  
  The ELO leaderboard you mean?
Racing0461 3 years ago

The vibe check is for pro tho. I want to see how ultra is benchmarked.

ssabevOP 3 years ago

Spoiler: it's fast, cheap, overly protective, and has Kafkaesque DX

Settings

Gemini Pro vs. GPT3.5

Keyboard Shortcuts