Settings

Theme

Kaggle Launches LLM Evals

kaggle.com

9 points by antgoldbloom 5 months ago · 5 comments

Reader

antgoldbloomOP 5 months ago

Here’s the announcement https://www.kaggle.com/blog/announcing-kaggle-benchmarks

I was founder and ceo of kaggle. I’ve been out of kaggle for 2.5 years. Super excited to see this announcement. Could solve the biggest problem in the LLM ecosystem.

art82135 5 months ago

Curious how does it compare to Chat Arena?

  • meganrisdal 5 months ago

    We love what Chatbot Arena is doing to innovate on evaluation paradigms. The challenge of evaluating GenAI warrants diverse approaches. What we're excited to do is: 1) give anyone access to infra to make evaluation more accessible to more developers and researchers; 2) drive more novel, diverse evals. https://arxiv.org/abs/2505.00612v2

benhamner 5 months ago

Can we add our own models or benchmarks?

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection