Settings

Theme

Kaggle Launches LLM Evals

kaggle.com

9 points by antgoldbloom 10 months ago · 5 comments

Reader

antgoldbloomOP 10 months ago

Here’s the announcement https://www.kaggle.com/blog/announcing-kaggle-benchmarks

I was founder and ceo of kaggle. I’ve been out of kaggle for 2.5 years. Super excited to see this announcement. Could solve the biggest problem in the LLM ecosystem.

art82135 10 months ago

Curious how does it compare to Chat Arena?

  • meganrisdal 10 months ago

    We love what Chatbot Arena is doing to innovate on evaluation paradigms. The challenge of evaluating GenAI warrants diverse approaches. What we're excited to do is: 1) give anyone access to infra to make evaluation more accessible to more developers and researchers; 2) drive more novel, diverse evals. https://arxiv.org/abs/2505.00612v2

benhamner 10 months ago

Can we add our own models or benchmarks?

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection