Settings

Theme

Show HN: Relia – Build your own LLM benchmark

relia.dev

3 points by yz-yu 2 years ago · 0 comments · 1 min read

Reader

Relia is an E2E testing framework for LLMs, designed to help you build AI benchmarks tailored to your specific use cases. It identifies the most suitable LLM model for your needs and ensures that model upgrades do not cause performance regressions through continuous testing. Built specifically for function calling (or "tool use") scenarios, which are at the core of agent-based AI applications.

No comments yet.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection