Settings

Theme

Ask HN: How are engineers evaluating non-deterministic ML/LLM based deployments?

2 points by zuck_vs_musk 9 months ago · 0 comments · 1 min read


So, we process data as well as documents from various sources, then,

  - convert all of its text (using different OCRs)
  - pass it to LLM models - depending on the customer, it can be a cheaper model, and we do have model fallbacks

How do engineers evaluate such systems?

  1. New models & new libraries are coming all the time
  2. Even a third-party's deployment model will change over time and might improve/regress our systems
Any good approach for writing evaluations for these?

No comments yet.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection