Why do LLM outputs get worse even when metrics stay stable? [pdf] huggingface.co 4 points by scaledsystems 18 days ago · 1 comment Reader PiP Save No comments yet.