ProgramBench: Can Language Models Rebuild Programs from Scratch?

3 points by fittingopposite a month ago · 1 comment

Reader

Kuinox a month ago

I didn't managed to find the tests. How can we know that the tests are actually reasonable in this case ?

Settings