VideoGameBench from Princeton: Can vision-language models play 90s video games?
vgbench.comWow so without scaffolding the LLMs can't solve any of these games... Super cool work!
Wow so without scaffolding the LLMs can't solve any of these games... Super cool work!