GPT cannot even count beans correctly
chatgpt.comMy hammer is also really bad at torque measurements. I'll get right on duct taping a torque wrench to it so it will pass that test ASAP.
That's a terrible analogy because a human can trivially count the beans. If GPT has any hope of being a decent image model, then counting most definitely is one of the tasks that it needs to be capable of doing correctly. It's astounding that the benchmarks and tests have ignored such basic image tasks. Even a five year old can count them correctly.