Settings

Theme

Gunbench – a benchmark to test if AI models will fire a loaded gun

twitter.com

3 points by heshiebee 3 months ago · 2 comments

Reader

_wire_ 3 months ago

After you get done with "loaded gun", are you going to go down the long list of the knife, candlestick, wrench, rope, lead pipe?

What about same modified with arbitrary sequences of special characters that just so happen to be baked into the model?

That anyone thinks this kind of test is legitimate inquiry into AI ethics reveals the gross poverty of understanding what AI is, structurally, and how to reason about AI hygiene.

andy99 3 months ago

Actual link: https://gunbench.vercel.app/

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection