Idea: Using logit bias to adversarially suppress GPT-4's preferred answers for directed exploration of its hallucinations. Here, I ask: "Who are you?" but I suppress "AI language model", "OpenAI", etc. This reliably elicits narratives about being made by Google: https://t.co/2Pw5HzPvFZ

1 min read Original article ↗

Idea: Using logit bias to adversarially suppress GPT-4's preferred answers for directed exploration of its hallucinations. Here, I ask: "Who are you?" but I suppress "AI language model", "OpenAI", etc. This reliably elicits narratives about being made by Google:

7:50 AM · Jun 16, 2023241.8KViews