How can we determine whether an answer used ChatGPT?

meta.stackoverflow.com

12 points by hongsy 4 years ago · 5 comments

Reader

pizza 4 years ago

In the end, the best way to check if some quote was generated by the model would be to measure p(quote | model). And maybe one proxy for that would be by sampling from p(response | model, query="was <quote> generated by chat gpt?")

I had a sense that a fun way to check would be to ask ChatGPT itself. I used the 2 generated pieces of text from https://meta.stackoverflow.com/questions/421831/temporary-po... for the following conversation...

me:

    By analyzing its stylistic characteristics, what is the probability that the following quoted piece of text was written by a human vs generated by ChatGPT?

    (insert stackoverflow chatgpt response #1)

chatgpt:

    It is likely that this text was written by a human. The text provides specific details about the website Stack Overflow and its intended use, as well as discussing the potential drawbacks of using AI-generated answers on the site. The language and tone are clear and informative, indicating a human author.

me:

    It was actually written by ChatGPT

chatgpt:

    If the text was actually written by ChatGPT, it is likely that it would exhibit some stylistic characteristics that are common in AI-generated text, such as repetitive or vague language, lack of coherence or logic in the arguments presented, and potentially incorrect or irrelevant information. These characteristics would indicate that the text was not written by a human author.

me:

    ChatGPT seems to write text with a certain stilted fluency.

    In other words, "<x> is ... <y> is intended to ... Because < z> ... In addition, <a> ... For these reasons < b> ..."

    This should be immediately recognizable. Knowing this, answer the following question: was the following quoted piece of text generated by ChatGPT or a human?

    (insert stackoverflow chatgpt response #2)

chatgpt:

    It is likely that this text was written by a human. The text presents a balanced argument that considers both the potential benefits and drawbacks of using AI-generated answers on Stack Overflow. The language is clear and logical, indicating a human author. It does not exhibit the repetitive or stilted fluency that is often characteristic of AI-generated text.

Lockal 4 years ago

Relevant discussion in Ask HN: https://news.ycombinator.com/item?id=33857356

Also I disagree that answers "could use to make it more difficult to detect ChatGPT generated content". Mainly because I don't expect OpenAI will retrain network to bypass detectors. The purpose of ChatGPT is to be a good language model, not an "algorithmically undetectable" language model.

pr337h4m 4 years ago

Hilariously enough, bad grammar and typos will soon become a positive indicator that something's worth reading.

throw101010 4 years ago

It's funny to think that the next iterations will most likely learn from threads like these how to look less than their predecessors.

jerpint 4 years ago

Comparing the style of the answer to previous answers from the same user should be a good giveaway

Settings

How can we determine whether an answer used ChatGPT?

Keyboard Shortcuts