LLMs' "simulated reasoning" abilities are a "brittle mirage," researchers find
arstechnica.comPreprint discussed in the article:
fyi, this is about Chain-of-thought, not <think>, is that still being used?
haven't read the paper closely enough to comment on the methods