Do reasoning models really “think” or not? Apple research sparks lively debate, response

Ultimately, the big takeaway for ML researchers is that before proclaiming an AI milestone—or obituary—make sure the test itself isn’t flawed