flatreader

Google Gemini unexpectedly surges to No. 1, over OpenAI, but benchmarks don’t tell the whole story

Google's Gemini-Exp-1114 AI model tops key benchmarks, but experts warn traditional testing methods may no longer accurately measure true AI capabilities or safety, raising concerns about the industry's current evaluation standards.