A recent controversy in AI benchmarking emerged over claims that Google’s Gemini model outperformed Anthropic’s Claude model in Pokémon gameplay. A viral post on X…