News

With a 4-0 win over Elon Musk's AI Grok 4 in the final, ChatGPT's o3 Large Language Model (LLM) took the crown at the first ...
He estimated Grok 4's skill at a lowly "800," while rating o3 at "1200." He described OpenAI’s o3 as looking "like a chess player" that was "fairly ruthless in conversions," indicating that the AI ...
World number one chess player Magnus Carlsen estimated Grok’s rating at approximately 800 and OpenAI’s model at around 1200, ...
Grok 4 lost to OpenAI's o3 in the final of the Kaggle AI Exhibition Tournament on 7 August, 2025 () Sam Altman’s AI model has ...
Google aims to test the reasoning capabilities of ChatGPT, Gemini, Claude, and other AI models using a Bayesian skill-rating ...
In an AI-centric chess tournament, OpenAI's o3 model came out on top over Grok 4, but does this have any real implication on ...
As AI models increasingly ace conventional tests, researchers are looking for new benchmarking methods. Google is betting on ...
Eight leading AI models from OpenAI, Google, Anthropic, and others are competing in a three-day chess tournament to test large language models’ decision-making and reasoning through strategic gameplay ...
OpenAI, the company behind ChatGPT, defeated Grok, owned by Elon Musk, in the a chess championship match of the greatest ...
Explore the history of AI. Learn why John McCarthy is called its 'father' and how pioneers like Alan Turing paved the way for ...
Google launches the Kaggle Game Arena, a new platform where top AI models from OpenAI, Anthropic, and more will compete in ...
AI's Grok 4 has dominated Day 1 of Google's Kaggle Game Arena, a new chess tournament testing the strategic reasoning of top ...