News
Checkmate: OpenAI's o3 swept Musk's Grok 4 in an AI chess showdown.
Having destroyed the previous record time for completing Pokémon Red, GPT-5 is now going to take on the sequel, Pokémon ...
GPT-5 Pro, GPT-5 Thinking, and o3 to analyze my code repository - and found surprising differences in detail, reasoning, and ...
ChatGPT-5 completed Pokémon Red in 6,470 steps, compared to 18,184 steps for GPT-o3 - but the best humans are still way ...
OpenAI promised a one-model-fits-all future for ChatGPT. Instead, it backtracked—and the model choices are more confusing ...
One of the few things many disliked about ChatGPT was the confusing number of models. OpenAI claimed GPT-5 would fix this, ...
Following this post’s publication, OpenAI co-founder and CEO Sam Altman announced the company would restore access to GPT-4o and other old m ...
Openai is making O3 level capabilities open source ahead of the release of GPT-5 in about two days. These open source models ...
OpenAI saved its biggest announcement for the last day of its 12-day "shipmas" event. On Friday, the company unveiled o3, the successor to the o1 "reasoning" model it released earlier in the year ...
With high effort, o3-mini achieves a performance comparable to o1. If you use o3-mini as an AI developer via an API, you get off cheaply: at 1.10 US dollars per million tokens, the model is ...
Chess is a window into how well an AI can plan, evaluate options, avoid catastrophic mistakes, and stay logically consistent.
Neither o3 nor o3-mini are widely available yet, but safety researchers can sign up for a preview for o3-mini starting today. An o3 preview will arrive sometime after; OpenAI didn’t specify when.
Results that may be inaccessible to you are currently showing.
Hide inaccessible results