GPT 4.1, o3, o4-mini - OpenAI releases through the lens of LLM_Chess

21.04.2025 143 views

This will be a quick post. I've ran the recent OpenAI models through LLM Chess eval:

Below is a matrix view of models' performance with Y-axis showing chess proficiency and X-axis instruction following:

LLM Chess Matrix View

P.S> The "Notes" section of the leaderboard web site dives deeper into model's performance.

Please login to leave a comment