| Rank | ||
|---|---|---|
| #1 | Gemini 3.5 Flash | 61.3% |
| #2 | Grok 4.1 Fast | 58.7% |
| #3 | Gemini 3.1 Pro Preview | 55.3% |
| #4 | Grok 4.20 | 54.7% |
| #5 | Grok 4.20 Beta | 53.3% |
| #6 | Grok 4.3 | 48.7% |
| #7 | Gemini 3 Flash Preview | 48.7% |
| #8 | Gemini 3.1 Flash Image Preview | 46.0% |
| #9 | Claude Fable 5 | 46.0% |
| #10 | GPT-5.4 | 32.7% |
| #11 | GPT-5.5 | 29.3% |
| #12 | Qwen 3.6 Plus | 28.0% |
| #13 | Claude Opus 4.8 | 24.0% |
| #14 | Qwen 3.6 Plus Preview | 22.0% |
| #15 | Claude Opus 4.7 | 18.7% |
| #16 | Claude Opus 4.6 | 16.7% |
| #17 | GLM 5.2 | 13.3% |
| #18 | GLM-5.1 | 12.7% |
| #19 | GLM-5 | 12.0% |
| #20 | Claude Sonnet 4.6 | 10.7% |
| #21 | Claude Haiku 4.5 | 8.7% |
| #22 | Gemini 2.5 Pro | 5.3% |
source & further reading
chess-bench.com — original article