导航菜单
切换主题

评测排行榜

各模型在不同评测基准上的表现排名

aider-polyglot

排名模型厂商得分
1GPT-5.5OpenAI90.00
2GPT-5.4 ProOpenAI89.50
3Claude Opus 4.7Anthropic88.50
4O4 MiniOpenAI88.50
5GPT-5OpenAI88.00
6GPT-5.5 ProOpenAI88.00
7Claude Sonnet 4.6Anthropic86.00
8DeepSeek V3.2DeepSeek85.00
9DeepSeek V4 ProDeepSeek85.00
10O1OpenAI84.20

aime

排名模型厂商得分
1GPT-5.4 ProOpenAI98.70
2Gemini 3.1 Pro PreviewGoogle98.20
3Kimi K2.6月之暗面96.40
4GLM-4.7智谱AI95.70
5Claude Opus 4.6Anthropic95.60
6GLM-5智谱AI95.40
7GLM-5.1智谱AI95.30
8Qwen3.6 Plus阿里巴巴95.10
9DeepSeek V3.2DeepSeek95.10
10Kimi K2.5月之暗面94.50

aime-2025

排名模型厂商得分
1MiniMax M2.5MiniMax86.30

amc

排名模型厂商得分
1O4 MiniOpenAI88.00
2GPT-5.5OpenAI85.00
3Claude Opus 4.7Anthropic82.00

arc-agi-2

排名模型厂商得分
1GPT-5.5OpenAI40.00
2Claude Opus 4.7Anthropic35.00
3GPT-5.4OpenAI32.00
4Gemini 2.5 ProGoogle30.00
5Gemini 3.1 Pro PreviewGoogle28.00
6O1OpenAI25.00
7DeepSeek V4 ProDeepSeek22.00