Benchmark profile

Artificial Analysis Intelligence Index

A display-only intelligence index published by Artificial Analysis that aggregates provider-reported and benchmark-derived signals into a single model-level score.

Data verified July 23, 2026

The public Artificial Analysis Intelligence Index snapshot ranks Claude Fable 5 first at 59.9%, ahead of GPT-5.6 Sol (58.9%) and Kimi K3 (57.1%) among 166 tested models. We mirror the table as display-only evidence; it does not affect overall rankings.

Benchmark score on Artificial Analysis Intelligence Index — July 23, 2026

BenchLM mirrors the published score view for Artificial Analysis Intelligence Index. Claude Fable 5 leads the public snapshot at 59.9% , followed by GPT-5.6 Sol (58.9%) and Kimi K3 (57.1%). BenchLM does not use these results to rank models overall.

1Closed

Claude Fable 5

Anthropic

claude-fable-5

59.9%

Overall 83.68Context 1M+

2Closed

GPT-5.6 Sol

OpenAI

gpt-5-6-sol

58.9%

Overall 81.96Context 1M

3Closed

Kimi K3

Moonshot AI

kimi-3

57.1%

Overall 80.96Context 1.05M

166 modelsKnowledgeCurrentDisplay onlyUpdated July 23, 2026

Benchmark score table (166 models)

Score

Claude Fable 5Anthropic · Closed

59.9%

GPT-5.6 SolOpenAI · Closed

58.9%

Kimi K3Moonshot AI · Closed

57.1%

Claude Opus 4.8Anthropic · Closed

55.7%

GPT-5.6 TerraOpenAI · Closed

55.0%

GPT-5.5OpenAI · Closed

54.8%

Grok 4.5xAI · Closed

53.8%

Claude Opus 4.7 (Adaptive)Anthropic · Closed

53.5%

Claude Sonnet 5Anthropic · Closed

53.4%

GPT-5.4OpenAI · Closed

51.4%

GPT-5.6 LunaOpenAI · Closed

51.2%

GLM-5.2Z.AI · Open weight

51.1%

Muse Spark 1.1Meta · Closed

50.6%

Gemini 3.5 FlashGoogle · Closed

50.2%

Gemini 3.6 FlashGoogle · Closed

50.1%

Gemini 3.1 ProGoogle · Closed

46.5%

Qwen3.7 MaxAlibaba · Closed

46.0%

MiniMax M3MiniMax · Open weight

44.4%

DeepSeek V4 Pro (Max)DeepSeek · Open weight

44.3%

GPT-5.3 CodexOpenAI · Closed

44.3%

GPT-5.3-Codex-SparkOpenAI · Closed

44.3%

Kimi K2.6Moonshot AI · Open weight

44.2%

Claude Opus 4.6 (Adaptive)Anthropic · Closed

43.7%

DeepSeek V4 Pro (High)DeepSeek · Open weight

43.1%

Muse SparkMeta · Closed

43.1%

Claude Opus 4.7Anthropic · Closed

42.7%

MiMo-V2.5-ProXiaomi · Closed

42.2%

GPT-5.2OpenAI · Closed

42.2%

Kimi K2.7 CodeMoonshot AI · Open weight

42.0%

Hy3 PreviewTencent · Open weight

41.2%

Hy3Tencent · Open weight

41.2%

Claude Opus 4.5 ThinkingAnthropic · Closed

40.8%

InklingThinking Machines Lab · Open weight

40.7%

MiMo-V2-ProXiaomi · Closed

40.3%

DeepSeek V4 Flash (Max)DeepSeek · Open weight

40.3%

GLM-5.1Z.AI · Open weight

40.2%

GPT-5.2-CodexOpenAI · Closed

40.1%

Qwen 3.6 Max (preview)Alibaba · Closed

40.0%

GPT-5.4 miniOpenAI · Closed

40.0%

Qwen3.6 PlusAlibaba · Closed

39.6%

Gemini 3 ProGoogle · Closed

39.5%

GLM-5Z.AI · Open weight

39.5%

Qwen3.7 PlusAlibaba · Closed

39.0%

GPT-5.4 nanoOpenAI · Closed

38.2%

MiniMax M2.7MiniMax · Open weight

38.1%

GLM-5-TurboZ.AI · Closed

38.1%

Claude Opus 4.6Anthropic · Closed

37.8%

Nemotron 3 UltraNVIDIA · Open weight

37.8%

Grok 4.3xAI · Closed

37.6%

DeepSeek V4 Flash (High)DeepSeek · Open weight

37.5%

Qwen3.6-27BAlibaba · Open weight

37.0%

GPT-5.1OpenAI · Closed

36.9%

Gemini 3.5 Flash-LiteGoogle · Closed

36.5%

Claude Sonnet 4.6Anthropic · Closed

35.9%

Kimi K2.5Moonshot AI · Open weight

35.4%

Kimi K2.5 (Reasoning)Moonshot AI · Closed

35.4%

MiMo-V2-OmniXiaomi · Closed

35.0%

GPT-5.1-Codex-MaxOpenAI · Closed

34.7%

GPT-5.1-CodexOpenAI · Closed

34.7%

Claude Opus 4.5Anthropic · Closed

34.7%

GPT-5 (high)OpenAI · Closed

34.7%

GLM-5V-TurboZ.AI · Closed

34.5%

Qwen3.5-27BAlibaba · Open weight

33.8%

GPT-5 (medium)OpenAI · Closed

33.7%

Claude 4.1 Opus ThinkingAnthropic · Closed

33.7%

GLM-4.7Z.AI · Open weight

33.7%

Qwen3.5 397BAlibaba · Open weight

33.7%

Qwen3.5 397B (Reasoning)Alibaba · Open weight

33.7%

MiniMax M2.5MiniMax · Closed

33.6%

Grok 4xAI · Closed

33.3%

o3-proOpenAI · Closed

32.5%

Qwen3.5-122B-A10BAlibaba · Open weight

32.3%

Qwen3.6-35B-A3BAlibaba · Open weight

31.6%

Grok 4.1 Fast (Reasoning)xAI · Closed

30.6%

o3OpenAI · Closed

30.4%

Step 3.7 FlashStepFun · Open weight

30.3%

Mistral Medium 3.5 128BMistral · Open weight

29.9%

Gemma 4 31BGoogle · Open weight

29.4%

Qwen3.5-35B-A3BAlibaba · Open weight

29.3%

Claude 4.1 OpusAnthropic · Closed

28.2%

Gemini 3 FlashGoogle · Closed

27.4%

Grok 4 Fast (Reasoning)xAI · Closed

27.4%

Step 3.5 FlashStepFun · Open weight

26.0%

Gemini 2.5 ProGoogle · Closed

25.8%

Gemma 4 26B A4BGoogle · Open weight

25.7%

Claude 4 SonnetAnthropic · Closed

25.5%

Nemotron 3 Super 120B A12BNVIDIA · Open weight

25.4%

GPT-5 miniOpenAI · Closed

25.3%

Gemini 3.1 Flash-LiteGoogle · Closed

25.0%

K-ExaoneLG AI Research · Closed

24.7%

MiMo-V2-FlashXiaomi · Open weight

24.7%

DeepSeek V3.2DeepSeek · Open weight

24.7%

Trinity-Large-PreviewArcee AI · Open weight

24.5%

Trinity-Large-ThinkingArcee AI · Open weight

24.5%

Qwen3 MaxAlibaba · Closed

24.0%

GPT-OSS 120BOpenAI · Open weight

23.8%

o1OpenAI · Closed

23.4%

GLM-4.6Z.AI · Open weight

23.0%

GLM-4.7-FlashZ.AI · Open weight

22.9%

100

Command A+Cohere · Open weight

22.5%

101

Gemma 4 12BGoogle · Open weight

22.0%

102

Grok Code Fast 1xAI · Closed

21.6%

103

Mercury 2Inception · Closed

21.4%

104

DeepSeek V3.1DeepSeek · Open weight

21.1%

105

DeepSeek V3.1 (Reasoning)DeepSeek · Open weight

20.7%

106

DeepSeek-R1DeepSeek · Open weight

20.1%

107

GPT-5 nanoOpenAI · Closed

19.9%

108

Mistral Small 4Mistral · Open weight

19.6%

109

Mistral Small 4 (Reasoning)Mistral · Open weight

19.6%

110

Kimi K2Moonshot AI · Closed

19.4%

111

GPT-4.1OpenAI · Closed

19.4%

112

o3-miniOpenAI · Closed

19.0%

113

o1-proOpenAI · Closed

18.9%

114

MiniMax M1 80kMiniMax · Closed

17.7%

115

o1-previewOpenAI · Closed

17.0%

116

Grok 4.1 FastxAI · Closed

16.9%

117

GLM-4.5-AirZ.AI · Closed

16.5%

118

Mistral Large 3Mistral · Closed

15.9%

119

Nemotron 3 Nano Omni 30B A3BNVIDIA · Open weight

14.9%

120

GPT-OSS 20BOpenAI · Open weight

14.9%

121

GPT-4.1 miniOpenAI · Closed

14.8%

122

Llama 4 MaverickMeta · Open weight

14.3%

123

Nemotron 3 Nano 30BNVIDIA · Open weight

14.2%

124

DeepSeek V3DeepSeek · Open weight

14.2%

125

Gemini 2.5 FlashGoogle · Closed

14.1%

126

Ling 2.6 FlashInclusionAI · Open weight

14.1%

127

Gemma 4 E4BGoogle · Open weight

12.5%

128

Mistral Medium 3Mistral · Closed

12.5%

129

Sarvam 105BSarvam · Open weight

11.9%

130

Claude 3 OpusAnthropic · Closed

11.8%

131

GPT-4oOpenAI · Closed

11.2%

132

Ministral 3 14B (Reasoning)Mistral · Open weight

11.1%

133

Ministral 3 14BMistral · Open weight

11.1%

134

DeepSeek R1 Distill Qwen 32BDeepSeek · Open weight

11.0%

135

Llama 4 ScoutMeta · Open weight

10.0%

136

Gemini 1.5 ProGoogle · Closed

10.0%

137

GPT-4.1 nanoOpenAI · Closed

9.6%

138

Gemma 4 E2BGoogle · Open weight

9.3%

139

Mistral Large 2Mistral · Closed

9.2%

140

Nemotron Ultra 253BNVIDIA · Open weight

9.1%

141

Ministral 3 8B (Reasoning)Mistral · Open weight

9.0%

142

Ministral 3 8BMistral · Open weight

9.0%

143

Llama 3.1 405BMeta · Open weight

8.5%

144

LFM2.5-8B-A1BLiquidAI · Open weight

8.3%

145

GPT-4 TurboOpenAI · Closed

7.9%

146

Solar Pro 2Upstage · Closed

7.8%

147

Nova ProAmazon · Closed

7.7%

148

Gemma 3 27BGoogle · Open weight

7.4%

149

Qwen2.5 Coder 32B InstructAlibaba · Open weight

7.1%

150

GPT-4o miniOpenAI · Closed

6.9%

151

Ministral 3 3B (Reasoning)Mistral · Open weight

6.8%

152

Ministral 3 3BMistral · Open weight

6.8%

153

Sarvam 30BSarvam · Open weight

6.6%

154

Exaone 4.0 32BLG AI Research · Open weight

6.0%

155

LFM2-24B-A2BLiquidAI · Closed

5.0%

156

Phi-4Microsoft · Open weight

4.9%

157

Claude 3 HaikuAnthropic · Closed

3.9%

158

Gemini 1.0 ProGoogle · Closed

3.1%

159

Exaone 4.0 1.2BLG AI Research · Open weight

2.8%

160

LFM2.5-1.2B-ThinkingLiquidAI · Closed

2.8%

161

LFM2.5-1.2B-InstructLiquidAI · Closed

2.7%

162

Granite-4.0-H-1BIBM · Open weight

2.7%

163

Granite-4.0-1BIBM · Open weight

2.1%

164

LFM2.5-VL-1.6B-ExtractLiquidAI · Open weight

1.0%

165

Granite-4.0-350MIBM · Open weight

1.0%

166

Granite-4.0-H-350MIBM · Open weight

1.0%

The published Artificial Analysis Intelligence Index snapshot places Claude Fable 5 first at 59.9%. The third row is 2.8 points behind. The broader top-10 range is 8.5 points, so many of the published results sit in a relatively narrow band.

166 models have been evaluated on Artificial Analysis Intelligence Index. The benchmark falls in the Knowledge category. This category carries a 12% weight in BenchLM.ai's overall scoring system. Artificial Analysis Intelligence Index is currently displayed for reference but excluded from the scoring formula, so it does not directly affect overall rankings.

About Artificial Analysis Intelligence Index

Year

2026

Tasks

Cross-benchmark intelligence index

Format

Aggregated model score

Difficulty

Display-only external reference

BenchLM tracks Artificial Analysis as a display-only external reference rather than a weighted benchmark. It is useful as a market snapshot, but it is not a benchmark-native row with a single public task set, scoring harness, or exact-source methodology aligned to BenchLM's core benchmark pages.

Artificial Analysis

BenchLM freshness & provenance

Version

Artificial Analysis Intelligence Index 2026

Refresh cadence

Quarterly

Staleness state

Current

Question availability

Public benchmark set

CurrentDisplay only

BenchLM uses freshness metadata to decide whether a benchmark should still be treated as a strong differentiator, a benchmark to watch, or a display-only reference. For the full scoring policy, see the BenchLM methodology page.

FAQ

What does Artificial Analysis Intelligence Index measure?

A display-only intelligence index published by Artificial Analysis that aggregates provider-reported and benchmark-derived signals into a single model-level score.

Which model scores highest on Artificial Analysis Intelligence Index?

Claude Fable 5 by Anthropic currently leads with a score of 59.9% on Artificial Analysis Intelligence Index.

How many models are evaluated on Artificial Analysis Intelligence Index?

166 AI models have been evaluated on Artificial Analysis Intelligence Index on BenchLM.

Compare Top Models on Artificial Analysis Intelligence Index

Claude Fable 5 vs GPT-5.6 Sol GPT-5.6 Sol vs Kimi K3 Kimi K3 vs Claude Opus 4.8 Claude Opus 4.8 vs GPT-5.6 Terra

Last updated: July 23, 2026 · BenchLM version Artificial Analysis Intelligence Index 2026

Choose a model with this week’s evidence

Join 2,000+ readers for ranking moves, pricing changes, and the claims that still need proof.

One email each week. Unsubscribe anytime.