Providers
Provider leaderboard surfaces
BenchLM groups canonical model families by creator so you can compare labs, not just single SKUs. Each provider page shows provisional-ranked depth, verified-ranked depth, current releases, and top-performing families.
Showing 46 of 46 providers
Anthropic
Top modelClaude Mythos Preview
Avg. top 3 score93.7
Provisional-ranked12
Verified-ranked4
Current releases4
OpenAI
Top modelGPT-5.5
Avg. top 3 score88.7
Provisional-ranked15
Verified-ranked2
Current releases5
Top modelGemini 3.1 Pro
Avg. top 3 score86.7
Provisional-ranked10
Verified-ranked1
Current releases4
Alibaba
Top modelQwen3.7 Max
Avg. top 3 score79
Provisional-ranked11
Verified-ranked8
Current releases12
xAI
Top modelGrok 4.1
Avg. top 3 score77
Provisional-ranked6
Verified-ranked0
Current releases3
Z.AI
Top modelGLM-5.1
Avg. top 3 score72.3
Provisional-ranked5
Verified-ranked2
Current releases1
DeepSeek
Top modelDeepSeek V4 Pro (Max)
Avg. top 3 score65
Provisional-ranked8
Verified-ranked1
Current releases1
MiniMax
Top modelMiniMax M3
Avg. top 3 score65
Provisional-ranked2
Verified-ranked1
Current releases1
Moonshot AI
Top modelKimi K2.6
Avg. top 3 score63
Provisional-ranked4
Verified-ranked2
Current releases1
Xiaomi
Top modelMiMo-V2-Flash
Avg. top 3 score59
Provisional-ranked1
Verified-ranked0
Current releases2
Sarvam
Top modelSarvam 105B
Avg. top 3 score39
Provisional-ranked1
Verified-ranked0
Current releases2
Mistral
Top modelMistral Large 3
Avg. top 3 score37
Provisional-ranked5
Verified-ranked0
Current releases4
OpenBMB
Top modelMiniCPM5-1B
Avg. top 3 score34
Provisional-ranked1
Verified-ranked1
Current releases1
Databricks
Top modelDBRX Instruct
Avg. top 3 score32
Provisional-ranked1
Verified-ranked0
Current releases0
NVIDIA
Top modelNemotron 3 Super 100B
Avg. top 3 score30
Provisional-ranked4
Verified-ranked0
Current releases6
Meta
Top modelLlama 3.1 405B
Avg. top 3 score29.7
Provisional-ranked5
Verified-ranked0
Current releases4
Microsoft
Top modelPhi-4
Avg. top 3 score28
Provisional-ranked1
Verified-ranked0
Current releases0
Z
Top modelZ-1
Avg. top 3 score24
Provisional-ranked1
Verified-ranked0
Current releases0
Amazon
Top modelNova Pro
Avg. top 3 score10
Provisional-ranked1
Verified-ranked0
Current releases0
H Company
Top modelHolo3-122B-A10B
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1
Cursor
Top modelComposer 2.5
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1
Interfaze
Top modelInterfaze Beta
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1
StepFun
Top modelStep 3.7 Flash
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1
LG AI Research
Top modelExaone 4.0 32B
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0
Tencent
Top modelHy3 Preview
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1
Zyphra
Top modelZAYA1-8B
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases2
Poolside
Top modelLaguna M.1
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1
LiquidAI
Top modelLFM2.5-8B-A1B
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases5
Prism ML
Top modelTernary Bonsai 8B
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases2
Cohere
Top modelCommand A+
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1
InclusionAI
Top modelLing 2.6 Flash
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1
Arcee AI
Top modelTrinity-Large-Thinking
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1
Inception
Top modelMercury 2
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1
IBM
Top modelGranite-4.0-H-1B
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0
ByteDance
Top modelSeed 1.6
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1
Aion Labs
Top modelAion-2.0
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0
Upstage
Top modelSolar Pro 2
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0
Tencent Hunyuan
Top modelHy-MT1.5-1.8B-1.25bit
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1
SK Telecom
Top modelA.X series
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0
Community
Top modelDNA 1.0 8B
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0
Naver Cloud
Top modelHyperClova X Think 32B
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0
Kakao
Top modelKanana Flag
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0
LightOn
Top modelOriOn-Qwen-32B
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0
Aleph Alpha
Top modelPharia-1-LLM-7B-control
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0
Academic
Top modelThunder-LLM 8B
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0
NC AI
Top modelVarco
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0