Search Arena

View overall rankings across LLMs with integrated web search.

Jul 1, 2026
834,288 votes
32 models
Rank Spread
1
11
Anthropic
Anthropic · Proprietary
1253±5
98,018$5 / $251M
2
25
OpenAI · Proprietary
1240±5
52,235$5 / $301.1M
3
25
Anthropic
Anthropic · Proprietary
1237±11
4,964$10 / $501M
4
25
Anthropic
Anthropic · Proprietary
1233±5
54,290$5 / $251M
5
27
Baidu · Proprietary
1227±10
3,829N/AN/A
6
57
Anthropic
Anthropic · Proprietary
1220±5
97,337$3 / $151M
7
511
Google · Proprietary
1213±5
74,199N/AN/A
8
713
Google · Proprietary
1207±5
37,335$2 / $12N/A
9
713
1206±5
71,993$2 / $62M
10
714
OpenAI · Proprietary
1206±6
52,915$1.75 / $14400K
11
714
Anthropic
Anthropic · Proprietary
1204±6
34,370$5 / $251M
12
815
OpenAI · Proprietary
1199±5
60,211$1.25 / $10400K
13
815
Google · Proprietary
1199±5
110,304N/AN/A
14
1015
OpenAI · Proprietary
1195±6
71,778$2.50 / $151.1M
15
1217
xAI · Proprietary
1190±6
54,111N/AN/A
16
1520
Anthropic
Anthropic · Proprietary
1179±6
61,862$5 / $25200K
17
1521
Anthropic
Anthropic · Proprietary
1178±10
3,022$2 / $101M
18
1621
OpenAI · Proprietary
1172±5
76,037$1.75 / $14400K
19
1621
xAI · Proprietary
1171±5
82,168$0.20 / $0.502M
20
1621
xAI · Proprietary
1170±4
43,028$0.20 / $0.502M
21
1722
xAI · Proprietary
1165±5
52,469$1.25 / $2.501M
22
2123
Anthropic
Anthropic · Proprietary
1157±5
91,608$3 / $151M
23
2226
Anthropic
Anthropic · Proprietary
1148±5
77,413$15 / $75200K
24
2327
OpenAI · Proprietary
1144±5
20,787$2 / $8200K
25
2328
Google · Proprietary
1142±5
83,790$1.25 / $101M
26
2328
xAI · Proprietary
1141±6
19,389$3 / $15N/A
27
2429
Perplexity AI · Proprietary
1137±6
29,214$1 / $1127.1K
28
2530
OpenAI · Proprietary
1132±6
20,926$1.25 / $10400K
29
2730
Perplexity AI · Proprietary
1129±6
28,717$1 / $1127.1K
30
2830
Anthropic
Anthropic · Proprietary
1126±5
31,225$15 / $75200K
31
3132
Diffbot · Apache 2.0
1023±8
6,436N/AN/A
32
3132
OpenAI · Proprietary
1006±11
3,441$30 / $608.2K

Default Leaderboard Plots

Average Win Rate Against All Other Models (Uniform Sampling and No Ties)

Fraction of Model A Wins for All Non-tied A vs. B Battles

Confidence Intervals on Model Strength (via Bootstrapping)

Battle Count for Each Combination of Models (without Ties)