LLM Benchmarks

ModelCreated ByLicenseContext WindowIntelligence IndexMMLU-Pro (Reasoning & Knowledge)GPQA Diamond (Scientific Reasoning)Humanity's Last Exam (Reasoning & Knowledge)LiveCodeBench (Coding)SciCode (Coding)HumanEval (Coding)MATH-500 (Quantitative Reasoning)AIME 2024 (Competition Math)Multilingual Index (Artificial Analysis)Input Price (USD/1M Tokens)Output Price (USD/1M Tokens)Median Tokens/sP5 Tokens/sP25 Tokens/sP75 Tokens/sP95 Tokens/sMedian First Chunk (s)P5 First Chunk (s)P25 First Chunk (s)P75 First Chunk (s)P95 First Chunk (s)Total Response (s)Reasoning Time (s)
o4-mini (high)OpenAIProprietary200k7083%78%18%80%47%99%99%94%$1.93 $1.10 $4.40 128.674.7101.3145.1172.239.9639.9617.2326.4153.8472.6943.850
Gemini 2.5 ProGoogleProprietary1m6986%84%17%78%40%99%98%87%$3.44 $1.25 $10.00 148.4139.1143.4154.2160.440.5440.5429.8734.4244.6568.4943.910
o3OpenAIProprietary128k6785%83%20%53%41%99%99%90%$17.50 $10.00 $40.00 234145.9186.8287.7386.614.1914.197.5910.6116.5522.1316.330
Grok 3 mini Reasoning (high)xAIProprietary1m6783%79%11%70%41%98%99%93%$0.35 $0.30 $0.50 75.838.94999.1166.40.2826.650.190.260.5119.9833.2426.37
o3-mini (high)OpenAIProprietary200k6680%77%12%73%40%99%86%$1.93 $1.10 $4.40 156.6110135.8167.8197.956.6556.6528.3448.3563.8684.1159.850
o3-miniOpenAIProprietary200k6379%75%9%72%40%97%97%77%$1.93 $1.10 $4.40 149.988.9125.4177216.815.0115.019.7611.3618.3221.6618.350
Qwen3 235B A22B (Reasoning)AlibabaOpen128k6283%70%12%62%40%93%84%$0.30 $0.20 $0.60 41.31830.351.387.80.5548.990.260.450.632.5661.148.43
o1OpenAIProprietary200k6284%75%8%68%36%97%97%72%$26.25 $15.00 $60.00 129.499109.9158.3187.922.0922.0915.4720.1827.1836.9225.950
Llama 3.1 Nemotron Ultra 253B ReasoningNVIDIAOpen128k6183%73%8%64%35%95%75%$0.90 $0.60 $1.80 42.537.841.742.642.70.6447.690.610.640.660.759.4547.04
Gemini 2.5 Flash (Reasoning)GoogleProprietary1m6080%70%12%51%36%98%84%$0.99 $0.15 $3.50 351.6314.3332357.4372.49.029.027.37.9310.2515.0310.450
DeepSeek R1DeepSeekOpen128k6084%71%9%62%36%98%97%68%$0.96 $0.55 $2.19
Qwen3 32B (Reasoning)AlibabaOpen128k5980%67%8%55%35%96%81%$0.17 $0.10 $0.38 43.220.934.5330.5335.60.5546.80.230.410.713.0458.3746.25
QwQ-32BAlibabaOpen131k5876%59%8%63%36%98%96%78%$0.47 $0.40 $0.55 106.233.683.7360447.90.4423.890.20.340.722.9228.5923.45
Claude 3.7 Sonnet ThinkingAnthropicProprietary200k5784%77%10%47%40%98%95%49%$6.00 $3.00 $15.00
Qwen3 14B (Reasoning)AlibabaOpen128k5677%60%4%52%32%96%96%76%$0.12 $0.08 $0.24 76.742618888.80.5526.620.490.510.650.9133.1426.07
Qwen3 30B A3B (Reasoning)AlibabaOpen128k5678%62%7%51%28%96%75%$0.19 $0.10 $0.45 125.838.677.2137.1186.10.5516.450.180.50.610.8220.4215.9
o1-miniOpenAIProprietary128k5474%60%5%58%32%97%94%60%$1.93 $1.10 $4.40 219.4160.9195.7236.2259.19.99.948.712.991512.180
DeepSeek V3 (Mar' 25)DeepSeekOpen128k5382%66%5%41%36%92%94%52%$0.48 $0.27 $1.10
GPT-4.1 miniOpenAIProprietary1m5378%66%5%48%40%95%93%43%$0.70 $0.40 $1.60 73.139.656.281.7920.580.580.480.530.650.87.420
GPT-4.1OpenAIProprietary1m5381%67%5%46%38%96%91%44%$3.50 $2.00 $8.00 123.770.698158188.80.460.460.340.390.570.814.50
Gemini 2.0 Flash Thinking exp. (Jan '25)GoogleProprietary1m5280%70%7%32%33%94%50%$0.00 $0.00 $0.00
DeepSeek R1 Distill Qwen 32BDeepSeekOpen128k5274%62%6%27%38%95%94%69%$0.22 $0.21 $0.24 4512.42048.650.90.2744.760.180.241.226.0955.8844.49
Qwen3 8B (Reasoning)AlibabaOpen128k5174%59%4%41%23%90%75%$0.06 $0.04 $0.14 90.746.885.6100101.20.6922.740.630.680.720.9628.2622.05
Llama 3.3 Nemotron Super 49B ReasoningNVIDIAOpen128k5179%64%7%28%28%96%96%58%$0.00 $0.00 $0.00
Grok 3xAIProprietary1m5180%69%5%43%37%91%87%33%$6.00 $3.00 $15.00 7221.956.885.3940.420.420.310.340.571.257.370
Llama 4 MaverickMetaOpen1m5181%67%5%40%33%88%89%39%$0.35 $0.20 $0.85 137.460.3107.9244.77880.360.360.20.290.482.053.990
GPT-4o (March 2025)OpenAIProprietary128k5080%66%5%43%37%96%89%33%$7.50 $5.00 $15.00 145.9105.1135.4159195.40.30.30.260.270.340.363.730
Gemini 2.0 Pro ExperimentalGoogleProprietary2m4981%62%7%35%31%95%92%36%$0.00 $0.00 $0.00 21.41.84.933.637.517.2117.2116.2316.7617.9219.4940.550
DeepSeek R1 Distill Qwen 14BDeepSeekOpen128k4974%48%4%38%24%93%95%67%$0.88 $0.88 $0.88 169.744.245.6170.7172.60.4112.190.260.280.740.9215.1411.78
Mistral Medium 3MistralProprietary128k4976%58%4%40%33%90%91%44%$0.00 $0.00 $0.00
Gemini 2.5 FlashGoogleProprietary1m4978%59%5%41%23%93%43%$0.26 $0.15 $0.60 274.2234.3257.9291.5308.70.40.40.230.30.420.452.220
DeepSeek R1 Distill Llama 70BDeepSeekOpen128k4880%40%6%27%31%97%94%67%$0.60 $0.54 $0.87 12030.559.6353.425590.5417.20.190.30.942.7221.3616.66
Claude 3.7 SonnetAnthropicProprietary200k4880%66%5%39%38%95%85%22%$6.00 $3.00 $15.00 77.166.873.278.782.21.31.30.570.931.723.747.790
Gemini 2.0 FlashGoogleProprietary1m4878%62%5%33%31%90%93%33%$0.17 $0.10 $0.40 235.4176.9221.4243.8260.20.380.380.260.330.410.462.50
Qwen3 4B (Reasoning)AlibabaOpen32k4770%52%5%47%4%91%93%66%$0.12 $0.08 $0.24 157.169146.3158.1158.60.5313.270.470.490.620.6916.4512.73
Reka Flash 3Reka AIOpen128k4767%53%5%44%27%95%89%51%$0.35 $0.20 $0.80 56.45555.957.5580.9636.420.860.90.981.0645.2935.46
Qwen3 235B A22BAlibabaOpen128k4776%61%5%34%30%90%33%$0.00 $0.00 $0.00
Gemini 2.0 Flash (exp)GoogleProprietary1m4678%64%5%21%34%91%91%30%$0.00 $0.00 $0.00 237.6207.8226.7245.4259.90.260.260.210.240.280.342.370
DeepSeek V3 (Dec '24)DeepSeekOpen128k4675%56%4%36%35%91%89%25%$0.48 $0.27 $1.10
Qwen2.5 MaxAlibabaProprietary32k4576%59%5%36%34%93%84%23%$2.80 $1.60 $6.40 49.139.845.350.6531.261.261.161.21.431.5811.440
Gemini 1.5 Pro (Sep)GoogleProprietary2m4575%59%5%32%30%90%88%23%$2.19 $1.25 $5.00 91.886.487.694.9102.30.430.430.390.410.451.095.870
Claude 3.5 Sonnet (Oct)AnthropicProprietary200k4477%60%4%38%37%93%77%16%$6.00 $3.00 $15.00 74.766.570.878.382.31.611.610.591.082.715.638.310
Qwen3 32BAlibabaOpen128k4473%54%4%29%28%90%87%30%$0.00 $0.00 $0.00
SonarPerplexityProprietary127k4369%47%7%30%23%82%82%49%$1.00 $1.00 $1.00 94.636.379.9138.4161.61.751.751.421.661.872.257.040
Llama 4 ScoutMetaOpen10m4375%59%4%30%17%83%84%28%$0.27 $0.16 $0.54 126.934.787.2166.81974.70.40.40.20.30.662.234.340
Sonar ProPerplexityProprietary200k4376%58%8%28%23%85%75%29%$6.00 $3.00 $15.00 81.158.167.9131.8165.72.412.411.71.853.024.498.580
QwQ 32B-PreviewAlibabaOpen33k4365%56%5%34%4%87%91%45%$0.23 $0.17 $0.43 63.333.145.193.298.80.4320.190.280.520.6239.931.6
Nova PremierAmazonProprietary1m4373%57%5%32%28%91%84%17%$5.00 $2.50 $12.50 63.960.462.866.272.80.790.790.720.760.850.878.620
Qwen3 30B A3BAlibabaOpen128k4371%52%5%32%26%86%26%$0.00 $0.00 $0.00
GPT-4o (Nov '24)OpenAIProprietary128k4175%54%3%31%33%93%76%15%$4.38 $2.50 $10.00 136.7104.8120.3147.8159.30.540.540.460.520.560.624.20
Gemini 2.0 Flash-Lite (Feb '25)GoogleProprietary1m4172%54%4%19%25%88%87%28%$0.13 $0.07 $0.30 201.6182.1194.7209216.60.280.280.220.260.30.312.760
Llama 3.3 70BMetaOpen128k4171%50%4%29%26%86%77%30%$0.60 $0.59 $0.70 112.225.541.8165.819030.450.450.230.340.622.734.910
GPT-4.1 nanoOpenAIProprietary1m4166%51%4%33%26%88%85%24%$0.17 $0.10 $0.40 223.2181.5192.7238.6255.60.540.540.310.470.820.852.780
Qwen3 14BAlibabaOpen128k4168%47%4%28%27%87%28%$0.00 $0.00 $0.00
GPT-4o (May '24)OpenAIProprietary128k4174%53%3%33%31%94%79%11%$7.50 $5.00 $15.00 108.552.479.4174.22070.370.370.290.350.430.664.980
Llama 3.1 405BMetaOpen128k4073%52%4%31%30%85%70%21%$3.25 $3.25 $3.25 3726.331.589.2165.70.670.670.380.431.162.3814.190
Qwen2.5 72BAlibabaOpen131k4072%49%4%28%27%88%86%16%$0.00 $0.00 $0.00 49.342.747.553.254.61.051.050.970.991.141.2911.190
MiniMax-Text-01MiniMaxOpen4m4076%58%4%25%25%86%75%13%$0.42 $0.20 $1.10 33.826.930.638.946.50.790.790.630.6811.6315.580
Phi-4Microsoft AzureOpen16k4071%57%4%23%26%87%81%14%$0.22 $0.13 $0.50 33.512.72444.645.50.460.460.360.430.521.3515.370
Command ACohereOpen256k4071%53%5%29%28%82%82%10%$4.38 $2.50 $10.00 104.9100.4103.9108.8111.90.20.20.190.190.210.234.970
Tulu3 405BAllen Institute for AIOpen128k4072%52%4%29%30%89%78%13%$0.00 $0.00 $0.00
Llama 3.3 Nemotron Super 49B v1NVIDIAOpen128k3970%52%4%28%23%83%78%19%$0.00 $0.00 $0.00
Grok 2xAIProprietary131k3971%51%4%27%28%86%78%13%$0.00 $0.00 $0.00
Gemini 1.5 Flash (Sep)GoogleProprietary1m3968%46%4%27%27%84%83%18%$0.13 $0.07 $0.30 191.1179.9187.3195.2201.10.190.190.160.180.20.232.810
Mistral Large 2 (Nov '24)MistralOpen128k3870%49%4%29%29%90%74%11%$3.00 $2.00 $6.00 67.322.946.979.887.90.450.450.360.391.4615.817.880
Qwen3 1.7B (Reasoning)AlibabaOpen32k3857%36%5%31%4%85%89%51%$0.00 $0.00 $0.00 48.944.545.6137.9167.40.6341.50.560.610.690.7351.7240.88
Gemma 3 27BGoogleOpen128k3867%43%5%14%21%89%88%25%$0.00 $0.00 $0.00
Grok BetaxAIProprietary128k3870%47%5%24%30%87%74%10%$7.50 $5.00 $15.00 66.560.164.567.568.60.30.30.240.280.350.397.820
Pixtral LargeMistralOpen128k3770%51%4%26%29%85%71%7%$3.00 $2.00 $6.00 41.228.135.94242.90.380.380.340.360.6391.7712.530
Qwen2.5 Instruct 32BAlibabaOpen128k3770%47%4%25%23%90%81%11%$0.15 $0.10 $0.30
Llama 3.1 Nemotron 70BNVIDIAOpen128k3769%47%5%17%23%82%73%25%$0.24 $0.18 $0.40 48.121.439.163.472.30.580.580.280.350.672.5710.980
Nova ProAmazonProprietary300k3769%50%3%23%21%83%79%11%$1.40 $0.80 $3.20
Qwen3 8BAlibabaOpen128k3764%45%3%20%17%83%24%$0.00 $0.00 $0.00
Mistral Large 2 (Jul '24)MistralOpen128k3768%47%3%27%27%89%71%9%$3.00 $2.00 $6.00 39.21636.740.941.90.450.450.40.410.560.8713.20
Qwen2.5 Coder 32BAlibabaOpen131k3664%42%4%30%27%90%77%12%$0.15 $0.14 $0.19 52.741.946.979.788.80.540.540.230.320.972.9910.030
GPT-4o miniOpenAIProprietary128k3665%43%4%23%23%88%79%12%$0.26 $0.15 $0.60 71.334.549.775.693.80.630.630.320.480.650.837.640
Llama 3.1 70BMetaOpen128k3568%41%5%23%27%81%65%17%$0.56 $0.56 $0.73 64.930.241.3132.4171.60.510.510.240.320.671.928.220
Mistral Small 3.1MistralOpen128k3566%45%5%21%27%86%71%9%$0.15 $0.10 $0.30 151.772.7135.6166.6172.70.270.270.250.250.290.393.570
Mistral Small 3MistralOpen32k3565%46%4%25%24%85%72%8%$0.15 $0.10 $0.30 146.8112.7136.7157.2166.60.270.270.260.260.30.353.670
Qwen3 4BAlibabaOpen32k3559%40%4%23%17%84%21%$0.00 $0.00 $0.00
Claude 3 OpusAnthropicProprietary200k3570%49%3%28%23%85%64%3%$30.00 $15.00 $75.00 2824.42629.630.91.131.130.991.051.381.4818.960
Claude 3.5 HaikuAnthropicProprietary200k3563%41%4%31%27%86%72%3%$1.60 $0.80 $4.00 66.1576267.768.60.90.90.570.641.212.248.470
DeepSeek R1 Distill Llama 8BDeepSeekOpen128k3454%30%4%23%12%84%85%33%$0.04 $0.04 $0.04 43.534.53753.655.90.6746.650.660.6674.76378.8658.1445.98
Gemma 3 12BGoogleOpen128k3460%35%5%14%17%83%85%22%$0.06 $0.05 $0.10 23.813.218.635.650.70.610.610.330.560.791.0321.60
Gemini 1.5 Pro (May)GoogleProprietary2m3466%37%4%24%27%83%67%8%$2.19 $1.25 $5.00 66.863.865.467.769.30.350.350.340.350.3911.297.840
Qwen TurboAlibabaProprietary1m3463%41%4%16%15%85%81%12%$0.09 $0.05 $0.20 106.494.4103109.8112.31.011.010.960.971.091.35.710
Llama 3.2 90B (Vision)MetaOpen128k3367%43%5%21%24%82%63%5%$0.72 $0.72 $0.72 36.523.432.151.160.90.270.270.180.210.521.0613.960
Qwen2 72BAlibabaOpen131k3362%37%4%16%23%83%70%15%$0.00 $0.00 $0.00 3130.830.931.131.21.31.31.241.261.461.5917.430
Nova LiteAmazonProprietary300k3359%43%5%17%14%84%77%11%$0.10 $0.06 $0.24 287265274302.5319.20.320.320.30.30.330.352.060
Gemini 1.5 Flash-8BGoogleProprietary1m3157%36%5%22%23%12%69%3%$0.07 $0.04 $0.15 287.2277.8282.6293.3299.90.190.190.160.170.20.221.930
DeepHermes 3 - Mistral 24BNous ResearchOpen32k3058%38%4%20%23%75%60%5%$0.00 $0.00 $0.00
Jamba 1.5 LargeAI21 LabsOpen256k2957%43%4%14%16%24%61%5%$3.50 $2.00 $8.00
Hermes 3 - Llama-3.1 70BNous ResearchOpen128k2957%40%4%19%23%75%54%2%$0.00 $0.00 $0.00
Jamba 1.6 LargeAI21 LabsOpen256k2956%39%4%17%18%70%58%5%$3.50 $2.00 $8.00 57.911.245.763.467.50.560.560.350.450.791.239.20
Gemini 1.5 Flash (May)GoogleProprietary1m2857%32%4%20%18%72%55%9%$0.13 $0.07 $0.30 315.1294.9306.8330.8356.60.260.260.240.250.270.281.850
Nova MicroAmazonProprietary130k2853%36%5%14%9%80%70%8%$0.06 $0.04 $0.14 322.6284.5307334.5356.10.30.30.280.290.320.331.850
Yi-Large01.AIProprietary32k2859%36%3%11%19%74%56%7%$3.00 $3.00 $3.00 66.262.764.969.780.90.40.40.330.350.422.317.950
Claude 3 SonnetAnthropicProprietary200k2858%40%4%18%23%71%41%5%$6.00 $3.00 $15.00 60.552.458.662.463.50.810.810.540.61.161.649.080
Codestral (Jan '25)MistralProprietary256k2845%31%5%24%25%85%61%4%$0.45 $0.30 $0.90 108.667.396.9120137.40.280.280.260.270.320.484.890
Llama 3 70BMetaOpen8k2757%38%4%20%19%79%48%0%$0.84 $0.73 $0.84 47.417.919.5151.8335.90.450.450.220.360.881.7510.990
Mistral Small (Sep '24)MistralOpen33k2753%38%4%14%16%81%56%6%$0.30 $0.20 $0.60 85.570.778.491.4100.30.280.280.270.270.290.496.130
Phi-4 MultimodalMicrosoft AzureOpen128k2749%32%4%13%11%73%69%9%$0.00 $0.00 $0.00 2114.519.321.723.70.350.350.30.320.360.3824.150
Qwen2.5 Coder 7BAlibabaOpen131k2747%34%5%13%15%90%66%5%$0.03 $0.02 $0.06 199.9133.6191.5210.3212.50.480.480.460.470.530.652.980
Mistral Large (Feb '24)MistralProprietary33k2652%35%3%18%21%71%53%0%$6.00 $4.00 $12.00 30.729.43031.131.80.450.450.430.440.460.5616.730
Mixtral 8x22BMistralOpen65k2654%33%4%15%19%72%55%0%$3.00 $2.00 $6.00 57.142.748.269.286.40.330.330.290.310.360.479.080
Phi-4 MiniMicrosoft AzureOpen128k2647%33%4%13%11%74%70%3%$0.00 $0.00 $0.00 58.55356.159.761.20.330.330.280.310.350.378.880
Qwen3 1.7BAlibabaOpen32k2541%28%5%13%7%72%10%$0.00 $0.00 $0.00
Phi-3 Medium 14BMicrosoft AzureOpen128k2554%33%5%15%12%0%46%1%$0.30 $0.17 $0.68 5350.452.153.654.20.410.410.380.40.440.469.840
Gemma 3 4BGoogleOpen128k2442%29%5%7%6%72%77%5%$0.03 $0.02 $0.04 124.181.2108.7146159.80.220.220.150.170.240.264.240
Claude 2.1AnthropicProprietary200k2450%32%4%20%18%16%37%3%$12.00 $8.00 $24.00 1413.513.714.214.40.910.910.820.8811.3536.60
Llama 3.1 8BMetaOpen128k2448%26%5%12%13%67%52%8%$0.10 $0.10 $0.10 222.246.193.6457.917920.320.320.160.230.511.042.570
Pixtral 12BMistralOpen128k2347%34%5%12%14%78%46%0%$0.15 $0.15 $0.15 100.591.698.1103.6105.20.290.290.270.290.310.565.260
Qwen3 0.6B (Reasoning)AlibabaOpen32k2335%24%6%12%3%49%75%10%$0.00 $0.00 $0.00
Mistral Small (Feb '24)MistralProprietary33k2342%30%4%11%13%79%56%1%$1.50 $1.00 $3.00 138.575.9107.8163.8176.10.270.270.250.250.30.763.880
Mistral MediumMistralProprietary33k2349%35%3%10%12%41%4%$4.09 $2.75 $8.10 46.426.438.358.789.60.420.420.340.360.531.0611.20
Ministral 8BMistralOpen128k2239%28%5%111277%57%4%$0.10 $0.10 $0.10 134.8119.8126.3137.2139.20.270.270.260.260.280.363.980
Gemma 2 9BGoogleOpen8k2250%31%4%13165%52%0%$0.12 $0.12 $0.15
Phi-3 MiniMicrosoft AzureOpen4k2244%32%4%12925%46%4%$0.00 $0.00 $0.00
LFM 40BLiquid AIProprietary32k2243%33%5%10751%48%2%$0.15 $0.15 $0.15 169.1153.8164173.8200.20.210.210.140.160.950.993.170
Command-R+CohereOpen128k2143%34%5%111263%40%0%$4.38 $2.50 $10.00 48.243.147.249.450.80.260.260.240.250.270.310.640
Llama 3 8BMetaOpen8k2141%30%5%101271%50%0%$0.09 $0.06 $0.14 104.759.674.1187.41353.70.340.340.210.290.460.975.120
Gemini 1.0 ProGoogleProprietary33k2143%28%5%12122%40%1%$0.75 $0.50 $1.50
Codestral (May '24)MistralOpen33k2033%26%5%212280%35%0%$0.30 $0.20 $0.60 105.293.8102.2108.5110.10.30.30.280.290.320.445.050
Aya Expanse 32BCohereOpen128k2038%23%5%141568%45%0%$0.75 $0.50 $1.50 118.3114.8116.3123.6127.20.160.160.140.150.170.214.390
Llama 2 Chat 13BMetaOpen4k2041%32%5%101233%2%$0.00 $0.00 $0.00
Command-R+ (Apr '24)CohereOpen128k2043%32%5%121264%28%1%$6.00 $3.00 $15.00 7262.767.375.377.50.210.210.190.210.230.277.150
DBRXDatabricksOpen33k2040%33%7%91267%28%3%$0.00 $0.00 $0.00
Ministral 3BMistralProprietary128k2034%26%6%7974%54%0%$0.04 $0.04 $0.04 225.6159.8220.6229.7232.10.260.260.240.250.280.472.480
Mistral NeMoMistralOpen128k2040%31%4%61065%40%0%$0.15 $0.15 $0.15 140.891.1133147.7149.90.290.290.270.270.30.343.840
Llama 3.2 3BMetaOpen128k2035%26%5%8556%49%7%$0.04 $0.03 $0.05 107.47084.8222.81591.40.460.460.190.230.611.165.120

 

Models compared: OpenAI: GPT 4o Audio, GPT 4o Realtime, GPT 4o Speech Pipeline, GPT-3.5 Turbo, GPT-3.5 Turbo (0125), GPT-3.5 Turbo (0301), GPT-3.5 Turbo (0613), GPT-3.5 Turbo (1106), GPT-3.5 Turbo Instruct, GPT-4, GPT-4 Turbo, GPT-4 Turbo (0125), GPT-4 Turbo (1106), GPT-4 Vision, GPT-4.1, GPT-4.1 mini, GPT-4.1 nano, GPT-4.5 (Preview), GPT-4o (April 2025), GPT-4o (Aug ’24), GPT-4o (ChatGPT), GPT-4o (March 2025), GPT-4o (May ’24), GPT-4o (Nov ’24), GPT-4o Realtime (Dec ’24), GPT-4o mini, GPT-4o mini Realtime (Dec ’24), o1, o1-mini, o1-preview, o1-pro, o3, o3-mini, o3-mini (high), and o4-mini (high), Meta: Code Llama 70B, Llama 2 Chat 13B, Llama 2 Chat 70B, Llama 2 Chat 7B, Llama 3 70B, Llama 3 8B, Llama 3.1 405B, Llama 3.1 70B, Llama 3.1 8B, Llama 3.2 11B (Vision), Llama 3.2 1B, Llama 3.2 3B, Llama 3.2 90B (Vision), Llama 3.3 70B, Llama 4 Behemoth, Llama 4 Maverick, Llama 4 Scout, and Llama 65B, Google: Gemini 1.0 Pro, Gemini 1.0 Ultra, Gemini 1.5 Flash (May), Gemini 1.5 Flash (Sep), Gemini 1.5 Flash-8B, Gemini 1.5 Pro (May), Gemini 1.5 Pro (Sep), Gemini 2.0 Flash, Gemini 2.0 Flash (exp), Gemini 2.0 Flash Thinking exp. (Dec ’24), Gemini 2.0 Flash Thinking exp. (Jan ’25), Gemini 2.0 Flash-Lite (Feb ’25), Gemini 2.0 Flash-Lite (Preview), Gemini 2.0 Pro Experimental, Gemini 2.5 Flash, Gemini 2.5 Flash (May ’25), Gemini 2.5 Flash (May ’25) (Reasoning), Gemini 2.5 Flash (April ’25) (Reasoning), Gemini 2.5 Pro, Gemini 2.5 Pro Preview (May’ 25), Gemini Experimental (Nov), Gemma 2 27B, Gemma 2 9B, Gemma 3 12B, Gemma 3 1B, Gemma 3 27B, Gemma 3 4B, Gemma 3n E4B, Gemma 7B, and PALM-2, Anthropic: Claude 2.0, Claude 2.1, Claude 3 Haiku, Claude 3 Opus, Claude 3 Sonnet, Claude 3.5 Haiku, Claude 3.5 Sonnet (June), Claude 3.5 Sonnet (Oct), Claude 3.7 Sonnet Thinking, Claude 3.7 Sonnet, Claude 4 Opus, Claude 4 Opus Thinking, Claude 4 Sonnet, Claude 4 Sonnet Thinking, and Claude Instant, Mistral: Codestral (Jan ’25), Codestral (May ’24), Codestral-Mamba, Devstral, Ministral 3B, Ministral 8B, Mistral 7B, Mistral Large (Feb ’24), Mistral Large 2 (Jul ’24), Mistral Large 2 (Nov ’24), Mistral Medium, Mistral Medium 3, Mistral NeMo, Mistral Saba, Mistral Small (Feb ’24), Mistral Small (Sep ’24), Mistral Small 3, Mistral Small 3.1, Mixtral 8x22B, Mixtral 8x7B, Pixtral 12B, and Pixtral Large, DeepSeek: DeepSeek Coder V2 Lite, DeepSeek LLM 67B (V1), DeepSeek Prover V2 671B, DeepSeek R1, DeepSeek R1 (FP4), DeepSeek R1 Distill Llama 70B, DeepSeek R1 Distill Llama 8B, DeepSeek R1 Distill Qwen 1.5B, DeepSeek R1 Distill Qwen 14B, DeepSeek R1 Distill Qwen 32B, DeepSeek V3 (Dec ’24), DeepSeek V3 (Mar’ 25), DeepSeek-Coder-V2, DeepSeek-V2, DeepSeek-V2.5, DeepSeek-V2.5 (Dec ’24), DeepSeek-VL2, and Janus Pro 7B, Perplexity: PPLX-70B Online, PPLX-7B-Online, R1 1776, Sonar, Sonar 3.1 Huge, Sonar 3.1 Large, Sonar 3.1 Small , Sonar Large, Sonar Pro, Sonar Reasoning, Sonar Reasoning Pro, and Sonar Small, xAI: Grok 2, Grok 3, Grok 3 Reasoning Beta, Grok 3 mini, Grok 3 mini Reasoning (low), Grok 3 mini Reasoning (high), Grok Beta, and Grok-1, OpenChat: OpenChat 3.5, Amazon: Nova Lite, Nova Micro, Nova Premier, and Nova Pro, Microsoft Azure: Phi-3 Medium 14B, Phi-3 Mini, Phi-4, Phi-4 Mini, Phi-4 Multimodal, Phi-4 mini reasoning, Phi-4 reasoning, and Phi-4 reasoning plus, Liquid AI: LFM 1.3B, LFM 3B, and LFM 40B, Upstage: Solar Mini, Solar Pro, and Solar Pro (Nov ’24), Databricks: DBRX, MiniMax: MiniMax-Text-01, NVIDIA: Cosmos Nemotron 34B, Llama 3.1 Nemotron 70B, Llama 3.1 Nemotron Nano 8B, Llama 3.3 Nemotron Nano 8B v1 (Reasoning), Llama 3.1 Nemotron Ultra 253B Reasoning, Llama 3.3 Nemotron Super 49B v1, and Llama 3.3 Nemotron Super 49B Reasoning, IBM: Granite 3.0 2B, OpenVoice: Granite 3.0 8B, Inceptionlabs: Mercury Coder Mini, Mercury Coder Small, Mercury Instruct, and Mercury Small, Reka AI: Reka Core, Reka Edge, Reka Flash (Feb ’24), Reka Flash, and Reka Flash 3, Xiaomi: MiMo 7B RL, Baichuan: Baichuan 4 and Baichuan M1 (Preview), Other: LLaVA-v1.5-7B, Cohere: Aya Expanse 32B, Aya Expanse 8B, Command, Command A, Command Light, Command R7B, Command-R, Command-R (Mar ’24), Command-R+ (Apr ’24), and Command-R+, Bytedance: Duobao 1.5 Pro, Seed-Thinking-v1.5, Skylark Lite, and Skylark Pro, AI21 Labs: Jamba 1.5 Large, Jamba 1.5 Large (Feb ’25), Jamba 1.5 Mini, Jamba 1.5 Mini (Feb 2025), Jamba 1.6 Large, Jamba 1.6 Mini, and Jamba Instruct, Snowflake: Arctic and Snowflake Llama 3.3 70B, Alibaba: QwQ-32B, QwQ 32B-Preview, Qwen Chat 14B, Qwen Chat 72B, Qwen Chat 7B, Qwen1.5 Chat 110B, Qwen1.5 Chat 14B, Qwen1.5 Chat 32B, Qwen1.5 Chat 72B, Qwen1.5 Chat 7B, Qwen2 72B, Qwen2 Instruct 7B, Qwen2 Instruct A14B 57B, Qwen2-VL 72B, Qwen2.5 Coder 32B, Qwen2.5 Coder 7B , Qwen2.5 Instruct 14B, Qwen2.5 Instruct 32B, Qwen2.5 72B, Qwen2.5 Instruct 7B, Qwen2.5 Max, Qwen2.5 Max 01-29, Qwen2.5 Omni 7B, Qwen2.5 Plus, Qwen2.5 Turbo, Qwen2.5 VL 72B, Qwen2.5 VL 7B, Qwen3 0.6B, Qwen3 0.6B (Reasoning), Qwen3 1.7B, Qwen3 1.7B (Reasoning), Qwen3 14B, Qwen3 14B (Reasoning), Qwen3 235B, Qwen3 235B (Reasoning), Qwen3 30B A3B, Qwen3 30B A3B (Reasoning), Qwen3 32B, Qwen3 32B (Reasoning), Qwen3 4B, Qwen3 4B (Reasoning), Qwen3 8B, and Qwen3 8B (Reasoning), and 01.AI: Yi-Large and Yi-Lightning.

Compare  + LLM Banchmarks at RankLLMs.com