Qwen2.5 14B Instruct
Q4_K - Medium
14.8Bparams
COMPARE ACCELERATORS
183 accelerators tested
Select Accelerators
NVIDIA RTX PRO 6000 Blackwell Workstation Edition
95GB
NVIDIA GeForce RTX 4090 D
47GB
NVIDIA RTX 6000 Ada Generation
47GB
NVIDIA GeForce RTX 5090
31GB
NVIDIA H100 PCIe
79GB
Qwen2.5 14B Instruct - Q4_K - Medium
LEADERBOARD
PROMPT
5099
tokens/s
GENERATION
80.4
tokens/s
TTFT
245
ms
LOCALSCORE
1187
GPU / 47GB
PROMPT
3552
tokens/s
GENERATION
70.0
tokens/s
TTFT
369
ms
LOCALSCORE
877
GPU / 48GB
PROMPT
2936
tokens/s
GENERATION
28.1
tokens/s
TTFT
471
ms
LOCALSCORE
560
GPU / 16GB
PROMPT
2317
tokens/s
GENERATION
42.7
tokens/s
TTFT
567
ms
LOCALSCORE
557
PROMPT
2839
tokens/s
GENERATION
23.2
tokens/s
TTFT
444
ms
LOCALSCORE
529
GPU / 16GB
PROMPT
2424
tokens/s
GENERATION
29.9
tokens/s
TTFT
555
ms
LOCALSCORE
507
GPU / 12GB
PROMPT
1701
tokens/s
GENERATION
24.9
tokens/s
TTFT
786
ms
LOCALSCORE
377
PROMPT
1337
tokens/s
GENERATION
32.8
tokens/s
TTFT
968
ms
LOCALSCORE
356
GPU / 20GB
PROMPT
1417
tokens/s
GENERATION
30.0
tokens/s
TTFT
944
ms
LOCALSCORE
356
GPU / 11GB
PROMPT
1168
tokens/s
GENERATION
39.7
tokens/s
TTFT
1.08
sec
LOCALSCORE
351
GPU / 16GB
PROMPT
1244
tokens/s
GENERATION
26.0
tokens/s
TTFT
1.10
sec
LOCALSCORE
309
GPU / 20GB
PROMPT
1037
tokens/s
GENERATION
24.9
tokens/s
TTFT
1.34
sec
LOCALSCORE
268
GPU / 16GB
PROMPT
858
tokens/s
GENERATION
28.2
tokens/s
TTFT
1.46
sec
LOCALSCORE
255
GPU / 16GB
PROMPT
839
tokens/s
GENERATION
27.6
tokens/s
TTFT
1.55
sec
LOCALSCORE
246
PROMPT
952
tokens/s
GENERATION
16.4
tokens/s
TTFT
1.44
sec
LOCALSCORE
221
GPU / 256GB
PROMPT
568
tokens/s
GENERATION
36.7
tokens/s
TTFT
2.08
sec
LOCALSCORE
216
GPU / 512GB
PROMPT
576
tokens/s
GENERATION
35.8
tokens/s
TTFT
2.05
sec
LOCALSCORE
216
GPU / 16GB
PROMPT
740
tokens/s
GENERATION
20.7
tokens/s
TTFT
1.81
sec
LOCALSCORE
204
GPU / 128GB
PROMPT
471
tokens/s
GENERATION
36.6
tokens/s
TTFT
2.48
sec
LOCALSCORE
191
GPU / 96GB
PROMPT
445
tokens/s
GENERATION
34.4
tokens/s
TTFT
2.67
sec
LOCALSCORE
179
GPU / 128GB
PROMPT
377
tokens/s
GENERATION
33.3
tokens/s
TTFT
3.10
sec
LOCALSCORE
159
GPU / 128GB
PROMPT
290
tokens/s
GENERATION
27.8
tokens/s
TTFT
4.04
sec
LOCALSCORE
126
PROMPT
230
tokens/s
GENERATION
2.1
tokens/s
TTFT
14.47
sec
LOCALSCORE
32