Qwen2.5 14B Instruct
Q4_K - Medium
14.8Bparams
COMPARE ACCELERATORS
259 accelerators tested
Select Accelerators
Apple M3 4P+4E+10GPU
16GB
NVIDIA RTX PRO 6000 Blackwell Workstation Edition
95GB
NVIDIA GeForce RTX 4090
23GB
NVIDIA GeForce RTX 4090 D
47GB
NVIDIA RTX 6000 Ada Generation
47GB
Qwen2.5 14B Instruct - Q4_K - Medium
LEADERBOARD
PROMPT
5255
tokens/s
GENERATION
81.9
tokens/s
TTFT
240
ms
LOCALSCORE
1215
GPU / 47GB
PROMPT
3552
tokens/s
GENERATION
70.0
tokens/s
TTFT
369
ms
LOCALSCORE
877
GPU / 16GB
PROMPT
2681
tokens/s
GENERATION
43.6
tokens/s
TTFT
503
ms
LOCALSCORE
614
GPU / 48GB
PROMPT
2936
tokens/s
GENERATION
28.1
tokens/s
TTFT
471
ms
LOCALSCORE
560
PROMPT
2913
tokens/s
GENERATION
24.1
tokens/s
TTFT
434
ms
LOCALSCORE
545
GPU / 16GB
PROMPT
2271
tokens/s
GENERATION
41.0
tokens/s
TTFT
580
ms
LOCALSCORE
543
PROMPT
2581
tokens/s
GENERATION
17.9
tokens/s
TTFT
637
ms
LOCALSCORE
438
GPU / 12GB
PROMPT
1701
tokens/s
GENERATION
24.9
tokens/s
TTFT
786
ms
LOCALSCORE
377
GPU / 20GB
PROMPT
1417
tokens/s
GENERATION
30.0
tokens/s
TTFT
944
ms
LOCALSCORE
356
GPU / 11GB
PROMPT
1140
tokens/s
GENERATION
39.5
tokens/s
TTFT
1.09
sec
LOCALSCORE
346
GPU / 15GB
PROMPT
1364
tokens/s
GENERATION
33.5
tokens/s
TTFT
1.26
sec
LOCALSCORE
345
PROMPT
1298
tokens/s
GENERATION
25.1
tokens/s
TTFT
1.02
sec
LOCALSCORE
315
GPU / 16GB
PROMPT
1244
tokens/s
GENERATION
26.0
tokens/s
TTFT
1.10
sec
LOCALSCORE
309
GPU / 16GB
PROMPT
1284
tokens/s
GENERATION
22.6
tokens/s
TTFT
1.03
sec
LOCALSCORE
304
GPU / 20GB
PROMPT
1051
tokens/s
GENERATION
24.7
tokens/s
TTFT
1.30
sec
LOCALSCORE
271
GPU / 16GB
PROMPT
858
tokens/s
GENERATION
28.2
tokens/s
TTFT
1.46
sec
LOCALSCORE
255
GPU / 16GB
PROMPT
839
tokens/s
GENERATION
27.6
tokens/s
TTFT
1.55
sec
LOCALSCORE
246
PROMPT
952
tokens/s
GENERATION
16.4
tokens/s
TTFT
1.44
sec
LOCALSCORE
221
GPU / 512GB
PROMPT
577
tokens/s
GENERATION
35.8
tokens/s
TTFT
2.05
sec
LOCALSCORE
216
GPU / 256GB
PROMPT
568
tokens/s
GENERATION
36.7
tokens/s
TTFT
2.08
sec
LOCALSCORE
216
GPU / 16GB
PROMPT
740
tokens/s
GENERATION
20.7
tokens/s
TTFT
1.81
sec
LOCALSCORE
204
GPU / 128GB
PROMPT
471
tokens/s
GENERATION
36.6
tokens/s
TTFT
2.48
sec
LOCALSCORE
191
GPU / 96GB
PROMPT
445
tokens/s
GENERATION
34.4
tokens/s
TTFT
2.67
sec
LOCALSCORE
179
GPU / 64GB
PROMPT
381
tokens/s
GENERATION
34.2
tokens/s
TTFT
3.06
sec
LOCALSCORE
162
GPU / 128GB
PROMPT
372
tokens/s
GENERATION
32.4
tokens/s
TTFT
3.13
sec
LOCALSCORE
157
GPU / 128GB
PROMPT
290
tokens/s
GENERATION
27.8
tokens/s
TTFT
4.04
sec
LOCALSCORE
126
GPU / 8GB
PROMPT
289
tokens/s
GENERATION
3.8
tokens/s
TTFT
8.84
sec
LOCALSCORE
50
PROMPT
230
tokens/s
GENERATION
2.1
tokens/s
TTFT
14.47
sec
LOCALSCORE
32
GPU / 6GB
PROMPT
83
tokens/s
GENERATION
1.1
tokens/s
TTFT
24.98
sec
LOCALSCORE
15
