Llama 3.2 1B Instruct
Q4_K - Medium
1.5Bparams
COMPARE ACCELERATORS
446 accelerators tested
Select Accelerators
NVIDIA RTX 6000 Ada Generation
47GB
NVIDIA GeForce RTX 4090
23GB
NVIDIA GeForce RTX 4090 D
47GB
NVIDIA L40S
44GB
NVIDIA RTX PRO 6000 Blackwell Workstation Edition
95GB
Llama 3.2 1B Instruct - Q4_K - Medium
LEADERBOARD
GPU / 47GB
PROMPT
27281
tokens/s
GENERATION
411
tokens/s
TTFT
53
ms
LOCALSCORE
5948
PROMPT
25401
tokens/s
GENERATION
244
tokens/s
TTFT
55
ms
LOCALSCORE
4842
GPU / 48GB
PROMPT
19620
tokens/s
GENERATION
131
tokens/s
TTFT
68
ms
LOCALSCORE
3350
GPU / 16GB
PROMPT
15287
tokens/s
GENERATION
206
tokens/s
TTFT
93
ms
LOCALSCORE
3211
GPU / 16GB
PROMPT
16736
tokens/s
GENERATION
141
tokens/s
TTFT
81
ms
LOCALSCORE
3077
PROMPT
16184
tokens/s
GENERATION
106
tokens/s
TTFT
80
ms
LOCALSCORE
2780
GPU / 12GB
PROMPT
13221
tokens/s
GENERATION
171
tokens/s
TTFT
105
ms
LOCALSCORE
2761
GPU / 20GB
PROMPT
10448
tokens/s
GENERATION
210
tokens/s
TTFT
144
ms
LOCALSCORE
2477
PROMPT
8206
tokens/s
GENERATION
181
tokens/s
TTFT
161
ms
LOCALSCORE
2099
GPU / 20GB
PROMPT
8737
tokens/s
GENERATION
189
tokens/s
TTFT
179
ms
LOCALSCORE
2099
GPU / 16GB
PROMPT
7533
tokens/s
GENERATION
157
tokens/s
TTFT
183
ms
LOCALSCORE
1865
GPU / 8GB
PROMPT
6350
tokens/s
GENERATION
212
tokens/s
TTFT
216
ms
LOCALSCORE
1840
GPU / 512GB
PROMPT
5483
tokens/s
GENERATION
179
tokens/s
TTFT
209
ms
LOCALSCORE
1674
GPU / 16GB
PROMPT
6087
tokens/s
GENERATION
168
tokens/s
TTFT
251
ms
LOCALSCORE
1599
GPU / 256GB
PROMPT
4999
tokens/s
GENERATION
178
tokens/s
TTFT
227
ms
LOCALSCORE
1578
GPU / 8GB
PROMPT
5553
tokens/s
GENERATION
195
tokens/s
TTFT
287
ms
LOCALSCORE
1558
PROMPT
5884
tokens/s
GENERATION
131
tokens/s
TTFT
237
ms
LOCALSCORE
1481
GPU / 8GB
PROMPT
6763
tokens/s
GENERATION
90.1
tokens/s
TTFT
213
ms
LOCALSCORE
1419
GPU / 8GB
PROMPT
6225
tokens/s
GENERATION
96.4
tokens/s
TTFT
224
ms
LOCALSCORE
1388
GPU / 6GB
PROMPT
4985
tokens/s
GENERATION
139
tokens/s
TTFT
278
ms
LOCALSCORE
1356
GPU / 6GB
PROMPT
4979
tokens/s
GENERATION
114
tokens/s
TTFT
286
ms
LOCALSCORE
1255
GPU / 128GB
PROMPT
3296
tokens/s
GENERATION
176
tokens/s
TTFT
334
ms
LOCALSCORE
1203
GPU / 192GB
PROMPT
3272
tokens/s
GENERATION
170
tokens/s
TTFT
339
ms
LOCALSCORE
1179
PROMPT
4087
tokens/s
GENERATION
130
tokens/s
TTFT
354
ms
LOCALSCORE
1144
GPU / 128GB
PROMPT
2977
tokens/s
GENERATION
151
tokens/s
TTFT
368
ms
LOCALSCORE
1070
GPU / 8GB
PROMPT
3399
tokens/s
GENERATION
109
tokens/s
TTFT
416
ms
LOCALSCORE
962
PROMPT
3624
tokens/s
GENERATION
81.6
tokens/s
TTFT
416
ms
LOCALSCORE
893
GPU / 6GB
PROMPT
825
tokens/s
GENERATION
36.5
tokens/s
TTFT
1.75
sec
LOCALSCORE
258
PROMPT
814
tokens/s
GENERATION
24.4
tokens/s
TTFT
1.83
sec
LOCALSCORE
221
