Llama 3.2 1B Instruct
Q4_K - Medium
1.5Bparams
COMPARE ACCELERATORS
405 accelerators tested
Select Accelerators
NVIDIA RTX PRO 6000 Blackwell Workstation Edition
95GB
NVIDIA RTX 6000 Ada Generation
47GB
NVIDIA GeForce RTX 4090 D
47GB
NVIDIA H100 PCIe
79GB
NVIDIA GeForce RTX 3090 Ti
24GB
Llama 3.2 1B Instruct - Q4_K - Medium
LEADERBOARD
PROMPT
30896
tokens/s
GENERATION
374
tokens/s
TTFT
43
ms
LOCALSCORE
6431
GPU / 47GB
PROMPT
27892
tokens/s
GENERATION
416
tokens/s
TTFT
53
ms
LOCALSCORE
6030
GPU / 48GB
PROMPT
19620
tokens/s
GENERATION
131
tokens/s
TTFT
68
ms
LOCALSCORE
3350
GPU / 16GB
PROMPT
15287
tokens/s
GENERATION
206
tokens/s
TTFT
93
ms
LOCALSCORE
3211
GPU / 16GB
PROMPT
16736
tokens/s
GENERATION
141
tokens/s
TTFT
81
ms
LOCALSCORE
3077
PROMPT
16219
tokens/s
GENERATION
104
tokens/s
TTFT
80
ms
LOCALSCORE
2762
GPU / 12GB
PROMPT
13221
tokens/s
GENERATION
171
tokens/s
TTFT
105
ms
LOCALSCORE
2761
GPU / 20GB
PROMPT
10448
tokens/s
GENERATION
210
tokens/s
TTFT
144
ms
LOCALSCORE
2477
PROMPT
8206
tokens/s
GENERATION
181
tokens/s
TTFT
161
ms
LOCALSCORE
2099
GPU / 20GB
PROMPT
8737
tokens/s
GENERATION
189
tokens/s
TTFT
179
ms
LOCALSCORE
2099
GPU / 16GB
PROMPT
7533
tokens/s
GENERATION
157
tokens/s
TTFT
183
ms
LOCALSCORE
1865
GPU / 8GB
PROMPT
6350
tokens/s
GENERATION
212
tokens/s
TTFT
216
ms
LOCALSCORE
1840
GPU / 512GB
PROMPT
5483
tokens/s
GENERATION
179
tokens/s
TTFT
209
ms
LOCALSCORE
1674
GPU / 16GB
PROMPT
6087
tokens/s
GENERATION
168
tokens/s
TTFT
251
ms
LOCALSCORE
1599
GPU / 256GB
PROMPT
4999
tokens/s
GENERATION
178
tokens/s
TTFT
227
ms
LOCALSCORE
1578
GPU / 8GB
PROMPT
5553
tokens/s
GENERATION
195
tokens/s
TTFT
287
ms
LOCALSCORE
1558
PROMPT
5884
tokens/s
GENERATION
131
tokens/s
TTFT
237
ms
LOCALSCORE
1481
GPU / 8GB
PROMPT
6225
tokens/s
GENERATION
96.4
tokens/s
TTFT
224
ms
LOCALSCORE
1388
GPU / 6GB
PROMPT
4985
tokens/s
GENERATION
139
tokens/s
TTFT
278
ms
LOCALSCORE
1356
GPU / 6GB
PROMPT
4979
tokens/s
GENERATION
114
tokens/s
TTFT
286
ms
LOCALSCORE
1255
GPU / 128GB
PROMPT
3296
tokens/s
GENERATION
176
tokens/s
TTFT
334
ms
LOCALSCORE
1203
GPU / 192GB
PROMPT
3272
tokens/s
GENERATION
170
tokens/s
TTFT
339
ms
LOCALSCORE
1179
PROMPT
4087
tokens/s
GENERATION
130
tokens/s
TTFT
354
ms
LOCALSCORE
1144
GPU / 8GB
PROMPT
3399
tokens/s
GENERATION
109
tokens/s
TTFT
416
ms
LOCALSCORE
962
PROMPT
3624
tokens/s
GENERATION
81.6
tokens/s
TTFT
416
ms
LOCALSCORE
893
PROMPT
814
tokens/s
GENERATION
24.4
tokens/s
TTFT
1.83
sec
LOCALSCORE
221