TEST #1473 RESULTS
09/03/2025 - 6:32 PM
ACCELERATOR
MODEL
14.5
tokens/s
generation
7.68
sec
time to first token
149
tokens/s
prompt
66
LocalScore
HOW YOU STACK UP
Explore All ResultsQwen2.5 14B Instruct - Q4_K - Medium
SYSTEM
CPU
Apple M2 Max 8P+4E
RAM
64GB
OS
Darwin
Kernel Release
25.0.0
Architecture
arm64
Version
Cosmopolitan 3.9.7 MODE=aarch64; Darwin Kernel Version 25.0.0: Wed Aug 27 20:19:49 PDT 2025; root:xnu-12377.1.9~17/RELEASE_ARM64_T6020
RUNTIME
Name
llamafile
Version
0.9.2
Commit Hash
a30b324
DETAILED RESULTS
TEST NAME
PROMPT
GENERATION
TTFT
pp1024+tg16
200
tokens/s
21.5
tokens/s
5.16
sec
pp4096+tg256
180
tokens/s
14.1
tokens/s
22.79
sec
pp2048+tg256
182
tokens/s
15.2
tokens/s
11.31
sec
pp2048+tg768
179
tokens/s
14.9
tokens/s
11.49
sec
pp1024+tg1024
172
tokens/s
14.8
tokens/s
6.01
sec
pp1280+tg3072
158
tokens/s
11.8
tokens/s
8.15
sec
pp384+tg1152
129
tokens/s
12.8
tokens/s
3.03
sec
pp64+tg1024
98
tokens/s
12.7
tokens/s
718
ms
pp16+tg1536
42
tokens/s
12.5
tokens/s
443
ms
