Apple M4 Max 12P+4E+40GPU Results

Home Latest Results Download About Blog

Apple M4 Max 12P+4E+40GPU

GPU

64

GB

PERFORMANCE OVERVIEW

Model

Llama 3.2 1B Instruct

Q4_K - Medium1.5B

Meta Llama 3.1 8B Instruct

Q4_K - Medium8.0B

Qwen2.5 14B Instruct

Q4_K - Medium14.8B

Prompt Speed

3817tokens/s

566tokens/s

289tokens/s

Generation Speed

180tokens/s

47.9tokens/s

26.3tokens/s

Time to First Token

307ms

2.11sec

4.44sec

LocalScore

1308

235

121

COMPARE MODELS

4 models tested

Select Models

Llama 3.2 1B Instruct

Q4_K - Medium

Meta Llama 3.1 8B Instruct

Q4_K - Medium

Qwen2.5 14B Instruct

Q4_K - Medium

LLaMA v2

Q4_0

Apple M4 Max 12P+4E+40GPU - 64GB