Apple M4 Max 10P+4E+32GPU Results

Home Latest Results Download About Blog

Apple M4 Max 10P+4E+32GPU

GPU

36

GB

PERFORMANCE OVERVIEW

Model

Llama 3.2 1B Instruct

Q4_K - Medium1.5B

Meta Llama 3.1 8B Instruct

Q4_K - Medium8.0B

Qwen2.5 14B Instruct

Q4_K - Medium14.8B

Prompt Speed

3252tokens/s

540tokens/s

282tokens/s

Generation Speed

166tokens/s

45.5tokens/s

25.3tokens/s

Time to First Token

368ms

2.25sec

4.36sec

LocalScore

1136

222

118

COMPARE MODELS

9 models tested

Select Models

Llama 3.2 1B Instruct

Q4_K - Medium

Meta Llama 3.1 8B Instruct

Q4_K - Medium

Qwen2.5 14B Instruct

Q4_K - Medium

Mistral Small 24B Instruct 2501

Q4_K - Small

DeepSeek R1 Distill Qwen 1.5B

Q4_K - Medium

Apple M4 Max 10P+4E+32GPU - 36GB