Apple M1 Max 8P+2E+24GPU Results

Home Latest Results Download About Blog

Apple M1 Max 8P+2E+24GPU

GPU

64

GB

PERFORMANCE OVERVIEW

Model

Llama 3.2 1B Instruct

Q4_K - Medium1.5B

Meta Llama 3.1 8B Instruct

Q4_K - Medium8.0B

Qwen2.5 14B Instruct

Q4_K - Medium14.8B

Prompt Speed

1669tokens/s

306tokens/s

140tokens/s

Generation Speed

106tokens/s

32.1tokens/s

15.1tokens/s

Time to First Token

689ms

3.99sec

8.83sec

LocalScore

635

135

63

COMPARE MODELS

6 models tested

Select Models

Llama 3.2 1B Instruct

Q4_K - Medium

Meta Llama 3.1 8B Instruct

Q4_K - Medium

Qwen2.5 14B Instruct

Q4_K - Medium

Mistral Small 3.1 24B Instruct 2503

Q6_K

Phi 3 Mini 128k Instruct

Q4_K - Medium

Apple M1 Max 8P+2E+24GPU - 64GB