Apple M3 4P+4E+8GPU Results

Home Latest Results Download About Blog

Apple M3 4P+4E+8GPU

GPU

16

GB

PERFORMANCE OVERVIEW

Model

Llama 3.2 1B Instruct

Q4_K - Medium1.5B

Meta Llama 3.1 8B Instruct

Q4_K - Medium8.0B

Prompt Speed

804tokens/s

125tokens/s

Generation Speed

60.3tokens/s

13.7tokens/s

Time to First Token

1.62sec

10.07sec

LocalScore

310

55

COMPARE MODELS

2 models tested

Select Models

Llama 3.2 1B Instruct

Q4_K - Medium

Meta Llama 3.1 8B Instruct

Q4_K - Medium

Apple M3 4P+4E+8GPU - 16GB