You'd have to stack 16 of these to get 2TB of VRAM, equivalent to 4 Mac Studios ...

themgt · 2025-08-28T09:31:05 1756373465

n.b. there's been a little speculation that Apple adding TensorOps to Metal 4 suggests M5/M6 may get tensor cores.

https://x.com/liuliu/status/1932158994698932505

https://developer.apple.com/metal/Metal-Shading-Language-Spe...

aurareturn · 2025-08-28T11:23:47 1756380227

Nice. I hope so. That would make Macs the best local LLM machines for the masses by far.

thomasskis · 2025-08-28T15:00:07 1756393207

It’s $12k for each Mac Studio, and the networking makes them only effective individually (it’s like less that 15 tokens/s with EXO) while NVLINK is very effective. The Spark is definitely more scalable, but the MLX and metal teams are cooking, so honestly either way is still winning.