Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It’s functional if your goal is to run models that won’t fit into RAM on a single machine. Functional.

the slow interconnects (yes, even at 40Gbps thunderbolt) severely limit both TtFT and tokens/second.

I tried it extensively for a few days, and ended up getting a single M3 Ultra Mac Studio, and am loving life.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: