the GH/GB compute has LPDDR5X - a single or dual GPU shares 480GB, depending if ...

wtallis · 2025-12-13T00:46:17 1765586777

Essentially, the Grace CPU is a memory and IO expander that happens to have a bunch of ARM CPU cores filling in the interior of the die, while the perimeter is all PHYs for LPDDR5 and NVLink and PCIe.

rbanffy · 2025-12-13T18:04:30 1765649070

> have a bunch of ARM CPU cores filling in the interior of the die

The main OS needs to run somewhere. At least for now.

wtallis · 2025-12-14T00:56:24 1765673784

Sure, but 72x Neoverse V3 (approximately Cortex X3) is a choice that seems more driven by convenience than by any real need for an AI server to have tons of somewhat slow CPU cores.

_zoltan_ · 2025-12-14T02:08:30 1765678110

there are uses cases where those cores are used for aux processing. there is more to these boxes than AI :-)

rbanffy · 2025-12-15T13:12:23 1765804343

If someone gave me one for free, I'd totally make it my daily driver. I don't do much AI, but I always wanted to have a machine with lots of puny cores since the Xeon Phi appeared.

The justification is that processors cores aren't getting much faster, but what they are is getting more numerous - entry-level machines have between 4 and 8 cores - and adapting code to run across multiple cores is important if we want to utilise all those cores.

_zoltan_ · 2025-12-15T19:31:47 1765827107

anybody writing single core code in 2025 professionally isn't very professional..

the core count doesn't matter. a top of the line Turin system has less than 1.4TB/s memory for the whole dual CPU system. A 2020 era A100 has 2TB/s.

_zoltan_ · 2025-12-13T02:23:50 1765592630

fully agree!

with MGX and CX8 we see PCIe root moving to the NIC, which is very exciting.