FLOPS are one thing, but 192 GB of unified memory that can be used as VRAM is something else. That could be a big win on the inference side of things, where even an RTX 4090 GPU is limited to only 24 GB.
Then there's the power consumption difference to consider. This seems like one of those cases where benchmarks reveal only a fraction of the larger picture.
Then there's the power consumption difference to consider. This seems like one of those cases where benchmarks reveal only a fraction of the larger picture.