Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

AMD's MI325 is slower (maybe 2x slower) than Nvidia B100. Sure it's cheaper and maybe consumes less power but you need more racks, more networking, and more labor to get the same performance.


I can't find any information that show a difference as large as 2x. Do you have a specific comparison point in mind?

From Nvidia and AMD, I read sparse fp8 at 7 PFLOPs for B100 [0] vs 5.22 PFLOPs for mi325x [1]

Nvidia doesn't give the dense fp8 so that's the easiest comparison I could get.

[0] https://resources.nvidia.com/en-us-blackwell-architecture [1] https://www.amd.com/en/products/accelerators/instinct/mi300/...


The B100 is only 20% faster than the H200 on average within the same target accuracy. There's nothing revolutionary about this architecture, that's why it's 4nm.

Nvidia drops precision and adjusts via software to sell a new gen.


If they doubled the area to get 20% that's a massive failure.


H100 sparse fp8 FLOPs are 4 PFLOPs, so it's more like 75% increase for B100 v H100.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: