>>Isn't Phi better at dealing with branch-heavy code? This maybe true. Generally...

astrodust · on Oct 14, 2016

If you mean branch prediction, then yes, it will end up inadvertently executing the wrong instructions and discard them, but it halts execution of the wrong branch at the first opportunity and reschedules to correct its mistake.

A GPU will execute both branches to completion.

jasonwatkinspdx · on Oct 14, 2016

No, he means the SSE predication instructions.

The comparison instructions let you build a mask vector, and instructions like blendv use that mask vector to only impact a subset of elements.

It's been a common feature on many RISC and VLIW cpu's over the years. It is in no way unique to GPU's.

astrodust · on Oct 14, 2016

Ah, thanks for clarifying.

gpderetta · on Oct 14, 2016

> GPU's use SIMT

I know very little of GPU architectures, but I tought that the last few generations of GPUs were straightforward SIMDs (i.e. all lanes are run in lockstep and divergence is handled at an higher level)