FSIN *only* works on x87 registers which you will rarely use on AMD64 systems --...

stephencanon · on March 29, 2022

Moving between x87 and xmm registers is actually fairly cheap (it's through memory, so it's not free, but it's also not _that_ bad). FSIN itself is catastrophically slow.

jhgb · on March 29, 2022

Fair enough, and I imagine there may even be some forwarding going on? There often is when a load follows a store, if I remember correctly. (Of course this will be microarchitecture-dependent.)