It is not. It's much slower. But it doesn't require you to use a canary browser with a command line flag, so that's that.
EDIT: I guess I should give credit here... 100-200ms/token with a 1.7B model is not slow. Would love to do a benchmark with webgpu and see how it compares.