damageboy's comments

damageboy · on Feb 1, 2023

Then your org is over paying for mediocre HW, at least for servers, arguably also for laptops (Dev machines for sure) At least for the last two years.

flangola7 · on Feb 1, 2023

I can count on one hand the number of devs we have. We're not a software tech company. vPro is a standard for device management here.

damageboy · on March 20, 2022

Thank you for writing the obvious. Instruction Byte count is the wrong metric here 100%. Instruction Count (given reasonable decoding/timing constraints) is the thing to optimize for and indeed variable length encoding is very bad.

tsmi · on March 20, 2022

Instruction byte count matters quite a lot when you're buying ROM in volume. And today, the main commercial battleground for RISCV is in the microcontroller space where people care about these things.

knorker · on March 20, 2022

For those of us without the expertise, could you elaborate on why that is?

On the one hand we have byte count, with its obvious effect on cache space used. But to those of us who don't know, why is instruction count so important?

There's macro-op fusion, which admittedly would burn transistors that could be used for other things. Could you elaborate why it's not sufficient?

And then the fact that modern x86 does the opposite to macro-op fusion, by actually splitting up CISC instructions into micro-ops. Why is it so bad if they were more micro-ops to start with, if Intel chooses to do this?

damageboy · on Nov 14, 2019

OP from twitter here. I'll take the blame for the click-baity-ness, but I was really shocked and this was in real-time in my defense.

I'm assuming it will get resolved, as the errata suggests by shoving lots of 0x2c segment overrides to re-align those jumps.

The price won't be 0, but will definitely less than the very extreme 20% edge case I stumbled upon.

I'm more "worried" or annoyed by older binaries that will never get updated. But hey... (；⌣̀_⌣́)

chris_overseas · on Nov 14, 2019

For those not understanding the context of the parent's comment, this HN post originally linked to @damageboy's https://twitter.com/damageboy/status/1194751035136450560 tweet showing a 20% performance hit, but was later changed by mods to link to the phoronix.com article.

damageboy · on March 11, 2019

Isn't this a whole lot of words for installing traefik with DNS challenge and let it do basically all of the work?

damageboy · on Sept 19, 2018

Fault? He is getting free publicity to the point he is even on the front page of HN (not that he care about this specifically). Show me the last time this happened with a French book up for a prize.

damageboy · on Sept 9, 2018

Would be interesting to see how xsv compared to miller (https://johnkerl.org/miller/doc/index.html) in terms of perf, this tool comes exactly as I am about to munge 1TB of gzipped csv files.

Unfortunately, the main operation I need is not supported by xsv...

notimetorelax · on Sept 9, 2018

What is the operation that you need? Can you send a PR?

damageboy · on Oct 4, 2011

What a load of crap... you just wasted 2gb of address space not of DRAM... You'll "waste" exactly up to the amount of stack each thread uses, rounded up to PAGE_SIZE which is usually 4kb

Let me guess, a nodejs fan?

dap · on Oct 5, 2011

You're right -- it's the touched pages that count. In many cases, that's a few MB per stack, which is what I said.