I ported pigz from Unix to Windows

igrunert · 2025-06-23T22:40:15 1750718415

I recently ported WebKit's libpas memory allocator[1] to Windows, which used pthreads on the Linux and Darwin ports. Depending on what pthreads features you're using it's not that much code to shim to Windows APIs. It's around ~200 LOC[2] for WebKit's usage, which a lot smaller than pthread-win32.

[1] https://github.com/WebKit/WebKit/pull/41945 [2] https://github.com/WebKit/WebKit/blob/main/Source/bmalloc/li...

kjksf · 2025-06-24T10:00:55 1750759255

At the time (11 years ago) I wanted this to run on Windows XP.

The APIs you use there (e.g. SleepConditionVariableSRW()) were only added in Vista.

I assume a big chunk of pthread emulation code at that time was implementing things like that.

malkia · 2025-06-23T23:37:27 1750721847

These VirtualAlloc's may intermittently fail if the pagefile is growing...

igrunert · 2025-06-24T00:04:23 1750723463

Ah yeah, I see Firefox ran into that and added retries:

https://hacks.mozilla.org/2022/11/improving-firefox-stabilit...

Seems like a worthwhile change, though I'm not sure when I'll get around to it.

account42 · 2025-06-24T08:26:32 1750753592

This is something you also need to do for other Win32 APIs, e.g. file write access may be temporarily blocked by anti-virus programs or whatever and not handling that makes unhappy users.

adzm · 2025-06-23T22:42:07 1750718527

Never knew about the destructor feature for fiber local allocations!

andy99 · 2025-06-23T23:08:40 1750720120

I'm a big fan of pigz, I discovered it 6 years ago when I had some massive files I needed to zip and and 48 core server I was underutilizing. It was very satisfying to open htop and watch all the cores max out.

Edit: found the screenshot https://imgur.com/a/w5fnXKS

itsthecourier · 2025-06-24T00:02:09 1750723329

that was a big big file indeed

haunter · 2025-06-23T23:32:15 1750721535

Very old post, needs 2013 in the title

https://web.archive.org/web/20130407195442/https://blog.kowa...

frainfreeze · 2025-06-24T00:35:55 1750725355

Seems to be updated, no?

jwilk · 2025-06-24T09:28:31 1750757311

Not much. The only non-cosmetic difference is:

  -Premake supports Visual Studio 2008 and 2010 (and 2012 supports 2010 project files via conversion).
  +Premake supports latest Visual Studio 2018 and 2022 project files via conversion).

kjksf · 2025-06-20T13:00:04 1750424404

Worth mentioning that this is only of interest as technical info on porting process.

The port itself is very old and therefore very outdated.

ZoomZoomZoom · 2025-06-23T22:11:15 1750716675

Perhaps it's worth it adding this as a note at the top of the post, maybe mentioning alternatives, such as an Actually Portable™ build of `pigz`[1] or just a windows build of zstd[2].

[1] https://cosmo.zip/pub/cosmos/tiny/pigz

[2] https://github.com/facebook/zstd/releases/latest/

lelandbatey · 2025-06-23T22:52:10 1750719130

I don't think the port itself is very old. The latest version of original pigz seems to have been released in 2023 [1], and the port seems to be of pigz from around that time[2]

[1] - https://zlib.net/pigz/

[2] - https://github.com/kjk/pigz/commits/master/

mid-kid · 2025-06-24T07:56:54 1750751814

I'm not sure how willing I'd be to trust a pthread library fork from a single no-name github person. The mingw-w64 project provides libwinpthread, which you can download as source from their sourceforge, or as a binary+headers from a well-known repository like msys2.

account42 · 2025-06-24T08:24:07 1750753447

> Porting pthreads code to Windows would be a nightmare.

Porting one application using pthreads to use the Win32 API directly is however a lot more reasonable and provides you more opportunity to deal with impedance mismatches than a full API shim has. Same goes for dirent and other things as well as for the reverse direction. Some slightly higher level abstraction for the thnings your program actually needs is usually a better solution for cross-platform applications than using one OS API and emulating it on other systems.

themadsens · 2025-06-23T21:03:44 1750712624

I wish premake could gain more traction. It is the comprehensible alternative to Cmake etc.

account42 · 2025-06-24T08:30:11 1750753811

I'd rather everyone use CMake than have to deal with yet another build system. Wouldn't be so bad if build systems could at least agree on the user interface and package registry format.

beagle3 · 2025-06-23T21:32:06 1750714326

Xmake[0] is as-simple-as-premake and does IIRC everything Premake does and a whole lot more.

[0] https://xmake.io/

PeakKS · 2025-06-23T23:26:29 1750721189

It's 2025, just use meson

nly · 2025-06-24T00:24:32 1750724672

Completely useless in an airgapped environment

throwaway2046 · 2025-06-24T05:02:39 1750741359

Could you elaborate on that?

carlmr · 2025-06-24T05:20:44 1750742444

I'm guessing it needs internet for everything and can't work with local repositories.

account42 · 2025-06-24T08:31:30 1750753890

Not really a fan of Meson but I doubt that that's the case as it is very popular in big OSS projects and distributions aren't throwing a fit.

PeakKS · 2025-06-25T02:47:04 1750819624

kristianp · 2025-06-23T21:17:52 1750713472

Repository link: https://github.com/kjk/pigz

nialv7 · 2025-06-24T01:28:57 1750728537

The best kind of porting - other people have already done most of the work for you!

anilakar · 2025-06-24T06:54:15 1750748055

Pigz? Good old Pigzip? :)

https://pc-freak.net/files/hackles.org/cgi-bin/archives.pl%3...

ObscureScience · 2025-06-24T07:01:49 1750748509

I don't see any relation. Pigz is a multithreaded reimplenentation of gzip (drop in replacement)

jqpabc123 · 2025-06-20T13:03:36 1750424616

This is clearly aimed at faster results in a single user desktop environment.

In a threaded server type app where available processor cores are already being utilized, I don't see much real advantage in this --- if any.

GuinansEyebrows · 2025-06-23T20:56:21 1750712181

depends on the current load. i've worked places where we would create nightly postgres dumps via pg_dumpall, then pipe through pigz to compress. it's great if you run it when load is otherwise low and you want to squeeze every bit of performance out of the box during that quiet window.

this predates the maturation of pg_dump/pg_restore concurrency features :)

ggm · 2025-06-23T23:32:34 1750721554

Not to over state it, embedding the parallelism into the application drives to the logic "the application is where we know we can do it" but embedding the parallelism into a discrete lower layer and using pipes drives to "this is a generic UNIX model of how to process data"

The thing with "and pipe to <thing>" is that you then reduce to a serial buffer delay decoding the pipe input. I do this, because often its both logically simple and the component of serial->parallel delay deblocking on a pipe is low.

Which is where xargs and the prefork model comes in, because instead you segment/shard the process, and either don't have a re-unification burden or its a simple serialise over the outputs.

When I know I can shard, and I don't know how to tell the appication to be parallel, this is my path out.