Parallel STL

askee · on May 9, 2015

GCC already has something alike, the parallel mode [1]. It is based on the Multi-Core STL (MCSTL) developed at Karlsruhe University. In [2] you can also find some publications. As far as I know this already works quite well.

[1] https://gcc.gnu.org/onlinedocs/libstdc++/manual/parallel_mod... [2] http://algo2.iti.kit.edu/singler/mcstl/

chrisseaton · on May 8, 2015

Yet another parallel for-loop.

If you have a finite number of totally independent load balanced operations to run in parallel, then you don't really have a parallelism problem in the first place.

We need libraries to help us where the operations are not independent and not balanced.

humanrebar · on May 9, 2015

Except this is an implementation of a proposed standard parallel for loop. The main purpose of a standard is to codify old-hat technologies so people don't have to reinvent the wheel anymore.

lorenzhs · on May 9, 2015

But this is not another #pragma omp parallel for. It's a parallel implementation of STL algorithms like std::sort or std::nth_element etc. - you can replace your sequential calls by multithreaded versions easily. Like the top post, I would recommend having a look at GNU's parallel mode (aka MCSTL, Multi Core STL). Peter Sanders recognized the potential of such an implementation very early, his group published the first version of MCSTL in 2006: http://algo2.iti.kit.edu/singler/mcstl/

Sadly, it looks like it didn't get continued by the GNU folks after they integrated it. It still exists, though.

malkia · on May 9, 2015

Spot on. There either needs to be some kind of OS level control (Grand Central), or tweaks through the environment, like OpenMP - where you set in advance how much threads are to be used by the process.

I think Microsoft's PPL had something where it would've cooperated with the OS, but things did not worked out as expected and it wasn't delivered. Or I could be completely wrong, some links here:

https://msdn.microsoft.com/en-us/library/ee207192.aspx

https://msdn.microsoft.com/en-us/library/dd984036.aspx

jfbastien · on May 9, 2015

You're thinking of C++'s "executor", which is currently being discussed for addition to the standard. The parallel STL will use the executors once they're added to a technical specification. There's agreement on how integration this will work, and that it's the right thing to do, but executors don't have a fully agreed-upon API yet.

We (the C++ standards committee) discussed these things further this week :-)

malkia · on May 9, 2015

Ah, thanks for correcting me. I've read about it year or more ago, a short article and I couldn't find it.

jfbastien · on May 9, 2015

There are a lot of papers to follow, unfortunately. Listed here: http://open-std.org/jtc1/sc22/wg21/docs/papers/2014/ http://open-std.org/jtc1/sc22/wg21/docs/papers/2015/

You'll want to follow mostly Mysen's and Kohlhoff's proposals on executors.

jevinskie · on May 9, 2015

I was going to suggest GCD as well. IIRC it's FreeBSD implementation is great and it's Linux port is adequate. But yes, queues, producers, consumers, and OS level parallelism is the way to go!

on May 8, 2015

[deleted]

chrisseaton · on May 8, 2015

It's just we see so many parallel collections, parallel streaming, parallel map, parallel for-loop efforts, and they always rely on the problem being embarrassingly parallel in the first place!

_random_ · on May 9, 2015

Yet another parallel for-loop.

I wonder what is your opinion about JS MVC frameworks :).

twotwotwo · on May 8, 2015

Can't not note--I recently put up a parallel radix sort and quicksort for Go at https://github.com/twotwotwo/sorts if you're into such things.

polskibus · on May 9, 2015

How does it differ from Intel's Threading Building Blocks?

eps · on May 9, 2015

Good question.

You are clearly familiar with Intel's Threading Building Blocks. Why don't you read up on the submitted implementation and answer your own question here for eveyone's benefit?

CyberDildonics · on May 10, 2015

Because someone might already know the answer.

lorenzhs · on May 9, 2015

TBB is much more low-level. This is a parallel version of the STL (including <algorithm>) and much easier to use as a drop-in replacement.

alamaison · on May 8, 2015

Isn't this from March 2014?