Yeah. reminds me of the ancient okcupid data analysis blogs and not the creepy o...

rafram · 2025-02-10T19:24:53 1739215493

> not the creepy one by sleep8

What are you referring to?

fragmede · 2025-02-10T19:27:41 1739215661

Oh I guess it was just a tweet, but still.

https://www.404media.co/ceo-reminds-everyone-eightsleep-pod-...

bufferoverflow · 2025-02-10T18:45:13 1739213113

> 35% of a million requests is different than 35% of a billion.

Not statistically.

alwa · 2025-02-10T18:49:34 1739213374

A mentor I respect memorably explained to young me that “it doesn’t matter how big the pot of soup, you can use the same size spoon to taste it.”

beefnugs · 2025-02-10T20:27:32 1739219252

Sorry but that mentor has a small practical imagination, a pot can be so large that the top 3 feet that you reach with that spoon could be all oil

alwa · 2025-02-10T20:49:03 1739220543

True! Consistency and representativeness matter, in soup samples as in social samples!

Is the soup smooth or lumpy? Striated or uniform? For that matter a soup could (and often does) involve huge soup bones that give it important parts of its flavor, but never show up directly in a spoonful. And you might need something different from a spoon to convincingly rule out some specific rare lumpy ingredient.

The didactic value of sampling the soup pot goes well behind its basic function: correcting the beginner’s misperception that a sample’s statistical power is directly related to population size :)

skeeter2020 · 2025-02-10T21:05:17 1739221517

to push this analogy too far, that's because you didn't stir it well, not because the spoon is too small.

prepend · 2025-02-10T22:20:55 1739226055

Have to sample to see if it’s stirred well enough.

brookst · 2025-02-11T02:44:33 1739241873

No, you can model whether stirring actions should create a representative sample

Terretta · 2025-02-11T12:12:54 1739275974

Not with immiscible layered stratified flow…

“You're gonna need a bigger spoon!”

ggm · 2025-02-11T02:32:23 1739241143

35% of a million students in the USA is very different to 35% of a billion students across the USA, Europe and Africa.

Since there aren't a billion students in the USA, 35% of them is an impossibility.

If you scale your population above some recognized boundary you aren't sampling in the same space any more. After all the local star density to 1AU tends very strongly to 1. That's not indicative of the actual star density in the milky way.

olddustytrail · 2025-02-10T20:37:33 1739219853

Yes statistically. What do you think "statistically" means?

layman51 · 2025-02-10T19:18:27 1739215107

What do you mean by “statistically”? The end results would be like three orders of magnitude apart. Wouldn’t the desired sample size depend on the size of the population itself?

og_kalu · 2025-02-10T22:46:39 1739227599

>Wouldn’t the desired sample size depend on the size of the population itself?

No, The most important thing is the distribution of the sample size. You have to make sure it isn't obviously biased in some way (i.e You're only surveying students in a university for extrapolation on the entire population of the country). Beyond that, the desired sample size levels off quickly.

5000 (assuming the same distribution) won't be any more or less accurate for 10M than it is for 1M.

Of course, if you just ask everyone or almost everyone then you no longer need to worry about distribution but yeah