Bayes's Theorem: What's the Big Deal?

hyperpape · on Jan 5, 2016

Interesting that they mention the medical case, when there's some psychological work around the idea that we should present these cases in terms of natural frequencies instead of Bayes' theorem.

The natural frequencies approach is to say "if 10000 people take the test, 100 will have cancer. Of them, 99 will get an accurate positive test, and 1 will have a false negative test. Of the other 9900, 99 will receive a false positive, and 9801 will receive a correct negative. What are the odds that someone who has a positive test has cancer?"

It turns out that doctors and other professionals whose core professional competency doesn't concern probability do terribly when presented with percentages and Bayes theorem, but can handle natural frequencies quite well (here's one quick summary: http://opinionator.blogs.nytimes.com/2010/04/25/chances-are/...).

As is obvious, this isn't an argument that Bayes' theorem is wrong--it's a theorem after all. It's an argument about which types of reasoning people can be easily taught.

Anderkent · on Jan 5, 2016

It's true that when the question is formed in frequentist terms, the answer is much more intuitive. But is that how the problem occurs in real life? The doctor doesn't see ten thousand people take a test; they see a person take a test, and get either a positive or negative result. The traditional way of forming the problem seems closer to actual experience: 'your patient tested positive. you know how accurate the test is, and how common the disease is; how likely is it that the result is a false positive?'

hyperpape · on Jan 5, 2016

I'm not quite sure what you're saying. Doctors don't observe probabilities or enormous frequencies. Either way, there are good odds that this is information that someone is communicating to them, not the result of their personal experience.

SilasX · on Jan 5, 2016

If I may rephrase (and steelman) the parent's point:

Reality does not neatly format itself for easy plug-and-chug into your formulas. To appropriately respond to reality, you must be good at recognizing when there's a mapping to a well-tested formula, technique, or phenomenon.

Therefore, if you require problems to be phrased in a way such that that's already done, that means you're not good at that domain; blaming the phrasing of the problem is missing the point.

Indeed, 90% of the mental work lies in recognizing such isomorphisms, not in cranking through the algorithm once it's recognized, and this is a hard skill to teach. (Schools that teach how to attack word problems have to rely on crude word-match techniques to identify e.g. when you want to subtract vs add vs divide.)

DanBC · on Jan 5, 2016

"Reckoning with risk" by Gerd Gigerenzer is interesting.

https://plus.maths.org/content/reckoning-risk

> The prescription put forward is simple. Essentially, we should all be using natural frequencies to express and think about uncertain events. Conditional probabilities are used in the first of the following statements; natural frequencies in the second (both are quoted from the book):

> The probability that one of these women [asymptomatic, aged 40 to 50, from a particular region, participating in mammography screening] has breast cancer is 0.8 percent. If a woman has breast cancer, the probability is 90 percent that she will have a positive mammogram. If a woman does not have breast cancer, the probability is 7 percent that she will still have a positive mammogram.

> Imagine a woman who has a positive mammogram. What is the probability that she actually has breast cancer?

> Eight out of every 1,000 women have breast cancer. Of these 8 women with breast cancer, 7 will have a positive mammogram. Of the remaining 992 women who don't have breast cancer, some 70 will still have a positive mammogram. Imagine a sample of women who have positive mammograms in screening. How many of these women actually have breast cancer?

Anderkent · on Jan 5, 2016

Doctors observe a result of the test, and know the basic probabilities (in the example, 99% test accuracy, 1% of population have the disease). The problem is that they [often] draw incorrect conclusions from those observations (99% test accuracy and you tested positive? well then you likely - 99% - have the disease, right?).

The question formed as 'your one patient tested positively' is more immediately relevant, I'd think. The correspondence with actual practice is obvious. The question formed as 'out of 10000 ...' could be remembered as a quirk of statistics, but not actually recalled when someone tests positively for cancer.

Jabbles · on Jan 5, 2016

Of course, doctors do not randomly assign tests to patients. Their prior that a patient has a disease is a lot higher than the background frequency of it occurring.

Getting them to estimate their prior would be interesting.

URSpider94 · on Jan 5, 2016

Except when they do. For example, when 100% of men above a certain age are screened for prostate cancer, or 100% of women above a certain age are screened for breast cancer. Both cases spawned major public health campaigns to encourage screening, followed years later by recommendations AGAINST 100% screening, based on the high degree of false positives and unnecessary treatment.

Other cases that come to mind: -- doctors who offer "full body scans" as a part of an executive physical; you're pretty much guaranteed to turn up something that is 2 sigma away from the population norm, somewhere in the body, on such a scan

-- spinal x-rays for back pain. Doctors almost always find something abnormal, and use that to justify the back pain and treat aggressively. But, we don't really have a good prior; if you x-rayed 1000 people off the street, would we find similar abnormalities frequently?

TeMPOraL · on Jan 5, 2016

It depends. Some tests are applied without prior suspicion, so you deal with exactly the background frequency. With others, the disease in question is so rare that false positives will dominate even if the doctor has serious suspicions. The second case is the reason for the "think horses not zebras" aphorism.

Doing a few explicit Bayesian calculations can help one internalize just how much important is that often forgotten P(A) factor is.

The same applies, by the way, to the antiterrorism security theater - many support it just because they have no intuition (or idea) about base rates.

hyperpape · on Jan 5, 2016

Note that the frequency with which people who don't have the condition take the test is a major (but not the only) component of a good prior.

hyperpape · on Jan 5, 2016

I certainly agree that thinking in terms of percentages is more familiar. But it's familiarity that doesn't help most people get the right answer.

6502nerdface · on Jan 5, 2016

I think GP's point is that in the case of interacting with an individual patient, the Bayesian conception of probability as a quantification of degree of belief is actually more intuitive than the frequentist conception of probability as a relative frequency of outcomes under repeated hypothetical experimentation.

pdkl95 · on Jan 5, 2016

> formed in frequentist terms

I suspect that this is simply a sense of scale, with the freq-vs-bayes aspect playing only a minor role.

The normalization that happens when you use percentages and 0.0 -> 1.0 probabilities is often useful, but it can sometimes obscure the magnitude of some relationships. It's easier to understand the sense of scale when you say "1 out of every 100,000 people" (which is easily extended to "a handful of people in a large city"). The same information is presented as "0.001% of the population" requires the reader to do more mental math if they want to understand how many people could be affected.

Knowing how to interpret false-positives and false-negatives is very important, but doctors are busy people who have to memorize a lot of data. It is probably better for patients if they can at least remember if something is "common" vs "rare" when they need to make a quick decision.

aamar · on Jan 5, 2016

It seems likely that this will help some doctors in at least simple cases, but natural frequencies don't 'compose' well if you are doing multiple tests; you can't use them to compute posterior probabilities when combining a half-dozen different tests, at least not without involving impractically huge numbers or fractional people.

Moreover, the trend in modern medicine is towards combining binary/categorical tests with a range of distributions based on age, gender, race, genetic factors, vitals, continuous lab result values, etc. Yes, we can theoretically delegate some or all of that to software, but that is true for medical diagnosis in general; while we're not there, doctors must understand and perform these calculations.

So it seems fine to use natural frequencies to help out, but building Bayesian intuitions early and often seems a better path.

vinchuco · on Jan 5, 2016

Except when

https://en.m.wikipedia.org/wiki/Simpson%27s_paradox

hyperpape · on Jan 5, 2016

It's a lovely phenomenon, but what are you trying to say?

marcosdumay · on Jan 5, 2016

Maybe that the natural frequency "normatized" for some people may be completely different from the actual frequency that apply to them, because the people that researched the frequency did their grouping badly.

hyperpape · on Jan 5, 2016

I'm not sure if that's relevant. It seems like either you know the actual frequency for their subgroup, in which case you redo the numbers in the example, or you don't in which case you can't accommodate that regardless of how you calculate.

A natural frequency is different way of expressing and reasoning about the same information as percentages.

marcosdumay · on Jan 6, 2016

Well, me neither. If you get the frequency wrong, you'll also get your priors wrong. Natural frequency does not compose well, but that is another problem.

That is just what I understood from the GP.

bduerst · on Jan 5, 2016

Doctors and professionals are probably not as concerned about Bayes because cancer isn't identically or independently distributed across the population.

Also, endocrinology diagnostic platforms usually don't have identical accuracy rates for positive and negative results, varying by test, machine, controls used, etc.

danieltillett · on Jan 5, 2016

The reason why doctors and other professionals find it difficult to understand is because they are paid for treating people. As Upton Sinclair observed “It is difficult to get a man to understand something, when his salary depends on his not understanding it.”

Laaw · on Jan 5, 2016

I've been saying this for years, and this is a large reason why I find the LessWrong folks to be almost entirely full of it. Their inability to come up with accurate priors is completely lost on many of the folks who follow this kind of thinking.

A couple of comments are saying, "no duh" to this article, but those folks likely don't realize quite how many other people are falling into this trap. "Garbage in, garbage out" is only good advice when the person you're saying it to realizes they're putting garbage in.

XFrequentist · on Jan 5, 2016

FWIW, it seems to me that a major benefit of the Bayesian approach is to make bad reasoning (in the form of, say, an unreasonable prior) transparent and obvious. I've never heard it claimed that the Bayesian approach was robust to sophisticated idiocy (neither on LessWrong nor mainstream writing on Bayesian methods), except in the narrow techical sense that the posterior asymptotically approximates the likelihood given infinite data (provably true under some assumptions but irrelevant to the objections in the OP).

TeMPOraL · on Jan 5, 2016

Yup. Actually, a common theme on LessWrong was realizing that with better reasoning tools you're more able to bullshit yourself, and so you need to be extra-careful.

NN88 · on Jan 5, 2016

"The first principle is that you must not fool yourself and you are the easiest person to fool."

- Richard P. Feynman

zepto · on Jan 5, 2016

How do you be extra careful except by developing yet more powerful reasoning tools?

TeMPOraL · on Jan 5, 2016

You can't protect yourself in 100% - it would require developing more powerful reasoning tools in an infinite regression. But what you can do is to use introspection, and triple-check your reasoning when it seems to defy common sense or leads you to weird (awful) conclusions.

That's why LW is so big on biases and heuristics by the way - you can treat them as a list of warning signs; if your reasoning seems to match some of them, it's time to take a closer look.

(A thing a lot of people missed since knowing logical fallacies started to become a mainstream thing - you should use the list to check your own reasoning, not your opponent's.)

Chathamization · on Jan 5, 2016

The problem with trying to rely on heuristics to avoid biases is people often ignore the biases in the heuristics of choice. To continue the example of LW, there are many people there who seem to think highly of IQ test, and who ignore the many issues with them (the Flynn effect an the effect of incentives being a couple examples of the flaws in IQ tests).

Trying to remove biases is great. But there is a problem when someone works to remove some biases, then believes that they are inherently more rational than the public at large, and then uncritically accepts there other biases ("Someone like me who's worked hard to remove there biases must be correct when compared to the biased masses.").

TeMPOraL · on Jan 5, 2016

Yes, there is that risk, and no doubt many fall for it. Ego / self-esteem issues may be a big part of it. But then again, every worthy goal poses risks. When you fly a plane, there's a greater risk you'll kill yourself than when you stay on the ground, and yet airplanes are being flown and we're reaping great benefits from it.

RE IQ, personally, I'm 100% confused on the topic. I used to believe that Flynn effect is basically people getting better at doing tests, but recently I heard that someone controlled for that and the effect remained. So I don't know. The topic is complicated and most of studies I heard of are the kind of psychology and social science I implicitly assume is mostly bullshit.

cobaltblue · on Jan 5, 2016

IQ is held in high esteem because research around g is very good and comprehensive, perhaps the crown jewel of psychology. Additionally most people's knowledge of the Flynn effect is out of date -- recent studies (here's one: http://www.sciencedirect.com/science/article/pii/S0160289615... in fact here's a boatload of references http://www.iapsych.com/iqmr/fe/MasterFlynnEffectreferencelis...) show a rise, leveling off, and then an overall fall from the beginning in performance over the last 40 years rather than the continuous rise (or at least non-decreasing) behavior most people would probably bet on from their layman understanding of the effect. (Additionally ethnic gaps have remained despite controls for everything and it is this unfortunate reality that I think is the reason for so much dismissal of IQ...)

Chathamization · on Jan 5, 2016

When fairly minor monetary incentives (over $10) can lead to a 20 point increase in scores[1], I'd be wary of reading too much into the tests. As for the Flynn effect, I've read a number of different theories, but there doesn't seem to be any consensus. Given the other issues IQ tests have (like the one just mentioned), I'd be wary about assuming that it doesn't stem from underlying problems with the test itself. Some researchers seem to think it stems from familiarity with test taking in general, which seems to match the general understanding that you will do better on IQ tests if you repeatedly take them, referred to as the "practice effect" (IE, IQ tests at least in part measure familiarity with the test).

[1] http://news.sciencemag.org/2011/04/what-does-iq-really-measu...

cobaltblue · on Jan 5, 2016

This study isn't surprising, I'm unsure what you think it implies. Indeed IQ researchers have been wary themselves and known about motivational effects for decades, along with many other objections to testing like cultural bias and the like that in good modern research are all accounted for -- obviously if I sleep during a test I'll score very low, but if I'm actually awake and care I can increase my score by a huge factor. IQ isn't a perfect correlate, but it's common to reason as if it alone has a predictive power of 0.4-0.6 for various important things, that is stronger than any other single factor we know about. When you add in Conscientiousness (the big five traits being the other contender for psychology's crown jewel, I think), which is about grit, intrinsic motivation, and the like, together with IQ (two factors now, not just one), you get predictive powers of 0.7 to educational success. I would wager that the differences seen in the paper cited by your article are almost entirely accounted for by Conscientiousness, but from the abstract (don't have the full paper) it looks like that was not controlled for at all.

For your pleasure: https://ideas.repec.org/p/nbr/nberwo/15898.html

danieltillett · on Jan 5, 2016

The Flynn effect mystery is exactly the same as the mystery that in most countries the younger population is taller than the previous generation. The fact that the Flynn effect only occurs in the bottom half of the intelligence distrubtion is a bit of a clue as to what is the cause.

sawwit · on Jan 6, 2016

What's the cause that follows from that it mostly affects the bottom 50%?

danieltillett · on Jan 6, 2016

What is the cause of the population getting taller?

The brain is just another organ that is affected by nutrition in the same way height is. Improve nutrition and those individuals that are below their genetic potential due to poor nutrition will improve. Guess which half of the population has suffered from poor nutrition in the past?

bordercases · on Jan 5, 2016

You should probably know what James Flynn thinks of the Flynn effect. He doesn't try to escape the conclusions of an I.Q. test as much as extend them, despite some of the "paradoxes".

Chathamization · on Jan 5, 2016

Fair enough. It's also worth pointing out that Alfred Binet, generally considered to be one of the fathers (if not THE father) of intelligence testing, felt that the idea of quantitative intelligence testing was severely flawed (and had fairly harsh words for people who believed that there was largely a single measure of intelligence).

Estragon · on Jan 6, 2016

You don't take anything hammered out by reason alone as sound information. You test reasonable-looking propositions against experience, and until they've stood up against that test by providing accurate predictions with substantial information-content, you take them as provisional.

cossatot · on Jan 5, 2016

Spend more time with the data, literature, alternate hypotheses, and so forth.

1024core · on Jan 5, 2016

> I've never heard it claimed that the Bayesian approach was robust to sophisticated idiocy

Alas, nothing is robust to sophisticated idiocy.

1812Overture · on Jan 5, 2016

"If you make something idiot proof, someone will just make a better idiot."

Estragon · on Jan 6, 2016

"The trouble with fool-proof methods is, you're still a fool."

XFrequentist · on Jan 5, 2016

This made me chuckle.

Retric · on Jan 5, 2016

Bayesian seems great when you first see it. It should be obvious how to apply it for something like a card game. The problem is how long would it take you to realize a deck of cards was missing the 4 of diamonds? What if the card was lost 1/2 way though the game? How about on the prior hand?

In the end it's stuck at one level of recursion and all facts are fuzzy.

stuartaxelowen · on Jan 5, 2016

It's stuck in the reality you define - you can add the last hand, all cards seen so far, and the color of people's jackets to the model if you so please.

Retric · on Jan 5, 2016

Your still stuck with: You observed X/Y. How accurate is your count. How accurate is your estimate of accuracy. How accurate is your estimate of accuracy of your accuracy estimate. ... recursive infinity.

cobaltblue · on Jan 5, 2016

Did you account for the possibility you're dreaming, or in a simulation? Or in a simulation of a simulation and so on? Yes, that's the curse of real numbers. In practice you set a limit of precision required, and a threshold of evidence you need to tell you that the precision should be more precise. You can get quite far with Newtonian physics without considering the corrections of general relativity, you can get quite far in engineering by assuming linearity by only expanding the Taylor transform of a non-linear expression by a bit instead of infinitely, and you can do a lot by assuming probabilistic independence of your card counting from what's going on on Pluto right now.

monkmartinez · on Jan 6, 2016

It is going to take me more than a few minutes to parse this. However, I am relatively certain there is a lot of sarcasm in this snippet and I like it.

BenoitEssiambre · on Jan 5, 2016

Except that at least with Bayesian methods the prior is explicitly laid out.

Frequencist methods when they are used to make predictions and get useful information out of experiments have hidden implicit priors that bias inferences in opaque ways.

The very honest frequencists will admit that their procedures are only rejecting hypotheses so small as to have no practical utility. Others will use weird and dishonest doublespeak where they call rejecting an insignificantly small hypothesis "statistical significance".

But I suppose they do redeem themselves a bit with the wording "null hypothesis" which candidly conveys the sense of having significantly rejected _nothing_.

GFK_of_xmaspast · on Jan 6, 2016

If frequentism is so terrible, why has the field of statistics had so much success over the last century or so.

btown · on Jan 6, 2016

Just because something is better than nothing, doesn't mean it's optimal.

woodchuck64 · on Jan 5, 2016

> I've been saying this for years, and this is a large reason why I find the LessWrong folks to be almost entirely full of it. Their inability to come up with accurate priors is completely lost on many of the folks who follow this kind of thinking.

I assume you're saying LessWrong folks are more prone to miscalculating priors than most. Could you give some examples of this?

jacobolus · on Jan 5, 2016

The LessWrong folks aren’t obviously better or worse at calculating priors than anyone else. The “problem” is that their hobby is spending their free time considering outlandish scenarios, inventing arbitrary assumptions related to such scenarios, drawing questionable conclusions, and then convincing themselves that because they used logic and math, their analysis must be correct. Plenty of other folks who spend time on similar activities with a less pseudo-rigorous framing end up as conspiracy theorists or occultists; belief in AI overlords ruling humanity, the technological singularity, cryogenics, or impending 1000-year human lifespans is far from the kookiest thing people convince themselves about.

TeMPOraL · on Jan 5, 2016

Oh come on. That's how you're supposed to use math. To aid your thinking. Without it, and considering "outlandish scenarios", we would not have any scientific progress.

Also, this is one strange thing - any time someone asks us (the STEM crowd), "what will I ever use math for in my life?", the default answer seems to be, "it's about having more tools for thinking, and greater clarity of thought; it'll make you smarter". But then, some of us turn around and refuse to acknowledge that people who actually learn math and try to apply it may be getting those promised results. Whether it's LW people, programmers, engineers or scientists, the moment it matters, the default conclusion is that math gives nothing.

jacobolus · on Jan 5, 2016

What? Who said “math gives nothing”? I spend most of my day building things out of math. I think math and scientific inquiry are basically the most important tools invented/popularized in the past 1000 years.

The lapse here is not math, but rather spending lots of attention on abstract thought disconnected from any kind of reality check. Of course, there’s nothing inherently wrong with speculating sans evidence about the future, it’s generally a harmless hobby. In the best case it makes for fun SF novels. Convincing yourself, still without direct evidence, that your speculation reflects truth implies that something has gone off the rails in the reasoning process, however.

Using statistical analysis to understand real causal relationships in areas we have real data about is damn hard, and even plenty of people who are highly trained as statisticians screw up all the time. Academic fields like comparative politics (to take an example I spent a fair amount of time studying) are rife with poor conclusions drawn from bad analysis. The LessWrong folks are hardly unique in applying logic poorly. But they do tend to tackle more speculative questions and convince themselves more firmly of their conclusions (at least, such is my impression as an outsider).

Chathamization · on Jan 5, 2016

I think this is a common problem when people working in fields that have somewhat accurate mathematical models look at fields that don't. They often don't realize how hard it is to create an accurate mathematical model for many situations, and assume that the other fields don't have them because the individuals who work in said fields aren't as good at math.

Which is why every so often you'll get things like a physicist spending a couple months studying economics in their free time and deciding that they can now unlock the secret to economics which has eluded economists.

This xkcd comic sums it up well:

https://xkcd.com/793/

jacquesm · on Jan 6, 2016

The more you extrapolate the more frequently you will need to adjust your future predictions because of errors in your initial measurements. This goes for any kind of extrapolation (for instance: plotting a course on a map), but it goes even more for extrapolating the future from limited evidence present today. Your 'best guess' might be off by many orders of magnitude if the evidence you have today is only loosely related to the future in terms of importance and where evidence may not be nearly as independent in nature as you currently perceive.

This can lead to your best guess based on available evidence being about as good at predicting the future as randomness in spite of all the apparent effort at making the predictions mathematically sound.

Anderkent · on Jan 6, 2016

> because they used logic and math, their analysis must be correct

> belief in AI overlords ruling humanity, the technological singularity, cryogenics, or impending 1000-year human lifespans

I don't think anyone on LW believes these are 'facts' that are 'correct'. LW commonly thinks of these ideas as risks / opportunites that might happen (except for the 1000-year human lifespans, which is a new idea to me), and that it's probably worth investing minor amounts of money in case it does. In case of cryonics, that's about $20/month for insurance that covers it; in case of AI safety, that's a couple people doing research on the problem, and some amount of money sent their way.

The way you phrase it seems like LW people are certain that cryonics will definitely let them be revived after death. That's definitely not the case - in fact, IIRC, on a survey a year or two ago LWers subscribed to cryonics assigned lower probability of it working than ones not subscribed. It's not a cargo-cult.

jacobolus · on Jan 6, 2016

Substitute “plausible” for “correct” if you want to give them the benefit of the doubt. Either way it all so speculative as to be basically pure fiction. It reads very similar to me to various “scientific” defenses of particular religious traditions.

Again, as I said, I don’t think there’s anything inherently wrong with this. Little communities of people should do whatever harmless hobby is fun for them.

I just don’t find it very interesting or insightful.

If people want to spend time and attention and resources on existential risks, how about the ones which are clear and imminent, like wealth inequality, the retreat of world democracy and increasing power of entirely unaccountable and amoral multinational corporations, or global climate change, e.g. http://www.esquire.com/news-politics/a36228/ballad-of-the-sa...

woodchuck64 · on Jan 5, 2016

But these aren't examples so much as vague caricatures. The subject matter that LessWrong considers is certainly unusual, but that alone should not be enough to call it arbitrary, questionable or outlandish.

Apocryphon · on Jan 5, 2016

http://rationalwiki.org/wiki/Roko%27s_basilisk

unimpressive · on Jan 5, 2016

https://en.wikipedia.org/wiki/Pascals_wager

I don't think it would be fair to malign philosophers because they come up with outlandish scary scenarios that scare people with OCD sometimes. It's not like LW gives Roko significant air time or serious treatment (EY freaking out and deleting it was partially principle of the thing, partially the fear that somebody might follow this road of thought to come up with something more terrifying and he doesn't want to take the community there, etc) somebody doing this is generally taken as a sign of serious crankery.

(FWIW, I agree with the top parent post that the hyping of Bayes Theorem is one of the LW foibles. At least the presentation of it.)

Apocryphon · on Jan 5, 2016

It's less about the plausibility of the thought experiment, and more about typical online drama and hysteria that ensued, which sort of belies that LW is made up of mortals like you and me. They aren't hyper-rational machines, after all.

Anderkent · on Jan 6, 2016

> They aren't hyper-rational machines, after all.

No one claims they are. In fact, if I was to name a single overarching theme of all lesswrong discussions, it would be the fallibility of human reasoning. How is having some reddit-like drama on an open internet forum even relevant?

ps. to belie is to contradict, whereas I think you meant the drama shows that LW is made up of mortals? Just making sure I understood you correctly.

philh · on Jan 6, 2016

In that case, how is it a reply to woodchuck?

"The subject matter that LessWrong considers is certainly unusual, but that alone should not be enough to call it arbitrary, questionable or outlandish."

"But let me tell you about the time LW experienced internet drama."

Doesn't seem particularly germane?

eli_gottlieb · on Jan 6, 2016

>I don't think it would be fair to malign philosophers because they come up with outlandish scary scenarios that scare people with OCD sometimes.

Actually, that sounds like a fine reason to malign philosophers. In order to consider "outlandish scary scenarios", you must first be quite sure that those scenarios are realistic. If they're not, then you're wasting everyone's time.

And yes, if your expected-utility expressions fail to converge because you believe in taking every Pascal's Wager/Mugging scenario into account, or because you don't believe in time, then you've attempted to take the limit of a nonconverging sequence and no amount of philosophizing will help.

woodchuck64 · on Jan 5, 2016

This is still working backwards: a thought experiment that increases your chances of being tortured by a future AI? Surely outlandish! But why? Which premises are truly outlandish, arbitrary, etc? What I see given the premises is no more than what Yudkowsky's already said: ".. a Friendly AI torturing people who didn't help it exist has probability ~0, nor did I ever say otherwise."

(However, I agree that to be genuinely distressed by the thought experiment possibility suggests more is going on psychologically than a rational assessment of unknowns, but this seems to be a minority of the community)

cobaltblue · on Jan 5, 2016

That's the best you got?

Apocryphon · on Jan 5, 2016

It's the most infamous example LW offers.

edanm · on Jan 5, 2016

So I'm a LessWronger and know a bit about the "movement", and think you are misunderstanding what "LessWrongers think". Obviously not all LessWrongers think the same thing at all, but I'm talking about the average position of the people who believe AI safety should be worked towards.

I'd love to explain the basic position, and tell me where you disagree with it. This is the basic position:

1. Intelligence can be created, because there is nothing "special"/"magical" about humans, and our intelligence was eventually created.

2. At some point, humanity will create an "artificial general intelligence". (Since we'll just keep improving science and technology, and there's no fundamental reason why this won't eventually allow us to create an intelligence.

3. "Artificial general intelligence" basically means a machine that is capable of achieving its goals, where the goals and methods it uses to achieve them are general. I.e. not "is able to play chess really well", but rather "is able to e.g. cure cancer".

4. For various reasons, once we have an artificial intelligence, it will likely become much smarter than us. (There are many reasons and debates about this, but let's just assume that since it's a computer, we can run it much faster than a human. If you dispute this point, we can talk about it more).

5. Something being much more intelligent than us means that, in effect, it has almost absolute power over what happens in the world (like we are basically all-powerful from the vantage point of monkeys, and their fate is totally in our hands).

6. (This is, I believe, the main point): Something being "intelligent" in the sense we're talking about doesn't say anything about what its goals are, or about how its mind works. We're used to everything that's intelligent being a human being, therefore the way our mind works is basically the same across every human. With an artificial intelligence, it will work completely differently from our mind. So if we "tell it" something like "cure cancer", it won't have our intuition and background knowledge to understand that we mean "but don't turn half the world into a giant computer in order to cure it".

7. Combine the two points above, and you get the large idea - whatever the goals of the AI will be, it will achieve them. Its goals won't, by default, be ones that are good for humanity, if only because we have no idea how to program our "value system" into a computer.

8. Therefore, we need to start working on making sure that when AI does come, it's safe. Even if we create an AI, the "extra" problem of making it safe is both hard, and we have absolutely no idea how to do it right now. We have no idea how long AI will take, or how long figuring out safety will take, but since this is a humanity-threatening problem, we should devote at least some resources to working on it right now.

That's it, that's the basic idea. I'd love to hear which part you disagree with. I totally understand that not everyone will agree on some of the final details like, e.g., how many resources we should effectively devote right now (you might even claim it's 0 because anything we do now won't be useful).

But I think the overall reasoning is sound, and would love to hear an intelligent disagreement.

ac29 · on Jan 5, 2016

> 1. Intelligence can be created, because there is nothing "special"/"magical" about humans, and our intelligence was eventually created.

Human intelligence evolved through a (very long!) series of natural processes, to the best of my knowledge. To say it was "created" implies something closer to a religious or philosophical opinion, rather than something supported by science.

> 2. At some point, humanity will create an "artificial general intelligence". (Since we'll just keep improving science and technology, and there's no fundamental reason why this won't eventually allow us to create an intelligence.

This is hugely debatable. Why is AGI inevitable? Even given great amounts of computing resources, a artificial general intelligence does not just automatically appear, it must somehow be designed and programmed. Fields like computer vision have grown tremendously using techniques like deep learning, but there really isn't any evidence that I know of that a general intelligence is any closer than it was 20 years ago.

edanm · on Jan 6, 2016

Totally agree with your first point, I just didn't want to have too many caveats and nitpicking words. If it's not clear, then of course my arugment in no way implies that human intelligence was "created" by an intelligence - it evolved. Poor wording aside, my statement remains the same.

"This is hugely debatable. Why is AGI inevitable? Even given great amounts of computing resources, a artificial general intelligence does not just automatically appear [...]"

Well no one thinks AGI will appear without anyone working on it, but lots and lots of people are working on it now. And since there are huge incentives to create one, the belief is that more people will work on it as time goes on.

"[...] there really isn't any evidence that I know of that a general intelligence is any closer than it was 20 years ago."

Well, in some sense I agree, in that we still have no idea how far off AGI is. If it's going to happen in 10 years, we should definitely prepare now. If it's 500 years away, maybe it's too early to think about it. But since neither of us knows, wouldn't you say it's worth putting some effort to working towards safety?

In another sense though, I disagree with you that we're not any closer to AGI. As you said jsut the sentence before, fields like comptuer vision have advanced tremendously. While this doesn't necessarily mean AGI is closer, it certainly seems that the fields are related, so advancement in one is a sign that advancement in the other is closer.

jameshart · on Jan 6, 2016

Yeah, you go off the rails around step 5. "Something being much more intelligent than us means that, in effect, it has almost absolute power over what happens in the world" makes no sense. Since when does intelligence get you power? Are the smartest people you know also in positions of power? Are the most powerful people all highly intelligent?

"whatever the goals of the AI will be, it will achieve them". Dude, if intelligence meant you could achieve your goals, Hacker News would be a much less whiny place.

discreteevent · on Jan 6, 2016

"Since when does intelligence get you power?" You hit the nail on the head there. Its about I/O. (Just as its about I/O in the original article - garbage in, garbage out). Jaron Lanier makes this point in.

http://edge.org/conversation/jaron_lanier-the-myth-of-ai

"This notion of attacking the problem on the level of some sort of autonomy algorithm, instead of on the actuator level is totally misdirected. This is where it becomes a policy issue. The sad fact is that, as a society, we have to do something to not have little killer drones proliferate. And maybe that problem will never take place anyway. What we don't have to worry about is the AI algorithm running them, because that's speculative. There isn't an AI algorithm that's good enough to do that for the time being. An equivalent problem can come about, whether or not the AI algorithm happens. In a sense, it's a massive misdirection."

jameshart · on Jan 6, 2016

As I've said before, the singularity theorists seem to be somewhere between computer scientists, who think in terms of software, and philosophers, who think in terms of mindware, and they seem to have a tendency to completely forget about hardware.

There seems to be this leap from 'superintelligent AI' to 'omnipotent omniscient deity' which is accepted as inevitable by (what for shorthand here is being called the 'lesswrong' worldview) which seems to ignore the fact that there are limited resources, limited amounts of energy, and limitations imposed by the laws of physics and information, that stand between a superintelligent AI and the ability to actuate changes in the world.

philh · on Jan 6, 2016

You're not engaging with the claim as it was meant. In context, no human being has ever been "much more intelligent" than me. Not in the same way that I am "much more intelligent" than the monkey von Neumann.

You might decide that this means edanm goes off the rails at step four, instead. But you should at least understand where you disagree.

jameshart · on Jan 6, 2016

I'm still not sure you could assume ultimate power and achieve everything you desired if you were the only hacker news reader on a planet of 8 billion monkeys.

woodchuck64 · on Jan 6, 2016

> I'm still not sure you could assume ultimate power and achieve everything you desired if you were the only hacker news reader on a planet of 8 billion monkeys.

I would think it relatively easy for a human armed with today's knowledge and a reasonable yet limited resource infrastructure (for comparison to the situation of an unguarded AI) to quite easily engineer the demise of primate competitors in the neighborhood. Set some strategic fires, burn down jungles would be the first step. "Fire" might be a metaphor for some technology that an AI might master that humans don't quite have the hang of yet that can be used against them. For example, a significant portion of Americans seem way too easily manipulated by religion and fear, an AI-generated televangelist or Donald Trump figure might be a frightening thought.

Avshalom · on Jan 5, 2016

Well "is able to e.g. cure cancer" is not actually very general. Which leads to the problem with 2) whats the economics behind creating a general intelligence when a specific intelligence will get you better results in a given industry. Even then specific intelligence is still going to be subject to the good-enough economic plateau that has killed so many future predictions.

Then the problems with 4 on up really concern the speed with which 4 can feasibly happen. The AI goes FOOM doomsaysers seem to think that we'll end up with an AI which is so horribly inefficient that/and it will be able to rewrite it self to be super duper intelligent without leaving its machine (and won't accidentally nerf itself in the attempt) and then that super duper intelligent computer will trick several industries into building an even more powerful body for itself etc... all of this happening before humans pull the plug. no step of which is has anything beyond speculation to support it.

In a general note the full employment theorems mean that even if general AI is economically incentivized there's still going to be dozens/hundred/thousands of different AIs carving out niches for themselves which, given that the earth/universe has limited resources, handily prevents the paper clip maximizer problem. While the future may not need humans it will still be a diverse future.

dmfdmf · on Jan 6, 2016

1) Define intelligence, knowledge, truth, proof (deductive and inductive)... how do concepts work?, etc. I am not being facetious here. AI is an epistemology problem not a technological one.

2) I agree but we have to solve the problem of induction first but LW/EY are certain that there is no problem of induction. How can one be certain in a Kantian/Popper framework where statements can be proved false but never true?

3a) Here is where we part ways. It is a common assumption that AI implies consciousness but I think that is an unwarranted assumption. Whatever the principles behind intelligence are, we know that consciousness minds have found a way to (implicitly) enact them. It does not follow that consciousness is necessary for intelligence (just the biological manifestation of them) and I think good arguments exist to think that they are not correlatives. If they are correlatives then it will be easier to genetically design better babies, now that evolution is in conscious control, than to start from scratch.

3b) Goals, values, aims, etc. are teleological concepts that apply to living things only because they face the alternative of life or death. Turning off your computer does not kill it in the same sense that a living thing that stops functioning dies forever. 3a) & 3b) diffuses all the scary AI scenarios about AI taking over the world. It does raise the issue of AI in the hands of bad people with evil goals and values, like the dictator of North Korea who now apparently has the H-Bomb. This is the real danger today.

4) I agree. Computer aided intelligence will allow us to accelerate the accumulation of knowledge (and its application to human life) in unimaginable ways. But it will be no more conscious than your (deductive) calculator.

5) Non Sequiturs. Possibly psychological projection of helplessness or hopelessness.

6) As the joke goes, we can always unplug it.

7) Granting your premises then the goal of LW/EY should not be AI but the scientific, rational proof and definition of ethics but their fundamental philosophic premises won't allow it.

8) For me the threat is bad, evil people in possession of powerful technologies.

ItsMattyG · on Jan 6, 2016

>Granting your premises then the goal of LW/EY should not be AI but the scientific, rational proof and definition of ethics but their fundamental philosophic premises won't allow it.

That is the goal of MIRI, the organization that EY founded, and is a frequent topic of discussion on LW

dmfdmf · on Jan 6, 2016

MIRI’s three research objectives are, at present:

•highly reliable agent design: how can we design AI systems that reliably pursue the goals they are given?

•value learning: how can we design learning systems to learn goals that are aligned with human values?

Not exactly what I meant. What are these human values (for humans not robots) and how do you prove they are rational and scientific? Their goal is to design AI that will accept human goals/values without defining a rational basis for those human values.

renox · on Jan 13, 2016

(5) isn't convincing: pull the plug of the computer hosting the AI: it's "dead".

manish_gill · on Jan 5, 2016

I've been saying t forever. Thanks for putting it so succinctly. LessWrong is a cult of people who want to be smart and they've essentially found a community in which certain assumptions and hypothetical scenarios combined with mathematical concepts make them think they've found the answer to everything in the Universe.

They're no better than any other cult in my book. The problem is that it's only going to get worse with the advances in AI that are going on. Yudkowski has managed to convince some wealthy people to fund his so called research and we have OpenAI operating in the same waters which somehow gives LW people more legitimacy.

Nostripus · on Jan 5, 2016

What is the answer to everything in the Universe they think they've found?

I've read a lot of the bigger posts on Lesswrong (http://lesswrong.com/top/?t=all) and none of them are anything like.

No better than any other cult? How are you deciding that? The LW community hasn't killed people. Doesn't cut people off from their family. Does't emotionally/physically abuse people. Etc...

Even if they are a "cult", this puts them miles ahead of other cults, like say, Scientology which has done far, far more harm to people.

I struggle to think in what way LW has harmed anyone at all.

manish_gill · on Jan 6, 2016

cult |kʌlt| noun 1 a system of religious veneration and devotion directed towards a particular figure or object: the cult of St Olaf. • a relatively small group of people having religious beliefs or practices regarded by others as strange or as imposing excessive control over members.

The veneration of Yudkowski and others in the LW community is more than a bit "religious". So I'd say by definition it's a cult.

LW hasn't done harm to people physically, but what it's done is spawn some very questionable ideas, perpetuate pseudo-science and pseudo-mathematics. The cult leader has no formal training, zero research in peer-reviewed journals and still calls himself a "senior research fellow" in an institute he himself started.

Hell, he even has an introductory religious text - The Sequences and the Methods of Rationality fan fiction (which by the way, he wanted to monetise before the broader fan fiction community stopped him. A clear violation of copyright law).

For more, see: http://rationalwiki.org/wiki/Yudkowsky

I'll quote the section titled "More controversial positions"

Despite being viewed as the smartest two-legged being to ever walk this planet on LessWrong, Yudkowsky (and by consequence much of the LessWrong community) endorses positions as TruthTM that are actually controversial in their respective fields. Below is a partial list: Transhumanism is correct. Cryonics might someday work. The Singularity is near![citation NOT needed]

Bayes' theorem and the scientific method don't always lead to the same conclusions (and therefore Bayes is better than science).[21]

Bayesian probability can be applied indiscriminately.[22]

Non-computable results, such as Kolmogorov complexity, are totally a reasonable basis for the entire epistemology. Solomonoff, baby!

Many Worlds Interpretation (MWI) of quantum physics is correct (a "slam dunk"), despite the lack of consensus among quantum physicists.[23]

Evolutionary psychology is well-established science.

Utilitarianism is a correct theory of morality. In particular, he proposes a framework by which an extremely, extremely huge number of people experiencing a speck of dust in their eyes for a moment could be worse than a man being tortured for 50 years.[24]

Also, while it is not very clear what his actual position is on this, he wrote a short sci-fi story where rape was briefly mentioned as legal.

TL;DR: If it's associated with LessWrong/Yudkowsky, it's probably bullshit.

sawwit · on Jan 6, 2016

I cannot judge to which degree these theses are bullshit, but I've found LW a tremendously rich source of thinking tools and I'm convinced that reading or skimming a lot of the sequences have improved my thinking.

Regarding the rape sequence in HPMOR: It's a terribly chosen trope to convey that the fictional society has very different values from ours. Apparently it ties into various parts of the story, so that EY didn't remove it and only toned it down after it was criticized.

manish_gill · on Jan 6, 2016

> I cannot judge to which degree these theses are bullshit.

Go the link, go to references, read about them. I'll outline the gist: Most of what Yudkowsky says is extremely sci-fi, no real basis in scientific fact, but stretching the current technological progress to the point where his opinions on things (stuff like transhumanism, singularity) can be justified.

What he's preaching isn't science. Certainly not rigorous experimental science. He (along with Bostrom) tends to engage in extreme hypotheticals. Which, sure if you're a philosopher, is fine. But even then, wouldn't you want your work to be judged by like-minded peers? But alas, here has a convenient excuse of being "auto didactic" to fall back on, so he can sit on his armchair and critique traditional education, and his lack of peer reviewed material.

Not to mention, and this is a bit of a pet peeve, I find that most LW people are too self-absorbed, I've literally seen a blog where the person who runs it "warns" the readers that what he writes is too complicated for people to follow. This sort of narcissistic, self congratulatory thinking is what puts me off more than anything. Writing long form posts on the Internet which use complicated words don't make you smart.

http://laurencetennant.com/bonds/cultofbayes.html is another critique. It comes off as bit crass, but stick with it.

> I've found LW a tremendously rich source of thinking tools and I'm convinced that reading or skimming a lot of the sequences have improved my thinking.

There are other ways to improve your thinking. Read books. Read different kind of books, that offer counter point of views. Farnham Street Blog is a good place to start for a list of resources for thinking tools/mental models btw. :)

sawwit · on Jan 6, 2016

I will look into Farnham Street's Blog, thanks.

> What he's preaching isn't science.

I don't buy into the necessity that everything has to be peer-reviewed in the old fashioned way. There is peer-review happening in the comments to some extent. I don't take a fancy to dismissing any radical ideas as pseudoscience. It's just the outer fringe of hypotheses that need to be tested against reality, and as long they are approximately humanist, enlightened and don't contradict existing physics without depending on mathematics (or disclaimers), I cannot see anything wrong with it. As a naturalist, I pretty much agree with everything I've read on LW so far, except for the parts I cannot judge (like hypotheses about physics), which I allocate a weaker priors for and a few unconvincing pieces.

> Not to mention, and this is a bit of a pet peeve, I find that most LW people are too self-absorbed, I've literally seen a blog where the person who runs it "warns" the readers that what he writes is too complicated for people to follow.

I have not yet experienced that, but there are also a lot of people on reddit and HN that I don't like, yet I differentiate within these communities between what is valuable and what is not.

> Most of what Yudkowsky says is extremely sci-fi, no real basis in scientific fact, but stretching the current technological progress to the point where his opinions on things (stuff like transhumanism, singularity) can be justified.

At the risk of seeming indoctrinated to you, this is what I believe with high certainty: If Moore's law continues another one or two decades, I think the singularity is a very real possibility. The human brain seems to be nothing more than a learning and prediction machine, nothing what transcends what we can understand in principle. Evolution did come up with complex organisms, but the complexity is limited by biochemical mechanisms and availability of energy. In addition, nature often approximates very simple things in overly complicated ways because evolution is based on incremental changes, not on an ultimate goal that prescribes a design of low complexity. I also think that AI will very likely be superintelligent and that poses a tremendous risk in the 10-40 years to come (on the order of atomic warfare and runaway climate change). By the time someone implements an approximately human-level intelligence, we better have a good idea about how to control such a machine.

manish_gill · on Jan 7, 2016

> There is peer-review happening in the comments to some extent.

Lol. I guess we don't need college education as well then, there's education happening in the comments to some extent. We don't need traditional means of news, there's news happening on Twitter to some extent. I could go on with analogous line of reasoning.

Don't get me wrong, I'm not 100% in favour of the traditional education model as well, but peer reviews exist for a reason. You and I are not experts in these fields. We rely on the expertise of people who have made it their business and life to study these fields based on a rigorous method. Would you try out homeopathy had it not been rejected completely by doctors and scientists but someone on a forum told you it worked for them? What if someone wrote a very long article with fancy words (like LW tends to do) explaining how and why it works (they exist, I assure you)? Would you try it then?

> I don't take a fancy to dismissing any radical ideas as pseudoscience.

Sure, I'm not saying we should be against radical ideas. That's how scientific progress happens. I'm against LW ideas, for which there is no basis in reality as far as we know based on our current understanding of science.

> I differentiate within these communities between what is valuable and what is not.

Indeed. But I'd rather the community's entire existence not depend on bullshit.

> At the risk of seeming indoctrinated to you, ...

a) Keywords: "If", "Seems" b) Tons of assumptions in that scenario you laid out. If you can't see it, I'm sorry but you're already too far gone. c) Watch some MIT lectures on computer architectures about how the trend of Moore's law has already radically shifted and is flatlining.

Basically, what you've done is precisely the kind of utter crap that LW perpetuates. "If x keeps happening" without providing any reason as to why that would be true. Make some ridiculous simplifications "complexity is limited by ___", nature often does __ because ___. You basically don't provide any rational reason for why you think AI will be super intelligent and even if it were, why that would be risky. You pick numbers out of a hat (10-40 years to come).

Yes, you look pretty well indoctrinated from where I'm sitting. But I hope you see the many (so many) flaws in that last paragraph of yours (it honestly made me laugh out loud :p)

Predicting the future is hard business -- be it the stock market predicting what happens tomorrow, or weather forecast for the next month. It's presumptuous and hella stupid if you think you can predict where Science and Technology will be x years from now.

TL;DR: Stahp.

sawwit · on Jan 7, 2016

> Lol. I guess we don't need college education as well then, there's education happening in the comments to some extent. We don't need traditional means of news, there's news happening on Twitter to some extent. I could go on with analogous line of reasoning.

That's a straw man. I did say it's the fringe and it needs to be tested. I didn't say one should replace the other. Peer-review is essentially just mutual corrections, and there are mutual corrections happening in the comments, just not as thoroughly as when it's institutionalized. Most of it is not new anyway, but just summarizes research results and draws logical conclusions from it (for example this [1]). If it wasn't all brought together on LW, I possibly wouldn't have found out about the wealth of knowledge for a long time.

[1] http://papers.nips.cc/paper/2716-bayesian-inference-in-spiki...

> a) Keywords: "If", "Seems" b) Tons of assumptions in that scenario you laid out. If you can't see it, I'm sorry but you're already too far gone. c) Basically, what you've done is precisely the kind of utter crap that LW perpetuates. "If x keeps happening" without providing any reason as to why that would be true.

It's very logical. My certainty referred to the implication, but it is hard, of course, to come up with a prior for that 'if': Exponential progress could continue in various ways, e.g. by invention of more energy efficient chips and by scaling them up, by 3D circuitry, molecular assemblers, memristors, or perhaps quantum computing. There are contradicting studies, so one should put P(Moore's law continues for another 10-20 yrs) at perhaps 50%. So, of course, this is all hedged behind this prior (which I think many people get confused by). The discussion is always concerned with implications which can be made with fairly solid reasoning, by assuming that P(..) above to be 100%.

> Make some ridiculous simplifications "complexity is limited by ___", nature often does __ because ___. You basically don't provide any rational reason for why you think AI will be super intelligent and even if it were, why that would be risky.

That's just a basic assumptions which I find plausible, and which some respectable and knowledgeable persons find plausible too (for example Stephen Wolfram and Mark Tegmark; I am aware that appeal to authority is difficult to argue from, but both have publications which I could also refer to). I agree that mentioning the complexity limitations didn't provide any information because they don't tell us whether it makes it simple enough for us to understand, it merely says that the complexity is not infinite, so I should have left it out entirely. But this is not at all representative for the best contents on LW, it was poor reasoning on my behalf. Bostrom's book Superintelligence gives a pretty good summary about why it is thought to be plausible.

> You pick numbers out of a hat (10-40 years to come).

That's based on estimates of the processing power required for brain simulations by IBM researchers and Ray Kurzweil. Simple extrapolation of Moore's law shows us that we will reach that point roughly between 2019 and 2025. 40 years is just my bet based on what I know about brain models and current obstacles in AI.

Anderkent · on Jan 6, 2016

I don't understand how you can be so certain that the hypothetical scenarios they imagine can't possibly happen. Even if it really is laughable, we spend a lot of money on laughable research (homeopathy, anyone?), so why is this case so particularly bad?

manish_gill · on Jan 6, 2016

...Just because we've spent money on one laughable research doesn't mean we should spend it on more, does it? Are you seriously going to argue that?

Anderkent · on Jan 6, 2016

No, what I'm arguing is that if we only spent money on things everyone agrees are useful, we'd never get anything done.

manish_gill · on Jan 6, 2016

Not everyone has to agree. That's why we have the scientific method and research institutes. If the current science shows that something is worth exploring in more detail, that some avenues are worth spending money on, spending money on them makes sense.

Bullshit Ideas like AI apocalypse and singularity, transhumanism, downloading brain into a computer do not fit into that criteria.

Anderkent · on Jan 6, 2016

So how do ideas get to the stage where the 'research institutes' agree they're worth investigating?

It seems to me like you're saying "I don't like these ideas, no one should be working on them". That seems a worse principle then "everyone should work on ideas they find worth investigating".

I also have no idea how you distinguish 'Bullshit Ideas' from non-bullshit ideas without investigating them. Your gut is not that good at distinguishing truth from a blunder.

IanCal · on Jan 5, 2016

Do priors just start you off closer to the truth? That is to say, if you start with any prior, will enough additional pieces of evidence always let you converge on the truth?

Does anyone commonly set their priors to be a distribution? Perhaps a range or actually a normal distribution to represent a prior with uncertainty?

jimrandomh · on Jan 5, 2016

> That is to say, if you start with any prior, will enough additional pieces of evidence always let you converge on the truth?

Yes, with two caveats. First, you can't have assigned zero probability to the right answer. Which means that if you have a probability distribution over hypotheses, and the right answer wasn't one of the hypotheses in the distribution, you're doomed from the start. The Solomonoff prior is a mathematical construct that gets around this problem by including every hypothesis that can be expressed as a Turing machine, but it gets used more as a philosophical token than an actual computing tool because it's unfeasible to use directly. The second issue is that if you have a bad enough prior, "enough additional pieces of evidence" can be an arbitrarily large amount, and there may only be a limited amount of evidence available to collect. In particular, rerunning an experiment over and over can only provide a limited amount of evidence, because of the possibility that some systematic error affects every instance of the experiment.

sawwit · on Jan 6, 2016

The prior of Solomonoff Induction is a uniform distribution over the enumerable set of Turing machines?

sawwit · on Jan 6, 2016

Ah I dimly remember that it also has an Occams razor built-in, so the prior probability falls off with the lengths of the strings that describe the Turing machines?

Fomite · on Jan 5, 2016

In my field (Epidemiology), when doing Bayesian analysis, it is very common to set one's priors to be a distribution. Sometimes the point estimate and spread of a previously conducted study or meta-analysis, sometimes merely a uniform distribution with upper and lower bounds ("It is extremely unlikely that the relative risk of disease for this exposure is below 0.01 or above 100...")

It's been argued that frequentist analysis is essentially a Bayesian analysis with a prior distribution centered on zero with bounds from positive to negative infinity.

skybrian · on Jan 5, 2016

How does that work? It doesn't sound like a well-defined distribution since the area under the curve needs to be 1.

bmh100 · on Jan 5, 2016

It has to do with calculus. If the probability of a given result approaches zero as the result itself approaches positive or negative infinity, then the area under the curve approaches 1.

Imagine 1/2 + 1/4 + 1/8 ... to infinity. The sum approaches 1 as the denominator approaches infinity. With calculus, we can determine that with mathematical methods.

gipp · on Jan 5, 2016

I think he was referring specifically to an unbounded uniform distribution, which is indeed not well-defined.

Fomite · on Jan 5, 2016

It's more of a philosophical statement than an actual implementation. Mainly that frequentist analysis begins with the prior "I dunno, could be anything..."

See also: The calculus answer.

tel · on Jan 6, 2016

It turns out that priors don't actually need to be distributions all the time. This often occurs when seeking maximally "uninformative" priors.

jsprogrammer · on Jan 5, 2016

Bayesian analysis comes directly from the probability axioms, which are 'frequentist'.

XFrequentist · on Jan 5, 2016

Huh? You mean the Kolmogorov axioms? In what sense are these 'frequentist'?

jsprogrammer · on Jan 5, 2016

The third axiom just counts the event space.

cwhy · on Jan 6, 2016

That's the right bottom part of the Bayes' rule?

sharkbot · on Jan 5, 2016

This comes to mind: https://en.wikipedia.org/wiki/Aumann's_agreement_theorem

Essentially, two genuine Bayesian rationalists (with some hand wavy preconditions) cannot agree to disagree; ie, they will eventually converge onto the same understanding of an event.

neaden · on Jan 5, 2016

This isn't true though, the problem is there are generally more then 2 possible explanations. For instance let's imagine both of us are using an experimental telescope to observe if some event occurs. We are then looking at four possible scenarios. The telescope could work/not work correctly and the event could happen/not happen. You are confident that the telescope works correctly and also confident that the event will not occur so you give high prior probability to the first and a low to the second. I on the other hand think the telescope is rubbish and the event will almost certainly occur and do the opposite. We sit down and wait and do not observe the event. You then come to the conclusion that the event did not occur and the telescope works correctly, while I come to the reverse conclusion.

kybernetikos · on Jan 5, 2016

One of the "handy wavy preconditions" is that they share priors. This nearly never happens in the real world. Almost all disagreements can be traced to differing priors.

XFrequentist · on Jan 6, 2016

No - the precondition is that agents share common knowledge of each others' priors (not that they have the same priors).

kybernetikos · on Jan 6, 2016

Here's the paper I read: http://www.ma.huji.ac.il/~raumann/pdf/Agreeing%20to%20Disagr...

Here's how it starts: "Theorem: if two people have the same priors..."

XFrequentist · on Jan 6, 2016

Whoops, seems you are correct. I'd misunderstood this, my apologies.

sago · on Jan 5, 2016

First Qn: yes. But the rabbit is hiding in the 'enough'. Most commonly the poor uses of BT end up with a narratively argued prior + a single suite of evidence.

For a finite set of evidence (particularly chosen by someone with bias), bias + evidence can be arbitrarily far from the truth.

cossatot · on Jan 5, 2016

> Does anyone commonly set their priors to be a distribution? Perhaps a range or actually a normal distribution to represent a prior with uncertainty?

Almost everyone does this, and the solution is a posterior probability distribution. Most uses of Bayesian techniques that I'm familiar with are based on Monte Carlo simulations, where priors are drawn from the specified prior distribution, processed in some fashion, and result in samples of the posterior distribution.

This is especially helpful when you don't view the problem as having a 'true' single answer with some 'uncertainty' but instead have actual variability in the system, which is described by the posterior distribution.

Thinking about it, I'm really not sure in what instance the prior wouldn't be a distribution. It is, by definition, a probability. I suppose you could have a singular value, where the pdf is a delta function, but what's the point of doing the Bayesian inversion or estimation then? If you have a single value with prior p(x) = 0 or 1, you should end up with the posterior p(x|d)=0 or 1.

So the simplest case is where the prior is a binary: Say p(x=yes) = 0.6, p(x=no)=0.4. Or something like that. It's not a continuous distribution but it's still a distribution. The sum/integral of all cases has to be 1.

conjectures · on Jan 5, 2016

Yes, but.

The technical term for the condition you're interested in is consistency. There is a theorem (Doob) regarding the consistency of Bayes estimates. [1]

In simple cases it means that under any non-silly prior your posterior will converge to the truth. E.g. mean of a normal distribution with a normal prior.

The complex cases include things like models that expand when given more data.

[1] http://www4.stat.ncsu.edu/~sghosal/papers/bhurev.pdf

jasonwatkinspdx · on Jan 5, 2016

> if you start with any prior, will enough additional pieces of evidence always let you converge on the truth?

That's the general trend: pooling data tends to make priors converge. However, converging priors isn't quite the same thing as everyone converging on the truth.

Statistical expressions themselves are an incomplete explanation, we routinely use assumptions about the direction of causality that aren't captured in them. See Judea Pearl's "Why I am only half Bayesian" paper for a discussion as well as an intro to how his framework approaches such independence assumptions.

XFrequentist · on Jan 5, 2016

Yes and yes and yes!

ETA - caveat to the 2nd question: ... unless you've restricted your prior to exclude the truth.

arcanus · on Jan 5, 2016

> Their inability to come up with accurate priors is completely lost on many of the folks who follow this kind of thinking.

Mis-specified/faulty priors are absolutely a possible pitfall of Bayes, but Jeffreys Priors are a means to circumvent this: https://en.wikipedia.org/wiki/Jeffreys_prior

Another thing not mentioned in the article is that Bayes DOES converge to the frequentist result in the limit of big data.

mjgeddes · on Jan 8, 2016

Bayesianism is a 'grand unified theory of reasoning' that all of science be should be based on assigning (and updating) probabilities for a list of possible outcomes; the probabilities are supposed to indicate your subjective degree of confidence that a given outcome will occur.

Contrast this with an alternative conception of rationality as espoused by David Deutsch.

David Deutsch in his superb books, 'The Fabric Of Theory' and 'The Beginning Of Infinity', argued for a different theory of reasoning than Bayesianism. Deutsch (correctly in my view) pointed out that real science is not based on probabilistic predictions, but on explanations. So real science is better thought of as the growth or integration of knowledge, rather than probability calculations.

So what's wrong with Bayesianism?

Probability theory was designed for reasoning about external observations - sensory data. (for example, "a coin has a 50% chance of coming up heads"). In terms of predicting things in the external world, it works very well.

Where it breaks down is when you try to apply it to reasoning about your own internal thought processes. It was never intended to do this. As statistician Andrew Gelman correctly points out, it is simply invalid to try to assign probabilities to mathematical statements or theories, for instance.

Can an alternative mathematical framework be developed, one more in keeping with the ideas of David Deutsch and the coherence theory of knowledge?

I believe the answer is yes, and I am going to sketch the basic ideas for such a framework.

The basic idea is to separate out levels of abstraction when reasoning (or equivalently, levels of recursion). In my proposed framework, there are 3 levels, and each level gets its own measure of 'truth-value'. All reasoning must terminate in a Boolean truth value (True/False) at the base level but the idea is that different forms of reasoning correspond to different levels of abstraction.

1st level: Boolean logic (True/False)

2nd level: Probability value (0-1)

3rd level: Conceptual coherence (categorization measure)

For full reflection, you need three different numbers: a Boolean value (T/F) at the base, a probability value (0-1) at the next level of abstraction, and an entirely new measure called conceptual coherence at the highest of abstraction.

As a rough working definition of conceptual coherence, I would define it thusly;

"The degree to which a concept coheres with (integrates with) the overall world-model."

It should now be clear what's wrong with Bayesianism! It only gets us to the 2nd level abstraction! There is not just uncertainty about our own knowledge of the world (probability), there is another meta-level of uncertainly; uncertainty about our own reasoning processes, or logical uncertainty. Bayesianism can't help us here. Conceptual coherence can. Lets see how:

All statements of the form:

‘outcome x has probability y’

can be converted into statements about conceptual coherence, simply by redefining ‘x’ as a concept in a world-model. Then the correct form of logical expression is:

‘concept x has coherence value y’.

The idea is that probability values are just special cases of coherence (the notion of coherence is more general than the notion of probabilities).

To conclude, conceptual coherence is the degree with which a concept is integrated with the rest of your world-model, and I think it accurately captures in mathematical terms the ideas that Deutsch was trying express, and is a more powerful method of reasoning than Bayesianism.

njohnson41 · on Jan 5, 2016

Good article.

I'm only a bit disappointed that the author seems not to realize that Bayes' theorem is just a simple consequence of probability theory, and should be attractive not because "maybe the brain is Bayesian", but because it is based on sound set-theoretic and analytic principles. If Bayes' theorem is false, so is probability theory, and so is nearly everything we know about probability.

Edit: Here is a good explanation of the theorem that makes it visually clear how only set theory is involved in deriving it: https://oscarbonilla.com/2009/05/visualizing-bayes-theorem/

mikeash · on Jan 5, 2016

Just because a theorem is true doesn't mean you can't misuse it or that you don't need to do some work to map it to reality.

For example, the Banach-Tarski theorem is solid, but that doesn't mean you can start a business making golf balls by buying one and then endlessly replicating it.

jsprogrammer · on Jan 5, 2016

Certainly. Just because you can name a theorem doesn't mean you can derive it either. The article had no actual computation or derivation of the theorem. Instead, it talked about beliefs and other things that don't really exist (in regards to the computation of a probability value).

gaur · on Jan 5, 2016

I guess a hard-line frequentist (if such a person exists) would counter that you can't assign probabilities to hypotheses or fixed parameters. Then Bayes's theorem (and every other statement about probability) is true only when applied to statements about how often a certain event will occur.

But of course, most people do assign probabilities to hypotheses and fixed parameters, even if only informally. Bayesian probability theory is an attempt to formalize that kind of intuitive reasoning.

nshepperd · on Jan 6, 2016

I have heard of people genuinely saying such a thing. Fortunately, it is nothing but an empty redefinition of the word "probability". In fact, rational degrees of belief in hypotheses do follow the kolmogorov axioms (as shown by eg. Cox's theorem or the VNM theorem), and bayes theorem does therefore apply. Whether or not someone refuses to call that "probability" makes no difference.

lottin · on Jan 5, 2016

I think it's strange this sudden comeback of a theory that was dismissed more than 70 years ago by Fisher and many others, but no one, as far as I know, cares to explain why Fisher was wrong and why the theory is right. It makes me very suspicious, to be honest.

dragandj · on Jan 5, 2016

Actually, many Bayesian textbooks cover that. Try Jaynes - Probability Theory: The Logic of Science.

eli_gottlieb · on Jan 6, 2016

>I guess a hard-line frequentist (if such a person exists) would counter that you can't assign probabilities to hypotheses or fixed parameters. Then Bayes's theorem (and every other statement about probability) is true only when applied to statements about how often a certain event will occur.

If you model "thinking" and "believing" as sampling in probabilistic programs (which they do in some schools of cognitive science), then Bayes' Theorem becomes a theorem about how often certain execution traces occur when the sampling program is run with fresh randomness. You then need none of the weird metaphysics associated with "subjective Bayesianism".

lmm · on Jan 5, 2016

All probabilistic tools are accurate in theory, otherwise we wouldn't use them. Bayes' theorem is no different from e.g. the t-test in that regard. The question of whether it's worth using Bayes explicitly rather than other tools is and should be a question of whether we find it aligns with our understanding and helps us think more clearly.

sorokod · on Jan 5, 2016

Given that

  p(A and B) = p(B and A)
  p(A and B ) = p(A) * p(B|A)

we have:

  p(A) * p(B|A) = p(B) * p(A|B)
  p(B|A) = p(B) * p(A|B) / p(A)

PaulHoule · on Jan 5, 2016

So can frequentism.

Many investigators in parapsychology who were sincere and intelligent appear to have based their career on the incorrect use of frequentist statistics.

And it's not just them. Ernerst Rutherford, who discovered the atomic nucleus, "If your experiment needs statistics, you ought to do a better experiment." In the 1990s I was a physics grad student and I think none of the professors had ever heard of the idea of a parameter estimator so we had a bunch of ad-hoc ways to fit power law coefficients that gave different answers and no way to judge goodness of fit that was thought out at all.

One postdoc in my lab suffered through a difficult job market before finally, after a decade of anxiety and uncertainty, got a tenure track position and eventually wrote a paper on how to fit power law curves... in a statistics journal.

And this was in a good department with people in which I was proud of both the teaching and research going on.

3pt14159 · on Jan 5, 2016

I'm a little confused. Are you saying that a tenure track prof wrote a paper on how to evauluate fitted power law curves? Was it something else besides least squares? Because I can't possibly see this getting accepted to a statistics journal.

jwmerrill · on Jan 5, 2016

Fitting distributions is a little bit different than the usual model fitting scenario where least squares is appropriate. Sometimes people do things like construct a histogram and then do a least squares fit to the bin heights, but that procedure doesn't satisfy the usual assumptions that justify least squares (observations with independent, equal variance, Gaussian errors).

Cosma Shalizi has written some interesting posts on this subject, and also published papers in statistics journals:

http://bactra.org/weblog/491.html

PaulHoule · on Jan 5, 2016

It is far beyond least squares as it was and is practiced

There already was stuff in the stats literature in the 1990s that was much better but people in the physics community (such as myself) were not aware of that literature. On the other hand, stats people were not particular aware of the way power laws were occuring in physics.

I saw things that did not add up ten years earlier and Mark Newman did too but we were both so caught up in the rat race, consensus reality, collective delusion, whatever you call it that I left physics before I could address the problem and he suffered through years of bullshit before he could find the time to do something about it.

Watching Mark write great papers, write great book chapters and suffer from tremendous anxiety over his career was a big reason why I left.

PaulHoule · on Jan 5, 2016

Remember what a parameter estimator is.

If you sample 100 values out of a very large pool and add them up then divide by 100 what you get is not the mean of the distribution, but an estimator of the mean of the distribution as you would get a slightly different answer if you picked a different 100.

Often estimators are simple formulas like that (they are for power laws) but there are subtle details, for instance to get the standard deviation in that case you might think you divide by N (100) but you really should divide by N-1 (99).

Back in the 1990s myself and the people around knew some popular statistic formulas but not the concept of estimating a parameter.

And of course its not physics. Social scientists and life science people tend to take a course on statistics but it is fair to say that the median paper in those fields has some mistake in how they do statistics.

NN88 · on Jan 5, 2016

Rutherford should just be happy he discovered the last reasonably tangible piece of matter humanity will see for a long time, if ever.

darawk · on Jan 5, 2016

> If you get tested again, you can reduce your uncertainty

I've always been bothered by statements like this about medical tests. This assumes that false positives are statistically independent. But isn't it more likely in general that false positives would be highly correlated in individuals, test administrators, or labs? E.g. If the same person takes the same test from the same doctor and sends it to the same lab, it seems extremely unlikely that the results will be independent. And to me at least, it seems highly likely that a false positive will correlate with some aspect of the individual's biological (e.g. some similar substance in the blood to what's being tested), and as such even using a different doctor/lab would not be all that likely to ameliorate this issue.

ucha · on Jan 5, 2016

That's a nitpick on a correct statement. Unless two tests are always perfectly correlated, you will reduce your uncertainty. They don't need to be independent.

darawk · on Jan 5, 2016

Sorry, I should have provided a more complete quote:

> If you get tested again, you can reduce your uncertainty enormously, because your probability of having cancer, P(B), is now 50 percent rather than one percent. If your second test also comes up positive, Bayes’ theorem tells you that your probability of having cancer is now 99 percent, or .99.

This statement is incorrect if the results are correlated at all.

DINKDINK · on Jan 5, 2016

Exactly, unless the false positive is causal some how, testing again will reduce the uncertainty some amount. Though that amount may be less than if the false positive isn't completely independent.

sophacles · on Jan 5, 2016

I think this is a good question - I hope someone who actually knows about this stuff chimes in with some answers.

In the mean time as a thought experiment, going with a blood test example:

* levels of the compound being tested may rise or fall naturally, and the test checks for a certain concentration or above. * the patient may have the condition, but not have had it long enough for markers to have risen to "trigger" levels. * The patient may not exactly follow pre-draw instructions on eating etc, skewing results * in the case of false negative: the person's immune response may be temporarily suppressing the marker * in the case of something like cell counts - this sample could just be a random local variation * The tech or doctor or whoever could randomly make a mistake on one sample, but not on each sample.

And so on. My devil's advocate point here is that the patient, doctor, lab, and so on are not deterministic code - there are a lot of random inputs in the entire process chain.

iraphael · on Jan 5, 2016

I think the idea is that, if you retake the test, your system will be in a different state (our organism's biochemistry fluctuates naturally). So if something in your system caused a false positive, there's a chance it won't be there when you retake the test and you will get a true negative.

Basically, you're thinking of your blood composition as stateless, when it may also be counted as an "external factor".

darawk · on Jan 6, 2016

Ya that is definitely the idea. The question is is that idea correct. It seems almost certain to be the case that sometimes it's correct and sometimes it's not, but i'm not aware of any research into which cases are which, and I think that is a huge problem.

snowwrestler · on Jan 5, 2016

These are questions that themselves would need to be answered with scientific research. In the absence of real empirical data, I'd be uncomfortable with saying it is "more likely in general that false positives would be highly correlated." Without data, we don't know how likely it is.

mhb · on Jan 5, 2016

Which is why the author of the article needs to mention that the second test is independent of the first.

jedberg · on Jan 5, 2016

You could also look at it the other way: Using the same doctor and lab and procedure is the best way to eliminate a false positive, because if the cause was external, then the cause may not be repeated. But if you went to a new lab/doctor/whatever, you've now introduced new variables that could cause a false positive on top of whatever already caused it.

Given that it could go either way, it makes sense to think of each one as independent.

Now if you really wanted to take advantage of Bayes, if you got a positive test then you should get two more tests, one with the same lab and one with a totally independent lab (or even if you got a negative test, assuming your first Bayes run gives a 50% confidence)

jsprogrammer · on Jan 5, 2016

Down mods for this?

More information allows for better reasoning. Repeating the same test and also doing a "more independent" test are the minimum next two things you should do given a positive result on a single test.

tzs · on Jan 5, 2016

Referring to an appearance of Bayes' theorem on Sheldon's whiteboard:

> Bayes’ theorem has become so popular that it even made a guest appearance on the hit CBS show Big Bang Theory

That's not a sign that it has become popular, any more than the appearance of the standard human pedigree notation used by genetic counselors on the same whiteboard indicates that standard human pedigree notation has become popular among the general public. It's simply a sign that BBT takes care to make the whiteboards generally scientifically sensible.

They have a UCLA physicist [1] consultant who works with the producers, writers, prop people, set decorators and others to try to make things scientifically accurate.

In this particular episode, both Bayes' theorem and the genetic information are there because Sheldon was trying to figure out his chances of surviving until technology reaches the point that he can transfer his mind to an AI and what he would have to do to improve those chances. So both those things were on the whiteboard because they are things that would be perfectly reasonable to find on the whiteboard of someone doing that.

They consulting physicist had a blog [2] where he covered most episodes and talked about the whiteboards and other scientific content. Here is the entry for the aforementioned episode [3].

[1] http://bigbangtheory.wikia.com/wiki/David_Saltzberg

[2] https://thebigblogtheory.wordpress.com

[3] https://thebigblogtheory.wordpress.com/2010/09/30/s04e02-the...

n4r9 · on Jan 5, 2016

P(popular | appeared on BBT) < 1

cwyers · on Jan 5, 2016

> In many cases, estimating the prior is just guesswork, allowing subjective factors to creep into your calculations. You might be guessing the probability of something that--unlike cancer—does not even exist, such as strings, multiverses, inflation or God. You might then cite dubious evidence to support your dubious belief. In this way, Bayes’ theorem can promote pseudoscience and superstition as well as reason.

Oh please. You can do plenty of psuedoscience and superstition with good old frequentist statistics. And of all the people you could pick to represent Bayesian statistics, instead of I don't know, Andrew Gelman or someone, the author picks... Eliezer Yudkowsky? If nothing else, this provides inspiration for me to quit procrastinating on my "ASK ME ABOUT ROKO'S BASILISK" novelty t-shirt idea.

entrode · on Jan 5, 2016

I suspect Eliezer is targeted specifically due to his tongue-in-cheek presentation of understanding Bayesian statistics as being initiation into a cult. Also due to the author's familiarity with the topic likely specifically as a result of Eliezer's efforts to popularize the subject and his association to him resultantly.

eli_gottlieb · on Jan 6, 2016

Sure, but who cares? There's an entire field of statistics called "Bayesian statistics", who do actual math and statistics and don't give a damn about novelty T-shirts.

privong · on Jan 5, 2016

> Oh please. You can do plenty of psuedoscience and superstition with good old frequentist statistics.

That's not really the point. The article is simply saying that Bayesian methods are not a silver bullet; it's not saying that other methods of statistics are free from problems.

cwyers · on Jan 5, 2016

> The article is simply saying that Bayesian methods are not a silver bullet

But it's not really saying it in passing; the headline is taken from that paragraph. At that point it almost feels like a takedown of a strawman Bayesian.

TeMPOraL · on Jan 5, 2016

> If nothing else, this provides inspiration for me to quit procrastinating on my "ASK ME ABOUT ROKO'S BASILISK" novelty t-shirt idea.

Can you tell us the story about that party when you got so drunk and started yelling in anger at a poor random dude? Except you weren't drunk at all and the dude behaved like an asshole, but nobody cares about the truth since you're a nerd and you look cute when you're sad that we're telling stories about your drinking problems.

That's basically what happened there. I wish people stopped with this Roko's Basilisk nonsense.

taliesinb · on Jan 5, 2016

Can you unpack that strange analogy a little, please?

TeMPOraL · on Jan 6, 2016

There was a thread on LW about potential information hazards[0] - in particular, a kind created by the special flavor of decision theory which was being developed there. This was one of the "peculiar out there" kind of stuff LW people like to play with. Here comes Roko, who says that he created a potential hazardous piece of information - a thing that if you know it, you're fucked - that could lead to people suffering, and then posts it right there. Eliezer deleted the comment, claiming Roko is being irresponsible (and wrong, but still irresponsible if he believed that what he posted was a hazard), and the topic of what became known as Roko's basilisk was banned from discussion for some time.

Eliezer's argument was that Roko behaved unethically by posting something he claimed to believe will hurt people who will read it. That's irrelevant to whether the post itself was an information hazard or not. Eliezer actually later claimed it wasn't, that Roko was wrong. But the whole situation ended up being used to prove the point that LW people are loonies who invent and believe crazy things.

Hence my analogy, in which one person is rightfully angry at another, but ends up being painted as having issues instead.

[0] - https://wiki.lesswrong.com/wiki/Information_hazard

vezzy-fnord · on Jan 5, 2016

Yudkowsky has been one of the most instrumental and highly regarded figures in promoting Bayesian epistemology, among other woo that engineers and programmers are oddly susceptible to. Nothing wrong with singling him out.

cwyers · on Jan 5, 2016

I agree with pretty much everything you say here, but then I think the article conflates Yudkowsky's Bayes-flavored woo with what most people mean when they talk about Bayesian statistics.

cbd1984 · on Jan 7, 2016

> my "ASK ME ABOUT ROKO'S BASILISK" novelty t-shirt idea.

Why should we take you seriously when you resort to such cheap attacks?

flurie · on Jan 5, 2016

Roko's Basilisk has decided to punish you by having allowed others to create the shirt ahead of you.

http://i.imgur.com/XlydDcK.jpg

https://teespring.com/ask-me-about-roko-s-basilisk

cwyers · on Jan 5, 2016

The second one is actually mine, I didn't promote it because I was unhappy with the font.

stuartaxelowen · on Jan 5, 2016

Read through most of the article just for "people can abuse priors"? Come on. Anything, used wrongly, can promote superstition and pseudoscience.

astine · on Jan 5, 2016

I think you're missing the broader argument, which is using 'mathy' concepts to dress up poor reasoning. Obviously priors matter, but what matters most of all is how good/complete your evidence is. Using a mathematical formula to lend credence to weak evidence (through liberal use of assumptions) is a hallmark of pseudoscience. The same could be said of many of the abuses of statistics and Bayes theorem is merely one good example of this.

Anderkent · on Jan 5, 2016

Is using mathy concepts to dress up poor reasoning worse than not using anything to back up your reasoning? At least you can point out exactly what's wrong with the mathy reasoning.

A colleague of mine says 'Sometimes pulling numbers out of your arse and using them to make a decision is better than pulling a decision out of your arse'

crpatino · on Jan 5, 2016

> 'Sometimes pulling numbers out of your arse and using them to make a decision is better than pulling a decision out of your arse'

Agreed! Leaving the pseudoscience example aside - since there are strong emotions involved - we can clearly see that it is indeed useful and necessary to take decisions under uncertain/incomplete information. This is advantageous whenever the cost of inaction is expected to exceed the cost of backtracking a less than perfect decision, which often is the case.

Let's say... project management. IF you take the time to find out that your project requires 100 tasks, 30 of which lay in your critical path; you can argue if each task will take one day or one week to complete, and you can debate whether adding a 3rd or 4th member to the team will significantly speed up the completion date or not. But you will definitevely be in better shape than if your PM just cook up some 5-page-spec overnight and commited to have it running in beta test by the end of the month before even anouncing it to the team...

Which itself will be better than having all your potential contracts snatched by competitors that never do any estimation at all but are very good at pulling themselves out of tarpits of their own making.

astine · on Jan 5, 2016

"Is using mathy concepts to dress up poor reasoning worse than not using anything to back up your reasoning?"

I believe so. If your belief is baseless, or based on flimsy evidence or simple bias, it's best if that's obvious. Dressing up weak reasoning to seem stronger is a form of lying. It's what we call sophistry. A big part of the problem is that for a lot of people don't understand the math well enough to point out what's wrong with it or have a bias towards explanations that seem complex or sophisticated but really aren't.

It's true that sometimes we have to make a decision based on poor or no evidence but it should be clear that that is the case when that is the case. Dressing up the argument only obfuscates that.

TeMPOraL · on Jan 5, 2016

Honesty is an ultimate issue here. If my reasoning is shoddy, but I plug it into some math apparatus, then it'll likely make my problems obviously wrong. If my reasoning is very inaccurate and the data uncertain, being precise about it can at least make the results salvageable. Scott Alexander argues for this position quite well in [0].

Humans can lie with statistics well. But they can lie with plain language even better.

[0] - http://slatestarcodex.com/2013/05/02/if-its-worth-doing-its-...

astine · on Jan 5, 2016

"If my reasoning is shoddy, but I plug it into some math apparatus, then it'll likely make my problems obviously wrong."

That's pretty clearly untrue. I remember reading a study recently where the p value was less than .01 or something like that but where the experimental design was clearly flawed. The correlation wasn't the correlation they thought they had. But because the math looked good and it was easier than actually reviewing the experiment, it was tempting to take the study on face value.

I've read Scott's essay before and I understand his argument, but I don't think it works. While, you might be able to avoid some bad reasoning simply by being more systematic, you can also strengthen bad arguments with a faulty application of statistics. What Scott doesn't do is provide an analysis of how often each of these things happens. I'd argue that for each time a quick application of statistics save someone from a bad intuitive judgment, a misapplication of statistics is used to encourage a bad judgment at least one time if not more.

Understand that my argument here is not that one should never use statistics or even Bayes theorem, but that a naive or lazy application can be worse than no application.

TeMPOraL · on Jan 5, 2016

I see your point and I agree.

For myself, I try to limit myself to the mathematical apparatus I feel comfortable with. I know that if I were to open a statistics textbook, I could find something to plug in my estimates and reach a conclusion, and I'm pretty sure the conclusion would be bullshit. I learned it the hard way in high school - I remember the poor results of trying to solve math and physics homework assignments on topics I didn't understand yet. The mistakes were often subtle, but devastating.

TeMPOraL · on Jan 5, 2016

This is a general argument against statistics. Or math, in general. Yes, dressing your bullshit in math can make people believe you more, but it doesn't change the fact that you're lying. Are we supposed to stop using math for good because evil people are using it for evil?

astine · on Jan 5, 2016

No, it's an argument against using statistics without first considering the strength of your data.

cobaltblue · on Jan 5, 2016

Then you should take the Bayesian side, because Bayesians look at the data first, and they take their data as given rather than taking a null hypothesis as given. They don't just blindly go off and run a test (which assumes a particular prior implicitly that may be wildly inappropriate) and see what it says about the likelihood of their already observed data being generated by the test's assumed data generator.

bigfudge · on Jan 5, 2016

But being a good bayesian makes you do exactly this. The process of describing priors makes it obvious you need to do a sensitivity analysis to check how much the prior is influencing the conclusions...

TillE · on Jan 5, 2016

> being a good bayesian

This is exactly what weirds people out about LessWrong folk. They talk about a tool as if it's a religion.

TeMPOraL · on Jan 5, 2016

It's a running joke there.

The people need to get over it. LW crowd is a group of people studying a pretty specific set of subjects, focused around a single website. It's typical for such a group to develop their own jargon and insider jokes, which may look weird from outside. It's normal.

eli_gottlieb · on Jan 6, 2016

"Good Bayesian" in that context just means being an able user of Bayesian statistics, not necessarily holding any particular philosophical belief about what they mean.

yummyfajitas · on Jan 6, 2016

How can you evaluate the strength of your data without using statistics? You've created a catch-22.

I'll speculate you have some sort of meta-heuristic and only apply this catch-22 under those circumstances? E.g. this catch-22 only applies to weird and socially disapproved topics?

crpatino · on Jan 5, 2016

On the other hand, one could argue that whenever the Church of Scientism sees someone using one of their favorite tools to argue in favor or a subject considered taboo, said church declares the use of said tool to be "invalid" or "out of scope".

RogerL · on Jan 5, 2016

I prefer to think of it in terms of the statistical inversion problem. That is, we have an event(s) that occur, which we may imperfectly understand. We take noisy measurements of that event. Clearly, the causal relationship is the events cause the measurements - a bad measurement does not cause the event to move.

But, in practice all we have are measurements, and from that we want to find an optimal (or good) estimate for what the events were. Hence, inversion.

Bayes formula expresses P(x|y) in terms of P(Y|x), so you can perform the inversion using bayes.

In many fields establishing the prior is difficult, hence frequentist methods are popular.

There are many techniques for the statistical inversion problem. Trying to track a ballistic object in a vacuum? Fit the measurements to a second order polynomial (parabola) and you are done (well, you have to decide least squares vs robust methods, but it is not such a hard problem in the scheme of things). Trying to track a manuevering jet, stock prices, or disease incidence rates. Now your model of the problem is much less clear.

We model lack of information as random variables. It isn't "random" in the deterministic sense, just in the sense that we don't know. Establish a good probabilistic description of that lack of knowledge in your prior, and you are probably going to get good result: this jet fighter is probabilisticly either moving straight, performing a coordinated turn, or performing an uncoordinated turn. Use a Markov chain to model those likelihoods (e.g.), and you may end up with good results. But if your modeling of the prior is poor, well, good luck to you, your output is probably nonsense.

jsprogrammer · on Jan 5, 2016

Most measurements do affect the event.

jamiek88 · on Jan 6, 2016

ALL measurements at a quantum level at least!

cbd1984 · on Jan 7, 2016

Believe it or not, the Heisenberg Uncertainty Principle has nothing to do with measurement.

jsprogrammer · on Jan 7, 2016

Is someone claiming otherwise?

cbd1984 · on Jan 7, 2016

Lots and lots of people, especially people who don't know about conjugate pairs or the Fourier transform.

haberman · on Jan 5, 2016

I am only just learning about this stuff, but there are several things in this article that seem incorrectly explained. Conceptual clarity is paramount to me, so it drove me a little crazy!

> Bayes’ theorem is a method for calculating the validity of beliefs (hypotheses, claims, propositions) based on the best available evidence (observations, data, information).

Bayes theorem is a statement about probability, not "validity." This description makes it sound like Bayes theorem is a function BT(belief, evidence) = validity of belief. But it's not like that at all.

Probability is a way of measuring uncertainty. Things are uncertain for two main reasons: either we can't observe them directly ("do I have this disease or not?") or they haven't happened yet ("what side will this coin flip land on?"). (If you believe in a deterministic universe, the second is just a special case of the first.)

The "beliefs" (aka priors/posteriors) in Bayes theorem are statements of probability. To use the article's example, if it is claimed that 1% of the population has a certain disease, your "belief" or "prior" is that P(I have the disease) = 0.01. The article seems to get confused and think that the "belief" here is "I have the disease." Bayes theorem doesn't tell you about "the probability that a belief is true" like the article says, the belief is a probability. It also doesn't tell you if your belief is "valid."

Bayes theorem takes your existing belief about the probability of something and gives you a new probability that incorporates some evidence you observed.

cronjobber · on Jan 5, 2016

Here's a proposal: Bayesian scientists shouldn't select their own prior. Instead publish how your results would update any prior, including the one picked by me, the reader.

I certainly haven't thought this through, but maybe this would make science more modular: combine the updates from M studies and calculate the new, combined update. Statisticians, does this work?

arcanus · on Jan 5, 2016

This does happen. I've had paper reviewers request results with an appropriately selected uninformative prior: https://en.wikipedia.org/wiki/Jeffreys_prior

Typically, if you are practitioner in the field, it is not too difficult to identify instances where the result is highly dependent on the choice of prior.

XFrequentist · on Jan 5, 2016

Yes - Laplace originally proposed this, it's a good approach (and incidentally the basis for Bayesian meta-analysis). Google "skeptical prior" for more.

Fomite · on Jan 5, 2016

I had a professor in graduate school who suggested exactly this - each study should conduct a meta-analysis of all previous studies on the subject, use that estimate as their prior, update, and publish for the next study...

The problem is, having tried it, this is much more difficult to do in practice.

conjectures · on Jan 6, 2016

On the second point. It's called meta-analysis and is an important area of research. However it's very far from operating automagically and requires substantial manual input.

On the first point this is equivalent to contracting a builder to build a house in any style.

Choosing a prior isn't always just about picking a wide band around 3 or a narrow band around 2. This is like the builder offering a choice of curtain colours from a swatch.

Making a commitment to implement any prior could be totally. Like the overcommitted builder being asked to reconstruct R'lyeh.

abecedarius · on Jan 6, 2016

There was a lesswrong post on that back before they moved to lesswrong.com: http://www.overcomingbias.com/2009/02/share-likelihood-ratio...

sago · on Jan 5, 2016

The current fashion for BT really bugs me.

BT inverts conditional probabilities. If you can estimate P(E), P(H) and P(E|H) better than P(H|E) it will give you a better result. It is one of many probability identities. But someone it has become 'the one', as if, say P(H|E) = P(H&E)/P(E) isn't much use, but put two of those together: world changing.

I've seen so much crap come out of this fad. My particular favourite is in theology. William Lane Craig has demonstrated that Jesus raised from the dead, to a high probability. Richard Carrier has shown that there was no historical Jesus. Funny how few people ever run BT and find it contradicts their views.

I think part of the problem comes from a lack of understanding of the difference between frequentist and bayesian interpretations of probability. I've yet to see these folks show BT working in anything but frequentist data. And then they'll switch and use it to demonstrate why their Bayesian situation is correct.

logfromblammo · on Jan 5, 2016

Bayes does more than just invert the conditional. Of P(x), P(y), P(x|y), and P(y|x), if you know any three, then Bayes will give you the fourth.

It's just an equation. Garbage inputs will yield garbage outputs. In the realm of theology, it is of no more use than Pascal's Wager as an expected value calculation. All the input values are made up, so the output value is equally fabricated.

If you're using it on real, verifiable statistics, such as verified spam in an e-mail corpus, you can use Bayes to make a classifier to automatically identify spam to a high degree of accuracy. But if you are estimating for any of the three numbers you need to know, the fourth that you calculate will also be suspect.

rcthompson · on Jan 5, 2016

Of course two people can come to different conclusions based on a Bayesian analysis of the same question, if their priors are different. The benefit of Bayes' Theorem is to make explicit the dependency of the result on the prior.

lqdc13 · on Jan 5, 2016

I just love how some guy, who by his own admission read a little Wikipedia on the topic, is critiquing a statistical method.

It would make for a much stronger argument if he actually showed some numbers where people are getting the priors wrong. That is, how often people get the priors wrong and the probability of mistake if they used a different strategy more commonly used in the field.

theseatoms · on Jan 5, 2016

> The potential for Bayes abuse begins with P(B), your initial estimate of the probability of your belief, often called the “prior.”

tldr; priors matter