Heronian triangle: Difference between revisions

From formulasearchengine
Jump to navigation Jump to search
No edit summary
en>MartinThoma
Replaced PNG by SVG
 
Line 1: Line 1:
'''Base rate fallacy''', also called '''base rate neglect''' or '''base rate bias''', is an error in thinking. If presented with related [[base rate]] information (i.e. generic, general information) and specific information (information only pertaining to a certain case), the mind tends to ignore the former and focus on the latter. This is what the base rate fallacy refers to.<ref>{{cite web|url=http://www.fallacyfiles.org/baserate.html |title=Logical Fallacy: The Base Rate Fallacy |publisher=Fallacyfiles.org |date= |accessdate=2013-06-15}}</ref>
Hiya and welcome there, I am Adrianne and I totally dig that name. Vermont has always been my current home and I really love every day living this site. Gardening is what I do regular. I am a people manager but rather soon I'll be without any help. You can find my website here: http://prometeu.net<br><br>Here is my weblog: [http://prometeu.net clash of clans hack deutsch]
 
==Example 1==
: John is a man wearing outstanding [[Gothic fashion|gothic]] inspired clothing with long black hair who listens to [[death metal]]. How likely is it that he is a [[Christian]] and how likely is it that he is a [[Satanist]]?
 
If people were asked this question, they would likely underestimate the probability of him being a Christian, and overestimate the probability of him being a Satanist. This is because they would ignore that the base rate of being a Christian (there are about 2 billion in the world) is vastly higher than that of being a Satanist (estimated to be in the thousands).<ref>{{cite web|url=http://www.religioustolerance.org/satanism.htm|accessdate=March 24, 2013|date=March 2006|title=Religious Satanism, 16th century Satanism, Satanic Dabbling, etc|publisher=Ontario Consultants on Religious Tolerance|author=B.A. Robinson}}</ref>
 
==Example 2==
: A group of policemen have [[breathalyzer]]s displaying false drunkness in 5% of the cases. However, the breathalyzers never fail to detect a truly drunk person. 1/1000 of drivers are driving drunk. Suppose the policemen then stop a driver at random, and force them to take a breathalyzer test. It indicates that he or she is drunk. We assume you don't know anything else about him or her. How high is the probability he or she really is drunk?
 
Many would answer as high as 0.95, but the correct probability is about 0.02.
 
To find the correct answer, one should use [[Bayes' theorem]]. The goal is to find the probability that the driver is drunk given that the breathalyzer indicated he/she is drunk, which can be represented as
:<math>p(drunk|D)</math>
where "D" means that the breathalyzer indicates that the driver is drunk. Bayes' Theorem tells us that
:<math>p(drunk|D) = \frac{p(D | drunk)\, p(drunk)}{p(D)}</math>
We were told the following in the first paragraph:
:<math>p(drunk) = 0.001</math>
:<math>p(sober) = 0.999 </math>
:<math>p(D|drunk) = 1.00 </math>
:<math>p(D|sober) = 0.05</math>
As you can see from the formula, one needs p(D) for Bayes' Theorem, which one can compute from the preceding values using
:<math>p(D) = p(D | drunk)\,p(drunk)+p(D|sober)\,p(sober)</math>
which gives
:<math>p(D)=0.05095</math>
Plugging these numbers into Bayes' Theorem, one finds that
:<math>p(drunk|D) = 0.019627\cdot</math>
 
A more intuitive explanation: in average, for every 1000 drivers tested,
* 1 driver is drunk, and it is 100% certain that for that driver there is a ''true'' positive test result, so there is 1 ''true'' positive test result
* 999 drivers are not drunk, and among those drivers there are 5% ''false'' positive test results, so there are 49.95 ''false'' positive test results
therefore the probability that one of the drivers among the 1 + 49.95 = 50.95 positive test results really is drunk is <math>p(drunk|D) = 1/50.95 \approx 0.019627</math>.
 
==Example 3==
In a city of 1 million inhabitants let there be 100 terrorists and 999,900 non-terrorists. To simplify the example, it is assumed that all people present in the city are inhabitants. Thus, the base rate probability of a randomly selected inhabitant of the city being a terrorist is 0.0001, and the base rate probability of that same inhabitant being a non-terrorist is 0.9999. In an attempt to catch the terrorists, the city installs an alarm system with a surveillance camera and automatic facial recognition software.
 
The software has two failure rates of 1%:
* The false negative rate: If the camera scans a terrorist, a bell will ring 99% of the time, and it will fail to ring 1% of the time.
* The false positive rate: If the camera scans a non-terrorist, a bell will not ring 99% of the time, but it will ring 1% of the time.
 
Suppose now that an inhabitant triggers the alarm. What is the chance that the person is a terrorist? In other words, what is P(T | B), the probability that a terrorist has been detected given the ringing of the bell? Someone making the 'base rate fallacy' would infer that there is a 99% chance that the detected person is a terrorist. Although the inference seems to make sense, it is actually bad reasoning, and a calculation below will show that the chances they are a terrorist are actually near 1%, not near 99%.
 
The fallacy arises from confusing the natures of two different failure rates. The 'number of non-bells per 100 terrorists' and the 'number of non-terrorists per 100 bells' are unrelated quantities. One does not necessarily equal the other, and they don't even have to be almost equal. To show this, consider what happens if an identical alarm system were set up in a second city with no terrorists at all. As in the first city, the alarm sounds for 1 out of every 100 non-terrorist inhabitants detected, but unlike in the first city, the alarm never sounds for a terrorist. Therefore 100% of all occasions of the alarm sounding are for non-terrorists, but a false negative rate cannot even be calculated. The 'number of non-terrorists per 100 bells' in that city is 100, yet P(T | B) = 0%. There is zero chance that a terrorist has been detected given the ringing of the bell.
 
Imagine that the city's entire population of one million people pass in front of the camera. About 99 of the 100 terrorists will trigger the alarm — and so will about 9,999 of the 999,900 non-terrorists. Therefore, about 10,098 people will trigger the alarm, among which about 99 will be terrorists. So, the probability that a person triggering the alarm actually is a terrorist, is only about 99 in 10,098, which is less than 1%, and very, very far below our initial guess of 99%.
 
The base rate fallacy is so misleading in this example because there are many more non-terrorists than terrorists.
 
==Findings in psychology==
In experiments, people have been found to prefer individuating information over general information when the former is available.<ref>{{cite journal|last=Bar-Hillel|first=Maya|title=The base-rate fallacy in probability judgments|journal=Acta Psychologica|year=1980|volume=44|pages=211–233}}</ref><ref name="kv1"/><ref>{{cite book|last=Kahneman|first=Daniel|title=Judgment under uncertainty: Heuristics and biases|year=1985|pages=153–160|coauthors=Amos Tversky|editor=Daniel Kahneman, Paul Slovic & Amos Tversky (Eds.)|chapter=Evidential impact of base rates}}</ref>
 
In some experiments, students were asked to estimate the [[grade point average]]s (GPAs) of hypothetical students. When given relevant statistics about GPA distribution, students tended to ignore them if given descriptive information about the particular student, even if the new descriptive information was obviously of little or no relevance to school performance.<ref name="kv1"/> This finding has been used to argue that interviews are an unnecessary part of the [[college admissions]] process because interviewers are unable to pick successful candidates better than basic statistics.
 
[[Psychologist]]s [[Daniel Kahneman]] and [[Amos Tversky]] attempted to explain this finding in terms of a [[heuristics in judgment and decision making|simple rule or "heuristic"]] called [[representativeness heuristic|representativeness]]. They argued that many judgements relating to likelihood, or to cause and effect, are based on how representative one thing is of another, or of a category.<ref name="kv1">{{cite journal|last=Kahneman|first=Daniel|coauthors=Amos Tversky|title=On the psychology of prediction|journal=Psychological Review|year=1973|volume=80|pages=237–251|doi=10.1037/h0034747}}</ref> Kahneman considers base rate neglect to be a specific form of [[extension neglect]].<ref>{{cite book|last=Kahneman|first=Daniel|title=Choices, Values and Frames|year=2000|editor=Daniel Kahneman and Amos Tversky (Eds.)|chapter=Evaluation by moments, past and future}}</ref> [[Richard Nisbett]] has argued that some [[attributional bias]]es like the [[fundamental attribution error]] are instances of the base rate fallacy: people underutilize "consensus information" (the "base rate") about how others behaved in similar situations and instead prefer simpler [[dispositional attribution]]s.<ref>{{cite book|last=Nisbett|first=Richard E.|title=Cognition and social behavior|year=1976|coauthors=E. Borgida, R. Crandall & H. Reed|editor=J. S. Carroll & J. W. Payne (Eds.)|chapter=Popular induction: Information is not always informative|pages=227–236|volume=2}}</ref>
 
There is considerable debate in psychology on the conditions under which people do or do not appreciate base rate information.<ref name="Koehler1996">{{Cite doi|10.1017/S0140525X00041157}}</ref><ref name="BarbeySloman2007">{{Cite doi|10.1017/S0140525X07001653}}</ref>
Researchers in the heuristics-and-biases program have stressed empirical findings showing that people tend to ignore base rates and make inferences that violate certain norms of probabilistic reasoning, such as [[Bayes’ theorem]]. The conclusion drawn from this line of research was that human probabilistic thinking is fundamentally flawed and error-prone.
<ref name="TverskyKahneman1974">{{Cite doi|10.1126/science.185.4157.1124
}}</ref>
Other researchers have emphasized the link between cognitive processes and information formats, arguing that such conclusions are not generally warranted.<ref>{{cite journal|last=Cosmides|first=Leda| coauthors=John Tooby|title=Are humans good intuitive statisticians after all? Rethinking some conclusions of the literature on judgment under uncertainty|journal=Cognition|year=1996|volume=58|pages=1–73}}</ref><ref name="GigerenzerHoffrage1995">{{Cite doi|10.1037/0033-295X.102.4.684}}</ref>
 
Consider again Example 2 from above. The required inference is to estimate the (posterior) probability that a (randomly picked) driver is drunk, given that the breathalyzer test is positive. Formally, this probability can be calculated using [[Bayes’ theorem]], as shown above. However, there are different ways of presenting the relevant information. Consider the following, formally equivalent variant of the problem:
 
: &nbsp;1 out of 1000 drivers are driving drunk. The breathalyzers never fail to detect a truly drunk person. For 50 out of the 999 drivers who are not drunk the breathalyzer falsely displays drunkness. Suppose the policemen then stop a driver at random, and force them to take a breathalyzer test. It indicates that he or she is drunk. We assume you don't know anything else about him or her. How high is the probability he or she really is drunk?
 
In this case, the relevant numerical information—''p''(drunk), ''p''(D | drunk), ''p''(D | sober)—is presented in terms of natural frequencies with respect to a certain reference class (see [[reference class problem]]). Empirical studies show that people’s inferences correspond more closely to Bayes’ rule when information is presented this way, helping to overcome base-rate neglect in laypeople<ref name="GigerenzerHoffrage1995" /> and experts.<ref name="Hoffrage2000">{{Cite doi|10.1126/science.290.5500.2261}}</ref> As a consequence, organizations like the [[Cochrane Collaboration]] recommend using this kind of format for communicating health statistics.<ref name="Cochrane2011">{{Cite doi|10.1002/14651858.CD006776.pub2}}</ref> Teaching people to translate these kinds of Bayesian reasoning problems into natural frequency formats is more effective than merely teaching them to plug probabilities (or percentages) into [[Bayes’ theorem]].<ref name="SedlmeierGigerenzer2002">{{Cite doi|10.1037/0096-3445.130.3.380}}</ref> It has also been shown that graphical representations of natural frequencies (e.g., icon arrays) help people to make better inferences.<ref name="SedlmeierGigerenzer2002" /><ref name="Brase2008">{{Cite doi|10.1002/acp.1460}}</ref><ref name="Edwards2002">{{Cite doi|10.1136/bmj.324.7341.827}}</ref>
 
Why are natural frequency formats helpful? One important reason is that this information format facilitates the required inference because it simplifies the necessary calculations. This can be seen when using an alternative way of computing the required probability ''p''(drunk|D):
 
:<math>p(drunk| D) = \frac{N(drunk \cap D)}{N(D)} = \frac{1}{51} = 0.0196</math>
 
where ''N''(drunk &cap; D) denotes the number of drivers that are drunk and get a positive breathalyzer result, and ''N''(D) denotes the total number of cases with a positive breathalyzer result. The equivalence of this equation to the above one follows from the axioms of probability theory, according to which ''N''(drunk &cap; D) = ''p'' (D | drunk) × ''p'' (drunk). Importantly, although this equation is formally equivalent to Bayes’ rule, it is not psychologically equivalent. Using natural frequencies simplifies the inference because the required mathematical operation can be performed on natural numbers, instead of normalized fractions (i.e., probabilities), because it makes the high number of false positives more transparent, and because natural frequencies exhibit a “nested-set structure”.<ref name="Girotto2001">{{Cite doi|10.1016/S0010-0277(00)00133-5}}</ref><ref name="Hoffrage2002" />
 
It is important to note that not any kind of frequency format facilitates Bayesian reasoning.<ref name="Hoffrage2002">{{Cite doi|10.1016/S0010-0277(02)00050-1}}</ref><ref name="GigerenzerHoffrage1999">{{Cite doi|10.1037/0033-295X.106.2.425}}</ref>
Natural frequencies refer to frequency information that results from ''natural sampling'',<ref name="Kleiter1994">{{Cite doi|10.1007/978-1-4612-4308-3_27}}</ref> which preserves base rate information (e.g., number of drunken drivers when taking a random sample of drivers). This is different from ''systematic sampling'', in which base rates are fixed a priori (e.g., in scientific experiments). In the latter case it is not possible to infer the posterior probability ''p'' (drunk | positive test) from comparing the number of drivers who are drunk and test positive compared to the total number of people who get a positive breathalyzer result, because base rate information is not preserved and must be explicitly re-introduced using [[Bayes’ theorem]].
 
==See also==
* [[Bayesian probability]]
* [[Data dredging]]
* [[False positive paradox]]
* [[Inductive argument]]
* [[List of cognitive biases]]
* [[Misleading vividness]]
* [[Prosecutor's fallacy]]
 
==References list==
<references />
 
==External links==
* [http://www.fallacyfiles.org/baserate.html The Base Rate Fallacy] The Fallacy Files
* [https://www.cia.gov/library/center-for-the-study-of-intelligence/csi-publications/books-and-monographs/psychology-of-intelligence-analysis/art15.html#ft145 Psychology of Intelligence Analysis: Base Rate Fallacy]
* [http://www.youtube.com/watch?v=D8VZqxcu0I0 The base rate fallacy explained visually] (Video)
* [http://www.eeps.com/riskicon/ Interactive page for visualizing statistical information and Bayesian inference problems]
*  [http://ipdas.ohri.ca/IPDAS-Chapter-C.pdf Current ‘best practice’ for communicating probabilities in health according to the International Patient Decision Aid Standards (IPDAS) Collaboration]
 
{{Relevance fallacies}}
 
[[Category:Relevance fallacies]]
[[Category:Cognitive biases]]
[[Category:Behavioral finance]]
[[Category:Probability fallacies]]

Latest revision as of 12:17, 8 January 2015

Hiya and welcome there, I am Adrianne and I totally dig that name. Vermont has always been my current home and I really love every day living this site. Gardening is what I do regular. I am a people manager but rather soon I'll be without any help. You can find my website here: http://prometeu.net

Here is my weblog: clash of clans hack deutsch