Quartic: Difference between revisions

From formulasearchengine
Jump to navigation Jump to search
en>Schneelocke
en>Yobot
m WP:CHECKWIKI error fixes using AWB (10072)
 
Line 1: Line 1:
In [[statistics]], the '''correlation ratio''' is a measure of the relationship between the [[statistical dispersion]] within individual categories and the dispersion across the whole population or sample. The measure is defined as the ''ratio'' of two [[standard deviation]]s representing these types of variation. The context here is the same as that of the [[intraclass correlation coefficient]], whose value is the square of the correlation ratio.
Hi there, I am Adrianne and I totally get that name. I am a people manager only soon I'll be without any help. Gardening is what I do every week. Guam has always been my home. See what's new on this is my website here: http://prometeu.net<br><br>My page [http://prometeu.net how to hack clash of clans with cydia]
 
==Definition==
 
Suppose each observation is ''y<sub>xi</sub>'' where ''x'' indicates the category that observation is in and ''i'' is the label of the particular observation.  Let ''n<sub>x</sub>'' be the number of observations in category ''x'' and
:<math>\overline{y}_x=\frac{\sum_i y_{xi}}{n_x}</math>  and  <math>\overline{y}=\frac{\sum_x n_x \overline{y}_x}{\sum_x n_x},</math>
 
where <math>\overline{y}_x</math> is the mean of the category ''x'' and <math>\overline{y}</math> is the mean of the whole population. The correlation ratio η ([[eta (letter)|eta]]) is defined as to satisfy
 
:<math>\eta^2 = \frac{\sum_x n_x (\overline{y}_x-\overline{y})^2}{\sum_{x,i} (y_{xi}-\overline{y})^2}</math>
 
which can be written as
:<math>\eta^2 = \frac{{\sigma_{\overline{y}}}^2}{{\sigma_{y}}^2}, \text{ where }{\sigma_{\overline{y}}}^2 = \frac{\sum_x n_x (\overline{y}_x-\overline{y})^2}{\sum_x n_x} \text{ and } {\sigma_{y}}^2 = \frac{\sum_{x,i} (y_{xi}-\overline{y})^2}{n},</math>
i.e. the weighted variance of the category means divided by the variance of all samples.
It is worth noting that if the relationship between values of <math>x \;\ </math> and values of <math>\overline{y}_x</math> is linear (which is certainly true when there are only two possibilities for ''x'') this will give the same result as the square of Pearson's [[Pearson product-moment correlation coefficient|correlation coefficient]], otherwise the correlation ratio will be larger in magnitude. It can therefore be used for judging non-linear relationships.
 
==Range==
The correlation ratio <math>\eta</math> takes values between 0 and 1. The limit <math>\eta=0</math> represents the special case of no dispersion among the means of the different categories, while <math>\eta=1</math> refers to no dispersion within the respective categories. Note further, that <math>\eta</math> is undefined when all data points of the complete population take the same value.
 
==Example==
Suppose there is a distribution of test scores in three topics (categories):
*Algebra: 45, 70, 29, 15 and 21 (5 scores)
*Geometry: 40, 20, 30 and 42 (4 scores)
*Statistics: 65, 95, 80, 70, 85 and 73 (6 scores).
Then the subject averages are 36, 33 and 78, with an overall average of 52.
 
The sums of squares of the differences from the subject averages are 1952 for Algebra, 308 for Geometry and 600 for Statistics, adding to 2860. The overall sum of squares of the differences from the overall average is 9640. The difference of 6780 between these is also the weighted sum of the square of the differences between the subject averages and the overall average:
:<math>5 (36-52)^2 + 4 (33-52)^2 +6 (78-52)^2 = 6780</math>
This gives
:<math>\eta^2 = \frac{6780}{9640}=0.7033\ldots</math>
suggesting that most of the overall dispersion is a result of differences between topics, rather than within topics.  Taking the square root
:<math>\eta = \sqrt{\frac{6780}{9640}}=0.8386\ldots</math>   
Observe that for <math>\eta = 1</math> the overall sample dispersion is purely due to dispersion among the categories and not at all due to dispersion within the individual categories. For a quick comprehension simply imagine all Algebra, Geometry, and Statistics scores being the same respectively, e.g. 5 times 36, 4 times 33, 6 times 78.
 
The limit <math>\eta = 0</math> refers to the case without dispersion in the categories contributing to the overall dispersion. The trivial requirement for this extreme is that all category means are the same.
 
==Pearson v. Fisher==
The correlation ratio was introduced by [[Karl Pearson]] as part of [[analysis of variance]]. [[Ronald Fisher]] commented:
<blockquote>''As a descriptive statistic the utility of the correlation ratio is extremely limited. It will be noticed that the number of [[Degrees of freedom (statistics)|degrees of freedom]] in the numerator of <math>\eta^2</math> depends on the number of the arrays''<ref>[[Ronald Fisher]] (1926) ''[[Statistical Methods for Research Workers]]'', ISBN 0-05-002170-2 [http://psychclassics.yorku.ca/Fisher/Methods/chap8.htm (excerpt)]</ref></blockquote> to which [[Egon Pearson]] (Karl's son) responded by saying
<blockquote>''Again, a long-established method such as the use of the correlation ratio [§45 The "Correlation Ratio" η] is passed over in a few words without adequate description, which is perhaps hardly fair to the student who is given no opportunity of judging its scope for himself.''<ref>Pearson E.S. (1926) "Review of Statistical Methods for Research Workers (R. A. Fisher)", ''Science Progress'', 20, 733-734. [http://www.economics.soton.ac.uk/staff/aldrich/fisherguide/esp.htm#esp1 (excerpt)]</ref></blockquote>
 
{{refimprove|date=August 2011}}
{{inline|date=August 2011}}
==References==
<references/>
 
[[Category:Covariance and correlation]]
[[Category:Statistical ratios]]

Latest revision as of 12:52, 6 April 2014

Hi there, I am Adrianne and I totally get that name. I am a people manager only soon I'll be without any help. Gardening is what I do every week. Guam has always been my home. See what's new on this is my website here: http://prometeu.net

My page how to hack clash of clans with cydia