|
|
Line 1: |
Line 1: |
| In [[probability theory]] and [[statistics]], there are several relationships among [[probability distributions]]. These relations can be categorized in the following groups:
| | My name is Dominik (42 years old) and my hobbies are Taxidermy and Board sports.<br><br>My web site; [http://tinyurl.com/kecvhhb http://tinyurl.com/kecvhhb] |
| *One distribution is a special case of another with a broader parameter space
| |
| *Transforms (function of a random variable);
| |
| *Combinations (function of several variables);
| |
| *Approximation (limit) relationships;
| |
| *Compound relationships (useful for Bayesian inference);
| |
| *[[Duality (mathematics)|Duality]];
| |
| *[[Conjugate prior]]s.
| |
| | |
| [[File:Relationships among some of univariate probability distributions.jpg|thumb|700px|center|Relationships among some of univariate probability distributions are illustrated with connected lines. dashed lines means approximate relationship. more info:<ref>{{cite journal|last=LEEMIS|first=Lawrence M.|coauthors=Jacquelyn T. MCQUESTON|title=Univariate Distribution Relationships|journal=American Statistician|date=February 2008|volume=62|issue=1|pages=45–53|url=http://www.math.wm.edu/~leemis/2008amstat.pdf}}</ref>]]
| |
| | |
| ==Special case of distribution parametrization==
| |
| | |
| * A [[binomial distribution|binomial]] (n, p) random variable with n = 1, is a [[Bernoulli distribution|Bernoulli]] (p) random variable.
| |
| * A [[negative binomial distribution]] with r = 1 is a [[geometric distribution]].
| |
| * A [[gamma distribution]] with shape parameter α = 1 and scale parameter β is an [[exponential distribution|exponential]] (β) distribution.
| |
| * A [[gamma distribution|gamma]] (α, β) random variable with α = ν/2 and β = 2, is a [[chi-squared distribution|chi-squared]] random variable with ν [[degrees of freedom]].
| |
| * A [[chi-squared distribution]] with 2 degrees of freedom is an [[exponential distribution]] with mean 2 and vice versa.
| |
| * A [[Weibull distribution|Weibull]] (1, β) random variable is an [[exponential distribution|exponential]] random variable with mean β.
| |
| * A [[beta distribution|beta]] random variable with parameters α = β = 1 is a [[uniform distribution (continuous)|uniform]] random variable.
| |
| * A [[beta-binomial distribution|beta-binomial]] (n, 1, 1) random variable is a discrete uniform random variable over the values 0 ... n.
| |
| * A random variable with a [[Student's t-distribution|t distribution]] with one degree of freedom is a [[Cauchy distribution|Cauchy]](0,1) random variable.
| |
| | |
| == Transform of a variable ==
| |
| | |
| === Multiple of a random variable ===
| |
| | |
| Multiplying the variable by any positive real constant yields a '''scaling''' of the original distribution.
| |
| Some are self-replicating, meaning that the scaling yields the same family of distributions, albeit with a different parameter:
| |
| [[Normal distribution]], [[Gamma distribution]], [[Cauchy distribution]], [[Exponential distribution]], [[Erlang distribution]], [[Weibull distribution]], [[Logistic distribution]], [[Error distribution]], [[Power distribution]], [[Rayleigh distribution]].
| |
| | |
| '''Example: '''
| |
| * If X is a gamma random variable with parameters (r, λ), then Y=aX is a gamma random variable with parameters (r, aλ).
| |
| | |
| === Linear function of a random variable ===
| |
| | |
| The affine transfom ''ax + b'' yields a '''relocation and scaling''' of the original distribution. The following are self-replicating:
| |
| [[Normal distribution]], [[Cauchy distribution]], [[Logistic distribution]], [[Error distribution]], [[Power distribution]], [[Rayleigh distribution]].
| |
| | |
| '''Example: '''
| |
| * If Z is a normal random variable with parameters (μ=m, σ<sup>2</sup>=s<sup>2</sup>), then X=aZ+b is a normal random variable with parameters (μ=am+b, σ<sup>2</sup>=a<sup>2</sup>s<sup>2</sup>).
| |
| | |
| === Reciprocal of a random variable ===
| |
| | |
| The reciprocal ''1/X'' of a random variable ''X'', is a member of the same family of distribution as ''X'', in the following cases:
| |
| [[Cauchy distribution]], [[F distribution]], [[log logistic distribution]].
| |
| | |
| '''Examples: '''
| |
| * If X is a Cauchy (μ, σ) random variable, then 1/X is a Cauchy (μ/C, σ/C) random variable where C = μ<sup>2</sup> + σ<sup>2</sup>.
| |
| * If X is an F(ν<sub>1</sub>, ν<sub>2</sub>) random variable then 1/X is an F(ν<sub>2</sub>, ν<sub>1</sub>) random variable.
| |
| | |
| === Other cases ===
| |
| Some distributions are invariant under a specific transformation.
| |
| | |
| '''Example: '''
| |
| * If X is a '''beta''' (α, β) random variable then (1 - X) is a '''beta''' (β, α) random variable.
| |
| * If X is a '''binomial''' (n, p) random variable then (n - X) is a ''binomial''' (n, 1-p) random variable.
| |
| | |
| * If ''X'' has [[cumulative distribution function]] ''F''<sub>''X''</sub>, then ''F''<sub>''X''</sub>(''X'') is a standard '''uniform''' (0,1) random variable
| |
| * If ''X'' is a '''normal''' (μ, σ<sup>2</sup>) random variable then ''e''<sup>''X''</sup> is a '''lognormal''' (μ, σ<sup>2</sup>) random variable.
| |
| :Conversely, if ''X'' is a lognormal (μ, σ<sup>2</sup>) random variable then log ''X'' is a normal (μ, σ<sup>2</sup>) random variable.
| |
| * If ''X'' is an '''exponential''' random variable with mean β, then ''X''<sup>1/γ</sup> is a '''Weibull''' (γ, β) random variable.
| |
| * The square of a '''standard normal''' random variable has a '''chi-squared''' distribution with one degree of freedom.
| |
| * If ''X'' is a '''[[Student’s t-distribution|Student’s t]]''' random variable with ν degree of freedom, then ''X''<sup>2</sup> is an '''''F''''' (1,ν) random variable.
| |
| * If ''X'' is a '''double exponential''' random variable with mean 0 and scale λ, then |''X''| is an '''exponential''' random variable with mean λ.
| |
| * A '''geometric''' random variable is the [[Floor and ceiling functions|floor]] of an '''exponential''' random variable.
| |
| * A '''[[rectangular distribution|rectangular]]''' random variable is the floor of a '''uniform''' random variable.
| |
| | |
| == Functions of several variables ==
| |
| | |
| === Sum of variables ===
| |
| The distribution of the sum of independent random variables is called the [[Convolution of probability distributions|convolution]] of the primal distribution.
| |
| | |
| *If it has a distribution from the same family of distributions as the original variables, that family of distributions is said to be ''closed under convolution''.
| |
| | |
| Examples of such [[univariate distribution]]s are:
| |
| [[Normal distribution]], [[Poisson distribution]], [[Binomial distribution]] (with common success probability), , [[Negative binomial distribution]] (with common success probability), [[Gamma distribution]](with common rate parameter), [[Chi-squared distribution]], [[Cauchy distribution]], [[Hyper-exponential distribution]].
| |
| | |
| '''Examples:<ref>{{cite web|last=Cook|first=John D.|title=Diagram of distribution relationships|url=http://www.johndcook.com/distribution_chart.html}}</ref>'''{{citation needed|date=January 2013|reason=no proofs in Cook ref}}
| |
| **If X<sub>1</sub> and X<sub>2</sub> are '''Poisson''' random variables with means μ<sub>1</sub> and μ<sub>2</sub> respectively, then X<sub>1</sub> + X<sub>2</sub> is a '''Poisson''' random variable with mean μ<sub>1</sub> + μ<sub>2</sub>.
| |
| ** The sum of '''gamma''' (''n''<sub>i</sub> , β) random variables has a '''gamma '''(Σ''n''<sub>i</sub> , β) distribution.
| |
| **If X<sub>1</sub> is a '''Cauchy''' (μ<sub>1</sub>, σ<sub>1</sub>) random variable and X<sub>2</sub> is a Cauchy (μ<sub>2</sub>, σ<sub>2</sub>), then X<sub>1</sub> + X<sub>2</sub> is a '''Cauchy''' (μ<sub>1</sub> + μ<sub>2</sub>, σ<sub>1</sub> + σ<sub>2</sub>) random variable.
| |
| **If X<sub>1</sub> and X<sub>2</sub> are '''chi-squared''' random variables with ν<sub>1</sub> and ν<sub>2</sub> degrees of freedom respectively, then X<sub>1</sub> + X<sub>2</sub> is a chi-squared random variable with ν<sub>1</sub> + ν<sub>2</sub> degrees of freedom.
| |
| **If X<sub>1</sub> is a '''normal''' (μ<sub>1</sub>, σ<sub>1</sub><sup>2</sup>) random variable and X<sub>2</sub> is a normal (μ<sub>2</sub>, σ<sub>2</sub><sup>2</sup>) random variable, then X<sub>1</sub> + X<sub>2</sub> is a normal (μ<sub>1</sub> + μ<sub>2</sub>, σ<sub>1</sub><sup>2</sup> + σ<sub>2</sub><sup>2</sup>) random variable.
| |
| **The sum of N chi-squared (1) random variables has a chi-squared distribution with N degrees of freedom.
| |
| | |
| Other distributions are not closed under convolution, but their sum has a known distribution:
| |
| * The sum of ''n'' '''Bernoulli''' (p) random variables is a '''binomial''' (''n'', p) random variable.
| |
| * The sum of ''n'' '''geometric''' random variable with probability of success ''p'' is a '''negative binomial''' random variable with parameters ''n'' and ''p''.
| |
| * The sum of ''n'' '''exponential''' (β) random variables is a '''gamma''' (''n'', β) random variable.
| |
| *The sum of the squares of N '''standard normal''' random variables has a '''chi-squared''' distribution with N degrees of freedom.
| |
| | |
| === Product of variables ===
| |
| | |
| The product of independent random variables ''X'' and ''Y'' may belong to the same family of distribution as ''X'' and ''Y'':
| |
| [[Bernoulli distribution]] and [[Log-normal distribution]].
| |
| | |
| '''Example: '''
| |
| * If X<sub>1</sub> and X<sub>2</sub> are independent '''log-normal''' random variables with parameters (μ<sub>1</sub>, σ<sub>1</sub><sup>2</sup>) and (μ<sub>2</sub>, σ<sub>2</sub><sup>2</sup>) respectively, then X<sub>1</sub> X<sub>2</sub> is a '''log-normal''' random variable with parameters (μ<sub>1</sub> + μ<sub>2</sub>, σ<sub>1</sub><sup>2</sup> + σ<sub>2</sub><sup>2</sup>).
| |
| | |
| === Minimum and maximum of independent random variables ===
| |
| | |
| For some distributions, the '''minimum''' value of several independent random variables is a member of the same family, with different parameters:
| |
| [[Bernoulli distribution]], [[Geometric distribution]], [[Exponential distribution]], [[Extreme value distribution]], [[Pareto distribution]], [[Rayleigh distribution]], [[Weibull distribution]].
| |
| | |
| '''Examples: '''
| |
| * If X<sub>1</sub> and X<sub>2</sub> are independent '''geometric''' random variables with probability of success p<sub>1</sub> and p<sub>2</sub> respectively, then min(X<sub>1</sub>, X<sub>2</sub>) is a geometric random variable with probability of success p = p<sub>1</sub> + p<sub>2</sub> - p<sub>1</sub> p<sub>2</sub>. The relationship is simpler if expressed in terms probability of failure: q = q<sub>1</sub> q<sub>2</sub>.
| |
| * If X<sub>1</sub> and X<sub>2</sub> are independent '''exponential''' random variables with mean μ<sub>1</sub> and μ<sub>2</sub> respectively, then min(X<sub>1</sub>, X<sub>2</sub>) is an exponential random variable with mean μ<sub>1</sub> μ<sub>2</sub>/(μ<sub>1</sub> + μ<sub>2</sub>).
| |
| | |
| Similarly, distributions for which the '''maximum''' value of several independent random variables is a member of the same family of distribution include:
| |
| [[Bernoulli distribution]], [[Power distribution]].
| |
| | |
| === Other ===
| |
| | |
| * If ''X'' and ''Y'' are independent '''standard normal''' random variables, ''X''/''Y'' is a '''Cauchy''' (0,1) random variable.
| |
| * If X<sub>1</sub> and X<sub>2</sub> are '''chi-squared''' random variables with ν<sub>1</sub> and ν<sub>2</sub> degrees of freedom respectively, then (X<sub>1</sub>/ν<sub>1</sub>)/(X<sub>2</sub>/ν<sub>2</sub>) is an '''''F'''''(ν<sub>1</sub>, ν<sub>2</sub>) random variable.
| |
| * If X is a '''standard normal''' random variable and U is a '''chi-squared''' random variable with ν degrees of freedom, then <math>\frac{X}{\sqrt{(U/\nu)}} </math> is a '''Student's ''t''''' (ν) random variable.
| |
| * If X<sub>1</sub> is '''gamma''' (α<sub>1</sub>, 1) random variable and X<sub>2</sub> is a gamma (α<sub>2</sub>, 1) random variable then X<sub>1</sub>/(X<sub>1</sub> + X<sub>2</sub>) is a '''beta'''(α<sub>1</sub>, α<sub>2</sub>) random variable. More generally, if X<sub>1</sub>is gamma(α<sub>1</sub>, β<sub>1</sub>) random variable and X<sub>2</sub> is gamma(α<sub>2</sub>, β<sub>2</sub>) random variable then β<sub>2</sub> X<sub>1</sub>/(β<sub>2</sub> X<sub>1</sub> + β<sub>1</sub> X<sub>2</sub>) is a beta(α<sub>1</sub>, α<sub>2</sub>) random variable.
| |
| * If ''X'' and ''Y'' are '''exponential''' random variables with mean μ, then ''X''-''Y'' is a '''[[Gumbel distribution|double exponential]]''' random variable with mean 0 and scale μ.
| |
| | |
| == Approximate (limit) relationships ==
| |
| | |
| Approximate or limit relationship means
| |
| *either that the combination of an infinite number of ''iid'' random variables tends to some distribution,
| |
| *or that the limit when a parameter tends to some value approaches to a different distribution.
| |
| | |
| '''Combination of ''iid'' random variables: '''
| |
| | |
| * Given certain conditions, the sum (hence the average) of a sufficiently large number of iid random variables, each with finite mean and variance, will be approximately normally distributed.(This is [[central limit theorem]] (CLT)).
| |
| | |
| '''Special case of distribution parametrization: '''
| |
| | |
| * X is a '''Hypergeometric''' (m, N, n) random variable. If ''n'' and ''m'' are large compared to ''N'', and ''p = m / N'' is not close to 0 or 1, then X approximately has a '''Binomial'''(n, p) Distribution.
| |
| * X is a '''beta-binomial''' random variable with parameters (n, α, β). Let p = α/(α + β) and suppose α + β is large, then X approximately has a '''binomial'''(n, p) distribution.
| |
| * If X is a '''binomial''' (n, p) random variable and if n is large and np is small then X approximately has a '''Poisson'''(np) distribution.
| |
| * If X is a '''negative binomial''' random variable with r large, P near 1, and r(1-P) = λ, then X approximately has a '''Poisson''' distribution with mean λ.
| |
| | |
| Consequences of the CLT:
| |
| * If X is a '''Poisson''' random variable with large mean, then for integers j and k, P(j ≤ X ≤ k) approximately equals to P(j - 1/2 ≤ Y ≤ k + 1/2) where Y is a '''normal''' distribution with the same mean and variance as X.
| |
| * If X is a '''binomial'''(n, p) random variable with large n and np, then for integers j and k, P(j ≤ X ≤ k) approximately equals to P(j - 1/2 ≤ Y ≤ k + 1/2) where Y is a '''normal''' random variable with the same mean and variance as X, i. e. np and np(1-p).
| |
| * If X is a '''beta''' random variable with parameters α and β equal and large, then X approximately has a '''normal''' distribution with the same mean and variance, i. e. mean α/(α + β) and variance αβ/((α+β)<sup>2</sup>(α + β + 1)).
| |
| * If X is a '''gamma'''(α, β) random variable and the shape parameter α is large relative to the scale parameter β, then X approximately has a '''normal''' random variable with the same mean and variance.
| |
| * If X is a '''Student's ''t''''' random variable with a large number of degrees of freedom ν then X approximately has a '''standard normal''' distribution.
| |
| * If X is an '''''F'''''(ν, ω) random variable with ω large, then ν X is approximately distributed As a '''chi-squared''' random variable with ν degrees of freedom.
| |
| | |
| == Compound (or Bayesian) relationships ==
| |
| | |
| When one or more parameter(s) of a distribution are random variables, the [[Compound probability distribution|compound]] distribution is the marginal distribution of the variable.
| |
| | |
| '''Examples: '''
| |
| | |
| * If X|N is a '''binomial''' (N,p) random variable, where parameter N is a random variable with negative-binomial (m, r) distribution, then X is distributed as a '''negative-binomial''' (m, r/(p+qr)).
| |
| * If X|N is a '''binomial''' (N,p) random variable, where parameter N is a random variable with Poisson (μ) distribution, then X is distributed as a '''Poisson''' (μp).
| |
| * If X|μ is a '''Poisson''' (μ) random variable and parameter μ is random variable with gamma (m, β) distribution, then X is distributed as a '''negative-binomial''' (m, μβ/(μ+β)), sometimes called [[Gamma-Poisson distribution]] if ''m'' is not integer.
| |
| | |
| Some distributions have been specially named as compounds:
| |
| [[Beta-Binomial distribution]], [[Beta-Pascal distribution]], [[Gamma-Normal distribution]].
| |
| | |
| '''Examples: '''
| |
| | |
| * If X is a Binomial (n,p) random variable, and parameter p is a random variable with beta (α, β) distribution, then X is distributed as a Beta-Binomial(α, β,n).
| |
| * If X is a negative-binomial (m,p) random variable, and parameter p is a random variable with beta (α, β) distribution, then X is distributed as a Beta-Pascal(α, β,m).
| |
| | |
| ==See also==
| |
| | |
| *[[Central limit theorem]]
| |
| *[[Compound probability distribution]]
| |
| | |
| == References ==
| |
| | |
| {{Reflist}}
| |
| | |
| == External links ==
| |
| | |
| * Interactive graphic: [http://www.math.wm.edu/~leemis/chart/UDR/UDR.html Univariate Distribution Relationships]
| |
| | |
| {{Common univariate probability distributions}}
| |
| | |
| [[Category:Probability distributions]]
| |