Topos: Difference between revisions

Revision as of 09:44, 1 February 2014

In information theory and statistics, Kullback's inequality is a lower bound on the Kullback–Leibler divergence expressed in terms of the large deviations rate function.^[1] If P and Q are probability distributions on the real line, such that P is absolutely continuous with respect to Q, i.e. P<<Q, and whose first moments exist, then

D_{K L} (P ‖ Q) \geq Ψ_{Q}^{*} (μ'_{1} (P)),

where $Ψ_{Q}^{*}$ is the rate function, i.e. the convex conjugate of the cumulant-generating function, of $Q$ , and $μ'_{1} (P)$ is the first moment of $P .$

The Cramér–Rao bound is a corollary of this result.

Proof

Let P and Q be probability distributions (measures) on the real line, whose first moments exist, and such that P<<Q. Consider the natural exponential family of Q given by

Q_{θ} (A) = \frac{\int_{A} e^{θ x} Q (d x)}{\int_{- \infty}^{\infty} e^{θ x} Q (d x)} = \frac{1}{M_{Q} (θ)} \int_{A} e^{θ x} Q (d x)

for every measurable set A, where $M_{Q}$ is the moment-generating function of Q. (Note that Q₀=Q.) Then

D_{K L} (P ‖ Q) = D_{K L} (P ‖ Q_{θ}) + \int_{s u p p P} (\log \frac{d Q_{θ}}{d Q}) d P .

By Gibbs' inequality we have $D_{K L} (P ‖ Q_{θ}) \geq 0$ so that

D_{K L} (P ‖ Q) \geq \int_{s u p p P} (\log \frac{d Q_{θ}}{d Q}) d P = \int_{s u p p P} (\log \frac{e^{θ x}}{M_{Q} (θ)}) P (d x)

Simplifying the right side, we have, for every real θ where $M_{Q} (θ) < \infty :$

D_{K L} (P ‖ Q) \geq μ'_{1} (P) θ - Ψ_{Q} (θ),

where $μ'_{1} (P)$ is the first moment, or mean, of P, and $Ψ_{Q} = \log M_{Q}$ is called the cumulant-generating function. Taking the supremum completes the process of convex conjugation and yields the rate function:

D_{K L} (P ‖ Q) \geq \sup_{θ} {μ'_{1} (P) θ - Ψ_{Q} (θ)} = Ψ_{Q}^{*} (μ'_{1} (P)) .

Corollary: the Cramér–Rao bound

Mining Engineer (Excluding Oil ) Truman from Alma, loves to spend time knotting, largest property developers in singapore developers in singapore and stamp collecting. Recently had a family visit to Urnes Stave Church.

Start with Kullback's inequality

Let X_θ be a family of probability distributions on the real line indexed by the real parameter θ, and satisfying certain regularity conditions. Then

\lim_{h \to 0} \frac{D_{K L} (X_{θ + h} ‖ X_{θ})}{h^{2}} \geq \lim_{h \to 0} \frac{Ψ_{θ}^{*} (μ_{θ + h})}{h^{2}},

where $Ψ_{θ}^{*}$ is the convex conjugate of the cumulant-generating function of $X_{θ}$ and $μ_{θ + h}$ is the first moment of $X_{θ + h} .$

Left side

The left side of this inequality can be simplified as follows:

\lim_{h \to 0} \frac{D_{K L} (X_{θ + h} ‖ X_{θ})}{h^{2}} = \lim_{h \to 0} \frac{1}{h^{2}} \int_{- \infty}^{\infty} (\log \frac{d X_{θ + h}}{d X_{θ}}) d X_{θ + h}

= \lim_{h \to 0} \frac{1}{h^{2}} \int_{- \infty}^{\infty} [(1 - \frac{d X_{θ}}{d X_{θ + h}}) + \frac{1}{2} {(1 - \frac{d X_{θ}}{d X_{θ + h}})}^{2} + o ({(1 - \frac{d X_{θ}}{d X_{θ + h}})}^{2})] d X_{θ + h},

where we have expanded the logarithm

\log x

in a Taylor series in

1 - 1 / x

,

= \lim_{h \to 0} \frac{1}{h^{2}} \int_{- \infty}^{\infty} [\frac{1}{2} {(1 - \frac{d X_{θ}}{d X_{θ + h}})}^{2}] d X_{θ + h}

= \lim_{h \to 0} \frac{1}{h^{2}} \int_{- \infty}^{\infty} [\frac{1}{2} {(\frac{d X_{θ + h} - d X_{θ}}{d X_{θ + h}})}^{2}] d X_{θ + h} = \frac{1}{2} ℐ_{X} (θ),

which is half the Fisher information of the parameter θ.

Right side

The right side of the inequality can be developed as follows:

\lim_{h \to 0} \frac{Ψ_{θ}^{*} (μ_{θ + h})}{h^{2}} = \lim_{h \to 0} \frac{1}{h^{2}} \sup_{t} {μ_{θ + h} t - Ψ_{θ} (t)} .

This supremum is attained at a value of t=τ where the first derivative of the cumulant-generating function is $Ψ'_{θ} (τ) = μ_{θ + h},$ but we have $Ψ'_{θ} (0) = μ_{θ},$ so that

Ψ^{'}'_{θ} (0) = \frac{d μ_{θ}}{d θ} \lim_{h \to 0} \frac{h}{τ} .

Moreover,

\lim_{h \to 0} \frac{Ψ_{θ}^{*} (μ_{θ + h})}{h^{2}} = \frac{1}{2 Ψ^{'}'_{θ} (0)} {(\frac{d μ_{θ}}{d θ})}^{2} = \frac{1}{2 V a r (X_{θ})} {(\frac{d μ_{θ}}{d θ})}^{2} .

Putting both sides back together

We have:

\frac{1}{2} ℐ_{X} (θ) \geq \frac{1}{2 V a r (X_{θ})} {(\frac{d μ_{θ}}{d θ})}^{2},

which can be rearranged as:

V a r (X_{θ}) \geq \frac{(d μ_{θ} / d θ)^{2}}{ℐ_{X} (θ)} .

Notes and references

↑ Aimé Fuchs and Giorgio Letta, L'inégalité de Kullback. Application à la théorie de l'estimation. Séminaire de probabilités (Strasbourg), vol. 4, pp. 108-131, 1970. http://www.numdam.org/item?id=SPS_1970__4__108_0

[1] Aimé Fuchs and Giorgio Letta, L'inégalité de Kullback. Application à la théorie de l'estimation. Séminaire de probabilités (Strasbourg), vol. 4, pp. 108-131, 1970. http://www.numdam.org/item?id=SPS_1970__4__108_0

[1]

@@ Line 1: / Line 1: @@
-Today, there are several other types of web development and blogging software available to design and host your website blogs online and that too in minutes, if not hours. You can either install Word - Press yourself or use free services offered on the web today. One really cool features about this amazing and free wp plugin is that the code it generates is completely portable.  If you liked this article and you would like to acquire extra facts pertaining to [http://emailsite.de/backup_plugin_7056333 backup plugin] kindly take a look at the web-site. Hosted by Your Domain on Another Web Host - In this model, you first purchase multiple-domain webhosting, and then you can build free Wordpress websites on your own domains, taking advantage of the full power of Wordpress. provided by Word - Press Automatic Upgrade, so whenever you need to update the new version does not, it automatically creates no webmaster. <br><br>Thus, it is imperative that you must Hire Word - Press Developers who have the expertise and proficiency in delivering theme integration and customization services. Infertility can cause a major setback to the couples due to the inability to conceive. Which is perfect for building a mobile site for business use. You can up your site's rank with the search engines by simply taking a bit of time with your site. The biggest advantage of using a coupon or deal plugin is that it gives your readers the coupons and deals within minutes of them becoming available. <br><br>Just ensure that you hire experienced Word - Press CMS developer who is experienced enough to perform the task of Word - Press customization to get optimum benefits of Word - Press CMS. But if you are not willing to choose cost to the detriment of quality, originality and higher returns, then go for a self-hosted wordpress blog and increase the presence of your business in this new digital age. I hope this short Plugin Dynamo Review will assist you to differentiate whether Plugin Dynamo is Scam or a Genuine. Enough automated blog posts plus a system keeps you and your clients happy. For any web design and development assignment, this is definitely one of the key concerns, specifically for online retail outlets as well as e-commerce websites. <br><br>Google Maps Excellent navigation feature with Google Maps and latitude, for letting people who have access to your account Latitude know exactly where you are. In case you need to hire PHP developers or hire Offshore Code - Igniter development services or you are looking for Word - Press development experts then Mindfire Solutions would be the right choice for a Software Development partner. Specialty about our themes are that they are easy to load, compatible with latest wordpress version and are also SEO friendly. Can you imagine where you would be now if someone in your family bought an original painting from van Gogh during his lifetime. Digital digital cameras now function gray-scale configurations which allow expert photographers to catch images only in black and white. <br><br>You will know which of your Word - Press blog posts are attracting more unique visitors which in turn will help you develop better products and services for your customers. By using Word - Press MLM websites or blogs, an online presence for you and your MLM company can be created swiftly and simply. While deciding couple should consider the expertise of the doctor,clinics success rate,the costs of fertility treatment,including fertility tests and IVF costs and overall ones own financial budget. And, it is better that you leave it on for the duration you are writing plugin code. However, if you're just starting out your blog site or business site, you can still search for an ideal theme for it without breaking your bank account.
+In [[information theory]] and [[statistics]], '''Kullback's inequality''' is a lower bound on the [[Kullback–Leibler divergence]] expressed in terms of the [[large deviations theory|large deviations]] [[rate function]].<ref>Aimé Fuchs and Giorgio Letta, ''L'inégalité de Kullback. Application à la théorie de l'estimation.'' Séminaire de probabilités (Strasbourg), vol. 4, pp. 108-131, 1970.  http://www.numdam.org/item?id=SPS_1970__4__108_0</ref>  If ''P'' and ''Q'' are [[probability distribution]]s on the real line, such that ''P'' is '''absolutely continuous''' with respect to ''Q'', i.e. ''P''<<''Q'', and whose first moments exist, then
+:<math>D_{KL}(P\|Q) \ge \Psi_Q^*(\mu'_1(P)),</math>
+where <math>\Psi_Q^*</math> is the rate function, i.e. the [[convex conjugate]] of the [[cumulant]]-generating function, of <math>Q</math>, and <math>\mu'_1(P)</math> is the first [[Moment (mathematics)|moment]] of <math>P.</math>
+The [[Cramér–Rao bound]] is a corollary of this result.
+==Proof==
+Let ''P'' and ''Q'' be [[probability distribution]]s (measures) on the real line, whose first moments exist, and such that [[Absolutely_continuous#Absolute_continuity_of_measures|''P''<<''Q'']]. Consider the '''[[natural exponential family]]''' of ''Q'' given by
+:<math>Q_\theta(A) = \frac{\int_A e^{\theta x}Q(dx)}{\int_{-\infty}^\infty e^{\theta x}Q(dx)}
+   = \frac{1}{M_Q(\theta)} \int_A e^{\theta x}Q(dx)</math>
+for every measurable set ''A'', where <math>M_Q</math> is the '''[[moment-generating function]]''' of ''Q''.  (Note that ''Q''<sub>0</sub>=''Q''.)  Then
+:<math>D_{KL}(P\|Q) = D_{KL}(P\|Q_\theta)
+   + \int_{\mathrm{supp}P}\left(\log\frac{\mathrm dQ_\theta}{\mathrm dQ}\right)\mathrm dP.</math>
+By [[Gibbs' inequality]] we have <math>D_{KL}(P\|Q_\theta) \ge 0</math> so that
+:<math>D_{KL}(P\|Q) \ge
+   \int_{\mathrm{supp}P}\left(\log\frac{\mathrm dQ_\theta}{\mathrm dQ}\right)\mathrm dP
+ = \int_{\mathrm{supp}P}\left(\log\frac{e^{\theta x}}{M_Q(\theta)}\right) P(dx)</math>
+Simplifying the right side, we have, for every real θ where <math>M_Q(\theta) < \infty:</math>
+:<math>D_{KL}(P\|Q) \ge \mu'_1(P) \theta - \Psi_Q(\theta),</math>
+where <math>\mu'_1(P)</math> is the first moment, or mean, of ''P'', and <math>\Psi_Q = \log M_Q</math> is called the '''[[cumulant|cumulant-generating function]]'''.  Taking the supremum completes the process of [[convex conjugate|convex conjugation]] and yields the [[rate function]]:
+:<math>D_{KL}(P\|Q) \ge \sup_\theta \left\{ \mu'_1(P) \theta - \Psi_Q(\theta) \right\}
+   = \Psi_Q^*(\mu'_1(P)).</math>
+==Corollary: the Cramér–Rao bound==
+{{main|Cramér–Rao bound}}
+===Start with Kullback's inequality===
+Let ''X''<sub>θ</sub> be a family of probability distributions on the real line indexed by the real parameter θ, and satisfying certain [[Cramér–Rao_bound#Regularity_conditions|regularity conditions]].  Then
+:<math> \lim_{h\rightarrow 0} \frac {D_{KL}(X_{\theta+h}\|X_\theta)} {h^2}
+    \ge \lim_{h\rightarrow 0} \frac {\Psi^*_\theta (\mu_{\theta+h})}{h^2},
+</math>
+where <math>\Psi^*_\theta</math> is the [[convex conjugate]] of the [[Cumulant|cumulant-generating function]] of <math>X_\theta</math> and <math>\mu_{\theta+h}</math> is the first moment of <math>X_{\theta+h}.</math>
+===Left side===
+The left side of this inequality can be simplified as follows:
+:<math>\lim_{h\rightarrow 0}
+       \frac {D_{KL}(X_{\theta+h}\|X_\theta)} {h^2}
+      =\lim_{h\rightarrow 0}
+       \frac 1 {h^2}
+       \int_{-\infty}^\infty \left( \log\frac{\mathrm dX_{\theta+h}}{\mathrm dX_\theta} \right)
+       \mathrm dX_{\theta+h}
+</math>
+:<math>  = \lim_{h\rightarrow 0} \frac 1 {h^2} \int_{-\infty}^\infty \left[
+            \left( 1 - \frac{\mathrm dX_\theta}{\mathrm dX_{\theta+h}} \right)
+ +\frac 1 2 \left( 1 - \frac{\mathrm dX_\theta}{\mathrm dX_{\theta+h}} \right) ^ 2
+ + o \left( \left( 1 - \frac{\mathrm dX_\theta}{\mathrm dX_{\theta+h}} \right) ^ 2 \right)
+          \right]\mathrm dX_{\theta+h},
+</math>
+::where we have expanded the logarithm <math>\log x</math> in a [[Taylor series]] in <math>1-1/x</math>,
+:<math>  = \lim_{h\rightarrow 0} \frac 1 {h^2} \int_{-\infty}^\infty \left[
+  \frac 1 2 \left( 1 - \frac{\mathrm dX_\theta}{\mathrm dX_{\theta+h}} \right) ^ 2
+          \right]\mathrm dX_{\theta+h}
+</math>
+:<math>
+         = \lim_{h\rightarrow 0} \frac 1 {h^2} \int_{-\infty}^\infty \left[
+  \frac 1 2 \left( \frac{\mathrm dX_{\theta+h} - \mathrm dX_\theta}{\mathrm dX_{\theta+h}} \right) ^ 2
+          \right]\mathrm dX_{\theta+h}
+ = \frac 1 2 \mathcal I_X(\theta),</math>
+which is half the [[Fisher information]] of the parameter θ.
+===Right side===
+The right side of the inequality can be developed as follows:
+:<math>
+  \lim_{h\rightarrow 0} \frac {\Psi^*_\theta (\mu_{\theta+h})}{h^2}
+= \lim_{h\rightarrow 0} \frac 1 {h^2} {\sup_t \{\mu_{\theta+h}t - \Psi_\theta(t)\} }.
+</math>
+This supremum is attained at a value of ''t''=τ where the first derivative of the cumulant-generating function is <math>\Psi'_\theta(\tau) = \mu_{\theta+h},</math> but we have <math>\Psi'_\theta(0) = \mu_\theta,</math> so that
+:<math>\Psi''_\theta(0) = \frac{d\mu_\theta}{d\theta} \lim_{h \rightarrow 0} \frac h \tau.</math>
+Moreover,
+:<math>\lim_{h\rightarrow 0} \frac {\Psi^*_\theta (\mu_{\theta+h})}{h^2}
+   = \frac 1 {2\Psi''_\theta(0)}\left(\frac {d\mu_\theta}{d\theta}\right)^2
+   = \frac 1 {2\mathrm{Var}(X_\theta)}\left(\frac {d\mu_\theta}{d\theta}\right)^2.</math>
+===Putting both sides back together===
+We have:
+:<math>\frac 1 2 \mathcal I_X(\theta)
+   \ge \frac 1 {2\mathrm{Var}(X_\theta)}\left(\frac {d\mu_\theta}{d\theta}\right)^2,</math>
+which can be rearranged as:
+:<math>\mathrm{Var}(X_\theta) \ge \frac{(d\mu_\theta / d\theta)^2} {\mathcal I_X(\theta)}.</math>
+==See also==
+* [[Kullback–Leibler divergence]]
+* [[Cramér–Rao bound]]
+* [[Fisher information]]
+* [[Large deviations theory]]
+* [[Convex conjugate]]
+* [[Rate function]]
+* [[Moment-generating function]]
+==Notes and references==
+<references/>
+{{DEFAULTSORT:Kullback's Inequality}}
+[[Category:Information theory]]
+[[Category:Statistical inequalities]]
+[[Category:Estimation theory]]

Topos: Difference between revisions

Revision as of 09:44, 1 February 2014

Contents

Proof

Corollary: the Cramér–Rao bound

Start with Kullback's inequality

Left side

Right side

Putting both sides back together

See also

Notes and references

Navigation menu

Topos: Difference between revisions

Revision as of 09:44, 1 February 2014

Proof

Corollary: the Cramér–Rao bound

Start with Kullback's inequality

Left side

Right side

Putting both sides back together

See also

Notes and references

Navigation menu

Search