Roy's identity: Difference between revisions

Revision as of 03:47, 13 December 2013

In computational complexity theory, the Sipser–Lautemann theorem or Sipser–Gács–Lautemann theorem states that Bounded-error Probabilistic Polynomial (BPP) time, is contained in the polynomial time hierarchy, and more specifically Σ₂ ∩ Π₂.

In 1983, Michael Sipser showed that BPP is contained in the polynomial time hierarchy. Péter Gács showed that BPP is actually contained in Σ₂ ∩ Π₂. Clemens Lautemann contributed by giving a simple proof of BPP’s membership in Σ₂ ∩ Π₂, also in 1983. It is conjectured that in fact BPP=P, which is a much stronger statement than the Sipser–Lautemann theorem.

Proof

Michael Sipser's version of the proof works as follows. Without loss of generality, a machine M ∈ BPP with error ≤ 2^-|x| can be chosen. (All BPP problems can be amplified to reduce the error probability exponentially.) The basic idea of the proof is to define a Σ₂ ∩ Π₂ sentence that is equivalent to stating that x is in the language, L, defined by M by using a set of transforms of the random variable inputs.

Since the output of M depends on random input, as well as the input x, it is useful to define which random strings produce the correct output as A(x) = {r | M(x,r) accepts}. The key to the proof is to note that when x ∈ L, A(x) is very large and when x ∉ L, A(x) is very small. By using bitwise parity, ⊕, a set of transforms can be defined as A(x) ⊕ t={r ⊕ t | r ∈ A(x)}. The first main lemma of the proof shows that the union of a small finite number of these transforms will contain the entire space of random input strings. Using this fact, a Σ₂ sentence and a Π₂ sentence can be generated that is true if and only if x ∈ L (see corollary).

Lemma 1

The general idea of lemma one is to prove that if A(x) covers a large part of the random space $R = {1, 0}^{| r |}$ then there exists a small set of translations that will cover the entire random space. In more mathematical language:

If

\frac{| A (x) |}{| R |} > 1 - \frac{1}{2^{| x |}}

, then

\exists t_{1}, t_{2}, \dots, t_{| r |}

, where

t_{i} \in {1, 0}^{| r |}

such that

⋃_{i} A (x) \oplus t_{i} = R .

Proof. Randomly pick t₁, t₂, ..., t_|r|. Let $S = ⋃_{i} A (x) \oplus t_{i}$ (the union of all transforms of A(x)).

So, for all r in R,

\Pr [r \notin S] = \Pr [r \notin A (x) \oplus t_{1}] \cdot \Pr [r \notin A (x) \oplus t_{2}] \dots \Pr [r \notin A (x) \oplus t_{| r |}] \leq \frac{1}{2^{| x | \cdot | r |}} .

The probability that there will exist at least one element in R not in S is

\Pr [\underset{i}{⋁} (r_{i} \notin S)] \leq \sum_{i} \frac{1}{2^{| x | \cdot | r |}} = \frac{1}{2^{| x |}} < 1 .

Therefore

\Pr [S = R] \geq 1 - \frac{1}{2^{| x |}} .

Thus there is a selection for each $t_{1}, t_{2}, \dots, t_{| r |}$ such that

⋃_{i} A (x) \oplus t_{i} = R .

Lemma 2

The previous lemma shows that A(x) can cover every possible point in the space using a small set of translations. Complementary to this, for x ∉ L only a small fraction of the space is covered by A(x). Therefore the set of random strings causing M(x,r) to accept cannot be generated by a small set of vectors t_i.

R = ⋃_{i} A (x) \oplus t_{i}

R is the set of all accepting random strings, exclusive-or'd with vectors t_i.

\frac{| A (x) |}{| R |} \leq \frac{1}{2^{| k |}} ⟹ \neg \exists t_{1}, t_{2}, \dots, t_{r}

Corollary

An important corollary of the lemmas shows that the result of the proof can be expressed as a Σ₂ expression, as follows.

x \in L ⟺ \exists t_{1}, t_{2}, \dots, t_{| r |} \forall r \in R \underset{1 \leq i \leq | r |}{⋁} (M (r \oplus t_{i}) accepts) .

That is, x is in language L if and only if there exist |r| binary vectors, where for all random bit vectors r, TM M accepts at least one random vector ⊕ t_i.

The above expression is in Σ₂ in that it is first existentially then universally quantified. Therefore BPP ⊆ Σ₂. Because BPP is closed under complement, this proves BPP ⊆ Σ₂ ∩ Π₂

Lautemann's proof

Here we present the proof (due to Lautemann) that BPP ⊆ Σ₂. See Trevisan's notes for more information.

Lemma 3

Based on the definition of BPP we show the following:

If L is in BPP then there is an algorithm A such that for every x,

{P r}_{r} (A (x, r) = right answer) \geq 1 - \frac{1}{3 m},

where m is the number of random bits $| r | = m = | x |^{O (1)}$ and A runs in time $| x |^{O (1)}$ .

Proof: Let ATemplate:' be a BPP algorithm for L. For every x, $\Pr_{r} (A^{'} (x, r) = wrong answer) \leq 1 / 3$ . ATemplate:' uses mTemplate:'(n) random bits where n = |x|. Do k(n) repetitions of ATemplate:' and accept if and only if at least k(n)/2 executions of ATemplate:' accept. Define this new algorithm as A. So A uses k(n)mTemplate:'(n) random bits and

{P r}_{r} (A (x, r) = wrong answer) \leq 2^{- c k (n)} .

We can then find k(n) with $k (n) = Θ (\log m^{'} (n))$ such that

\frac{1}{2^{c k (n)}} \leq \frac{1}{3 k (n) m^{'} (n)} .

Theorem 1

Proof: Let L be in BPP and A as in Lemma 3. We want to show

x \in L ⟺ \exists y_{1}, \dots, y_{m} \in {0, 1}^{m} \forall z \in {0, 1}^{m} ⋁_{i = 1}^{m} A (x, y_{i} \oplus z) = 1 .

where m is the number of random bits used by A on input x. Given $x \in L$ , then

\begin{aligned} {P r}_{y_{1}, \dots, y_{m}} (\exists z & A (x, y_{1} \oplus z) = \dots = A (x, y_{m} \oplus z) = 0) \\ \leq \sum_{z \in {0, 1}^{m}} {P r}_{y_{1}, \dots, y_{m}} (A (x, y_{1} \oplus z) = \dots = A (x, y_{m} \oplus z) = 0) \\ \leq 2^{m} \frac{1}{(3 m)^{m}} \\ < 1 . \end{aligned}

Thus

{P r}_{y_{1}, \dots, y_{m}} (\forall z \underset{i}{⋁} A (x, y_{i} \oplus z)) = 1 - {P r}_{y_{1}, . . ., y_{m}} (\exists z A (x, y_{1} \oplus z) = \dots = A (x, y_{m} \oplus z) = 0) .

Thus $(y_{1}, \dots, y_{m})$ exists.

Conversely, suppose $x \notin L$ . Then

${P r}_{z} (\underset{i}{⋁} A (x, y_{i} \oplus z)) \leq \sum_{i} {P r}_{z} (A (x, y_{i} \oplus z) = 1) \leq m \frac{1}{3 m} = \frac{1}{3} .$

Thus

{P r}_{z} (A (x, y_{1} \oplus z) = \dots = A (x, y_{m} \oplus z) = 0) = 1 - {P r}_{z} (\underset{i}{⋁} A (x, y_{i} \oplus z)) \geq \frac{2}{3} > 0 .

Thus there is a z such that $\underset{i}{⋁} A (x, y_{i} \oplus z) = 0$ for all $y_{1}, . . ., y_{m} \in {0, 1}^{m} .$

Stronger version

The theorem can be strengthened to $B P P \subseteq M A \subseteq S_{2}^{P} \subseteq Σ_{2} \cap Π_{2}$ (see MA, [[S2P (complexity)|STemplate:Su]]).

References

M. Sipser. A complexity theoretic approach to randomness In Proceedings of the 15th ACM Symposium on Theory of Computing, 330–335. ACM Press, 1983.
C. Lautemann, BPP and the polynomial hierarchy Inf. Proc. Lett. 17 (4) 215–217, 1983.
Luca Trevisan's Lecture Notes, University of California, Berkeley

@@ Line 1: / Line 1: @@
-The name of the author is Figures but it's not the most masucline name out there. My working day job is a meter reader. California is our birth location. Doing ceramics is what love performing.<br><br>Visit my website - [http://www.videoworld.com/user/SMaloney over the counter std test]
+In [[computational complexity theory]], the '''Sipser–Lautemann theorem''' or '''Sipser–Gács–Lautemann theorem''' states that [[Bounded-error probabilistic polynomial|Bounded-error Probabilistic Polynomial]] (BPP) time, is contained in the [[Polynomial hierarchy|polynomial time hierarchy]], and more specifically Σ<sub>2</sub> ∩ Π<sub>2</sub>.
+In 1983, [[Michael Sipser]] showed that BPP is contained in the [[Polynomial hierarchy|polynomial time hierarchy]]. [[Péter Gács]] showed that BPP is actually contained in Σ<sub>2</sub> ∩ Π<sub>2</sub>. [[Clemens Lautemann]] contributed by giving a simple proof of BPP’s membership in Σ<sub>2</sub> ∩ Π<sub>2</sub>, also in 1983. It is conjectured that in fact BPP=[[P (complexity)|P]], which is a much stronger statement than the Sipser–Lautemann theorem.
+== Proof ==
+[[Michael Sipser|Michael Sipser's]] version of the proof works as follows.  Without loss of generality, a machine ''M'' ∈ BPP with error ≤ 2<sup>-|''x''|</sup> can be chosen.  (All BPP problems can be amplified to reduce the error probability exponentially.)  The basic idea of the proof is to define a Σ<sub>2</sub> ∩ Π<sub>2</sub> sentence that is equivalent to stating that ''x'' is in the language, ''L'', defined by ''M'' by using a set of transforms of the random variable inputs.
+Since the output of ''M'' depends on random input, as well as the input ''x'', it is useful to define which random strings produce the correct output as ''A''(''x'') = {''r'' | ''M''(''x'',''r'') accepts}.  The key to the proof is to note that when ''x'' ∈ ''L'', ''A''(''x'') is very large and when ''x'' ∉ ''L'', ''A''(''x'') is very small.  By using [[Exclusive disjunction|bitwise parity]], ⊕, a set of transforms can be defined as ''A''(''x'') ⊕ ''t''={''r'' ⊕ ''t'' | ''r'' ∈ ''A''(''x'')}.  The first main lemma of the proof shows that the union of a small finite number of these transforms will contain the entire space of random input strings.  Using this fact, a Σ<sub>2</sub> sentence and a Π<sub>2</sub> sentence can be generated that is true if and only if ''x'' ∈ ''L'' (see corollary).
+=== Lemma 1 ===
+The general idea of lemma one is to prove that if ''A''(''x'') covers a large part of the random space <math>R= \{ 1,0 \} ^ {|r|} </math> then there exists a small set of translations that will cover the entire random space.  In more mathematical language:
+:''If'' <math>\frac{|A(x)|}{|R|} > 1 - \frac{1}{2^{|x|}}</math>, ''then'' <math>\exists  t_1,t_2,\ldots,t_{|r|}</math>, ''where'' <math>t_i \in \{ 1,0 \} ^{|r|} \ </math> ''such that'' <math> \bigcup_i A(x) \oplus t_i = R.</math>
+'''Proof.''' Randomly pick ''t''<sub>1</sub>, ''t''<sub>2</sub>, ..., ''t''<sub>|''r''|</sub>. Let <math>S=\bigcup_i A(x)\oplus t_i</math> (the union of all transforms of ''A''(''x'')).
+So, for all ''r'' in ''R'',
+:<math>\Pr [r \notin S] = \Pr [r \notin A(x) \oplus t_1] \cdot \Pr [r \notin A(x) \oplus t_2] \cdots \Pr [r \notin A(x) \oplus t_{|r|}] \le { \frac{1}{2^{|x| \cdot |r|}} }.</math>
+The probability that there will exist at least one element in ''R'' not in ''S'' is
+:<math>\Pr \Bigl[ \bigvee_i (r_i \notin S)\Bigr] \le \sum_i \frac{1}{2^{|x| \cdot |r|}} = \frac{1}{2^{|x|}} < 1.</math>
+Therefore
+:<math>\Pr [S = R] \ge 1 - \frac{1}{2^{|x|}}.</math>
+Thus there is a selection for each <math>t_1,t_2,\ldots,t_{|r|}</math> such that
+:<math> \bigcup_i A(x) \oplus t_i = R.</math>
+=== Lemma 2 ===
+The previous lemma shows that ''A''(''x'') can cover every possible point in the space using a small set of translations.  Complementary to this, for ''x'' ∉ ''L'' only a small fraction of the space is covered by ''A''(''x'').  Therefore the set of random strings causing ''M''(''x'',''r'') to accept cannot be generated by a small set of vectors ''t<sub>i</sub>''.
+:<math>R = \bigcup_i A(x) \oplus t_i</math>
+''R'' is the set of all accepting random strings, exclusive-or'd with vectors ''t<sub>i</sub>''.
+:<math>\frac{|A(x)|}{|R|} \le  \frac{1}{2^{|k|}} \implies \neg \exists t_1,t_2,\dots,t_{r}</math>
+=== Corollary ===
+An important corollary of the lemmas shows that the result of the proof can be expressed as a Σ<sub>2</sub> expression, as follows.
+:<math>x \in L \iff \exists t_1,t_2,\dots,t_{|r|}\, \forall r \in R \bigvee_{ 1 \le i \le |r|} (M(r \oplus t_i) \text{ accepts}).</math>
+That is, ''x'' is in language ''L'' if and only if there exist |''r''| binary vectors, where for all random bit vectors ''r'', TM ''M'' accepts at least one random vector ⊕ ''t<sub>i</sub>''.
+The above expression is in [[Polynomial hierarchy|&Sigma;<sub>2</sub>]] in that it is first existentially then universally quantified.  Therefore BPP ⊆ Σ<sub>2</sub>.  Because BPP is closed under [[Complement (complexity)|complement]], this proves BPP ⊆ Σ<sub>2</sub> ∩ Π<sub>2</sub>
+== Lautemann's proof ==
+Here we present the proof (due to Lautemann) that BPP ⊆ Σ<sub>2</sub>.  See Trevisan's notes for more information.
+=== Lemma 3 ===
+Based on the definition of BPP we show the following:
+If ''L'' is in BPP then there is an algorithm ''A'' such that for every ''x'',
+:<math>{\rm Pr}_r(A(x,r) = \mbox{right answer}) \ge 1 - \frac{1}{3m},</math>
+where ''m'' is the number of random bits <math>|r| = m = |x|^{O(1)}</math> and ''A'' runs in time <math>|x|^{O(1)}</math>.
+'''Proof:''' Let ''A{{'}}'' be a BPP algorithm for ''L''.  For every ''x'', <math>\Pr_r(A'(x,r) = \mbox{wrong answer}) \le 1/3</math>.  ''A{{'}}'' uses ''m{{'}}''(''n'') random bits where ''n'' = |''x''|.
+Do ''k''(''n'') repetitions of ''A{{'}}'' and accept if and only if at least ''k''(''n'')/2 executions of ''A{{'}}'' accept.  Define this new algorithm as ''A''.  So ''A'' uses ''k''(''n'')''m{{'}}''(''n'') random bits and
+:<math>{\rm Pr}_r(A(x,r) = \mbox{wrong answer}) \le 2^{-ck(n)}.</math>
+We can then find ''k''(''n'') with <math>k(n)=\Theta (\log m'(n))</math> such that
+:<math>\frac{1}{2^{ck(n)}} \le \frac{1}{3k(n)m'(n)}.</math>
+=== Theorem 1 ===
+'''Proof:''' Let ''L'' be in BPP and ''A'' as in Lemma 3.  We want to show
+:<math>x \in L \iff \exists y_1,\dots,y_m \in \{0,1\}^m\, \forall z \in \{0,1\}^m \bigvee_{i=1}^mA(x,y_i \oplus z)=1.</math>
+where ''m'' is the number of random bits used by ''A'' on input ''x''. Given <math>x \in L</math>, then
+:<math>\begin{align}
+  {\rm Pr}_{y_1,\dots,y_m}(\exists z& A(x,y_1 \oplus z)=\dots=A(x,y_m \oplus z)=0)\\
+      &\le \sum_{z \in \{0,1\}^m} {\rm Pr}_{y_1,\dots,y_m}(A(x,y_1 \oplus z) = \dots = A( x, y_m \oplus z) = 0)\\
+      &\le 2^m \frac{1}{(3m)^m}\\
+      &< 1.
+\end{align}</math>
+Thus
+:<math>{\rm Pr}_{y_1,\dots,y_m}\Bigl( \forall z \bigvee_i A(x,y_i \oplus z)\Bigr)=1 - {\rm Pr}_{y_1,...,y_m}(\exists z A(x,y_1 \oplus z)=\dots=A(x,y_m \oplus z)=0).</math>
+Thus <math>(y_1,\dots,y_m)</math> exists.
+Conversely, suppose <math>x \notin L</math>. Then
+<math>{\rm Pr}_z \Bigl( \bigvee_i A(x,y_i \oplus z) \Bigr) \le \sum_i {\rm Pr}_z (A(x,y_i \oplus z)=1)\le m \frac{1}{3m}= \frac{1}{3}.</math>
+Thus
+:<math>{\rm Pr}_z(A(x,y_1 \oplus z)=\dots=A(x,y_m \oplus z)=0)= 1 - {\rm Pr}_z \Bigl( \bigvee_i A(x,y_i \oplus z) \Bigr)\ge \frac{2}{3} > 0.</math>
+Thus there is a ''z'' such that <math> \bigvee_i A(x,y_i \oplus z)=0</math> for all <math>y_1,...,y_m \in \{0,1\}^m.</math>
+== Stronger version ==
+The theorem can be strengthened to <math>\mathsf{BPP} \subseteq \mathsf{MA} \subseteq \mathsf{S}_2^P \subseteq \Sigma_2 \cap \Pi_2</math> (see [[MA (complexity)|MA]], [[S2P (complexity)|S{{su|p=P|b=2}}]]).
+== References ==
+* [[Michael Sipser|M. Sipser]]. ''A complexity theoretic approach to randomness'' In Proceedings of the 15th ACM Symposium on Theory of Computing, 330–335. ACM Press, 1983.
+* C. Lautemann, ''[http://www.sciencedirect.com/science/article/pii/0020019083900443 BPP and the polynomial hierarchy]'' Inf. Proc. Lett. 17 (4) 215–217, 1983.
+* [[Luca Trevisan]]'s [http://www.cs.berkeley.edu/~luca/notes/ Lecture Notes], University of California, Berkeley
+{{DEFAULTSORT:Sipser-Lautemann theorem}}
+[[Category:Structural complexity theory]]
+[[Category:Probabilistic complexity theory]]
+[[Category:Theorems in computational complexity theory]]
+[[Category:Articles containing proofs]]

Roy's identity: Difference between revisions

Revision as of 03:47, 13 December 2013

Contents

Proof

Lemma 1

Lemma 2

Corollary

Lautemann's proof

Lemma 3

Theorem 1

Stronger version

References

Navigation menu

Roy's identity: Difference between revisions

Revision as of 03:47, 13 December 2013

Proof

Lemma 1

Lemma 2

Corollary

Lautemann's proof

Lemma 3

Theorem 1

Stronger version

References

Navigation menu

Search