6. Query Complexity II

We have seen in the last lecture that there are separations between deterministic and zero-error randomized query complexity for both partial and total Boolean functions. In this lecture, we continue our investigations into the power of randomness by examining whether there are any such gaps when we compare zero-error and bounded-error randomized query complexity.

Separations between $\overline{\mathrm{R}}_0$ and $\mathrm{R}_{\frac13}$

The question of whether allowing bounded error probability reduces query complexity is made precise by asking: is there any asymptotic separation between $\overline{\mathrm{R}}_0(f)$ and $\overline{\mathrm{R}}_{\epsilon}(f)$ for any partial or total function $f$ and constant $\epsilon$? As we have seen when considering success amplification and trunction, however, the average-case randomized query complexity with error $\epsilon$ satisfies

\[\overline{\mathrm{R}}_\epsilon(f) = \Theta( \mathrm{R}_\frac13(f) )\]

whenever $0 < \epsilon < \frac12$ is an absolute constant that is bounded away from $0$ and $1/2$. So our question is equivalent to asking about asymptotic separations between $\overline{\mathrm{R}}_0(f)$ and $\mathrm{R}_\frac13(f)$.

For partial functions, the situation is similar to the one we observed when comparing deterministic and zero-error randomized query complexity: we can construct partial functions for which the maximum possible separation between our two randomized query complexity measures is achieved.

Theorem 1. There exists a partial function $f \colon X \to \{0,1\}$, $X \subseteq \{0,1\}^*$ for which
\[\overline{\mathrm{R}}_0(f) = \Omega(n) \qquad \mbox{and} \qquad \mathrm{R}_{\frac13}(f) = O(1).\]

Proof. Consider the Gap Majority function where
\[X = \big\{ x \in \{0,1\}^* : \sum_{i=1}^n x_i \in \{0,1,\ldots,\tfrac n3, \tfrac{2n}3,\ldots,n\}\big\}\]
and $\mathrm{GapMaj} \colon X \to \{0,1\}$ is defined by $\mathrm{GapMaj}(x) = \mathbf{1}[\sum x_i \ge \frac n2]$. (The Gap Majority function is identical to the standard Majority function, but has the additional promise that the inputs are always heavily biased towards the output value.)

Any zero-error randomized algorithm for the Gap Majority function must have query complexity at least $\frac n3 + 1$ because with fewer queries, it is always possible that it was extremely unlucky and queried only the minority bits. With bounded error, however, it suffices to query a single bit of the input uniformly at random and to output its value.

Once again, the situation is quite different when we restrict our attention to total functions. In this setting, there is again essentially only a quadratic gap between zero-error and bounded-error randomized query complexity.

Theorem 2. For every Boolean function $f \colon \{0,1\}^n \to \{0,1\}$,
\[\overline{\mathrm{R}}_0(f) \le O \big( \mathrm{R}_{\frac 13}(f)^2 \log \mathrm{R}_{\frac13}(f) \big).\]

The upper bound was obtained by Kulkarni and Tal (2016) who provided a tight analysis of an earlier argument of Midrijānis (2005). One of the key aspects of this argument is success amplification: through a connection to (variants of) block sensitivity, the core of the argument lies in showing that it suffices to show the existence of a randomized algorithm with polynomially small error on each input to show the existence of a zero-error randomized algorithm.

Shortcut functions, revisited

Unlike in the situation with deterministic vs. zero-error randomized query complexity, there was no natural total Boolean function (like the recursive NAND function in the former setting) known to show an asymptotic separation between the two randomized measures of query complexity. So in this case, until recently it was open whether any polynomial or even just asymptotic separation held between $\overline{\mathrm{R}}_0(f)$ and $\mathrm{R}_{\frac13}(f)$$ for any total Boolean function.

Ambainis, Balodis, Belovs, Lee, Santha, and Smotrovs (2017), in the same paper in which they established the near-optimal separation between deterministic and zero-error randomized query complexity that we saw in the last lecture, also established a near-optimal separation between zero-error and bounded-error randomized query complexity.

Theorem 3. There is a total function $f \colon \{0,1\}^n \to \{0,1\}$ that satisfies
\[\overline{\mathrm{R}}_0(f) = \widetilde{\Omega}\big(\mathrm{R}_{\frac13}(f)^2\big).\]

The key to the proof of this theorem is again to design an appropriate “shortcut” in the function that can only be efficiently accessed under one of the models of computation. This is again done by considering an appropriate modification of the original pointer functions of Göös, Pitassi, and Watson (2015), but in this case to make the shortcuts useful only when the randomized algorithm is allowed to err with bounded probability.

Separations between $\mathrm{D}(f)$ and $\mathrm{R}_{\frac13}(f)$

If we combine the upper bounds on the maximum separation between the various query complexity measures, we obtain the conclusion that for every total function $f$,

\[\mathrm{D}(f) \le \widetilde{O} \big( \mathrm{R}_\frac13(f)^4 \big).\]

Is that bound tight? No!

Theorem 4. For every $f \colon \{0,1\}^n \to \{0,1\}$,
\[\mathrm{D}(f) \le \mathrm{R}_{\frac13}(f)^3.\]

This bound was obtained by Nisan (1991). We can prove it by combining certificate and block sensitivity complexity. Let us first introduce the latter complexity measure.

The sensitivity of an input $x \in \{0,1\}^n$ in a function $f \colon \{0,1\}^n \to \{0,1\}$, denoted $s(f,x)$, is the number of coordinates $i \in [n]$ for which the input $x^{(i)}$ obtained by flipping the $i$th coordinate of $x$ satisfies $f(x^{(i)}) \neq f(x)$. The (maximum) sensitivity complexity of a function is

\[\mathrm{s}(f) = \max_{x \in \{0,1\}^n} s(f,x).\]

(Note that the average sensitivity complexity measure obtained by replacing the maximum with an expectation over the uniform distribution on $\{0,1\}^n$ is another fundamental complexity measure with useful properties that often shows up in the analysis of Boolean functions, but it is very different than the maximum sensitivity measure we consider here.)

The block sensitivity of an input $x$ in the function $f$ is similar to its sensitivity, except that now we ask about the maximum number of disjoint blocks of indices $B_1,\ldots,B_k \subseteq [n]$ such that for each block, the input $x^{B_i}$ obtained by flipping all of the bits in $B_i$ satisfies $f(x^{(B_i)}) \neq f(x)$. The block sensitivity of an input is denoted by $bs(f,x)$, and the block sensitivity complexity of $f$ is

\[\mathrm{bs}(f) = \max_{x \in \{0,1\}^n} bs(f,x).\]

The sensitivity and block sensitivity complexity measures satisfy the obvious bound $\mathrm{s}(f) \le \mathrm{bs}(f)$. These measures also relate to the other complexity measures we have already seen in various ways.

As a first step, we have the following upper bound originally due to Beals, Buhrman, Cleve, Mosca, and de Wolf (2001).

Lemma 5. For every $f \colon \{0,1\}^n \to \{0,1\}$,
\[\mathrm{D}(f) \le \mathrm{C}^1(f) \cdot \mathrm{bs}(f).\]

The proof of this lemma can be completed with an argument that is very similar to the one for the $\mathrm{D}(f) \le \mathrm{C}^0(f) \cdot \mathrm{C}^1(f)$ bound that we saw in the last lecture.

We can also bound the certificate complexity of a function by its block sensitivity in the following way.

Lemma 6. For every $f \colon \{0,1\}^n \to \{0,1\}$,
\[\mathrm{C}(f) \le \mathrm{s}(f) \cdot \mathrm{bs}(f) \le \mathrm{bs}(f)^2.\]

And the final ingredient of the proof is the observation that block sensitivity provides a lower bound on bounded-error randomized query complexity.

Lemma 7. For every $f \colon \{0,1\}^n \to \{0,1\}$,
\[\mathrm{R}_{\frac13}(f) = \Omega( \mathrm{bs}(f) ).\]

The proof of Lemma 7 is obtained via Yao’s Minimax Principle: let $x$ be an input with maximal block sensitivity and $B_1,\ldots,B_k \subseteq [n]$, $k = \mathrm{bs}(f)$ be disjoint sensitive blocks for $x$. Let $\mu$ be the distribution that returns $x$ itself with probability half and otherwise returns $x^{(B_i)}$ for $i \in [k]$ chosen uniformly at random. Any deterministic algorithm that makes $o(k)$ queries can test only a sublinear fraction of the possible sensitive blocks, so it does not correctly compute $f$ with bounded error under $\mu$.

Combining Lemmas 5-7 completes the proof of Theorem 4. Note however that it is still open whether the cubic bound in that theorem can be achieved; the best separation we have between deterministic and bounded-error randomized query complexity is nearly quadratic bound that we have already observed when comparing deterministic vs. zero-error or zero-error vs. bounded-error randomized complexity measures.

There are also a number of other different complexity measures for which we still do not know what is the best possible separation that is achievable for total function. For a good overview of these separation questions as well as a summary of the recent state of the art and pointers to relevant literature, see Table 1 in the recent article of Aaronson, Ben-David, Kothari, Rao, and Tal (2021).

6. Query Complexity II

Separations between \(\overline{\mathrm{R}}_0\) and \(\mathrm{R}_{\frac13}\)

Shortcut functions, revisited

Separations between \(\mathrm{D}(f)\) and \(\mathrm{R}_{\frac13}(f)\)