11. Set Disjointness
The set disjointness function plays a central role in the study of communication complexity.
We saw in the last lecture that the randomized communication complexity of this function is bounded below by
Theorem 1.
We should remark that there are many other questions and results related to the communication complexity of the set disjointness function that are quite interesting but that we won’t be able to cover in the course. See for example the surveys of Chattopadhyay and Pitassi (2010) and Sherstov (2014) for more on this topic.
Applications
Before we dive into the task of proving Theorem 1, let us first take a detour to see how this communication complexity lower bound also implies lower bounds in other models of computation as well.
Streaming
A (one-pass) streaming algorithm is a randomized algorithm whose input consists of a sequence of elements
There are some functions that can easily be computed with a reasonably small amount of memory.
For example, we can consider the frequency moments estimation task.
Here, the elements
What if we consider the problem of approximating
Can we also approximate the other moments of a stream of numbers as efficiently?
This is the question that Alon, Matias, and Szegedy (1999) considered.
They showed that some frequency moments can be approximated with little space but others cannot.
In particular, the maximum frequency
Theorem 2. Any single-pass streaming algorithm that provides a
-approximation to on a stream of at most numbers in has space complexity .
Proof. Let
be a streaming algorithm that computes a approximation to and uses bits of memory. Alice and Bob can use to compute the set disjointness function in the following way. They simulate on the sequence that includes the elements in Alice’s set followed by the elements in Bob’s set. Alice can simulate the algorithm on her own for the first part of the sequence. Then she send the bits in memory after has finished processing her part of the sequence to Bob. Bob then finishes simulating on his part of the sequence. He then outputs if and only if the estimate to is at most . This protocol uses
bits of communication. When Alice and Bob’s sets are disjoint, the correct value of is 1. And when their sets are not disjoint, then at least one element appears twice so that is 2. In both cases, they output the correct value whenever outputs a approximation to .
Property testing
The Set Disjointness function can also be used to prove strong lower bounds for some problems in property testing.
In the
(i) the function
There is an algorithm that can test
Theorem 3. Any algorithms for testing
-juntas has query complexity .
Proof. Let
be a -junta tester with query complexity . We can design an efficient protocol for using in the following way. Let
and denote Alice and Bob’s inputs to the function. Alice and Bob use their public randomness to simulate on the function . For every query that makes on input , Alice and Bob exchange the values and and continue simulating when it receives the value to its query. Once terminates, Alice and Bob output 1 if and only if claims that is not a -junta. Each query of
is answered with 2 bits of communication, so the total communication cost of is . And the correctness of the protocol follows from two facts. First, when and are disjoint -subsets of , then and a parity function on coordinates is far from being a -junta. Second, when and intersect, then and the parity function is a -junta. Therefore, the argument above shows that
. And with Theorem 1 we can also show that . So any -junta tester must have query complexity at least .
One-sided Discrepancy
Recall that the matrix representation of the set disjointness function looks like this:
This matrix has a large 0-monochromatic rectangle.
But it does not appear to contain any large 1-monochromatic rectangle.
And indeed we can prove that even for larger values of
In the rest of this section, let
The one-sided discrepancy result at the heart of the argument is the following.
Lemma 4. Let
be any rectangle with for which at most an fraction of the pairs in are intersecting. Then there is a constant such that
Proof. Let
. Define to be the set of elements in that intersect with at most a fraction of the sets in . By Markov’s inequality, I claim that there are
sets in such that each set , satisfies Indeed, the size of the sets in
guarantees that . And the number of sets in for which is at most Since this number is less than
, there must exist a set in that satisfies the required condition. The condition on the sets
implies that their union has cardinality at least And the fact that they are all in
implies that they each intersect at most an fraction of the sets in . By Markov’s inequality, this means that at least half of the sets in intersect at most of the sets . But this means that the cardinality of is bounded above by
With Lemma 4, we can easily obtain our lower bound on the communication complexity of disjointness.
Theorem 5.
Proof. By the easy direction of Yao’s Minimax Principle, it suffices to show that for the distribution
defined above, . Fix any deterministic protocol
that computes with error at most on inputs with and drawn independently and uniformly at random from . (With constant probability, the leaf reached by must correspond to a nearly 1-monochromatic rectangle.) By Lemma 4, however, each such rectangle has measure at most with respect to . This means that there must be at least such leaves and so the depth of the communication tree for must be at least .
The bound of