Impagliazzo, Paturi and Zane (JCSS 2001) proved a sparsification lemma for k-CNFs:
every k-CNF is a sub-exponential size disjunction of k-CNFs with a linear
number of clauses. This lemma has subsequently played a key role in the study
of the exact complexity of the satisfiability problem. A natural question is
whether an analogous structural result holds for CNFs or even for broader
non-uniform classes such as constant-depth circuits or Boolean formulae.
We prove a very strong negative result in this connection: For every superlinear
function f(n), there are CNFs of size f(n) which cannot be written as
a disjunction of 2^{n- \varepsilon n} CNFs each having a linear number of
clauses for any \varepsilon > 0. We also give a hierarchy of such non-sparsifiable CNFs: For every
k, there is a k' for which there are CNFs of size n^{k'} which cannot
be written as a sub-exponential size disjunction of CNFs of size n^{k}.
Furthermore, our lower bounds hold not just against CNFs but against an {\it arbitrary}
family of functions as long as the cardinality of the family is appropriately
bounded.
As by-products of our result, we make progress both on questions about
circuit lower bounds for depth-3 circuits and satisfiability algorithms for
constant-depth circuits. Improving on a result of Impagliazzo, Paturi and
Zane, for any f(n) = \omega(n \log(n)), we define a pseudo-random function generator with seed length f(n)
such that with high probability, a function in the output of this generator
does not have depth-3 circuits of size 2^{n-o(n)} with bounded bottom fan-in.
We show that if we could decrease the seed length of our generator below
n, we would get an explicit function which does not have linear-size
logarithmic-depth series-parallel circuits, solving a long-standing open question.
Motivated by the question of whether CNFs sparsify into bounded-depth circuits,
we show a {\it simplification} result for bounded-depth circuits: any bounded-depth circuit of linear size can be written as a sub-exponential size disjunction of linear-size
constant-width CNFs. As a corollary, we show that if there is an algorithm for
CNF satisfiability which runs in time O(2^{\alpha n}) for some fixed
\alpha < 1 on CNFs of linear size, then there is an algorithm for
satisfiability of linear-size constant-depth circuits which runs in time
O(2^{(\alpha + o(1))n}).