Random multilinear maps and the Erdős box problem
- The Division of Physics, Mathematics and Astronomy, California Institute of Technology
- More about David Conlon
- Department of Mathematics, Yale University
- More about Cosmin Pohoata
- Moscow Institute of Physics and Technology
- More about Dmitriy Zakharov
Editorial introduction
Random multilinear maps and the Erdős box problem, Discrete Analysis 2021:17, 8 pp.
A major theme in extremal combinatorics is determining the maximum number of edges that a graph or hypergraph can have if it does not contain a certain fixed graph or hypergraph. In the case of graphs, Turán’s theorem gives an exact result when the forbidden subgraph is the complete graph (the maximum number of edges is attained when the graph is an -partite graph with vertex sets as equal as possible in size), and the Erdős-Stone theorem shows that if the forbidden subgraph has chromatic number then the same example is at least asymptotically best possible.
When , meaning that the forbidden subgraph is bipartite, the Erdős-Stone theorem merely tells us that the asymptotic density of the graph is zero, which can be proved by a relatively simple direct argument, first noted by Kővári, Sós and Turán. The Zarankiewicz problem, which has led to a large amount of research, is to obtain more precise bounds for the number of edges. Even for general complete bipartite graphs there is a significant gap between the best known upper and lower bounds.
One case of interest is that of 4-cycles. If is a graph with vertices and we write for the neighbourhood of a vertex , then the number of edges of is (ignoring a factor 2) and the number of labelled 4-cycles, including degenerate ones, is . By Cauchy-Schwarz, the latter quantity is at least . But counts the number of labelled and possibly degenerate paths of length 2, so it is equal to , which is up to a constant also the number of degenerate 4-cycles. Thus, there is a non-degenerate cycle provided that
or in other words provided that . But by Cauchy-Schwarz again, . Thus, if , then contains a non-degenerate 4-cycle.
One might expect an upper bound proved in as natural a way as this to be matched by a lower bound given by the standard technique of picking a random graph and removing a few edges. However, if we choose edges independently with probability , then the expected number of (labelled and possibly degenerate) 4-cycles is while the number of edges is (up to a constant), so for this to work we need , which results in a graph with around edges instead of . Nevertheless, the bound can be shown to be sharp using a construction due to Klein that appears in a paper of Erdős. Define a graph by taking the vertex set to be and join to if . If forms a 4-cycle in this graph, then setting (where is interpreted as 1 when ), we have that and . This implies that and , which implies that either or , and therefore that and . Thus, all 4-cycles are degenerate.
There are many situations like this in extremal graph theory and extremal combinatorics more generally, where random constructions work reasonably well, but are outperformed by explicit algebraic constructions. However, for many of these situations, even the best known algebraic constructions do not give bounds that match what is known in the other direction.
Recently, an interesting new method has emerged, which is a sort of hybrid of the random and algebraic approaches. Though the roots of the idea can be found earlier, it took off properly in 2014 with an influential paper by Boris Bukh entitled Random algebraic construction of extremal graphs. (A somewhat different mixture of randomness and algebra can also be found in Peter Keevash’s remarkable constructions of designs from the same year.)
Broadly speaking, Bukh’s insight was that if a random construction has algebraic aspects to it, then it is sometimes possible to show that if a forbidden configuration is present, then some much larger configuration must also be present. It is easier to rule out the larger configuration, and this ultimately leads to a better bound. Bukh’s example was a bipartite graph with vertex sets equal to , with joined to if and only if is a zero of a random polynomial in variables of degree at most in each variable (for suitably chosen ). The randomness of the polynomial is enough for the calculation of various moments, but the all-important non-randomness arises because the size of the zero set of a polynomial is tightly constrained: the zero set itself has a dimension and the size of the set must be approximately . In particular, if it is not bounded, then it must be at least comparable to .
This paper concerns the natural hypergraph generalization of the 4-cycle problem above. That is, one is looking for the maximum number of edges in a -partite -uniform hypergraph that contains no complete -partite subhypergraph with two vertices in each vertex class. If the vertex sets of the hypergraph are and one represents edges by points in , then this forbidden configuration is a set of the form where each is a set of size 2: that is, it is a discrete aligned cuboid, which explains the name “box problem”. Alternatively, if we think of the edges of the hypergraph as simplices, then the forbidden configuration is a -dimensional octahedron (often known as a cross-polytope).
One motivation for these extremal problems is that -dimensional octahedra are important in the theory of quasirandomness: just as a graph is quasirandom if it has as few 4-cycles as possible given its density, so a hypergraph is quasirandom (in many useful senses) if it has as few -dimensional octahedra as possible.
Cauchy-Schwarz type arguments show that if the vertex sets have size , then a box-free hypergraph has edges. However, as we have already seen when , the probabilistic method with deletions does not give a matching lower bound: instead it gives a hypergraph with edges.
Gunderson, Rödl and Sidorenko used a random algebraic construction to improve on this bound for all such that is coprime to . When , they took three copies and of as their vertex sets and for each and they took all triples such that belonged to a randomly chosen 3-dimensional affine subspace of . This construction yields a lower bound of , whereas the bound from the purely random method is , which is smaller (by 1/35).
In this paper the authors use random multilinear maps. That is, they let each be a copy of for a carefully chosen , and then for suitable they choose a random multilinear map and take as their edge set the set of all such that .
The key point about this construction is that if the resulting hypergraph contains a box , then in fact it contains the much larger set , where each is the line the contains . This enables the authors to obtain a non-trivial gain using a double counting argument and then to apply the deletion method more efficiently. Their method works for all and improves on the bound of Gunderson, Rödl and Sidorenko, when their method works, except if is a power of 2. This very interesting proof technique may well have further applications.
The bound they obtain is slightly complicated to specify, but the exponent is at least . Since cannot divide , this is bigger than the that is given by the probabilistic method. Though the gain is small, it turns out that by a result of Ferber, McKinley and Samotij, any power-type gain on the obvious probabilistic bound yields an optimal counting result for the number of box-free hypergraphs with vertices in each class – thus, the small gain has a very satisfying corollary.
More details can be found in the paper, or in the following talk given by one of the authors.