Random restrictions of high-rank tensors and polynomial maps
- Algorithms and complexity group, Centrum Wiskunde & Informatica
- More about Jop Briët
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge
- More about Davi Castro-Silva
Editorial introduction
Random restrictions of high-rank tensors and polynomial maps, Discrete Analysis 2024:9, 24 pp.
Let be an matrix of rank . The following argument shows that if we randomly restrict by picking each row and each column independently with positive probability , then the resulting submatrix will be likely to have rank at least for some that depends only on . We note first that has independent columns. By standard probabilistic estimates, there is an exponentially small (in ) probability that the number of columns chosen in our random selection is smaller than, say, . The restriction of the matrix to those columns has row-rank , so we can find linearly independent rows, and by standard arguments again there is an exponentially small probability that fewer than of those rows are selected. So the probability that we do not end up with a matrix of rank at least is exponentially small.
The aim of this paper is to obtain a similar conclusion for tensors. (A tensor can be thought of as a higher-dimensional matrix. For example, a 3-tensor over a field is a 3-dimensional array of elements of .) However, for tensors there are many different notions of rank, each one typically taking a property of matrix rank and generalizing it: it turns out that several properties that are equivalent for matrices have natural generalizations that are not equivalent for tensors.
There is another problem that turns out to be more serious. Suppose that we have fixed on a notion of tensor rank. The natural thing to try to do in order to generalize the result about matrices would be to prove a result similar to the result that the rank of a matrix equals its row rank and its column rank. However, this is false: for instance, there exists a 3-tensor with slice-rank 4 with no subtensor of slice-rank 4. (Slice rank is a notion introduced by Tao that played a central role in a reformulation of the solution to the cap-set problem by Ellenberg and Gijswijt, building on work of Croot, Lev and Pach.)
Let be a -tensor over a field , defined as a function from to , where is shorthand for . Given a subset define to be the restriction of to .
To obtain a generalization of the matrix statement, it would be sufficient to prove that if is a -tensor of rank , then for any we can find a set of size linear in such that the restriction has rank . In fact, a somewhat weaker property would also be sufficient, which the authors call the linear core property. This states that one can find a subtensor of of rank at least , where and for some constants .
Unfortunately, for many kinds of tensor rank, while it is conjectured that they have the linear core property, this has not been proved. The best result we have in that direction, due to Thomas Karam, is that we can at least obtain such a subtensor for values of and that depend only on the rank of (for many different definitions of rank), but the bounds he obtains are exponential rather than linear.
For this reason, the authors of the paper choose a different approach. To deal with the multiplicity of definitions of tensor ranks, they prove a more abstract result that applies to all “natural” definitions, which are ones that satisfy three obvious properties. The first is subadditivity – that . The second is monotonicity – that the rank of any restriction is at most the rank of . And the third is a Lipschitz condition: that if , then .
The problem is then reduced to one of proving a suitable concentration result for subadditive monotone Lipschitz functions on the collection of subsets of , or equivalently on with respect to the biased measure where each element is chosen independently with probability . If such a function has maximum value , then the concentration result the authors prove gives an exponential upper bound for the probability that the value of such a function is smaller than multiplied by a suitable constant.
There is a close connection between -tensors and multivariable polynomials of degree – indeed, associated with a -tensor is a -linear map of which the tensor provides the coefficients, and one can also obtain a (symmetric) -linear map from a polynomial by differencing times. It is thus natural to ask how well randomly restricting a multivariable polynomial to a dense set of variables preserves its rank (which can be defined as the rank of the associated -linear map or -tensor).
Sometimes it is possible to deduce results about polynomials directly from corresponding results about tensors and multilinear maps, but for this result it is more difficult. An issue is that there is no canonical way to represent even a homogeneous polynomial as a multilinear map. (Such a canonical representation does exist when the field has high characteristic, but the result proved here is field-independent.) One might instead consider trying to apply the abstract concentration inequality directly by looking at the function that takes a set to the rank of the restriction of a given polynomial to , but this function turns out not to be subadditive. Instead, the authors prove another abstract result, making use of ideas from the analysis of Boolean functions, this time concerning monotone Lipschitz functions that are not necessarily subadditive.
The original motivation for these results was a problem in complexity theory. Very roughly, the results here can be used to show that bounded-depth and bounded fanin circuits supplemented by mod-2 addition gates of arbitrary fanin are insufficiently powerful to decode corrupted error-correcting codes.