Geometric rank of tensors and subrank of matrix multiplication
- Department of Mathematics and Department of Computer Science, University of Toronto
- More about Swastik Kopparty
- Department of Mathematics, City University of New York
- More about Guy Moshkovitz
- Korteveg-de Vries Institute for Mathematics, University of Amsterdam
- More about Jeroen Zuiddam
Editorial introduction
Geometric rank of tensors and subrank of matrix multiplication, Discrete Analysis 2023:1, 25 pp.
The rank of a matrix is a parameter of obvious importance, so it is natural to wonder what the right definition is for the rank of a higher-dimensional matrix – that is, of a tensor (which can be thought of as a -dimensional array of elements of a field ). This question turns out not to have a straightforward answer: there are many different natural definitions for the rank of a tensor that specialize to matrix rank when the tensor has dimension 2. However, it also turns out that several of these definitions are useful and have important applications.
One approach to defining a notion of rank is first to decide what the rank-1 tensors should be and then to define the rank of a tensor to be the smallest number of rank-1 tensors needed to generate as a linear combination. If and we define a rank-1 tensor to be one of the form , then we obtain the notion of tensor rank. If instead we define a rank-1 tensor to be one of the form , or , then we obtain the notion of slice rank, which was introduced by Tao and played a key role in his reformulation of the famous solution to the cap-set problem.
A different approach, which works when is the finite field is to observe (following a straightforward calculation) that if is a matrix of rank , then . If we wanted, we could therefore define the rank of to be . This has an obvious generalization to higher-order tensors. For instance, if then we define to be and we can consider the quantity as a kind of rank of . This notion was introduced by Gowers and Wolf, who called it the analytic rank of the tensor.
However, in contrast with tensor rank and slice rank, the definition of the analytic rank depends on the field being finite, and it is not obvious what an analogous notion should be for fields of characteristic zero. This question was explicitly asked by Lovett, and is answered in this paper, where the authors introduce yet another notion of rank, which they call geometric rank.
For a 3-tensor , the geometric rank is defined as follows. Let have coefficients for , where denotes the set . Then for each let be the matrix . That is, the matrices are the “slices” of perpendicular to the third axis. Now define an algebraic variety in to be the set of all such that for every . The geometric rank of is defined to be the codimension of this variety, which can be shown to be independent of which of the three directions we use to define the slices.
To illustrate this with a very simple example, suppose that , and let if and 0 otherwise. Then the variety is the set of all such that for , which can be shown to have codimension (roughly speaking since if the coordinates are non-zero, then the set of such that is the set of such that , which has codimension ).
While the relationship between geometric rank and analytic rank is not completely straightforward, it is at least plausible, given basic results about the sizes of varieties in finite fields, that if we approximate a field by larger and larger finite fields and count the number of points in the corresponding variety , then it should grow like a power and that that power should be the codimension of . Indeed, the example just given illustrates this: if the coefficients belong to , say, then for large almost every will have all its first coordinates not equal to zero, so for almost every the number of with will be , so the number of points in will be well approximated by .
While the definition of geometric rank can be seen to be natural, what matters more is whether it is useful. In this paper the authors demonstrate its utility by first showing that it gives upper and lower bounds for various other notions of rank, which sometimes leads to easier ways of bounding other ranks of given tensors. Then they apply these results to analyse the important matrix multiplication tensor, which is the tensor that corresponds to the trilinear map
where , and are three matrices for which the right-hand side makes sense. (More precisely, for each choice of sizes for the matrices we obtain a separate matrix multiplication tensor.)
The famous and important problem of determining the number of scalar multiplications needed in order to calculate the product of two matrices turns out to be equivalent to determining the tensor rank of the matrix multiplication tensor. In 1987, Strassen (the first person to obtain a power saving on the trivial upper bound of ) obtained a lower bound for a notion of rank called the border subrank, and the results of this paper prove that this bound of Strassen is sharp.
Since this paper first appeared as a preprint, the notion of geometric rank has played an important role in a central problem of additive combinatorics, which is to relate analytic rank to partition rank (where the rank-1 tensors of degree are those of the form , where and depend on disjoint sets of variables). After a long series of developments, Alex Cohen and the second author made essential use of geometric rank to prove that for sufficiently large finite fields the partition rank is bounded above by a linear function of the analytic rank.