Today I Learned

November 12th

Today I learned the classification of nilpotent matrices with maximal index. Suppose $A$ is a nilpotent transformation of index $k$ sending, say, $F^n\to F^n.$ This means we can get a vector $v\in F^n$ such that $A^{k-1}v\ne0,$ but $A^k=0.$

We claim that the vectors $v,Av,A^2v,\ldots,A^{k-1}v$ are linearly independent. Indeed, fix a linear relation\[\sum_{\ell=0}^{k-1}c_\ell A^\ell v=0.\]We can inductively show that the $c_\ell$ are all $0.$ Indeed, if everybody below some $c_L$ ($L \lt k$) is equal to $0,$ these terms don't do anything for the relation, so may multiply both sides by $A^{k-L-1},$ which gives\[c_LA^{k-1}v=c_LA^{k-1}v+\sum_{\ell=L+1}^{k-1}c_\ell A^{\ell-L-1}A^kv=0.\]We assumed $A^{k-1}v\ne0,$ so we must have $c_L=0,$ which is what we wanted.

It follows that we can have no larger than index $n$ for $A,$ for this would make $v,Av,A^2v,\ldots,A^{k-1}v$ a full basis. I.e., $n$ is the maximal index. In fact, $n$ is achievable by\[N=\begin{bmatrix} 0 & 1 & 0 & 0 & \cdots & 0 & 0 \\ 0 & 0 & 1 & 0 & \cdots & 0 & 0 \\ 0 & 0 & 0 & 1 & \cdots & 0 & 0 \\ \vdots & \vdots & \vdots & \vdots & \ddots & \vdots & \vdots \\ 0 & 0 & 0 & 0 & \cdots & 0 & 1 \\ 0 & 0 & 0 & 0 & \cdots & 0 & 0\end{bmatrix}.\]Quickly, $N$ sends $e_n\to e_{n-1}\to\cdots\to e_3\to e_2\to e_1\to0.$ So after $n$ iterations, all basis vectors will be sent to $0,$ but after $n-1$ iterations, we will have $e_n\to e_1\ne0.$ In fact, this is readily seen to be the only linear transformation of index $n,$ up to a change of basis. Indeed, if we refix our basis\[(e_1,e_2,\ldots,e_n)=\left(A^{k-1}v,A^{k-2}v,\ldots,A^2v,Av,v\right),\]then $A$ is sending $e_n\to e_{n-1}\to\cdots\to e_3\to e_2\to e_1\to0$ again. In other words, $A$ looks like $N$ according to this basis, as claimed.

It follows that all nilpotent matrices of maximal index (i.e., $n$), are $N$ up to a change of basis. If our change of basis matrix is $S,$ then we can classify these nilpotent matrices as having the form $S^{-1}NS$ for any invertible matrix $S.$ This classification is good enough for me, so we are done here.

What I like about this proof is that asking for nilpotent matrices of rank $n$ is an entirely natural (for some definitions of natural) question, one that could be asked as soon as a student learns about matrix multiplication. But in fact, the proof here requires somewhat deep technology (change of basis) to get our classification, and further, this deep technology makes the proof immediate. It's quite cute.