When is the Resolvent Like a Rank One Matrix?
Anne Greenbaum, Abbas Salemi, Faranges Kyanfar
Abstract
For a square matrix \(A\), the resolvent at a point \(z \in {\bf C}\) is defined as \((A-zI )^{-1}\). It was observed in [2] that for certain matrices \(A\) with ill-conditioned eigenvalues the resolvent is close to the rank one matrix \(\sigma _1 (z) u_1 (z) v_1 (z )^H\), for a wide range of \(z\) values, where \(\sigma _1 (z)\) is the largest singular value of \((A-zI )^{-1}\) and \(u_1 (z)\) and \(v_1 (z)\) are the corresponding left and right singular vectors. Moreover, for a slightly smaller range of \(z\) values, \(u_1 (z)\) and \(v_1 (z)\) are almost orthogonal to each other. Here we provide a partial explanation for this phenomenon.
The distance in 2-norm from \((A-zI )^{-1}\) to the nearest rank one matrix, \(\sigma _1 (z) u_1 (z) v_1 (z )^H\), is \(\sigma _2 (z)\), the second largest singular value of \((A - zI )^{-1}\), and one might define the relative distance as \(\sigma _2 (z) / \| (A-zI )^{-1} \|_2 = \sigma _2 (z) / \sigma _1 (z)\). Given \(\epsilon > 0\), we are interested in
\(\seteqnumber{0}{}{0}\)\begin{equation} \{ z \in {\bf C} : \sigma _2 (z) / \sigma _1 (z) < \epsilon \} . \label {newset} \end{equation}
Recall that the \(\epsilon \)-pseudospectrum of \(A\) can be defined as [3]:
\(\seteqnumber{0}{}{1}\)\begin{equation} \{ z \in {\bf C} : 1/ \sigma _1 (z) < \epsilon \} . \label {pseudospectrum} \end{equation}
If it turns out that \(\sigma _2 (z) \sim 1\) throughout the \(\epsilon \)-pseudospectrum, then these two sets may look very similar. Indeed, the plots in [2] look much like pseudospectra.
To study this phenomenon, we will work with the matrix \(A-zI\), whose singular values are the inverses of those of \((A - zI )^{-1}\) and whose right and left singular vectors are the left and right singular vectors of \((A - zI )^{-1}\). If \(s_n (z)\) and \(s_{n-1} (z)\) denote the smallest and second smallest singular values of \(A-zI\), then we are interested in the ratio \(s_n (z) / s_{n-1} (z)\).
The following theorem and corollary are proved in a paper currently in progress [1]:
Theorem. Let \(\lambda \) be a simple eigenvalue of \(A\) and let \(A_0 := A - \lambda I = U S V^{H}\) be a singular value decomposition of \(A_0\), where \(U := [ u_1 , \ldots , u_n ]\), \(V := [ v_1 , \ldots , v_n ]\), \(S := \mbox {diag} ( s_1 , \ldots , s_{n-1} , 0 )\), \(s_1 \geq \ldots \geq s_{n-1} > 0\). Let \(A_0^{\dag }\) denote the Moore-Penrose pseudoinverse of \(A_0\):
\(\seteqnumber{0}{}{2}\)\begin{equation} A_0^{\dag } := V_{n-1} S_{n-1}^{-1} U_{n-1}^H , \label {pseduoinverse} \end{equation}
where \(U_{n-1} := [ u_1 , \ldots , u_{n-1} ]\), \(V_{n-1} := [ v_1 , \ldots , v_{n-1} ]\), and \(S_{n-1} := \mbox {diag} ( s_1 , \ldots , s_{n-1} )\). For each \(k=1,2, \ldots \), the smallest singular value of \(A_0 - zI\) is less than or equal to
\(\seteqnumber{0}{}{3}\)\begin{equation} \sum _{j=1}^k |z |^j | u_n^H ( A_0^{\dag } )^{j-1} v_n | + | z |^{k+1} / s_{n-1}^k . \label {sigmanbound2} \end{equation}
Taking \(k=1\) in the theorem, we obtain the bound
\[ s_{n} ( A_0 -zI ) \leq | u_n^H v_n |~|z| + | z |^2 / s_{n-1} . \]
If \(\lambda \) is ill-conditioned, it means that the inner product of the left and right unit eigenvectors of \(A\) corresponding to eigenvalue \(\lambda \) is tiny, but these eigenvectors are the same as the left and right singular vectors \(u_n\) and \(v_n\) corresponding to the zero singular value of \(A_0\). In this case, if also \(s_{n-1} \sim 1\), then \(s_n ( A_0 - zI )\) grows more like \(| z |^2\) than like \(| z |\) for \(| u_n^H v_n | << | z | << 1\). If \(u_n\) is also nearly orthogonal to \(A_0^{\dag } v_n\), then taking \(k=2\) in the theorem suggests that the growth rate of \(s_n (A_0 -zI)\) with \(|z|\) may be more like \(| z |^3\), and the more powers \(j\) for which \(| u_n^H ( A_0^{\dag } )^j v_n |\) is small, the higher the power of \(|z|\) describing the growth of \(s_n ( A_0 - zI )\), for \(|z| << 1\). If the absolute value of \(z\) times each eigenvalue of \(A_0^{\dag }\) is less than one, then the first sum in (4) will converge to a finite value as \(k \rightarrow \infty \), and for \(|z| < s_{n-1}\), the second term in (4) will go to \(0\) as \(k \rightarrow \infty \). In this case, the smallest bound may be obtained by taking \(k = \infty \).
Although we are not yet sure how to interpret the conditions that \(| u_n^H ( A_0^{\dag } )^j v_n |\) be small, these conditions seem to be satisfied by many test problems with ill-conditioned eigenvalues, such as those available through the ’gallery’ command in MATLAB and many in [3].
Corollary. With the notation and assumptions of the previous theorem, let \(\epsilon \in (0,1)\) be given. The region where the ratio of the second largest to the largest singular value of the resolvent \(( A-zI )^{-1}\) is less than \(\epsilon \) contains the set of points \(z \in {\bf C}\) such that \(|z- \lambda | < s_{n-1}\) and
\(\seteqnumber{0}{}{4}\)\begin{equation} \min _{k=1,2, \ldots } \left [ \sum _{j=1}^k | z- \lambda |^j | u_n^H ( A_0^{\dag } )^{j-1} v_n | + |z - \lambda |^{k+1} / s_{n-1}^k \right ] / ( s_{n-1} - |z - \lambda | ) < \epsilon . \label {epsregion} \end{equation}
The \(\epsilon \)-pseudospectrum of \(A\) contains the set of points \(z \in {\bf C}\) such that
\(\seteqnumber{0}{}{5}\)\begin{equation} \min _{k=1,2, \ldots } \left [ \sum _{j=1}^k | z- \lambda |^j | u_n^H ( A_0^{\dag } )^{j-1} v_n | + |z - \lambda |^{k+1} / s_{n-1}^k \right ] < \epsilon . \label {epspseudo} \end{equation}
This corollary defines disks about each eigenvalue that are known to lie within the regions defined in (1) and (2). In our numerical tests, they are not far from the largest disks about the eigenvalues that are contained in these regions.
We will report on these results, as well as some results obtained by differentiating the singular values and vectors of \(A-zI\).
References
-
[1] A. Greenbaum, F. Kyanfar, and A. Salemi, When is the Resolvent Like a Rank One Matrix?, to appear.
-
[2] A. Greenbaum and N. Wellen, Comparison of K-Spectral Set Bounds on Norms of Functions of a Matrix or Operator, Lin. Alg. Appl. 694, pp. 52-77, 2024.
-
[3] L. N. Trefethen and M. Embree, Spectra and Pseudospectra: The Behavior of Nonnormal Matrices and Operators, Princeton University Press, 2005.