jupytext | kernelspec | ||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
In addition to what's in Anaconda, this lecture will need the following libraries:
:tags: [hide-output]
!pip install quantecon
In this lecture we will begin with the foundational concepts in spectral theory.
Then we will explore the Perron-Frobenius theorem and connect it to applications in Markov chains and networks.
We will use the following imports:
import numpy as np
from numpy.linalg import eig
import scipy as sp
import quantecon as qe
Often, in economics, the matrix that we are dealing with is nonnegative.
Nonnegative matrices have several special and useful properties.
In this section we will discuss some of them --- in particular, the connection between nonnegativity and eigenvalues.
An
We denote this as
(irreducible)=
We introduced irreducible matrices in the Markov chain lecture.
Here we generalize this concept:
Let
An
In other words, for each
:label: eigen2_ex_irr
Here are some examples to illustrate this further:
$$
A = \begin{bmatrix} 0.5 & 0.1 \\
0.2 & 0.2
\end{bmatrix}
$$
$A$ is irreducible since $a_{ij}>0$ for all $(i,j)$.
$$
B = \begin{bmatrix} 0 & 1 \\
1 & 0
\end{bmatrix}
, \quad
B^2 = \begin{bmatrix} 1 & 0 \\
0 & 1
\end{bmatrix}
$$
$B$ is irreducible since $B + B^2$ is a matrix of ones.
$$
C = \begin{bmatrix} 1 & 0 \\
0 & 1
\end{bmatrix}
$$
$C$ is not irreducible since $C^k = C$ for all $k \geq 0$ and thus
$c^{k}_{12},c^{k}_{21} = 0$ for all $k \geq 0$.
Recall that we previously discussed eigenvectors in {ref}Eigenvalues and Eigenvectors <la_eigenvalues>
.
In particular,
In this section we introduce left eigenvectors.
To avoid confusion, what we previously referred to as "eigenvectors" will be called "right eigenvectors".
Left eigenvectors will play important roles in what follows, including that of stochastic steady states for dynamic models under a Markov assumption.
A vector
In other words, if
This hints at how to compute left eigenvectors
A = np.array([[3, 2],
[1, 4]])
# Compute eigenvalues and right eigenvectors
λ, v = eig(A)
# Compute eigenvalues and left eigenvectors
λ, w = eig(A.T)
# Keep 5 decimals
np.set_printoptions(precision=5)
print(f"The eigenvalues of A are:\n {λ}\n")
print(f"The corresponding right eigenvectors are: \n {v[:,0]} and {-v[:,1]}\n")
print(f"The corresponding left eigenvectors are: \n {w[:,0]} and {-w[:,1]}\n")
We can also use scipy.linalg.eig
with argument left=True
to find left eigenvectors directly
eigenvals, ε, e = sp.linalg.eig(A, left=True)
print(f"The eigenvalues of A are:\n {eigenvals.real}\n")
print(f"The corresponding right eigenvectors are: \n {e[:,0]} and {-e[:,1]}\n")
print(f"The corresponding left eigenvectors are: \n {ε[:,0]} and {-ε[:,1]}\n")
The eigenvalues are the same while the eigenvectors themselves are different.
(Also note that we are taking the nonnegative value of the eigenvector of {ref}dominant eigenvalue <perron-frobe>
, this is because eig
automatically normalizes the eigenvectors.)
We can then take transpose to obtain
This is a more common expression and where the name left eigenvectors originates.
(perron-frobe)=
For a square nonnegative matrix
For any such matrix
:label: perron-frobenius
If a matrix $A \geq 0$ then,
1. the dominant eigenvalue of $A$, $r(A)$, is real-valued and nonnegative.
2. for any other eigenvalue (possibly complex) $\lambda$ of $A$, $|\lambda| \leq r(A)$.
3. we can find a nonnegative and nonzero eigenvector $v$ such that $Av = r(A)v$.
Moreover if $A$ is also irreducible then,
4. the eigenvector $v$ associated with the eigenvalue $r(A)$ is strictly positive.
5. there exists no other positive eigenvector $v$ (except scalar multiples of $v$) associated with $r(A)$.
(More of the Perron-Frobenius theorem about primitive matrices will be introduced {ref}`below <prim_matrices>`.)
(This is a relatively simple version of the theorem --- for more details see here).
We will see applications of the theorem below.
Let's build our intuition for the theorem using a simple example we have seen before.
Now let's consider examples for each case.
Consider the following irreducible matrix
A = np.array([[0, 1, 0],
[.5, 0, .5],
[0, 1, 0]])
We can compute the dominant eigenvalue and the corresponding eigenvector
eig(A)
Now we can see the claims of the Perron-Frobenius theorem holds for the irreducible matrix
- The dominant eigenvalue is real-valued and non-negative.
- All other eigenvalues have absolute values less than or equal to the dominant eigenvalue.
- A non-negative and nonzero eigenvector is associated with the dominant eigenvalue.
- As the matrix is irreducible, the eigenvector associated with the dominant eigenvalue is strictly positive.
- There exists no other positive eigenvector associated with the dominant eigenvalue.
(prim_matrices)=
We know that in real world situations it's hard for a matrix to be everywhere positive (although they have nice properties).
The primitive matrices, however, can still give us helpful properties with looser definitions.
Let
A matrix is called primitive if there exists a
:label: eigen2_ex_prim
Recall the examples given in irreducible matrices:
$$
A = \begin{bmatrix} 0.5 & 0.1 \\
0.2 & 0.2
\end{bmatrix}
$$
$A$ here is also a primitive matrix since $A^k$ is everywhere nonnegative for $k \in \mathbb{N}$.
$$
B = \begin{bmatrix} 0 & 1 \\
1 & 0
\end{bmatrix}
, \quad
B^2 = \begin{bmatrix} 1 & 0 \\
0 & 1
\end{bmatrix}
$$
$B$ is irreducible but not primitive since there are always zeros in either principal diagonal or secondary diagonal.
We can see that if a matrix is primitive, then it implies the matrix is irreducible but not vice versa.
Now let's step back to the primitive matrices part of the Perron-Frobenius theorem
:label: con-perron-frobenius
If $A$ is primitive then,
6. the inequality $|\lambda| \leq r(A)$ is **strict** for all eigenvalues $\lambda$ of $A$ distinct from $r(A)$, and
7. with $v$ and $w$ normalized so that the inner product of $w$ and $v = 1$, we have
$ r(A)^{-m} A^m$ converges to $v w^{\top}$ when $m \rightarrow \infty$. The matrix $v w^{\top}$ is called the **Perron projection** of $A$.
Consider the following primitive matrix
B = np.array([[0, 1, 1],
[1, 0, 1],
[1, 1, 0]])
np.linalg.matrix_power(B, 2)
We compute the dominant eigenvalue and the corresponding eigenvector
eig(B)
Now let's give some examples to see if the claims of the Perron-Frobenius theorem hold for the primitive matrix
- The dominant eigenvalue is real-valued and non-negative.
- All other eigenvalues have absolute values strictly less than the dominant eigenvalue.
- A non-negative and nonzero eigenvector is associated with the dominant eigenvalue.
- The eigenvector associated with the dominant eigenvalue is strictly positive.
- There exists no other positive eigenvector associated with the dominant eigenvalue.
- The inequality
$|\lambda| < r(B)$ holds for all eigenvalues$\lambda$ of$B$ distinct from the dominant eigenvalue.
Furthermore, we can verify the convergence property (7) of the theorem on the following examples:
def compute_perron_projection(M):
eigval, v = eig(M)
eigval, w = eig(M.T)
r = np.max(eigval)
# Find the index of the dominant (Perron) eigenvalue
i = np.argmax(eigval)
# Get the Perron eigenvectors
v_P = v[:, i].reshape(-1, 1)
w_P = w[:, i].reshape(-1, 1)
# Normalize the left and right eigenvectors
norm_factor = w_P.T @ v_P
v_norm = v_P / norm_factor
# Compute the Perron projection matrix
P = v_norm @ w_P.T
return P, r
def check_convergence(M):
P, r = compute_perron_projection(M)
print("Perron projection:")
print(P)
# Define a list of values for n
n_list = [1, 10, 100, 1000, 10000]
for n in n_list:
# Compute (A/r)^n
M_n = np.linalg.matrix_power(M/r, n)
# Compute the difference between A^n / r^n and the Perron projection
diff = np.abs(M_n - P)
# Calculate the norm of the difference matrix
diff_norm = np.linalg.norm(diff, 'fro')
print(f"n = {n}, error = {diff_norm:.10f}")
A1 = np.array([[1, 2],
[1, 4]])
A2 = np.array([[0, 1, 1],
[1, 0, 1],
[1, 1, 0]])
A3 = np.array([[0.971, 0.029, 0.1, 1],
[0.145, 0.778, 0.077, 0.59],
[0.1, 0.508, 0.492, 1.12],
[0.2, 0.8, 0.71, 0.95]])
for M in A1, A2, A3:
print("Matrix:")
print(M)
check_convergence(M)
print()
print("-"*36)
print()
The convergence is not observed in cases of non-primitive matrices.
Let's go through an example
B = np.array([[0, 1, 1],
[1, 0, 0],
[1, 0, 0]])
# This shows that the matrix is not primitive
print("Matrix:")
print(B)
print("100th power of matrix B:")
print(np.linalg.matrix_power(B, 100))
check_convergence(B)
The result shows that the matrix is not primitive as it is not everywhere positive.
These examples show how the Perron-Frobenius theorem relates to the eigenvalues and eigenvectors of positive matrices and the convergence of the power of matrices.
In fact we have already seen the theorem in action before in {ref}the Markov chain lecture <mc1_ex_1>
.
(spec_markov)=
We are now prepared to bridge the languages spoken in the two lectures.
A primitive matrix is both irreducible and aperiodic.
So Perron-Frobenius theorem explains why both {ref}Imam and Temple matrix <mc_eg3>
and Hamilton matrix converge to a stationary distribution, which is the Perron projection of the two matrices
P = np.array([[0.68, 0.12, 0.20],
[0.50, 0.24, 0.26],
[0.36, 0.18, 0.46]])
print(compute_perron_projection(P)[0])
mc = qe.MarkovChain(P)
ψ_star = mc.stationary_distributions[0]
ψ_star
P_hamilton = np.array([[0.971, 0.029, 0.000],
[0.145, 0.778, 0.077],
[0.000, 0.508, 0.492]])
print(compute_perron_projection(P_hamilton)[0])
mc = qe.MarkovChain(P_hamilton)
ψ_star = mc.stationary_distributions[0]
ψ_star
We can also verify other properties hinted by Perron-Frobenius in these stochastic matrices.
+++
Another example is the relationship between convergence gap and convergence rate.
In the {ref}exercise<mc1_ex_1>
, we stated that the convergence rate is determined by the spectral gap, the difference between the largest and the second largest eigenvalue.
This can be proven using what we have learned here.
Please note that we use
With Markov model
This is proven in {cite}sargent2023economic
and a nice discussion can be found here.
In this formula
Premultiplying
Recall that eigenvalues are ordered from smallest to largest from
As we have seen, the largest eigenvalue for a primitive stochastic matrix is one.
This can be proven using Gershgorin Circle Theorem, but it is out of the scope of this lecture.
So by the statement (6) of Perron-Frobenius theorem,
Hence, after taking the Euclidean norm deviation, we obtain
Thus, the rate of convergence is governed by the modulus of the second largest eigenvalue.
:label: eig_ex1
Wassily Leontief developed a model of an economy with
Under this model some of the output is consumed internally by the industries and the rest is consumed by external consumers.
We define a simple model with 3 sectors - agriculture, industry, and service.
The following table describes how output is distributed within the economy:
Total output | Agriculture | Industry | Service | Consumer | |
---|---|---|---|---|---|
Agriculture | 0.3$x_1$ | 0.2$x_2$ | 0.3$x_3$ | 4 | |
Industry | 0.2$x_1$ | 0.4$x_2$ | 0.3$x_3$ | 5 | |
Service | 0.2$x_1$ | 0.5$x_2$ | 0.1$x_3$ | 12 |
The first row depicts how agriculture's total output
-
$0.3x_1$ is used as inputs within agriculture itself, -
$0.2x_2$ is used as inputs by the industry sector to produce$x_2$ units, -
$0.3x_3$ is used as inputs by the service sector to produce$x_3$ units and - 4 units is the external demand by consumers.
We can transform this into a system of linear equations for the 3 sectors as given below:
This can be transformed into the matrix equation
The solution $x^{}$ is given by the equation $x^{} = (I-A)^{-1} d$
-
Since
$A$ is a nonnegative irreducible matrix, find the Perron-Frobenius eigenvalue of$A$ . -
Use the {ref}
Neumann Series Lemma <la_neumann>
to find the solution$x^{*}$ if it exists.
:class: dropdown
A = np.array([[0.3, 0.2, 0.3],
[0.2, 0.4, 0.3],
[0.2, 0.5, 0.1]])
evals, evecs = eig(A)
r = max(abs(λ) for λ in evals) #dominant eigenvalue/spectral radius
print(r)
Since we have
I = np.identity(3)
B = I - A
d = np.array([4, 5, 12])
d.shape = (3,1)
B_inv = np.linalg.inv(B)
x_star = B_inv @ d
print(x_star)