有理标准形

55 阅读7分钟

12.2 THE RATIONAL CANONICAL FORM

12.2 有理标准形

We now apply our results on finitely generated modules in the special case where the P.I.D. is the ring F[x]F\left\lbrack x\right\rbrack of polynomials in xx with coefficients in a field FF .

我们现在将我们对有限生成模的结果应用于特殊情况,其中P.I.D.是F[x]F\left\lbrack x\right\rbrack——带有在域FF中的系数的xx多项式环。

VV 是一个定义在 FF 上的有限维向量空间,其维度为 nn,并且令 TTVV 的一个固定线性变换(即从 VV 到其自身的变换)。正如我们在第10章中看到的,我们可以将 VV 视为一个 F[x]F\left\lbrack x\right\rbrack -模,其中元素 xx 作用于 VV 的方式是线性变换 TT(因此任何 xx 的多项式在 VV 上的作用都是相同的 TT 的多项式)。由于假设 VVFF 上具有有限维度,因此按照定义,它作为 FF -模是有限生成的,因此也必然作为 F[x]F\left\lbrack x\right\rbrack -模是有限生成的,所以上一节的分类定理适用。

任何非零自由 F[x]F\left\lbrack x\right\rbrack -模(同构于 F[x]F\left\lbrack x\right\rbrack 的副本的直接和)是 FF 上的无穷维向量空间,所以如果 VVFF 上有有限维数,那么它实际上必定是一个扭 F[x]F\left\lbrack x\right\rbrack -模(即,它的自由秩为 0)。由基本定理可知,VV 作为 F[x]F\left\lbrack x\right\rbrack -模同构于循环、扭 F[x]F\left\lbrack x\right\rbrack -模的直接和。我们将看到这种 VV 的分解将允许我们选择一个基,相对于该基,线性变换 TT 的矩阵表示形式是特定的简单形式。当我们使用 VV 的不变因子分解时,我们得到矩阵 TT 的有理标准形,我们在本节中分析它。当我们使用初等因子分解(并且 FF 包含 TT 的所有特征值时),我们得到约当标准形,这在下一节中讨论,并且之前提到过,它是尽可能接近对角矩阵的 TT 的矩阵表示。基本定理的唯一性部分确保了有理标准形和约当标准形的唯一性(这就是为什么它们被称为标准形)。

One important use of these canonical forms is to classify the distinct linear transformations of VV . In particular they allow us to determine when two matrices represent the same linear transformation,i.e.,when two given n×nn \times n matrices are similar.

这些标准形的一个重要应用是分类 VV 的不同线性变换。特别是它们允许我们确定两个矩阵是否代表相同的线性变换,即两个给定的 n×nn \times n 矩阵何时相似。

Note that this will be another instance where the structure of the space being acted upon (the invariant factor decomposition of VV for example) is used to obtain significant information on the algebraic objects (in this case the linear transformations) which are acting. This will be considered in the case of groups acting on vector spaces in Chapter 18 (and goes under the name of Representation Theory of Groups).

注意,这将是另一个例子,其中一个作用于其上的空间的结构(例如 VV 的不变因子分解)被用来获取关于代数对象(在这种情况下是线性变换)的显著信息。这将在第18章中考虑群作用于向量空间的情形(并以群表示论的名字出现)。

Before describing the rational canonical form in detail we first introduce some linear algebra.

在详细描述有理标准形之前,我们首先介绍一些线性代数知识。

Definition.

定义。

(1) An element λ\lambda of FF is called an eigenvalue of the linear transformation TT if there is a nonzero vector vVv \in V such that T(v)=λvT\left( v\right) = {\lambda v} . In this situation vv is called an eigenvector of TT with corresponding eigenvalue λ\lambda .

(1)如果存在一个非零向量 vVv \in V 使得 T(v)=λvT\left( v\right) = {\lambda v} ,则称 FF 中的元素 λ\lambda 为线性变换 TT 的特征值。在这种情况下,vv 被称为对应于特征值 λ\lambdaTT 的特征向量。

(2) If AA is an n×nn \times n matrix with coefficients in FF ,an element λ\lambda is called an eigenvalue of AA with corresponding eigenvector vv if vv is a nonzero n×1n \times 1 column vector such that Av=λv{Av} = {\lambda v} .

(2)如果 AA 是一个在 FF 中具有系数的 n×nn \times n 矩阵,那么如果 vv 是一个非零的 n×1n \times 1 列向量,且满足 Av=λv{Av} = {\lambda v} ,则称元素 λ\lambdaAA 的特征值,对应的特征向量为 vv

(3) If λ\lambda is an eigenvalue of the linear transformation TT ,the set {vVT(v)=λv}\{ v \in V \mid T\left( v\right) = {\lambda v}\} is called the eigenspace of TT corresponding to the eigenvalue λ\lambda . Similarly,if λ\lambda is an eigenvalue of the n×nn \times n matrix AA ,the set of n×1n \times 1 matrices vv with Av=λv{Av} = {\lambda v} is called the eigenspace of AA corresponding to the eigenvalue λ\lambda .

(3)如果 λ\lambda 是线性变换 TT 的特征值,那么集合 {vVT(v)=λv}\{ v \in V \mid T\left( v\right) = {\lambda v}\} 被称为对应于特征值 λ\lambdaTT 的特征空间。类似地,如果 λ\lambdan×nn \times n 矩阵 AA 的特征值,那么满足 Av=λv{Av} = {\lambda v}n×1n \times 1 矩阵的集合 vv 被称为对应于特征值 λ\lambdaAA 的特征空间。

Note that if we fix a basis B\mathcal{B} of VV then any linear transformation TT of VV has an associated n×nn \times n matrix AA . Conversely,if AA is any n×nn \times n matrix then the map TT defined by T(v)=AvT\left( v\right) = {Av} for vVv \in V ,where the vv on the right is the n×1n \times 1 vector consisting of the coordinates of vv with respect to the fixed basis B\mathcal{B} of VV ,is a linear transformation of VV . Then vv is an eigenvector of TT with corresponding eigenvalue λ\lambda if and only if the coordinate vector of vv with respect to B\mathcal{B} is an eigenvector of AA with eigenvalue λ\lambda . In other words,the eigenvalues for the linear transformation TT are the same as the eigenvalues for the matrix AA of TT with respect to any fixed basis for VV .

请注意,如果我们固定 B\mathcal{B} 的一个基 VV,那么任何 VV 的线性变换 TT 都有一个关联的 n×nn \times n 矩阵 AA。反之,如果 AA 是任意的 n×nn \times n 矩阵,那么由 T(v)=AvT\left( v\right) = {Av} 定义的映射 TT 对于 vVv \in V,其中右侧的 vv 是相对于固定基 B\mathcal{B}VVvv 坐标组成的 n×1n \times 1 向量,是一个 VV 的线性变换。当且仅当相对于 B\mathcal{B}vv 坐标向量是 AA 的特征向量,对应的特征值为 λ\lambda 时,vvTT 的特征向量。换句话说,线性变换 TT 的特征值与任何固定基对于 VVAA 矩阵的特征值相同。

Definition. The determinant of a linear transformation from VV to VV is the determinant of any matrix representing the linear transformation (note that this does not depend on the choice of the basis used).

定义。从 VVVV 的线性变换的行列式是表示该线性变换的任意矩阵的行列式(注意,这并不依赖于所使用的基的选择)。

Proposition 12. The following are equivalent:

命题 12。以下等价:

(1) λ\lambda is an eigenvalue of TT

(1) λ\lambdaTT 的特征值

(2) λIT{\lambda I} - T is a singular linear transformation of VV

(2) λIT{\lambda I} - TVV 的奇异线性变换

(3) det(λIT)=0\det \left( {{\lambda I} - T}\right) = 0 .

(3) det(λIT)=0\det \left( {{\lambda I} - T}\right) = 0

Proof: Since λ\lambda is an eigenvalue of TT with corresponding eigenvector vv if and only if vv is a nonzero vector in the kernel of λIT{\lambda I} - T ,it follows that (1) and (2) are equivalent. (2) and (3) are equivalent by our results on determinants.

证明:由于 λ\lambdaTT 的特征值,对应的特征向量为 vv,当且仅当 vvλIT{\lambda I} - T 核中的非零向量,因此 (1) 和 (2) 等价。根据我们对行列式的研究结果,(2) 和 (3) 也是等价的。

Definition. Let xx be an indeterminate over FF . The polynomial det(xIT)\det \left( {{xI} - T}\right) is called the characteristic polynomial of TT and will be denoted cT(x){c}_{T}\left( x\right) . If AA is an n×nn \times n matrix with coefficients in F,det(xIA)F,\det \left( {{xI} - A}\right) is called the characteristic polynomial of AA and will be denoted cA(x){c}_{A}\left( x\right) .

定义。设 xxFF 上的一个不定元。多项式 det(xIT)\det \left( {{xI} - T}\right) 被称为 TT 的特征多项式,并表示为 cT(x){c}_{T}\left( x\right)。如果 AA 是一个在 F,det(xIA)F,\det \left( {{xI} - A}\right) 中具有系数的 n×nn \times n 矩阵,则该矩阵的特征多项式被称为 AA 的特征多项式,并表示为 cA(x){c}_{A}\left( x\right)

It is easy to see by expanding the determinant that the characteristic polynomial of either TT or AA is a monic polynomial of degree n=dimVn = \dim V . Proposition 12 says that the set of eigenvalues of TT (or AA ) is precisely the set of roots of the characteristic polynomial of TT (of AA ,respectively). In particular, TT has at most nn distinct eigenvalues.

通过展开行列式可以容易地看出,TTAA 的特征多项式是次数为 n=dimVn = \dim V 的单变量多项式。命题 12 说明 TT(或 AA)的特征值集合恰好是 TT(或分别的 AA)特征多项式的根集合。特别是,TT 最多有 nn 个不同的特征值。

We have seen that VV considered as a module over F[x]F\left\lbrack x\right\rbrack via the linear transformation TT is a torsion F[x]F\left\lbrack x\right\rbrack -module. Let m(x)F[x]m\left( x\right) \in F\left\lbrack x\right\rbrack be the unique monic polynomial generating the annihilator of VV in F[x]F\left\lbrack x\right\rbrack . Equivalently, m(x)m\left( x\right) is the unique monic polynomial of minimal degree annihilating VV (i.e.,such that m(T)m\left( T\right) is the 0 linear transformation),and if f(x)F[x]f\left( x\right) \in F\left\lbrack x\right\rbrack is any polynomial annihilating V,m(x)V,m\left( x\right) divides f(x)f\left( x\right) . Since the ring of all n×nn \times n matrices over FF is isomorphic to the collection of all linear transformations of VV to itself (an isomorphism is obtained by choosing a basis for VV ),it follows that for any n×nn \times n matrix AA over FF there is similarly a unique monic polynomial of minimal degree with m(A)m\left( A\right) the zero matrix.

我们已经看到,通过线性变换 TTVV 视为 F[x]F\left\lbrack x\right\rbrack 的模是一个扭模。设 m(x)F[x]m\left( x\right) \in F\left\lbrack x\right\rbrack 是唯一生成 VVF[x]F\left\lbrack x\right\rbrack 中消灭子的单变量多项式。等价地,m(x)m\left( x\right) 是唯一消灭 VV 的最小次数的单变量多项式(即,使得 m(T)m\left( T\right) 是零线性变换),如果 f(x)F[x]f\left( x\right) \in F\left\lbrack x\right\rbrack 是任何消灭 V,m(x)V,m\left( x\right) 的多项式,则 V,m(x)V,m\left( x\right)f(x)f\left( x\right)。由于所有 n×nn \times n 矩阵在 FF 上的环同构于所有将 VV 映射到自身的线性变换的集合(通过为 VV 选择一个基可以得到同构),因此对于任何在 FF 上的 n×nn \times n 矩阵 AA,也存在一个具有 m(A)m\left( A\right) 零矩阵的唯一最小次数的单变量多项式。

Definition. The unique monic polynomial which generates the ideal Ann(V)\operatorname{Ann}\left( V\right) in F[x]F\left\lbrack x\right\rbrack is called the minimal polynomial of TT and will be denoted mT(x){m}_{T}\left( x\right) . The unique monic polynomial of smallest degree which when evaluated at the matrix AA is the zero matrix is called the minimal polynomial of AA and will be denoted mA(x){m}_{A}\left( x\right) .

定义。生成理想 Ann(V)\operatorname{Ann}\left( V\right)F[x]F\left\lbrack x\right\rbrack 中的唯一单变量多项式称为 TT 的最小多项式,并表示为 mT(x){m}_{T}\left( x\right) 。当在矩阵 AA 上求值时,唯一的最小次数的单变量多项式,其结果为零矩阵,称为 AA 的最小多项式,并表示为 mA(x){m}_{A}\left( x\right)

It is easy to see (cf. Exercise 5) that the degrees of these minimal polynomials are at most n2{n}^{2} where nn is the dimension of VV . We shall shortly prove that the minimal polynomial for TT is a divisor of the characteristic polynomial for TT (this is the Cayley-Hamilton Theorem),and similarly for AA ,so in fact the degrees of these polynomials are at most nn .

很容易看出(参见练习5),这些最小多项式的次数最多为 n2{n}^{2},其中 nnVV 的维度。我们很快将证明 TT 的最小多项式是 TT 的特征多项式的因子(这是凯莱-哈密顿定理),对于 AA 也是如此,因此实际上这些多项式的次数最多为 nn

We now describe the rational canonical form of the linear transformation TT (respectively,of the n×nn \times n matrix AA ). By Theorem 5 we have an isomorphism

我们现在描述线性变换 TT(分别是 n×nn \times n 矩阵 AA 的)的有理标准形。根据定理5,我们有一个同构

VF[x]/(a1(x))F[x]/(a2(x))F[x]/(am(x))(12.1)V \cong F\left\lbrack x\right\rbrack /\left( {{a}_{1}\left( x\right) }\right) \oplus F\left\lbrack x\right\rbrack /\left( {{a}_{2}\left( x\right) }\right) \oplus \cdots \oplus F\left\lbrack x\right\rbrack /\left( {{a}_{m}\left( x\right) }\right) \tag{12.1}

of F[x]F\left\lbrack x\right\rbrack -modules where a1(x),a2(x),,am(x){a}_{1}\left( x\right) ,{a}_{2}\left( x\right) ,\ldots ,{a}_{m}\left( x\right) are polynomials in F[x]F\left\lbrack x\right\rbrack of degree at least one with the divisibility conditions

F[x]F\left\lbrack x\right\rbrack -模,其中 a1(x),a2(x),,am(x){a}_{1}\left( x\right) ,{a}_{2}\left( x\right) ,\ldots ,{a}_{m}\left( x\right) 是次数至少为1的 F[x]F\left\lbrack x\right\rbrack 中的多项式,满足整除条件

a1(x)a2(x)am(x).{a}_{1}\left( x\right) \left| {{a}_{2}\left( x\right) }\right| \cdots \mid {a}_{m}\left( x\right) .

These invariant factors ai(x){a}_{i}\left( x\right) are only determined up to a unit in F[x]F\left\lbrack x\right\rbrack but since the units of F[x]F\left\lbrack x\right\rbrack are precisely the nonzero elements of FF (i.e., the nonzero constant polynomials), we may make these polynomials unique by stipulating that they be monic.

这些不变因子 ai(x){a}_{i}\left( x\right) 只在 F[x]F\left\lbrack x\right\rbrack 中的单位上确定,但由于 F[x]F\left\lbrack x\right\rbrack 的单位正是 FF 的非零元素(即非零常数多项式),我们可以通过指定它们为单变量多项式来使这些多项式唯一。

Since the annihilator of VV is the ideal (am(x))\left( {{a}_{m}\left( x\right) }\right) (part (3) of Theorem 5),we immediately obtain:

由于 VV 的消灭子是理想 (am(x))\left( {{a}_{m}\left( x\right) }\right)(定理5的第三部分),我们立即得到:

Proposition 13. The minimal polynomial mT(x){m}_{T}\left( x\right) is the largest invariant factor of VV . All the invariant factors of VV divide mT(x){m}_{T}\left( x\right) .

命题13。最小多项式 mT(x){m}_{T}\left( x\right)VV 的最大不变因子。 VV 的所有不变因子都整除 mT(x){m}_{T}\left( x\right)

We shall see below how to calculate not only the minimal polynomial for TT but also the other invariant factors.

我们将在下面看到如何计算 TT 的最小多项式,以及其他不变因子。

We now choose a basis for each of the direct summands for VV in the decomposition (1)above for which the matrix for TT is quite simple. Recall that the linear transformation TT acting on the left side of (1) is the element xx acting by multiplication on each of the factors on the right side of the isomorphism in (1).

我们现在为 VV 的每个直和项选择一个基,使得在分解 (1) 中的矩阵 TT 非常简单。回想一下,线性变换 TT 作用于 (1) 左侧的元素 xx 通过乘法作用于同构 (1) 右侧的每个因子。

We have seen in the example following Proposition 1 of Chapter 11 that the elements 1,xˉ,xˉ2,,xˉk11,\bar{x},{\bar{x}}^{2},\ldots ,{\bar{x}}^{k - 1} give a basis for the vector space F[x]/(a(x))F\left\lbrack x\right\rbrack /\left( {a\left( x\right) }\right) where a(x)=xk+a\left( x\right) = {x}^{k} + bk1xk1++b1x+b0{b}_{k - 1}{x}^{k - 1} + \cdots + {b}_{1}x + {b}_{0} is any monic polynomial in F[x]F\left\lbrack x\right\rbrack and xˉ=x  mod  (a(x))\bar{x} = x{\;\operatorname{mod}\;\left( {a\left( x\right) }\right) } . With respect to this basis the linear transformation of multiplication by xx acts in a simple manner:

我们在第11章命题1后的例子中看到,元素 1,xˉ,xˉ2,,xˉk11,\bar{x},{\bar{x}}^{2},\ldots ,{\bar{x}}^{k - 1} 构成向量空间 F[x]/(a(x))F\left\lbrack x\right\rbrack /\left( {a\left( x\right) }\right) 的一个基,其中 a(x)=xk+a\left( x\right) = {x}^{k} + bk1xk1++b1x+b0{b}_{k - 1}{x}^{k - 1} + \cdots + {b}_{1}x + {b}_{0}F[x]F\left\lbrack x\right\rbrack 中的任意首一多项式,xˉ=x  mod  (a(x))\bar{x} = x{\;\operatorname{mod}\;\left( {a\left( x\right) }\right) } 。对于这个基,乘以 xx 的线性变换作用方式很简单:

1  xˉ1\; \mapsto \bar{x}
xˉxˉ2\bar{x} \mapsto {\bar{x}}^{2}
xˉ2xˉ3{\bar{x}}^{2} \mapsto {\bar{x}}^{3}

x:x :

xˉk2xˉk1{\bar{x}}^{k - 2} \mapsto {\bar{x}}^{k - 1}
xˉk1xˉk=b0b1xˉbk1xˉk1{\bar{x}}^{k - 1} \mapsto {\bar{x}}^{k} = - {b}_{0} - {b}_{1}\bar{x} - \cdots - {b}_{k - 1}{\bar{x}}^{k - 1}

where the last equality is because xˉk+bk1xˉk1++b1xˉ+b0=0{\bar{x}}^{k} + {b}_{k - 1}{\bar{x}}^{k - 1} + \cdots + {b}_{1}\bar{x} + {b}_{0} = 0 since a(xˉ)=0a\left( \bar{x}\right) = 0 in F[x]/(a(x))F\left\lbrack x\right\rbrack /\left( {a\left( x\right) }\right) . With respect to this basis,the matrix formultiplication by xx is therefore

其中最后一个等式是因为 xˉk+bk1xˉk1++b1xˉ+b0=0{\bar{x}}^{k} + {b}_{k - 1}{\bar{x}}^{k - 1} + \cdots + {b}_{1}\bar{x} + {b}_{0} = 0 ,由于 a(xˉ)=0a\left( \bar{x}\right) = 0F[x]/(a(x))F\left\lbrack x\right\rbrack /\left( {a\left( x\right) }\right) 中。对于这个基,乘以 xx 的矩阵形式因此是

(00b010b101b200001bk1).\left( \begin{matrix} 0 & 0 & \ldots & \ldots & \ldots & - {b}_{0} \\ 1 & 0 & \ldots & \ldots & \ldots & - {b}_{1} \\ 0 & 1 & \ldots & \ldots & \ldots & - {b}_{2} \\ 0 & 0 & \ddots & & & \vdots \\ \vdots & \vdots & & \ddots & & \vdots \\ 0 & 0 & \ldots & \ldots & 1 & - {b}_{k - 1} \end{matrix}\right) .

Such matrices are given a name:

这样的矩阵有一个名称:

Definition. Let a(x)=xk+bk1xk1++b1x+b0a\left( x\right) = {x}^{k} + {b}_{k - 1}{x}^{k - 1} + \cdots + {b}_{1}x + {b}_{0} be any monic polynomial in F[x]F\left\lbrack x\right\rbrack . The companion matrix of a(x)a\left( x\right) is the k×kk \times k matrix with 1’s down the first subdiagonal, b0,b1,,bk1- {b}_{0}, - {b}_{1},\ldots , - {b}_{k - 1} down the last column and zeros elsewhere. The companion matrix of a(x)a\left( x\right) will be denoted by Ca(x){\mathcal{C}}_{a\left( x\right) } .

定义。设 a(x)=xk+bk1xk1++b1x+b0a\left( x\right) = {x}^{k} + {b}_{k - 1}{x}^{k - 1} + \cdots + {b}_{1}x + {b}_{0}F[x]F\left\lbrack x\right\rbrack 中的任意首一多项式。a(x)a\left( x\right) 的伴随矩阵是具有1沿着第一条次对角线下方,b0,b1,,bk1- {b}_{0}, - {b}_{1},\ldots , - {b}_{k - 1} 沿着最后一列,其余位置为0的 k×kk \times k 矩阵。a(x)a\left( x\right) 的伴随矩阵将表示为 Ca(x){\mathcal{C}}_{a\left( x\right) }

We apply this to each of the cyclic modules on the right side of (1) above and let Bi{\mathcal{B}}_{i} be the elements of VV corresponding to the basis chosen above for the cyclic factor F[x]/(ai(x))F\left\lbrack x\right\rbrack /\left( {{a}_{i}\left( x\right) }\right) under the isomorphism in (1). Then by definition the linear transformation TT acts on Bi{\mathcal{B}}_{i} by the companion matrix for ai(x){a}_{i}\left( x\right) since we have seen that this is how multiplication by xx acts. The union B\mathcal{B} of the Bi{\mathcal{B}}_{i} ’s gives a basis for VV since the sum on the right of (1) is direct and with respect to this basis the linear transformation TT has as matrix the direct sum of the companion matrices for the invariant factors, i.e.,

我们将此应用于上述(1)右侧的每个循环模块,并让 Bi{\mathcal{B}}_{i}VV 中与上述选择的循环因子 F[x]/(ai(x))F\left\lbrack x\right\rbrack /\left( {{a}_{i}\left( x\right) }\right) 在同构(1)下对应的元素。根据定义,线性变换 TT 通过 ai(x){a}_{i}\left( x\right) 的伴随矩阵作用于 Bi{\mathcal{B}}_{i},因为我们已经看到这是乘以 xx 的作用方式。所有 Bi{\mathcal{B}}_{i} 的并集给出了 VV 的一个基,因为(1)右侧的和是直接的,并且对于这个基,线性变换 TT 的矩阵是伴随矩阵的直接和,即,

(Ca1(x)Ca2(x)Cam(x)).\left( \begin{array}{llll} {\mathcal{C}}_{{a}_{1}\left( x\right) } & & & \\ & {\mathcal{C}}_{{a}_{2}\left( x\right) } & & \\ & & \ddots & \\ & & & {\mathcal{C}}_{{a}_{m}\left( x\right) } \end{array}\right) .

(12.2)

Notice that this matrix is uniquely determined from the invariant factors of the F[x]F\left\lbrack x\right\rbrack - module VV and,by Theorem 9,the list of invariant factors uniquely determines the module VV up to isomorphism as an F[x]F\left\lbrack x\right\rbrack -module.

注意,这个矩阵由 F[x]F\left\lbrack x\right\rbrack - 模块的不变因子唯一确定,并且,根据定理9,不变因子的列表唯一地确定了作为 F[x]F\left\lbrack x\right\rbrack - 模块的 VV 的同构。

Definition.

定义。

(1) A matrix is said to be in rational canonical form if it is the direct sum of companion matrices for monic polynomials a1(x),,am(x){a}_{1}\left( x\right) ,\ldots ,{a}_{m}\left( x\right) of degree at least one with a1(x)a2(x)am(x){a}_{1}\left( x\right) \left| {{a}_{2}\left( x\right) }\right| \cdots \mid {a}_{m}\left( x\right) . The polynomials ai(x){a}_{i}\left( x\right) are called the invariant factors of the matrix. Such a matrix is also said to be a block diagonal matrix with blocks the companion matrices for the ai(x){a}_{i}\left( x\right) .

(1)如果一个矩阵是单变量多项式 a1(x),,am(x){a}_{1}\left( x\right) ,\ldots ,{a}_{m}\left( x\right) 的伴随矩阵的直接和,且这些多项式的次数至少为1,并且具有 a1(x)a2(x)am(x){a}_{1}\left( x\right) \left| {{a}_{2}\left( x\right) }\right| \cdots \mid {a}_{m}\left( x\right) ,则称这个矩阵为有理标准形。这些多项式 ai(x){a}_{i}\left( x\right) 被称为矩阵的不变因子。这样的矩阵也被称为以 ai(x){a}_{i}\left( x\right) 的伴随矩阵为块的块对角矩阵。

(2) A rational canonical form for a linear transformation TT is a matrix representing TT which is in rational canonical form.

(2)一个线性变换 TT 的有理标准形是一个表示 TT 的矩阵,且该矩阵处于有理标准形。

We have seen that any linear transformation TT has a rational canonical form. We now see that this rational canonical form is unique (hence is called the rational canonical form for TT ). To see this note that the process we used to determine the matrix of TT from the direct sum decomposition is reversible. Suppose b1(x),b2(x),,bt(x){b}_{1}\left( x\right) ,{b}_{2}\left( x\right) ,\ldots ,{b}_{t}\left( x\right) are monic polynomials in F[x]F\left\lbrack x\right\rbrack of degree at least one such that bi(x)bi+1(x){b}_{i}\left( x\right) \mid {b}_{i + 1}\left( x\right) for all ii and suppose for some basis E\mathcal{E} of VV ,that the matrix of TT with respect to the basis E\mathcal{E} is the direct sum of the companion matrices of the bi(x){b}_{i}\left( x\right) . Then VV must be a direct sum of TT -stable subspaces Di{D}_{i} ,one for each bi(x){b}_{i}\left( x\right) in such a way that the matrix of TT on each Di{D}_{i} is the companion matrix of bi(x){b}_{i}\left( x\right) . Let Ei{\mathcal{E}}_{i} be the corresponding (ordered) basis of Di{D}_{i} (so E\mathcal{E} is the union of the Ei{\mathcal{E}}_{i} ) and let ei{e}_{i} be the first basis element in Ei{\mathcal{E}}_{i} . Then it is easy to see that Di{D}_{i} is a cyclic F[x]F\left\lbrack x\right\rbrack -module with generator ei{e}_{i} and that the annihilator of Di{D}_{i} is bi(x){b}_{i}\left( x\right) . Thus the torsion F[x]F\left\lbrack x\right\rbrack -module VV decomposes into a direct sum of cyclic F[x]F\left\lbrack x\right\rbrack -modules in two ways, both of which satisfy the conditions of Theorem 5, i.e., both of which give lists of invariant factors. Since the invariant factors are unique by Theorem 9, ai(x){a}_{i}\left( x\right) and bi(x){b}_{i}\left( x\right) must differ by a unit factor in F[x]F\left\lbrack x\right\rbrack and since the polynomials are monic by assumption,we must have ai(x)=bi(x){a}_{i}\left( x\right) = {b}_{i}\left( x\right) for all ii . This proves the following result:

我们已经看到,任何线性变换 TT 都有一个有理标准形。现在我们知道这个有理标准形是唯一的(因此被称为 TT 的有理标准形)。为了证明这一点,请注意我们用来从直接和分解确定 TT 的矩阵的过程是可逆的。假设 b1(x),b2(x),,bt(x){b}_{1}\left( x\right) ,{b}_{2}\left( x\right) ,\ldots ,{b}_{t}\left( x\right)F[x]F\left\lbrack x\right\rbrack 中的首一多项式,且次数至少为一个,使得对于所有 ii 都有 bi(x)bi+1(x){b}_{i}\left( x\right) \mid {b}_{i + 1}\left( x\right),并且假设对于某个 VV 的基 E\mathcal{E}TT 关于基 E\mathcal{E} 的矩阵是 bi(x){b}_{i}\left( x\right) 的伴随矩阵的直接和。那么 VV 必须是 TT -稳定子空间的直接和,每个 bi(x){b}_{i}\left( x\right) 有一个,使得 TT 在每个 Di{D}_{i} 上的矩阵是 bi(x){b}_{i}\left( x\right) 的伴随矩阵。设 Ei{\mathcal{E}}_{i}Di{D}_{i} 的相应(有序)基(因此 E\mathcal{E}Ei{\mathcal{E}}_{i} 的并集),设 ei{e}_{i}Ei{\mathcal{E}}_{i} 中的第一个基元素。那么很容易看出 Di{D}_{i} 是一个循环 F[x]F\left\lbrack x\right\rbrack -模,生成元为 ei{e}_{i},并且 Di{D}_{i} 的消元子为 bi(x){b}_{i}\left( x\right)。因此,挠 F[x]F\left\lbrack x\right\rbrack -模 VV 以两种方式分解为循环 F[x]F\left\lbrack x\right\rbrack -模的直接和,这两种方式都满足定理 5 的条件,即都给出不变因子的列表。由于定理 9 证明了不变因子是唯一的,ai(x){a}_{i}\left( x\right)bi(x){b}_{i}\left( x\right) 必须在 F[x]F\left\lbrack x\right\rbrack 中相差一个单位因子,并且由于假设多项式是首一的,我们必须对于所有 iiai(x)=bi(x){a}_{i}\left( x\right) = {b}_{i}\left( x\right)。这证明了以下结果:

Theorem 14. (Rational Canonical Form for Linear Transformations) Let VV be a finite dimensional vector space over the field FF and let TT be a linear transformation of VV .

定理 14.(线性变换的有理标准形)设 VV 是定义在域 FF 上的有限维向量空间,设 TTVV 的一个线性变换。

(1) There is a basis for VV with respect to which the matrix for TT is in rational canonical form, i.e., is a block diagonal matrix whose diagonal blocks are the companion matrices for monic polynomials a1(x),a2(x),,am(x){a}_{1}\left( x\right) ,{a}_{2}\left( x\right) ,\ldots ,{a}_{m}\left( x\right) of degree at least one with a1(x)a2(x)am(x){a}_{1}\left( x\right) \left| {{a}_{2}\left( x\right) }\right| \cdots \mid {a}_{m}\left( x\right) .

(1)存在 VV 的一个基,对于该基,TT 的矩阵是有理标准形,即是一个对角块矩阵,其对角块是首一多项式 a1(x),a2(x),,am(x){a}_{1}\left( x\right) ,{a}_{2}\left( x\right) ,\ldots ,{a}_{m}\left( x\right) 的伴随矩阵,其次数至少为 1 并且 a1(x)a2(x)am(x){a}_{1}\left( x\right) \left| {{a}_{2}\left( x\right) }\right| \cdots \mid {a}_{m}\left( x\right)

(2) The rational canonical form for TT is unique.

(2)TT 的有理标准形是唯一的。

The use of the word rational is to indicate that this canonical form is calculated entirely within the field FF and exists for any linear transformation TT . This is not the case for the Jordan canonical form (considered later),which only exists if the field FF contains the eigenvalues for TT (cf. also the remarks following Corollary 18).

使用“有理”一词是为了表明这种标准形完全在域 FF 内计算,并且对于任何线性变换 TT 都存在。这与约当标准形(稍后讨论)的情况不同,后者仅在域 FF 包含 TT 的特征值时存在(参见 also the remarks following Corollary 18)。

The following result translates the notion of similar linear transformations (i.e., the same linear transformation up to a change of basis) into the language of modules and relates this notion to rational canonical forms.

下面的结果将相似线性变换(即,在基变换下的相同线性变换)的概念转化为模块的语言,并将此概念与有理标准形联系起来。

Theorem 15. Let SS and TT be linear transformations of VV . Then the following are equivalent:

定理 15. 设 SSTTVV 的线性变换。那么以下条件是等价的:

(1) SS and TT are similar linear transformations

(1)SSTT 是相似的线性变换

(2) the F[x]F\left\lbrack x\right\rbrack -modules obtained from VV via SS and via TT are isomorphic F[x]F\left\lbrack x\right\rbrack - modules

(2)通过 VV 和通过 SSF[x]F\left\lbrack x\right\rbrack 获得的 F[x]F\left\lbrack x\right\rbrack -模块与 TT 是同构的 F[x]F\left\lbrack x\right\rbrack -模块

(3) SS and TT have the same rational canonical form.

(3)SSTT 具有相同的有理标准形。

Proof: [(1) implies (2)] Assume there is a nonsingular linear transformation UU such that S=UTU1S = {UT}{U}^{-1} . The vector space isomorphism U:VVU : V \rightarrow V is also an F[x]F\left\lbrack x\right\rbrack -module homomorphism,where xx acts on the first VV via TT and on the second via SS ,since for example U(xv)=U(Tv)=UT(v)=SU(v)=x(Uv)U\left( {xv}\right) = U\left( {Tv}\right) = {UT}\left( v\right) = {SU}\left( v\right) = x\left( {Uv}\right) . Hence this is an F[x]F\left\lbrack x\right\rbrack -module isomorphism of the two modules in (2).

证明:[(1) 蕴含 (2)] 假设存在一个非奇异线性变换 UU 使得 S=UTU1S = {UT}{U}^{-1} 。该向量空间同构 U:VVU : V \rightarrow V 同时也是一个 F[x]F\left\lbrack x\right\rbrack -模同态,其中 xx 通过 VV 作用于第一个分量,通过 TT 作用于第二个分量,因为例如 SS 。因此,这是 (2) 中两个模的同构。

[(2) implies (3)] Assume (2) holds and denote by V1{V}_{1} the vector space VV made into an F[x]F\left\lbrack x\right\rbrack -module via SS and denote by V2{V}_{2} the space VV made into an F[x]F\left\lbrack x\right\rbrack -module via TT . Since V1V2{V}_{1} \cong {V}_{2} as F[x]F\left\lbrack x\right\rbrack -modules they have the same list of invariant factors. Thus SS and TT have a common rational canonical form.

[(2) 蕴含 (3)] 假设 (2) 成立,并记 V1{V}_{1} 为通过 SS 使向量空间 VV 成为 F[x]F\left\lbrack x\right\rbrack -模,记 V2{V}_{2} 为通过 TT 使空间 VV 成为 F[x]F\left\lbrack x\right\rbrack -模。由于 V1V2{V}_{1} \cong {V}_{2} 作为 F[x]F\left\lbrack x\right\rbrack -模,它们具有相同的不变因子列表。因此 SSTT 有一个共同的理性标准形。

[(3) implies (1)] Assume (3) holds. Since SS and TT have the same matrix representation with respect to some choice of (possibly different) bases of VV by assumption, they are,up to a change of basis,the same linear transformation of VV ,hence are similar.

[(3) 蕴含 (1)] 假设 (3) 成立。由于根据假设,SSTT 在某些(可能是不同的)VV 基底下的矩阵表示相同,它们在基底变换下是相同的 VV 线性变换,因此是相似的。

Let AA be any n×nn \times n matrix with entries from FF . Let VV be an nn -dimensional vector space over FF . Recall we can then define a linear transformation TT on VV by choosing a basis for VV and setting T(v)=AvT\left( v\right) = {Av} where vv on the right hand side means the n×1n \times 1 column vector of coordinates of vv with respect to our chosen basis (this is just the usual identification of linear transformations with matrices). Then (of course) the matrix for this TT with respect to this basis is the given matrix AA . Put another way,any n×nn \times n matrix AA with entries from the field FF arises as the matrix for some linear transformation TT of an nn -dimensional vector space.

AA 为任意 n×nn \times n 矩阵,其元素来自 FF 。设 VVnn 维的 FF 上的向量空间。回顾我们可以在 VV 上定义一个线性变换 TT ,通过为 VV 选择一个基并设置 T(v)=AvT\left( v\right) = {Av} ,其中 vv 右侧的 n×1n \times 1 表示 vv 关于我们选择的基的坐标的 n×1n \times 1 列向量(这只是线性变换与矩阵的通常识别)。那么(当然)这个基下 TT 的矩阵就是给定的矩阵 AA 。换句话说,任何来自域 FFn×nn \times n 矩阵 AA 都可以作为某个 nn 维向量空间的线性变换 TT 的矩阵。

This dictionary between linear transformations of vector spaces and matrices allows us to state our previous two results in the language of matrices:

这个线性变换的向量空间和矩阵之间的字典允许我们用矩阵的语言陈述我们之前得到的结果:

Theorem 16. (Rational Canonical Form for Matrices) Let AA be an n×nn \times n matrix over the field FF .

定理 16。(矩阵的有理标准形)设 AAFF 上的 n×nn \times n 矩阵。

(1) The matrix AA is similar to a matrix in rational canonical form,i.e.,there is an invertible n×nn \times n matrix PP over FF such that P1AP{P}^{-1}{AP} is a block diagonal matrix whose diagonal blocks are the companion matrices for monic polynomials a1(x),a2(x),,am(x){a}_{1}\left( x\right) ,{a}_{2}\left( x\right) ,\ldots ,{a}_{m}\left( x\right) of degree at least one with a1(x)a2(x)am(x).{a}_{1}\left( x\right) \left| {{a}_{2}\left( x\right) }\right| \cdots \mid {a}_{m}\left( x\right) .

(1)矩阵 AA 与有理标准形的一个矩阵相似,即存在一个可逆的 n×nn \times n 矩阵 PPFF 上,使得 P1AP{P}^{-1}{AP} 是一个块对角矩阵,其对角块是单变量多项式 a1(x),a2(x),,am(x){a}_{1}\left( x\right) ,{a}_{2}\left( x\right) ,\ldots ,{a}_{m}\left( x\right) 的伴随矩阵,其次数至少为一个,并且 a1(x)a2(x)am(x).{a}_{1}\left( x\right) \left| {{a}_{2}\left( x\right) }\right| \cdots \mid {a}_{m}\left( x\right) .

(2) The rational canonical form for AA is unique.

(2)AA 的有理标准形是唯一的。

Definition. The invariant factors of an n×nn \times n matrix over a field FF are the invariant factors of its rational canonical form.

定义。一个 n×nn \times n 矩阵在域 FF 上的不变因子是其有理标准形的不变因子。

Theorem 17. Let AA and BB be n×nn \times n matrices over the field FF . Then AA and BB are similar if and only if AA and BB have the same rational canonical form.

定理 17。设 AABB 是定义在域 FF 上的 n×nn \times n 矩阵。那么 AABB 相似当且仅当它们具有相同的理性标准形。

If AA is a matrix with entries from a field FF and FF is a subfield of a larger field KK then we may also consider AA as a matrix over KK . The next result shows that the rational canonical form for AA and questions of similarity do not depend on which field contains the entries of AA .

如果 AA 是一个元素来自域 FF 的矩阵,且 FF 是更大域 KK 的子域,那么我们也可以将 AA 视为定义在 KK 上的矩阵。下一个结果说明,AA 的理性标准形以及相似性问题不依赖于包含 AA 元素的域。

Corollary 18. Let AA and BB be two n×nn \times n matrices over a field FF and suppose FF is a subfield of the field KK .

推论 18。设 AABB 是定义在域 FF 上的两个 n×nn \times n 矩阵,并假设 FF 是域 KK 的子域。

(1) The rational canonical form of AA is the same whether it is computed over KK or over FF . The minimal and characteristic polynomials and the invariant factors of AA are the same whether AA is considered as a matrix over FF or as a matrix over KK .

(1)无论在 KK 上还是在 FF 上计算,AA 的理性标准形都是相同的。最小多项式和特征多项式以及 AA 的不变因子,无论将 AA 视为在 FF 上的矩阵还是 KK 上的矩阵,都是相同的。

(2) The matrices AA and BB are similar over KK if and only if they are similar over FF ,i.e.,there exists an invertible n×nn \times n matrix PP with entries from KK such that B=P1APB = {P}^{-1}{AP} if and only if there exists an (in general different) invertible n×nn \times n matrix QQ with entries from FF such that B=Q1AQB = {Q}^{-1}{AQ} .

(2)如果矩阵 AABBKK 上相似,当且仅当它们在 FF 上相似,即存在一个可逆的 n×nn \times n 矩阵 PP 其元素来自 KK 使得 B=P1APB = {P}^{-1}{AP} 当且仅当存在一个(通常不同)的可逆的 n×nn \times n 矩阵 QQ 其元素来自 FF 使得 B=Q1AQB = {Q}^{-1}{AQ}

Proof: (1) Let MM be the rational canonical form of AA when computed over the smaller field FF . Since MM satisfies the conditions in the definition of the rational canonical form over KK ,the uniqueness of the rational canonical form implies that MM is also the rational canonical form of AA over KK . Hence the invariant factors of AA are the same whether AA is viewed over FF or over KK . In particular,since the minimal polynomial is the largest invariant factor of AA it also does not depend on the field over which AA is viewed. It is clear from the determinant definition of the characteristic polynomial of AA that this polynomial depends only on the entries of AA (we shall see shortly that the characteristic polynomial is the product of all the invariant factors for AA ,which will give an alternate proof of this result).

证明:(1) 设 MM 是当在较小的场 FF 上计算时 AA 的有理标准形。由于 MM 满足在有理标准形的定义中关于 KK 的条件,有理标准形的唯一性意味着 MM 也是 AAKK 上的有理标准形。因此,AA 的不变因子无论是在 FF 上还是在 KK 上都是相同的。特别是,由于最小多项式是 AA 的最大不变因子,它也不依赖于 AA 所在的场。从 AA 的特征多项式的行列式定义中可以清楚地看出,这个多项式只依赖于 AA 的元素(我们很快将看到特征多项式是所有不变因子的乘积,这将为这个结果提供一个替代的证明)。

(2) If AA and BB are similar over the smaller field FF they are clearly similar over KK . Conversely,if AA and BB are similar over KK ,they have the same rational canonical form over KK . By (1) they have the same rational canonical form over FF ,hence are similar over FF by Theorem 17.

(2) 如果 AABB 在较小的场 FF 上相似,那么它们显然在 KK 上也相似。反之,如果 AABBKK 上相似,那么它们在 KK 上具有相同的有理标准形。由 (1) 可知,它们在 FF 上也具有相同的有理标准形,因此根据定理 17,它们在 FF 上是相似的。

This corollary asserts in particular that the rational canonical form for an n×nn \times n matrix AA is an n×nn \times n matrix with entries in the smallest field containing the entries of AA . Further,this canonical form is the same matrix even if we allow conjugation of AA by nonsingular matrices whose entries come from larger fields. This explains the terminology of rational canonical form.

这个推论特别指出,对于 n×nn \times n 矩阵 AA 的有理标准形是一个元素在包含 AA 元素的最小场中的 n×nn \times n 矩阵。进一步,即使我们允许通过来自较大场的非奇异矩阵对 AA 进行共轭,这个标准形仍然是相同的矩阵。这就解释了为什么使用有理标准形这个术语。

The next proposition gives the connection between the characteristic polynomial of a matrix (or of a linear transformation) and its invariant factors and is quite useful for determining these invariant factors (particularly for matrices of small size).

下一个命题给出了矩阵(或线性变换)的特征多项式与其不变因子之间的联系,这对于确定这些不变因子(特别是对于小尺寸矩阵)非常有用。

Lemma 19. Let a(x)F[x]a\left( x\right) \in F\left\lbrack x\right\rbrack be any monic polynomial.

引理19. 设 a(x)F[x]a\left( x\right) \in F\left\lbrack x\right\rbrack 为任意首一多项式。

(1) The characteristic polynomial of the companion matrix of a(x)a\left( x\right) is a(x)a\left( x\right) .

(1) a(x)a\left( x\right) 的伴随矩阵的特征多项式是 a(x)a\left( x\right)

(2) If MM is the block diagonal matrix

(2) 如果 MM 是分块对角矩阵

M=(A1000A2000Ak),M = \left( \begin{matrix} {A}_{1} & 0 & \ldots & 0 \\ 0 & {A}_{2} & \ldots & 0 \\ \vdots & \vdots & \ddots & \vdots \\ 0 & 0 & \ldots & {A}_{k} \end{matrix}\right) ,

given by the direct sum of matrices A1,A2,,Ak{A}_{1},{A}_{2},\ldots ,{A}_{k} then the characteristic polynomial of MM is the product of the characteristic polynomials of A1,A2,,Ak{A}_{1},{A}_{2},\ldots ,{A}_{k} .

由矩阵 A1,A2,,Ak{A}_{1},{A}_{2},\ldots ,{A}_{k} 的直接和给定,那么 MM 的特征多项式是 A1,A2,,Ak{A}_{1},{A}_{2},\ldots ,{A}_{k} 的特征多项式的乘积。

Proof: These are both straightforward exercises.

证明:这两个都是直接的练习。

Proposition 20. Let AA be an n×nn \times n matrix over the field FF .

命题20. 设 AA 为定义在域 FF 上的 n×nn \times n 矩阵。

(1) The characteristic polynomial of AA is the product of all the invariant factors of A.

(1) AA 的特征多项式是 A 的所有不变因子的乘积。

(2) (The Cayley-Hamilton Theorem) The minimal polynomial of AA divides the characteristic polynomial of AA .

(2) (凯莱-哈密顿定理)AA 的最小多项式整除 AA 的特征多项式。

(3) The characteristic polynomial of AA divides some power of the minimal polynomial of AA . In particular these polynomials have the same roots,not counting multiplicities.

(3) AA 的特征多项式整除 AA 的最小多项式的某个幂。特别地,这些多项式具有相同的根,不计重数。

The same statements are true if the matrix AA is replaced by a linear transformation TT of an nn -dimensional vector space over FF .

如果将矩阵 AA 替换为 FFnn 维向量空间的线性变换 TT,上述结论同样成立。

Proof: Let BB be the rational canonical form of AA . By the previous lemma the block diagonal form of BB shows that the characteristic polynomial of BB is the product of the characteristic polynomials of the companion matrices of the invariant factors of AA . By the first part of the lemma above, the characteristic polynomial of the companion matrix Ca(x){\mathcal{C}}_{a\left( x\right) } for a(x)a\left( x\right) is just a(x)a\left( x\right) ,which implies that the characteristic polynomial for BB is the product of the invariant factors of AA . Since AA and BB are similar,they have the same characteristic polynomial, which proves (1). Assertion (2) is immediate from (1) since the minimal polynomial for AA is the largest invariant factor of AA . The fact that all the invariant factors divide the largest one immediately implies (3). The final assertion is clear from the dictionary between linear transformations of vector spaces and matrices.

证明:设 BBAA 的有理标准形。根据之前的引理,BB 的块对角形式表明 BB 的特征多项式是 AA 的不变因子的伴随矩阵的特征多项式的乘积。由上述引理的第一部分,a(x)a\left( x\right) 的伴随矩阵 Ca(x){\mathcal{C}}_{a\left( x\right) } 的特征多项式就是 a(x)a\left( x\right),这意味着 BB 的特征多项式是 AA 的不变因子的乘积。由于 AABB 相似,它们具有相同的特征多项式,这证明了 (1)。从 (1) 直接得出断言 (2),因为 AA 的最小多项式是 AA 的最大不变因子。所有不变因子都能整除最大不变因子这一事实立即意味着 (3)。最后的断言从线性变换的向量空间与矩阵之间的字典关系可以清楚地看出。

Note that part (2) of the proposition is the assertion that the matrix AA satisfies its own characteristic polynomial,i.e., cA(A)=0{c}_{A}\left( A\right) = 0 as matrices,which is the usual formulation for the Cayley-Hamilton Theorem. Note also that it implies the degree of the minimal polynomial for AA has degree at most nn ,a result mentioned before.

注意,命题的第 (2) 部分是断言矩阵 AA 满足其自身的特征多项式,即 cA(A)=0{c}_{A}\left( A\right) = 0 作为矩阵,这是凯莱-哈密顿定理的通常表述。还应注意,它暗示了 AA 的最小多项式的次数至多为 nn,这是一个之前提到过的结果。

The relations in Proposition 20 are frequently quite useful in the determination of the invariant factors for a matrix AA ,particularly for matrices of small degree (cf. Exercises 3 and 4 and the examples). The following result (which relies on Exercises 16 to 19 in the previous section and whose proof we outline in the exercises) computes the invariant factors in general.

命题 20 中的关系在确定矩阵 AA 的不变因子时经常非常有用,特别是对于小度数的矩阵(参见练习 3 和 4 以及示例)。以下结果(依赖于上一节的练习 16 到 19,并且其证明我们在练习中概述)计算了一般情况下的不变因子。

Let AA be an n×nn \times n matrix over the field FF . Then xIA{xI} - A is an n×nn \times n matrix with entries in F[x]F\left\lbrack x\right\rbrack . The three operations

AA 是定义在域 FF 上的一个 n×nn \times n 矩阵。那么 xIA{xI} - A 是一个其元素属于 F[x]F\left\lbrack x\right\rbrackn×nn \times n 矩阵。以下三种操作:

(a) interchanging two rows or columns

(a) 交换两行或两列

(b) adding a multiple (in F[x]F\left\lbrack x\right\rbrack ) of one row or column to another

(b) 将一行或一列的倍数(在 F[x]F\left\lbrack x\right\rbrack 中)加到另一行或另一列上

(c) multiplying any row or column by a unit in F[x]F\left\lbrack x\right\rbrack ,i.e.,by a nonzero element in FF , are called elementary row and column operations.

(c) 将任何一行或一列乘以 F[x]F\left\lbrack x\right\rbrack 中的单位元,即乘以 FF 中的非零元素,这些操作被称为基本的行和列变换。

Theorem 21. Let AA be an n×nn \times n matrix over the field FF . Using the three elementary row and column operations above,the n×nn \times n matrix xIA{xI} - A with entries from F[x]F\left\lbrack x\right\rbrack can be put into the diagonal form (called the Smith Normal Form for AA )

定理21。设 AA 是定义在域 FF 上的一个 n×nn \times n 矩阵。使用上述三种基本的行和列变换,可以将具有来自 F[x]F\left\lbrack x\right\rbrack 的元素的 n×nn \times n 矩阵 xIA{xI} - A 转化为对角形式(称为 AA 的 Smith 标准形)

(11a1(x)a2(x)an(x))\left( \begin{matrix} 1 & & & & & & \\ & \ddots & & & & & \\ & & 1 & & {a}_{1}\left( x\right) & & \\ & & & & {a}_{2}\left( x\right) & & \\ & & & & & \ddots & \\ & & & & & & {a}_{n}\left( x\right) \end{matrix}\right)

with monic nonzero elements a1(x),a2(x),,am(x){a}_{1}\left( x\right) ,{a}_{2}\left( x\right) ,\ldots ,{a}_{m}\left( x\right) of F[x]F\left\lbrack x\right\rbrack with degrees at least one and satisfying a1(x)a2(x)am(x){a}_{1}\left( x\right) \left| {{a}_{2}\left( x\right) }\right| \cdots \mid {a}_{m}\left( x\right) . The elements a1(x),,am(x){a}_{1}\left( x\right) ,\ldots ,{a}_{m}\left( x\right) are the invariant factors of AA .

其中 a1(x),a2(x),,am(x){a}_{1}\left( x\right) ,{a}_{2}\left( x\right) ,\ldots ,{a}_{m}\left( x\right)F[x]F\left\lbrack x\right\rbrack 的首一非零元素,其次数至少为1,并满足 a1(x)a2(x)am(x){a}_{1}\left( x\right) \left| {{a}_{2}\left( x\right) }\right| \cdots \mid {a}_{m}\left( x\right) 。元素 a1(x),,am(x){a}_{1}\left( x\right) ,\ldots ,{a}_{m}\left( x\right)AA 的不变因子。

Proof: cf. the exercises.

证明:参见练习。

Invariant Factor Decomposition Algorithm: Converting to Rational Canonical Form

不变因子分解算法:转换为有理标准形

As mentioned in the exercises near the end of the previous section, keeping track of the operations necessary to diagonalize xIA{xI} - A will explicitly give a matrix PP such that P1AP{P}^{-1}{AP} is in rational canonical form. Equivalently,if VV is a given F[x]F\left\lbrack x\right\rbrack -module with vector space basis [e1,e2,,en]\left\lbrack {{e}_{1},{e}_{2},\ldots ,{e}_{n}}\right\rbrack ,then PP defines the change of basis giving the Invariant Factor Decomposition of VV into a direct sum of cyclic F[x]F\left\lbrack x\right\rbrack -modules. In particular,if AA is the matrix of the linear transformation TT of the F[x]F\left\lbrack x\right\rbrack -module VV defined by xx (i.e., T(ej)=xej=i=1naijeiT\left( {e}_{j}\right) = x{e}_{j} = \mathop{\sum }\limits_{{i = 1}}^{n}{a}_{ij}{e}_{i} where A=(aij))\left. {A = \left( {a}_{ij}\right) }\right) ,then the matrix PP defines the change of basis for VV with respect to which the matrix for TT is in rational canonical form.

如前一部分末尾的练习中所述,跟踪对角化 xIA{xI} - A 所需的操作将明确给出一个矩阵 PP,使得 P1AP{P}^{-1}{AP} 处于有理标准形。等价地,如果 VV 是一个给定的 F[x]F\left\lbrack x\right\rbrack -模,其向量空间基为 [e1,e2,,en]\left\lbrack {{e}_{1},{e}_{2},\ldots ,{e}_{n}}\right\rbrack,那么 PP 定义了基变换,给出 VV 的不变因子分解,将其分解为循环 F[x]F\left\lbrack x\right\rbrack -模的直接和。特别是,如果 AA 是由 xx 定义的 F[x]F\left\lbrack x\right\rbrack -模 VV 的线性变换 TT 的矩阵(即 T(ej)=xej=i=1naijeiT\left( {e}_{j}\right) = x{e}_{j} = \mathop{\sum }\limits_{{i = 1}}^{n}{a}_{ij}{e}_{i} 其中 A=(aij))\left. {A = \left( {a}_{ij}\right) }\right)),那么矩阵 PP 定义了相对于 VV 的基变换,该变换使得 TT 的矩阵处于有理标准形。

We first describe the algorithm in the general context of determining the Invariant Factor Decomposition of a given F[x]F\left\lbrack x\right\rbrack -module VV with vector space basis [e1,e2,,en]\left\lbrack {{e}_{1},{e}_{2},\ldots ,{e}_{n}}\right\rbrack (the proof is outlined in the exercises). We then describe the algorithm to convert a given n×nn \times n matrix AA to rational canonical form (in which reference to an underlying vector space and associated linear transformation are suppressed).

我们首先在确定给定 F[x]F\left\lbrack x\right\rbrack -模 VV 的不变因子分解的一般背景下描述算法(证明大纲在练习中给出)。然后我们描述将给定的 n×nn \times n 矩阵 AA 转换为有理标准形的算法(其中省略了基础向量空间和关联线性变换的引用)。

Explicit numerical examples of this algorithm are given in Examples 2 and 3 following.

本算法的具体数值示例在接下来的示例 2 和 3 中给出。

Invariant Factor Decomposition Algorithm

不变因子分解算法

Let VV be an F[x]F\left\lbrack x\right\rbrack -module with vector space basis [e1,e2,,en]\left\lbrack {{e}_{1},{e}_{2},\ldots ,{e}_{n}}\right\rbrack (so in particular these elements are generators for VV as an F[x]F\left\lbrack x\right\rbrack -module). Let TT be the linear transformation of VV to itself defined by xx and let AA be the n×nn \times n matrix associated to TT and this choice of basis for VV ,i.e.,

VV 是一个 F[x]F\left\lbrack x\right\rbrack -模,其向量空间基为 [e1,e2,,en]\left\lbrack {{e}_{1},{e}_{2},\ldots ,{e}_{n}}\right\rbrack(特别地,这些元素作为 F[x]F\left\lbrack x\right\rbrack -模的生成元)。设 TT 是从 VV 到其自身的线性变换,定义为 xx,设 AA 是与 TT 相关的 n×nn \times n 矩阵,以及 VV 的基的选择,即,

T(ej)=xej=i=1naijei   where   A=(aij).T\left( {e}_{j}\right) = x{e}_{j} = \mathop{\sum }\limits_{{i = 1}}^{n}{a}_{ij}{e}_{i}\;\text{ where }\;A = \left( {a}_{ij}\right) .

(1) Use the following three elementary row and column operations to diagonalize the matrix xIA{xI} - A over F[x]F\left\lbrack x\right\rbrack ,keeping track of the row operations used:

(1)使用以下三个基本的行和列变换来对矩阵 xIA{xI} - A 进行对角化,保持对行变换的使用进行跟踪:

(a) interchange two rows or columns (which will be denoted by RiRj{R}_{i} \leftrightarrow {R}_{j} for the interchange of the ith {i}^{\text{th }} and jth {j}^{\text{th }} rows and similarly by CiCj{C}_{i} \leftrightarrow {C}_{j} for columns),

(a)交换两行或两列(交换第 ith {i}^{\text{th }} 行和第 jth {j}^{\text{th }} 行将用 RiRj{R}_{i} \leftrightarrow {R}_{j} 表示,类似地,交换列用 CiCj{C}_{i} \leftrightarrow {C}_{j} 表示),

(b) add a multiple (in F[x]F\left\lbrack x\right\rbrack ) of one row or column to another (which will be denoted by Ri+p(x)RjRi{R}_{i} + p\left( x\right) {R}_{j} \mapsto {R}_{i} if p(x)p\left( x\right) times the jth {j}^{\text{th }} row is added to the ith {i}^{\text{th }} row,and similarly by Ci+p(x)CjCi{C}_{i} + p\left( x\right) {C}_{j} \mapsto {C}_{i} for columns),

(b)将一行或一列的倍数(在 F[x]F\left\lbrack x\right\rbrack 中)加到另一行或另一列上(如果将 p(x)p\left( x\right) 倍的 jth {j}^{\text{th }} 行加到第 ith {i}^{\text{th }} 行上,则用 Ri+p(x)RjRi{R}_{i} + p\left( x\right) {R}_{j} \mapsto {R}_{i} 表示,类似地,对于列用 Ci+p(x)CjCi{C}_{i} + p\left( x\right) {C}_{j} \mapsto {C}_{i} 表示),

(c) multiply any row or column by a unit in F[x]F\left\lbrack x\right\rbrack ,i.e.,by a nonzero element in FF (which will be denoted by uRiu{R}_{i} if the ith {i}^{\text{th }} row is multiplied by uF×u \in {F}^{ \times } ,and similarly by uCiu{C}_{i} for columns).

(c)将任何行或列乘以 F[x]F\left\lbrack x\right\rbrack 中的单位元,即 FF 中的非零元素(如果第 ith {i}^{\text{th }} 行乘以 uF×u \in {F}^{ \times },则用 uRiu{R}_{i} 表示,类似地,对于列用 uCiu{C}_{i} 表示)。

(2) Beginning with the F[x]F\left\lbrack x\right\rbrack -module generators [e1,e2,,en]\left\lbrack {{e}_{1},{e}_{2},\ldots ,{e}_{n}}\right\rbrack ,for each row operation used in (1), change the set of generators by the following rules:

(2)从 F[x]F\left\lbrack x\right\rbrack -模生成元 [e1,e2,,en]\left\lbrack {{e}_{1},{e}_{2},\ldots ,{e}_{n}}\right\rbrack 开始,对于(1)中使用的每一行变换,按照以下规则改变生成元的集合:

(a) If the ith {i}^{\text{th }} row is interchanged with the jth {j}^{\text{th }} row then interchange the ith {i}^{\text{th }} and jth {j}^{\text{th }} generators.

(a)如果第 ith {i}^{\text{th }} 行与第 jth {j}^{\text{th }} 行交换,则交换 ith {i}^{\text{th }}jth {j}^{\text{th }} 生成元。

(b) If p(x)p\left( x\right) times the jth {j}^{\text{th }} row is added to the ith {i}^{\text{th }} row then subtract p(x)p\left( x\right) times the ith {i}^{\text{th }} generator from the jth {j}^{\text{th }} generator (note the indices). (c) If the ith {i}^{\text{th }} row is multiplied by the unit uFu \in F then divide the ith {i}^{\text{th }} generator by uu .

(b) 如果将 p(x)p\left( x\right) 乘以 jth {j}^{\text{th }} 行加到 ith {i}^{\text{th }} 行,然后从 jth {j}^{\text{th }} 生成器中减去 p(x)p\left( x\right) 乘以 ith {i}^{\text{th }} 生成器(注意索引)。(c) 如果 ith {i}^{\text{th }} 行乘以单位 uFu \in F,则将 ith {i}^{\text{th }} 生成器除以 uu

(3) When xIA{xI} - A has been diagonalized to the form in Theorem 21 the generators [e1,e2,,en]\left\lbrack {{e}_{1},{e}_{2},\ldots ,{e}_{n}}\right\rbrack for VV will be in the form of F[x]F\left\lbrack x\right\rbrack -linear combinations of e1,e2,,en{e}_{1},{e}_{2},\ldots ,{e}_{n} . Use xej=T(ej)=i=1naijeix{e}_{j} = T\left( {e}_{j}\right) = \mathop{\sum }\limits_{{i = 1}}^{n}{a}_{ij}{e}_{i} to write these elements as FF - linear combinations of e1,e2,,en{e}_{1},{e}_{2},\ldots ,{e}_{n} . When xIA{xI} - A has been diagonalized,the first nmn - m of these linear combinations are 0 (providing a useful numerical check on the computations) and the remaining mm linear combinations are nonzero,i.e., the generators for VV are in the form [0,,0,f1,,fm]\left\lbrack {0,\ldots ,0,{f}_{1},\ldots ,{f}_{m}}\right\rbrack corresponding precisely to the diagonal elements in Theorem 21. The elements f1,,fm{f}_{1},\ldots ,{f}_{m} are a set of F[x]F\left\lbrack x\right\rbrack -module generators for the cyclic factors in the invariant factor decomposition of VV (with annihilators (a1(x)),,(am(x))\left( {{a}_{1}\left( x\right) }\right) ,\ldots ,\left( {{a}_{m}\left( x\right) }\right) ,respectively):

(3) 当 xIA{xI} - A 已经对角化为定理21中的形式时,VV 的生成器 [e1,e2,,en]\left\lbrack {{e}_{1},{e}_{2},\ldots ,{e}_{n}}\right\rbrack 将是 F[x]F\left\lbrack x\right\rbrack -线性组合的形式,由 e1,e2,,en{e}_{1},{e}_{2},\ldots ,{e}_{n} 组成。使用 xej=T(ej)=i=1naijeix{e}_{j} = T\left( {e}_{j}\right) = \mathop{\sum }\limits_{{i = 1}}^{n}{a}_{ij}{e}_{i} 将这些元素写成 FF -线性组合的形式。当 xIA{xI} - A 已经对角化时,这些线性组合中的前 nmn - m 个为0(为计算提供了一个有用的数值校验),其余的 mm 线性组合不为零,即 VV 的生成器是 [0,,0,f1,,fm]\left\lbrack {0,\ldots ,0,{f}_{1},\ldots ,{f}_{m}}\right\rbrack 的形式,精确对应定理21中的对角元素。元素 f1,,fm{f}_{1},\ldots ,{f}_{m}F[x]F\left\lbrack x\right\rbrack -模生成器集合,对应于 VV 的不变因子分解中的循环因子(分别具有消灭子 (a1(x)),,(am(x))\left( {{a}_{1}\left( x\right) }\right) ,\ldots ,\left( {{a}_{m}\left( x\right) }\right)):

V=F[x]f1F[x]f2F[x]fmV = F\left\lbrack x\right\rbrack {f}_{1} \oplus F\left\lbrack x\right\rbrack {f}_{2} \oplus \ldots \oplus F\left\lbrack x\right\rbrack {f}_{m}
F[x]fiF[x]/(ai(x))  i=1,2,,m,F\left\lbrack x\right\rbrack {f}_{i} \cong F\left\lbrack x\right\rbrack /\left( {{a}_{i}\left( x\right) }\right) \;i = 1,2,\ldots ,m,

giving the Invariant Factor Decomposition of the F[x]F\left\lbrack x\right\rbrack -module VV .

给出了 F[x]F\left\lbrack x\right\rbrack -模 VV 的不变因子分解。

(4) The corresponding vector space basis for each cyclic factor of VV is then given by the elements fi,Tfi,T2fi,,Tdegai(x)1fi{f}_{i},T{f}_{i},{T}^{2}{f}_{i},\ldots ,{T}^{\deg {a}_{i}\left( x\right) - 1}{f}_{i} .

(4) 每个 VV 的循环因子的对应向量空间基由元素 fi,Tfi,T2fi,,Tdegai(x)1fi{f}_{i},T{f}_{i},{T}^{2}{f}_{i},\ldots ,{T}^{\deg {a}_{i}\left( x\right) - 1}{f}_{i} 给出。

(5) Write the kth {k}^{\text{th }} element of the vector space basis computed in (4) in terms of the original vector space basis [e1,e2,,en]\left\lbrack {{e}_{1},{e}_{2},\ldots ,{e}_{n}}\right\rbrack and use the coordinates for the kth{k}^{\text{th}} column of an n×nn \times n matrix PP . Then P1AP{P}^{-1}{AP} is in rational canonical form (with diagonal blocks the companion matrices for the ai(x){a}_{i}\left( x\right) ). This is the matrix for the linear transformation TT with respect to the vector space basis in (4).

(5) 将在(4)中计算得到的向量空间基的kth {k}^{\text{th }}元素表示为原始向量空间基[e1,e2,,en]\left\lbrack {{e}_{1},{e}_{2},\ldots ,{e}_{n}}\right\rbrack的形式,并使用kth{k}^{\text{th}}列的坐标作为n×nn \times n矩阵PP的列。那么P1AP{P}^{-1}{AP}就是有理标准形(对角块为ai(x){a}_{i}\left( x\right)的伴随矩阵)。这是线性变换TT相对于(4)中的向量空间基的矩阵。

We now describe the algorithm to convert a given n×nn \times n matrix AA to rational canonical form,i.e.,to determine an n×nn \times n matrix PP so that P1AP{P}^{-1}{AP} is in rational canonical form. This is nothing more than the algorithm above applied to the vector space V=FnV = {F}^{n} of n×1n \times 1 column vectors with standard basis [e1,e2,,en]\left\lbrack {{e}_{1},{e}_{2},\ldots ,{e}_{n}}\right\rbrack (where ei{e}_{i} is the column vector with 1 in the ith {i}^{\text{th }} position and 0 ’s elsewhere) and TT is the linear transformation defined by AA and this choice of basis. Explicit reference to this underlying vector space and associated linear transformation are suppressed, so the algorithm is purely matrix theoretic.

我们现在描述一个算法,将给定的n×nn \times n矩阵AA转换为有理标准形,即确定一个n×nn \times n矩阵PP,使得P1AP{P}^{-1}{AP}为有理标准形。这不过是上述算法应用于V=FnV = {F}^{n}的向量空间n×1n \times 1的标准基[e1,e2,,en]\left\lbrack {{e}_{1},{e}_{2},\ldots ,{e}_{n}}\right\rbrack(其中ei{e}_{i}是在ith {i}^{\text{th }}位置为1,其余位置为0的列向量)和由AA以及这个基的选择定义的线性变换TT。对这一底层向量空间和相关的线性变换的明确引用被省略,因此算法纯粹是矩阵理论上的。

Converting an n×nn \times n Matrix to Rational Canonical Form

n×nn \times n矩阵转换为有理标准形

Let AA be an n×nn \times n matrix with entries in the field FF .

AA是一个在字段FF中的n×nn \times n矩阵。

(1) Use the following three elementary row and column operations to diagonalize the matrix xIA{xI} - A over F[x]F\left\lbrack x\right\rbrack ,keeping track of the row operations used:

(1) 使用以下三个基本的行和列操作来对矩阵xIA{xI} - A进行对角化,同时跟踪所使用的行操作:

(a) interchange two rows or columns (which will be denoted by RiRj{R}_{i} \leftrightarrow {R}_{j} for the interchange of the ith {i}^{\text{th }} and jth {j}^{\text{th }} rows and similarly by CiCj{C}_{i} \leftrightarrow {C}_{j} for columns),

(a) 交换两行或两列(交换第ith {i}^{\text{th }}jth {j}^{\text{th }}行将表示为RiRj{R}_{i} \leftrightarrow {R}_{j},类似地,交换列表示为CiCj{C}_{i} \leftrightarrow {C}_{j}),

(b) add a multiple (in F[x]F\left\lbrack x\right\rbrack ) of one row or column to another (which will be denoted by Ri+p(x)RjRi{R}_{i} + p\left( x\right) {R}_{j} \mapsto {R}_{i} if p(x)p\left( x\right) times the jth {j}^{\text{th }} row is added to the ith {i}^{\text{th }} row,and similarly by Ci+p(x)CjCi{C}_{i} + p\left( x\right) {C}_{j} \mapsto {C}_{i} for columns),

(b) 将一行或一列的倍数(在 F[x]F\left\lbrack x\right\rbrack 中)加到另一行或列上(如果将 p(x)p\left( x\right) 倍的 jth {j}^{\text{th }} 行加到 ith {i}^{\text{th }} 行上,则用 Ri+p(x)RjRi{R}_{i} + p\left( x\right) {R}_{j} \mapsto {R}_{i} 表示,类似地,对于列则用 Ci+p(x)CjCi{C}_{i} + p\left( x\right) {C}_{j} \mapsto {C}_{i} 表示),

(c) multiply any row or column by a unit in F[x]F\left\lbrack x\right\rbrack ,i.e.,by a nonzero element in FF (which will be denoted by uRiu{R}_{i} if the ith {i}^{\text{th }} row is multiplied by uF×u \in {F}^{ \times } ,and similarly by uCiu{C}_{i} for columns).

(c) 将任何行或列乘以 F[x]F\left\lbrack x\right\rbrack 中的单位元素,即乘以 FF 中的非零元素(如果将 ith {i}^{\text{th }} 行乘以 uF×u \in {F}^{ \times } ,则用 uRiu{R}_{i} 表示,类似地,对于列则用 uCiu{C}_{i} 表示)。

Define d1,,dm{d}_{1},\ldots ,{d}_{m} to be the degrees of the monic nonconstant polynomials a1(x),,am(x){a}_{1}\left( x\right) ,\ldots ,{a}_{m}\left( x\right) appearing on the diagonal,respectively.

定义 d1,,dm{d}_{1},\ldots ,{d}_{m} 为对角线上出现的首一非零多项式 a1(x),,am(x){a}_{1}\left( x\right) ,\ldots ,{a}_{m}\left( x\right) 的次数。

(2) Beginning with the n×nn \times n identity matrix P{P}^{\prime } ,for each row operation used in (1), change the matrix P{P}^{\prime } by the following rules:

(2) 从 n×nn \times n 单位矩阵 P{P}^{\prime } 开始,对于在 (1) 中使用的每一行操作,按照以下规则改变矩阵 P{P}^{\prime }

(a) If RiRj{R}_{i} \leftrightarrow {R}_{j} then interchange the ith {i}^{\text{th }} and jth {j}^{\text{th }} columns of P{P}^{\prime } (i.e., CiCj{C}_{i} \leftrightarrow {C}_{j} for P)\left. {P}^{\prime }\right) .

(a) 如果 RiRj{R}_{i} \leftrightarrow {R}_{j} ,则交换 P{P}^{\prime } 中的 ith {i}^{\text{th }}jth {j}^{\text{th }} 列(即 CiCj{C}_{i} \leftrightarrow {C}_{j} 对于 P)\left. {P}^{\prime }\right))。

(b) If Ri+p(x)RjRi{R}_{i} + p\left( x\right) {R}_{j} \mapsto {R}_{i} then subtract the product of the matrix p(A)p\left( A\right) times the ith {i}^{\text{th }} column of P{P}^{\prime } from the jth {j}^{\text{th }} column of P{P}^{\prime } (i.e., Cjp(A)CiCj{C}_{j} - p\left( A\right) {C}_{i} \mapsto {C}_{j} for P{P}^{\prime } - note the indices).

(b) 如果 Ri+p(x)RjRi{R}_{i} + p\left( x\right) {R}_{j} \mapsto {R}_{i} ,则从 P{P}^{\prime }jth {j}^{\text{th }} 列中减去矩阵 p(A)p\left( A\right) 乘以 P{P}^{\prime }ith {i}^{\text{th }} 列的乘积(即 Cjp(A)CiCj{C}_{j} - p\left( A\right) {C}_{i} \mapsto {C}_{j} 对于 P{P}^{\prime } - 注意索引)。

(c) If uRiu{R}_{i} then divide the elements of the ith {i}^{\text{th }} column of P{P}^{\prime } by uu (i.e., u1Ci{u}^{-1}{C}_{i} for P)\left. {P}^{\prime }\right) .

(c) 如果 uRiu{R}_{i} ,则将 P{P}^{\prime }ith {i}^{\text{th }} 列元素除以 uu(即 u1Ci{u}^{-1}{C}_{i} 对于 P)\left. {P}^{\prime }\right))。

(3) When xIA{xI} - A has been diagonalized to the form in Theorem 21 the first nmn - m columns of the matrix P{P}^{\prime } are 0 (providing a useful numerical check on the computations) and the remaining mm columns of P{P}^{\prime } are nonzero. For each i=1,2,,mi = 1,2,\ldots ,m , multiply the ith{i}^{\mathrm{{th}}} nonzero column of P{P}^{\prime } successively by A0=I,A1,A2,,Adi1,{A}^{0} = I,{A}^{1},{A}^{2},\ldots ,{A}^{{d}_{i} - 1}, where di{d}_{i} is the integer in (1) above and use the resulting column vectors (in this order) as the next di{d}_{i} columns of an n×nn \times n matrix PP . Then P1AP{P}^{-1}{AP} is in rational canonical form (whose diagonal blocks are the companion matrices for the polynomials a1(x),,am(x){a}_{1}\left( x\right) ,\ldots ,{a}_{m}\left( x\right) in (1)).

(3) 当 xIA{xI} - A 被对角化为定理21中的形式时,矩阵 P{P}^{\prime } 的前 nmn - m 列为0(为计算提供了一个有用的数值校验),而 P{P}^{\prime } 的剩余 mm 列不为零。对于每个 i=1,2,,mi = 1,2,\ldots ,m ,将 P{P}^{\prime }ith{i}^{\mathrm{{th}}} 不为零列依次乘以 A0=I,A1,A2,,Adi1,{A}^{0} = I,{A}^{1},{A}^{2},\ldots ,{A}^{{d}_{i} - 1}, ,其中 di{d}_{i} 是上文(1)中的整数,并使用按此顺序得到的列向量作为 n×nn \times n 矩阵 PP 的下一个 di{d}_{i} 列。然后 P1AP{P}^{-1}{AP} 处于有理标准形(其对角块是多项式 a1(x),,am(x){a}_{1}\left( x\right) ,\ldots ,{a}_{m}\left( x\right) 在(1)中的伴随矩阵)。

In the theory of canonical forms for linear transformations (or matrices) the characteristic polynomial plays the role of the order of a finite abelian group and the minimal polynomial plays the role of the exponent (after all, they are the same invariants, one for modules over the Principal Ideal Domain Z\mathbb{Z} and the other for modules over the Principal Ideal Domain F[x]F\left\lbrack x\right\rbrack ) so we can solve problems directly analogous to those we considered for finite abelian groups in Chapter 5. In particular, this includes the following:

在线性变换(或矩阵)的标准形理论中,特征多项式扮演了有限阿贝尔群阶数的作用,而最小多项式扮演了指数的作用(毕竟,它们是相同的不变量,一个适用于主理想整环 Z\mathbb{Z} 上的模,另一个适用于主理想整环 F[x]F\left\lbrack x\right\rbrack 上的模),因此我们可以直接解决与第5章中我们考虑的有限阿贝尔群类似的问题。特别是,这包括以下内容:

(A)  \textbf{(A)}\; determine the rational canonical form of a given matrix (analogous to decomposing a finite abelian group as a direct product of cyclic groups)

(A)  \textbf{(A)}\; 确定给定矩阵的有理标准形(类似于将有限阿贝尔群分解为循环群的直积)

(B) determine whether two given matrices are similar (analogous to determining whether two given finite abelian groups are isomorphic)

(B) 判断给定的两个矩阵是否相似(类似于判断给定的两个有限阿贝尔群是否同构)

(C) determine all similarity classes of matrices over FF with a given characteristic polynomial (analogous to determining all abelian groups of a given order)

(C) 确定具有给定特征多项式的 FF 上所有矩阵的相似类(类似于确定给定阶数的所有阿贝尔群)

(D) determine all similarity classes of n×nn \times n matrices over FF with a given minimal polynomial (analogous to determining all abelian groups of rank at most nn of a given exponent).

(D) 确定 n×nn \times n 矩阵在给定最小多项式 FF 上的所有相似类(类似于确定秩至多 nn 的给定指数的所有阿贝尔群)。

Examples

示例

(1) We find the rational canonical forms of the following matrices over Q\mathbb{Q} and determine if they are similar:

(1) 我们找到以下 Q\mathbb{Q} 矩阵的有理标准形,并确定它们是否相似:

A=(2214037002)  B=(04851430003)  C=(221021003).A = \left( \begin{array}{rrr} 2 & - 2 & {14} \\ 0 & 3 & - 7 \\ 0 & 0 & 2 \end{array}\right) \;B = \left( \begin{array}{rrr} 0 & - 4 & {85} \\ 1 & 4 & - {30} \\ 0 & 0 & 3 \end{array}\right) \;C = \left( \begin{array}{rrr} 2 & 2 & 1 \\ 0 & 2 & - 1 \\ 0 & 0 & 3 \end{array}\right) .

A direct computation shows that all three of these matrices have the same characteristic polynomial: cA(x)=cB(x)=cC(x)=(x2)2(x3){c}_{A}\left( x\right) = {c}_{B}\left( x\right) = {c}_{C}\left( x\right) = {\left( x - 2\right) }^{2}\left( {x - 3}\right) . Since the minimal and characteristic polynomials have the same roots, the only possibilities for the minimal polynomials are (x2)(x3)\left( {x - 2}\right) \left( {x - 3}\right) or (x2)2(x3){\left( x - 2\right) }^{2}\left( {x - 3}\right) . We quickly find that (A2I)(A3I)=0\left( {A - {2I}}\right) \left( {A - {3I}}\right) = 0 , (B2I)(B3I)0\left( {B - {2I}}\right) \left( {B - {3I}}\right) \neq 0 (the 1,1-entry is nonzero) and (C2I)(C3I)0\left( {C - {2I}}\right) \left( {C - {3I}}\right) \neq 0 (the 1,2-entry is nonzero). It follows that

直接计算表明这三个矩阵都有相同的特征多项式:cA(x)=cB(x)=cC(x)=(x2)2(x3){c}_{A}\left( x\right) = {c}_{B}\left( x\right) = {c}_{C}\left( x\right) = {\left( x - 2\right) }^{2}\left( {x - 3}\right) 。由于最小多项式和特征多项式有相同的根,最小多项式的唯一可能形式是 (x2)(x3)\left( {x - 2}\right) \left( {x - 3}\right)(x2)2(x3){\left( x - 2\right) }^{2}\left( {x - 3}\right) 。我们很快发现 (A2I)(A3I)=0\left( {A - {2I}}\right) \left( {A - {3I}}\right) = 0(B2I)(B3I)0\left( {B - {2I}}\right) \left( {B - {3I}}\right) \neq 0 (1,1-项不为零)和 (C2I)(C3I)0\left( {C - {2I}}\right) \left( {C - {3I}}\right) \neq 0 (1,2-项不为零)。因此

mA(x)=(x2)(x3),  mB(x)=mC(x)=(x2)2(x3).{m}_{A}\left( x\right) = \left( {x - 2}\right) \left( {x - 3}\right) ,\;{m}_{B}\left( x\right) = {m}_{C}\left( x\right) = {\left( x - 2\right) }^{2}\left( {x - 3}\right) .

It follows immediately that there are no additional invariant factors for BB and CC . Since the invariant factors for AA divide the minimal polynomial and have product the characteristic polynomial,we see that AA has for invariant factors the polynomials x2,(x2)(x3)=x25x+6x - 2,\left( {x - 2}\right) \left( {x - 3}\right) = {x}^{2} - {5x} + 6 . (For 2×22 \times 2 and 3×33 \times 3 matrices the determination of the characteristic and minimal polynomials determines all the invariant factors, cf. Exercises 3 and 4.) We conclude that BB and CC are similar and neither is similar to AA . The rational canonical forms are (note (x2)2(x3)=x37x2+16x12{\left( x - 2\right) }^{2}\left( {x - 3}\right) = {x}^{3} - 7{x}^{2} + {16x} - {12} )

立刻得出 BBCC 没有额外的不变因子。由于 AA 的不变因子除以最小多项式并乘积等于特征多项式,我们发现 AA 的不变因子是多项式 x2,(x2)(x3)=x25x+6x - 2,\left( {x - 2}\right) \left( {x - 3}\right) = {x}^{2} - {5x} + 6 。(对于 2×22 \times 23×33 \times 3 矩阵,特征多项式和最小多项式的确定决定了所有的不变因子,参见练习3和4。)我们得出结论,BBCC 是相似的,且都不与 AA 相似。有理标准形是(注意 (x2)2(x3)=x37x2+16x12{\left( x - 2\right) }^{2}\left( {x - 3}\right) = {x}^{3} - 7{x}^{2} + {16x} - {12}

(200006015)  (00121016017)  (00121016017).\left( \begin{array}{rrr} 2 & 0 & 0 \\ 0 & 0 & - 6 \\ 0 & 1 & 5 \end{array}\right) \;\left( \begin{array}{rrr} 0 & 0 & {12} \\ 1 & 0 & - {16} \\ 0 & 1 & 7 \end{array}\right) \;\left( \begin{array}{rrr} 0 & 0 & {12} \\ 1 & 0 & - {16} \\ 0 & 1 & 7 \end{array}\right) .

(2) In the example above the rational canonical forms were obtained simply by determining the characteristic and minimal polynomials for the matrices. As mentioned, this is sufficient for 2×22 \times 2 and 3×33 \times 3 matrices since this information is sufficient to determine all of the invariant factors. For larger matrices, however, this is in general not sufficient (cf. the next example) and more work is required to determine the invariant factors. In this example we again compute the rational canonical form for the matrix AA in Example 1 following the two algorithms outlined above. While this is computationally more difficult for this small matrix (as will be apparent), it has the advantage even in this case that it also explicitly computes a matrix PP with P1AP{P}^{-1}{AP} in rational canonical form.

在上述示例中,通过简单地确定矩阵的特征多项式和最小多项式来获得有理标准形。如前所述,这对于 2×22 \times 23×33 \times 3 矩阵来说是足够的,因为这一信息足以确定所有的不变因子。然而,对于更大的矩阵,这通常是不够的(参见下一个示例),需要更多的工作来确定不变因子。在这个示例中,我们再次计算示例 1 中矩阵 AA 的有理标准形,遵循上述两种算法。虽然对于这个小矩阵来说计算上更加困难(正如将出现的),但即使在每种情况下,它也有优势,即它还显式地计算了一个 PP 矩阵,其 P1AP{P}^{-1}{AP} 在有理标准形中。

I. (Invariant Factor Decomposition) We use row and column operations (in Q[x]\mathbb{Q}\left\lbrack x\right\rbrack ) to reduce the matrix

I.(不变因子分解)我们使用行和列操作(在 Q[x]\mathbb{Q}\left\lbrack x\right\rbrack 中)来简化矩阵

xIA=(x22140x3700x2){xI} - A = \left( \begin{matrix} x - 2 & 2 & - {14} \\ 0 & x - 3 & 7 \\ 0 & 0 & x - 2 \end{matrix}\right)

to diagonal form. As in the invariant factor decomposition algorithm, we shall use the notation RiRj{R}_{i} \leftrightarrow {R}_{j} to denote the interchange of the ith {i}^{\text{th }} and jth {j}^{\text{th }} rows, Ri+aRjRi{R}_{i} + a{R}_{j} \mapsto {R}_{i} if aa times the jth {j}^{\text{th }} row is added to the ith {i}^{\text{th }} row,simply uRiu{R}_{i} if the ith {i}^{\text{th }} row is multiplied by uu (and similarly for columns,using CC instead of RR ). Note also that the first two operations we perform below are rather ad  hoc{ad}\;{hoc} and were chosen simply to have integers everywhere in the computation:

到对角形式。和在不变因子分解算法中一样,我们将使用记号 RiRj{R}_{i} \leftrightarrow {R}_{j} 来表示交换 ith {i}^{\text{th }}jth {j}^{\text{th }} 行,如果 aa 倍的 jth {j}^{\text{th }} 行加到 ith {i}^{\text{th }} 行上,则是 Ri+aRjRi{R}_{i} + a{R}_{j} \mapsto {R}_{i} ,如果 ith {i}^{\text{th }} 行乘以 uu ,则是简单地 uRiu{R}_{i}(对于列操作,使用 CC 而不是 RR)。请注意,下面我们执行的前两个操作相当 ad  hoc{ad}\;{hoc} ,我们选择它们仅仅是为了在计算中到处都有整数:

(x22140x3700x2)R1+R2(x2x170x3700x2)\left( \begin{matrix} x - 2 & 2 & - {14} \\ 0 & x - 3 & 7 \\ 0 & 0 & x - 2 \end{matrix}\right) \underset{{R}_{1} + {R}_{2}}{ \rightarrow }\left( \begin{matrix} x - 2 & x - 1 & - 7 \\ 0 & x - 3 & 7 \\ 0 & 0 & x - 2 \end{matrix}\right) \rightarrow
C1C2(1x17x+3x3700x2)R1(1x+17x+3x3700x2)\overset{ \rightarrow }{\underset{{C}_{1} - {C}_{2}}{ \rightarrow }}\left( \begin{matrix} - 1 & x - 1 & - 7 \\ - x + 3 & x - 3 & 7 \\ 0 & 0 & x - 2 \end{matrix}\right) \underset{-{R}_{1}}{ \rightarrow }\left( \begin{matrix} 1 & - x + 1 & 7 \\ - x + 3 & x - 3 & 7 \\ 0 & 0 & x - 2 \end{matrix}\right) \rightarrow
R2+(x3)R1(1x+170x2+5x67(x2)00x2)C2+(x1)C1(1070x2+5x67(x2)00x2)\underset{{R}_{2} + \left( {x - 3}\right) {R}_{1}}{ \rightarrow }\left( \begin{matrix} 1 & - x + 1 & 7 \\ 0 & - {x}^{2} + {5x} - 6 & 7\left( {x - 2}\right) \\ 0 & 0 & x - 2 \end{matrix}\right) \underset{{C}_{2} + \left( {x - 1}\right) {C}_{1}}{ \rightarrow }\left( \begin{matrix} 1 & 0 & 7 \\ 0 & - {x}^{2} + {5x} - 6 & 7\left( {x - 2}\right) \\ 0 & 0 & x - 2 \end{matrix}\right) \rightarrow
C37C1C3(1000x2+5x67(x2)00x2)C2(1000x25x+67(x2)00x2)\underset{ \mapsto {C}_{3}}{\overset{{C}_{3} - 7{C}_{1}}{ \rightarrow }}\left( \begin{matrix} 1 & 0 & 0 \\ 0 & - {x}^{2} + {5x} - 6 & 7\left( {x - 2}\right) \\ 0 & 0 & x - 2 \end{matrix}\right) \underset{-{C}_{2}}{ \rightarrow }\left( \begin{matrix} 1 & 0 & 0 \\ 0 & {x}^{2} - {5x} + 6 & 7\left( {x - 2}\right) \\ 0 & 0 & x - 2 \end{matrix}\right) \rightarrow
R27R3(1000x25x+6000x2)C2C3(1000x2000x25x+6).\underset{{R}_{2} - 7{R}_{3}}{ \rightarrow }\left( \begin{matrix} 1 & 0 & 0 \\ 0 & {x}^{2} - {5x} + 6 & 0 \\ 0 & 0 & x - 2 \end{matrix}\right) \underset{{C}_{2} \leftrightarrow {C}_{3}}{ \rightarrow }\left( \begin{matrix} 1 & 0 & 0 \\ 0 & x - 2 & 0 \\ 0 & 0 & {x}^{2} - {5x} + 6 \end{matrix}\right) .

This determines the invariant factors x2,x25x+6x - 2,{x}^{2} - {5x} + 6 for this matrix,which we determined in Example 1 above. Let now VV be a 3-dimensional vector space over Q\mathbb{Q} with basis e1,e2,e3{e}_{1},{e}_{2},{e}_{3} and let TT be the corresponding linear transformation (which defines the action of xx on VV ),i.e.,

这确定了该矩阵的不变因子 x2,x25x+6x - 2,{x}^{2} - {5x} + 6,我们在上面的示例 1 中确定了它们。现在让 VV 是一个 Q\mathbb{Q} 上的三维向量空间,其基为 e1,e2,e3{e}_{1},{e}_{2},{e}_{3},并让 TT 是相应的线性变换(它定义了 xxVV 上的作用),即,

xe1=T(e1)=2e1x{e}_{1} = T\left( {e}_{1}\right) = 2{e}_{1}
xe2=T(e2)=2e1+3e2x{e}_{2} = T\left( {e}_{2}\right) = - 2{e}_{1} + 3{e}_{2}
xe3=T(e3)=14e17e2+2e3.x{e}_{3} = T\left( {e}_{3}\right) = {14}{e}_{1} - 7{e}_{2} + 2{e}_{3}.

The row operations used in the reduction above were

在上述简化中使用的行操作是

R1+R2R1,R1,R2+(x3)R1R2,R27R3R2,R2R3.{R}_{1} + {R}_{2} \mapsto {R}_{1}, - {R}_{1},{R}_{2} + \left( {x - 3}\right) {R}_{1} \mapsto {R}_{2},{R}_{2} - 7{R}_{3} \mapsto {R}_{2},{R}_{2} \leftrightarrow {R}_{3}.

Starting with the basis [e1,e2,e3]\left\lbrack {{e}_{1},{e}_{2},{e}_{3}}\right\rbrack for VV and changing it according to the rules given in the text, we obtain

Starting with the basis [e1,e2,e3]\left\lbrack {{e}_{1},{e}_{2},{e}_{3}}\right\rbrack for VV and changing it according to the rules given in the text, we obtain

[e1,e2,e3][e1,e2e1,e3][e1,e2e1,e3]\left\lbrack {{e}_{1},{e}_{2},{e}_{3}}\right\rbrack \rightarrow \left\lbrack {{e}_{1},{e}_{2} - {e}_{1},{e}_{3}}\right\rbrack \rightarrow \left\lbrack {-{e}_{1},{e}_{2} - {e}_{1},{e}_{3}}\right\rbrack
[e1(x3)(e2e1),e2e1,e3]\rightarrow \left\lbrack {-{e}_{1} - \left( {x - 3}\right) \left( {{e}_{2} - {e}_{1}}\right) ,{e}_{2} - {e}_{1},{e}_{3}}\right\rbrack
[e1(x3)(e2e1),e2e1,e3+7(e2e1)]\rightarrow \left\lbrack {-{e}_{1} - \left( {x - 3}\right) \left( {{e}_{2} - {e}_{1}}\right) ,{e}_{2} - {e}_{1},{e}_{3} + 7\left( {{e}_{2} - {e}_{1}}\right) }\right\rbrack
[e1(x3)(e2e1),e3+7(e2e1),e2e1].\rightarrow \left\lbrack {-{e}_{1} - \left( {x - 3}\right) \left( {{e}_{2} - {e}_{1}}\right) ,{e}_{3} + 7\left( {{e}_{2} - {e}_{1}}\right) ,{e}_{2} - {e}_{1}}\right\rbrack .

Using the formulas above for the action of xx ,we see that these last elements are the elements [0,7e1+7e2+e3,e1+e2]\left\lbrack {0, - 7{e}_{1} + 7{e}_{2} + {e}_{3}, - {e}_{1} + {e}_{2}}\right\rbrack of VV corresponding to the elements 1,x21,x - 2 and x25x+6{x}^{2} - {5x} + 6 in the diagonalized form of xIA{xI} - A ,respectively. The elements f1=7e1+7e2+e3{f}_{1} = - 7{e}_{1} + 7{e}_{2} + {e}_{3} and f2=e1+e2{f}_{2} = - {e}_{1} + {e}_{2} are therefore Q[x]\mathbb{Q}\left\lbrack x\right\rbrack -module generators for the two cyclic factors of VV in its invariant factor decomposition as a Q[x]\mathbb{Q}\left\lbrack x\right\rbrack -module. The corresponding Q\mathbb{Q} -vector space bases for these two factors are then f1{f}_{1} and f2,xf2=Tf2,{f}_{2},x{f}_{2} = T{f}_{2}, i.e., 7e1+7e2+e3- 7{e}_{1} + 7{e}_{2} + {e}_{3} and e1+e2,T(e1+e2)=4e1+3e2.- {e}_{1} + {e}_{2},T\left( {-{e}_{1} + {e}_{2}}\right) = - 4{e}_{1} + 3{e}_{2}.

Using the formulas above for the action of xx ,we see that these last elements are the elements [0,7e1+7e2+e3,e1+e2]\left\lbrack {0, - 7{e}_{1} + 7{e}_{2} + {e}_{3}, - {e}_{1} + {e}_{2}}\right\rbrack of VV corresponding to the elements 1,x21,x - 2 and x25x+6{x}^{2} - {5x} + 6 in the diagonalized form of xIA{xI} - A ,respectively. The elements f1=7e1+7e2+e3{f}_{1} = - 7{e}_{1} + 7{e}_{2} + {e}_{3} and f2=e1+e2{f}_{2} = - {e}_{1} + {e}_{2} are therefore Q[x]\mathbb{Q}\left\lbrack x\right\rbrack -module generators for the two cyclic factors of VV in its invariant factor decomposition as a Q[x]\mathbb{Q}\left\lbrack x\right\rbrack -module. The corresponding Q\mathbb{Q} -vector space bases for these two factors are then f1{f}_{1} and f2,xf2=Tf2,{f}_{2},x{f}_{2} = T{f}_{2}, i.e., 7e1+7e2+e3- 7{e}_{1} + 7{e}_{2} + {e}_{3} and e1+e2,T(e1+e2)=4e1+3e2.- {e}_{1} + {e}_{2},T\left( {-{e}_{1} + {e}_{2}}\right) = - 4{e}_{1} + 3{e}_{2}.

Then the matrix

Then the matrix

P=(714713100)P = \left( \begin{array}{rrr} - 7 & - 1 & - 4 \\ 7 & 1 & 3 \\ 1 & 0 & 0 \end{array}\right)

conjugates AA into its rational canonical form:

conjugates AA into its rational canonical form:

P1AP=(200006015){P}^{-1}{AP} = \left( \begin{array}{rrr} 2 & 0 & 0 \\ 0 & 0 & - 6 \\ 0 & 1 & 5 \end{array}\right)

as one easily checks.

as one easily checks.

II. (Converting A Directly to Rational Canonical Form) We use the row operations involved in the diagonalization of xIA{xI} - A to determine the matrix P{P}^{\prime } of the algorithm above:

II. (Converting A Directly to Rational Canonical Form) We use the row operations involved in the diagonalization of xIA{xI} - A to determine the matrix P{P}^{\prime } of the algorithm above:

(100010001)C2C1(110010001)C1(110010001)\left( \begin{array}{lll} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{array}\right) \underset{{C}_{2} - {C}_{1}}{ \rightarrow }\left( \begin{array}{rrr} 1 & - 1 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{array}\right) \underset{-{C}_{1}}{ \rightarrow }\left( \begin{array}{rrr} - 1 & - 1 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{array}\right) \rightarrow
C1(A3I)C2C1(010010001)C3+7C2(017017001)C2C3(071071010)=P.{C}_{1} - \underset{ \mapsto {C}_{1}}{\left( {A - {3I}}\right) {C}_{2}}\left( \begin{array}{rrr} 0 & - 1 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{array}\right) \underset{{C}_{3} + 7{C}_{2}}{ \rightarrow }\left( \begin{array}{rrr} 0 & - 1 & - 7 \\ 0 & 1 & 7 \\ 0 & 0 & 1 \end{array}\right) \underset{{C}_{2} \leftrightarrow {C}_{3}}{ \rightarrow }\left( \begin{array}{rrr} 0 & - 7 & - 1 \\ 0 & 7 & 1 \\ 0 & 1 & 0 \end{array}\right) = {P}^{\prime }.

Here we have d1=1{d}_{1} = 1 and d2=2{d}_{2} = 2 ,corresponding to the second and third nonzero columns of P{P}^{\prime } ,respectively. The columns of PP are therefore given by

Here we have d1=1{d}_{1} = 1 and d2=2{d}_{2} = 2 ,corresponding to the second and third nonzero columns of P{P}^{\prime } ,respectively. The columns of PP are therefore given by

(771) and (110),  A(110)=(430)\left( \begin{array}{r} - 7 \\ 7 \\ 1 \end{array}\right) \text{ and }\left( \begin{array}{r} - 1 \\ 1 \\ 0 \end{array}\right) ,\;A\left( \begin{array}{r} - 1 \\ 1 \\ 0 \end{array}\right) = \left( \begin{array}{r} - 4 \\ 3 \\ 0 \end{array}\right)

respectively,which again gives the matrix PP above.

respectively,which again gives the matrix PP above.

(3) For the 3×33 \times 3 matrix AA it was not necessary to perform the lengthy calculations above merely to determine the rational canonical form (equivalently, the invariant factors),as we saw in Example 1. For n×nn \times n matrices with n4n \geq 4 ,however,the computation of the characteristic and minimal polynomials is in general not sufficient for the determination of all the invariant factors, so the more extensive calculations of the previous example may become necessary. For example, consider the matrix

(3) For the 3×33 \times 3 matrix AA it was not necessary to perform the lengthy calculations above merely to determine the rational canonical form (equivalently, the invariant factors),as we saw in Example 1. For n×nn \times n matrices with n4n \geq 4 ,however,the computation of the characteristic and minimal polynomials is in general not sufficient for the determination of all the invariant factors, so the more extensive calculations of the previous example may become necessary. For example, consider the matrix

D=(1244214810120123).D = \left( \begin{array}{rrrr} 1 & 2 & - 4 & 4 \\ 2 & - 1 & 4 & - 8 \\ 1 & 0 & 1 & - 2 \\ 0 & 1 & - 2 & 3 \end{array}\right) .

A short computation shows that the characteristic polynomial of DD is (x1)4{\left( x - 1\right) }^{4} . The possible minimal polynomials are then x1,(x1)2,(x1)3x - 1,{\left( x - 1\right) }^{2},{\left( x - 1\right) }^{3} and (x1)4{\left( x - 1\right) }^{4} . Clearly DI0D - I \neq 0 and another short computation shows that (DI)2=0{\left( D - I\right) }^{2} = 0 ,so the minimal polynomial for DD is (x1)2{\left( x - 1\right) }^{2} . There are then two possible sets of invariant factors:

简单计算表明,DD 的特征多项式是 (x1)4{\left( x - 1\right) }^{4} 。可能的最小多项式是 x1,(x1)2,(x1)3x - 1,{\left( x - 1\right) }^{2},{\left( x - 1\right) }^{3}(x1)4{\left( x - 1\right) }^{4} 。显然 DI0D - I \neq 0 ,另一次简单计算表明 (DI)2=0{\left( D - I\right) }^{2} = 0 ,因此 DD 的最小多项式是 (x1)2{\left( x - 1\right) }^{2} 。因此有两个可能的不变因子集合:

x1,x1,(x1)2   and   (x1)2,(x1)2.x - 1,x - 1,{\left( x - 1\right) }^{2}\;\text{ and }\;{\left( x - 1\right) }^{2},{\left( x - 1\right) }^{2}.

To determine the invariant factors for DD we apply the procedure of the previous example to the 4×44 \times 4 matrix

为了确定 DD 的不变因子,我们将前一个例子的方法应用到 4×44 \times 4 矩阵上

xID=(x12442x+14810x12012x3).{xI} - D = \left( \begin{matrix} x - 1 & - 2 & 4 & - 4 \\ - 2 & x + 1 & - 4 & 8 \\ - 1 & 0 & x - 1 & 2 \\ 0 & - 1 & 2 & x - 3 \end{matrix}\right) .

The diagonal matrix obtained from this matrix by elementary row and column opera-

通过初等行和列变换从该矩阵得到的对角矩阵是矩阵

tions is the matrix

这表明 的不变因子是 (一组使 对角化的初等行和列变换是 ,

(1000010000(x1)20000(x1)2),\left( \begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & {\left( x - 1\right) }^{2} & 0 \\ 0 & 0 & 0 & {\left( x - 1\right) }^{2} \end{matrix}\right) ,

which shows that the invariant factors for DD are (x1)2,(x1)2{\left( x - 1\right) }^{2},{\left( x - 1\right) }^{2} (one series of elementary row and column operations which diagonalize xID{xI} - D are R1R3,R1{R}_{1} \leftrightarrow {R}_{3}, - {R}_{1} ,

DD (x1)2,(x1)2{\left( x - 1\right) }^{2},{\left( x - 1\right) }^{2}

R2+2R1R2  ,  R3(x1)R1R3  ,  C3+(x1)C1C3  ,  C4+2C1C4  ,\begin{matrix} {R}_{2} + 2{R}_{1} \mapsto {R}_{2}\;,\;{R}_{3} - \left( {x - 1}\right) {R}_{1} \mapsto {R}_{3}\;,\;{C}_{3} + \left( {x - 1}\right) {C}_{1} \mapsto {C}_{3}\;,\;{C}_{4} + 2{C}_{1} \mapsto {C}_{4}\;, \end{matrix}

R2R4,R2,R3+2R2R3,R4(x+1)R2R4,C3+2C2C3,{R}_{2} \leftrightarrow {R}_{4}, - {R}_{2},{R}_{3} + 2{R}_{2} \mapsto {R}_{3},{R}_{4} - \left( {x + 1}\right) {R}_{2} \mapsto {R}_{4},{C}_{3} + 2{C}_{2} \mapsto {C}_{3}, C4+(x3)C2C4)\left. {{C}_{4} + \left( {x - 3}\right) {C}_{2} \mapsto {C}_{4}}\right) .

R2R4,R2,R3+2R2R3,R4(x+1)R2R4,C3+2C2C3,{R}_{2} \leftrightarrow {R}_{4}, - {R}_{2},{R}_{3} + 2{R}_{2} \mapsto {R}_{3},{R}_{4} - \left( {x + 1}\right) {R}_{2} \mapsto {R}_{4},{C}_{3} + 2{C}_{2} \mapsto {C}_{3}, C4+(x3)C2C4)\left. {{C}_{4} + \left( {x - 3}\right) {C}_{2} \mapsto {C}_{4}}\right)

I. (Invariant Factor Decomposition) If e1,e2,e3,e4{e}_{1},{e}_{2},{e}_{3},{e}_{4} is a basis for VV in this case,then using the row operations in this diagonalization as in the previous example we see that the generators of VV corresponding to the factors above are (x1)e12e2e3=0\left( {x - 1}\right) {e}_{1} - 2{e}_{2} - {e}_{3} = 0 , 2e1+(x+1)e2e4=0,e1,e2- 2{e}_{1} + \left( {x + 1}\right) {e}_{2} - {e}_{4} = 0,{e}_{1},{e}_{2} . Hence a vector space basis for the two direct factors in the invariant decomposition of VV in this case is given by e1,Te1{e}_{1},T{e}_{1} and e2,Te2{e}_{2},T{e}_{2} where The corresponding matrix PP relating these bases is

I. (不变因子分解)如果 e1,e2,e3,e4{e}_{1},{e}_{2},{e}_{3},{e}_{4} 是这个例子中 VV 的一个基,那么使用这个对角化中的行变换,就像前一个例子中那样,我们看到对应于上述因子的 VV 的生成元是 (x1)e12e2e3=0\left( {x - 1}\right) {e}_{1} - 2{e}_{2} - {e}_{3} = 02e1+(x+1)e2e4=0,e1,e2- 2{e}_{1} + \left( {x + 1}\right) {e}_{2} - {e}_{4} = 0,{e}_{1},{e}_{2} 。因此,在这种情况下,VV 的不变分解中的两个直和因子的一个向量空间基由 e1,Te1{e}_{1},T{e}_{1}e2,Te2{e}_{2},T{e}_{2} 给出,其中对应的矩阵 PP 关联这些基是

P=(1102021101000001)P = \left( \begin{array}{rrrr} 1 & 1 & 0 & 2 \\ 0 & 2 & 1 & - 1 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 1 \end{array}\right)

so that P1DP{P}^{-1}{DP} is in rational canonical form:

使得 P1DP{P}^{-1}{DP} 是有理标准形:

P1DP=(0100120000010012){P}^{-1}{DP} = \left( \begin{array}{rrrr} 0 & - 1 & 0 & 0 \\ 1 & 2 & 0 & 0 \\ 0 & 0 & 0 & - 1 \\ 0 & 0 & 1 & 2 \end{array}\right)

as can easily be checked.

这可以很容易地验证。

II. (Converting D Directly to Rational Canonical Form) As in Example 2 we determine the matrix P{P}^{\prime } of the algorithm from the row operations used in the diagonalization of xID:{xI} - D :

II. (直接将有理标准形转换为 D)就像在例子 2 中那样,我们通过在 xID:{xI} - D : 的对角化中使用的行变换确定算法的矩阵 P{P}^{\prime }

(1000010000100001)C1C3(0010010010000001)C1(0010010010000001)\left( \begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \end{matrix}\right) \underset{{C}_{1} \leftrightarrow {C}_{3}}{ \rightarrow }\left( \begin{matrix} 0 & 0 & 1 & 0 \\ 0 & 1 & 0 & 0 \\ 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 \end{matrix}\right) \underset{-{C}_{1}}{ \rightarrow }\left( \begin{matrix} 0 & 0 & 1 & 0 \\ 0 & 1 & 0 & 0 \\ - 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 \end{matrix}\right) \rightarrow
c1c2C1(0010210010000001)c1+(DI)c3C1(0010010000000001)c2c4(0010000100000100)\underset{ \mapsto {C}_{1}}{\overset{ \rightarrow }{{c}_{1} \rightarrow {c}_{2}}}\left( \begin{array}{rrrr} 0 & 0 & 1 & 0 \\ - 2 & 1 & 0 & 0 \\ - 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 \end{array}\right) \underset{ \mapsto {C}_{1}}{\overset{ \rightarrow }{{c}_{1} + \left( {D - I}\right) {c}_{3}}}\left( \begin{array}{rrrr} 0 & 0 & 1 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 \end{array}\right) \overset{ \rightarrow }{{c}_{2} \leftrightarrow {c}_{4}}\left( \begin{array}{rrrr} 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \end{array}\right) \rightarrow
C2(0010000100000100)C22C3(0210000100000100)C2+(D+I)C4(0010000100000000)=P\underset{-{C}_{2}}{ \rightarrow }\left( \begin{matrix} 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & 0 \\ 0 & - 1 & 0 & 0 \end{matrix}\right) \underset{{C}_{2} - 2{C}_{3}}{ \rightarrow }\left( \begin{matrix} 0 & - 2 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & 0 \\ 0 & - 1 & 0 & 0 \end{matrix}\right) \underset{{C}_{2} + \left( \overline{D + I}\right) {C}_{4}}{ \rightarrow }\left( \begin{matrix} 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}\right) = {P}^{\prime }

Here we have d1=2{d}_{1} = 2 and d2=2{d}_{2} = 2 ,corresponding to the third and fourth nonzero columns of P{P}^{\prime } . The columns of PP are therefore given by

在这里,我们有 d1=2{d}_{1} = 2d2=2{d}_{2} = 2 ,对应于 P{P}^{\prime } 的第三和第四个非零列。因此,PP 的列由以下给出

(1000),  D(1000)=(1210) and (0100),  D(0100)=(2101),\left( \begin{array}{l} 1 \\ 0 \\ 0 \\ 0 \end{array}\right) ,\;D\left( \begin{array}{l} 1 \\ 0 \\ 0 \\ 0 \end{array}\right) = \left( \begin{array}{l} 1 \\ 2 \\ 1 \\ 0 \end{array}\right) \text{ and }\left( \begin{array}{l} 0 \\ 1 \\ 0 \\ 0 \end{array}\right) ,\;D\left( \begin{array}{l} 0 \\ 1 \\ 0 \\ 0 \end{array}\right) = \left( \begin{array}{r} 2 \\ - 1 \\ 0 \\ 1 \end{array}\right) ,

respectively,which again gives the matrix PP above.

分别,这再次给出了上面的矩阵 PP

(4) In this example we determine all similarity classes of matrices AA with entries from Q\mathbb{Q} with characteristic polynomial (x41)(x21)\left( {{x}^{4} - 1}\right) \left( {{x}^{2} - 1}\right) . First note that any matrix with a degree

(4) 在这个例子中,我们确定所有具有 AA 元素的特征多项式为 (x41)(x21)\left( {{x}^{4} - 1}\right) \left( {{x}^{2} - 1}\right) 的矩阵 Q\mathbb{Q} 的相似类。首先注意,任何具有六次特征多项式的矩阵必须是一个 AA 矩阵。多项式 Q\mathbb{Q}(x41)(x21)\left( {{x}^{4} - 1}\right) \left( {{x}^{2} - 1}\right) 中分解为不可约项为 。由于 的最小多项式 与 具有相同的根,因此 能整除 。假设 是某些 的不变因子,那么 (特别是,所有不变因子都能整除 )和 。在这些约束条件下,唯一允许的列表是

6 characteristic polynomial must be a 6×66 \times 6 matrix. The polynomial (x41)(x21)\left( {{x}^{4} - 1}\right) \left( {{x}^{2} - 1}\right) factors into irreducibles in Q[x]\mathbb{Q}\left\lbrack x\right\rbrack as (x1)2(x+1)2(x2+1){\left( x - 1\right) }^{2}{\left( x + 1\right) }^{2}\left( {{x}^{2} + 1}\right) . Since the minimal polynomial mA(x){m}_{A}\left( x\right) for AA has the same roots as cA(x){c}_{A}\left( x\right) it follows that (x1)(x+1)(x2+1)\left( {x - 1}\right) \left( {x + 1}\right) \left( {{x}^{2} + 1}\right) divides mA(x){m}_{A}\left( x\right) . Suppose a1(x),,am(x){a}_{1}\left( x\right) ,\ldots ,{a}_{m}\left( x\right) are the invariant factors of some AA ,so am(x)=mA(x),ai(x)ai+1(x){a}_{m}\left( x\right) = {m}_{A}\left( x\right) ,{a}_{i}\left( x\right) \mid {a}_{i + 1}\left( x\right) (in particular,all the invariant factors divide mA(x){m}_{A}\left( x\right) ) and a1(x)a2(x)am(x)=(x41)(x21){a}_{1}\left( x\right) {a}_{2}\left( x\right) \cdots {a}_{m}\left( x\right) = \left( {{x}^{4} - 1}\right) \left( {{x}^{2} - 1}\right) . One easily sees that the only permissible lists under these constraints are

(5) 这个多项式分解为 (x1)2(x+1)2(x2+1){\left( x - 1\right) }^{2}{\left( x + 1\right) }^{2}\left( {{x}^{2} + 1}\right) 。假设 6×66 \times 6 是任何最小多项式能整除 Q[x]\mathbb{Q}\left\lbrack x\right\rbrack(x41)(x21)\left( {{x}^{4} - 1}\right) \left( {{x}^{2} - 1}\right) 矩阵,那么 (x1)2(x+1)2(x2+1){\left( x - 1\right) }^{2}{\left( x + 1\right) }^{2}\left( {{x}^{2} + 1}\right) 。对于 6×66 \times 6 的最小多项式的唯一限制是其次数最多为3(根据凯莱-哈密顿定理)。因此,此类矩阵 mA(x){m}_{A}\left( x\right) 的最小多项式的唯一可能是

(a) (x1)(x+1),  (x1)(x+1)(x2+1)\left( {x - 1}\right) \left( {x + 1}\right) ,\;\left( {x - 1}\right) \left( {x + 1}\right) \left( {{x}^{2} + 1}\right)

(a) (x1)(x+1),  (x1)(x+1)(x2+1)\left( {x - 1}\right) \left( {x + 1}\right) ,\;\left( {x - 1}\right) \left( {x + 1}\right) \left( {{x}^{2} + 1}\right)

(b) x1,  (x1)(x+1)2(x2+1)x - 1,\;\left( {x - 1}\right) {\left( x + 1\right) }^{2}\left( {{x}^{2} + 1}\right)

(b) x1,  (x1)(x+1)2(x2+1)x - 1,\;\left( {x - 1}\right) {\left( x + 1\right) }^{2}\left( {{x}^{2} + 1}\right)

(c) x+1,  (x1)2(x+1)(x2+1)x + 1,\;{\left( x - 1\right) }^{2}\left( {x + 1}\right) \left( {{x}^{2} + 1}\right)

(c) x+1,  (x1)2(x+1)(x2+1)x + 1,\;{\left( x - 1\right) }^{2}\left( {x + 1}\right) \left( {{x}^{2} + 1}\right)

(d) (x1)2(x+1)2(x2+1){\left( x - 1\right) }^{2}{\left( x + 1\right) }^{2}\left( {{x}^{2} + 1}\right) .

(d) (x1)2(x+1)2(x2+1){\left( x - 1\right) }^{2}{\left( x + 1\right) }^{2}\left( {{x}^{2} + 1}\right)

One can now easily write out the corresponding direct sums of companion matrices to obtain representatives of the 4 similarity classes. We shall see in the next section that there are still only 4 similarity classes even in M6(C){M}_{6}\left( \mathbb{C}\right) .

现在可以很容易地写出相应的伴随矩阵的直接和,以获得4个相似类的代表。我们将在下一节看到,即使在 M6(C){M}_{6}\left( \mathbb{C}\right) 中,仍然只有4个相似类。

(5) In this example we find all similarity classes of 3×33 \times 3 matrices AA with entries from Q\mathbb{Q} satisfying A6=I{A}^{6} = I . For each such AA ,its minimal polynomial divides x61{x}^{6} - 1 and in Q[x]\mathbb{Q}\left\lbrack x\right\rbrack the complete factorization of this polynomial is

(5) 在这个例子中,我们找到所有满足 A6=I{A}^{6} = I3×33 \times 3 矩阵 AA 的相似类。对于每个这样的 AA ,它的最小多项式能整除 x61{x}^{6} - 1 ,在 Q[x]\mathbb{Q}\left\lbrack x\right\rbrack 中,这个多项式的完全分解是

x61=(x1)(x+1)(x2x+1)(x2+x+1).{x}^{6} - 1 = \left( {x - 1}\right) \left( {x + 1}\right) \left( {{x}^{2} - x + 1}\right) \left( {{x}^{2} + x + 1}\right) .

Conversely,if BB is any 3×33 \times 3 matrix whose minimal polynomial divides x61{x}^{6} - 1 ,then B6=I{B}^{6} = I . The only restriction on the minimal polynomial for BB is that its degree is at most 3 (by the Cayley-Hamilton Theorem). The only possibilities for the minimal polynomial of such a matrix AA are therefore

反之,如果 BB 是任何最小多项式能整除 x61{x}^{6} - 13×33 \times 3 矩阵,那么 B6=I{B}^{6} = I 。对于 BB 的最小多项式的唯一限制是其次数最多为3(根据凯莱-哈密顿定理)。因此,此类矩阵 AA 的最小多项式的唯一可能是

(a) x1x - 1

(a) x1x - 1

(b) x+1x + 1

(b) x+1x + 1

(c) x2x+1{x}^{2} - x + 1

(c) x2x+1{x}^{2} - x + 1

(d) x2+x+1{x}^{2} + x + 1

(d) x2+x+1{x}^{2} + x + 1

(e) (x1)(x+1)\left( {x - 1}\right) \left( {x + 1}\right)

(e) (x1)(x+1)\left( {x - 1}\right) \left( {x + 1}\right)

(f) (x1)(x2x+1)\left( {x - 1}\right) \left( {{x}^{2} - x + 1}\right)

(f) (x1)(x2x+1)\left( {x - 1}\right) \left( {{x}^{2} - x + 1}\right)

(g) (x1)(x2+x+1)\left( {x - 1}\right) \left( {{x}^{2} + x + 1}\right)

(g) (x1)(x2+x+1)\left( {x - 1}\right) \left( {{x}^{2} + x + 1}\right)

(b) (x+1)(x2x+1)\left( {x + 1}\right) \left( {{x}^{2} - x + 1}\right)

(b) (x+1)(x2x+1)\left( {x + 1}\right) \left( {{x}^{2} - x + 1}\right)

(i) (x+1)(x2+x+1)\left( {x + 1}\right) \left( {{x}^{2} + x + 1}\right) .

(i) (x+1)(x2+x+1)\left( {x + 1}\right) \left( {{x}^{2} + x + 1}\right) .

Under the constraints of the rational canonical form these give rise to the following permissible lists of invariant factors:

在有理标准形的约束下,这些会产生以下允许的不变因子列表:

(i) x1,  x1,  x1x - 1,\;x - 1,\;x - 1

(i) x1,  x1,  x1x - 1,\;x - 1,\;x - 1

(ii) x+1,x+1,x+1x + 1,x + 1,x + 1

(ii) x+1,x+1,x+1x + 1,x + 1,x + 1

(iii) x1,  (x1)(x+1)x - 1,\;\left( {x - 1}\right) \left( {x + 1}\right)

(iii) x1,  (x1)(x+1)x - 1,\;\left( {x - 1}\right) \left( {x + 1}\right)

(iv) x+1,  (x1)(x+1)x + 1,\;\left( {x - 1}\right) \left( {x + 1}\right)

(iv) x+1,  (x1)(x+1)x + 1,\;\left( {x - 1}\right) \left( {x + 1}\right)

(v) (x1)(x2x+1)\left( {x - 1}\right) \left( {{x}^{2} - x + 1}\right)

(v) (x1)(x2x+1)\left( {x - 1}\right) \left( {{x}^{2} - x + 1}\right)

(vi) (x1)(x2+x+1)\left( {x - 1}\right) \left( {{x}^{2} + x + 1}\right)

(vi) (x1)(x2+x+1)\left( {x - 1}\right) \left( {{x}^{2} + x + 1}\right)

(vii) (x+1)(x2x+1)\left( {x + 1}\right) \left( {{x}^{2} - x + 1}\right)

(vii) (x+1)(x2x+1)\left( {x + 1}\right) \left( {{x}^{2} - x + 1}\right)

(viii) (x+1)(x2+x+1)\left( {x + 1}\right) \left( {{x}^{2} + x + 1}\right) .

(viii) (x+1)(x2+x+1)\left( {x + 1}\right) \left( {{x}^{2} + x + 1}\right) .

Note that it is impossible to have a suitable set of invariant factors if the minimal polynomial is x2+x+1{x}^{2} + x + 1 or x2x+1{x}^{2} - x + 1 . One can now write out the corresponding

注意,如果最小多项式是 x2+x+1{x}^{2} + x + 1x2x+1{x}^{2} - x + 1,则不可能有一组合适的不变因子。现在可以写出相应的

rational canonical forms; for example,(i) is II ,(ii) is I- I ,and (iii) is

有理标准形;例如,(i)是 II ,(ii)是 I- I ,而(iii)是

(100001010)\left( \begin{array}{lll} 1 & 0 & 0 \\ 0 & 0 & 1 \\ 0 & 1 & 0 \end{array}\right)

Note also that another way of phrasing this result is that any 3×33 \times 3 matrix with entries from Q whose order (multiplicatively,of course) divides 6 is similar to one of these 8 matrices, so this example determines all elements of orders 1,2,3 and 6 in the group GL3(Q){\mathrm{{GL}}}_{3}\left( \mathbb{Q}\right) (up to similarity).

还应注意,这个结果的另一种表述方式是,任何元素来自有理数域 Q 且其阶(乘法意义上)整除 6 的 3×33 \times 3 矩阵都与这 8 个矩阵之一相似,因此这个例子确定了群 GL3(Q){\mathrm{{GL}}}_{3}\left( \mathbb{Q}\right) 中阶数为 1、2、3 和 6 的所有元素(相似性不变)。

EXERCISES

练习

  1. Prove that similar linear transformations of VV (or n×nn \times n matrices) have the same characteristic and the same minimal polynomial.

  2. 证明相似的线性变换 VV(或 n×nn \times n 矩阵)具有相同的特征多项式和相同的最小多项式。

  3. Let MM be as in Lemma 19. Prove that the minimal polynomial of MM is the least common multiple of the minimal polynomials of A1,,Ak{A}_{1},\ldots ,{A}_{k} .

  4. MM 如引理19中所述。证明 MM 的最小多项式是 A1,,Ak{A}_{1},\ldots ,{A}_{k} 的最小多项式的最小公倍数。

  5. Prove that two 2×22 \times 2 matrices over FF which are not scalar matrices are similar if and only if they have the same characteristic polynomial.

  6. 证明两个在 FF 上的非纯量矩阵 2×22 \times 2 相似当且仅当它们具有相同的特征多项式。

  7. Prove that two 3×33 \times 3 matrices are similar if and only if they have the same characteristic and same minimal polynomials. Give an explicit counterexample to this assertion for 4×44 \times 4 matrices.

  8. 证明两个 3×33 \times 3 矩阵相似当且仅当它们具有相同的特征多项式和相同的最小多项式。为 4×44 \times 4 矩阵给出一个反例来反驳这一断言。

  9. Prove directly from the fact that the collection of all linear transformations of an nn dimensional vector space VV over FF to itself form a vector space over FF of dimension n2{n}^{2} that the minimal polynomial of a linear transformation TT has degree at most n2{n}^{2} .

  10. 直接从所有 nn 维向量空间 VV 到其自身的线性变换集合在 FF 上构成一个维度为 n2{n}^{2} 的向量空间这一事实出发,证明线性变换 TT 的最小多项式的次数最多为 n2{n}^{2}

  11. Prove that the constant term in the characteristic polynomial of the n×nn \times n matrix AA is (1)ndetA{\left( -1\right) }^{n}\det A and that the coefficient of xn1{x}^{n - 1} is the negative of the sum of the diagonal entries of AA (the sum of the diagonal entries of AA is called the trace of AA ). Prove that det AA is the product of the eigenvalues of AA and that the trace of AA is the sum of the eigenvalues of AA .

  12. 证明 n×nn \times n 矩阵 AA 的特征多项式中的常数项是 (1)ndetA{\left( -1\right) }^{n}\det A,并且 xn1{x}^{n - 1} 的系数是 AA 对角线元素之和的相反数(AA 对角线元素之和称为 AA 的迹)。证明 det AAAA 特征值的乘积,并且 AA 的迹是 AA 特征值之和。

  13. Determine the eigenvalues of the matrix

  14. 确定矩阵的特征值

(0100001000011000).\left( \begin{array}{llll} 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ 1 & 0 & 0 & 0 \end{array}\right) .
  1. Verify that the characteristic polynomial of the companion matrix

  2. 验证伴随矩阵的特征多项式

(0000a01000a10100a20001an1)\left( \begin{matrix} 0 & 0 & 0 & \ldots & 0 & - {a}_{0} \\ 1 & 0 & 0 & \ldots & 0 & - {a}_{1} \\ 0 & 1 & 0 & \ldots & 0 & - {a}_{2} \\ \vdots & \vdots & \vdots & & \vdots & \vdots \\ 0 & 0 & 0 & \ldots & 1 & - {a}_{n - 1} \end{matrix}\right)

is

xn+an1xn1++a1x+a0.{x}^{n} + {a}_{n - 1}{x}^{n - 1} + \cdots + {a}_{1}x + {a}_{0}.
  1. Find the rational canonical forms of

  2. 找出以下矩阵的有理标准形

(011000100),  (c010c111c)   and   (422465153042046315308409303260140155512).\left( \begin{array}{rrr} 0 & - 1 & - 1 \\ 0 & 0 & 0 \\ - 1 & 0 & 0 \end{array}\right) ,\;\left( \begin{array}{rrr} c & 0 & - 1 \\ 0 & c & 1 \\ - 1 & 1 & c \end{array}\right) \;\text{ and }\;\left( \begin{array}{rrrr} {422} & {465} & {15} & - {30} \\ - {420} & - {463} & - {15} & {30} \\ {840} & {930} & {32} & - {60} \\ - {140} & - {155} & - 5 & {12} \end{array}\right) .
  1. Find all similarity classes of 6×66 \times 6 matrices over Q\mathbb{Q} with minimal polynomial (x+2)2(x1){\left( x + 2\right) }^{2}\left( {x - 1}\right) (it suffices to give all lists of invariant factors and write out some of their corresponding matrices).

  2. 找出所有在 Q\mathbb{Q} 上具有最小多项式 (x+2)2(x1){\left( x + 2\right) }^{2}\left( {x - 1}\right)6×66 \times 6 矩阵的相似类(给出所有不变因子的列表并写出它们对应的某些矩阵即可)。

  3. Find all similarity classes of 6×66 \times 6 matrices over C\mathbb{C} with characteristic polynomial (x41)(x21)\left( {{x}^{4} - 1}\right) \left( {{x}^{2} - 1}\right) .

  4. 找出所有在 6×66 \times 6 上具有特征多项式 (x41)(x21)\left( {{x}^{4} - 1}\right) \left( {{x}^{2} - 1}\right)C\mathbb{C} 矩阵的相似类。

  5. Find all similarity classes of 3×33 \times 3 matrices AA over F2{\mathbb{F}}_{2} satisfying A6=I{A}^{6} = I (compare with the answer we computed over Q\mathbb{Q} ). Do the same for 4×44 \times 4 matrices BB satisfying B20=I{B}^{20} = I .

  6. 找出所有在 F2{\mathbb{F}}_{2} 上满足 A6=I{A}^{6} = I3×33 \times 3 矩阵的相似类(与我们在 Q\mathbb{Q} 上计算出的答案进行比较)。对于满足 B20=I{B}^{20} = I4×44 \times 4 矩阵 BB 也做同样的事情。

  7. Prove that the number of similarity classes of 3×33 \times 3 matrices over Q\mathbb{Q} with a given characteristic polynomial in Q[x]\mathbb{Q}\left\lbrack x\right\rbrack is the same as the number of similarity classes over any extension field of Q\mathbb{Q} . Give an example to show that this is not true in general for 4×44 \times 4 matrices.

  8. 证明在 Q\mathbb{Q} 上具有给定特征多项式的 3×33 \times 3 矩阵的相似类数量与任何 Q\mathbb{Q} 的扩展域上的相似类数量相同。举一个例子说明这对于 4×44 \times 4 矩阵来说在一般情况下并不成立。

  9. Determine all possible rational canonical forms for a linear transformation with characteristic polynomial x2(x2+1)2{x}^{2}{\left( {x}^{2} + 1\right) }^{2} .

  10. 确定具有特征多项式 x2(x2+1)2{x}^{2}{\left( {x}^{2} + 1\right) }^{2} 的线性变换的所有可能的有理标准形。

  11. Determine up to similarity all 2×22 \times 2 rational matrices (i.e., M2(Q)\in {M}_{2}\left( \mathbb{Q}\right) ) of precise order 4 (multiplicatively,of course). Do the same if the matrix has entries from C\mathbb{C} .

  12. 确定 2×22 \times 2 有理矩阵(即 M2(Q)\in {M}_{2}\left( \mathbb{Q}\right) )的所有相似类,其精确阶数为4(当然是指乘法意义下的阶数)。如果矩阵的元素来自 C\mathbb{C} ,也做同样的事情。

  13. Show that x51=(x1)(x24x+1)(x2+5x+1){x}^{5} - 1 = \left( {x - 1}\right) \left( {{x}^{2} - {4x} + 1}\right) \left( {{x}^{2} + {5x} + 1}\right) in F19[x]{\mathbb{F}}_{19}\left\lbrack x\right\rbrack . Use this to determine up to similarity all 2×22 \times 2 matrices with entries from F19{\mathbb{F}}_{19} of (multiplicative) order 5 .

  14. 证明 x51=(x1)(x24x+1)(x2+5x+1){x}^{5} - 1 = \left( {x - 1}\right) \left( {{x}^{2} - {4x} + 1}\right) \left( {{x}^{2} + {5x} + 1}\right)F19[x]{\mathbb{F}}_{19}\left\lbrack x\right\rbrack 中。利用这一点确定所有在 F19{\mathbb{F}}_{19} 中具有(乘法)阶数为5的 2×22 \times 2 矩阵的相似类。

  15. Determine representatives for the conjugacy classes for GL3(F2)G{L}_{3}\left( {\mathbb{F}}_{2}\right) . [Compare your answer with Theorem 15 and Proposition 14 of Chapter 6.]

  16. 确定 GL3(F2)G{L}_{3}\left( {\mathbb{F}}_{2}\right) 的共轭类的代表。将你的答案与第6章的定理15和命题14进行比较。]

  17. Let VV be a finite dimensional vector space over Q\mathbb{Q} and suppose TT is a nonsingular linear transformation of VV such that T1=T2+T{T}^{-1} = {T}^{2} + T . Prove that the dimension of VV is divisible by 3. If the dimension of VV is precisely 3 prove that all such transformations TT are similar.

  18. VVQ\mathbb{Q} 上的有限维向量空间,假设 TTVV 的一个非奇异线性变换,且满足 T1=T2+T{T}^{-1} = {T}^{2} + T 。证明 VV 的维数能被3整除。如果 VV 的维数恰好为3,证明所有这样的变换 TT 都是相似的。

  19. Let VV be the infinite dimensional real vector space

  20. VV 是无限维实向量空间

R={(a0,a1,a2,)a0,a1,a2,R}.{\mathbb{R}}^{\infty } = \left\{ {\left( {{a}_{0},{a}_{1},{a}_{2},\ldots }\right) \mid {a}_{0},{a}_{1},{a}_{2},\cdots \in \mathbb{R}}\right\} .

Define the map T:VVT : V \rightarrow V by T(a0,a1,a2,)=(0,a0,a1,a2,)T\left( {{a}_{0},{a}_{1},{a}_{2},\ldots }\right) = \left( {0,{a}_{0},{a}_{1},{a}_{2},\ldots }\right) . Prove that TT has no eigenvectors.

通过 T(a0,a1,a2,)=(0,a0,a1,a2,)T\left( {{a}_{0},{a}_{1},{a}_{2},\ldots }\right) = \left( {0,{a}_{0},{a}_{1},{a}_{2},\ldots }\right) 定义映射 T:VVT : V \rightarrow V 。证明 TT 没有特征向量。

  1. Let \ell be a prime and let Φ(x)=x1x1=x1+x2++x+1Z[x]{\Phi }_{\ell }\left( x\right) = \frac{{x}^{\ell } - 1}{x - 1} = {x}^{\ell - 1} + {x}^{\ell - 2} + \ldots + x + 1 \in \mathbb{Z}\left\lbrack x\right\rbrack be the th{\ell }^{\mathrm{{th}}} cyclotomic polynomial,which is irreducible over Q\mathbb{Q} (Example 4 following Corollary 9.14). This exercise determines the smallest degree of a factor of Φ(x){\Phi }_{\ell }\left( x\right) modulo pp for any prime pp and so in particular determines when Φ(x){\Phi }_{\ell }\left( x\right) is irreducible modulo pp . (This actually determines the complete factorization of Φ(x){\Phi }_{\ell }\left( x\right) modulo pp - cf. Exercise 8 of Section 13.6.)

  2. \ell 为一个质数,Φ(x)=x1x1=x1+x2++x+1Z[x]{\Phi }_{\ell }\left( x\right) = \frac{{x}^{\ell } - 1}{x - 1} = {x}^{\ell - 1} + {x}^{\ell - 2} + \ldots + x + 1 \in \mathbb{Z}\left\lbrack x\right\rbrackth{\ell }^{\mathrm{{th}}} 的原根多项式,它在 Q\mathbb{Q} 上是不可约的(例 4,紧随推论 9.14)。这个练习确定了对于任意质数 ppΦ(x){\Phi }_{\ell }\left( x\right)pp 的因子的最小次数,特别地,确定了 Φ(x){\Phi }_{\ell }\left( x\right) 在模 pp 时的不可约性。 (这实际上确定了 Φ(x){\Phi }_{\ell }\left( x\right)pp 的完全分解 - 参见第 13.6 节练习 8。)

(a) Show that if p=p = \ell then Φ(x){\Phi }_{\ell }\left( x\right) is divisible by x1x - 1 in F[x]{\mathbb{F}}_{\ell }\left\lbrack x\right\rbrack .

(a) 证明如果 p=p = \ell ,那么在 F[x]{\mathbb{F}}_{\ell }\left\lbrack x\right\rbrackΦ(x){\Phi }_{\ell }\left( x\right) 可被 x1x - 1 整除。

(b) Suppose pp \neq \ell and let ff denote the order of pp in F×{\mathbb{F}}_{\ell }^{ \times } ,i.e., ff is the smallest power of pp with pf1  mod  {p}^{f} \equiv 1{\;\operatorname{mod}\;\ell } . Show that m=fm = f is the first value of mm for which the group GLm(Fp)G{L}_{m}\left( {\mathbb{F}}_{p}\right) contains an element AA of order \ell . [Use the formula for the order of this group at the end of Section 11.1.]

(b) 假设 pp \neq \ell 并且设 ff 表示 ppF×{\mathbb{F}}_{\ell }^{ \times } 中的阶,即 ffpp 的最小幂使得 pf1  mod  {p}^{f} \equiv 1{\;\operatorname{mod}\;\ell } 。证明 m=fm = f 是使得群 GLm(Fp)G{L}_{m}\left( {\mathbb{F}}_{p}\right) 包含一个阶为 \ell 的元素 AA 的第一个 mm 值。 [使用第 11.1 节末尾关于该群阶的公式。]

(c) Show that Φ(x){\Phi }_{\ell }\left( x\right) is not divisible by any polynomial of degree smaller than ff in Fp[x]{\mathbb{F}}_{p}\left\lbrack x\right\rbrack [consider the companion matrix for such a divisor and use (b)]. Let mA(x)Fp[x]{m}_{A}\left( x\right) \in {\mathbb{F}}_{p}\left\lbrack x\right\rbrack denote the minimal polynomial for the matrix AA in (b) and conclude that mA(x){m}_{A}\left( x\right) is irreducible of degree ff and divides Φ(x){\Phi }_{\ell }\left( x\right) in Fp[x]{\mathbb{F}}_{p}\left\lbrack x\right\rbrack .

(c) 证明 Φ(x){\Phi }_{\ell }\left( x\right) 不能被 Fp[x]{\mathbb{F}}_{p}\left\lbrack x\right\rbrack 中任何次数小于 ff 的多项式整除(考虑此类除数的伴随矩阵并使用 (b))。设 mA(x)Fp[x]{m}_{A}\left( x\right) \in {\mathbb{F}}_{p}\left\lbrack x\right\rbrack 表示 (b) 中矩阵 AA 的最小多项式,并得出 mA(x){m}_{A}\left( x\right) 是次数为 ff 的不可约多项式,并且在 Fp[x]{\mathbb{F}}_{p}\left\lbrack x\right\rbrack 中整除 Φ(x){\Phi }_{\ell }\left( x\right)

(d) In particular,prove that Φ(x){\Phi }_{\ell }\left( x\right) is irreducible modulo pp if and only if l1l - 1 is the smallest power of pp which is congruent to 1 modulo \ell ,i.e., pp is a primitive root modulo \ell .

(d) 特别地,证明 Φ(x){\Phi }_{\ell }\left( x\right) 在模 pp 意义下不可约当且仅当 l1l - 1pp 的最小幂,满足模 \ell 同余于1,即 pp 是模 \ell 的一个原根。

  1. Prove that the first two elementary row and column operations described before Theorem 21 do not change the determinant of the matrix and the third elementary operation multiplies the determinant by a unit. Conclude from Theorem 21 that the characteristic polynomial of AA differs by a unit from the product of the invariant factors of AA . Since both these polynomials are monic by definition, conclude that they are equal (this gives an alternate proof of Proposition 20).

  2. 证明定理21之前描述的前两个基本行和列操作不会改变矩阵的行列式,而第三个基本操作将行列式乘以一个单位元。从定理21得出结论,AA 的特征多项式与 AA 的不变因子的乘积相差一个单位元。由于这两个多项式按定义都是首一多项式,因此它们相等(这给出了命题20的另一种证明)。

The following exercises outline the proof of Theorem 21. They carry out explicitly the construction described in Exercises 16 to 19 of the previous section for the Euclidean Domain F[x]F\left\lbrack x\right\rbrack . Let VV be an nn -dimensional vector space with basis v1,v2,,vn{v}_{1},{v}_{2},\ldots ,{v}_{n} and let TT be the linear transformation of VV defined by the matrix AA and this choice of basis,i.e., TT is the linear transformation with

以下练习概述了定理21的证明。它们具体执行了上一节练习16至19中描述的构造,针对欧几里得域 F[x]F\left\lbrack x\right\rbrack 。设 VV 是一个 nn 维向量空间,其基为 v1,v2,,vn{v}_{1},{v}_{2},\ldots ,{v}_{n} ,并设 TT 是由矩阵 AA 和这个基定义的 VV 上的线性变换,即 TT 是具有

T(vj)=i=1naijvi,  j=1,2,,nT\left( {v}_{j}\right) = \mathop{\sum }\limits_{{i = 1}}^{n}{a}_{ij}{v}_{i},\;j = 1,2,\ldots ,n

where A=(aij)A = \left( {a}_{ij}\right) . Let F[x]nF{\left\lbrack x\right\rbrack }^{n} be the free module of rank nn over F[x]F\left\lbrack x\right\rbrack and let ξ1,ξ2,,ξn{\xi }_{1},{\xi }_{2},\ldots ,{\xi }_{n} denote a basis. Then we have a natural surjective F[x]F\left\lbrack x\right\rbrack -module homomorphism

其中 A=(aij)A = \left( {a}_{ij}\right) 。设 F[x]nF{\left\lbrack x\right\rbrack }^{n} 是秩为 nn 的自由模,且 F[x]F\left\lbrack x\right\rbrack 上的 ξ1,ξ2,,ξn{\xi }_{1},{\xi }_{2},\ldots ,{\xi }_{n} 表示一个基。那么我们有一个自然的满射 F[x]F\left\lbrack x\right\rbrack -模同态。

φ:F[x]nV\varphi : F{\left\lbrack x\right\rbrack }^{n} \rightarrow V

defined by mapping ξi{\xi }_{i} to vi,i=1,2,,n{v}_{i},i = 1,2,\ldots ,n . As indicated in the exercises of the previous section the invariant factors for the F[x]F\left\lbrack x\right\rbrack -module VV can be determined once we have determined a set of generators and the corresponding relations matrix for kerφ\ker \varphi . Since by definition xx acts on VV by the linear transformation TT ,we have

通过映射 ξi{\xi }_{i}vi,i=1,2,,n{v}_{i},i = 1,2,\ldots ,n 定义。如前一部分练习所示,一旦我们确定了生成集和相应的 kerφ\ker \varphi 关系矩阵,就可以确定 F[x]F\left\lbrack x\right\rbrack -模 VV 的不变因子。由于根据定义 xx 通过线性变换 TT 作用于 VV,因此我们有

x(vj)=i=1naijvi,  j=1,2,,n.x\left( {v}_{j}\right) = \mathop{\sum }\limits_{{i = 1}}^{n}{a}_{ij}{v}_{i},\;j = 1,2,\ldots ,n.
  1. Show that the elements

  2. 证明元素

νj=a1jξ1aj1  jξj1+(xajj)ξjaj+1  jξj+1anjξn{\nu }_{j} = - {a}_{1j}{\xi }_{1} - \cdots - {a}_{j - 1\;j}{\xi }_{j - 1} + \left( {x - {a}_{jj}}\right) {\xi }_{j} - {a}_{j + 1\;j}{\xi }_{j + 1} - \cdots - {a}_{nj}{\xi }_{n}

for j=1,2,,nj = 1,2,\ldots ,n are elements of the kernel of φ\varphi .

对于 j=1,2,,nj = 1,2,\ldots ,nφ\varphi 的核中的元素。

  1. (a) Show that xξj=vj+fjx{\xi }_{j} = {v}_{j} + {f}_{j} where fjFξ1++Fξn{f}_{j} \in F{\xi }_{1} + \cdots + F{\xi }_{n} is an element in the FF -vector space spanned by ξ1,,ξn{\xi }_{1},\ldots ,{\xi }_{n} .

  2. (a) 证明 xξj=vj+fjx{\xi }_{j} = {v}_{j} + {f}_{j} 其中 fjFξ1++Fξn{f}_{j} \in F{\xi }_{1} + \cdots + F{\xi }_{n} 是由 ξ1,,ξn{\xi }_{1},\ldots ,{\xi }_{n} 生成的 FF -向量空间中的元素。

(b) Show that

(b) 证明

F[x]ξ1++F[x]ξn=(F[x]v1++F[x]vn)+(Fξ1++Fξn).F\left\lbrack x\right\rbrack {\xi }_{1} + \cdots + F\left\lbrack x\right\rbrack {\xi }_{n} = \left( {F\left\lbrack x\right\rbrack {v}_{1} + \cdots + F\left\lbrack x\right\rbrack {v}_{n}}\right) + \left( {F{\xi }_{1} + \cdots + F{\xi }_{n}}\right) .
  1. Show that v1,v2,,vn{v}_{1},{v}_{2},\ldots ,{v}_{n} generate the kernel of φ\varphi . [Use the previous result to show that any element of kerφ\ker \varphi is the sum of an element in the module generated by ν1,ν2,,νn{\nu }_{1},{\nu }_{2},\ldots ,{\nu }_{n} and an element of the form b1ξ1++bnξn{b}_{1}{\xi }_{1} + \cdots + {b}_{n}{\xi }_{n} where the bi{b}_{i} are elements of FF . Then show that such an element is in kerφ\ker \varphi if and only if all the bi{b}_{i} are 0 since v1,,vn{v}_{1},\ldots ,{v}_{n} are a basis for VV over FF .]

  2. 证明 v1,v2,,vn{v}_{1},{v}_{2},\ldots ,{v}_{n} 生成 φ\varphi 的核。 [使用前面的结果证明 kerφ\ker \varphi 的任何元素都是 ν1,ν2,,νn{\nu }_{1},{\nu }_{2},\ldots ,{\nu }_{n} 生成的模中的元素和一个形式为 b1ξ1++bnξn{b}_{1}{\xi }_{1} + \cdots + {b}_{n}{\xi }_{n} 的元素的和,其中 bi{b}_{i}FF 的元素。然后证明如果且仅当所有 bi{b}_{i} 都为0时,这样的元素在 kerφ\ker \varphi 中,因为 v1,,vn{v}_{1},\ldots ,{v}_{n}VVFF 上的基。]

  3. Show that the generators v1,v2,,vn{v}_{1},{v}_{2},\ldots ,{v}_{n} of kerφ\ker \varphi have corresponding relations matrix

  4. 证明 kerφ\ker \varphi 的生成元 v1,v2,,vn{v}_{1},{v}_{2},\ldots ,{v}_{n} 对应的关系矩阵为

(xa11a21an1a12xa22an2a1na2nxann)=xIAt,\left( \begin{matrix} x - {a}_{11} & - {a}_{21} & \ldots & - {a}_{n1} \\ - {a}_{12} & x - {a}_{22} & \ldots & - {a}_{n2} \\ \vdots & \vdots & \ddots & \vdots \\ - {a}_{1n} & - {a}_{2n} & \ldots & x - {a}_{nn} \end{matrix}\right) = {xI} - {A}^{t},

where At{A}^{t} is the transpose of AA . Conclude that Theorem 21 and the algorithm for determining the invariant factors of AA follows by Exercises 16 to 19 in the previous section (note that the row and column operations necessary to diagonalize this relations matrix are the column and row operations necessary to diagonalize the matrix in Theorem 21, which explains why the invariant factor algorithm keeps track of the row operations used).

其中 At{A}^{t}AA 的转置。得出结论,定理21以及确定 AA 不变因子的算法可以通过前一部分的练习16到19得出(注意,对角化此关系矩阵所需的行和列操作是对角化定理21中矩阵所需的列和行操作,这解释了为什么不变因子算法需要跟踪使用的行操作)。