We now apply our results on finitely generated modules in the special case where the P.I.D. is the ring F[x] of polynomials in x with coefficients in a field F .
令 V 是一个定义在 F 上的有限维向量空间,其维度为 n,并且令 T 是 V 的一个固定线性变换(即从 V 到其自身的变换)。正如我们在第10章中看到的,我们可以将 V 视为一个 F[x] -模,其中元素 x 作用于 V 的方式是线性变换 T(因此任何 x 的多项式在 V 上的作用都是相同的 T 的多项式)。由于假设 V 在 F 上具有有限维度,因此按照定义,它作为 F -模是有限生成的,因此也必然作为 F[x] -模是有限生成的,所以上一节的分类定理适用。
任何非零自由 F[x] -模(同构于 F[x] 的副本的直接和)是 F 上的无穷维向量空间,所以如果 V 在 F 上有有限维数,那么它实际上必定是一个扭 F[x] -模(即,它的自由秩为 0)。由基本定理可知,V 作为 F[x] -模同构于循环、扭 F[x] -模的直接和。我们将看到这种 V 的分解将允许我们选择一个基,相对于该基,线性变换 T 的矩阵表示形式是特定的简单形式。当我们使用 V 的不变因子分解时,我们得到矩阵 T 的有理标准形,我们在本节中分析它。当我们使用初等因子分解(并且 F 包含 T 的所有特征值时),我们得到约当标准形,这在下一节中讨论,并且之前提到过,它是尽可能接近对角矩阵的 T 的矩阵表示。基本定理的唯一性部分确保了有理标准形和约当标准形的唯一性(这就是为什么它们被称为标准形)。
One important use of these canonical forms is to classify the distinct linear transformations of V . In particular they allow us to determine when two matrices represent the same linear transformation,i.e.,when two given n×n matrices are similar.
这些标准形的一个重要应用是分类 V 的不同线性变换。特别是它们允许我们确定两个矩阵是否代表相同的线性变换,即两个给定的 n×n 矩阵何时相似。
Note that this will be another instance where the structure of the space being acted upon (the invariant factor decomposition of V for example) is used to obtain significant information on the algebraic objects (in this case the linear transformations) which are acting. This will be considered in the case of groups acting on vector spaces in Chapter 18 (and goes under the name of Representation Theory of Groups).
注意,这将是另一个例子,其中一个作用于其上的空间的结构(例如 V 的不变因子分解)被用来获取关于代数对象(在这种情况下是线性变换)的显著信息。这将在第18章中考虑群作用于向量空间的情形(并以群表示论的名字出现)。
Before describing the rational canonical form in detail we first introduce some linear algebra.
在详细描述有理标准形之前,我们首先介绍一些线性代数知识。
Definition.
定义。
(1) An element λ of F is called an eigenvalue of the linear transformation T if there is a nonzero vector v∈V such that T(v)=λv . In this situation v is called an eigenvector of T with corresponding eigenvalue λ .
(1)如果存在一个非零向量 v∈V 使得 T(v)=λv ,则称 F 中的元素 λ 为线性变换 T 的特征值。在这种情况下,v 被称为对应于特征值 λ 的 T 的特征向量。
(2) If A is an n×n matrix with coefficients in F ,an element λ is called an eigenvalue of A with corresponding eigenvector v if v is a nonzero n×1 column vector such that Av=λv .
(2)如果 A 是一个在 F 中具有系数的 n×n 矩阵,那么如果 v 是一个非零的 n×1 列向量,且满足 Av=λv ,则称元素 λ 是 A 的特征值,对应的特征向量为 v。
(3) If λ is an eigenvalue of the linear transformation T ,the set {v∈V∣T(v)=λv} is called the eigenspace of T corresponding to the eigenvalue λ . Similarly,if λ is an eigenvalue of the n×n matrix A ,the set of n×1 matrices v with Av=λv is called the eigenspace of A corresponding to the eigenvalue λ .
(3)如果 λ 是线性变换 T 的特征值,那么集合 {v∈V∣T(v)=λv} 被称为对应于特征值 λ 的 T 的特征空间。类似地,如果 λ 是 n×n 矩阵 A 的特征值,那么满足 Av=λv 的 n×1 矩阵的集合 v 被称为对应于特征值 λ 的 A 的特征空间。
Note that if we fix a basis B of V then any linear transformation T of V has an associated n×n matrix A . Conversely,if A is any n×n matrix then the map T defined by T(v)=Av for v∈V ,where the v on the right is the n×1 vector consisting of the coordinates of v with respect to the fixed basis B of V ,is a linear transformation of V . Then v is an eigenvector of T with corresponding eigenvalue λ if and only if the coordinate vector of v with respect to B is an eigenvector of A with eigenvalue λ . In other words,the eigenvalues for the linear transformation T are the same as the eigenvalues for the matrix A of T with respect to any fixed basis for V .
请注意,如果我们固定 B 的一个基 V,那么任何 V 的线性变换 T 都有一个关联的 n×n 矩阵 A。反之,如果 A 是任意的 n×n 矩阵,那么由 T(v)=Av 定义的映射 T 对于 v∈V,其中右侧的 v 是相对于固定基 B 的 V 中 v 坐标组成的 n×1 向量,是一个 V 的线性变换。当且仅当相对于 B 的 v 坐标向量是 A 的特征向量,对应的特征值为 λ 时,v 是 T 的特征向量。换句话说,线性变换 T 的特征值与任何固定基对于 V 的 A 矩阵的特征值相同。
Definition. The determinant of a linear transformation from V to V is the determinant of any matrix representing the linear transformation (note that this does not depend on the choice of the basis used).
定义。从 V 到 V 的线性变换的行列式是表示该线性变换的任意矩阵的行列式(注意,这并不依赖于所使用的基的选择)。
Proposition 12. The following are equivalent:
命题 12。以下等价:
(1) λ is an eigenvalue of T
(1) λ 是 T 的特征值
(2) λI−T is a singular linear transformation of V
(2) λI−T 是 V 的奇异线性变换
(3) det(λI−T)=0 .
(3) det(λI−T)=0。
Proof: Since λ is an eigenvalue of T with corresponding eigenvector v if and only if v is a nonzero vector in the kernel of λI−T ,it follows that (1) and (2) are equivalent. (2) and (3) are equivalent by our results on determinants.
证明:由于 λ 是 T 的特征值,对应的特征向量为 v,当且仅当 v 是 λI−T 核中的非零向量,因此 (1) 和 (2) 等价。根据我们对行列式的研究结果,(2) 和 (3) 也是等价的。
Definition. Let x be an indeterminate over F . The polynomial det(xI−T) is called the characteristic polynomial of T and will be denoted cT(x) . If A is an n×n matrix with coefficients in F,det(xI−A) is called the characteristic polynomial of A and will be denoted cA(x) .
定义。设 x 是 F 上的一个不定元。多项式 det(xI−T) 被称为 T 的特征多项式,并表示为 cT(x)。如果 A 是一个在 F,det(xI−A) 中具有系数的 n×n 矩阵,则该矩阵的特征多项式被称为 A 的特征多项式,并表示为 cA(x)。
It is easy to see by expanding the determinant that the characteristic polynomial of either T or A is a monic polynomial of degree n=dimV . Proposition 12 says that the set of eigenvalues of T (or A ) is precisely the set of roots of the characteristic polynomial of T (of A ,respectively). In particular, T has at most n distinct eigenvalues.
通过展开行列式可以容易地看出,T 或 A 的特征多项式是次数为 n=dimV 的单变量多项式。命题 12 说明 T(或 A)的特征值集合恰好是 T(或分别的 A)特征多项式的根集合。特别是,T 最多有 n 个不同的特征值。
We have seen that V considered as a module over F[x] via the linear transformation T is a torsion F[x] -module. Let m(x)∈F[x] be the unique monic polynomial generating the annihilator of V in F[x] . Equivalently, m(x) is the unique monic polynomial of minimal degree annihilating V (i.e.,such that m(T) is the 0 linear transformation),and if f(x)∈F[x] is any polynomial annihilating V,m(x) divides f(x) . Since the ring of all n×n matrices over F is isomorphic to the collection of all linear transformations of V to itself (an isomorphism is obtained by choosing a basis for V ),it follows that for any n×n matrix A over F there is similarly a unique monic polynomial of minimal degree with m(A) the zero matrix.
我们已经看到,通过线性变换 T 将 V 视为 F[x] 的模是一个扭模。设 m(x)∈F[x] 是唯一生成 V 在 F[x] 中消灭子的单变量多项式。等价地,m(x) 是唯一消灭 V 的最小次数的单变量多项式(即,使得 m(T) 是零线性变换),如果 f(x)∈F[x] 是任何消灭 V,m(x) 的多项式,则 V,m(x) 除 f(x)。由于所有 n×n 矩阵在 F 上的环同构于所有将 V 映射到自身的线性变换的集合(通过为 V 选择一个基可以得到同构),因此对于任何在 F 上的 n×n 矩阵 A,也存在一个具有 m(A) 零矩阵的唯一最小次数的单变量多项式。
Definition. The unique monic polynomial which generates the ideal Ann(V) in F[x] is called the minimal polynomial of T and will be denoted mT(x) . The unique monic polynomial of smallest degree which when evaluated at the matrix A is the zero matrix is called the minimal polynomial of A and will be denoted mA(x) .
定义。生成理想 Ann(V) 在 F[x] 中的唯一单变量多项式称为 T 的最小多项式,并表示为 mT(x) 。当在矩阵 A 上求值时,唯一的最小次数的单变量多项式,其结果为零矩阵,称为 A 的最小多项式,并表示为 mA(x) 。
It is easy to see (cf. Exercise 5) that the degrees of these minimal polynomials are at most n2 where n is the dimension of V . We shall shortly prove that the minimal polynomial for T is a divisor of the characteristic polynomial for T (this is the Cayley-Hamilton Theorem),and similarly for A ,so in fact the degrees of these polynomials are at most n .
很容易看出(参见练习5),这些最小多项式的次数最多为 n2,其中 n 是 V 的维度。我们很快将证明 T 的最小多项式是 T 的特征多项式的因子(这是凯莱-哈密顿定理),对于 A 也是如此,因此实际上这些多项式的次数最多为 n。
We now describe the rational canonical form of the linear transformation T (respectively,of the n×n matrix A ). By Theorem 5 we have an isomorphism
These invariant factors ai(x) are only determined up to a unit in F[x] but since the units of F[x] are precisely the nonzero elements of F (i.e., the nonzero constant polynomials), we may make these polynomials unique by stipulating that they be monic.
这些不变因子 ai(x) 只在 F[x] 中的单位上确定,但由于 F[x] 的单位正是 F 的非零元素(即非零常数多项式),我们可以通过指定它们为单变量多项式来使这些多项式唯一。
Since the annihilator of V is the ideal (am(x)) (part (3) of Theorem 5),we immediately obtain:
由于 V 的消灭子是理想 (am(x))(定理5的第三部分),我们立即得到:
Proposition 13. The minimal polynomial mT(x) is the largest invariant factor of V . All the invariant factors of V divide mT(x) .
命题13。最小多项式 mT(x) 是 V 的最大不变因子。 V 的所有不变因子都整除 mT(x) 。
We shall see below how to calculate not only the minimal polynomial for T but also the other invariant factors.
我们将在下面看到如何计算 T 的最小多项式,以及其他不变因子。
We now choose a basis for each of the direct summands for V in the decomposition (1)above for which the matrix for T is quite simple. Recall that the linear transformation T acting on the left side of (1) is the element x acting by multiplication on each of the factors on the right side of the isomorphism in (1).
我们现在为 V 的每个直和项选择一个基,使得在分解 (1) 中的矩阵 T 非常简单。回想一下,线性变换 T 作用于 (1) 左侧的元素 x 通过乘法作用于同构 (1) 右侧的每个因子。
We have seen in the example following Proposition 1 of Chapter 11 that the elements 1,xˉ,xˉ2,…,xˉk−1 give a basis for the vector space F[x]/(a(x)) where a(x)=xk+bk−1xk−1+⋯+b1x+b0 is any monic polynomial in F[x] and xˉ=xmod(a(x)) . With respect to this basis the linear transformation of multiplication by x acts in a simple manner:
where the last equality is because xˉk+bk−1xˉk−1+⋯+b1xˉ+b0=0 since a(xˉ)=0 in F[x]/(a(x)) . With respect to this basis,the matrix formultiplication by x is therefore
其中最后一个等式是因为 xˉk+bk−1xˉk−1+⋯+b1xˉ+b0=0 ,由于 a(xˉ)=0 在 F[x]/(a(x)) 中。对于这个基,乘以 x 的矩阵形式因此是
Definition. Let a(x)=xk+bk−1xk−1+⋯+b1x+b0 be any monic polynomial in F[x] . The companion matrix of a(x) is the k×k matrix with 1’s down the first subdiagonal, −b0,−b1,…,−bk−1 down the last column and zeros elsewhere. The companion matrix of a(x) will be denoted by Ca(x) .
We apply this to each of the cyclic modules on the right side of (1) above and let Bi be the elements of V corresponding to the basis chosen above for the cyclic factor F[x]/(ai(x)) under the isomorphism in (1). Then by definition the linear transformation T acts on Bi by the companion matrix for ai(x) since we have seen that this is how multiplication by x acts. The union B of the Bi ’s gives a basis for V since the sum on the right of (1) is direct and with respect to this basis the linear transformation T has as matrix the direct sum of the companion matrices for the invariant factors, i.e.,
我们将此应用于上述(1)右侧的每个循环模块,并让 Bi 为 V 中与上述选择的循环因子 F[x]/(ai(x)) 在同构(1)下对应的元素。根据定义,线性变换 T 通过 ai(x) 的伴随矩阵作用于 Bi,因为我们已经看到这是乘以 x 的作用方式。所有 Bi 的并集给出了 V 的一个基,因为(1)右侧的和是直接的,并且对于这个基,线性变换 T 的矩阵是伴随矩阵的直接和,即,
Ca1(x)Ca2(x)⋱Cam(x).
(12.2)
Notice that this matrix is uniquely determined from the invariant factors of the F[x] - module V and,by Theorem 9,the list of invariant factors uniquely determines the module V up to isomorphism as an F[x] -module.
注意,这个矩阵由 F[x] - 模块的不变因子唯一确定,并且,根据定理9,不变因子的列表唯一地确定了作为 F[x] - 模块的 V 的同构。
Definition.
定义。
(1) A matrix is said to be in rational canonical form if it is the direct sum of companion matrices for monic polynomials a1(x),…,am(x) of degree at least one with a1(x)∣a2(x)∣⋯∣am(x) . The polynomials ai(x) are called the invariant factors of the matrix. Such a matrix is also said to be a block diagonal matrix with blocks the companion matrices for the ai(x) .
(2) A rational canonical form for a linear transformation T is a matrix representing T which is in rational canonical form.
(2)一个线性变换 T 的有理标准形是一个表示 T 的矩阵,且该矩阵处于有理标准形。
We have seen that any linear transformation T has a rational canonical form. We now see that this rational canonical form is unique (hence is called the rational canonical form for T ). To see this note that the process we used to determine the matrix of T from the direct sum decomposition is reversible. Suppose b1(x),b2(x),…,bt(x) are monic polynomials in F[x] of degree at least one such that bi(x)∣bi+1(x) for all i and suppose for some basis E of V ,that the matrix of T with respect to the basis E is the direct sum of the companion matrices of the bi(x) . Then V must be a direct sum of T -stable subspaces Di ,one for each bi(x) in such a way that the matrix of T on each Di is the companion matrix of bi(x) . Let Ei be the corresponding (ordered) basis of Di (so E is the union of the Ei ) and let ei be the first basis element in Ei . Then it is easy to see that Di is a cyclic F[x] -module with generator ei and that the annihilator of Di is bi(x) . Thus the torsion F[x] -module V decomposes into a direct sum of cyclic F[x] -modules in two ways, both of which satisfy the conditions of Theorem 5, i.e., both of which give lists of invariant factors. Since the invariant factors are unique by Theorem 9, ai(x) and bi(x) must differ by a unit factor in F[x] and since the polynomials are monic by assumption,we must have ai(x)=bi(x) for all i . This proves the following result:
我们已经看到,任何线性变换 T 都有一个有理标准形。现在我们知道这个有理标准形是唯一的(因此被称为 T 的有理标准形)。为了证明这一点,请注意我们用来从直接和分解确定 T 的矩阵的过程是可逆的。假设 b1(x),b2(x),…,bt(x) 是 F[x] 中的首一多项式,且次数至少为一个,使得对于所有 i 都有 bi(x)∣bi+1(x),并且假设对于某个 V 的基 E,T 关于基 E 的矩阵是 bi(x) 的伴随矩阵的直接和。那么 V 必须是 T -稳定子空间的直接和,每个 bi(x) 有一个,使得 T 在每个 Di 上的矩阵是 bi(x) 的伴随矩阵。设 Ei 是 Di 的相应(有序)基(因此 E 是 Ei 的并集),设 ei 是 Ei 中的第一个基元素。那么很容易看出 Di 是一个循环 F[x] -模,生成元为 ei,并且 Di 的消元子为 bi(x)。因此,挠 F[x] -模 V 以两种方式分解为循环 F[x] -模的直接和,这两种方式都满足定理 5 的条件,即都给出不变因子的列表。由于定理 9 证明了不变因子是唯一的,ai(x) 和 bi(x) 必须在 F[x] 中相差一个单位因子,并且由于假设多项式是首一的,我们必须对于所有 i 有 ai(x)=bi(x)。这证明了以下结果:
Theorem 14. (Rational Canonical Form for Linear Transformations) Let V be a finite dimensional vector space over the field F and let T be a linear transformation of V .
定理 14.(线性变换的有理标准形)设 V 是定义在域 F 上的有限维向量空间,设 T 是 V 的一个线性变换。
(1) There is a basis for V with respect to which the matrix for T is in rational canonical form, i.e., is a block diagonal matrix whose diagonal blocks are the companion matrices for monic polynomials a1(x),a2(x),…,am(x) of degree at least one with a1(x)∣a2(x)∣⋯∣am(x) .
(1)存在 V 的一个基,对于该基,T 的矩阵是有理标准形,即是一个对角块矩阵,其对角块是首一多项式 a1(x),a2(x),…,am(x) 的伴随矩阵,其次数至少为 1 并且 a1(x)∣a2(x)∣⋯∣am(x)。
(2) The rational canonical form for T is unique.
(2)T 的有理标准形是唯一的。
The use of the word rational is to indicate that this canonical form is calculated entirely within the field F and exists for any linear transformation T . This is not the case for the Jordan canonical form (considered later),which only exists if the field F contains the eigenvalues for T (cf. also the remarks following Corollary 18).
使用“有理”一词是为了表明这种标准形完全在域 F 内计算,并且对于任何线性变换 T 都存在。这与约当标准形(稍后讨论)的情况不同,后者仅在域 F 包含 T 的特征值时存在(参见 also the remarks following Corollary 18)。
The following result translates the notion of similar linear transformations (i.e., the same linear transformation up to a change of basis) into the language of modules and relates this notion to rational canonical forms.
Theorem 15. Let S and T be linear transformations of V . Then the following are equivalent:
定理 15. 设 S 和 T 是 V 的线性变换。那么以下条件是等价的:
(1) S and T are similar linear transformations
(1)S 和 T 是相似的线性变换
(2) the F[x] -modules obtained from V via S and via T are isomorphic F[x] - modules
(2)通过 V 和通过 S 从 F[x] 获得的 F[x] -模块与 T 是同构的 F[x] -模块
(3) S and T have the same rational canonical form.
(3)S 和 T 具有相同的有理标准形。
Proof: [(1) implies (2)] Assume there is a nonsingular linear transformation U such that S=UTU−1 . The vector space isomorphism U:V→V is also an F[x] -module homomorphism,where x acts on the first V via T and on the second via S ,since for example U(xv)=U(Tv)=UT(v)=SU(v)=x(Uv) . Hence this is an F[x] -module isomorphism of the two modules in (2).
证明:[(1) 蕴含 (2)] 假设存在一个非奇异线性变换 U 使得 S=UTU−1 。该向量空间同构 U:V→V 同时也是一个 F[x] -模同态,其中 x 通过 V 作用于第一个分量,通过 T 作用于第二个分量,因为例如 S 。因此,这是 (2) 中两个模的同构。
[(2) implies (3)] Assume (2) holds and denote by V1 the vector space V made into an F[x] -module via S and denote by V2 the space V made into an F[x] -module via T . Since V1≅V2 as F[x] -modules they have the same list of invariant factors. Thus S and T have a common rational canonical form.
[(2) 蕴含 (3)] 假设 (2) 成立,并记 V1 为通过 S 使向量空间 V 成为 F[x] -模,记 V2 为通过 T 使空间 V 成为 F[x] -模。由于 V1≅V2 作为 F[x] -模,它们具有相同的不变因子列表。因此 S 和 T 有一个共同的理性标准形。
[(3) implies (1)] Assume (3) holds. Since S and T have the same matrix representation with respect to some choice of (possibly different) bases of V by assumption, they are,up to a change of basis,the same linear transformation of V ,hence are similar.
[(3) 蕴含 (1)] 假设 (3) 成立。由于根据假设,S 和 T 在某些(可能是不同的)V 基底下的矩阵表示相同,它们在基底变换下是相同的 V 线性变换,因此是相似的。
Let A be any n×n matrix with entries from F . Let V be an n -dimensional vector space over F . Recall we can then define a linear transformation T on V by choosing a basis for V and setting T(v)=Av where v on the right hand side means the n×1 column vector of coordinates of v with respect to our chosen basis (this is just the usual identification of linear transformations with matrices). Then (of course) the matrix for this T with respect to this basis is the given matrix A . Put another way,any n×n matrix A with entries from the field F arises as the matrix for some linear transformation T of an n -dimensional vector space.
设 A 为任意 n×n 矩阵,其元素来自 F 。设 V 为 n 维的 F 上的向量空间。回顾我们可以在 V 上定义一个线性变换 T ,通过为 V 选择一个基并设置 T(v)=Av ,其中 v 右侧的 n×1 表示 v 关于我们选择的基的坐标的 n×1 列向量(这只是线性变换与矩阵的通常识别)。那么(当然)这个基下 T 的矩阵就是给定的矩阵 A 。换句话说,任何来自域 F 的 n×n 矩阵 A 都可以作为某个 n 维向量空间的线性变换 T 的矩阵。
This dictionary between linear transformations of vector spaces and matrices allows us to state our previous two results in the language of matrices:
这个线性变换的向量空间和矩阵之间的字典允许我们用矩阵的语言陈述我们之前得到的结果:
Theorem 16. (Rational Canonical Form for Matrices) Let A be an n×n matrix over the field F .
定理 16。(矩阵的有理标准形)设 A 为 F 上的 n×n 矩阵。
(1) The matrix A is similar to a matrix in rational canonical form,i.e.,there is an invertible n×n matrix P over F such that P−1AP is a block diagonal matrix whose diagonal blocks are the companion matrices for monic polynomials a1(x),a2(x),…,am(x) of degree at least one with a1(x)∣a2(x)∣⋯∣am(x).
(1)矩阵 A 与有理标准形的一个矩阵相似,即存在一个可逆的 n×n 矩阵 P 在 F 上,使得 P−1AP 是一个块对角矩阵,其对角块是单变量多项式 a1(x),a2(x),…,am(x) 的伴随矩阵,其次数至少为一个,并且 a1(x)∣a2(x)∣⋯∣am(x).
(2) The rational canonical form for A is unique.
(2)A 的有理标准形是唯一的。
Definition. The invariant factors of an n×n matrix over a field F are the invariant factors of its rational canonical form.
定义。一个 n×n 矩阵在域 F 上的不变因子是其有理标准形的不变因子。
Theorem 17. Let A and B be n×n matrices over the field F . Then A and B are similar if and only if A and B have the same rational canonical form.
定理 17。设 A 和 B 是定义在域 F 上的 n×n 矩阵。那么 A 和 B 相似当且仅当它们具有相同的理性标准形。
If A is a matrix with entries from a field F and F is a subfield of a larger field K then we may also consider A as a matrix over K . The next result shows that the rational canonical form for A and questions of similarity do not depend on which field contains the entries of A .
如果 A 是一个元素来自域 F 的矩阵,且 F 是更大域 K 的子域,那么我们也可以将 A 视为定义在 K 上的矩阵。下一个结果说明,A 的理性标准形以及相似性问题不依赖于包含 A 元素的域。
Corollary 18. Let A and B be two n×n matrices over a field F and suppose F is a subfield of the field K .
推论 18。设 A 和 B 是定义在域 F 上的两个 n×n 矩阵,并假设 F 是域 K 的子域。
(1) The rational canonical form of A is the same whether it is computed over K or over F . The minimal and characteristic polynomials and the invariant factors of A are the same whether A is considered as a matrix over F or as a matrix over K .
(1)无论在 K 上还是在 F 上计算,A 的理性标准形都是相同的。最小多项式和特征多项式以及 A 的不变因子,无论将 A 视为在 F 上的矩阵还是 K 上的矩阵,都是相同的。
(2) The matrices A and B are similar over K if and only if they are similar over F ,i.e.,there exists an invertible n×n matrix P with entries from K such that B=P−1AP if and only if there exists an (in general different) invertible n×n matrix Q with entries from F such that B=Q−1AQ .
(2)如果矩阵 A 和 B 在 K 上相似,当且仅当它们在 F 上相似,即存在一个可逆的 n×n 矩阵 P 其元素来自 K 使得 B=P−1AP 当且仅当存在一个(通常不同)的可逆的 n×n 矩阵 Q 其元素来自 F 使得 B=Q−1AQ 。
Proof: (1) Let M be the rational canonical form of A when computed over the smaller field F . Since M satisfies the conditions in the definition of the rational canonical form over K ,the uniqueness of the rational canonical form implies that M is also the rational canonical form of A over K . Hence the invariant factors of A are the same whether A is viewed over F or over K . In particular,since the minimal polynomial is the largest invariant factor of A it also does not depend on the field over which A is viewed. It is clear from the determinant definition of the characteristic polynomial of A that this polynomial depends only on the entries of A (we shall see shortly that the characteristic polynomial is the product of all the invariant factors for A ,which will give an alternate proof of this result).
证明:(1) 设 M 是当在较小的场 F 上计算时 A 的有理标准形。由于 M 满足在有理标准形的定义中关于 K 的条件,有理标准形的唯一性意味着 M 也是 A 在 K 上的有理标准形。因此,A 的不变因子无论是在 F 上还是在 K 上都是相同的。特别是,由于最小多项式是 A 的最大不变因子,它也不依赖于 A 所在的场。从 A 的特征多项式的行列式定义中可以清楚地看出,这个多项式只依赖于 A 的元素(我们很快将看到特征多项式是所有不变因子的乘积,这将为这个结果提供一个替代的证明)。
(2) If A and B are similar over the smaller field F they are clearly similar over K . Conversely,if A and B are similar over K ,they have the same rational canonical form over K . By (1) they have the same rational canonical form over F ,hence are similar over F by Theorem 17.
(2) 如果 A 和 B 在较小的场 F 上相似,那么它们显然在 K 上也相似。反之,如果 A 和 B 在 K 上相似,那么它们在 K 上具有相同的有理标准形。由 (1) 可知,它们在 F 上也具有相同的有理标准形,因此根据定理 17,它们在 F 上是相似的。
This corollary asserts in particular that the rational canonical form for an n×n matrix A is an n×n matrix with entries in the smallest field containing the entries of A . Further,this canonical form is the same matrix even if we allow conjugation of A by nonsingular matrices whose entries come from larger fields. This explains the terminology of rational canonical form.
这个推论特别指出,对于 n×n 矩阵 A 的有理标准形是一个元素在包含 A 元素的最小场中的 n×n 矩阵。进一步,即使我们允许通过来自较大场的非奇异矩阵对 A 进行共轭,这个标准形仍然是相同的矩阵。这就解释了为什么使用有理标准形这个术语。
The next proposition gives the connection between the characteristic polynomial of a matrix (or of a linear transformation) and its invariant factors and is quite useful for determining these invariant factors (particularly for matrices of small size).
(1) The characteristic polynomial of the companion matrix of a(x) is a(x) .
(1) a(x) 的伴随矩阵的特征多项式是 a(x)。
(2) If M is the block diagonal matrix
(2) 如果 M 是分块对角矩阵
M=A10⋮00A2⋮0……⋱…00⋮Ak,
given by the direct sum of matrices A1,A2,…,Ak then the characteristic polynomial of M is the product of the characteristic polynomials of A1,A2,…,Ak .
由矩阵 A1,A2,…,Ak 的直接和给定,那么 M 的特征多项式是 A1,A2,…,Ak 的特征多项式的乘积。
Proof: These are both straightforward exercises.
证明:这两个都是直接的练习。
Proposition 20. Let A be an n×n matrix over the field F .
命题20. 设 A 为定义在域 F 上的 n×n 矩阵。
(1) The characteristic polynomial of A is the product of all the invariant factors of A.
(1) A 的特征多项式是 A 的所有不变因子的乘积。
(2) (The Cayley-Hamilton Theorem) The minimal polynomial of A divides the characteristic polynomial of A .
(2) (凯莱-哈密顿定理)A 的最小多项式整除 A 的特征多项式。
(3) The characteristic polynomial of A divides some power of the minimal polynomial of A . In particular these polynomials have the same roots,not counting multiplicities.
(3) A 的特征多项式整除 A 的最小多项式的某个幂。特别地,这些多项式具有相同的根,不计重数。
The same statements are true if the matrix A is replaced by a linear transformation T of an n -dimensional vector space over F .
如果将矩阵 A 替换为 F 上 n 维向量空间的线性变换 T,上述结论同样成立。
Proof: Let B be the rational canonical form of A . By the previous lemma the block diagonal form of B shows that the characteristic polynomial of B is the product of the characteristic polynomials of the companion matrices of the invariant factors of A . By the first part of the lemma above, the characteristic polynomial of the companion matrix Ca(x) for a(x) is just a(x) ,which implies that the characteristic polynomial for B is the product of the invariant factors of A . Since A and B are similar,they have the same characteristic polynomial, which proves (1). Assertion (2) is immediate from (1) since the minimal polynomial for A is the largest invariant factor of A . The fact that all the invariant factors divide the largest one immediately implies (3). The final assertion is clear from the dictionary between linear transformations of vector spaces and matrices.
证明:设 B 为 A 的有理标准形。根据之前的引理,B 的块对角形式表明 B 的特征多项式是 A 的不变因子的伴随矩阵的特征多项式的乘积。由上述引理的第一部分,a(x) 的伴随矩阵 Ca(x) 的特征多项式就是 a(x),这意味着 B 的特征多项式是 A 的不变因子的乘积。由于 A 和 B 相似,它们具有相同的特征多项式,这证明了 (1)。从 (1) 直接得出断言 (2),因为 A 的最小多项式是 A 的最大不变因子。所有不变因子都能整除最大不变因子这一事实立即意味着 (3)。最后的断言从线性变换的向量空间与矩阵之间的字典关系可以清楚地看出。
Note that part (2) of the proposition is the assertion that the matrix A satisfies its own characteristic polynomial,i.e., cA(A)=0 as matrices,which is the usual formulation for the Cayley-Hamilton Theorem. Note also that it implies the degree of the minimal polynomial for A has degree at most n ,a result mentioned before.
注意,命题的第 (2) 部分是断言矩阵 A 满足其自身的特征多项式,即 cA(A)=0 作为矩阵,这是凯莱-哈密顿定理的通常表述。还应注意,它暗示了 A 的最小多项式的次数至多为 n,这是一个之前提到过的结果。
The relations in Proposition 20 are frequently quite useful in the determination of the invariant factors for a matrix A ,particularly for matrices of small degree (cf. Exercises 3 and 4 and the examples). The following result (which relies on Exercises 16 to 19 in the previous section and whose proof we outline in the exercises) computes the invariant factors in general.
Let A be an n×n matrix over the field F . Then xI−A is an n×n matrix with entries in F[x] . The three operations
设 A 是定义在域 F 上的一个 n×n 矩阵。那么 xI−A 是一个其元素属于 F[x] 的 n×n 矩阵。以下三种操作:
(a) interchanging two rows or columns
(a) 交换两行或两列
(b) adding a multiple (in F[x] ) of one row or column to another
(b) 将一行或一列的倍数(在 F[x] 中)加到另一行或另一列上
(c) multiplying any row or column by a unit in F[x] ,i.e.,by a nonzero element in F , are called elementary row and column operations.
(c) 将任何一行或一列乘以 F[x] 中的单位元,即乘以 F 中的非零元素,这些操作被称为基本的行和列变换。
Theorem 21. Let A be an n×n matrix over the field F . Using the three elementary row and column operations above,the n×n matrix xI−A with entries from F[x] can be put into the diagonal form (called the Smith Normal Form for A )
定理21。设 A 是定义在域 F 上的一个 n×n 矩阵。使用上述三种基本的行和列变换,可以将具有来自 F[x] 的元素的 n×n 矩阵 xI−A 转化为对角形式(称为 A 的 Smith 标准形)
1⋱1a1(x)a2(x)⋱an(x)
with monic nonzero elements a1(x),a2(x),…,am(x) of F[x] with degrees at least one and satisfying a1(x)∣a2(x)∣⋯∣am(x) . The elements a1(x),…,am(x) are the invariant factors of A .
Invariant Factor Decomposition Algorithm: Converting to Rational Canonical Form
不变因子分解算法:转换为有理标准形
As mentioned in the exercises near the end of the previous section, keeping track of the operations necessary to diagonalize xI−A will explicitly give a matrix P such that P−1AP is in rational canonical form. Equivalently,if V is a given F[x] -module with vector space basis [e1,e2,…,en] ,then P defines the change of basis giving the Invariant Factor Decomposition of V into a direct sum of cyclic F[x] -modules. In particular,if A is the matrix of the linear transformation T of the F[x] -module V defined by x (i.e., T(ej)=xej=i=1∑naijei where A=(aij)) ,then the matrix P defines the change of basis for V with respect to which the matrix for T is in rational canonical form.
如前一部分末尾的练习中所述,跟踪对角化 xI−A 所需的操作将明确给出一个矩阵 P,使得 P−1AP 处于有理标准形。等价地,如果 V 是一个给定的 F[x] -模,其向量空间基为 [e1,e2,…,en],那么 P 定义了基变换,给出 V 的不变因子分解,将其分解为循环 F[x] -模的直接和。特别是,如果 A 是由 x 定义的 F[x] -模 V 的线性变换 T 的矩阵(即 T(ej)=xej=i=1∑naijei 其中 A=(aij))),那么矩阵 P 定义了相对于 V 的基变换,该变换使得 T 的矩阵处于有理标准形。
We first describe the algorithm in the general context of determining the Invariant Factor Decomposition of a given F[x] -module V with vector space basis [e1,e2,…,en] (the proof is outlined in the exercises). We then describe the algorithm to convert a given n×n matrix A to rational canonical form (in which reference to an underlying vector space and associated linear transformation are suppressed).
我们首先在确定给定 F[x] -模 V 的不变因子分解的一般背景下描述算法(证明大纲在练习中给出)。然后我们描述将给定的 n×n 矩阵 A 转换为有理标准形的算法(其中省略了基础向量空间和关联线性变换的引用)。
Explicit numerical examples of this algorithm are given in Examples 2 and 3 following.
本算法的具体数值示例在接下来的示例 2 和 3 中给出。
Invariant Factor Decomposition Algorithm
不变因子分解算法
Let V be an F[x] -module with vector space basis [e1,e2,…,en] (so in particular these elements are generators for V as an F[x] -module). Let T be the linear transformation of V to itself defined by x and let A be the n×n matrix associated to T and this choice of basis for V ,i.e.,
设 V 是一个 F[x] -模,其向量空间基为 [e1,e2,…,en](特别地,这些元素作为 F[x] -模的生成元)。设 T 是从 V 到其自身的线性变换,定义为 x,设 A 是与 T 相关的 n×n 矩阵,以及 V 的基的选择,即,
T(ej)=xej=i=1∑naijei where A=(aij).
(1) Use the following three elementary row and column operations to diagonalize the matrix xI−A over F[x] ,keeping track of the row operations used:
(1)使用以下三个基本的行和列变换来对矩阵 xI−A 进行对角化,保持对行变换的使用进行跟踪:
(a) interchange two rows or columns (which will be denoted by Ri↔Rj for the interchange of the ith and jth rows and similarly by Ci↔Cj for columns),
(b) add a multiple (in F[x] ) of one row or column to another (which will be denoted by Ri+p(x)Rj↦Ri if p(x) times the jth row is added to the ith row,and similarly by Ci+p(x)Cj↦Ci for columns),
(c) multiply any row or column by a unit in F[x] ,i.e.,by a nonzero element in F (which will be denoted by uRi if the ith row is multiplied by u∈F× ,and similarly by uCi for columns).
(2) Beginning with the F[x] -module generators [e1,e2,…,en] ,for each row operation used in (1), change the set of generators by the following rules:
(a) If the ith row is interchanged with the jth row then interchange the ith and jth generators.
(a)如果第 ith 行与第 jth 行交换,则交换 ith 和 jth 生成元。
(b) If p(x) times the jth row is added to the ith row then subtract p(x) times the ith generator from the jth generator (note the indices). (c) If the ith row is multiplied by the unit u∈F then divide the ith generator by u .
(3) When xI−A has been diagonalized to the form in Theorem 21 the generators [e1,e2,…,en] for V will be in the form of F[x] -linear combinations of e1,e2,…,en . Use xej=T(ej)=i=1∑naijei to write these elements as F - linear combinations of e1,e2,…,en . When xI−A has been diagonalized,the first n−m of these linear combinations are 0 (providing a useful numerical check on the computations) and the remaining m linear combinations are nonzero,i.e., the generators for V are in the form [0,…,0,f1,…,fm] corresponding precisely to the diagonal elements in Theorem 21. The elements f1,…,fm are a set of F[x] -module generators for the cyclic factors in the invariant factor decomposition of V (with annihilators (a1(x)),…,(am(x)) ,respectively):
(3) 当 xI−A 已经对角化为定理21中的形式时,V 的生成器 [e1,e2,…,en] 将是 F[x] -线性组合的形式,由 e1,e2,…,en 组成。使用 xej=T(ej)=i=1∑naijei 将这些元素写成 F -线性组合的形式。当 xI−A 已经对角化时,这些线性组合中的前 n−m 个为0(为计算提供了一个有用的数值校验),其余的 m 线性组合不为零,即 V 的生成器是 [0,…,0,f1,…,fm] 的形式,精确对应定理21中的对角元素。元素 f1,…,fm 是 F[x] -模生成器集合,对应于 V 的不变因子分解中的循环因子(分别具有消灭子 (a1(x)),…,(am(x))):
V=F[x]f1⊕F[x]f2⊕…⊕F[x]fm
F[x]fi≅F[x]/(ai(x))i=1,2,…,m,
giving the Invariant Factor Decomposition of the F[x] -module V .
给出了 F[x] -模 V 的不变因子分解。
(4) The corresponding vector space basis for each cyclic factor of V is then given by the elements fi,Tfi,T2fi,…,Tdegai(x)−1fi .
(4) 每个 V 的循环因子的对应向量空间基由元素 fi,Tfi,T2fi,…,Tdegai(x)−1fi 给出。
(5) Write the kth element of the vector space basis computed in (4) in terms of the original vector space basis [e1,e2,…,en] and use the coordinates for the kth column of an n×n matrix P . Then P−1AP is in rational canonical form (with diagonal blocks the companion matrices for the ai(x) ). This is the matrix for the linear transformation T with respect to the vector space basis in (4).
We now describe the algorithm to convert a given n×n matrix A to rational canonical form,i.e.,to determine an n×n matrix P so that P−1AP is in rational canonical form. This is nothing more than the algorithm above applied to the vector space V=Fn of n×1 column vectors with standard basis [e1,e2,…,en] (where ei is the column vector with 1 in the ith position and 0 ’s elsewhere) and T is the linear transformation defined by A and this choice of basis. Explicit reference to this underlying vector space and associated linear transformation are suppressed, so the algorithm is purely matrix theoretic.
Converting an n×n Matrix to Rational Canonical Form
将n×n矩阵转换为有理标准形
Let A be an n×n matrix with entries in the field F .
设A是一个在字段F中的n×n矩阵。
(1) Use the following three elementary row and column operations to diagonalize the matrix xI−A over F[x] ,keeping track of the row operations used:
(1) 使用以下三个基本的行和列操作来对矩阵xI−A进行对角化,同时跟踪所使用的行操作:
(a) interchange two rows or columns (which will be denoted by Ri↔Rj for the interchange of the ith and jth rows and similarly by Ci↔Cj for columns),
(b) add a multiple (in F[x] ) of one row or column to another (which will be denoted by Ri+p(x)Rj↦Ri if p(x) times the jth row is added to the ith row,and similarly by Ci+p(x)Cj↦Ci for columns),
(c) multiply any row or column by a unit in F[x] ,i.e.,by a nonzero element in F (which will be denoted by uRi if the ith row is multiplied by u∈F× ,and similarly by uCi for columns).
(b) If Ri+p(x)Rj↦Ri then subtract the product of the matrix p(A) times the ith column of P′ from the jth column of P′ (i.e., Cj−p(A)Ci↦Cj for P′ - note the indices).
(3) When xI−A has been diagonalized to the form in Theorem 21 the first n−m columns of the matrix P′ are 0 (providing a useful numerical check on the computations) and the remaining m columns of P′ are nonzero. For each i=1,2,…,m , multiply the ith nonzero column of P′ successively by A0=I,A1,A2,…,Adi−1, where di is the integer in (1) above and use the resulting column vectors (in this order) as the next di columns of an n×n matrix P . Then P−1AP is in rational canonical form (whose diagonal blocks are the companion matrices for the polynomials a1(x),…,am(x) in (1)).
In the theory of canonical forms for linear transformations (or matrices) the characteristic polynomial plays the role of the order of a finite abelian group and the minimal polynomial plays the role of the exponent (after all, they are the same invariants, one for modules over the Principal Ideal Domain Z and the other for modules over the Principal Ideal Domain F[x] ) so we can solve problems directly analogous to those we considered for finite abelian groups in Chapter 5. In particular, this includes the following:
在线性变换(或矩阵)的标准形理论中,特征多项式扮演了有限阿贝尔群阶数的作用,而最小多项式扮演了指数的作用(毕竟,它们是相同的不变量,一个适用于主理想整环 Z 上的模,另一个适用于主理想整环 F[x] 上的模),因此我们可以直接解决与第5章中我们考虑的有限阿贝尔群类似的问题。特别是,这包括以下内容:
(A) determine the rational canonical form of a given matrix (analogous to decomposing a finite abelian group as a direct product of cyclic groups)
(A) 确定给定矩阵的有理标准形(类似于将有限阿贝尔群分解为循环群的直积)
(B) determine whether two given matrices are similar (analogous to determining whether two given finite abelian groups are isomorphic)
(B) 判断给定的两个矩阵是否相似(类似于判断给定的两个有限阿贝尔群是否同构)
(C) determine all similarity classes of matrices over F with a given characteristic polynomial (analogous to determining all abelian groups of a given order)
(C) 确定具有给定特征多项式的 F 上所有矩阵的相似类(类似于确定给定阶数的所有阿贝尔群)
(D) determine all similarity classes of n×n matrices over F with a given minimal polynomial (analogous to determining all abelian groups of rank at most n of a given exponent).
(D) 确定 n×n 矩阵在给定最小多项式 F 上的所有相似类(类似于确定秩至多 n 的给定指数的所有阿贝尔群)。
Examples
示例
(1) We find the rational canonical forms of the following matrices over Q and determine if they are similar:
A direct computation shows that all three of these matrices have the same characteristic polynomial: cA(x)=cB(x)=cC(x)=(x−2)2(x−3) . Since the minimal and characteristic polynomials have the same roots, the only possibilities for the minimal polynomials are (x−2)(x−3) or (x−2)2(x−3) . We quickly find that (A−2I)(A−3I)=0 , (B−2I)(B−3I)=0 (the 1,1-entry is nonzero) and (C−2I)(C−3I)=0 (the 1,2-entry is nonzero). It follows that
It follows immediately that there are no additional invariant factors for B and C . Since the invariant factors for A divide the minimal polynomial and have product the characteristic polynomial,we see that A has for invariant factors the polynomials x−2,(x−2)(x−3)=x2−5x+6 . (For 2×2 and 3×3 matrices the determination of the characteristic and minimal polynomials determines all the invariant factors, cf. Exercises 3 and 4.) We conclude that B and C are similar and neither is similar to A . The rational canonical forms are (note (x−2)2(x−3)=x3−7x2+16x−12 )
立刻得出 B 和 C 没有额外的不变因子。由于 A 的不变因子除以最小多项式并乘积等于特征多项式,我们发现 A 的不变因子是多项式 x−2,(x−2)(x−3)=x2−5x+6 。(对于 2×2 和 3×3 矩阵,特征多项式和最小多项式的确定决定了所有的不变因子,参见练习3和4。)我们得出结论,B 和 C 是相似的,且都不与 A 相似。有理标准形是(注意 (x−2)2(x−3)=x3−7x2+16x−12 )
(2) In the example above the rational canonical forms were obtained simply by determining the characteristic and minimal polynomials for the matrices. As mentioned, this is sufficient for 2×2 and 3×3 matrices since this information is sufficient to determine all of the invariant factors. For larger matrices, however, this is in general not sufficient (cf. the next example) and more work is required to determine the invariant factors. In this example we again compute the rational canonical form for the matrix A in Example 1 following the two algorithms outlined above. While this is computationally more difficult for this small matrix (as will be apparent), it has the advantage even in this case that it also explicitly computes a matrix P with P−1AP in rational canonical form.
在上述示例中,通过简单地确定矩阵的特征多项式和最小多项式来获得有理标准形。如前所述,这对于 2×2 和 3×3 矩阵来说是足够的,因为这一信息足以确定所有的不变因子。然而,对于更大的矩阵,这通常是不够的(参见下一个示例),需要更多的工作来确定不变因子。在这个示例中,我们再次计算示例 1 中矩阵 A 的有理标准形,遵循上述两种算法。虽然对于这个小矩阵来说计算上更加困难(正如将出现的),但即使在每种情况下,它也有优势,即它还显式地计算了一个 P 矩阵,其 P−1AP 在有理标准形中。
I. (Invariant Factor Decomposition) We use row and column operations (in Q[x] ) to reduce the matrix
I.(不变因子分解)我们使用行和列操作(在 Q[x] 中)来简化矩阵
xI−A=x−2002x−30−147x−2
to diagonal form. As in the invariant factor decomposition algorithm, we shall use the notation Ri↔Rj to denote the interchange of the ith and jth rows, Ri+aRj↦Ri if a times the jth row is added to the ith row,simply uRi if the ith row is multiplied by u (and similarly for columns,using C instead of R ). Note also that the first two operations we perform below are rather adhoc and were chosen simply to have integers everywhere in the computation:
到对角形式。和在不变因子分解算法中一样,我们将使用记号 Ri↔Rj 来表示交换 ith 和 jth 行,如果 a 倍的 jth 行加到 ith 行上,则是 Ri+aRj↦Ri ,如果 ith 行乘以 u ,则是简单地 uRi(对于列操作,使用 C 而不是 R)。请注意,下面我们执行的前两个操作相当 adhoc ,我们选择它们仅仅是为了在计算中到处都有整数:
This determines the invariant factors x−2,x2−5x+6 for this matrix,which we determined in Example 1 above. Let now V be a 3-dimensional vector space over Q with basis e1,e2,e3 and let T be the corresponding linear transformation (which defines the action of x on V ),i.e.,
这确定了该矩阵的不变因子 x−2,x2−5x+6,我们在上面的示例 1 中确定了它们。现在让 V 是一个 Q 上的三维向量空间,其基为 e1,e2,e3,并让 T 是相应的线性变换(它定义了 x 在 V 上的作用),即,
xe1=T(e1)=2e1
xe2=T(e2)=−2e1+3e2
xe3=T(e3)=14e1−7e2+2e3.
The row operations used in the reduction above were
Using the formulas above for the action of x ,we see that these last elements are the elements [0,−7e1+7e2+e3,−e1+e2] of V corresponding to the elements 1,x−2 and x2−5x+6 in the diagonalized form of xI−A ,respectively. The elements f1=−7e1+7e2+e3 and f2=−e1+e2 are therefore Q[x] -module generators for the two cyclic factors of V in its invariant factor decomposition as a Q[x] -module. The corresponding Q -vector space bases for these two factors are then f1 and f2,xf2=Tf2, i.e., −7e1+7e2+e3 and −e1+e2,T(−e1+e2)=−4e1+3e2.
Using the formulas above for the action of x ,we see that these last elements are the elements [0,−7e1+7e2+e3,−e1+e2] of V corresponding to the elements 1,x−2 and x2−5x+6 in the diagonalized form of xI−A ,respectively. The elements f1=−7e1+7e2+e3 and f2=−e1+e2 are therefore Q[x] -module generators for the two cyclic factors of V in its invariant factor decomposition as a Q[x] -module. The corresponding Q -vector space bases for these two factors are then f1 and f2,xf2=Tf2, i.e., −7e1+7e2+e3 and −e1+e2,T(−e1+e2)=−4e1+3e2.
Then the matrix
Then the matrix
P=−771−110−430
conjugates A into its rational canonical form:
conjugates A into its rational canonical form:
P−1AP=2000010−65
as one easily checks.
as one easily checks.
II. (Converting A Directly to Rational Canonical Form) We use the row operations involved in the diagonalization of xI−A to determine the matrix P′ of the algorithm above:
II. (Converting A Directly to Rational Canonical Form) We use the row operations involved in the diagonalization of xI−A to determine the matrix P′ of the algorithm above:
Here we have d1=1 and d2=2 ,corresponding to the second and third nonzero columns of P′ ,respectively. The columns of P are therefore given by
Here we have d1=1 and d2=2 ,corresponding to the second and third nonzero columns of P′ ,respectively. The columns of P are therefore given by
−771 and −110,A−110=−430
respectively,which again gives the matrix P above.
respectively,which again gives the matrix P above.
(3) For the 3×3 matrix A it was not necessary to perform the lengthy calculations above merely to determine the rational canonical form (equivalently, the invariant factors),as we saw in Example 1. For n×n matrices with n≥4 ,however,the computation of the characteristic and minimal polynomials is in general not sufficient for the determination of all the invariant factors, so the more extensive calculations of the previous example may become necessary. For example, consider the matrix
(3) For the 3×3 matrix A it was not necessary to perform the lengthy calculations above merely to determine the rational canonical form (equivalently, the invariant factors),as we saw in Example 1. For n×n matrices with n≥4 ,however,the computation of the characteristic and minimal polynomials is in general not sufficient for the determination of all the invariant factors, so the more extensive calculations of the previous example may become necessary. For example, consider the matrix
D=12102−101−441−24−8−23.
A short computation shows that the characteristic polynomial of D is (x−1)4 . The possible minimal polynomials are then x−1,(x−1)2,(x−1)3 and (x−1)4 . Clearly D−I=0 and another short computation shows that (D−I)2=0 ,so the minimal polynomial for D is (x−1)2 . There are then two possible sets of invariant factors:
To determine the invariant factors for D we apply the procedure of the previous example to the 4×4 matrix
为了确定 D 的不变因子,我们将前一个例子的方法应用到 4×4 矩阵上
xI−D=x−1−2−10−2x+10−14−4x−12−482x−3.
The diagonal matrix obtained from this matrix by elementary row and column opera-
通过初等行和列变换从该矩阵得到的对角矩阵是矩阵
tions is the matrix
这表明 的不变因子是 (一组使 对角化的初等行和列变换是 ,
1000010000(x−1)20000(x−1)2,
which shows that the invariant factors for D are (x−1)2,(x−1)2 (one series of elementary row and column operations which diagonalize xI−D are R1↔R3,−R1 ,
I. (Invariant Factor Decomposition) If e1,e2,e3,e4 is a basis for V in this case,then using the row operations in this diagonalization as in the previous example we see that the generators of V corresponding to the factors above are (x−1)e1−2e2−e3=0 , −2e1+(x+1)e2−e4=0,e1,e2 . Hence a vector space basis for the two direct factors in the invariant decomposition of V in this case is given by e1,Te1 and e2,Te2 where The corresponding matrix P relating these bases is
I. (不变因子分解)如果 e1,e2,e3,e4 是这个例子中 V 的一个基,那么使用这个对角化中的行变换,就像前一个例子中那样,我们看到对应于上述因子的 V 的生成元是 (x−1)e1−2e2−e3=0 ,−2e1+(x+1)e2−e4=0,e1,e2 。因此,在这种情况下,V 的不变分解中的两个直和因子的一个向量空间基由 e1,Te1 和 e2,Te2 给出,其中对应的矩阵 P 关联这些基是
P=1000121001002−101
so that P−1DP is in rational canonical form:
使得 P−1DP 是有理标准形:
P−1DP=0100−1200000100−12
as can easily be checked.
这可以很容易地验证。
II. (Converting D Directly to Rational Canonical Form) As in Example 2 we determine the matrix P′ of the algorithm from the row operations used in the diagonalization of xI−D:
II. (直接将有理标准形转换为 D)就像在例子 2 中那样,我们通过在 xI−D: 的对角化中使用的行变换确定算法的矩阵 P′
1000,D1000=1210 and 0100,D0100=2−101,
respectively,which again gives the matrix P above.
分别,这再次给出了上面的矩阵 P。
(4) In this example we determine all similarity classes of matrices A with entries from Q with characteristic polynomial (x4−1)(x2−1) . First note that any matrix with a degree
6 characteristic polynomial must be a 6×6 matrix. The polynomial (x4−1)(x2−1) factors into irreducibles in Q[x] as (x−1)2(x+1)2(x2+1) . Since the minimal polynomial mA(x) for A has the same roots as cA(x) it follows that (x−1)(x+1)(x2+1) divides mA(x) . Suppose a1(x),…,am(x) are the invariant factors of some A ,so am(x)=mA(x),ai(x)∣ai+1(x) (in particular,all the invariant factors divide mA(x) ) and a1(x)a2(x)⋯am(x)=(x4−1)(x2−1) . One easily sees that the only permissible lists under these constraints are
One can now easily write out the corresponding direct sums of companion matrices to obtain representatives of the 4 similarity classes. We shall see in the next section that there are still only 4 similarity classes even in M6(C) .
(5) In this example we find all similarity classes of 3×3 matrices A with entries from Q satisfying A6=I . For each such A ,its minimal polynomial divides x6−1 and in Q[x] the complete factorization of this polynomial is
(5) 在这个例子中,我们找到所有满足 A6=I 的 3×3 矩阵 A 的相似类。对于每个这样的 A ,它的最小多项式能整除 x6−1 ,在 Q[x] 中,这个多项式的完全分解是
x6−1=(x−1)(x+1)(x2−x+1)(x2+x+1).
Conversely,if B is any 3×3 matrix whose minimal polynomial divides x6−1 ,then B6=I . The only restriction on the minimal polynomial for B is that its degree is at most 3 (by the Cayley-Hamilton Theorem). The only possibilities for the minimal polynomial of such a matrix A are therefore
反之,如果 B 是任何最小多项式能整除 x6−1 的 3×3 矩阵,那么 B6=I 。对于 B 的最小多项式的唯一限制是其次数最多为3(根据凯莱-哈密顿定理)。因此,此类矩阵 A 的最小多项式的唯一可能是
(a) x−1
(a) x−1
(b) x+1
(b) x+1
(c) x2−x+1
(c) x2−x+1
(d) x2+x+1
(d) x2+x+1
(e) (x−1)(x+1)
(e) (x−1)(x+1)
(f) (x−1)(x2−x+1)
(f) (x−1)(x2−x+1)
(g) (x−1)(x2+x+1)
(g) (x−1)(x2+x+1)
(b) (x+1)(x2−x+1)
(b) (x+1)(x2−x+1)
(i) (x+1)(x2+x+1) .
(i) (x+1)(x2+x+1) .
Under the constraints of the rational canonical form these give rise to the following permissible lists of invariant factors:
在有理标准形的约束下,这些会产生以下允许的不变因子列表:
(i) x−1,x−1,x−1
(i) x−1,x−1,x−1
(ii) x+1,x+1,x+1
(ii) x+1,x+1,x+1
(iii) x−1,(x−1)(x+1)
(iii) x−1,(x−1)(x+1)
(iv) x+1,(x−1)(x+1)
(iv) x+1,(x−1)(x+1)
(v) (x−1)(x2−x+1)
(v) (x−1)(x2−x+1)
(vi) (x−1)(x2+x+1)
(vi) (x−1)(x2+x+1)
(vii) (x+1)(x2−x+1)
(vii) (x+1)(x2−x+1)
(viii) (x+1)(x2+x+1) .
(viii) (x+1)(x2+x+1) .
Note that it is impossible to have a suitable set of invariant factors if the minimal polynomial is x2+x+1 or x2−x+1 . One can now write out the corresponding
rational canonical forms; for example,(i) is I ,(ii) is −I ,and (iii) is
有理标准形;例如,(i)是 I ,(ii)是 −I ,而(iii)是
100001010
Note also that another way of phrasing this result is that any 3×3 matrix with entries from Q whose order (multiplicatively,of course) divides 6 is similar to one of these 8 matrices, so this example determines all elements of orders 1,2,3 and 6 in the group GL3(Q) (up to similarity).
Prove that similar linear transformations of V (or n×n matrices) have the same characteristic and the same minimal polynomial.
证明相似的线性变换 V(或 n×n 矩阵)具有相同的特征多项式和相同的最小多项式。
Let M be as in Lemma 19. Prove that the minimal polynomial of M is the least common multiple of the minimal polynomials of A1,…,Ak .
设 M 如引理19中所述。证明 M 的最小多项式是 A1,…,Ak 的最小多项式的最小公倍数。
Prove that two 2×2 matrices over F which are not scalar matrices are similar if and only if they have the same characteristic polynomial.
证明两个在 F 上的非纯量矩阵 2×2 相似当且仅当它们具有相同的特征多项式。
Prove that two 3×3 matrices are similar if and only if they have the same characteristic and same minimal polynomials. Give an explicit counterexample to this assertion for 4×4 matrices.
Prove directly from the fact that the collection of all linear transformations of an n dimensional vector space V over F to itself form a vector space over F of dimension n2 that the minimal polynomial of a linear transformation T has degree at most n2 .
直接从所有 n 维向量空间 V 到其自身的线性变换集合在 F 上构成一个维度为 n2 的向量空间这一事实出发,证明线性变换 T 的最小多项式的次数最多为 n2。
Prove that the constant term in the characteristic polynomial of the n×n matrix A is (−1)ndetA and that the coefficient of xn−1 is the negative of the sum of the diagonal entries of A (the sum of the diagonal entries of A is called the trace of A ). Prove that det A is the product of the eigenvalues of A and that the trace of A is the sum of the eigenvalues of A .
证明 n×n 矩阵 A 的特征多项式中的常数项是 (−1)ndetA,并且 xn−1 的系数是 A 对角线元素之和的相反数(A 对角线元素之和称为 A 的迹)。证明 det A 是 A 特征值的乘积,并且 A 的迹是 A 特征值之和。
Determine the eigenvalues of the matrix
确定矩阵的特征值
0001100001000010.
Verify that the characteristic polynomial of the companion matrix
00−1−100−100,c0−10c1−11c and 422−420840−140465−463930−15515−1532−5−3030−6012.
Find all similarity classes of 6×6 matrices over Q with minimal polynomial (x+2)2(x−1) (it suffices to give all lists of invariant factors and write out some of their corresponding matrices).
Find all similarity classes of 6×6 matrices over C with characteristic polynomial (x4−1)(x2−1) .
找出所有在 6×6 上具有特征多项式 (x4−1)(x2−1) 的 C 矩阵的相似类。
Find all similarity classes of 3×3 matrices A over F2 satisfying A6=I (compare with the answer we computed over Q ). Do the same for 4×4 matrices B satisfying B20=I .
Prove that the number of similarity classes of 3×3 matrices over Q with a given characteristic polynomial in Q[x] is the same as the number of similarity classes over any extension field of Q . Give an example to show that this is not true in general for 4×4 matrices.
Determine all possible rational canonical forms for a linear transformation with characteristic polynomial x2(x2+1)2 .
确定具有特征多项式 x2(x2+1)2 的线性变换的所有可能的有理标准形。
Determine up to similarity all 2×2 rational matrices (i.e., ∈M2(Q) ) of precise order 4 (multiplicatively,of course). Do the same if the matrix has entries from C .
确定 2×2 有理矩阵(即 ∈M2(Q) )的所有相似类,其精确阶数为4(当然是指乘法意义下的阶数)。如果矩阵的元素来自 C ,也做同样的事情。
Show that x5−1=(x−1)(x2−4x+1)(x2+5x+1) in F19[x] . Use this to determine up to similarity all 2×2 matrices with entries from F19 of (multiplicative) order 5 .
Determine representatives for the conjugacy classes for GL3(F2) . [Compare your answer with Theorem 15 and Proposition 14 of Chapter 6.]
确定 GL3(F2) 的共轭类的代表。将你的答案与第6章的定理15和命题14进行比较。]
Let V be a finite dimensional vector space over Q and suppose T is a nonsingular linear transformation of V such that T−1=T2+T . Prove that the dimension of V is divisible by 3. If the dimension of V is precisely 3 prove that all such transformations T are similar.
设 V 是 Q 上的有限维向量空间,假设 T 是 V 的一个非奇异线性变换,且满足 T−1=T2+T 。证明 V 的维数能被3整除。如果 V 的维数恰好为3,证明所有这样的变换 T 都是相似的。
Let V be the infinite dimensional real vector space
设 V 是无限维实向量空间
R∞={(a0,a1,a2,…)∣a0,a1,a2,⋯∈R}.
Define the map T:V→V by T(a0,a1,a2,…)=(0,a0,a1,a2,…) . Prove that T has no eigenvectors.
通过 T(a0,a1,a2,…)=(0,a0,a1,a2,…) 定义映射 T:V→V 。证明 T 没有特征向量。
Let ℓ be a prime and let Φℓ(x)=x−1xℓ−1=xℓ−1+xℓ−2+…+x+1∈Z[x] be the ℓth cyclotomic polynomial,which is irreducible over Q (Example 4 following Corollary 9.14). This exercise determines the smallest degree of a factor of Φℓ(x) modulo p for any prime p and so in particular determines when Φℓ(x) is irreducible modulo p . (This actually determines the complete factorization of Φℓ(x) modulo p - cf. Exercise 8 of Section 13.6.)
设 ℓ 为一个质数,Φℓ(x)=x−1xℓ−1=xℓ−1+xℓ−2+…+x+1∈Z[x] 为 ℓth 的原根多项式,它在 Q 上是不可约的(例 4,紧随推论 9.14)。这个练习确定了对于任意质数 p ,Φℓ(x) 模 p 的因子的最小次数,特别地,确定了 Φℓ(x) 在模 p 时的不可约性。 (这实际上确定了 Φℓ(x) 模 p 的完全分解 - 参见第 13.6 节练习 8。)
(a) Show that if p=ℓ then Φℓ(x) is divisible by x−1 in Fℓ[x] .
(a) 证明如果 p=ℓ ,那么在 Fℓ[x] 中 Φℓ(x) 可被 x−1 整除。
(b) Suppose p=ℓ and let f denote the order of p in Fℓ× ,i.e., f is the smallest power of p with pf≡1modℓ . Show that m=f is the first value of m for which the group GLm(Fp) contains an element A of order ℓ . [Use the formula for the order of this group at the end of Section 11.1.]
(b) 假设 p=ℓ 并且设 f 表示 p 在 Fℓ× 中的阶,即 f 是 p 的最小幂使得 pf≡1modℓ 。证明 m=f 是使得群 GLm(Fp) 包含一个阶为 ℓ 的元素 A 的第一个 m 值。 [使用第 11.1 节末尾关于该群阶的公式。]
(c) Show that Φℓ(x) is not divisible by any polynomial of degree smaller than f in Fp[x] [consider the companion matrix for such a divisor and use (b)]. Let mA(x)∈Fp[x] denote the minimal polynomial for the matrix A in (b) and conclude that mA(x) is irreducible of degree f and divides Φℓ(x) in Fp[x] .
(c) 证明 Φℓ(x) 不能被 Fp[x] 中任何次数小于 f 的多项式整除(考虑此类除数的伴随矩阵并使用 (b))。设 mA(x)∈Fp[x] 表示 (b) 中矩阵 A 的最小多项式,并得出 mA(x) 是次数为 f 的不可约多项式,并且在 Fp[x] 中整除 Φℓ(x)。
(d) In particular,prove that Φℓ(x) is irreducible modulo p if and only if l−1 is the smallest power of p which is congruent to 1 modulo ℓ ,i.e., p is a primitive root modulo ℓ .
(d) 特别地,证明 Φℓ(x) 在模 p 意义下不可约当且仅当 l−1 是 p 的最小幂,满足模 ℓ 同余于1,即 p 是模 ℓ 的一个原根。
Prove that the first two elementary row and column operations described before Theorem 21 do not change the determinant of the matrix and the third elementary operation multiplies the determinant by a unit. Conclude from Theorem 21 that the characteristic polynomial of A differs by a unit from the product of the invariant factors of A . Since both these polynomials are monic by definition, conclude that they are equal (this gives an alternate proof of Proposition 20).
证明定理21之前描述的前两个基本行和列操作不会改变矩阵的行列式,而第三个基本操作将行列式乘以一个单位元。从定理21得出结论,A 的特征多项式与 A 的不变因子的乘积相差一个单位元。由于这两个多项式按定义都是首一多项式,因此它们相等(这给出了命题20的另一种证明)。
The following exercises outline the proof of Theorem 21. They carry out explicitly the construction described in Exercises 16 to 19 of the previous section for the Euclidean Domain F[x] . Let V be an n -dimensional vector space with basis v1,v2,…,vn and let T be the linear transformation of V defined by the matrix A and this choice of basis,i.e., T is the linear transformation with
以下练习概述了定理21的证明。它们具体执行了上一节练习16至19中描述的构造,针对欧几里得域 F[x] 。设 V 是一个 n 维向量空间,其基为 v1,v2,…,vn ,并设 T 是由矩阵 A 和这个基定义的 V 上的线性变换,即 T 是具有
T(vj)=i=1∑naijvi,j=1,2,…,n
where A=(aij) . Let F[x]n be the free module of rank n over F[x] and let ξ1,ξ2,…,ξn denote a basis. Then we have a natural surjective F[x] -module homomorphism
defined by mapping ξi to vi,i=1,2,…,n . As indicated in the exercises of the previous section the invariant factors for the F[x] -module V can be determined once we have determined a set of generators and the corresponding relations matrix for kerφ . Since by definition x acts on V by the linear transformation T ,we have
通过映射 ξi 到 vi,i=1,2,…,n 定义。如前一部分练习所示,一旦我们确定了生成集和相应的 kerφ 关系矩阵,就可以确定 F[x] -模 V 的不变因子。由于根据定义 x 通过线性变换 T 作用于 V,因此我们有
Show that v1,v2,…,vn generate the kernel of φ . [Use the previous result to show that any element of kerφ is the sum of an element in the module generated by ν1,ν2,…,νn and an element of the form b1ξ1+⋯+bnξn where the bi are elements of F . Then show that such an element is in kerφ if and only if all the bi are 0 since v1,…,vn are a basis for V over F .]
证明 v1,v2,…,vn 生成 φ 的核。 [使用前面的结果证明 kerφ 的任何元素都是 ν1,ν2,…,νn 生成的模中的元素和一个形式为 b1ξ1+⋯+bnξn 的元素的和,其中 bi 是 F 的元素。然后证明如果且仅当所有 bi 都为0时,这样的元素在 kerφ 中,因为 v1,…,vn 是 V 在 F 上的基。]
Show that the generators v1,v2,…,vn of kerφ have corresponding relations matrix
where At is the transpose of A . Conclude that Theorem 21 and the algorithm for determining the invariant factors of A follows by Exercises 16 to 19 in the previous section (note that the row and column operations necessary to diagonalize this relations matrix are the column and row operations necessary to diagonalize the matrix in Theorem 21, which explains why the invariant factor algorithm keeps track of the row operations used).
其中 At 是 A 的转置。得出结论,定理21以及确定 A 不变因子的算法可以通过前一部分的练习16到19得出(注意,对角化此关系矩阵所需的行和列操作是对角化定理21中矩阵所需的列和行操作,这解释了为什么不变因子算法需要跟踪使用的行操作)。