1.2. Lecture 2

1.2.1. More results about the exponential map

For the next result about the exponential map it will be useful to know the following fact from linear algebra.

Lemma 1.2.1.

Let XGLn()X\in\operatorname{GL}_{n}(\mathbb{C}). Then XX is conjugate to a matrix of the form DUDU where DD is diagonal, UU is upper triangular with ‘1’s on the diagonal and DD and UU commute.

Proof.

(nonexaminable) This follows from Jordan normal form. Here’s a direct proof. Firstly write n\mathbb{C}^{n} as a direct sum of generalised eigenspaces for XX: if λ\lambda is an eigenvalue of XX then we can write the characteristic polynomial P(T)=(Tλ)aQ(T)P(T)=(T-\lambda)^{a}Q(T) where Q(T)Q(T) does not have λ\lambda as a root and a1a\geq 1 is an integer. Then the image of Q(X)Q(X) on n\mathbb{C}^{n} is the generalised eigenspace of λ\lambda. The kernel of Q(X)Q(X) is preserved by XX and XX does not have an eigenvalue equal to λ\lambda since λ\lambda is not a root of Q(X)Q(X), which must be the characteristic polynomial of XX acting on kerQ(X)\ker Q(X). Thus

imQ(X)kerQ(X)={0}\operatorname{im}Q(X)\cap\ker Q(X)=\{0\}

and by the rank-nullity theorem

n=imQ(X)kerQ(X)\mathbb{C}^{n}=\operatorname{im}Q(X)\oplus\ker Q(X)

is a decomposition of n\mathbb{C}^{n} as a direct sum of the λ\lambda generalised eigenspace and a subspace preserved by XX. Repeating for each eigenvector gives the required decomposition of n\mathbb{C}^{n}. This reduces the proof of the statement to the case where XX has only one eigenvalue λ\lambda. In this case, we can inductively choose a basis 𝐯1,,𝐯n{\bf v}_{1},\ldots,{\bf v}_{n} of n\mathbb{C}^{n} such that, for 1in1\leq i\leq n, the image of 𝐯i{\bf v}_{i} in /𝐯1,,𝐯i1\mathbb{C}/\left\langle{\bf v}_{1},\ldots,{\bf v}_{i-1}\right\rangle is an eigenvector of XX with eigenvalue λ\lambda. With respect to this basis, XX is then diagonal with λ\lambda’s on the diagonal, and we get the required decomposition with D=λIdD=\lambda\operatorname{Id}. ∎

Lemma 1.2.2.

The exponential function exp:𝔤𝔩n,GLn()\exp:\mathfrak{gl}_{n,\mathbb{C}}\to\operatorname{GL}_{n}(\mathbb{C}) is surjective.

Proof.

First prove it for all DD and UU in GLn()\operatorname{GL}_{n}(\mathbb{C}) as in Lemma 1.2.1. The case of diagonal matrices is more straightforward (Problem 1.2.3(a)) whereas for UU you can use that the power series for log(U)\log(U) in terms of powers of UIdU-\operatorname{Id} is actually a polynomial (Problem 1.2.3(b)).

Now consider DUDU. If D=exp(d)D=\exp(d) and U=exp(u)U=\exp(u) then

DU=exp(d)exp(u)=exp(d+u)DU=\exp(d)\exp(u)=\exp(d+u)

because dd and uu commute (so long as you choose dd and uu carefully — Problem 1.2.3(c)).

By Lemma 1.2.1 we have that for any XX in GLn()\operatorname{GL}_{n}(\mathbb{C}) there exists a PP in GLn()\operatorname{GL}_{n}(\mathbb{C}) such that P1XP=DUP^{-1}XP=DU where DD and UU have the form stated in the Lemma. By Lemma 1.1.2(v) we have

X=PDUP1=Pexp(d+u)P1=exp(P(d+u)P1)X=PDUP^{-1}=P\exp(d+u)P^{-1}=\exp(P(d+u)P^{-1})

and thus exp\exp is surjective. ∎

Remark 1.2.3.

The lemma is not true over \mathbb{R}; as we will see, the determinant of exp(X)\exp(X) is positive for all real matrices XX.

Lemma 1.2.4.

We have

detexp(X)=exptr(X).\det\exp(X)=\exp\operatorname{tr}(X).
Proof.

By Lemma 1.2.1 we can conjugate so that XX is an upper triangular matrix with diagonal entries λ1,,λn\lambda_{1},\ldots,\lambda_{n}, and then note that exp(X)\exp(X) is also upper triangular with diagonal entries exp(λ1),,exp(λn)\exp(\lambda_{1}),\ldots,\exp(\lambda_{n}).

Thus

detexp(X)=i=1nexp(λi)=exp(i=1nλi)=exptr(X).\det\exp(X)=\prod_{i=1}^{n}\exp(\lambda_{i})=\exp\left(\sum_{i=1}^{n}\lambda_{% i}\right)=\exp\operatorname{tr}(X).

The next proposition will be useful when we discuss Lie algebras of linear Lie groups.

Proposition 1.2.5 (Lie product formula).

For X,Y𝔤𝔩n,kX,Y\in\mathfrak{gl}_{n,k} we have

exp(X+Y)=limk(exp(Xk)exp(Yk))k.\exp(X+Y)=\lim_{k\to\infty}\left(\exp\left(\frac{X}{k}\right)\exp\left(\frac{Y% }{k}\right)\right)^{k}.
Proof.

We have

(exp(Xk)exp(Yk))k\displaystyle\left(\exp\left(\frac{X}{k}\right)\exp\left(\frac{Y}{k}\right)% \right)^{k} =((Id+Xk+O(1k2))(Id+Yk+O(1k2)))k\displaystyle=\left(\left(\operatorname{Id}+\frac{X}{k}+O\left(\frac{1}{k^{2}}% \right)\right)\left(\operatorname{Id}+\frac{Y}{k}+O\left(\frac{1}{k^{2}}\right% )\right)\right)^{k}
=(Id+X+Yk+O(1k2))k\displaystyle=\left(\operatorname{Id}+\frac{X+Y}{k}+O\left(\frac{1}{k^{2}}% \right)\right)^{k}
=Id+X+Y+O(1k)\displaystyle=\operatorname{Id}+X+Y+O\left(\frac{1}{k}\right)
=exp(X+Y+O(1k)).\displaystyle=\exp\left(X+Y+O\left(\frac{1}{k}\right)\right).

Now take the limit as kk\to\infty. ∎

1.2.2. One-parameter subgroups

Lemma 1.2.6.

The map from \mathbb{R} to GLn()\operatorname{GL}_{n}(\mathbb{C}) given by

texp(tX)t\longmapsto\exp(tX)

is a differentiable group homomorphism.

We have

ddtexp(tX)=Xexp(tX)=exp(tX)X.\frac{d}{dt}\exp(tX)=X\exp(tX)=\exp(tX)X.

In particular,

ddtexp(tX)|t=0=X.\left.\frac{d}{dt}\exp(tX)\right|_{t=0}=X.
Proof.

The given map is a group homomorphism by Lemma 1.1.2(iv).

By definition,

exp(tX)=k=0Xkk!tk.\exp(tX)=\sum_{k=0}^{\infty}\frac{X^{k}}{k!}t^{k}.

As this power series (and its termwise derivative) are uniformly convergent on any compact subset, we can compute its derivative by differentiating termwise, which gives

ddtexp(tX)=k=1Xk(k1)!tk1=Xexp(tX).\frac{d}{dt}\exp(tX)=\sum_{k=1}^{\infty}\frac{X^{k}}{(k-1)!}t^{k-1}=X\exp(tX).\qed
Definition 1.2.7.

A one-parameter subgroup of GLn()\operatorname{GL}_{n}(\mathbb{C}) is a differentiable group homomorphism f:GLn()f:\mathbb{R}\to\operatorname{GL}_{n}(\mathbb{C}). That is, a differentiable map such that

f(s+t)=f(s)f(t)f(s+t)=f(s)f(t)

for all s,ts,t\in\mathbb{R}.

The infinitesimal generator of a one-parameter subgroup ff is the element f(0)𝔤𝔩n,f^{\prime}(0)\in\mathfrak{gl}_{n,\mathbb{C}}.

The convention used here is a slight abuse of notation, f()f(\mathbb{R}) is the subgroup of GLn()\operatorname{GL}_{n}(\mathbb{C}) referred to in the definition but as the map defines the subgroup we just refer to the map.

Remark 1.2.8.

For a one-parameter subgroup ff, it actually suffices to require that ff is continuous. Differentiability then comes for free.

Indeed, if ff is continuous, the integral 0af(t)𝑑t\int_{0}^{a}f(t)dt exists. Moreover,

f(s)0af(t)𝑑t=0af(s+t)𝑑t=ss+af(t)𝑑t.f(s)\int_{0}^{a}f(t)dt=\int_{0}^{a}f(s+t)dt=\int_{s}^{s+a}f(t)dt.

The RHS is differentiable with respect to ss by the fundamental theorem of algebra. Therefore, to prove that f(s)f(s) is differentiable, we only need to show that there is an a>0a>0 such that 0af(t)𝑑t\int_{0}^{a}f(t)dt is an invertible matrix. Now consider the function

F(a)=1a0af(t)𝑑t.F(a)=\frac{1}{a}\int_{0}^{a}f(t)dt.

It is well-defined for a0a\neq 0 and lima0F(a)=Id\lim\limits_{a\to 0}F(a)=\operatorname{Id}. Hence, for 0<a10<a\ll 1, F(a)F(a) is invertible, and therefore so is aF(a)=0af(t)𝑑taF(a)=\int_{0}^{a}f(t)dt.

The following is a very important property of one-parameter subgroups: that they all come from the exponential map.

Proposition 1.2.9.

Let f:GLn()f:\mathbb{R}\to\operatorname{GL}_{n}(\mathbb{C}) be a one-parameter subgroup with infinitesimal generator XX.

Then

f(t)=exp(tX)f(t)=\exp(tX)

for all tt\in\mathbb{R}. That is, all one-parameter subgroups arise from the exponential function.

Proof.

From the definition of one-parameter subgroups, we have

f(t)=lims0f(s+t)f(t)s=f(t)lims0f(s)f(0)s=f(t)f(0)=f(t)X.f^{\prime}(t)=\lim_{s\to 0}\frac{f(s+t)-f(t)}{s}=f(t)\lim_{s\to 0}\frac{f(s)-f% (0)}{s}=f(t)f^{\prime}(0)=f(t)X.

Now consider the differential equation

f(t)=f(t)X.f^{\prime}(t)=f(t)X.

By Lemma 1.2.6 we have exp(tX)\exp(tX) is a solution with the initial condition that f(0)=Idf(0)=\operatorname{Id}. To show it is a unique solution suppose that g(t)g(t) is also a solution. Then

(g(t)exp(tX))=g(t)exp(tX)g(t)exp(tX)X=g(t)(Xexp(tX)exp(tX)X)=0,\left(g(t)\exp(-tX)\right)^{\prime}=g^{\prime}(t)\exp(-tX)-g(t)\exp(-tX)X=g(t)% \left(X\exp(-tX)-\exp(-tX)X\right)=0,

and thus g(t)exp(tX)=DGLn()g(t)\exp(-tX)=D\in\operatorname{GL}_{n}(\mathbb{C}). Applying the initial conditions we get D=IdD=\operatorname{Id} and g(t)=exp(tX)g(t)=\exp(tX). ∎

Example 1.2.10.

The map SO(3)={gGL3()|ggT=Id,detg=1}GL3()\mathbb{R}\to\mathrm{SO}(3)=\{g\in\operatorname{GL}_{3}(\mathbb{R})\,|\,gg^{T}% =\operatorname{Id},\,\det g=1\}\subseteq\operatorname{GL}_{3}(\mathbb{R}) taking θ\theta to rotation by θ\theta about a fixed axis is a one-parameter subgroup. Problem 1.2.3 asks you to find its infinitesimal generator.

1.2.3. Exercises

.

Problem 3. Let 𝔫\mathfrak{n} be the \mathbb{C}-vector space of strictly upper triangular matrices (0’s on the diagonal) and let N={gGLn()|g=Id+X,X𝔫}N=\{g\in\operatorname{GL}_{n}(\mathbb{C})\,|\,g=\operatorname{Id}+X,X\in% \mathfrak{n}\}.

In this problem we will see that the restriction of the exponential to 𝔫\mathfrak{n} is a diffeomorphism onto NN.

  1. (a)

    Let X𝔫X\in\mathfrak{n}. Show that Xn=0X^{n}=0.

  2. (b)

    Show that exp(X)N\exp(X)\in N for X𝔫X\in\mathfrak{n}.

  3. (c)

    Show that, for gNg\in N, the logarithm log(g)=k=1(1)k+1(gI)kk\log(g)=\sum_{k=1}^{\infty}(-1)^{k+1}\tfrac{(g-I)^{k}}{k} is in fact a finite sum (and hence converges).

  4. (d)

    Show that exp|𝔫\exp|_{\mathfrak{n}} and log|N\log|_{N} are inverses of each other. Hint: this boils down to an identity of formal power series, which you can actually deduce from the corresponding fact over \mathbb{R}.

Problem 4. Using the previous question, fill in the gaps of the proof from the notes that

exp:𝔤𝔩n,GLn()\exp:\mathfrak{gl}_{n,\mathbb{C}}\rightarrow\operatorname{GL}_{n}(\mathbb{C})

is surjective.

  1. (a)

    Show all matrices of the form DD have a preimage.

  2. (b)

    Show all matrices of the form UU have a preimage.

  3. (c)

    Show all matrices of the form DUDU have a preimage.

  4. (d)

    Is the exponential map exp:𝔰𝔩2,SL2()\exp:\mathfrak{sl}_{2,\mathbb{C}}\rightarrow\operatorname{SL}_{2}(\mathbb{C}) surjective? What about exp:𝔤𝔩2,GL2+()\exp:\mathfrak{gl}_{2,\mathbb{R}}\rightarrow\operatorname{GL}^{+}_{2}(\mathbb{% R})?

Problem 5. Let 𝐯3{\bf v}\in\mathbb{R}^{3} be a unit vector and let f:SO(3)f:\mathbb{R}\to SO(3) be the map with f(θ)f(\theta) being rotation by θ\theta about the axis 𝐯{\bf v} (the angle is measured anticlockwise as you look along the vector from the origin).

Show that ff is a one-parameter subgroup and find its infinitesimal generator in terms of 𝐯{\bf v}.