Representation Theory IV

1 Linear Lie groups and their Lie algebras

1.1 Linear Lie groups

We fix some notation:

  • 𝕂 denotes either the field or ;

  • 𝔤𝔩n,𝕂=Mn(𝕂) is the vector space of all n×n matrices over 𝕂.

Definition 1.1.

A (linear) Lie group is a closed subgroup of GLn(), for some n.

Remark 1.2.

The usual definition of a Lie group is a smooth manifold together with a group structure such that the group operations are smooth functions. It is a theorem (Cartan’s theorem, or the closed subgroup theorem) that every linear Lie group in the sense of definition 1.1 is a Lie group in this sense. Not every Lie group is a linear Lie group, but we will only be studying linear Lie groups; sometimes I might omit the word ‘linear’.

We give various examples (note that any subgroup defined by equalities of continuous functions will be closed):

  • the real general linear group GLn(): we simply impose the closed condition that all the entries of the matrix are real;

  • the (real or complex) special linear groups SLn(𝕂);

  • we may see (,+) as a linear Lie group in two ways: either as {(1x01):x}GL2() or as {ex:x}=(+,×).

  • if , is a bilinear form on n then we obtain a linear Lie group

    {gGLn():gv,gw=v,w}.

    There is a matrix A such that v,w=vTAw for all v,w; the bilinear form is symmetric if and only if A is symmetric, alternating if and only if A is skew-symmetric (AT=-A), and nondegenerate if and only if detA is nonzero. Then the group is:

    {gGLn():gTAg=A}.

    Some special cases follow.

  • The orthogonal and special orthogonal groups

    O(n)={gGLn():gTg=I}

    and SO(n)=O(n)SLn();

  • the unitary and special unitary groups

    U(n)={gGLn():gg=I}

    and SU(n)=U(n)SLn() (not strictly a special case of the above, but closely related);

  • the symplectic groups

    Sp(2n)={gGL2n():gTJg=J}

    where J=(0I-I0) and I is the n×n matrix with 1s on the antidiagonal and 0s elsewhere. This corresponds to a nondegenerate alternating bilinear form. Remark: it is also common to use J=(0In-In0).

We may visualise U(1) as the unit circle in the complex plane; the group SO(2) can also be visualised as this circle by identifying rotation by θ with eiθ.

The group SO(3) can be visualised as a unit ball in 3 with opposite points on its surface identified; this is identified in section 1.8 below.

Example 1.3.

Non-examples are GLn() (this is a subgroup of GLn(), but not closed), or (if α is an irrational real number) the subgroup

{(eix00eiαx):x}GL2().

This is a subgroup, isomorphic — as a group — to , but not closed. You should picture it as a string wound infinitely densely around a torus.

The idea of Lie theory is to simplify the study of these groups by just studying their structure ’very close to the identity’. This crucially uses that they are groups with a topology. By looking at the tangent spaces of these groups at the origin, you obtain Lie algebras; the group operation then turns into a structure called the Lie bracket.

1.2 The exponential map

Recall that 𝕂 denotes either or .

Definition 1.4.

Let X𝔤𝔩n,𝕂. We define

exp(X)=k=0Xkk!.

This series is convergent for all X𝔤𝔩n,𝕂. Let |||| be the matrix norm

||X||=(i,j|xij|2)1/2.

This satisfies the triangle inequality and also ||XY||||X||||Y|| — this can be proved using Cauchy–Schwarz. Then for any X𝔤𝔩n, with ||X||C, we have

||exp(X)||k=0||X||kk!k=0Ckk!=exp(C).

By the Weierstrass M-test, the power series for exp converges uniformly and absolutely on the closed ball {X:||X||C}, for any C>0. In particular, exp is uniformly absolutely convergent on all compact subsets of 𝔤𝔩n,𝕂, and hence is continuous.

Lemma 1.5.

We have (for all X,Y𝔤𝔩n,𝕂, s,t𝕂):

  1. 1.

    exp(0)=I.

  2. 2.

    exp(X+Y)=exp(X)exp(Y) if XY=YX. (This is NOT true in general).

  3. 3.

    exp(sX)exp(tX)=exp((s+t)X).

  4. 4.

    exp(X) is invertible, with inverse exp(-X). In particular, exp(X)GLn(𝕂).

  5. 5.

    gexp(X)g-1=exp(gXg-1).

Proof.

The first point is obvious. Let’s prove (2) from which (3) and (4) follow. By definition,

exp(X+Y) =k=0(X+Y)kk!
=k=0l=0k(kl)XlYk-lk! (using that X and Y commute!)
=k=0l=0kXlYk-ll!(k-l)!
=(l=0Xll!)(j=0Yjj!), (putting j=k-l)

which is equal to the right hand side. Rearranging the sums is valid by absolute convergence. Finally, (5) follows from gXkg-1=(gXg-1)k. ∎

Example 1.6.

Let X=(0100) and Y=(00-10). Then X2=Y2=0 so

exp(X)=I+X=(1101),exp(Y)=I+Y=(10-11).

But X+Y=(01-10), so exp(X+Y)=(cos1sin1-sin1cos1). Hence exp(X+Y)exp(X)exp(Y); see Problem 1.

In fact the exponential map is differentiable as a function of X. For this, recall that a function f:NM is differentiable at a point pN if there is a (necessarily unique) linear map Dpf:NM such that

limh0||f(p+h)-f(p)-Dpf(h)||||h||=0,

and in this case Dpf is called the derivative of f at p. (This definition is independent of the choice of norms on N and M).

Proposition 1.7.

The exponential map is differentiable at the origin (zero matrix), and its derivative at the origin is the identity map from 𝔤𝔩n, to itself.

Proof.

In the above definition we have, N=M=2n2, f=exp, p=0, and we claim D0exp is the identity. Thus we need to show

lim||X||0||exp(X)-exp(0)-X||||X||=lim||X||0||exp(X)-I-X||||X||=0,

which follows from the definition of the exponential map. Indeed,

||exp(X)-I-X||||X||=||k=2Xkk!||||X||||X||k=0||X||k(k+2)!<||X||e||X||,

which tends to zero as ||X||0. ∎

Remark 1.8.

In fact, the exponential function has derivatives to all orders at all points. One way to see this is that the matrix entries of exp are given by multivariable power series in the xij, and multivariable power series are differentiable (indeed, smooth) on the interior of their region of convergence. Caution: termwise derivatives of matrix functions cannot be calculated naively by applying the formal derivative of the power series. For example, it does not make sense to say that the derivative of X2 is 2X, so exp is not "its own derivative" in any meaningful sense.

By the inverse function theorem, it follows from the remark that

Corollary 1.9.

The exponential map is a “local diffeomorphism” at 0: there exist neighbourhoods U𝔤𝔩n,𝕂 containing 0 and VGLn(𝕂) containing I such that exp|U is a smooth bijection onto V with smooth inverse.

Remark 1.10.

In fact we can take V={XGLn():||X-I||<1}. The inverse of exp in this neighbourhood is

log(X)=k=0(-1)k(X-I)k+1k+1,

which is convergent when ||X-I||<1.

Of course, exp is not injective in general. For example, exp(2πik)=1 for k, and exp((02π-2π0))=I.

For the next result it will be useful to know the following facts from linear algebra.

Lemma 1.11.

Let XGLn(). Then X is conjugate to a matrix of the form DU where

  • D is diagonal

  • U is upper triangular with ‘1’s on the diagonal

  • D and U commute.

Proof.

(nonexaminable) This follows from Jordan normal form. Here’s a direct proof. Firstly write n as a direct sum of generalised eigenspaces for X: if λ is an eigenvalue of X then we can write the characteristic polynomial P(T)=(T-λ)aQ(T) where Q(T) does not have λ as a root and a1 is an integer. Then the image of Q(X) on n is the generalised eigenspace of λ. The kernel of Q(X) is preserved by X and X does not have an eigenvalue equal to λ since λ is not a root of Q(X), which must be the characteristic polynomial of X acting on kerQ(X). Thus

imQ(X)kerQ(X)={0}

and by the rank-nullity theorem

n=imQ(X)kerQ(X)

is a decomposition of n as a direct sum of the λ generalised eigenspace and a subspace preserved by X. Repeating for each eigenvalue gives the required decomposition of n. This reduces the proof of the statement to the case where X has only one eigenvalue λ. In this case, we can inductively choose a basis v1,,vn of n such that, for 1in, the image of vi in n/v1,,vi-1 is an eigenvector of X with eigenvalue λ. With respect to this basis, X is then upper triangular with λ’s on the diagonal, and we get the required decomposition with D=λI. ∎

Lemma 1.12.

We have

detexp(X)=exptr(X).
Proof.

Conjugate so that X is an upper triangular matrix with diagonal entries λ1,,λn, and then note that exp(X) is also upper triangular with diagonal entries exp(λ1),,exp(λn).

Thus

detexp(X)=i=1nexp(λi)=exp(i=1nλi)=exptr(X).

Lemma 1.13.

The exponential function exp:𝔤𝔩n,GLn() is surjective.

Proof.

Sketch. Details are in Problem 4. First prove it for D and U as in Lemma 1.11. The case of diagonal matrices is easy, and for U note that U-I is nilpotent so the power series for log(U) is a finite sum.

For general X, by conjugating we can reduce to the case where X=DU as above. If D=exp(d) and U=exp(u), then

DU=exp(d)exp(u)=exp(d+u)

because d and u commute (one must choose d and u carefully since exp is not injective). ∎

Remark 1.14.

The lemma is not true over ; by Lemma 1.12, the determinant of exp(X) is positive for all real matrices X.

The next proposition will be useful when we discuss Lie algebras of linear Lie groups.

Proposition 1.15.

(Lie product formula) We have

exp(X+Y)=limk(exp(Xk)exp(Yk))k.
Proof.

Let

F(t)=log(exp(tX)exp(tY)),

noting that for t sufficiently small, exp(tX)exp(tY) lies in the neighbourhood of I where log is defined. Since log and exp are differentiable and the product of differentiable functions is differentiable, we apply the chain rule to see that F(0)=X+Y. Therefore

F(t)=t(X+Y)+o(t).

(Here, the notation o(f), for any function f, denotes some function g such that g(t)/f(t)0 as t tends to some limit, usually 0 or . In this case, as t0.)

Setting t=1k and multiplying by k we get

kF(t)=X+Y+ko(1/k)=X+Y+o(1).

Therefore

(exp(Xk)exp(Yk))k=exp(kF(1k))=exp(X+Y+o(1)).

Taking the limit as k gives the result. ∎

1.3 One-parameter subgroups

Lemma 1.16.

The map from to GLn() given by

texp(tX)

is a differentiable group homomorphism.

We have

ddtexp(tX)=Xexp(tX)=exp(tX)X.

In particular,

ddtexp(tX)|t=0=X.
Proof.

The given map is a group homomorphism by Lemma 1.5 part 4.

By definition,

exp(tX)=k=0Xkk!tk.

As this power series (and its termwise derivative) are uniformly convergent on any compact subset, we can compute its derivative by differentiating termwise, which gives

ddtexp(tX)=k=1Xk(k-1)!tk-1=Xexp(tX).
Definition 1.17.

A one-parameter subgroup of GLn() is a differentiable group homomorphism f:GLn(). That is, a differentiable map such that

f(s+t)=f(s)f(t)

for all s,t.

The infinitesimal generator of a one-parameter subgroup f is the element f(0)𝔤𝔩n,.

By the previous lemma, for every X𝔤𝔩n, the map texp(tX) is a one-parameter subgroup, with infinitesimal generator X.

Example 1.18.

Let f:SO(3) be rotation about the z-axis:

f(t)=(cost-sint0sintcost0001).

Then f is a one-parameter subgroup, and its infinitesimal generator is

f(0)=(0-10100000).

In fact f(t)=exp(tf(0)). Problem 5 asks you to generalize this to rotation about an arbitrary axis.

Remark 1.19.

(non-examinable) For a one-parameter subgroup f, it actually suffices to require that f is continuous. Differentiability then comes for free.

Indeed, if f is continuous, the integral 0af(t)𝑑t exists. Moreover,

f(s)0af(t)𝑑t=0af(s+t)𝑑t=ss+af(t)𝑑t.

The RHS is differentiable with respect to s by the fundamental theorem of calculus. Therefore, to prove that f(s) is differentiable, we only need to show that there is an a>0 such that 0af(t)𝑑t is an invertible matrix. Now consider the function

F(a)=1a0af(t)𝑑t.

It is well-defined for a0 and lima0F(a)=I. Hence, for 0<a1, F(a) is invertible, and therefore so is aF(a)=0af(t)𝑑t.

The following is a very important property of one-parameter subgroups: that they all come from the exponential map.

Proposition 1.20.

Let f:GLn() be a one-parameter subgroup with infinitesimal generator X.

Then

f(t)=exp(tX)

for all t. That is, all one-parameter subgroups arise from the exponential function.

Proof.

From the definition of one-parameter subgroups, we have

f(t)=lims0f(s+t)-f(t)s=f(t)lims0f(s)-f(0)s=f(t)f(0)=f(t)X.

Now consider the differential equation

g(t)=g(t)X.

We know that both f(t) and exp(tX) are both solutions with the same initial condition that g(0)=I. Therefore they must be equal. ∎

1.4 Lie algebras

Definition 1.21.

Let GGLn() be a linear Lie group. We define its Lie algebra by

𝔤={X𝔤𝔩n,:exp(tX)G for all t}.

In other words, it is the set of X such that the one-parameter subgroup infinitesimally generated by X is contained in the group G. We write Lie(G) for this Lie algebra.

Remark 1.22.

It is not true that 𝔤={X𝔤𝔩n,:exp(X)G}. This is not even true for G={1}, why?

The Lie algebra can also be defined more geometrically as the tangent space to G at the identity; the above definition then becomes the “exponential characterization” of the Lie algebra. The equivalence is given by the following theorem:

Theorem 1.23.

With G and 𝔤 as above, we have

𝔤={X𝔤𝔩n,:X=γ(0) for some differentiable map γ:[-a,a]Ga>0}.

In other words, 𝔤 is the set of all possible tangent vectors to curves in G passing through I.

Proof.

We show that X lies in the right hand side (i.e. is a tangent vector) if and only if exp(tX)G for all t.

If exp(tX)G for all t then we may take the derivative of γ(t)=exp(tX) and conclude that X is a tangent vector at I.

Now assume that X is a tangent vector at I. Then there is a differentiable map γ:[-a,a]G, for some a>0, such that γ(0)=I and X=γ(0). By the local inverse property of exp, there is a neighbourhood of I on which log is defined and differentiable. For |t| small, set

A(t)=log(γ(t)).

Then A(0)=0, and by the chain rule (using that the derivative of log at I is the identity map) we have A(0)=X. Hence

A(t)=tX+o(t)

as t0.

Fix any t. For k large, t/k(-a,a) and γ(t/k) lies in the domain of log, so

(γ(tk))k=exp(kA(tk))=exp(tX+o(1))G.

Since G is closed and

limk(γ(tk))k=exp(tX),

with γ(t/k)G for all k, we conclude that exp(tX)G. ∎

We define the dimension of the Lie group G to be the dimension of the associated Lie algebra 𝔤. We now compute the Lie algebras of many of the groups that we are interested in:

Proposition 1.24.

The Lie algebras of GLn(𝕂), SLn(𝕂), O(n), SO(n), U(n), and SU(n) are given by

G𝔤=dim(𝔤)GLn(𝕂)𝔤𝔩n,𝕂dim𝕂𝔤𝔩n,𝕂=n2SLn(𝕂)𝔰𝔩n,𝕂={X𝔤𝔩n,𝕂:tr(X)=0}dim𝕂𝔰𝔩n,𝕂=n2-1O(n),SO(n)𝔬n=𝔰𝔬n={X𝔤𝔩n,:X+XT=0}dim𝔰𝔬n=n(n-1)2U(n)𝔲n={X𝔤𝔩n,:X+X=0}dim𝔲n=n2SU(n)𝔰𝔲n={X𝔲n:tr(X)=0}dim𝔰𝔲n=n2-1Sp(2n)𝔰𝔭2n={X𝔤𝔩2n,:XTJ+JX=0}dim𝔰𝔭2n=n(2n+1)

Here 𝔰𝔬n is the space of skew-symmetric real matrices and 𝔲n is the space of skew-Hermitian matrices.

Proof.

The first one is obvious for 𝕂= and left as an exercise for 𝕂=. For the second one, first suppose that tr(X)=0. Then detexp(tX)=exptr(tX)=1 so X𝔰𝔩n,𝕂. Conversely, if X𝔰𝔩n,K then 1=detexp(tX)=exp(ttr(X)) for all t; differentiating at t=0 gives tr(X)=0 as required.

For the third one, we need to find all X such that

exp(tX)exp(tX)T=exp(tX)exp(tXT)=I (1.1)

for all t. Taking the derivative for both sides with respect to t, we obtain

Xexp(tX)exp(tXT)+exp(tX)exp(tXT)XT=0.

Evaluating at t=0, we get

X+XT=0.

Thus 𝔬n{X𝔤𝔩n,:X+XT=0}.

Conversely, if X+XT=0, then equation (1.1) holds because

exp(tX)T=exp(tXT)=exp(-tX)=exp(tX)-1.

Thus {X𝔤𝔩n,:X+XT=0}𝔬n and we have equality by the opposite containment above.

For the dimension, notice that X satisfying X=-XT is determined by its upper triangular part and that the diagonal entries must be all zeros; as there are n(n-1)2 entries strictly above the diagonal, that is the dimension of 𝔬n.

Finally, it is clear that 𝔰𝔬n𝔬n. If X𝔬n then X+XT=0 and so tr(X)=0. Thus exp(tX)SLn()O(n)=SO(n) for all t and so X𝔰𝔬n, giving the claimed equality.

The unitary and symplectic Lie algebras can be computed in a similar way — homework! ∎

Proposition 1.25.

Let 𝔤 be the Lie algebra of a (linear) Lie group G. Then

  1. 1.

    𝔤 is a real vector space (inside 𝔤𝔩n,).

  2. 2.

    If X𝔤 and if gG, then gXg-1𝔤.

  3. 3.

    For X,Y𝔤,

[X,Y]=XY-YX𝔤.
Proof.

For the first part we must show closure under scalar multiplication and addition. For scalar multplication, we reparametrise: suppose X𝔤 and λ. Then texp(tλX) lies in G (as λt is still a real number) and so λX𝔤.

For addition, let X,Y𝔤. Then exp(tX),exp(tY)G for all t, so the product

γ(t)=exp(tX)exp(tY)

is a differentiable path in G with γ(0)=I and γ(0)=X+Y. Hence X+Y𝔤 by Theorem 1.23.

Part (2) follows from exp(t(gXg-1))=gexp(tX)g-1G and the definition of 𝔤.

For part (3), we know by part (2) that, for X,Y𝔤,

exp(tX)Yexp(-tX)𝔤.

Then

ddtexp(tX)Yexp(-tX)|t=0 =(Xexp(tX)Yexp(-tX)-exp(tX)Yexp(-tX)X)|t=0
=XY-YX.

But also by definition

ddtexp(tX)Yexp(-tX)|t=0=limt0exp(tX)Yexp(-tX)-Yt.

This is a limit of elements of the vector space 𝔤, which is a closed subset of 𝔤𝔩n,𝕂, and so must itself be an element of 𝔤. ∎

Definition 1.26.

A Lie algebra 𝔤 is an -vector space together with a bilinear map (Lie bracket)

[,]=𝔤×𝔤𝔤

that satisfies the following properties.

  1. 1.

    It is alternating: [Y,X]=-[X,Y] for all X,Y𝔤.

  2. 2.

    The Jacobi identity holds:

    [X,[Y,Z]]+[Y,[Z,X]]+[Z,[X,Y]]=0

    for all X,Y,Z𝔤.

Definition 1.27.

If 𝔤 is a Lie algebra, then a Lie subalgebra is a subspace 𝔥𝔤 that is closed under the Lie bracket.

Proposition 1.28.

The Lie bracket

[X,Y]=XY-YX

makes 𝔤𝔩n,𝕂 into a Lie algebra.

If GGLn(𝕂) is a Lie group, then 𝔤 is a (real) Lie subalgebra of 𝔤𝔩n,𝕂.

Proof.

For the first part, simply check the axioms directly (the Jacobi identity is a bit of a pain…).

We have already shown the second part (we only have to show that 𝔤 is closed under the bracket). ∎

Before computing the Lie algebras of various matrix groups, we give an example that doesn’t seem to be of this form.

Example 1.29.

Let 𝔤=3 and let [v,w]=v×w. Then this is a Lie algebra (just check the axioms).

In fact, 𝔤𝔰𝔬3. To see this, send the vector v to the infinitesimal generator of the one parameter subgroup of SO(3) given by ‘rotating around the axis v at speed |v|’.

Definition 1.30.

A Lie algebra 𝔤 is called abelian if [X,Y]=0 for all X,Y𝔤.

Definition 1.31.

The center of a Lie algebra 𝔤 is

{Z𝔤:[Z,X]=0 for all X𝔤}.

It is an abelian Lie subalgebra of 𝔤.

1.5 Lie group and Lie algebra homomorphisms

Definition 1.32.

A Lie group homomorphism ϕ:GG between two linear Lie groups G and G is a continuous group homomorphism.

An isomorphism is a bijective Lie group homomorphism whose inverse is also continuous.

Remark 1.33.

In fact, a continuous homorphism between linear Lie groups is automatically a smooth map of smooth manifolds, and if it is bijective then the inverse is automatically continuous.

Definition 1.34.

A homomorphism φ:𝔤𝔤 of Lie algebras is an -linear map such that

φ([X,Y])=[φ(X),φ(Y)]

for all X,Y𝔤.

An isomorphism is an invertible homomorphism.

Definition 1.35.

Let ϕ:GG be a Lie group homomorphism. Define the derivative (or derived homomorphism)

Dϕ:𝔤𝔤

by

Dϕ(X)=ddtϕ(exp(tX))|t=0

for X𝔤. 11 1 We can justify taking the derivative by appealing to Remark 1.19.

Remark 1.36.

In fact, Dϕ is the derivative of ϕ at the identity in the sense of smooth manifolds; recall that 𝔤 and 𝔤 are the tangent spaces to G and G at the identity.

Theorem 1.37.

Let ϕ:GG be a Lie group homomorphism with derivative Dϕ. Then

  1. 1.

    The following diagram commutes:

    𝔤Dϕ𝔤expexpGϕG.

    That is, for X𝔤 we have

    ϕ(exp(X))=exp(Dϕ(X)).
  2. 2.

    For all gG,X𝔤,

    Dϕ(gXg-1)=ϕ(g)Dϕ(X)ϕ(g)-1.
  3. 3.

    The map Dϕ:𝔤𝔤 is a Lie algebra homomorphism.

Proof.
  1. 1.

    Consider the one parameter subgroup f:G defined by f(t)=ϕ(exp(tX)). By construction, f(0)=Dϕ(X). By Proposition 1.20, one parameter subgroups are determined by their derivative at 0, so that we must have

    ϕ(exp(tX))=f(t)=exp(tDϕ(X)).
  2. 2.

    We have

    Dϕ(gXg-1) =ddtϕ(exp(tgXg-1))|t=0
    =ddtϕ(gexp(tX)g-1)|t=0
    =ddtϕ(g)ϕ(exp(tX))ϕ(g-1)|t=0
    =ϕ(g)ddtϕ(exp(tX))|t=0ϕ(g-1)
    =ϕ(g)Dϕ(X)ϕ(g)-1,

    as claimed.

  3. 3.

    To show that Dϕ is a Lie algebra homomorphism, we need to show that

    • Dϕ is -linear; and

    • Dϕ([X,Y])=[Dϕ(X),Dϕ(Y)].

    So let X,Y𝔤 and s. By definition,

    Dϕ(sX) =ddtϕ(exp(tsX))|t=0.
    If we now set μ=st, we can rewrite this as:
    ddtϕ(exp(tsX))|t=0 =sddμϕ(exp(μX))|μ=0
    =sDϕ(X).

    So Dϕ commutes with scalar multiplication. For additivity, we have

    Dϕ(X+Y)=ddtϕ(exp(t(X+Y)))|t=0.

    On the other hand, by Proposition 1.15 and using part (1)

    ϕ(exp(t(X+Y))) =limkϕ((exp(tkX)exp(tkY))k)
    =limkϕ(exp(tkX))ϕ(exp(tkY))k
    =limk(exp(tkDϕ(X))exp(tkDϕ(Y)))k
    =exp(t(Dϕ(X)+Dϕ(Y))).

    Taking the derivative at t=0, we conclude that

    Dϕ(X+Y)=Dϕ(X)+Dϕ(Y),

    showing additivity.

    Finally we show that Dϕ respects the Lie bracket. Let X,Y𝔤. By parts (1) and (2) we have

    Dϕ(exp(tX)Yexp(-tX))=exp(tDϕ(X))Dϕ(Y)exp(-tDϕ(X)).

    Take the derivative for both sides at t=0: the derivative of the RHS is [Dϕ(X),Dϕ(Y)], and the derivative of the LHS is Dϕ([X,Y]) (as Dϕ is linear).∎

1.6 Complex Lie groups and holomorphic homomorphisms

Definition 1.38.

A complex linear Lie group is a closed subgroup of GLn() whose Lie algebra is a complex subspace of 𝔤𝔩n, (as opposed to just a real subspace).

Note that 𝔲n and 𝔰𝔲n are only real Lie algebras and correspondingly U(n) and SU(n) are only real Lie groups, even though they consist of complex matrices. On the other hand, 𝔤𝔩n, and 𝔰𝔩n, are complex Lie algebras, so GLn() and SLn() are complex Lie groups.

A complex Lie algebra is a -vector space with a -bilinear Lie bracket satisfying the same axioms as for a Lie algebra. Thus the Lie algebra of a complex Lie group may be viewed as a complex Lie algebra (since the Lie bracket on 𝔤𝔩n, is clearly -bilinear).

Definition 1.39.

Suppose that G and G are complex Lie groups and ϕ:GG is a homomorphism. Then ϕ is holomorphic if Dϕ:𝔤𝔤 is -linear.

(This implies that ϕ is a holomorphic map of complex manifolds.)

Example 1.40.

The map det:GL2,GL1, is holomorphic. The map GL2,GL2, sending g to g¯ is not holomorphic.

1.7 Topological properties

We have shown that the differential ϕDϕ gives a map

D:Hom(G,G)Hom(𝔤,𝔤).

This raises two natural questions:

  1. 1.

    Is the map injective? Does the derivative Dϕ uniquely determine the Lie group homomorphism ϕ?

  2. 2.

    Is the map surjective? Or in other words, does every Lie algebra homomorphism φ ’exponentiate’ (or ’lift’) to a Lie group homomorphism ϕ such that Dϕ=φ? We say ’exponentiate’ since if yes, then ϕ would need to satisfy ϕ(exp(X))=exp(φ(X)). So this gives a formula for ϕ, at least on the image of exp. The question is whether this is well-defined (the exponential map is neither injective nor surjective in general) and whether this defines a homomorphism.

The answer to these questions is actually of topological nature, which we discuss in this subsection.

While we only defined linear Lie groups to be closed subgroups of GLn(), in fact they have much nicer topological properties than arbitrary closed subsets (which can be pretty wild, like the Cantor set).

Theorem 1.41.

(closed subgroup theorem) Let GGLn() be a closed subgroup, and let 𝔤 be its Lie algebra. Then for every gG there is an open subset gUG and an open subset 0V𝔤 such that Xgexp(X) is a homeomorphism VU. See Figure 1.

Remark 1.42.

An open subset is a subset UG such that U=GU for some open subset UGLn() (which has the usual definition in terms of open balls). Likewise, an open subset of 𝔤 means open in its usual Euclidean topology as a finite-dimensional complex vector space.

A homeomorphism is a continuous bijection with continuous inverse.

Chart
Figure 1: Chart ϕ in closed subgroup theorem.
Proof.

(Sketch, nonexaminable.) Since, for any g, the map ‘multiply by g’ is continuous with continuous inverse, it suffices to prove this when g is the identity element. Choose a sufficiently small open neighbourhood W of 0𝔤𝔩n, such that exp:Wexp(W) is a homeomorphism onto an open neighbourhood of IGLn() (for this, you can use the open mapping theorem). Set V=W𝔤 and U=exp(W)G. The tricky point is to show that, if gG is sufficiently close to the identity (so that log(g) is defined), then log(g)𝔤; this ensures that exp restricts to a homeomorphism VU. ∎

If you don’t want to take this theorem on faith, then feel free to include its conclusion as part of the definition of a linear Lie group (in all our examples, it would be straightforward to verify).

We say that G is connected if, for every x,yG, there is a continuous function (or path) γ:[0,1]G with γ(0)=x and γ(1)=y. For those of you taking courses in topology, this is actually the definition of path-connected; however, it follows from the closed subgroup theorem that Lie groups are locally path-connected, and being path-connected is equivalent to being connected for such spaces.

Let G be a linear Lie group and let G0 be the set of all gG such that there is a continuous path γ:[0,1]G with γ(0)=I and γ(1)=g.

Proposition 1.43.

The subset G0 is a normal subgroup of G.

Proof.

Let g,hG0, and let γ1,γ2 be paths with γi(0)=I and γ1(1)=g,γ2(1)=h.

Then define a path from I to gh by following γ1 and then gγ2. Concretely, define γ:[0,1]G by

γ(t)={γ1(2t)0t1/2gγ2(2t-1)1/2t1

and observe that this is a continuous path from I to gh. This shows G0 is closed under multiplication.

The identity and inverse axioms, and the normality, are left as exercises. ∎

Proposition 1.44.

The subgroup G0 is an open and closed subset of G.

Proof.

Firstly, if HG is an open subgroup then

GH=gGHgH

is a union of open subsets, so H is also closed.

To show G0 is open, it suffices to show that it contains an open subset U containing the identity, as then gU is an open subset containing g, for all gG. If V is a sufficiently small open ball around 0𝔤 then by Theorem 1.41 exp(V) is an open subset around IG, and exp(V) is path-connected since V is. Thus exp(V)G0 as required. ∎

It follows from this result that the quotient topology on G/G0 is discrete. 22 2 If you don’t know the definition of the quotient topology, please ignore this.

It is clear that G is connected if and only if G=G0.

Exercise 1.45.

Prove that SO(n) is connected for n2. Hint: show that it is path-connected, by induction.

Proposition 1.46.

The group SLn() is connected.

Proof.

Omitted, but here is a sketch.

  1. 1.

    Use Gram–Schmidt orthogonalisation to show that SLn()=SO(n)N+ where N+ is the group of upper triangular matrices with positive diagonal entries.

  2. 2.

    Show that SO(n) is connected (see previous exercise) and N+ is connected.

  3. 3.

    Deduce that SLn() is connected.

Remark 1.47.

There is an alternative proof: show that SLn() is generated by elementary matrices, and then connect every elementary matrix to the identity.

Among the Lie groups related to this course, GLn(), SLn(), SLn(), U(n), SU(n), SO(n), and Sp(2n) are connected, while GLn() and O(n) are not connected, with their connected components being GLn+()={gGLn():detg>0} and SO(n) respectively.

Proposition 1.48.

If X𝔤, then exp(X)G0.

Proof.

Note that for any X𝔤, the image {exp(tX):t} defines a curve in G containing the identity IG. This curve is in the connected component of the identity. So exp(tX)G0 for all t. ∎

Theorem 1.49.

Let G be a Lie group and 𝔤 be its Lie algebra. Then the subgroup generated by exp(𝔤) is G0.

In particular, if G is connected, then each element of G is a (non-unique) product of a finite number of exponentials.

Proof.

Since exp(𝔤) contains an open neighbourhood of the identity (by Theorem 1.41), it follows that exp(𝔤) generates an open subgroup H of G0, which is then necessarily closed. But since G0 is connected it has no proper nonempty open and closed subsets,33 3 This argument uses the basic fact that a path-connected topological space is connected. and so H=G0. ∎

As a corollary we immediately obtain the answer to the first question above.

Proposition 1.50.

Let G be a connected (linear) Lie group and let ϕ:GG be a Lie group homomorphism. Then the differential Dϕ:𝔤𝔤 uniquely determines ϕ.

Proof.

Since ϕ(exp(X))=exp(Dϕ(X)), the values Dϕ(X) determine ϕ on the subgroup generated by the exp(X), which is exactly G0=G. ∎

Exercise 1.51.

Show that, if G is a connected (linear) Lie group with Lie algebra 𝔤, then G is abelian if and only if 𝔤 is (see Definition 1.30).

What goes wrong when G is not connected?

Example 1.52.

Any finite group G can be embedded in GLn() for some n, and so regarded as a linear Lie group. Its Lie algebra is the zero vector space, so the derivative of a homomorphism GH is always zero. In other words, the Lie algebra knows nothing in this case.

Example 1.53.

Recall that on the orthogonal group O(n), the determinant (which is a continuous map!) takes the values {±1}. Hence O(n) is not connected; SO(n) is the connected component of the identity. (This is related to 𝔰𝔬n=𝔬n; that is, the condition X=-Xt automatically implies that X has trace zero and hence that exp(X) has determinant 1.)

Correspondingly, the determinant det on O(n) has zero differential, as it is constant on an open neighbourhood of the identity. This means that the differential on O(n) cannot distinguish the determinant from the trivial map (g1×).

We now turn to the second question, whether every Lie algebra homomorphism exponentiates to a Lie group homomorphism. In the light of what we have seen, it is sensible to restrict to the case of connected Lie groups. However, even with this restriction, the answer is in general no, as the next example shows!

Example 1.54.

The linear Lie groups GL1+()=>0 and U(1)={z:|z|=1} both have Lie algebra with trivial Lie bracket; in the second case we get the subspace i of 𝔤𝔩1, and identify it with by dividing by i.

The Lie algebra homomorphisms are all of the form ϕa:tat for some a. We consider which of these exponentiate to homomorphisms of Lie groups.

  1. 1.

    The map ϕa always exponentiates to a map >0>0, specifically the map

    xealog(x)=xa.
  2. 2.

    The map ϕa always exponentiates to a map >0U(1), specifically the map

    xeialog(x).
  3. 3.

    The map ϕa never exponentiates to a map U(1)>0 if a0. If it did, the map would have to send

    eixeax,

    and setting x=2π gives a=0.

  4. 4.

    The map ϕa exponentiates to a map U(1)U(1) if and only if a, in which case the map is

    zza.

    Indeed, the map would have to be

    eixeiax

    and setting x=2π shows that a, when the map is as claimed.

Exercise 1.55.

Check that the Lie algebra of SO(2) is also isomorphic to . Write down an isomorphism U(1)SO(2); what is the identification of Lie algebras it induces?

The key difference between >0 and U(1) is that the former is simply connected while the latter is not (it has fundamental group ). We explain this a bit further.

Recall we call a topological/metric space X simply connected if it is path-connected and if every loop can be continuous shrunk to a single point; rigorously, if every continuous map from the unit circle to X can be extended to a continuous map from the unit disc to X. In topology, the failure of a space to be simply connected is measured by the ’fundamental group’ π1(X): X is simply-connected if and only if π1(X) is trivial.

Theorem 1.56.

Let G be a simply connected (linear) Lie group. Let G be any other (linear) Lie group. Let 𝔤 and 𝔤 be their Lie algebras. Then every homomorphism 𝔤𝔤 exponentiates to a unique homomorphism GG.

Hence we have a 1-1 correspondence

{Hom(G,G)}{Hom(𝔤,𝔤)}.
Proof.

This is beyond the scope of this course. Note in the above example GL1+() is simply connected while the circle group U(1) is not. ∎

One can show that SLn() and SU(n) are simply connected. Here is a small table showing our connected groups and their fundamental groups.

G π1(G)
GLn()
SLn() 1
SL2()
SLn(), n3 C2
SO(2)
SO(n), n3 C2
U(n)
SU(n) 1
Sp(2n)
Remark 1.57.

It is not an accident that the fundamental groups of SLn() and SO(n) are isomorphic — Gram–Schmidt orthogonalization, as used in the proof of Proposition 1.46, shows that SLn() and SO(n) are homotopy equivalent. A similar remark applies to SLn() and SU(n).

If G is not connected, or its identity component is not simply connected, we can work in the following way.

  • There exists a ’universal cover’ G~ of G0 which is simply connected, and also has the structure of a Lie group (not necessarily linear, unfortunately). There is a surjective group homomorphism π:G~G0 with discrete kernel Zπ1(G0), so that G0G~/Z.

  • The kernel Z of π is isomorphic to the fundamental group π1(G0).

  • Homomorphisms out of G0 are in 1-1 correspondence with homomorphisms out of G~ which are trivial on Z.

  • The Lie algebras of G, G0 and G~ coincide (more precisely, the maps π:G~G0 and ι:G0G induce isomorphisms of Lie algebras).

  • In general G/G0 can be an arbitrary finite group! For this reason, it is common to restrict attention to connected Lie groups.

The diagram looks as follows:

𝔤expG~π𝔤expG0ιG.
Example 1.58.

The group U(1) is not simply connected. Here the universal cover is (,+) (this is a linear Lie group because it is isomorphic to the upper triangular 2x2 matrices with 1s on the diagonal — a similar argument shows that any vector space (with addition) is a Lie group). The map G~G0 is then

π: U(1)
x e2πix

and we see that the kernel of π is , which is indeed the fundamental group π1(U(1)).

1.8 The example of SU(2) and SO(3)

We illustrate the previous section with the example of SO(3). According to Table 1.7, the fundamental group of SO(3) is C2. We can visualize this as follows: An element of SO(3) is rotation by some angle θ[0,π] about some (oriented) axis. We can represent this as a vector in 3 of length θ in the direction of the axis. Elements of SO(3) then correspond to points in the closed ball in 3 of radius π. However, rotation by π about the axis v is the same as rotation by π about the axis -v, and so we must identify diametrically opposite points on the boundary of this ball.

Picture of

Figure 2: Picture of SO(3)

Now, the straight line in this three-dimensional sphere from a point on the boundary to its diametrically opposite point is a loop in SO(3) since the endpoints represent the same rotation. You can convince yourself that this loop cannot be shrunk to a point (proving it rigorously requires some topology). However, if you go around the loop twice, then that can (!) be shrunk to a point. The idea is to move one copy of the loop out to the boundary, then use the ‘opposite point’ identification to move it to the other side, when you get a normal loop inside the ball which may be shrunk. See Figure 3.

Contracting twice a loop
Figure 3: Contracting twice a loop

A nice physical illustration of this is provided by the “Dirac belt trick”; here is a video of this demonstrated with long hair!

According to the general picture of the previous section, there should be a Lie group homomorphism G~SO(3) whose kernel has order 2 and such that G~ is simply connected, and it turns out that we can take G~=SU(2). So we study this group for a bit.

Firstly, one can show (see problem 12) that every element of SU(2) has the form

(a-b¯ba¯)

for a,b with |a|2+|b|2=1. It follows that SU(2) is homeomorphic to the unit sphere S3 in 4, which is simply connected (that is, there is a continuous bijection SU(2)S3 with continuous inverse).

In Problem 8 we constructed an isomorphism of Lie algebras 𝔰𝔲2𝔰𝔬3. Since SU(2) is simply connected, Theorem 1.56 implies that this exponentiates to a Lie group homomorphism

π:SU(2)SO(3).

We will construct π explicitly later, using the adjoint representation (see Section 2.4).

For now, we record the key facts: π is surjective and

ker(π)={±I}C2.

In particular SO(3)SU(2)/{±I}, so π realises SU(2) as the universal cover of SO(3), and the kernel {±I} is (canonically) isomorphic to π1(SO(3)).