Tags Math

In this quick post I'll dispel a common confusion in the basic math of complex numbers. It's often useful to calculate the norm-square (also known as absolute square) of a complex number z. This norm-square is denoted |z|^2. One could naively expect that:

\[|z|^2=zz\]

However, that's false! The way to calculate norm-square is:

\[|z|^2=zz^*\]

Where z^* is the complex conjugate of z. But why? To understand the mechanics of this calculation, let's start by defining what a norm is.

The norm of a complex number

Informally, a norm is a generalization of the concept of "length" or "size". For a real number r, the norm is its absolute value |r|. No matter if the number is positive or negative, the norm is its "length" - the distance from the origin.

A norm is defined similarly for complex numbers. Here's a standard geometrical interpretation of a complex number z, showing both cartesian and polar coordinates:

Cartesian and polar representation of a complex number

The "norm" of z is the length of the blue line, or the distance of its endpoint from the origin (in the polar representationz=re^{i\theta}, it's exactly r). The norm of a complex number uses the same notation as the absolute value: |z|, because it means the same thing. Another common name for the norm of complex numbers is modulus.

Norm: a formal definition

The formal definition of a norm (from Wikipedia) talks about generalized vector spaces. Since complex numbers are also a vector space (of dimension 1), we can simplify the definition just for \mathbb{C} as follows:

A norm on \mathbb{C} is a real-valued function p:\mathbb{C}\rightarrow\mathbb{R} with the following properties:

  1. Triangle inequality: p(z+w)\leq p(z)+p(w) for all z,w\in \mathbb{C}
  2. Absolute homogeneity: p(sz)=|s|p(z) for all z\in \mathbb{C} and all s\in \mathbb{R}
  3. Positive definiteness: for all z\in \mathbb{C} if p(z)=0 then z=0.
  4. Non-negativity: p(z)\geq 0 for all z\in \mathbb{C}

For a complex number z, setting p(z)=|z| makes all these properties work out intuitively:

  1. Graphically, complex numbers add like vectors; therefore, given z and w, the length |z+w| is always at most the combined lengths |z|+|w|.
  2. When we scale z by a scalar s, its length is scaled similarly. Think about the polar representation of a complex number: z=re^{i\theta}. Multiplying that by s, we get sz=sre^{i\theta}, which has length sr.
  3. In the polar representation: if |z|=0, then r=0, meaning that z=0.
  4. By definition, a length is non-negative; it's the distance from the origin.

Why z squared is not a norm-square

Now it's time to go back to the question we started the post with. Why isn't zz (or z^2) the norm-square?

For a general z=x+iy, we can calculate zz as follows:

\[\begin{align*} zz&=(x+iy)(x+iy)\\ &=x^2+iyx+ixy-y^2\\ &=x^2-y^2+2ixy \end{align*}\]

This is clearly a complex number, with a real and an imaginary component. It doesn't seem to fit the requirement of being a "length" or distance from the origin, which we'd expect to be a real quantity.

Multiplying z by itself in polar coordinates can be insightful:

\[\begin{align*} zz&=re^{i\theta}re^{i\theta}\\ &=rre^{i\theta+i\theta}\\ &=r^2 e^{i\cdot 2\theta} \end{align*}\]

It seems like we almost get what we want, because the magnitude of zz is r^2, which seems like the right magnitude for the square of the distance from origin; but there's still an angle... Here's how it looks geometrically:

Polar representation of zz

Recall that when multiplying two complex numbers - their magnitudes multiply, but their angles add. In this case, we got the r^2 we needed, but with an angle of 2\theta, which is not what we need. If only we could get rid of the angle somehow... keep this thought in your head for the next section.

Looking at the formal definition of the norm, it's clear right away that zz won't do. The norm is defined as a real-valued function, whereas zz is not real-valued.

zz* is a norm-square

The conjugate z^* of z is defined as:

\[z^*=x-iy\]

Or in polar form:

\[z^*=re^{-i\theta}\]

Recall how we had that pesky angle remaining when calculating zz? Let's find a way to get rid of it; since angles add when we multiply complex numbers, to get rid of \theta we should multiply z by something with an angle of -\theta. Do you see where this is going?

\[zz^*=re^{i\theta}re^{-i\theta}=r^2e^{i(\theta-\theta)}=r^2\]

Voila! Multiplying z by its complex conjugate gives us a real number. Moreover, it gives us exactly the number we want - r^2. This is the norm-square, or |z|^2. The norm of z is |z| and is precisely r, the "length" of the complex number.

Let's verify this works in cartesian coordinates:

\[\begin{align*} zz^*&=(x+iy)(x-iy)\\ &=x^2+iyx-ixy+y^2\\ &=x^2+y^2 \end{align*}\]

Indeed, this makes intuitive sense because:

\[|z|=\sqrt{zz*}=\sqrt{x^2+y^2}\]

And this is exactly what we expect when calculating the length of z.

Conclusion

The norm square of a complex number z is denoted |z|^2. In this post we've seen why

\[|z|^2\neq z^2\]

(even though the mathematical notation makes it seem like this should be true). Instead, this is how the norm-square is actually calculated:

\[|z|^2=zz^*\]

With the norm itself being

\[|z|=\sqrt{zz^*}\]

Appendix: zz* and the formal definition of norm

Let's get back to the formal definition of norm and show that zz* satisfies it.

First of all, we've already seen that zz* is a real number, no matter what z is. Therefore, it can indeed serve as a real-valued function p:\mathbb{C}\rightarrow\mathbb{R}.

The proof of property 1 is a bit tedious, but straightforward using the Cauchy-Schwartz inequality.

For property 2, since s is a real number, the square root of its square is just its absolute value:

\[|sz|=\sqrt{(sx)^2+(sy)^2}=\sqrt{s^2(x^2+y^2)}=|s|\sqrt{x^2+y^2}=|s||z|\]

For property 3, consider a z such that:

\[|z|=\sqrt{x^2+y^2}=0\]

Since neither addend inside the square root can be negative, clearly both x and y must be zero, meaning that z=0.

Property 4 is similarly straightforward: given real components x and y, |z|=\sqrt{x^2+y^2} is non-negative.