Fermat's Last Theorem: 2006-05-07

Saturday, May 13, 2006

Cyclotomic Integers: Factoring Fermat's Last Theorem

Today's blog continues the discussion of Kummer's proof of Fermat's Last Theorem for regular primes. If you would like to review the historical context for this proof, start here.

The major reason why cyclotomic integers are interesting in relation to Fermat's Last Theorem is because they enable us to factor Fermat's Last Theorem in the following way:

zⁿ = xⁿ + yⁿ = (x + y)(x + αy)(x + α²y) .... (x + α^n-1y)

Below I will show how I can derive this factoring using the Fundamental Theorem of Algebra.

Lemma 1: Let α be a primitive root of unity such that n is an odd prime and αⁿ = 1, and let x,y,z be integers such that xⁿ + yⁿ = zⁿ, then:

zⁿ = xⁿ + yⁿ = (x + y)(x + αy)(x + α²y) .... (x + α^n-1y)

Proof:

(1) We know that xⁿ - 1 has n root from the Fundamental Theorem of Algebra.

(2) We also note that for all αⁱ where 0 ≤ i ≤ n-1, we have (αⁱ)ⁿ = 1.

NOTE: αⁿ = 1 so it is really the same as α⁰.

(3) Based on #2, the Fundamental Theorem of Algebra gives us:

xⁿ - 1 = (x - 1)*(x - α)*(x - α²)*...*(x - α^n-1)

QED

Theorem 1: if n is odd, then zⁿ = xⁿ + yⁿ = (x + y)(x + αy)(x + α²y) .... (x + α^n-1y)

Proof:

(1) aⁿ - 1 = (a - 1)*(a - α)*(a - α²)*...*(a - α^n-1) [From Lemma 1 above]

(2) Since a can be any value, let a = -x/y so that:

(-x/y)ⁿ - 1 = [(-x/y) - 1]*[(-x/y) - α]*...*[(-x/y) - α^n-1] = -(x)ⁿ/yⁿ - 1

(3) If we multiply (-y)ⁿ=-(yⁿ) to both sides, we get:

xⁿ + yⁿ = (x + y)*(x + yα)*...*(x + α^n-1y)

QED

Cyclotomic Integers: Division Algorithm

Friday, May 12, 2006

Cyclotomic Integers: Units and Primes

Tuesday, May 09, 2006

Basic Properties of Cyclotomic Integers

Today's blog continues the discussion of Kummer's proof of Fermat's Last Theorem for regular primes. If you would like to review the historical context for this proof, start here.

Today, I will review the basic properties of cyclotomic integers. Today's content comes directly from Chapter 4 of Harold M. Edwards' Fermat's Last Theorem: A Genetic Introduction to Algebraic Number Theory.

1. Notation

For Kummer's notation, he used λ to represent the odd prime number and α to represent the root of unity so that we have:

Definition 1:
α^λ = 1

2. Standard Form of Cyclotomic Integers

Lemma 1:
If a₀, a₁, ... a_λ-1 are integers, then all cyclotomic integers for a given value of λ can be represented in the following form:

a₀ + a₁α + a₂α² + ... + a_λ-1α^λ-1

Proof:

(1) Let's assume that we have cyclotomic integer = a₀ + a₁α + a₂α² + ... + a_λ-1α^λ-1 + a_λα^λ

(2) By definition 1 above, α^λ = 1

(3) So that we have:

(a₀ + a_λ) + a₁α + a₂α² + ... + a_λ-1α^λ-1

(4) We can do the same thing for any power of αⁱ where i ≥ λ

(5) So we can conclude that all values can be reduced to the form required.

QED

Lemma 2: For any given value of λ, 1 + α+α² + ... + α^λ-1 = 0

Proof:

(1) Since α^λ = 1, we have:

1 + α+α² + ... + α^λ-1 =α^λ + α+α² + ... + α^λ-1 =

= α(α^λ-1 + 1 + α+α² + ... + α^λ-2)

(2) Now, we know that α ≠ 0 since 0^λ = 0 which contradicts with definition 1.

(3) We also know that α ≠ 1 since α is a λth root of unity [using Euler's Identity, see here], we know that α = e^2iπ/λ

(4) So, therefore, 1 + α+α² + ... + α^λ-1= 0

QED

Corollary 2.1: for any given integer c, a₀ + a₁α + a₂α² + ... + a_λ-1α^λ-1= (a₀ + c) + (a₁ + c)α + (a₂ + c)α² + ... + (a_λ-1 + c)α^λ-1.

Proof:

(1) 1 + α + α² + ... + α^λ-1= 0 [From Lemma 2 above]

(2) c + cα + cα² + ... + cα^λ-1= c*0 = 0

(3) So that:

a₀ + a₁α + a₂α² + ... + a_λ-1α^λ-1= a₀ + a₁α + a₂α² + ... + a_λ-1α^λ-1 + 0 =

= a₀ + a₁α + a₂α² + ... + a_λ-1α^λ-1 + c + cα + cα² + ... + cα^λ-1=

= (a₀ + c) + (a₁ + c)α + (a₂ + c)α² + ... + (a_λ-1 + c)α^λ-1.

QED

3. Conjugates

Since each cyclotomic value can be represented as:

a₀ + a₁α + a₂α² + ... + a_λ-1α^λ-1

Kummer used the following shorthand to represent a cyclotomic integer:

f(α), g(α), φ(α), F(α), etc.

One important point that we find is that if f(α) = g(α), then f(α²) = g(α²) and so on up until λ - 1.

Lemma 2.5: Conjugates preserve relations between equations

That is, if f(α) = g(α), then f(αⁱ) = g(αⁱ) where i is a positive number less than λ, αⁱ ≠ 1 and α^λ = 1.

Proof:

(1) Let f(α) = a₀ + a₁α + ... + a_λ-1α^λ-1

(2) For any value f(αⁱ) we see that:

f(αⁱ) = a₀ + a₁αⁱ + ... + a_λ-1α^i*(λ-1)

(3) In step #1, let j be the possible values ranging from 1 to λ -1. Combining this with step #2, we get:

f(αⁱ) = ∑ a_jα^j*i

(4) To prove this lemma, we need to show each element j*i is congruent to a unique value of i modulo λ

In other words, we are trying to prove that each element of the f(αⁱ) is distinct.

(5) This turns out to be the case from Lemma 1 here.

QED

For this reason, we say that f(α), f(α²), ...., and f(α^λ-1) are conjugates of each other.

4. Norm

Definition 2: Norm of a cyclotomic integer f(α)

Nf(α) = f(α)*f(α²)*...*f(α^λ-1)

I will now use this definition in the following proofs.

Lemma 3: Nf(α) = Nf(αⁱ) for all values of i between 1 and λ-1.

Proof:

(1) Nf(αⁱ) = f(αⁱ)*f(α^2*i)*...*f(α^i(λ-1))

(2) Now, each value i, 2*i, 3*i, ... (λ-1)*i maps to a distinct value of 1,2,3,...,(λ-1) modulo λ (see Lemma 1 here)

(3) So in each case, i,2*i, etc. maps to a₁*λ+1, a₂*λ+2, etc.

(4) So we get Nf(αⁱ) = f(α^a₀*λ+1)*f(α^a₁*λ+2)*...*f(α^{a_λ-1*λ+λ-1}) where a_i is a nonnegative integer.

(5) Since α^n*λ=1, we get:

Nf(αⁱ) = f(α)*f(α²)*...*f(α^λ-1)

QED

Lemma 4: α^j = α^λ-j

Proof:

(1) From roots of unity and Euler's Formula, we know that:

α = e^(i2π/λ) = cos(2π/λ) + isin(2π/λ)

(2) We also know that the complex conjugate of a + bi is a - bi, so the complex conjugate for α is:

α = cos(2π/λ) - isin(2π/λ)

(3) Likewise, we know that the complex conjugate for α^j is:

α^j = cos(2jπ/λ) - isin(2jπ/λ)

(4) Using Euler's Formula, we see that:

e^-2jπ/λ = cos(-2jπ/λ) + isin(-2jπ/λ)

(5) Since cos(-x) = cos(x) and sin(-x) = -sin(x) [see here], we can use (#4) to get:

e^-2jπ/λ = cos(2jπ/λ) - isin(2jπ/λ)

which is from #3, the complex conjugate for α^j

(6) Now, e^-2jπ/λ = (e^2π/λ)^-j =

= α^-j = α^-j*α^λ =

= α^{λ - j}

QED

Corollary 4.1: f(α^j) = f(α^λ-j)

Proof:

(1) From Lemma 1, we have:

f(α) = a₀ + a₁α + a₂α² + ... + a_λ-1α^λ-1

(2) From this,

f(α^j) = a₀ + a₁α^j + a₂α^2*j + ... a_λ-1α^j*(λ-1)

(3) Now, from Lemma 4, we know that:

f(α^j) = a₀ + a₁α^λ-j + a₂α^{λ - 2*j} + ... a_λ-1α^{λ - j*(λ-1)}

(4) And, we know that:

f(α^λ-j) = a₀ + a₁α^λ-j + a₂α^(λ-j)*2 + ... a_λ-1α^{(λ - j)*(λ - 1)}

(5) Now,

n*λ - j*n ≡ λ - j*n (mod λ) [See here if you need a review of modular arithmetic]

(6) So that we see that step #3 and step #4 are equal so that:

f(α^j) = f(α^λ-j)

QED

Corollary 4.2: f(α^j)*f(α^λ-j) is a nonnegative real number

Proof:

(1) f(α^j) * f(α^λ-j) = f(α^j)* f(α^j) [From Corollary 4.1 above]

(2) So that:
f(α^j) * f(α^λ-j) = (a₀ + a₁α^j + ... + a_λ-1α^j*(λ-1))(a₀ + a₁α^j + ... + a_λ-1α^j*(λ-1)) =

= (a₀)² + (a₁)²(α^j*α^j) + ... + (a_λ-1)²*α^j*(λ-1)*α^j*(λ-1))

(3) Since each α*α is a nonnegative number, the conclusion follows.

QED

Lemma 5: For any cyclotomic integer f(α), its norm is a nonnegative rational integer.

Proof:

(1) Using Lemma 1 above, we know that:

Nf(α) = a₀ + a₁α + a₂α² + ... + a_λ-1α^λ-1

(2) By Lemma 3 above, we can substitute any conjugate α^j and get the same norm so that:

Nf(α^j) = Nf(α)

(3) But by changing to a conjugate, we keep the same coefficients but get the following:

Nf(α^j) = a₀ + a₁α^j + a₂α^j*2 + ... + a_λ-1α^(λ-1)*j

(4) Combining the two equations gets us:

a₀ + a₁α^j + a₂α^j*2 + ... + a_λ-1α^(λ-1)*j= a₀ + a₁α + a₂α² + ... + a_λ-1αλ-1

(5) Subtracting one from the other gives us:

a₀ - a₀ + (a₁ - a_j)α^j + ... = 0

(6) Since we know that each of these j,2*j,...,(λ-1)*j matches up with a value 1,2,...,λ-1, we know that:

a₁ = a_j

(7) Further, since j can be any value from 2 thru λ-1, we can conclude the following:

a₁ = a₂ = a₃ = ... = a_λ-1

(8) So that:
Nf(α) = a₀ + a₁(α + α² + ... + α^λ-1)

(9) From Lemma 2, we know that:

1 + α+α² + ... + α^λ-1 = 0

so that:

α+α² + ... + α^λ-1= -1

(10) So, we apply (#9) to (#8) to give us:

Nf(α) = a₀ - a₁
(11) We know that it is nonnegative since:

Nf(α) = [f(α¹)*f(α^λ-1)]*[f(α²)*f(α^λ-2)]*...

(12) From Corollary 4.2 above, we know that multiplication of (λ-1)/2 pairs of nonnegative values will result in a nonnegative value.

QED

Lemma 6: f(α)g(α) = h(α) → Nf(α)*Ng(α) = Nh(α)

Proof:

(1) Let f(α)g(α) = h(α)

(2) By Definition 2 above:

Nf(α) = f(α)*f(α²)*...*f(α^λ-1)

Ng(α) = g(α)*g(α²)*...*g(α^λ-1)

Nh(α) = h(α)*h(α²)*...*h(α^λ-1)

(3) Using step #1 gives us:

Nh(α) = f(α)*g(α)*f(α²)*g(α²)*...*f(α^λ-1)*g(α^λ-1) =

= f(α)*f(α²)*...*f(α^λ-1) *g(α)*g(α²)*...*g(α^λ-1) =

= Nf(α)*Ng(α)

QED

Sunday, May 07, 2006

Fermat's Last Theorem: Proof for regular primes

One of the highpoints of the 19th century mathematics is Kummer's proof of Fermat's Last Theorem for regular primes.

Kummer's theory of ideal numbers is one of the foundations of algebraic number theory. In future blogs, I will talk about some of the other very important proofs that came out at this time (impossibility of a general method for quintic equations, transcendence of π, and the fundamental theorem of algebra) and show how Dedekind reinterpreted many of these developments into the modern concepts of ideals, rings, groups, and fields.

Kummer's proof comes down to three major points.

(A) For certain primes (which Kummer called "regular primes"), cyclotomic integers can be said to have a form of unique factorization. [See here for discussion on ideal numbers and how they "save" unique factorization for cyclotomic integers]

(B) For a regular prime λ, there is no solution to x^λ + y^λ = z^λ where x,y,z are pairwise relatively prime all prime to λ

(C) For a regular prime λ, there is no solution to x^λ + y^λ = z^λ where x,y, z are pairwise relatively prime and where λ divides z.

For the full proof, go here.

References

Harold M. Edwards, Fermat's Last Theorem

Fermat's Last Theorem

Saturday, May 13, 2006

Cyclotomic Integers: Factoring Fermat's Last Theorem

Cyclotomic Integers: Division Algorithm

Friday, May 12, 2006

Cyclotomic Integers: Units and Primes

Tuesday, May 09, 2006

Basic Properties of Cyclotomic Integers

Sunday, May 07, 2006

Fermat's Last Theorem: Proof for regular primes

Topic Index

Completed Proofs

Recommended Books

Required Reading for Experts

About Me

Blog Archive