MathComps LogoMathComps
ProblemsHandoutsGuideNews
Sign in

Mathematical induction

Proofs
Author
Patrik Bak

1Introduction

The word induction generally means a mental process from the specific to the general. When proving problems, we often think this way – we play with small cases and investigate how to discover new ones from previously derived results. Mathematical induction is a formal proof method based on this idea. In this material, we will formalize it and then demonstrate it on various examples.

2Basic induction

We illustrate the idea of mathematical induction using dominoes. They have a property: if the kkk-th one falls, the (k+1)(k+1)(k+1)-th one will also fall. Thanks to this, if we knock over the first one, the second one falls as well, and therefore the third one, etc. The conclusion is that they all fall.

Example 1

Prove that the sum of the first nnn natural numbers is equal to n(n+1)2\frac{n(n+1)}22n(n+1)​.

✓Solution

For n=1n=1n=1 we get 1=1⋅221 = \frac{1\cdot 2}{2}1=21⋅2​, which holds. Assume that the statement holds for some nnn. For n+1n+1n+1, the new expression on the left is equal to

(1+2+⋯+n)+(n+1)=n(n+1)2+(n+1)=(n+1)(n+2)2,(1+2+\cdots+n)+(n+1) = \frac{n(n+1)}{2}+(n+1) = \frac{(n+1)(n+2)}{2},(1+2+⋯+n)+(n+1)=2n(n+1)​+(n+1)=2(n+1)(n+2)​,

which is exactly the expression on the right side for n+1n+1n+1. The proof by induction is complete.

In general, a proof by induction consists of two steps:

  • Proof of the statement for some initial value n0n_0n0​.
  • Proof that if the statement holds for some n≥n0n \ge n_0n≥n0​, then it holds for n+1n+1n+1.

Try this approach on these memorable identities.

Exercise 1

Prove the following identities for all natural numbers nnn:

  • 1+3+⋯+(2n−1)=n21+3+\cdots+(2n-1) = n^21+3+⋯+(2n−1)=n2
  • 12+22+⋯+n2=n(n+1)(2n+1)61^2+2^2+\cdots+n^2 = \frac{n(n+1)(2n+1)}{6}12+22+⋯+n2=6n(n+1)(2n+1)​
  • 13+23+⋯+n3=(1+2+⋯+n)21^3+2^3+\cdots+n^3 = (1+2+\cdots+n)^213+23+⋯+n3=(1+2+⋯+n)2
  • 11⋅2+12⋅3+⋯+1n(n+1)=nn+1\frac{1}{1\cdot 2}+\frac{1}{2\cdot 3}+\cdots+\frac{1}{n(n+1)} = \frac{n}{n+1}1⋅21​+2⋅31​+⋯+n(n+1)1​=n+1n​
  • 1⋅1!+2⋅2!+⋯+n⋅n!=(n+1)!−11\cdot 1!+2\cdot 2!+\cdots+n\cdot n! = (n+1)!-11⋅1!+2⋅2!+⋯+n⋅n!=(n+1)!−1
✓Solution

We will prove all five identities by induction on nnn.

  • By trying small cases, we guess the formula. For n=1n=1n=1, 1=121=1^21=12 holds. Assume that the statement holds for a given nnn. Then
    1+3+⋯+(2n−1)+(2n+1)=n2+2n+1=(n+1)2,1+3+\cdots+(2n-1)+(2n+1) = n^2+2n+1 = (n+1)^2,1+3+⋯+(2n−1)+(2n+1)=n2+2n+1=(n+1)2,
    which is exactly the statement for n+1n+1n+1.
  • For n=1n=1n=1, 1=1⋅2⋅361 = \frac{1\cdot 2\cdot 3}{6}1=61⋅2⋅3​ holds. Assume validity for a given nnn. Then
    12+⋯+n2+(n+1)2=n(n+1)(2n+1)6+(n+1)2==(n+1)(n+2)(2n+3)6,\begin{gather*} 1^2+\cdots+n^2+(n+1)^2 = \frac{n(n+1)(2n+1)}{6}+(n+1)^2 = \cr = \frac{(n+1)(n+2)(2n+3)}{6}, \end{gather*}12+⋯+n2+(n+1)2=6n(n+1)(2n+1)​+(n+1)2==6(n+1)(n+2)(2n+3)​,​
    where the last equality follows from factoring out (n+1)(n+1)(n+1) and simplifying
    n(2n+1)+6(n+1)6=2n2+7n+66=(n+2)(2n+3)6.\frac{n(2n+1)+6(n+1)}{6} = \frac{2n^2+7n+6}{6} = \frac{(n+2)(2n+3)}{6}.6n(2n+1)+6(n+1)​=62n2+7n+6​=6(n+2)(2n+3)​.
  • We know that
    1+2+⋯+n=n(n+1)2,1+2+\cdots+n = \frac{n(n+1)}{2},1+2+⋯+n=2n(n+1)​,
    so we are proving
    13+⋯+n3=(n(n+1)2)2.1^3+\cdots+n^3 = \left(\frac{n(n+1)}{2}\right)^2.13+⋯+n3=(2n(n+1)​)2.
    For n=1n=1n=1, 1=11=11=1 holds. Assume validity for a given nnn. Then
    13+⋯+n3+(n+1)3=(n(n+1)2)2+(n+1)3=(n+1)2(n24+n+1)=(n+1)2⋅(n+2)24==((n+1)(n+2)2)2.\begin{gather*} 1^3+\cdots+n^3+(n+1)^3 = \left(\frac{n(n+1)}{2}\right)^2+(n+1)^3 \cr = (n+1)^2\left(\frac{n^2}{4}+n+1\right) = (n+1)^2\cdot\frac{(n+2)^2}{4} = \cr = \left(\frac{(n+1)(n+2)}{2}\right)^2. \end{gather*}13+⋯+n3+(n+1)3=(2n(n+1)​)2+(n+1)3=(n+1)2(4n2​+n+1)=(n+1)2⋅4(n+2)2​==(2(n+1)(n+2)​)2.​
  • For n=1n=1n=1, 12=12\frac{1}{2} = \frac{1}{2}21​=21​ holds. Assume validity for a given nnn. Then
    11⋅2+⋯+1n(n+1)+1(n+1)(n+2)==nn+1+1(n+1)(n+2)=n(n+2)+1(n+1)(n+2)==n2+2n+1(n+1)(n+2)=(n+1)2(n+1)(n+2)=n+1n+2.\begin{gather*} \frac{1}{1\cdot 2}+\cdots+\frac{1}{n(n+1)}+\frac{1}{(n+1)(n+2)} = \cr = \frac{n}{n+1}+\frac{1}{(n+1)(n+2)} = \frac{n(n+2)+1}{(n+1)(n+2)} = \cr = \frac{n^2+2n+1}{(n+1)(n+2)} = \frac{(n+1)^2}{(n+1)(n+2)} = \frac{n+1}{n+2}. \end{gather*}1⋅21​+⋯+n(n+1)1​+(n+1)(n+2)1​==n+1n​+(n+1)(n+2)1​=(n+1)(n+2)n(n+2)+1​==(n+1)(n+2)n2+2n+1​=(n+1)(n+2)(n+1)2​=n+2n+1​.​
  • For n=1n=1n=1, 1=2!−1=11=2!-1=11=2!−1=1 holds. Assume validity for a given nnn. By adding (n+1)⋅(n+1)!(n+1)\cdot(n+1)!(n+1)⋅(n+1)! we get
    1⋅1!+⋯+n⋅n!+(n+1)⋅(n+1)!==(n+1)!−1+(n+1)⋅(n+1)!==(n+2)⋅(n+1)!−1=(n+2)!−1.\begin{gather*} 1\cdot 1!+\cdots+n\cdot n!+(n+1)\cdot(n+1)! = \cr = (n+1)!-1+(n+1)\cdot(n+1)! = \cr = (n+2)\cdot(n+1)!-1 = (n+2)!-1. \end{gather*}1⋅1!+⋯+n⋅n!+(n+1)⋅(n+1)!==(n+1)!−1+(n+1)⋅(n+1)!==(n+2)⋅(n+1)!−1=(n+2)!−1.​

Exercise 2

Prove by induction the following formulas for the sums of the first nnn terms of known sequences.

  • Sum of an arithmetic sequence:
    a+(a+d)+(a+2d)+⋯+(a+(n−1)d)=n(2a+(n−1)d)2.a + (a+d) + (a+2d) + \cdots + (a+(n-1)d) = \frac{n(2a+(n-1)d)}{2}.a+(a+d)+(a+2d)+⋯+(a+(n−1)d)=2n(2a+(n−1)d)​.
  • Sum of a geometric sequence (q≠1q \neq 1q=1):
    a+aq+aq2+⋯+aqn−1=a⋅qn−1q−1.a + aq + aq^2 + \cdots + aq^{n-1} = a\cdot\frac{q^n-1}{q-1}.a+aq+aq2+⋯+aqn−1=a⋅q−1qn−1​.
  • Sum of an arithmetico-geometric sequence (q≠1q \neq 1q=1):
    1+2q+3q2+⋯+nqn−1=1−(n+1)qn+nqn+1(1−q)2.1 + 2q + 3q^2 + \cdots + nq^{n-1} = \frac{1-(n+1)q^n+nq^{n+1}}{(1-q)^2}.1+2q+3q2+⋯+nqn−1=(1−q)21−(n+1)qn+nqn+1​.
✓Solution

We will prove all three formulas by induction on nnn.

  • For n=1n=1n=1, a=1⋅(2a)2=aa=\frac{1\cdot(2a)}{2}=aa=21⋅(2a)​=a holds. Assume validity for a given nnn. By adding the (n+1)(n+1)(n+1)-th term a+nda+nda+nd to the right side we get
    n(2a+(n−1)d)2+a+nd=(n+1)(2a+nd)2.\frac{n(2a+(n-1)d)}{2}+a+nd = \frac{(n+1)(2a+nd)}{2}.2n(2a+(n−1)d)​+a+nd=2(n+1)(2a+nd)​.
  • For n=1n=1n=1, a=a⋅q−1q−1a = a\cdot\frac{q-1}{q-1}a=a⋅q−1q−1​ holds. Assume validity for a given nnn. By adding aqnaq^naqn we get
    a⋅qn−1q−1+aqn=a⋅qn−1+qn(q−1)q−1=a⋅qn+1−1q−1.\begin{gather*} a\cdot\frac{q^n-1}{q-1}+aq^n = a\cdot\frac{q^n-1+q^n(q-1)}{q-1} = a\cdot\frac{q^{n+1}-1}{q-1}. \end{gather*}a⋅q−1qn−1​+aqn=a⋅q−1qn−1+qn(q−1)​=a⋅q−1qn+1−1​.​
  • For n=1n=1n=1, 1=1−2q+q2(1−q)2=11=\frac{1-2q+q^2}{(1-q)^2}=11=(1−q)21−2q+q2​=1 holds. Assume validity for a given nnn. By adding (n+1)qn(n+1)q^n(n+1)qn to the right side we get
    1−(n+1)qn+nqn+1(1−q)2+(n+1)qn(1−q)2(1−q)2.\begin{gather*} \frac{1-(n+1)q^n+nq^{n+1}}{(1-q)^2} + \frac{(n+1)q^n(1-q)^2}{(1-q)^2}. \end{gather*}(1−q)21−(n+1)qn+nqn+1​+(1−q)2(n+1)qn(1−q)2​.​
    After multiplying out and simplifying, the numerator simplifies to 1−(n+2)qn+1+(n+1)qn+21-(n+2)q^{n+1}+(n+1)q^{n+2}1−(n+2)qn+1+(n+1)qn+2.As a matter of interest, let us add that this formula can be obtained by differentiating the previous formula written for n+1n+1n+1.

Exercise 3

Prove the following identities for Fibonacci numbers, defined recursively as F1=1F_1=1F1​=1, F2=1F_2=1F2​=1 and Fn+2=Fn+1+FnF_{n+2}=F_{n+1}+F_nFn+2​=Fn+1​+Fn​:

  • F1+F2+⋯+Fn=Fn+2−1F_1+F_2+\cdots+F_n = F_{n+2}-1F1​+F2​+⋯+Fn​=Fn+2​−1
  • F1+F3+⋯+F2n−1=F2nF_1+F_3+\cdots+F_{2n-1} = F_{2n}F1​+F3​+⋯+F2n−1​=F2n​
  • F2+F4+⋯+F2n=F2n+1−1F_2+F_4+\cdots+F_{2n} = F_{2n+1}-1F2​+F4​+⋯+F2n​=F2n+1​−1
✓Solution

We prove all three by induction on nnn.

  • For n=1n=1n=1, F1=1=F3−1=2−1F_1 = 1 = F_3-1 = 2-1F1​=1=F3​−1=2−1 holds. Assume
    F1+⋯+Fn=Fn+2−1.F_1+\cdots+F_n = F_{n+2}-1.F1​+⋯+Fn​=Fn+2​−1.
    By adding Fn+1F_{n+1}Fn+1​ we have
    F1+⋯+Fn+Fn+1=Fn+2−1+Fn+1=Fn+3−1,F_1+\cdots+F_n+F_{n+1}=F_{n+2}-1+F_{n+1} = F_{n+3}-1,F1​+⋯+Fn​+Fn+1​=Fn+2​−1+Fn+1​=Fn+3​−1,
    which is the statement for n+1n+1n+1.
  • For n=1n=1n=1, F1=1=F2F_1=1=F_2F1​=1=F2​ holds. Assume
    F1+F3+⋯+F2n−1=F2n.F_1 + F_3 + \cdots + F_{2n-1}=F_{2n}.F1​+F3​+⋯+F2n−1​=F2n​.
    By adding F2n+1F_{2n+1}F2n+1​ we get
    F1+F3+⋯+F2n−1+F2n+1=F2n+F2n+1=F2n+2.F_1 + F_3 + \cdots + F_{2n-1}+F_{2n+1}=F_{2n}+F_{2n+1}=F_{2n+2}.F1​+F3​+⋯+F2n−1​+F2n+1​=F2n​+F2n+1​=F2n+2​.
  • For n=1n=1n=1, F2=1=F3−1F_2=1=F_3-1F2​=1=F3​−1 holds. Assume
    F2+F4+⋯+F2n=F2n+1−1.F_2+F_4+\cdots+F_{2n} = F_{2n+1}-1.F2​+F4​+⋯+F2n​=F2n+1​−1.
    By adding F2n+2F_{2n+2}F2n+2​ we have
    F2+F4+⋯+F2n+F2n+2=F2n+1−1+F2n+2=F2n+3−1.F_2+F_4+\cdots+F_{2n}+F_{2n+2}=F_{2n+1}-1+F_{2n+2}=F_{2n+3}-1.F2​+F4​+⋯+F2n​+F2n+2​=F2n+1​−1+F2n+2​=F2n+3​−1.

So far, we have proved statements for all natural numbers nnn. In the following example, we will show that this is not necessary and induction can sometimes start later. This example also opens up another type of problems where induction is useful – inequalities.

Example 2

For which natural numbers nnn does 2n≥n22^n \ge n^22n≥n2 hold?

✓Solution

By testing small cases, we get strange behavior: For n=1n=1n=1 and n=2n=2n=2 the statement holds. One might think it holds always. However, for n=3n=3n=3 we have 8≥98 \ge 98≥9. For n=4n=4n=4, things are fine again and we have 16≥1616 \ge 1616≥16. For n=5n=5n=5 then 32≥2532 \ge 2532≥25, further 64≥3264 \ge 3264≥32. The differences are increasing, it seems the statement will hold always from now on. We will formally prove it by mathematical induction for n≥4n \ge 4n≥4.

For n=4n=4n=4 the statement holds. Assume that it holds for a given n≥4n \ge 4n≥4, that is, 2n≥n22^n \ge n^22n≥n2. Then we estimate:

2n+1=2⋅2n≥2⋅n2=2n2.2^{n+1} = 2 \cdot 2^n \ge 2 \cdot n^2 = 2n^2.2n+1=2⋅2n≥2⋅n2=2n2.

It would therefore be sufficient to prove that 2n2≥(n+1)22n^2 \ge (n+1)^22n2≥(n+1)2, then altogether we would get 2n+1≥(n+1)22^{n+1} \ge (n+1)^22n+1≥(n+1)2.

We have 2n2≥(n+1)22n^2 \ge (n+1)^22n2≥(n+1)2 if and only if 2n2−(n+1)2=n(n−2)−12n^2-(n+1)^2 = n(n-2)-12n2−(n+1)2=n(n−2)−1 is non-negative. For n≥4n \ge 4n≥4, the expression n(n−2)n(n-2)n(n−2) is obviously increasing and for n=4n=4n=4 it equals 888, so the statement holds.

Note that the residual inequality 2n2≥(n+1)22n^2 \ge (n+1)^22n2≥(n+1)2 obviously holds already for n=3n=3n=3. The problem is that the original inequality does not hold for n=3n=3n=3, so the induction really could not have started earlier.

The answer to the question from the problem is all natural numbers except n=3n=3n=3.

Try a few inequalities for practice.

Exercise 4

Prove that for all natural numbers nnn it holds that 2n≥n2^n \ge n2n≥n.

✓Solution

For n=1n=1n=1, 2≥12 \ge 12≥1 holds. Assume that 2n≥n2^n \ge n2n≥n for a given nnn. Then

2n+1=2⋅2n≥2n≥n+1,2^{n+1} = 2\cdot 2^n \ge 2n \ge n+1,2n+1=2⋅2n≥2n≥n+1,

since 2n≥n+12n \ge n+12n≥n+1 for n≥1n \ge 1n≥1.

Exercise 5

Prove that for all natural numbers n≥4n \ge 4n≥4 it holds that n!>2nn! > 2^nn!>2n.

✓Solution

For n=4n=4n=4, 24>1624 > 1624>16 holds. Assume that n!>2nn! > 2^nn!>2n for a given n≥4n \ge 4n≥4. Then

(n+1)!=(n+1)⋅n!>(n+1)⋅2n,(n+1)! = (n+1)\cdot n! > (n+1)\cdot 2^n,(n+1)!=(n+1)⋅n!>(n+1)⋅2n,

now we use that n+1≥5>2n+1 \ge 5 > 2n+1≥5>2, so

(n+1)!>(n+1)⋅2n≥2⋅2n=2n+1.(n+1)! > (n+1) \cdot 2^n \ge 2 \cdot 2^n = 2^{n+1}.(n+1)!>(n+1)⋅2n≥2⋅2n=2n+1.

Exercise 6

Prove that for all natural numbers nnn it holds that

1n+1+1n+2+⋯+12n≥12.\frac{1}{n+1} + \frac{1}{n+2} + \cdots + \frac{1}{2n} \ge \frac{1}{2}.n+11​+n+21​+⋯+2n1​≥21​.
✓Solution

For n=1n=1n=1, 12≥12\frac{1}{2} \ge \frac{1}{2}21​≥21​ holds. Assume validity for a given nnn. Denote Sn=1n+1+⋯+12nS_n = \frac{1}{n+1}+\cdots+\frac{1}{2n}Sn​=n+11​+⋯+2n1​. Then

Sn+1=1n+2+⋯+12n+12n+1+12n+2==Sn−1n+1+12n+1+12n+2.\begin{gather*} S_{n+1} = \frac{1}{n+2}+\cdots+\frac{1}{2n}+\frac{1}{2n+1}+\frac{1}{2n+2} = \cr = S_n - \frac{1}{n+1}+\frac{1}{2n+1}+\frac{1}{2n+2}. \end{gather*}Sn+1​=n+21​+⋯+2n1​+2n+11​+2n+21​==Sn​−n+11​+2n+11​+2n+21​.​

It suffices to verify that

−1n+1+12n+1+12n+2≥0,-\frac{1}{n+1}+\frac{1}{2n+1}+\frac{1}{2n+2} \ge 0,−n+11​+2n+11​+2n+21​≥0,

after rewriting with a common denominator we get

12(2n+1)(n+1)>0\frac{1}{2(2n+1)(n+1)} > 02(2n+1)(n+1)1​>0

which obviously holds.

Exercise 7

Prove Bernoulli's inequality: for real x≥−1x \ge -1x≥−1 and natural nnn it holds that

(1+x)n≥1+nx.(1 + x)^n \ge 1 + nx.(1+x)n≥1+nx.
✓Solution

For n=1n=1n=1, 1+x≥1+x1+x \ge 1+x1+x≥1+x holds. Assume that

(1+x)n≥1+nx(1+x)^n \ge 1+nx(1+x)n≥1+nx

for a given nnn. Since 1+x≥01+x \ge 01+x≥0, we can multiply both sides by the expression (1+x)(1+x)(1+x):

(1+x)n+1≥(1+nx)(1+x)=1+(n+1)x+nx2.(1+x)^{n+1} \ge (1+nx)(1+x) = 1+(n+1)x+nx^2.(1+x)n+1≥(1+nx)(1+x)=1+(n+1)x+nx2.

We need to prove that the last expression is at least 1+(n+1)x1+(n+1)x1+(n+1)x, which is obvious, because nx2nx^2nx2 is non-negative.

We can also use induction to prove divisibility.

Example 3

Prove that for all natural numbers nnn it holds that 6∣n3−n6 \mid n^3-n6∣n3−n.

✓Solution

For n=1n=1n=1, 1−1=01-1=01−1=0 and 6∣06 \mid 06∣0 holds. Assume that 6∣n3−n6 \mid n^3-n6∣n3−n for a given nnn. Then

(n+1)3−(n+1)=n3+3n2+3n+1−n−1==(n3−n)+3n(n+1).\begin{gather*} (n+1)^3-(n+1) = n^3+3n^2+3n+1-n-1 = \cr = (n^3-n)+3n(n+1). \end{gather*}(n+1)3−(n+1)=n3+3n2+3n+1−n−1==(n3−n)+3n(n+1).​

The first term is divisible by 6 by the induction hypothesis. In the second term, the number n(n+1)n(n+1)n(n+1) is the product of two consecutive numbers, hence even, and after multiplying by 3 we get a multiple of 6.

Example 4

Prove that for all natural numbers nnn it holds that 3∣4n−13 \mid 4^n-13∣4n−1.

✓Solution

For n=1n=1n=1, 4−1=34-1=34−1=3 and 3∣33 \mid 33∣3 holds. Assume that 3∣4n−13 \mid 4^n-13∣4n−1 for a given nnn. Then

4n+1−1=4⋅4n−1=4(4n−1)+3.4^{n+1}-1 = 4\cdot 4^n-1 = 4(4^n-1)+3.4n+1−1=4⋅4n−1=4(4n−1)+3.

The first addend is divisible by 3 by the induction hypothesis, and 333 is obviously divisible by 3, therefore their sum is as well.

Exercise 8

Prove that for all natural numbers nnn it holds that 9∣4n+6n−19 \mid 4^n+6n-19∣4n+6n−1.

✓Solution

For n=1n=1n=1, 4+6−1=94+6-1=94+6−1=9 and 9∣99 \mid 99∣9 holds. Assume that 9∣4n+6n−19 \mid 4^n+6n-19∣4n+6n−1 for a given nnn. Then

4n+1+6(n+1)−1=4⋅4n+6n+5==4(4n+6n−1)−18n+9==4(4n+6n−1)+9(1−2n).\begin{gather*} 4^{n+1}+6(n+1)-1 = 4\cdot 4^n+6n+5 = \cr = 4(4^n+6n-1)-18n+9 = \cr = 4(4^n+6n-1)+9(1-2n). \end{gather*}4n+1+6(n+1)−1=4⋅4n+6n+5==4(4n+6n−1)−18n+9==4(4n+6n−1)+9(1−2n).​

The first addend is divisible by 9 by the induction hypothesis and the second is obviously a multiple of 9.

Exercise 9

Prove that for all natural numbers nnn it holds that 31∣5n+1+62n−131 \mid 5^{n+1}+6^{2n-1}31∣5n+1+62n−1.

✓Solution

For n=1n=1n=1, 52+61=315^2+6^1=3152+61=31 and 31∣3131 \mid 3131∣31 holds. Assume that 31∣5n+1+62n−131 \mid 5^{n+1}+6^{2n-1}31∣5n+1+62n−1. Then

5n+2+62n+1=5⋅5n+1+36⋅62n−1==5(5n+1+62n−1)+31⋅62n−1.\begin{gather*} 5^{n+2}+6^{2n+1} = 5\cdot 5^{n+1}+36\cdot 6^{2n-1} = \cr = 5(5^{n+1}+6^{2n-1})+31\cdot 6^{2n-1}. \end{gather*}5n+2+62n+1=5⋅5n+1+36⋅62n−1==5(5n+1+62n−1)+31⋅62n−1.​

The first addend is divisible by 31 by the induction hypothesis and the second is obviously a multiple of 31.

In a proof, it is really necessary to verify the first step, otherwise anything can be proved. For example, that a number of the form n2+n+1n^2+n+1n2+n+1 is always even. Namely, if it holds for a given nnn, then for n+1n+1n+1 we have (n+1)2+(n+1)+1(n+1)^2+(n+1)+1(n+1)2+(n+1)+1, which we easily convince ourselves is equal to (n2+n+1)+2(n+1)(n^2+n+1)+2(n+1)(n2+n+1)+2(n+1). By the induction hypothesis, n2+n+1n^2+n+1n2+n+1 is even, and obviously 2(n+1)2(n+1)2(n+1) is even, so we have the sum of two even numbers, thus the proof is complete.

The problem is that if we now plug in any nnn, we get an odd number. Where is the mistake? Well, we did not verify that the statement holds for n=1n=1n=1 – then the expression is equal to 3 and is odd. So if we were proving that this expression is always odd, then together with the step for n=1n=1n=1 the proof would be complete.

This problem presents another potential pitfall.

Problem 1

Find the mistake in the proof of this statement:

Let us have n≥2n \ge 2n≥2 lines in the plane such that no two are parallel. Then all these lines pass through a single point.

Proof: For n=2n = 2n=2 the statement holds, because two non-parallel lines intersect in a single point.

Induction step: Let the statement hold for some n≥2n \ge 2n≥2 lines. Let us take n+1n+1n+1 lines p1,p2,…,pn+1p_1, p_2, \dots, p_{n+1}p1​,p2​,…,pn+1​, out of which no two are parallel.

The first nnn lines p1,…,pnp_1, \dots, p_np1​,…,pn​ pass through a single point XXX by the induction hypothesis. Similarly, the lines p1,…,pn−1,pn+1p_1, \dots, p_{n-1}, p_{n+1}p1​,…,pn−1​,pn+1​ (there are also nnn of them) pass through a single point YYY. Since the lines p1,…,pn−1p_1, \dots, p_{n-1}p1​,…,pn−1​ pass through both points XXX and YYY, necessarily X=YX = YX=Y. Therefore, all n+1n+1n+1 lines pass through point XXX.

1Hint

It is evident that the statement does not hold already for n=3n=3n=3. To understand the mistake, try writing out the induction step for nnn equal to 2 (where we prove that the statement holds for 2+1=32+1=32+1=3 lines).

✓Solution

The mistake is in the sentence Since the lines p1,…,pn−1p_1, \dots, p_{n-1}p1​,…,pn−1​ pass through both points XXX and YYY, necessarily X=YX = YX=Y. For n=2n=2n=2, the sequence of lines p1,…,pn−1p_1,\dots,p_{n-1}p1​,…,pn−1​ is actually just a single line. Whenever n>2n>2n>2, the statement would indeed hold, and that causes the confusion.

3Strong induction

In previous problems, we always said in the induction step that the statement holds for some nnn and subsequently proved it for n+1n+1n+1. In fact, however, we can assume something stronger. For example, let's have a statement for all natural numbers nnn that we are proving by induction. First, of course, we prove it for n=1n=1n=1. Subsequently, we assume that it holds for all 1,2,…,n1,2,\dots,n1,2,…,n and prove it for n+1n+1n+1 – instead of just assuming that it holds for some nnn. This turn is very common in more complex proofs and is called strong induction. We illustrate it with an example:

Theorem 1

Prime factorization theorem

Every integer n>1n>1n>1 can be factored into a product of several, not necessarily distinct, primes.

Proof

The statement obviously holds for n=2n=2n=2, which is itself a prime. Assume that we have proved the statement for 2,3,…,n2,3,\dots,n2,3,…,n. Now let us take the number n+1n+1n+1. If it is a prime, we are done. If it is not a prime, it means that there exist two integers a>1a>1a>1 and b>1b>1b>1 such that n+1=abn+1=abn+1=ab. Since both aaa and bbb are more than 1, both are less than n+1n+1n+1, thus at most nnn. We can apply the induction hypothesis to both of them, namely, that they can be written as a product of primes. Therefore, their product ababab can also be written as a product of primes, which is what we needed to prove.

Notice that we could not use traditional induction in this proof; we strictly need the numbers a,ba,ba,b to be smaller than n+1n+1n+1.

Furthermore, realize that regular induction is a special case of strong induction – after all, the assumption that the statement holds for all 1,2,…,n1,2,\dots,n1,2,…,n also includes the fact that it holds for nnn, which is what we base regular induction on. When coming up with a solution by induction, it is therefore more useful to think straight away in the style of strong induction; we won't lose anything.

Another traditional result is the existence of a unique representation in the binary system.

Example 5

Prove that every natural number can be written as a sum of mutually distinct powers of two.

✓Solution

For n=1n=1n=1, 1=201=2^01=20 holds. Assume that the statement holds for all natural numbers less than nnn. If nnn is a power of two, we are done. Otherwise, we find the largest power of two 2k<n2^k < n2k<n. The number n−2kn-2^kn−2k is positive and less than nnn, so by the induction hypothesis, it can be written as a sum of distinct powers of two. Moreover, n−2k<2kn - 2^k < 2^kn−2k<2k (because 2k+1>n2^{k+1} > n2k+1>n), so 2k2^k2k does not appear in its decomposition. By adding 2k2^k2k, we get the representation of nnn as a sum of distinct powers of two.

Try a similar proof on harder examples:

Problem 2

Zeckendorf's theorem, existence

Prove that every positive integer can be written as a sum of mutually non-consecutive Fibonacci numbers.

1Hint

We proceed by induction. Let nnn be the number we are trying to write as a sum. It pays off to consider the largest Fibonacci number not exceeding nnn, e.g., FmF_mFm​, and apply the induction hypothesis to n−Fmn-F_mn−Fm​. Are we done?

2Hint

We are not done yet. We need to deal with the fact that the representation of n−Fmn-F_mn−Fm​ might contain Fm−1F_{m-1}Fm−1​, which would break the non-consecutiveness. But what if n−Fm=Fm−1+⋯n-F_m=F_{m-1}+\cdotsn−Fm​=Fm−1​+⋯?

✓Solution

For the smallest natural numbers, the statement holds (e.g., 1=F2,2=F31=F_2, 2=F_31=F2​,2=F3​). Assume that every number less than nnn can be written in the required way. Let us find the largest Fibonacci number Fm≤nF_m \le nFm​≤n. If n=Fmn=F_mn=Fm​, we are done. Otherwise, the number n−Fmn-F_mn−Fm​ is positive and strictly less than nnn, so by the induction hypothesis, we can write it as a sum of mutually non-consecutive Fibonacci numbers.

By adding FmF_mFm​, we would get the representation for nnn. A problem would only occur if the number Fm−1F_{m-1}Fm−1​ (or larger) appeared in the representation of n−Fmn-F_mn−Fm​. In such a case, the entire representation would be at least Fm−1F_{m-1}Fm−1​, and thus n−Fm≥Fm−1n-F_m \ge F_{m-1}n−Fm​≥Fm−1​. From this, however, we get n≥Fm+Fm−1=Fm+1n \ge F_m + F_{m-1} = F_{m+1}n≥Fm​+Fm−1​=Fm+1​, which is a contradiction with FmF_mFm​ being the largest Fibonacci number not exceeding nnn.

Therefore, the representation for n−Fmn-F_mn−Fm​ contains only numbers at most Fm−2F_{m-2}Fm−2​. Thus, supplementing with the number FmF_mFm​ does not create a pair of consecutive Fibonacci numbers, and we get a valid representation of the number nnn.

Another numeral system is the factorial base:

Problem 3

Factorial number system

Prove that every positive integer nnn can be written in the form

n=a1⋅1!+a2⋅2!+⋯+ak⋅k!,n = a_1 \cdot 1! + a_2 \cdot 2! + \cdots + a_k \cdot k!,n=a1​⋅1!+a2​⋅2!+⋯+ak​⋅k!,

where 0≤ai≤i0 \le a_i \le i0≤ai​≤i for every iii.

1Hint

We proceed by induction. Let nnn be the number we are trying to write in this way. Consider the largest number kkk such that k!≤nk! \le nk!≤n. The trick is to divide nnn by k!k!k! with a remainder, that is, to write n=a⋅k!+rn=a \cdot k! + rn=a⋅k!+r, where 0≤r<k!0 \le r < k!0≤r<k!. Where can we apply the induction hypothesis? What remains to be proved?

2Hint

First of all, realize that a≤ka \le ka≤k (prove it). This means that for r=0r=0r=0 we are done, and that for r>0r>0r>0 we can apply the induction hypothesis to rrr. Do we thus already obtain a satisfying expression?

✓Solution

For n=1n=1n=1 the statement holds, 1=1⋅1!1 = 1 \cdot 1!1=1⋅1!. Assume that every number less than nnn can be written in the required way. Find the largest number kkk such that k!≤nk! \le nk!≤n and divide nnn by k!k!k! with a remainder, so n=ak⋅k!+rn = a_k \cdot k! + rn=ak​⋅k!+r, where 0≤r<k!0 \le r < k!0≤r<k!. Since k!k!k! was the largest possible, originally we must have had n<(k+1)!n < (k+1)!n<(k+1)!, which implies

ak⋅k!+r<(k+1)!=(k+1)⋅k!ak⋅k!≤ak⋅k!+r<(k+1)⋅k!ak<k+1  ⟹  ak≤k.\begin{gather*} a_k \cdot k! + r < (k+1)! = (k+1) \cdot k! \cr a_k \cdot k! \le a_k \cdot k! + r < (k+1) \cdot k! \cr a_k < k+1 \implies a_k \le k. \end{gather*}ak​⋅k!+r<(k+1)!=(k+1)⋅k!ak​⋅k!≤ak​⋅k!+r<(k+1)⋅k!ak​<k+1⟹ak​≤k.​

We are guaranteed that the condition for the most significant digit is met. If r=0r=0r=0, we are done. Otherwise, we have 0<r<k!≤n0 < r < k! \le n0<r<k!≤n, so rrr is strictly less than nnn and we can apply the induction hypothesis to it. We thus get the representation r=a1⋅1!+⋯+am⋅m!r = a_1 \cdot 1! + \cdots + a_m \cdot m!r=a1​⋅1!+⋯+am​⋅m!.

Since r<k!r < k!r<k!, the largest factorial in the representation of rrr can be at most (k−1)!(k-1)!(k−1)!, thus m≤k−1m \le k-1m≤k−1. By substituting this representation for rrr into the expression n=ak⋅k!+rn = a_k \cdot k! + rn=ak​⋅k!+r, we thus obtain the sought representation of the number nnn, and no two factorials add up together.

Let us add that uniqueness can be proved for both the Fibonacci and the factorial number systems (and of course for binary and our decimal one as well). The proof is easy but technical – it relies on a general principle: Let us have a system where we can express 1 and in which it holds that the largest number we can express using kkk digits is 1 less than the smallest number we can express using k+1k+1k+1 digits. In such a system, both completeness (every number can be expressed) and uniqueness then hold. You can try to think this statement through for all the examined systems (binary, Fibonacci, and factorial); the already proved exercises can be a good help :)

Technically, one of the forms of strong induction is also when we use the induction hypothesis for only two smaller numbers, e.g., n−2n-2n−2 and n−1n-1n−1. This is often seen in problems about recurrence sequences.

Example 6

A sequence is defined by the rule a1=0a_1 = 0a1​=0, a2=1a_2 = 1a2​=1 and

an=3an−1−2an−2a_n = 3a_{n-1}-2a_{n-2}an​=3an−1​−2an−2​

for n≥3n \ge 3n≥3. Guess the explicit formula for ana_nan​ and prove it by induction.

✓Solution

From the small values a1=0,a2=1,a3=3,a4=7,a5=15,a6=31a_1=0, a_2=1, a_3=3, a_4=7, a_5=15, a_6=31a1​=0,a2​=1,a3​=3,a4​=7,a5​=15,a6​=31 we guess an=2n−1−1a_n = 2^{n-1}-1an​=2n−1−1. We prove this formula by induction:

For n=1n=1n=1, 20−1=02^0-1=020−1=0 holds. For n=2n=2n=2, 21−1=12^1-1=121−1=1 holds. Assume it holds for n−1n-1n−1 and nnn. Then

an+1=3an−2an−1=3(2n−1−1)−2(2n−2−1)==3⋅2n−1−3−2n−1+2=2⋅2n−1−1=2n−1.\begin{gather*} a_{n+1} = 3a_n-2a_{n-1} = 3(2^{n-1}-1)-2(2^{n-2}-1)= \cr = 3\cdot 2^{n-1} - 3 - 2^{n-1} + 2 = 2\cdot 2^{n-1}-1 = 2^n-1. \end{gather*}an+1​=3an​−2an−1​=3(2n−1−1)−2(2n−2−1)==3⋅2n−1−3−2n−1+2=2⋅2n−1−1=2n−1.​

Exercise 10

A sequence is defined by the rule a1=1a_1 = 1a1​=1, a2=5a_2 = 5a2​=5 and

an=5an−1−6an−2a_n = 5a_{n-1}-6a_{n-2}an​=5an−1​−6an−2​

for n≥3n \ge 3n≥3. Guess the explicit formula for ana_nan​ and prove it by induction.

✓Solution

By trying a1=1,a2=5,a3=19,a4=65,a5=211a_1=1, a_2=5, a_3=19, a_4=65, a_5=211a1​=1,a2​=5,a3​=19,a4​=65,a5​=211 we guess the formula an=3n−2na_n = 3^n-2^nan​=3n−2n. We prove this by induction:

For n=1n=1n=1, 3−2=13-2=13−2=1 holds. For n=2n=2n=2, 9−4=59-4=59−4=5 holds. Assume it holds for n−1n-1n−1 and nnn. Then

an+1=5an−6an−1=5(3n−2n)−6(3n−1−2n−1)==5⋅3n−5⋅2n−2⋅3n+3⋅2n==3⋅3n−2⋅2n=3n+1−2n+1.\begin{gather*} a_{n+1} = 5a_n-6a_{n-1} = 5(3^n-2^n)-6(3^{n-1}-2^{n-1}) = \cr = 5\cdot 3^n - 5\cdot 2^n - 2\cdot 3^n + 3\cdot 2^n = \cr = 3\cdot 3^n - 2\cdot 2^n = 3^{n+1}-2^{n+1}. \end{gather*}an+1​=5an​−6an−1​=5(3n−2n)−6(3n−1−2n−1)==5⋅3n−5⋅2n−2⋅3n+3⋅2n==3⋅3n−2⋅2n=3n+1−2n+1.​

Problem 4

Prove that for all natural numbers nnn it holds that Fn≤φn−1F_n \le \varphi^{n-1}Fn​≤φn−1, where FnF_nFn​ are the Fibonacci numbers and φ=1+52\varphi = \frac{1+\sqrt{5}}{2}φ=21+5​​ is the golden ratio.

1Hint

We verify numerically for n=1n=1n=1 and n=2n=2n=2. Then induction works. Namely, Fn=Fn−1+Fn−2F_n = F_{n-1}+F_{n-2}Fn​=Fn−1​+Fn−2​ for n≥3n \ge 3n≥3.

2Hint

The key in the proof by induction is the fact that ϕ\phiϕ has this magical property that ϕ2=ϕ+1\phi^2=\phi+1ϕ2=ϕ+1 (verify).

✓Solution

We will perform the proof by strong induction. For n=1n=1n=1 we have F1=1=φ0F_1 = 1 = \varphi^0F1​=1=φ0. For n=2n=2n=2 we have F2=1≤φ1F_2 = 1 \le \varphi^1F2​=1≤φ1, which holds, since φ>5/2>1\varphi > \sqrt{5}/2 > 1φ>5​/2>1.

Assume that the statement holds for a given nnn and n−1n-1n−1. For n+1n+1n+1 we have Fn+1=Fn+Fn−1F_{n+1} = F_n + F_{n-1}Fn+1​=Fn​+Fn−1​. From the induction hypothesis, we can estimate this sum:

Fn+1≤φn−1+φn−2=φn−2(φ+1).F_{n+1} \le \varphi^{n-1} + \varphi^{n-2} = \varphi^{n-2}(\varphi+1).Fn+1​≤φn−1+φn−2=φn−2(φ+1).

For φ\varphiφ, the identity φ2=φ+1\varphi^2 = \varphi+1φ2=φ+1 holds, which we easily verify. By substituting we have

Fn+1≤φn−2⋅φ2=φn.F_{n+1} \le \varphi^{n-2}\cdot \varphi^2 = \varphi^n.Fn+1​≤φn−2⋅φ2=φn.

Thus the induction step is proved.

Problem 5

Let xxx be a real number such that x+1xx+\frac{1}{x}x+x1​ is rational. Prove that then xn+1xnx^n+\frac{1}{x^n}xn+xn1​ is also rational for every natural nnn.

1Hint

We proceed by induction. In order to make the transition from nnn to n+1n+1n+1, we multiply the expression for nnn by a suitable expression.

2Hint

The crucial multiplication is by the expression x+1xx+\frac 1xx+x1​. Subsequently, it appears that our induction hypothesis must be strong.

✓Solution

We prove the statement by strong induction. For n=1n=1n=1, x+1xx+\frac{1}{x}x+x1​ is rational directly from the problem statement. For n=2n=2n=2 we have

x2+1x2=(x+1x)2−2,x^2+\frac{1}{x^2} = \left(x+\frac{1}{x}\right)^2-2,x2+x21​=(x+x1​)2−2,

which is obviously a rational number.

Assume that the statement holds for all natural numbers up to some nnn and n−1n-1n−1, where n≥2n \ge 2n≥2.

xn+1+1xn+1=(xn+1xn)(x+1x)−(xn−1+1xn−1).x^{n+1}+\frac{1}{x^{n+1}} = \left(x^n+\frac{1}{x^n}\right)\left(x+\frac{1}{x}\right) - \left(x^{n-1}+\frac{1}{x^{n-1}}\right).xn+1+xn+11​=(xn+xn1​)(x+x1​)−(xn−1+xn−11​).

By the induction hypothesis, xn+1xnx^n+\frac{1}{x^n}xn+xn1​ and xn−1+1xn−1x^{n-1}+\frac{1}{x^{n-1}}xn−1+xn−11​ are rational numbers. Since x+1xx+\frac{1}{x}x+x1​ is as well, the entire right side is rational. Thus the induction step is complete.

4Other forms of induction

In the tasks so far, we have encountered two types of induction hypotheses: that the statement holds for nnn or that it holds for suitable numbers not exceeding nnn. Consequently, we proved from this that the statement holds for n+1n+1n+1. However, imagination has no limits, and an induction can easily occur where we prove n+2n+2n+2 from nnn; therefore, we need to prove the statement for consecutive base cases.

Example 7

Prove that every integer n≥2n \ge 2n≥2 can be written in the form n=2a+3bn = 2a + 3bn=2a+3b, where a,ba,ba,b are non-negative integers.

✓Solution

We will do the proof by induction with a step of 2, which means we need two consecutive base cases. We verify them directly: 2=2⋅12 = 2\cdot 12=2⋅1 and 3=3⋅13 = 3\cdot 13=3⋅1.

Assume that the statement holds for a given n≥2n \ge 2n≥2, that is, n=2a+3bn = 2a+3bn=2a+3b. Then n+2=2(a+1)+3bn+2 = 2(a+1)+3bn+2=2(a+1)+3b, so the statement holds for n+2n+2n+2 as well.

Note. As a point of interest, let us add how it is in general: For two coprime positive integers ppp and qqq, it can be proved that every integer n≥pq−p−q+1n \ge pq-p-q+1n≥pq−p−q+1 can be written as n=pa+qbn=pa+qbn=pa+qb for non-negative a,ba,ba,b. The number pq−p−qpq-p-qpq−p−q is called the Frobenius number and it is the largest integer that cannot be written in this way. In our case it is 2⋅3−2−3=12\cdot 3-2-3=12⋅3−2−3=1.

Another fascinating form of induction can be seen in Cauchy's proof of the AM-GM inequality, where we first go up and then down.

Theorem 2

AM-GM inequality

For positive real numbers a1,a2,…,ana_1,a_2,\dots,a_na1​,a2​,…,an​ it holds that

a1+a2+⋯+ann≥a1a2⋯ann,\frac{a_1+a_2+\cdots+a_n}{n} \ge \sqrt[ n ] {a_1 a_2 \cdots a_n},na1​+a2​+⋯+an​​≥na1​a2​⋯an​​,

with equality occurring if and only if a1=a2=⋯=ana_1=a_2=\cdots=a_na1​=a2​=⋯=an​.

Proof

The proof consists of two steps: first we prove the inequality for all powers of two n=2kn = 2^kn=2k and then we show that from the validity for nnn variables follows the validity for n−1n-1n−1 variables. These two steps combined cover all natural numbers.

Step upwards (from nnn to 2n2n2n): For n=2n=2n=2 we need to prove that a1+a22≥a1a2\frac{a_1+a_2}{2} \ge \sqrt{a_1 a_2}2a1​+a2​​≥a1​a2​​, which is equivalent to (a1−a2)2≥0(\sqrt{a_1}-\sqrt{a_2})^2 \ge 0(a1​​−a2​​)2≥0, and this always holds.

Assume that the inequality holds for nnn variables. For 2n2n2n variables, we divide the numbers into two groups of nnn. Let us denote

A=a1+⋯+ann,B=an+1+⋯+a2nn.A = \frac{a_1+\cdots+a_n}{n}, \qquad B = \frac{a_{n+1}+\cdots+a_{2n}}{n}.A=na1​+⋯+an​​,B=nan+1​+⋯+a2n​​.

From the induction hypothesis for both groups we have

A≥a1⋯ann,B≥an+1⋯a2nn.A \ge \sqrt[ n ]{a_1 \cdots a_n}, \qquad B \ge \sqrt[ n ]{a_{n+1} \cdots a_{2n}}.A≥na1​⋯an​​,B≥nan+1​⋯a2n​​.

The average of all 2n2n2n numbers is A+B2\frac{A+B}{2}2A+B​. From the base case for two variables we know that A+B2≥AB\frac{A+B}{2} \ge \sqrt{AB}2A+B​≥AB​. Thus

a1+⋯+a2n2n=A+B2≥AB≥≥a1⋯ann⋅an+1⋯a2nn=a1⋯a2n2n.\begin{gather*} \frac{a_1+\cdots+a_{2n}}{2n} = \frac{A+B}{2} \ge \sqrt{AB} \ge \cr \ge \sqrt{\sqrt[ n ]{a_1 \cdots a_n} \cdot \sqrt[ n ]{a_{n+1} \cdots a_{2n}}} = \sqrt[ 2n ]{a_1 \cdots a_{2n}}. \end{gather*}2na1​+⋯+a2n​​=2A+B​≥AB​≥≥na1​⋯an​​⋅nan+1​⋯a2n​​​=2na1​⋯a2n​​.​

Step downwards (from nnn to n−1n-1n−1): Assume that the inequality holds for nnn variables, and let us take n−1n-1n−1 non-negative real numbers a1,…,an−1a_1,\dots,a_{n-1}a1​,…,an−1​. Let us set

an=a1+⋯+an−1n−1a_n = \frac{a_1+\cdots+a_{n-1}}{n-1}an​=n−1a1​+⋯+an−1​​

thus ana_nan​ is the average of the remaining numbers. Then

a1+⋯+an−1+ann=(n−1)an+ann=an.\frac{a_1+\cdots+a_{n-1}+a_n}{n} = \frac{(n-1)a_n+a_n}{n} = a_n.na1​+⋯+an−1​+an​​=n(n−1)an​+an​​=an​.

From the assumption for nnn variables we obtain

an=a1+⋯+ann≥a1⋯an−1⋅ann.a_n = \frac{a_1+\cdots+a_n}{n} \ge \sqrt[ n ]{a_1 \cdots a_{n-1} \cdot a_n}.an​=na1​+⋯+an​​≥na1​⋯an−1​⋅an​​.

By raising to the nnn-th power we have ann≥a1⋯an−1⋅ana_n^n \ge a_1 \cdots a_{n-1} \cdot a_nann​≥a1​⋯an−1​⋅an​, and after dividing by the positive ana_nan​ we get

ann−1≥a1⋯an−1,a_n^{n-1} \ge a_1 \cdots a_{n-1},ann−1​≥a1​⋯an−1​,

that is

(a1+⋯+an−1n−1)n−1≥a1⋯an−1,\left(\frac{a_1+\cdots+a_{n-1}}{n-1}\right)^{n-1} \ge a_1 \cdots a_{n-1},(n−1a1​+⋯+an−1​​)n−1≥a1​⋯an−1​,

which is exactly the AM-GM inequality for n−1n-1n−1 variables raised to the n−1n-1n−1 power.

5What we have learned

  • Statements involving natural numbers can often be proved by mathematical induction.
  • In it, we can assume not only that the statement holds for a given nnn, but directly that it holds e.g. for 1,2,…,n1,2,\dots,n1,2,…,n.
  • Sometimes it is worthwhile to also make large steps, e.g., from nnn to n+2n+2n+2, or 2n2n2n, or even backwards (from nnn to n−1n-1n−1).
  • Induction can be not only on one variable, but also on the sum of variables.

What to watch out for:

  • We always verify small cases and write down in the solution that we have done so.
  • Induction can fail because for the induction step we need e.g. n≥2n \ge 2n≥2 instead of n≥1n \ge 1n≥1; it pays off to do this induction step at least in our head for small nnn.
  • If the induction uses both n−1n-1n−1 and n−2n-2n−2, then we need two base cases.

6Problems

There are truly many examples of induction and, as we have seen, it can be used in virtually all areas of mathematics to prove many interesting ideas. You can try the following ones, ordered roughly by difficulty.

Problem 6

Prove that for every natural number nnn there exist natural numbers a,ba,ba,b such that (1+2)n=a2+b(1+\sqrt{2})^n = a\sqrt{2}+b(1+2​)n=a2​+b.

1Hint

We proceed by induction. The key is to multiply the assumed equality (1+2)n=a2+b(1+\sqrt{2})^n = a\sqrt{2}+b(1+2​)n=a2​+b by 1+21+\sqrt{2}1+2​ and simplify.

✓Solution

For n=1n=1n=1 we have (1+2)1=1⋅2+1(1+\sqrt{2})^1 = 1\cdot\sqrt{2}+1(1+2​)1=1⋅2​+1, so a=b=1a=b=1a=b=1. Assume that (1+2)n=a2+b(1+\sqrt{2})^n = a\sqrt{2}+b(1+2​)n=a2​+b for some natural a,ba,ba,b. Then

(1+2)n+1=(a2+b)(1+2)==(2a+b)2+(a+b).\begin{gather*} (1+\sqrt{2})^{n+1} = (a\sqrt{2}+b)(1+\sqrt{2}) = \cr = (2a+b)\sqrt{2}+(a+b). \end{gather*}(1+2​)n+1=(a2​+b)(1+2​)==(2a+b)2​+(a+b).​

Since a,ba,ba,b are natural, so are 2a+b2a+b2a+b and a+ba+ba+b, which is the statement for n+1n+1n+1.

Problem 7

We have nnn light bulbs in a row. Initially, all are off. Each minute we either switch on exactly one bulb that is off, or switch off exactly one bulb that is on. Prove that we can choose the moves so that we pass through every one of all possible configurations exactly once.

1Hint

To even be sure that we have passed through all configurations, we need to know how many there are.

2Hint

The answer to the question of the number of configurations follows from realizing that each bulb is either on or off, so we can use the multiplication rule.

3Hint

Let us try small cases. If we can construct an algorithm for some nnn, can we modify it to construct an algorithm for n+1n+1n+1?

4Hint

For n=1n=1n=1 a single switch suffices. For n=2n=2n=2 we proceed e.g.\ as follows: XX→XO→OX→XXXX \rightarrow XO \rightarrow OX \rightarrow XXXX→XO→OX→XX. What about n=3n=3n=3? The trick is to leave the last bulb off at the beginning and then switch it on at the right moment. Formally, we use the induction hypothesis that we can solve the problem for smaller nnn.

✓Solution

First of all, since each bulb can be either on or off, the total number of configurations for nnn bulbs is 2n2^n2n by the multiplication rule.

Now we prove the statement that all configurations are reachable by switching, by mathematical induction on nnn.

For n=1n=1n=1 we have 21=22^1=221=2 configurations; it suffices to switch the off bulb on. Assume that for some nnn there exists a suitable sequence of moves passing through all 2n2^n2n configurations, starting from the state where all bulbs are off.

For n+1n+1n+1 bulbs we proceed as follows: First we leave the (n+1)(n+1)(n+1)-st bulb off and on the first nnn bulbs we perform the assumed sequence of moves for nnn bulbs. This way we pass through all 2n2^n2n configurations in which the last bulb is off. Next, we switch on the (n+1)(n+1)(n+1)-st bulb. Now we perform the sequence of moves for the first nnn bulbs in reverse order (from end to start). Since the original sequence always changed exactly one bulb, the same holds for the reversed one. This way we successively pass through the remaining 2n2^n2n configurations in which the last bulb is on.

Altogether, we pass through 2n+2n=2n+12^n+2^n=2^{n+1}2n+2n=2n+1 configurations, and since we first passed through all those with the last bulb off and then all those with the last bulb on, no configuration was repeated. We have therefore visited every configuration exactly once, completing the induction step.

Problem 8

We have a chocolate bar of size 8×38 \times 38×3. We break it along gridlines into individual 1×11 \times 11×1 squares (along gridlines means we cannot break off e.g.\ a corner). What is the smallest number of breaks we need so that all pieces are 1×11 \times 11×1?

1Hint

When trying it out, we may notice that the number of breaks is always the same. Is this a coincidence? What is the number? How would we prove it?

2Hint

The trick is to solve a generalized problem: We have a chocolate bar m×nm \times nm×n, where m,nm,nm,n are natural numbers. We want to prove that breaking it up requires mn−1mn-1mn−1 breaks. The right approach is mathematical induction, e.g.\ on m+nm+nm+n.

3Hint

The idea of the induction hypothesis is that by breaking an m×nm \times nm×n bar we always produce two bars whose dimension sums are smaller than the original m+nm+nm+n. We can therefore apply the induction hypothesis, and the rest is just a bit of algebra.

✓Solution

We show that for a chocolate bar m×nm \times nm×n we always need exactly mn−1mn-1mn−1 breaks regardless of strategy. For the given bar 8×38 \times 38×3 this is 232323 breaks.

We prove the statement by strong induction on the sum of dimensions k=m+nk = m+nk=m+n. For k=2k=2k=2 we have a 1×11 \times 11×1 bar, which needs no breaking, corresponding to 1⋅1−1=01\cdot 1 - 1 = 01⋅1−1=0.

Assume that the statement holds for all bars with dimension sum strictly less than m+nm+nm+n. The first break of the bar m×nm \times nm×n produces two smaller pieces. Without loss of generality, assume we broke the side of length mmm. This creates pieces m1×nm_1 \times nm1​×n and m2×nm_2 \times nm2​×n, where m1+m2=mm_1+m_2=mm1​+m2​=m.

The dimension sums of both new pieces are m1+nm_1+nm1​+n and m2+nm_2+nm2​+n, which are obviously strictly less than m+nm+nm+n. By the induction hypothesis, fully breaking them requires m1n−1m_1n-1m1​n−1 and m2n−1m_2n-1m2​n−1 breaks. The total number of breaks for our bar is therefore

1+(m1n−1)+(m2n−1)=(m1+m2)n−1=mn−1.1 + (m_1n-1) + (m_2n-1) = (m_1+m_2)n - 1 = mn-1.1+(m1​n−1)+(m2​n−1)=(m1​+m2​)n−1=mn−1.

The induction step is complete. Every way of breaking up an m×nm \times nm×n bar requires exactly mn−1mn-1mn−1 steps, so the minimum number is also mn−1mn-1mn−1.

Problem 9*

We have 42 cities such that between every two there is a one-way street. Prove that there exists a city from which we can depart and pass through all other cities.

1Hint

The key is to use strong induction. We solve small cases. The crucial question however is: how can we use the case for smaller nnn for the larger one?

2Hint

The trick is to pick one city, say XXX, and split the remaining cities into two sets: the set of cities from which a street leads to XXX; and the set of cities to which a street leads from XXX. Some of these sets may be empty. However, both will certainly be smaller than the final set (since we set XXX aside).

✓Solution

We prove the statement by strong induction on the number of cities nnn.

For n=1n=1n=1 the statement obviously holds (the path consists of a single city). Assume that the statement holds for every number of cities kkk, where 1≤k<n1 \le k < n1≤k<n.

Let us have n≥2n \ge 2n≥2 cities. Pick any city XXX. We split the remaining n−1n-1n−1 cities into two disjoint sets: set AAA consisting of cities from which a street leads to XXX, and set BBB consisting of cities to which a street leads from XXX.

If set AAA is non-empty, 1≤∣A∣<n1 \le |A| < n1≤∣A∣<n holds. By the induction hypothesis for this set alone, there exists a path passing through all its cities; let this path end at some city U∈AU \in AU∈A. Since U∈AU \in AU∈A, a street leads from UUU to XXX.

If set BBB is non-empty, 1≤∣B∣<n1 \le |B| < n1≤∣B∣<n holds. Similarly, by the induction hypothesis for this set alone, there exists a path passing through all its cities; let this path start at some city V∈BV \in BV∈B. Since V∈BV \in BV∈B, a street leads from XXX to VVV.

We construct the overall path by joining these segments and distinguish cases depending on whether the sets were empty:

  • If both AAA and BBB are non-empty, we traverse the path in AAA ending at UUU, go from there to XXX, and then from XXX to VVV, from where we traverse the rest of the path in BBB.
  • If AAA is empty (so ∣B∣=n−1≥1|B|=n-1 \ge 1∣B∣=n−1≥1), we start directly at XXX and go to VVV, from where we continue with the path in BBB.
  • If BBB is empty (so ∣A∣=n−1≥1|A|=n-1 \ge 1∣A∣=n−1≥1), we traverse the path in AAA ending at UUU and go from there to XXX, where our path ends.

In every case we have constructed a path passing through all nnn cities, completing the induction step. The statement therefore holds for all natural nnn, in particular for n=42n=42n=42.

Problem 10*

Prove that for all natural numbers nnn and kkk it holds that

k!∣n(n+1)(n+2)⋯(n+k−1)k! \mid n(n+1)(n+2)\cdots(n+k-1)k!∣n(n+1)(n+2)⋯(n+k−1)

(in words: k!k!k! divides the product of kkk consecutive natural numbers)

1Hint

We have two natural variables nnn and kkk; it is not clear which one to induct on. The idea is to induct on both, first on kkk and, when trying to prove the claim for k+1k+1k+1, to induct on nnn.

2Hint

First we solve k=1k=1k=1. Then we assume the claim holds for k−1k-1k−1. Next we go by induction on nnn. After solving n=1n=1n=1, we assume validity for a given nnn. Then, denoting P(n)=n(n+1)⋯(n+k−1)P(n) = n(n+1)\cdots(n+k-1)P(n)=n(n+1)⋯(n+k−1), we need k!∣P(n+1)k! \mid P(n+1)k!∣P(n+1). The trick is to look at P(n+1)−P(n)P(n+1)-P(n)P(n+1)−P(n). Along the way, it pays off to find where we can use the induction hypothesis for k−1k-1k−1.

✓Solution

Denote P(n)=n(n+1)⋯(n+k−1)P(n) = n(n+1)\cdots(n+k-1)P(n)=n(n+1)⋯(n+k−1). We prove the statement by induction on kkk.

For k=1k=1k=1 we have P(n)=nP(n) = nP(n)=n and 1!=11! = 11!=1, which holds trivially. Assume the claim holds for k−1k-1k−1, i.e.\ (k−1)!(k-1)!(k−1)! divides the product of any k−1k-1k−1 consecutive numbers. We now prove that k!∣P(n)k! \mid P(n)k!∣P(n) for all nnn, by induction on nnn.

For n=1n=1n=1 we have P(1)=k!P(1) = k!P(1)=k!, which is obviously divisible by k!k!k!. Assume validity for a given nnn. Then

P(n+1)−P(n)=(n+1)(n+2)⋯(n+k)−n(n+1)⋯(n+k−1)==(n+1)(n+2)⋯(n+k−1)⋅[(n+k)−n]=k⋅(n+1)(n+2)⋯(n+k−1).\begin{gather*} P(n+1) - P(n) = (n+1)(n+2)\cdots(n+k) - n(n+1)\cdots(n+k-1)=\\ = (n+1)(n+2)\cdots(n+k-1)\cdot[(n+k)-n] = k\cdot (n+1)(n+2)\cdots(n+k-1). \end{gather*}P(n+1)−P(n)=(n+1)(n+2)⋯(n+k)−n(n+1)⋯(n+k−1)==(n+1)(n+2)⋯(n+k−1)⋅[(n+k)−n]=k⋅(n+1)(n+2)⋯(n+k−1).​

The expression (n+1)(n+2)⋯(n+k−1)(n+1)(n+2)\cdots(n+k-1)(n+1)(n+2)⋯(n+k−1) is a product of k−1k-1k−1 consecutive numbers, so by the induction hypothesis for k−1k-1k−1 it is divisible by (k−1)!(k-1)!(k−1)!. Thus P(n+1)−P(n)P(n+1)-P(n)P(n+1)−P(n) is divisible by k⋅(k−1)!=k!k\cdot(k-1)! = k!k⋅(k−1)!=k!. Since P(n)P(n)P(n) is divisible by k!k!k!, so is P(n+1)P(n+1)P(n+1).

Note. There is a simple combinatorial proof of this problem – the claim follows from the fact that the binomial coefficient

(n+k−1k)=n(n+1)(n+2)⋯(n+k−1)k!{n+k-1 \choose k} = \frac{n(n+1)(n+2)\cdots(n+k-1)}{k!}(kn+k−1​)=k!n(n+1)(n+2)⋯(n+k−1)​

is an integer. The solution above shows that it can also be proved number-theoretically.

Comments

Obsah

  • 1Introduction
  • 2Basic induction
  • 3Strong induction
  • 4Other forms of induction
  • 5What we have learned
  • 6Problems
  • Comments
MathComps LogoMathComps

The long-term vision of the project is to create a platform for beginners and advanced solvers of mathematical competitions, their tutors, and all supporters.

Navigation

  • Problems
  • Handouts
  • Guide

Project

  • About
  • Sponsors

© 2026 MathComps•Patrik Bak•Privacy and terms

MathComps LogoMathComps
ProblemsHandoutsGuideNews
Sign in