Mathematical induction
1Introduction
The word induction generally means a mental process from the specific to the general
. When proving problems, we often think this way – we play with small cases and investigate how to discover new ones from previously derived results. Mathematical induction is a formal proof method based on this idea. In this material, we will formalize it and then demonstrate it on various examples.
2Basic induction
We illustrate the idea of mathematical induction using dominoes. They have a property: if the -th one falls, the -th one will also fall. Thanks to this, if we knock over the first one, the second one falls as well, and therefore the third one, etc. The conclusion is that they all fall.
Example 1
Prove that the sum of the first natural numbers is equal to .
✓Solution
For we get , which holds. Assume that the statement holds for some . For , the new expression on the left is equal to
which is exactly the expression on the right side for . The proof by induction is complete.
In general, a proof by induction consists of two steps:
- Proof of the statement for some initial value .
- Proof that if the statement holds for some , then it holds for .
Try this approach on these memorable identities.
Exercise 1
Prove the following identities for all natural numbers :
✓Solution
We will prove all five identities by induction on .
- By trying small cases, we guess the formula. For , holds. Assume that the statement holds for a given . Thenwhich is exactly the statement for .
- For , holds. Assume validity for a given . Thenwhere the last equality follows from factoring out and simplifying
- We know thatso we are provingFor , holds. Assume validity for a given . Then
- For , holds. Assume validity for a given . Then
- For , holds. Assume validity for a given . By adding we get
Exercise 2
Prove by induction the following formulas for the sums of the first terms of known sequences.
- Sum of an arithmetic sequence:
- Sum of a geometric sequence ():
- Sum of an arithmetico-geometric sequence ():
✓Solution
We will prove all three formulas by induction on .
- For , holds. Assume validity for a given . By adding the -th term to the right side we get
- For , holds. Assume validity for a given . By adding we get
- For , holds. Assume validity for a given . By adding to the right side we getAfter multiplying out and simplifying, the numerator simplifies to .As a matter of interest, let us add that this formula can be obtained by differentiating the previous formula written for .
Exercise 3
Prove the following identities for Fibonacci numbers, defined recursively as , and :
✓Solution
We prove all three by induction on .
- For , holds. AssumeBy adding we havewhich is the statement for .
- For , holds. AssumeBy adding we get
- For , holds. AssumeBy adding we have
So far, we have proved statements for all natural numbers . In the following example, we will show that this is not necessary and induction can sometimes start later. This example also opens up another type of problems where induction is useful – inequalities.
Example 2
For which natural numbers does hold?
✓Solution
By testing small cases, we get strange behavior: For and the statement holds. One might think it holds always. However, for we have . For , things are fine again and we have . For then , further . The differences are increasing, it seems the statement will hold always from now on. We will formally prove it by mathematical induction for .
For the statement holds. Assume that it holds for a given , that is, . Then we estimate:
It would therefore be sufficient to prove that , then altogether we would get .
We have if and only if is non-negative. For , the expression is obviously increasing and for it equals , so the statement holds.
Note that the residual inequality obviously holds already for . The problem is that the original inequality does not hold for , so the induction really could not have started earlier.
The answer to the question from the problem is all natural numbers except .
Try a few inequalities for practice.
Exercise 4
Prove that for all natural numbers it holds that .
✓Solution
For , holds. Assume that for a given . Then
since for .
Exercise 5
Prove that for all natural numbers it holds that .
✓Solution
For , holds. Assume that for a given . Then
now we use that , so
Exercise 6
Prove that for all natural numbers it holds that
✓Solution
For , holds. Assume validity for a given . Denote . Then
It suffices to verify that
after rewriting with a common denominator we get
which obviously holds.
Exercise 7
Prove Bernoulli's inequality: for real and natural it holds that
✓Solution
For , holds. Assume that
for a given . Since , we can multiply both sides by the expression :
We need to prove that the last expression is at least , which is obvious, because is non-negative.
We can also use induction to prove divisibility.
Example 3
Prove that for all natural numbers it holds that .
✓Solution
For , and holds. Assume that for a given . Then
The first term is divisible by 6 by the induction hypothesis. In the second term, the number is the product of two consecutive numbers, hence even, and after multiplying by 3 we get a multiple of 6.
Example 4
Prove that for all natural numbers it holds that .
✓Solution
For , and holds. Assume that for a given . Then
The first addend is divisible by 3 by the induction hypothesis, and is obviously divisible by 3, therefore their sum is as well.
Exercise 8
Prove that for all natural numbers it holds that .
✓Solution
For , and holds. Assume that for a given . Then
The first addend is divisible by 9 by the induction hypothesis and the second is obviously a multiple of 9.
Exercise 9
Prove that for all natural numbers it holds that .
✓Solution
For , and holds. Assume that . Then
The first addend is divisible by 31 by the induction hypothesis and the second is obviously a multiple of 31.
In a proof, it is really necessary to verify the first step, otherwise anything can be proved. For example, that a number of the form is always even. Namely, if it holds for a given , then for we have , which we easily convince ourselves is equal to . By the induction hypothesis, is even, and obviously is even, so we have the sum of two even numbers, thus the proof
is complete.
The problem is that if we now plug in any , we get an odd number. Where is the mistake? Well, we did not verify that the statement holds for – then the expression is equal to 3 and is odd. So if we were proving that this expression is always odd, then together with the step for the proof would be complete.
This problem presents another potential pitfall.
Problem 1
Find the mistake in the proof
of this statement:
Let us have lines in the plane such that no two are parallel. Then all these lines pass through a single point.
Proof
: For the statement holds, because two non-parallel lines intersect in a single point.
Induction step: Let the statement hold for some lines. Let us take lines , out of which no two are parallel.
The first lines pass through a single point by the induction hypothesis. Similarly, the lines (there are also of them) pass through a single point . Since the lines pass through both points and , necessarily . Therefore, all lines pass through point .
1Hint
It is evident that the statement does not hold already for . To understand the mistake, try writing out the induction step for equal to 2 (where we prove that the statement holds for lines).
✓Solution
The mistake is in the sentence Since the lines pass through both points and , necessarily .
For , the sequence of lines is actually just a single line. Whenever , the statement would indeed hold, and that causes the confusion.
3Strong induction
In previous problems, we always said in the induction step that the statement holds for some and subsequently proved it for . In fact, however, we can assume something stronger. For example, let's have a statement for all natural numbers that we are proving by induction. First, of course, we prove it for . Subsequently, we assume that it holds for all and prove it for – instead of just assuming that it holds for some . This turn is very common in more complex proofs and is called strong induction. We illustrate it with an example:
Theorem 1
Prime factorization theoremEvery integer can be factored into a product of several, not necessarily distinct, primes.
Proof
The statement obviously holds for , which is itself a prime. Assume that we have proved the statement for . Now let us take the number . If it is a prime, we are done. If it is not a prime, it means that there exist two integers and such that . Since both and are more than 1, both are less than , thus at most . We can apply the induction hypothesis to both of them, namely, that they can be written as a product of primes. Therefore, their product can also be written as a product of primes, which is what we needed to prove.
Notice that we could not use traditional induction in this proof; we strictly need the numbers to be smaller than .
Furthermore, realize that regular induction is a special case of strong induction – after all, the assumption that the statement holds for all also includes the fact that it holds for , which is what we base regular induction on. When coming up with a solution by induction, it is therefore more useful to think straight away in the style of strong induction; we won't lose anything.
Another traditional result is the existence of a unique representation in the binary system.
Example 5
Prove that every natural number can be written as a sum of mutually distinct powers of two.
✓Solution
For , holds. Assume that the statement holds for all natural numbers less than . If is a power of two, we are done. Otherwise, we find the largest power of two . The number is positive and less than , so by the induction hypothesis, it can be written as a sum of distinct powers of two. Moreover, (because ), so does not appear in its decomposition. By adding , we get the representation of as a sum of distinct powers of two.
Try a similar proof on harder examples:
Problem 2
Zeckendorf's theorem, existenceProve that every positive integer can be written as a sum of mutually non-consecutive Fibonacci numbers.
1Hint
We proceed by induction. Let be the number we are trying to write as a sum. It pays off to consider the largest Fibonacci number not exceeding , e.g., , and apply the induction hypothesis to . Are we done?
2Hint
We are not done yet. We need to deal with the fact that the representation of might contain , which would break the non-consecutiveness. But what if ?
✓Solution
For the smallest natural numbers, the statement holds (e.g., ). Assume that every number less than can be written in the required way. Let us find the largest Fibonacci number . If , we are done. Otherwise, the number is positive and strictly less than , so by the induction hypothesis, we can write it as a sum of mutually non-consecutive Fibonacci numbers.
By adding , we would get the representation for . A problem would only occur if the number (or larger) appeared in the representation of . In such a case, the entire representation would be at least , and thus . From this, however, we get , which is a contradiction with being the largest Fibonacci number not exceeding .
Therefore, the representation for contains only numbers at most . Thus, supplementing with the number does not create a pair of consecutive Fibonacci numbers, and we get a valid representation of the number .
Another numeral system is the factorial base:
Problem 3
Factorial number systemProve that every positive integer can be written in the form
where for every .
1Hint
We proceed by induction. Let be the number we are trying to write in this way. Consider the largest number such that . The trick is to divide by with a remainder, that is, to write , where . Where can we apply the induction hypothesis? What remains to be proved?
2Hint
First of all, realize that (prove it). This means that for we are done, and that for we can apply the induction hypothesis to . Do we thus already obtain a satisfying expression?
✓Solution
For the statement holds, . Assume that every number less than can be written in the required way. Find the largest number such that and divide by with a remainder, so , where . Since was the largest possible, originally we must have had , which implies
We are guaranteed that the condition for the most significant digit is met. If , we are done. Otherwise, we have , so is strictly less than and we can apply the induction hypothesis to it. We thus get the representation .
Since , the largest factorial in the representation of can be at most , thus . By substituting this representation for into the expression , we thus obtain the sought representation of the number , and no two factorials add up together.
Let us add that uniqueness can be proved for both the Fibonacci and the factorial number systems (and of course for binary and our decimal one as well). The proof is easy but technical – it relies on a general principle: Let us have a system where we can express 1 and in which it holds that the largest number we can express using digits is 1 less than the smallest number we can express using digits. In such a system, both completeness (every number can be expressed) and uniqueness then hold. You can try to think this statement through for all the examined systems (binary, Fibonacci, and factorial); the already proved exercises can be a good help :)
Technically, one of the forms of strong induction is also when we use the induction hypothesis for only two smaller numbers, e.g., and . This is often seen in problems about recurrence sequences.
Example 6
A sequence is defined by the rule , and
for . Guess the explicit formula for and prove it by induction.
✓Solution
From the small values we guess . We prove this formula by induction:
For , holds. For , holds. Assume it holds for and . Then
Exercise 10
A sequence is defined by the rule , and
for . Guess the explicit formula for and prove it by induction.
✓Solution
By trying we guess the formula . We prove this by induction:
For , holds. For , holds. Assume it holds for and . Then
Problem 4
Prove that for all natural numbers it holds that , where are the Fibonacci numbers and is the golden ratio.
1Hint
We verify numerically for and . Then induction works. Namely, for .
2Hint
The key in the proof by induction is the fact that has this magical property that (verify).
✓Solution
We will perform the proof by strong induction. For we have . For we have , which holds, since .
Assume that the statement holds for a given and . For we have . From the induction hypothesis, we can estimate this sum:
For , the identity holds, which we easily verify. By substituting we have
Thus the induction step is proved.
Problem 5
Let be a real number such that is rational. Prove that then is also rational for every natural .
1Hint
We proceed by induction. In order to make the transition from to , we multiply the expression for by a suitable expression.
2Hint
The crucial multiplication is by the expression . Subsequently, it appears that our induction hypothesis must be strong.
✓Solution
We prove the statement by strong induction. For , is rational directly from the problem statement. For we have
which is obviously a rational number.
Assume that the statement holds for all natural numbers up to some and , where .
By the induction hypothesis, and are rational numbers. Since is as well, the entire right side is rational. Thus the induction step is complete.
4Other forms of induction
In the tasks so far, we have encountered two types of induction hypotheses: that the statement holds for or that it holds for suitable numbers not exceeding . Consequently, we proved from this that the statement holds for . However, imagination has no limits, and an induction can easily occur where we prove from ; therefore, we need to prove the statement for consecutive base cases.
Example 7
Prove that every integer can be written in the form , where are non-negative integers.
✓Solution
We will do the proof by induction with a step of 2, which means we need two consecutive base cases. We verify them directly: and .
Assume that the statement holds for a given , that is, . Then , so the statement holds for as well.
Note. As a point of interest, let us add how it is in general: For two coprime positive integers and , it can be proved that every integer can be written as for non-negative . The number is called the Frobenius number and it is the largest integer that cannot be written in this way. In our case it is .
Another fascinating form of induction can be seen in Cauchy's proof of the AM-GM inequality, where we first go up and then down.
Theorem 2
AM-GM inequalityFor positive real numbers it holds that
with equality occurring if and only if .
Proof
The proof consists of two steps: first we prove the inequality for all powers of two and then we show that from the validity for variables follows the validity for variables. These two steps combined cover all natural numbers.
Step upwards (from to ): For we need to prove that , which is equivalent to , and this always holds.
Assume that the inequality holds for variables. For variables, we divide the numbers into two groups of . Let us denote
From the induction hypothesis for both groups we have
The average of all numbers is . From the base case for two variables we know that . Thus
Step downwards (from to ): Assume that the inequality holds for variables, and let us take non-negative real numbers . Let us set
thus is the average of the remaining numbers. Then
From the assumption for variables we obtain
By raising to the -th power we have , and after dividing by the positive we get
that is
which is exactly the AM-GM inequality for variables raised to the power.
5What we have learned
- Statements involving natural numbers can often be proved by mathematical induction.
- In it, we can assume not only that the statement holds for a given , but directly that it holds e.g. for .
- Sometimes it is worthwhile to also make large steps, e.g., from to , or , or even backwards (from to ).
- Induction can be not only on one variable, but also on the sum of variables.
What to watch out for:
- We always verify small cases and write down in the solution that we have done so.
- Induction can fail because for the induction step we need e.g. instead of ; it pays off to do this induction step at least in our head for small .
- If the induction uses both and , then we need two base cases.
6Problems
There are truly many examples of induction and, as we have seen, it can be used in virtually all areas of mathematics to prove many interesting ideas. You can try the following ones, ordered roughly by difficulty.
Problem 6
Prove that for every natural number there exist natural numbers such that .
1Hint
We proceed by induction. The key is to multiply the assumed equality by and simplify.
✓Solution
For we have , so . Assume that for some natural . Then
Since are natural, so are and , which is the statement for .
Problem 7
We have light bulbs in a row. Initially, all are off. Each minute we either switch on exactly one bulb that is off, or switch off exactly one bulb that is on. Prove that we can choose the moves so that we pass through every one of all possible configurations exactly once.
1Hint
To even be sure that we have passed through all configurations, we need to know how many there are.
2Hint
The answer to the question of the number of configurations follows from realizing that each bulb is either on or off, so we can use the multiplication rule.
3Hint
Let us try small cases. If we can construct an algorithm for some , can we modify it to construct an algorithm for ?
4Hint
For a single switch suffices. For we proceed e.g.\ as follows: . What about ? The trick is to leave the last bulb off at the beginning and then switch it on at the right moment. Formally, we use the induction hypothesis that we can solve the problem for smaller .
✓Solution
First of all, since each bulb can be either on or off, the total number of configurations for bulbs is by the multiplication rule.
Now we prove the statement that all configurations are reachable by switching, by mathematical induction on .
For we have configurations; it suffices to switch the off bulb on. Assume that for some there exists a suitable sequence of moves passing through all configurations, starting from the state where all bulbs are off.
For bulbs we proceed as follows: First we leave the -st bulb off and on the first bulbs we perform the assumed sequence of moves for bulbs. This way we pass through all configurations in which the last bulb is off. Next, we switch on the -st bulb. Now we perform the sequence of moves for the first bulbs in reverse order (from end to start). Since the original sequence always changed exactly one bulb, the same holds for the reversed one. This way we successively pass through the remaining configurations in which the last bulb is on.
Altogether, we pass through configurations, and since we first passed through all those with the last bulb off and then all those with the last bulb on, no configuration was repeated. We have therefore visited every configuration exactly once, completing the induction step.
Problem 8
We have a chocolate bar of size . We break it along gridlines into individual squares (along gridlines means we cannot break off e.g.\ a corner). What is the smallest number of breaks we need so that all pieces are ?
1Hint
When trying it out, we may notice that the number of breaks is always the same. Is this a coincidence? What is the number? How would we prove it?
2Hint
The trick is to solve a generalized problem: We have a chocolate bar , where are natural numbers. We want to prove that breaking it up requires breaks. The right approach is mathematical induction, e.g.\ on .
3Hint
The idea of the induction hypothesis is that by breaking an bar we always produce two bars whose dimension sums are smaller than the original . We can therefore apply the induction hypothesis, and the rest is just a bit of algebra.
✓Solution
We show that for a chocolate bar we always need exactly breaks regardless of strategy. For the given bar this is breaks.
We prove the statement by strong induction on the sum of dimensions . For we have a bar, which needs no breaking, corresponding to .
Assume that the statement holds for all bars with dimension sum strictly less than . The first break of the bar produces two smaller pieces. Without loss of generality, assume we broke the side of length . This creates pieces and , where .
The dimension sums of both new pieces are and , which are obviously strictly less than . By the induction hypothesis, fully breaking them requires and breaks. The total number of breaks for our bar is therefore
The induction step is complete. Every way of breaking up an bar requires exactly steps, so the minimum number is also .
Problem 9*
We have 42 cities such that between every two there is a one-way street. Prove that there exists a city from which we can depart and pass through all other cities.
1Hint
The key is to use strong induction. We solve small cases. The crucial question however is: how can we use the case for smaller for the larger one?
2Hint
The trick is to pick one city, say , and split the remaining cities into two sets: the set of cities from which a street leads to ; and the set of cities to which a street leads from . Some of these sets may be empty. However, both will certainly be smaller than the final set (since we set aside).
✓Solution
We prove the statement by strong induction on the number of cities .
For the statement obviously holds (the path consists of a single city). Assume that the statement holds for every number of cities , where .
Let us have cities. Pick any city . We split the remaining cities into two disjoint sets: set consisting of cities from which a street leads to , and set consisting of cities to which a street leads from .
If set is non-empty, holds. By the induction hypothesis for this set alone, there exists a path passing through all its cities; let this path end at some city . Since , a street leads from to .
If set is non-empty, holds. Similarly, by the induction hypothesis for this set alone, there exists a path passing through all its cities; let this path start at some city . Since , a street leads from to .
We construct the overall path by joining these segments and distinguish cases depending on whether the sets were empty:
- If both and are non-empty, we traverse the path in ending at , go from there to , and then from to , from where we traverse the rest of the path in .
- If is empty (so ), we start directly at and go to , from where we continue with the path in .
- If is empty (so ), we traverse the path in ending at and go from there to , where our path ends.
In every case we have constructed a path passing through all cities, completing the induction step. The statement therefore holds for all natural , in particular for .
Problem 10*
Prove that for all natural numbers and it holds that
(in words: divides the product of consecutive natural numbers)
1Hint
We have two natural variables and ; it is not clear which one to induct on. The idea is to induct on both, first on and, when trying to prove the claim for , to induct on .
2Hint
First we solve . Then we assume the claim holds for . Next we go by induction on . After solving , we assume validity for a given . Then, denoting , we need . The trick is to look at . Along the way, it pays off to find where we can use the induction hypothesis for .
✓Solution
Denote . We prove the statement by induction on .
For we have and , which holds trivially. Assume the claim holds for , i.e.\ divides the product of any consecutive numbers. We now prove that for all , by induction on .
For we have , which is obviously divisible by . Assume validity for a given . Then
The expression is a product of consecutive numbers, so by the induction hypothesis for it is divisible by . Thus is divisible by . Since is divisible by , so is .
Note. There is a simple combinatorial proof of this problem – the claim follows from the fact that the binomial coefficient
is an integer. The solution above shows that it can also be proved number-theoretically.