The real numbers
You probably have a good intuition about the natural numbers (). Just count: . You also know how and work on very well. From there defining the integers () and then the rationals () is quite natural: it’d be great if we could reverse and , and defining is how we do that.1
Getting from to was not hard. But where on earth do the reals () come from?
Wrong answers only
We’re going to work backwards a little. We’ll touch on natural ideas that edge closer to the real numbers, then shine a spotlight on why the rationals are lacking (and how they naturally lead into the reals), and then finally revisit some of these topics with our newfound understanding of the reals.
At the same time I will reference the reals without explaining what they are yet. Sorry about that, but it helps to be able to say “this idea doesn’t perfectly match the final answer” even if we don’t know what the final answer is yet.
Algebraic numbers
You might think we’d like to invent the real numbers because we want to exist, or more precisely, we want to have a solution. Not entirely. There are two problems with : it contains too many numbers and at the same time too few.
Let’s take our desire of ’s existence to the extreme. We want roots for all rational polynomials: this necessarily implies the Fundamental Theorem of Algebra holds, i.e. that we can factor every polynomial into the form where all are rational.
Actually, the set of real numbers that are solutions to a rational polynomial are called the algebraic numbers (). Algebraic numbers are countable (because polynomials are countable) yet the reals are uncountable. By the way, the algebraic numbers are “enough” in this setting: the Fundamental Theorem of Algebra holds for . There is no reason to go “beyond” .
So there are real numbers that are not algebraic. The real numbers are, in one sense, more than the algebraic numbers. So ensuring roots to all polynomials could not possibly be the rationale.
At the same time there is no solution for in . So the algebraic numbers contain too many numbers, that is they contain complex numbers.
Cauchy sequences
I’d like to be able to write an infinite sequence of digits, like , and still have that be a number.
Yeah, this is pretty close. One of the official reasons for defining real numbers is so that this sequence of digits is some real number. (It’s a bit more general than that, but it is close!)
I want Intermediate Value Theorem to work
Yes! For the Intermediate Value Theorem we need continuity, which doesn’t do much for integers. A concept of continuity does exist for rationals and works reasonably well. But actually the intermediate value theorem fails for rationals.
Here is an example: consider the function if and if where sends rationals to rationals. This is continuous: if you know - proofs you can check this yourself, and if you don’t you can take my word for it. (In the reals this is discontinuous only at and since doesn’t exist it is now continuous!) Clearly we have and , but we have no satisfying . Ouch.
The same goes for many other theorems like Mean Value Theorem. All these things where we need “something in between” fail on !
So why is lacking?
Consider the set . You can easily bound this set from above. For instance, every element in is less than .
But is there a least upper bound? No. No matter what upper bound you give me, I can always give you a smaller one. If you give me , I will give you . And then you can give me . And so on and so forth: we can keep one-upping each other.
Obviously if we took the reals for this set we’d have a least upper bound: . But we want something more general. And in fact the following is true:
Given any non-empty set bounded from above, has a least upper bound .
By “least upper bound”, we mean given any upper bound for , we have . Also we’d obviously like for to contain .
Constructing the reals
But we cannot just will into existence. We actually need to prove it exists somehow. Here is the construction:
Dedekind Cuts. A Dedekind Cut is a non-empty set satisfying two properties: - contains no largest element. - If and then .
Intuitively, we cut the rational number line in half. All points to the left are in ; all points to the right are not in . Of course, this point at which we cut the rationals is not necessarily a rational. We cannot even rigorously define such a point (if we could, no reason to bother with Dedekind Cuts): we can only reason in terms of the elements of .
Here is the kicker. These Dedekind Cuts are our real numbers. They don’t merely represent real numbers. They are, under the hood, what we are referring to when we say “real number”. Now it is important to note there are other ways to define the real numbers such that they satisfy the properties we want them to. And it turns out these notions of the real numbers are all equivalent (formally they are isomorphic). This is well outside the scope of this post, but for now we take this definition of the reals and run with it.
Recall our wishlist: we wanted a least upper bound property and we wanted . Implicit in this wishlist is a notion of order (i.e. ), a notion of addition, and a notion of multiplication. Furthermore they have to be consistent with the respective notions from . Let’s get down to defining and checking these.
What are rational numbers in the reals?
A rational number , represented as a real, is simply . It is easy to check this is a Dedekind Cut.
Ordering
We say if and only if . It is easy to check that , where and are the reals associated with and .
Also we always have at least one of or , and when both are true . (When people say is a complete ordered field, this is what they mean by “complete ordered”.)
Addition
We say . We can check this is a Dedekind cut and it preserves ordering the same way addition does in the rationals.
Also we may define the additive inverse: where is not the minimal element in .
Multiplication
This is a bit more annoying. First we have to define positive and negative reals in the obvious way. (Remember reals are cuts and comparison with is through subset comparison.) For positive reals we define . If is negative and is positive then . There are more cases that I cannot be bothered to list here because they provide no further instructive value.
Right answers only
The details above are not particularly important. Here is what matters: is a superset of that preserves ordering, addition, and multiplication. Furthermore, it gains the least upper bound property (“any non-empty set bounded from above has a least upper bound”). Through that it gains some other properties.
Cauchy sequences
The terms in the sequence eventually get arbitrarily close to each other. Formally, for any , there is some such that for all , . (Any sequence satisfying this condition is called a Cauchy sequence.) If we consider as a sequence of rationals, it doesn’t converge to anything. We know this because it converges to in the reals, which is not a rational number, and limits are unique in the reals. (This is not obvious and requires proof, but here we take it for granted.)
So there are rational Cauchy sequences that do not have a limit. This is the other sense in which the rationals were lacking. The reals, however, do not have this problem. Any Cauchy sequence in the reals also converges to a real number.
The Intermediate Value Theorem
The full proof in all its detail is in Section 1.6 of Charles Pugh’s Real Mathematical Analysis.
Formally, the Intermediate Value Theorem states
Let be a continuous function on and suppose . If then there is some such that .
Continuity is defined with the usual - criteria: if you don’t know them and are curious you may easily Google it.
This is true in the reals, and the proof fully uses the least upper bound property. The proof also relies on the facts that continuous functions on a closed interval of are bounded: because it is longer we do not reproduce it here. We just take it for granted. It is proven in a very similar fashion.
Let be the set of values taken by for ; in set notation, . Obviously each is non-empty and we took for granted that was bounded, so has a least upper bound; let it be . Set . Clearly is non-empty as and is bounded from above by . We claim .
We may use the - condition of continuity to show that and both lead to contradictions. Very informally, continuity tells us that if , then there’s some slightly larger than such that as well (and also for all ). Likewise for . So we must have .
In more technical terms, is a field: we have a and operation, an identity with (i.e. ), an identity with (i.e. ), and additive/multiplicative inverses. Actually, is the smallest field containing .↩︎