.. index:: algorithms; greatest common divisor Euclid's algorithm greatest common divisor algorithm .. _gcd: Greatest Common Divisor ========================= Euclid's Algorithm ----------------------- The greatest common divisor of two non-zero integers is a great example to illustrate the power of loops. Everyone learns about the *concept* of a greatest common divisor when faced with a fraction that is not in *reduced* form. Consider the fraction :math:`\frac{2}{4}`, which is the same as :math:`\frac{1}{2}`. The fraction :math:`\frac{2}{4}` can be reduced, because the numerator and denominator both have greatest common factor of 2. That is, :math:`\frac{2}{4} = \frac{1 \cdot 2}{2 \cdot 2}`. So the factor of 2 can be canceled from both the numerator and the denominator. Euclid (the mathematician from classic times and author of *Elements*) is credited with having come up with a clever algorithm for how to compute the greatest common divisor efficiently. It is written as follows, where :math:`a \bmod b` means ``a % b`` in C#. .. math:: gcd(a, b) = gcd(b, a \bmod b) gcd(a, 0) = a It is common in mathematics to list functions as one or more *cases*. The way you read this is as follows: - In general, the greatest common divisor of ``a`` and ``b`` is the same as computing the greatest common divisor of ``b`` and the remainder of ``a`` divided by ``b``. - In the case where ``b`` is zero, the result is ``a``. This makes sense because ``a`` divides itself and 0. To gain some appreciation of how the definition *always* allows you to compute the greatest common divisor, it is worthwhile to try it out for a couple of numbers where you *know* the greatest common divisor. For example, we already know that the greatest common divisor of 10 and 15 is 5. Let's use Euclid's method to verify this: - :math:`gcd(10, 15) = gcd(15, 10 \bmod 15) = gcd(15, 10)` - :math:`gcd(15, 10) = gcd(10, 15 \bmod 10) = gcd(10, 5)` - :math:`gcd(10, 5) = gcd(5, 10 \bmod 5) = gcd(5, 0)` - :math:`gcd(5, 0) = 5` Notice that in the example above, the first number (10) was smaller than the second (15), and the first transformation just swapped the numbers, so the larger number was first. Thereafter the first number is always larger. GCD "Brute Force" Method ------------------------ Now that we've gotten the preliminaries out of the way and have a basic mathematical explanation for how to calculate the greatest common divisor, we'll take a look at how to translate this into code using the machinery of while loops that you've recently learned. The way GCD is formulated above is, indeed, the most clever way to calculate the greatest common divisor. Yet the way we learn about the greatest common divisor in elementary school (at least at first) is to learn how to factor the numbers a and b, often in a brute force way. So for example, when calculating the greatest common divisor of 10 and 15, we can immediately see it, because we know that both of these numbers are divisible by 5 (e.g. 5 * 2 = 10 and 5 * 3 = 15). So the greatest common divisor is 5. But if we had something more tricky to do like 810 and 729, we might have to think a bit more. Before we learn to find the factors of numbers, we will often just "try" numbers until we get the greatest common divisor. This sort of trial process can take place in a loop, where we start at 1 and end at min(a, b). Why the minimum? We know that none of the values after the minimum can divide both a and b (in integer division), because no larger number can divide a smaller positive number. The smaller number would be the (non-zero) remainder. Now take a look at a basic version of GCD: .. literalinclude:: ../../examples/introcs/g_c_d_basic/g_c_d_basic.cs :start-after: chunk-gcd-begin :end-before: chunk-gcd-end :linenos: :dedent: 6 This code works as follows: - We begin by finding ``Math.Min(a, b)``. This is how to compute the minimum of any two values in C#. Technically, we don't need to use the minimum of a and b, but there is no point in doing any more work than necessary. - We'll use the variable ``i`` as the loop index, starting at 1. - The variable ``gcd`` will hold the largest currently known common divisor. We start with 1, which divides any integer, and we will look for a higher value that also divides a and b. - The line ``while (i <= n)`` is used to indicate that we are iterating the values of ``i`` until the minimum of ``a`` and ``b`` (computed earlier) is reached. - The line ``if (a % i == 0 && b % i == 0)`` is used to check whether we have found a new value that replaces our previous *candidate* for the GCD. A value can only be a candidate for the GCD if it divides a and b without a remainder. The modulus operator ``%`` is our way of determining whether there is a remainder from the division operation ``a / i`` or ``b / i``. - The line ``i++`` is our way of going to the next value of ``i`` to be tested as the new GCD. - When this loop terminates, the greatest common divisor has been found. So this gives you a relatively straightforward way of calculating the greatest common divisor. While simple, it is not necessarily the most efficient way of determining the GCD. If you think about what is going on, this loop could run a significant number of times. For example, if you were calculating the GCD two very large numbers, say, one billion (1,000,000,000) and two billion (2,000,000,000) it is painfully evident that you would consider a large number of values (a billion, in fact) before obtaining the candidate GCD, which we know is 1,000,000,000. Brute-Force GCD Exercise ~~~~~~~~~~~~~~~~~~~~~~~~~ The code above goes though all integers 2 through ``min(a, b)``. That is not generally necessary when the GCD is greater than 1, even with a brute-force mindset. Write a ``g_c_d_basic_faster.cs`` to do this with a slightly different ``GreatestCommonDivisor`` function. [#gcdbrute]_ GCD Subtraction Method ---------------------- The subtraction method (also attributable to Euclid) to compute the Greatest Common Divisor works as follows: - Based on the *mathematical* definition in the previous section, the greatest common divisor algorithm saves a step when we already have ``a`` and ``b`` in the *right order*. - The *right order* means that :math:`a > b`. As we noted earlier, the cleverness of the *mathematical* definition is that ``a`` and ``b`` are swapped as the first step to ensure that :math:`a > b`, after which we can repeatedly divide to get the GCD. - Division, of course, is a form of repetitive subtraction, so the way to divide by ``b`` is to repeatedly subtract it (from a) until ``a`` is no longer greater than ``b``. - The subtraction method basically makes no attempt to put ``a`` and ``b`` in the right order. Instead, we just write similar loops to allow for the possibility of either order. - A simple check must be performed to ensure that the approach of repeated subtraction actually resulted in the GCD. This will happen if ``a`` and ``b`` bump into one another, thereby meaning that we have computed the GCD. .. literalinclude:: ../../examples/introcs/g_c_d_subtraction_method/g_c_d_subtraction_method.cs :start-after: chunk-gcd-begin :end-before: chunk-gcd-end :linenos: :dedent: 6 A look at the source code more or less follows the above explanation. Let's start by looking at the inner loop at line 5, ``while (a > b)``. In this loop, we are repeatedly subtracting ``b`` from ``a``, which we know we can do, because ``a`` started out as being larger than ``b``. At the end of loop ``a`` is reduced to either #. ``b``, in which case ``b`` exactly divided the earlier ``a``, and ``b`` is the GCD, or #. a number less than ``b``, namely :math:`a \bmod b` (or in C# terms ``a % b``), and the process continues.... The loop on line 9 is similar to the loop in line 5. For the same reasons as we already explained, ``b`` ends up equal to ``a``, which is the GCD, or ``b`` ends up as :math:`b \bmod a`. As discussed above, if ``a`` and ``b`` end up as the same number, that is the GCD. On the other hand, the first GCD algorithm example showed how remainders may need to be to be calculated over and over. The outer loop in this version keeps this up until ``a`` and ``b`` are reduced to equal values. At this point the inner loops would make no further changes, and the common value is the GCD. As an exercise to the reader, you may want to consider adding some ``Console.WriteLine()`` statements to print the values of ``a`` and ``b`` within each loop, and after both loops have executed. It will allow you to see in visual terms how this method does its work. .. index:: greatest common divisor; iterative .. _gcd-remainder-loop: GCD Remainder Loop -------------------- There are several ways to code the shorter Euclidean algorithm at the beginning of this GCD section. It is a repetitive pattern, and a loop can be used. There are two parameters, ``a`` and ``b``, to the gcd, and they can be successively changed, suggesting a loop. What is the continuation condition? You stop when ``b`` is 0, so you continue while ``b != 0``. The parameters ``a`` and ``b`` need to be replaced by ``b`` and ``a % b``. One extra variable needs to be introduced to make this double change work. The simplest is to introduce a variable ``r`` for the remainder. Check and see for yourself that you need an extra variable like ``r``:: /// Return the greatest comon divisor of nonnegative numbers, /// not both 0. public static int GreatestCommonDivisor (int a, int b) { while (b != 0) { int r = a % b; a = b; b = r; } return a; } More verbose demonstration code, that prints the progress each time through loop is in :repsrc:`g_c_d_remainder_loop/g_c_d_remainder_loop.cs`. .. index:: double: recursion; greatest common divisor .. _gcd-recursive: Preview: Recursive GCD ---------------------- The first statement of Euclid's algorithm said (in C#) when :: gcd(a, b) = gcd(b, a % b) It is saying the result of the function with one set of parameters is equal to calling the function with another set of parameters. If we put this into a C# function definition, it would mean the instructions for the function say to *call itself*. This is a broadly useful technique called *recursion*, where a function calls *itself* inside its definition. We don't expect you to master this technique immediately but do feel that it is important you at least *hear* about it and see its tremendous power:: /// Return the greatest comon divisor of nonnegative numbers, /// not both 0. public static int GreatestCommonDivisor (int a, int b) { if (b == 0) { // base return a; // case } else { return GreatestCommonDivisor (b, a % b); // recursion } } - Recalling our earlier definition, the case :math:`gcd(a, 0) = a` is handled directly by lines commented as "base case". - And the case :math:`gcd(a, b) = gcd(b, a \bmod b)` is handled by line with comment "recursion", with the function calling itself. In :repsrc:`g_c_d_euclid_recursive/g_c_d_euclid_recursive.cs` is a wordier demonstration version that prints to the screen the progress at each recursive call. The recursive version of the ``gcd`` function *refers to itself* by *calling* itself. Though this seems circular, you can see from the examples that it works very well. The important point is that the calls to the same function are not completely the same: *Successive* calls have *smaller* second numbers, and the second number eventually reaches 0, and in that case there is a direct final answer. Hence the function is not really circular. This recursive version is a much more direct translation of the original mathematical algorithm than the looping version! The general idea of recursion is for a function to call itself with *simpler* parameters, until a simple enough place is reached, where the answer can be directly calculated. .. [#gcdbrute] The original brute-force gcd approach always goes through all the integers between 1 and ``min(a, b)``. There is a way to stop the first time the real gcd is reached. How can you arrange that?