# Methods of computing square roots

(Redirected from Babylonian method)
The sequence we get by computing the square root of two with the Babylonian method with different starting points

In numerical analysis, a branch of mathematics, there are several square root algorithms or methods of computing the principal square root of a non-negative real number. For the square roots of a negative or complex number, see below.

Finding ${\displaystyle {\sqrt {S}}}$ is the same as solving the equation ${\displaystyle f(x)=x^{2}-S=0\,\!}$ for a positive ${\displaystyle x}$. Therefore, any general numerical root-finding algorithm can be used. Newton's method, for example, reduces in this case to the so-called Babylonian method:

${\displaystyle x_{n+1}=x_{n}-{\frac {f(x_{n})}{f'(x_{n})}}=x_{n}-{\frac {x_{n}^{2}-S}{2x_{n}}}={\frac {1}{2}}\left(x_{n}+{\frac {S}{x_{n}}}\right)}$

These methods generally yield approximate results, but can be made arbitrarily precise by increasing the number of calculation steps.

## Rough estimation

Many square root algorithms require an initial seed value. If the initial seed value is far away from the actual square root, the algorithm will be slowed down. It is therefore useful to have a rough estimate, which may be very inaccurate but easy to calculate. With ${\displaystyle S}$  expressed in scientific notation as ${\displaystyle a\times 10^{2n}}$  where ${\displaystyle 1\leq a<100}$  and n is an integer, the square root ${\displaystyle {\sqrt {S}}={\sqrt {a}}\times 10^{n}}$  can be estimated as

${\displaystyle {\sqrt {S}}\approx {\begin{cases}2\cdot 10^{n}&{\text{if }}a<10,\\6\cdot 10^{n}&{\text{if }}a\geq 10.\end{cases}}}$

The factors two and six are used because they approximate the geometric means of the lowest and highest possible values with the given number of digits: ${\displaystyle {\sqrt {{\sqrt {1}}\cdot {\sqrt {10}}}}={\sqrt[{4}]{10}}\approx 2\,}$  and ${\displaystyle {\sqrt {{\sqrt {10}}\cdot {\sqrt {100}}}}={\sqrt[{4}]{1000}}\approx 6\,}$ .

For ${\displaystyle S=125348=12.5348\times 10^{4}}$ , the estimate is ${\displaystyle {\sqrt {S}}\approx 6\cdot 10^{2}=600}$ .

When working in the binary numeral system (as computers do internally), by expressing ${\displaystyle S}$  as ${\displaystyle a\times 2^{2n}}$  where ${\displaystyle 0.1_{2}\leq a<10_{2}}$ , the square root ${\displaystyle {\sqrt {S}}={\sqrt {a}}\times 2^{n}}$  can be estimated as ${\displaystyle {\sqrt {S}}\approx 2^{n}}$ , since the geometric mean of the lowest and highest possible values is ${\displaystyle {\sqrt {{\sqrt {0.1_{2}}}\cdot {\sqrt {10_{2}}}}}={\sqrt[{4}]{1}}=1}$ .

For ${\displaystyle S=125348=1\;1110\;1001\;1010\;0100_{2}=1.1110\;1001\;1010\;0100_{2}\times 2^{16}\,}$  the binary approximation gives ${\displaystyle {\sqrt {S}}\approx 2^{8}=1\;0000\;0000_{2}=256\,.}$

These approximations are useful to find better seeds for iterative algorithms, which results in faster convergence.

## Babylonian method

Graph charting the use of the Babylonian method for approximating a square root of 100 (±10) using starting values x0 = 50, x0 = 1, and x0 = −5. Note that a positive starting value yields the positive root, and a negative starting value the negative root.

Perhaps the first algorithm used for approximating ${\displaystyle {\sqrt {S}}}$  is known as the Babylonian method, despite there being no direct evidence, beyond informed conjecture, that the eponymous Babylonian mathematicians employed exactly this method.[1] The method is also known as Heron's method, after the first-century Greek mathematician Hero of Alexandria who gave the first explicit description of the method in his AD 60 work Metrica.[2] The basic idea is that if x is an overestimate to the square root of a non-negative real number S then S/x will be an underestimate, or vice versa, and so the average of these two numbers may reasonably be expected to provide a better approximation (though the formal proof of that assertion depends on the inequality of arithmetic and geometric means that shows this average is always an overestimate of the square root, as noted in the article on square roots, thus assuring convergence).

More precisely, if x is our initial guess of ${\displaystyle {\sqrt {S}}}$  and e is the error in our estimate such that S = (x+ e)2, then we can expand the binomial and solve for

${\displaystyle e={\frac {S-x^{2}}{2x+e}}\approx {\frac {S-x^{2}}{2x}},}$  since ${\displaystyle e\ll x}$ .

Therefore, we can compensate for the error and update our old estimate as

${\displaystyle x+e\approx x+{\frac {S-x^{2}}{2x}}={\frac {S+x^{2}}{2x}}={\frac {{\frac {S}{x}}+x}{2}}\equiv x_{\text{revised}}}$

Since the computed error was not exact, this becomes our next best guess. The process of updating is iterated until desired accuracy is obtained. This is a quadratically convergent algorithm, which means that the number of correct digits of the approximation roughly doubles with each iteration. It proceeds as follows:

1. Begin with an arbitrary positive starting value x0 (the closer to the actual square root of S, the better).
2. Let xn + 1 be the average of xn and S/xn (using the arithmetic mean to approximate the geometric mean).
3. Repeat step 2 until the desired accuracy is achieved.

It can also be represented as:

${\displaystyle x_{0}\approx {\sqrt {S}},}$
${\displaystyle x_{n+1}={\frac {1}{2}}\left(x_{n}+{\frac {S}{x_{n}}}\right),}$
${\displaystyle {\sqrt {S}}=\lim _{n\to \infty }x_{n}.}$

This algorithm works equally well in the p-adic numbers, but cannot be used to identify real square roots with p-adic square roots; one can, for example, construct a sequence of rational numbers by this method that converges to +3 in the reals, but to −3 in the 2-adics.

It can be derived from Newton's method (which it predates by 16 centuries):

Writing ${\displaystyle f(x)=x^{2}-S}$ ,
${\displaystyle x_{n+1}=x_{n}-{\dfrac {f(x_{n})}{f'(x_{n})}}=x_{n}-{\dfrac {x_{n}^{2}-S}{2x_{n}}}={\dfrac {1}{2}}{\Bigl (}2x_{n}-{\bigl (}x_{n}-{\dfrac {S}{x_{n}}}{\bigr )}{\Bigr )}={\dfrac {1}{2}}{\Bigl (}x_{n}+{\dfrac {S}{x_{n}}}{\Bigr )}_{.}}$

### Example

To calculate S, where S = 125348, to six significant figures, use the rough estimation method above to get

{\displaystyle {\begin{aligned}{\begin{array}{rlll}x_{0}&=6\cdot 10^{2}&&=600.000\\[0.3em]x_{1}&={\frac {1}{2}}\left(x_{0}+{\frac {S}{x_{0}}}\right)&={\frac {1}{2}}\left(600.000+{\frac {125348}{600.000}}\right)&=404.457\\[0.3em]x_{2}&={\frac {1}{2}}\left(x_{1}+{\frac {S}{x_{1}}}\right)&={\frac {1}{2}}\left(404.457+{\frac {125348}{404.457}}\right)&=357.187\\[0.3em]x_{3}&={\frac {1}{2}}\left(x_{2}+{\frac {S}{x_{2}}}\right)&={\frac {1}{2}}\left(357.187+{\frac {125348}{357.187}}\right)&=354.059\\[0.3em]x_{4}&={\frac {1}{2}}\left(x_{3}+{\frac {S}{x_{3}}}\right)&={\frac {1}{2}}\left(354.059+{\frac {125348}{354.059}}\right)&=354.045\\[0.3em]x_{5}&={\frac {1}{2}}\left(x_{4}+{\frac {S}{x_{4}}}\right)&={\frac {1}{2}}\left(354.045+{\frac {125348}{354.045}}\right)&=354.045\end{array}}\end{aligned}}}

Therefore, 125348 ≈ 354.045.

### Convergence

Suppose that x0 > 0 and S > 0. Then for any natural number n, xn > 0. Let the relative error in xn be defined by

${\displaystyle \varepsilon _{n}={\frac {x_{n}}{\sqrt {S}}}-1>-1}$

and thus

${\displaystyle x_{n}={\sqrt {S}}\cdot (1+\varepsilon _{n}).}$

Then it can be shown that

${\displaystyle \varepsilon _{n+1}={\frac {\varepsilon _{n}^{2}}{2(1+\varepsilon _{n})}}\geq 0.}$

And thus that

${\displaystyle \varepsilon _{n+2}\leq \min \left\{{\frac {\varepsilon _{n+1}^{2}}{2}},{\frac {\varepsilon _{n+1}}{2}}\right\}}$

and consequently that convergence is assured, and quadratic.

#### Worst case for convergence

If using the rough estimate above with the Babylonian method, then the least accurate cases in ascending order are as follows:

{\displaystyle {\begin{aligned}S&=1;&x_{0}&=2;&x_{1}&=1.250;&\varepsilon _{1}&=0.250.\\S&=10;&x_{0}&=2;&x_{1}&=3.500;&\varepsilon _{1}&<0.107.\\S&=10;&x_{0}&=6;&x_{1}&=3.833;&\varepsilon _{1}&<0.213.\\S&=100;&x_{0}&=6;&x_{1}&=11.333;&\varepsilon _{1}&<0.134.\end{aligned}}}

Thus in any case,

${\displaystyle \varepsilon _{1}\leq 2^{-2}.\,}$
${\displaystyle \varepsilon _{2}<2^{-5}<10^{-1}.\,}$
${\displaystyle \varepsilon _{3}<2^{-11}<10^{-3}.\,}$
${\displaystyle \varepsilon _{4}<2^{-23}<10^{-6}.\,}$
${\displaystyle \varepsilon _{5}<2^{-47}<10^{-14}.\,}$
${\displaystyle \varepsilon _{6}<2^{-95}<10^{-28}.\,}$
${\displaystyle \varepsilon _{7}<2^{-191}<10^{-57}.\,}$
${\displaystyle \varepsilon _{8}<2^{-383}<10^{-115}.\,}$

Rounding errors will slow the convergence. It is recommended to keep at least one extra digit beyond the desired accuracy of the xn being calculated to minimize round off error.

#### Scaling Heron’s algorithm

Efficiency of Heron algorithm can be enhanced by prescaling.

If ${\displaystyle S\in [0.5,2.0]}$  then, using a starting guess ${\displaystyle x_{0}=1.0}$ , Heron algorithm proceeds with quadratic convergence and 5 iterations of the algorithm are sufficient to compute ${\displaystyle {\sqrt {S}}}$  with 16 digit accuracy.

Prescaling consists in finding an integer number k such that ${\displaystyle S=2^{2k}{\tilde {S}}}$ , where ${\displaystyle {\tilde {S}}\in [0.5,2.0]}$ , then solve the intermediate problem ${\displaystyle z={\sqrt {\tilde {S}}}}$  by performing a fixed number of iterations of the Heron method starting from ${\displaystyle z_{0}=1.0}$ , finally obtain the solution ${\displaystyle x=2^{k}z}$ .

This technique is particularly effective when working with floating point numbers in binary format (e.g. IEEE-754), as the determination of k and ${\displaystyle {\tilde {S}}}$  is trivial. In fact, these numbers are represented in the format ${\displaystyle 1.xxx\dots 2^{p}}$  where mantissa and exponent are immediately available. Then if p is even, then ${\displaystyle {\tilde {S}}=1.xxx\dots }$  and ${\displaystyle k=p/2}$ , otherwise ${\displaystyle {\tilde {S}}=1.xxx\dots /2.0}$  and ${\displaystyle k=(p+1)/2}$ . Note that there is no floating point mathematical operation involved here, as multiplication and division by powers of 2 can be done with trivial integer operations on the exponent. Similarly, no floating point operation is required when computing the final solution ${\displaystyle x=2^{k}z}$ .

## Bakhshali method

This method for finding an approximation to a square root was described in an ancient Indian mathematical manuscript called the Bakhshali manuscript. It is equivalent to two iterations of the Babylonian method beginning with x0. Thus, the algorithm is quartically convergent, which means that the number of correct digits of the approximation roughly quadruples with each iteration.[3] The original presentation, using modern notation, is as follows: To calculate ${\displaystyle {\sqrt {S}}}$ , let x02 be the initial approximation to S. Then, successively iterate as:

${\displaystyle a_{n}={\frac {S-x_{n}^{2}}{2x_{n}}},}$
${\displaystyle b_{n}=x_{n}+a_{n},}$
${\displaystyle x_{n+1}=b_{n}-{\frac {a_{n}^{2}}{2b_{n}}}.}$

Written explicitly, it becomes

${\displaystyle x_{n+1}=(x_{n}+a_{n})-{\frac {a_{n}^{2}}{2(x_{n}+a_{n})}}.}$

Let x0 = N be an integer which is the nearest perfect square to S. Also, let the difference d = S - N2, then the first iteration can be written as:

${\displaystyle {\sqrt {S}}\approx N+{\frac {d}{2N}}-{\frac {d^{2}}{8N^{3}+4Nd}}={\frac {8N^{4}+8N^{2}d+d^{2}}{8N^{3}+4Nd}}={\frac {N^{4}+6N^{2}S+S^{2}}{4N^{3}+4NS}}={\frac {N^{2}(N^{2}+6S)+S^{2}}{4N(N^{2}+S)}}.}$

This gives a rational approximation to the square root.

### Example

Using the same example as given in Babylonian method, let ${\displaystyle S=125348.}$  Then, the first iterations gives

${\displaystyle x_{0}=600}$
${\displaystyle a_{1}={\frac {125348-600^{2}}{2\times 600}}=-195.543}$
${\displaystyle b_{1}=600+(-195.543)=404.456}$
${\displaystyle x_{1}=404.456-{\frac {(-195.543)^{2}}{2\times 404.456}}=357.186}$

Likewise the second iteration gives

${\displaystyle a_{2}={\frac {125348-357.186^{2}}{2\times 357.186}}=-3.126}$
${\displaystyle b_{2}=357.186+(-3.126)=354.06}$
${\displaystyle x_{2}=354.06-{\frac {(-3.1269)^{2}}{2\times 354.06}}=354.046}$

## Digit-by-digit calculation

This is a method to find each digit of the square root in a sequence. It is slower than the Babylonian method, but it has several advantages:

• It can be easier for manual calculations.
• Every digit of the root found is known to be correct, i.e., it does not have to be changed later.
• If the square root has an expansion that terminates, the algorithm terminates after the last digit is found. Thus, it can be used to check whether a given integer is a square number.
• The algorithm works for any base, and naturally, the way it proceeds depends on the base chosen.

Napier's bones include an aid for the execution of this algorithm. The shifting nth root algorithm is a generalization of this method.

### Basic principle

First, consider the case of finding the square root of a number Z, that is the square of a two-digit number XY, where X is the tens digit and Y is the units digit. Specifically:

Z = (10X + Y)2 = 100X2 + 20XY + Y2

Now using the Digit-by-Digit algorithm, we first determine the value of X. X is the largest digit such that X2 is less or equal to Z from which we removed the two rightmost digits.

In the next iteration, we pair the digits, multiply X by 2, and place it in the tenth's place while we try to figure out what the value of Y is.

Since this is a simple case where the answer is a perfect square root XY, the algorithm stops here.

The same idea can be extended to any arbitrary square root computation next. Suppose we are able to find the square root of N by expressing it as a sum of n positive numbers such that

${\displaystyle N=(a_{1}+a_{2}+a_{3}+\dotsb +a_{n})^{2}.}$

By repeatedly applying the basic identity

${\displaystyle (x+y)^{2}=x^{2}+2xy+y^{2},}$

the right-hand-side term can be expanded as

{\displaystyle {\begin{aligned}&(a_{1}+a_{2}+a_{3}+\dotsb +a_{n})^{2}\\=&\,a_{1}^{2}+2a_{1}a_{2}+a_{2}^{2}+2(a_{1}+a_{2})a_{3}+a_{3}^{2}+\dotsb +a_{n-1}^{2}+2\left(\sum _{i=1}^{n-1}a_{i}\right)a_{n}+a_{n}^{2}\\=&\,a_{1}^{2}+[2a_{1}+a_{2}]a_{2}+[2(a_{1}+a_{2})+a_{3}]a_{3}+\dotsb +\left[2\left(\sum _{i=1}^{n-1}a_{i}\right)+a_{n}\right]a_{n}.\end{aligned}}}

This expression allows us to find the square root by sequentially guessing the values of ${\displaystyle a_{i}}$ s. Suppose that the numbers ${\displaystyle a_{1},\ldots ,a_{m-1}}$  have already been guessed, then the m-th term of the right-hand-side of above summation is given by ${\displaystyle Y_{m}=[2P_{m-1}+a_{m}]a_{m},}$  where ${\displaystyle P_{m-1}=\sum _{i=1}^{m-1}a_{i}}$  is the approximate square root found so far. Now each new guess ${\displaystyle a_{m}}$  should satisfy the recursion

${\displaystyle X_{m}=X_{m-1}-Y_{m},}$

such that ${\displaystyle X_{m}\geq 0}$  for all ${\displaystyle 1\leq m\leq n,}$  with initialization ${\displaystyle X_{0}=N.}$  When ${\displaystyle X_{n}=0,}$  the exact square root has been found; if not, then the sum of ${\displaystyle a_{i}}$ s gives a suitable approximation of the square root, with ${\displaystyle X_{n}}$  being the approximation error.

For example, in the decimal number system we have

${\displaystyle N=(a_{1}\cdot 10^{n-1}+a_{2}\cdot 10^{n-2}+\cdots +a_{n-1}\cdot 10+a_{n})^{2},}$

where ${\displaystyle 10^{n-i}}$  are place holders and the coefficients ${\displaystyle a_{i}\in \{0,1,2,\ldots ,9\}}$ . At any m-th stage of the square root calculation, the approximate root found so far, ${\displaystyle P_{m-1}}$  and the summation term ${\displaystyle Y_{m}}$  are given by

${\displaystyle P_{m-1}=\sum _{i=1}^{m-1}a_{i}\cdot 10^{n-i}=10^{n-m+1}\sum _{i=1}^{m-1}a_{i}\cdot 10^{m-i-1},}$
${\displaystyle Y_{m}=[2P_{m-1}+a_{m}\cdot 10^{n-m}]a_{m}\cdot 10^{n-m}=[20\sum _{i=1}^{m-1}a_{i}\cdot 10^{m-i-1}+a_{m}]a_{m}\cdot 10^{2(n-m)}.}$

Here since the place value of ${\displaystyle Y_{m}}$  is an even power of 10, we only need to work with the pair of most significant digits of the remaining term ${\displaystyle X_{m-1}}$  at any m-th stage. The section below codifies this procedure.

It is obvious that a similar method can be used to compute the square root in number systems other than the decimal number system. For instance, finding the digit-by-digit square root in the binary number system is quite efficient since the value of ${\displaystyle a_{i}}$  is searched from a smaller set of binary digits {0,1}. This makes the computation faster since at each stage the value of ${\displaystyle Y_{m}}$  is either ${\displaystyle Y_{m}=0}$  for ${\displaystyle a_{m}=0}$  or ${\displaystyle Y_{m}=2P_{m-1}+1}$  for ${\displaystyle a_{m}=1}$ . The fact that we have only two possible options for ${\displaystyle a_{m}}$  also makes the process of deciding the value of ${\displaystyle a_{m}}$  at m-th stage of calculation easier. This is because we only need to check if ${\displaystyle Y_{m}\leq X_{m-1}}$  for ${\displaystyle a_{m}=1.}$  If this condition is satisfied, then we take ${\displaystyle a_{m}=1}$ ; if not then ${\displaystyle a_{m}=0.}$  Also, the fact that multiplication by 2 is done by left bit-shifts helps in the computation.

### Decimal (base 10)

Write the original number in decimal form. The numbers are written similar to the long division algorithm, and, as in long division, the root will be written on the line above. Now separate the digits into pairs, starting from the decimal point and going both left and right. The decimal point of the root will be above the decimal point of the square. One digit of the root will appear above each pair of digits of the square.

Beginning with the left-most pair of digits, do the following procedure for each pair:

1. Starting on the left, bring down the most significant (leftmost) pair of digits not yet used (if all the digits have been used, write "00") and write them to the right of the remainder from the previous step (on the first step, there will be no remainder). In other words, multiply the remainder by 100 and add the two digits. This will be the current value c.
2. Find p, y and x, as follows:
• Let p be the part of the root found so far, ignoring any decimal point. (For the first step, p = 0.)
• Determine the greatest digit x such that ${\displaystyle x(20p+x)\leq c}$ . We will use a new variable y = x(20p + x).
• Note: 20p + x is simply twice p, with the digit x appended to the right).
• Note: x can be found by guessing what c/(20·p) is and doing a trial calculation of y, then adjusting x upward or downward as necessary.
• Place the digit ${\displaystyle x}$  as the next digit of the root, i.e., above the two digits of the square you just brought down. Thus the next p will be the old p times 10 plus x.
3. Subtract y from c to form a new remainder.
4. If the remainder is zero and there are no more digits to bring down, then the algorithm has terminated. Otherwise go back to step 1 for another iteration.

#### Examples

Find the square root of 152.2756.

          1  2. 3  4
/
\/  01 52.27 56

01                   1*1 <= 1 < 2*2                 x = 1
01                     y = x*x = 1*1 = 1
00 52                22*2 <= 52 < 23*3              x = 2
00 44                  y = (20+x)*x = 22*2 = 44
08 27             243*3 <= 827 < 244*4           x = 3
07 29               y = (240+x)*x = 243*3 = 729
98 56          2464*4 <= 9856 < 2465*5        x = 4
98 56            y = (2460+x)*x = 2464*4 = 9856
00 00          Algorithm terminates: Answer is 12.34


Find the square root of 2.

          1. 4  1  4  2
/
\/  02.00 00 00 00

02                  1*1 <= 2 < 2*2                 x = 1
01                    y = x*x = 1*1 = 1
01 00               24*4 <= 100 < 25*5             x = 4
00 96                 y = (20+x)*x = 24*4 = 96
04 00            281*1 <= 400 < 282*2           x = 1
02 81              y = (280+x)*x = 281*1 = 281
01 19 00         2824*4 <= 11900 < 2825*5       x = 4
01 12 96           y = (2820+x)*x = 2824*4 = 11296
06 04 00      28282*2 <= 60400 < 28283*3     x = 2
The desired precision is achieved:
The square root of 2 is about 1.4142


### Binary numeral system (base 2)

Inherent to digit-by-digit algorithms is a search and test step: find a digit, ${\displaystyle \,e}$ , when added to the right of a current solution ${\displaystyle \,r}$ , such that ${\displaystyle \,(r+e)\cdot (r+e)\leq x}$ , where ${\displaystyle \,x}$  is the value for which a root is desired. Expanding: ${\displaystyle \,r\cdot r+2re+e\cdot e\leq x}$ . The current value of ${\displaystyle \,r\cdot r}$ —or, usually, the remainder—can be incrementally updated efficiently when working in binary, as the value of ${\displaystyle \,e}$  will have a single bit set (a power of 2), and the operations needed to compute ${\displaystyle \,2\cdot r\cdot e}$  and ${\displaystyle \,e\cdot e}$  can be replaced with faster bit shift operations.

#### Example

Here we obtain the square root of 81, which when converted into binary gives 1010001. The numbers in the left column gives the option between that number or zero to be used for subtraction at that stage of computation. The final answer is 1001, which in decimal is 9.

             1 0 0 1
---------
√ 1010001

1      1
1
---------
101     01
0
--------
1001     100
0
--------
10001    10001
10001
-------
0


This gives rise to simple computer implementations:[4]

short isqrt(short num) {
short res = 0;
short bit = 1 << 14; // The second-to-top bit is set: 1 << 30 for 32 bits

// "bit" starts at the highest power of four <= the argument.
while (bit > num)
bit >>= 2;

while (bit != 0) {
if (num >= res + bit) {
num -= res + bit;
res = (res >> 1) + bit;
}
else
res >>= 1;
bit >>= 2;
}
return res;
}


Using the notation above, the variable "bit" corresponds to ${\displaystyle e_{m}^{2}}$  which is ${\displaystyle (2^{m})^{2}=4^{m}}$ , the variable "res" is equal to ${\displaystyle 2re_{m}}$ , and the variable "num" is equal to the current ${\displaystyle X_{m}}$  which is the difference of the number we want the square root of and the square of our current approximation with all bits set up to ${\displaystyle 2^{m+1}}$ . Thus in the first loop, we want to find the highest power of 4 in "bit" to find the highest power of 2 in ${\displaystyle e}$ . In the second loop, if num is greater than res + bit, then ${\displaystyle X_{m}}$  is greater than ${\displaystyle 2re_{m}+e_{m}^{2}}$  and we can subtract it. The next line, we want to add ${\displaystyle e_{m}}$  to ${\displaystyle r}$  which means we want to add ${\displaystyle 2e_{m}^{2}}$  to ${\displaystyle 2re_{m}}$  so we want res = res + bit<<1. Then update ${\displaystyle e_{m}}$  to ${\displaystyle e_{m-1}}$  inside res which involves dividing by 2 or another shift to the right. Combining these 2 into one line leads to res = res>>1 + bit. If ${\displaystyle X_{m}}$  isn't greater than ${\displaystyle 2re_{m}+e_{m}^{2}}$  then we just update ${\displaystyle e_{m}}$  to ${\displaystyle e_{m-1}}$  inside res and divide it by 2. Then we update ${\displaystyle e_{m}}$  to ${\displaystyle e_{m-1}}$  in bit by dividing it by 4. The final iteration of the 2nd loop has bit equal to 1 and will cause update of ${\displaystyle e}$  to run one extra time removing the factor of 2 from res making it our integer approximation of the root.

Faster algorithms, in binary and decimal or any other base, can be realized by using lookup tables—in effect trading more storage space for reduced run time.[5]

## Exponential identity

Pocket calculators typically implement good routines to compute the exponential function and the natural logarithm, and then compute the square root of S using the identity found using the properties of logarithms (${\displaystyle \ln x^{n}=n\ln x}$ ) and exponentials (${\displaystyle e^{\ln x}=x}$ ):

${\displaystyle {\sqrt {S}}=e^{{\frac {1}{2}}\ln S}.}$

The denominator in the fraction corresponds to the nth root. In the case above the denominator is 2, hence the equation specifies that the square root is to be found. The same identity is used when computing square roots with logarithm tables or slide rules.

## Vedic duplex method for extracting a square root

The Vedic duplex method from the book 'Vedic Mathematics' is a variant of the digit-by-digit method for calculating the square root.[6] The duplex is the square of the central digit plus double the cross-product of digits equidistant from the center. The duplex is computed from the quotient digits (square root digits) computed thus far, but after the initial digits. The duplex is subtracted from the dividend digit prior to the second subtraction for the product of the quotient digit times the divisor digit. For perfect squares the duplex and the dividend will get smaller and reach zero after a few steps. For non-perfect squares the decimal value of the square root can be calculated to any precision desired. However, as the decimal places proliferate, the duplex adjustment gets larger and longer to calculate. The duplex method follows the Vedic ideal for an algorithm, one-line, mental calculation. It is flexible in choosing the first digit group and the divisor. Small divisors are to be avoided by starting with a larger initial group.

### Basic principle

We proceed as with the digit-by-digit calculation by assuming that we want to express a number N as a square of the sum of n positive numbers as

${\displaystyle N=(a_{0}+a_{1}+\cdots +a_{n-1})^{2}}$
${\displaystyle =a_{0}^{2}+2a_{0}\sum _{i=1}^{n-1}a_{i}+a_{1}^{2}+2a_{1}\sum _{i=2}^{n-1}a_{i}+\cdots +a_{n-1}^{2}.}$

Define divisor as ${\displaystyle q=2a_{0}}$  and the duplex for a sequence of m numbers as

${\displaystyle d_{m}={\begin{cases}a_{\lceil m/2\rceil }^{2}+\sum _{i=1}^{\lfloor m/2\rfloor }2a_{i}a_{m-i+1}&{\text{for }}m{\text{ odd}}\\\sum _{i=1}^{m/2}2a_{i}a_{m-i+1}&{\text{for }}m{\text{ even}}.\\\end{cases}}}$

Thus, we can re-express the above identity in terms of the divisor and the duplexes as

${\displaystyle N-a_{0}^{2}=\sum _{i=1}^{n-1}(qa_{i}+d_{i}).}$

Now the computation can proceed by recursively guessing the values of ${\displaystyle a_{m}}$  so that

${\displaystyle X_{m}=X_{m-1}-qa_{m}-d_{m},}$

such that ${\displaystyle X_{m}\geq 0}$  for all ${\displaystyle 1\leq m\leq n-1}$ , with initialization ${\displaystyle X_{0}=N-a_{0}^{2}.}$  When ${\displaystyle X_{m}=0}$  the algorithm terminates and the sum of ${\displaystyle a_{i}}$ s give the square root. The method is more similar to long division where ${\displaystyle X_{m-1}}$  is the dividend and ${\displaystyle X_{m}}$  is the remainder.

For the case of decimal numbers, if

${\displaystyle N=(a_{0}\cdot 10^{n-1}+a_{1}\cdot 10^{n-2}+\cdots +a_{n-2}\cdot 10+a_{n-1})^{2}}$

where ${\displaystyle a_{i}\in \{0,1,2,\ldots ,9\}}$ , then the initiation ${\displaystyle X_{0}=N-a_{0}^{2}\cdot 10^{2(n-1)}}$  and the divisor will be ${\displaystyle q=2a_{0}\cdot 10^{n-1}}$ . Also the product at any m-th stage will be ${\displaystyle qa_{m}\cdot 10^{n-m-1}=2a_{0}a_{m}\cdot 10^{2n-m-2}}$  and the duplexes will be ${\displaystyle d_{m}={\tilde {d}}_{m}\cdot 10^{2n-m-3}}$ , where ${\displaystyle {\tilde {d}}_{m}}$  are the duplexes of the sequence ${\displaystyle a_{1},a_{2},\ldots ,a_{m}}$ . At any m-th stage, we see that the place value of the duplex ${\displaystyle {\tilde {d}}_{m}}$  is one less than the product ${\displaystyle 2a_{0}a_{m}}$ . Thus, in actual calculations it is customary to subtract the duplex value of the m-th stage at (m+1)-th stage. Also, unlike the previous digit-by-digit square root calculation, where at any given m-th stage, the calculation is done by taking the most significant pair of digits of the remaining term ${\displaystyle X_{m-1}}$ , the duplex method uses only a single most significant digit of ${\displaystyle X_{m-1}}$ .

In other words, to calculate the duplex of a number, double the product of each pair of equidistant digits plus the square of the center digit (of the digits to the right of the colon).

Number Calculation = Duplex
3 32 = 9
142(1·4) = 8
574 2(5·4) + 72 = 89
1,421 2(1·1) + 2(4·2) = 2 + 16 = 18
10,523 2(1·3) + 2(0·2) + 52 = 6+0+25 = 31
406,739 2(4·9)+ 2(0·3)+ 2(6·7) = 72+0+84 = 156

In a square root calculation the quotient digit set increases incrementally for each step.

### Example

Consider the perfect square 2809 = 532. Use the duplex method to find the square root of 2,809.

• Set down the number in groups of two digits.
• Define a divisor, a dividend and a quotient to find the root.
• Given 2809. Consider the first group, 28.
• Find the nearest perfect square below that group.
• The root of that perfect square is the first digit of our root.
• Since 28 > 25 and 25 = 52, take 5 as the first digit in the square root.
• For the divisor take double this first digit (2 · 5), which is 10.
• Next, set up a division framework with a colon.
• 28: 0 9 is the dividend and 5: is the quotient. (Note: the quotient should always be a single digit number, and it should be such that the dividend in the next stage is non-negative.)
• Put a colon to the right of 28 and 5 and keep the colons lined up vertically. The duplex is calculated only on quotient digits to the right of the colon.
• Calculate the remainder. 28: minus 25: is 3:.
• Append the remainder on the left of the next digit to get the new dividend.
• Here, append 3 to the next dividend digit 0, which makes the new dividend 30. The divisor 10 goes into 30 just 3 times. (No reserve needed here for subsequent deductions.)
• Repeat the operation.
• The zero remainder appended to 9. Nine is the next dividend.
• This provides a digit to the right of the colon so deduct the duplex, 32 = 9.
• Subtracting this duplex from the dividend 9, a zero remainder results.
• Ten into zero is zero. The next root digit is zero. The next duplex is 2(3·0) = 0.
• The dividend is zero. This is an exact square root, 53.
Find the square root of 2809.
Set down the number in groups of two digits.
The number of groups gives the number of whole digits in the root.
Put a colon after the first group, 28, to separate it.
From the first group, 28, obtain the divisor, 10, since
28>25=52 and by doubling this first root, 2x5=10.
Gross dividend:     28:  0  9. Using mental math:
Divisor: 10)     3  0   Square: 10)  28:  30  9
Duplex, Deduction:     25: xx 09  Square root:  5:   3. 0
Dividend:         30 00
Remainder:      3: 00 00
Square Root, Quotient:      5:  3. 0


## A two-variable iterative method

This method is applicable for finding the square root of ${\displaystyle 0  and converges best for ${\displaystyle S\approx 1}$ . This, however, is no real limitation for a computer based calculation, as in base 2 floating point and fixed point representations, it is trivial to multiply ${\displaystyle S\,\!}$  by an integer power of 4, and therefore ${\displaystyle {\sqrt {S}}}$  by the corresponding power of 2, by changing the exponent or by shifting, respectively. Therefore, ${\displaystyle S\,\!}$  can be moved to the range ${\displaystyle {\frac {1}{2}}\leq S<2}$ . Moreover, the following method does not employ general divisions, but only additions, subtractions, multiplications, and divisions by powers of two, which are again trivial to implement. A disadvantage of the method is that numerical errors accumulate, in contrast to single variable iterative methods such as the Babylonian one.

The initialization step of this method is

${\displaystyle a_{0}=S\,\!}$
${\displaystyle c_{0}=S-1\,\!}$

${\displaystyle a_{n+1}=a_{n}-a_{n}c_{n}/2\,\!}$
${\displaystyle c_{n+1}=c_{n}^{2}(c_{n}-3)/4\,\!}$

Then, ${\displaystyle a_{n}\rightarrow {\sqrt {S}}}$  (while ${\displaystyle c_{n}\rightarrow 0}$ ).

Note that the convergence of ${\displaystyle c_{n}\,\!}$ , and therefore also of ${\displaystyle a_{n}\,\!}$ , is quadratic.

The proof of the method is rather easy. First, rewrite the iterative definition of ${\displaystyle c_{n}\,\!}$  as

${\displaystyle 1+c_{n+1}=(1+c_{n})(1-c_{n}/2)^{2}\,\!}$ .

Then it is straightforward to prove by induction that

${\displaystyle S(1+c_{n})=a_{n}^{2}}$

and therefore the convergence of ${\displaystyle a_{n}\,\!}$  to the desired result ${\displaystyle {\sqrt {S}}}$  is ensured by the convergence of ${\displaystyle c_{n}\,\!}$  to 0, which in turn follows from ${\displaystyle -1 .

This method was developed around 1950 by M. V. Wilkes, D. J. Wheeler and S. Gill[7] for use on EDSAC, one of the first electronic computers.[8] The method was later generalized, allowing the computation of non-square roots.[9]

## Iterative methods for reciprocal square roots

The following are iterative methods for finding the reciprocal square root of S which is ${\displaystyle 1/{\sqrt {S}}}$ . Once it has been found, find ${\displaystyle {\sqrt {S}}}$  by simple multiplication: ${\displaystyle {\sqrt {S}}=S\cdot (1/{\sqrt {S}})}$ . These iterations involve only multiplication, and not division. They are therefore faster than the Babylonian method. However, they are not stable. If the initial value is not close to the reciprocal square root, the iterations will diverge away from it rather than converge to it. It can therefore be advantageous to perform an iteration of the Babylonian method on a rough estimate before starting to apply these methods.

• Applying Newton's method to the equation ${\displaystyle (1/x^{2})-S=0}$  produces a method that converges quadratically using three multiplications per step:
${\displaystyle x_{n+1}={\frac {x_{n}}{2}}\cdot (3-S\cdot x_{n}^{2}).}$
${\displaystyle y_{n}=S\cdot x_{n}^{2}}$ , and
${\displaystyle x_{n+1}={\frac {x_{n}}{8}}\cdot (15-y_{n}\cdot (10-3\cdot y_{n}))}$ .

### Goldschmidt’s algorithm

Some computers use Goldschmidt's algorithm to simultaneously calculate ${\displaystyle {\sqrt {S}}}$  and ${\displaystyle 1/{\sqrt {S}}}$ . Goldschmidt's algorithm finds ${\displaystyle {\sqrt {S}}}$  faster than Newton-Raphson iteration on a computer with a fused multiply–add instruction and either a pipelined floating point unit or two independent floating-point units.[10]

The first way of writing Goldschmidt's algorithm begins

${\displaystyle b_{0}=S}$
${\displaystyle Y_{0}\approx 1/{\sqrt {S}}}$  (typically using a table lookup)
${\displaystyle y_{0}=Y_{0}}$
${\displaystyle x_{0}=Sy_{0}}$

and iterates

${\displaystyle b_{n+1}=b_{n}Y_{n}^{2}}$
${\displaystyle Y_{n+1}=(3-b_{n+1})/2}$
${\displaystyle x_{n+1}=x_{n}Y_{n+1}}$
${\displaystyle y_{n+1}=y_{n}Y_{n+1}}$

until ${\displaystyle b_{i}}$  is sufficiently close to 1, or a fixed number of iterations. The iterations converge to

${\displaystyle \lim _{n\to \infty }x_{n}={\sqrt {S}}}$ , and
${\displaystyle \lim _{n\to \infty }y_{n}=1/{\sqrt {S}}}$ .

Note that it is possible to omit either ${\displaystyle x_{n}}$  and ${\displaystyle y_{n}}$  from the computation, and if both are desired then ${\displaystyle x_{n}=Sy_{n}}$  may be used at the end rather than computing it through in each iteration.

A second form, using fused multiply-add operations, begins

${\displaystyle y_{0}\approx 1/{\sqrt {S}}}$  (typically using a table lookup)
${\displaystyle x_{0}=Sy_{0}}$
${\displaystyle h_{0}=y_{0}/2}$

and iterates

${\displaystyle r_{n}=0.5-x_{n}h_{n}}$
${\displaystyle x_{n+1}=x_{n}+x_{n}r_{n}}$
${\displaystyle h_{n+1}=h_{n}+h_{n}r_{n}}$

until ${\displaystyle r_{i}}$  is sufficiently close to 0, or a fixed number of iterations. This converges to

${\displaystyle \lim _{n\to \infty }x_{n}={\sqrt {S}}}$ , and
${\displaystyle \lim _{n\to \infty }2h_{n}=1/{\sqrt {S}}}$ .

## Taylor series

If N is an approximation to ${\displaystyle {\sqrt {S}}}$ , a better approximation can be found by using the Taylor series of the square root function:

${\displaystyle {\sqrt {N^{2}+d}}=N\sum _{n=0}^{\infty }{\frac {(-1)^{n}(2n)!}{(1-2n)n!^{2}4^{n}}}{\frac {d^{n}}{N^{2n}}}=N(1+{\frac {d}{2N^{2}}}-{\frac {d^{2}}{8N^{4}}}+{\frac {d^{3}}{16N^{6}}}-{\frac {5d^{4}}{128N^{8}}}+\cdots )}$

As an iterative method, the order of convergence is equal to the number of terms used. With two terms, it is identical to the Babylonian method. With three terms, each iteration takes almost as many operations as the Bakhshali approximation, but converges more slowly.[citation needed] Therefore, this is not a particularly efficient way of calculation. To maximize the rate of convergence, choose N so that ${\displaystyle {\frac {|d|}{N^{2}}}\,}$  is as small as possible.

## CORDIC

A completely different method for computing the square root is based on the CORDIC algorithm, which uses only very simple operations (addition, subtraction, with bitshift and table lookup to implement multiplication). The square root of S may be obtained as the output ${\displaystyle x_{n}}$  using the hyperbolic coordinate system in vectoring mode, with the following initialization:[11]

${\displaystyle x_{0}=S+1}$
${\displaystyle y_{0}=S-1}$
${\displaystyle \omega _{0}=0}$

## Continued fraction expansion

Quadratic irrationals (numbers of the form ${\displaystyle {\frac {a+{\sqrt {b}}}{c}}}$ , where a, b and c are integers), and in particular, square roots of integers, have periodic continued fractions. Sometimes what is desired is finding not the numerical value of a square root, but rather its continued fraction expansion, and hence its rational approximation. Let S be the positive number for which we are required to find the square root. Then assuming a to be a number that serves as an initial guess and r to be the remainder term, we can write ${\displaystyle S=a^{2}+r.}$  Since we have ${\displaystyle S-a^{2}=({\sqrt {S}}+a)({\sqrt {S}}-a)=r}$ , we can express the square root of S as

${\displaystyle {\sqrt {S}}=a+{\frac {r}{a+{\sqrt {S}}}}.}$

By applying this expression for ${\displaystyle {\sqrt {S}}}$  to the denominator term of the fraction, we have

${\displaystyle {\sqrt {S}}=a+{\frac {r}{a+(a+{\frac {r}{a+{\sqrt {S}}}})}}=a+{\frac {r}{2a+{\frac {r}{a+{\sqrt {S}}}}}}.}$

Proceeding this way, we get a generalized continued fraction for the square root as

${\displaystyle {\sqrt {S}}=a+{\frac {r|}{|2a}}+{\frac {r|}{|2a}}+{\frac {r|}{|2a}}+\cdots }$

For any S a possible choice for a and r is a = 1 and r = S - 1, yielding

${\displaystyle {\sqrt {S}}=1+{\frac {S-1|}{|2}}+{\frac {S-1|}{|2}}+{\frac {S-1|}{|2}}+\cdots }$

For example, for the square root of 2, we can take a = 1 and r = 1, giving us

${\displaystyle {\sqrt {2}}=1+{\frac {1|}{|2}}+{\frac {1|}{|2}}+{\frac {1|}{|2}}+\cdots }$

Taking the first three denominators give the rational approximation of 2 as [1;2,2,2] = 17/12 = 1.41667, correct up to first three decimal places. Taking the first five denominators gives the rational approximation to 2 as [1;2,2,2,2,2] = 99/70 = 1.4142857, correct up to first five decimal places. Taking more denominators give better approximations.

As another example, for the square root of 3, we can select a = 2 and r = -1, giving us

${\displaystyle {\sqrt {3}}=2-{\frac {1|}{|4}}-{\frac {1|}{|4}}-{\frac {1|}{|4}}-\cdots }$

The first three denominators gives 3 as 1.73214, correct up to the first four decimal places. Note that it is not necessary to choose an integer valued a. For instance, we can take a = 2 and r = 1, such that

${\displaystyle {\sqrt {3}}={\sqrt {2}}+{\frac {1|}{|2{\sqrt {2}}}}+{\frac {1|}{|2{\sqrt {2}}}}+{\frac {1|}{|2{\sqrt {2}}}}+\cdots }$

We can do the same for the whole numbers as well. For instance,

${\displaystyle 2={\sqrt {4}}=1+{\frac {3|}{|2}}+{\frac {3|}{|2}}+{\frac {3|}{|2}}+\cdots }$

### Algorithm

The following iterative algorithm[12] can be used to obtain the continued fraction expansion in canonical form (S is any natural number that is not a perfect square):

${\displaystyle m_{0}=0\,\!}$
${\displaystyle d_{0}=1\,\!}$
${\displaystyle a_{0}=\left\lfloor {\sqrt {S}}\right\rfloor \,\!}$
${\displaystyle m_{n+1}=d_{n}a_{n}-m_{n}\,\!}$
${\displaystyle d_{n+1}={\frac {S-m_{n+1}^{2}}{d_{n}}}\,\!}$
${\displaystyle a_{n+1}=\left\lfloor {\frac {{\sqrt {S}}+m_{n+1}}{d_{n+1}}}\right\rfloor =\left\lfloor {\frac {a_{0}+m_{n+1}}{d_{n+1}}}\right\rfloor \!.}$

Notice that mn, dn, and an are always integers. The algorithm terminates when this triplet is the same as one encountered before. The algorithm can also terminate on ai when ai = 2 a0,[13] which is easier to implement.

The expansion will repeat from then on. The sequence [a0; a1, a2, a3, ...] is the continued fraction expansion:

${\displaystyle {\sqrt {S}}=a_{0}+{\cfrac {1}{a_{1}+{\cfrac {1}{a_{2}+{\cfrac {1}{a_{3}+\,\ddots }}}}}}}$

### Example, square root of 114 as a continued fraction

Begin with m0 = 0; d0 = 1; and a0 = 10 (102 = 100 and 112 = 121 > 114 so 10 chosen).

{\displaystyle {\begin{aligned}{\sqrt {114}}&={\frac {{\sqrt {114}}+0}{1}}=10+{\frac {{\sqrt {114}}-10}{1}}=10+{\frac {({\sqrt {114}}-10)({\sqrt {114}}+10)}{{\sqrt {114}}+10}}\\&=10+{\frac {114-100}{{\sqrt {114}}+10}}=10+{\frac {1}{\frac {{\sqrt {114}}+10}{14}}}.\end{aligned}}}
${\displaystyle m_{1}=d_{0}\cdot a_{0}-m_{0}=1\cdot 10-0=10\,.}$
${\displaystyle d_{1}={\frac {S-m_{1}^{2}}{d_{0}}}={\frac {114-10^{2}}{1}}=14\,.}$
${\displaystyle a_{1}=\left\lfloor {\frac {a_{0}+m_{1}}{d_{1}}}\right\rfloor =\left\lfloor {\frac {10+10}{14}}\right\rfloor =\left\lfloor {\frac {20}{14}}\right\rfloor =1\,.}$

So, m1 = 10; d1 = 14; and a1 = 1.

${\displaystyle {\frac {{\sqrt {114}}+10}{14}}=1+{\frac {{\sqrt {114}}-4}{14}}=1+{\frac {114-16}{14({\sqrt {114}}+4)}}=1+{\frac {1}{\frac {{\sqrt {114}}+4}{7}}}.}$

Next, m2 = 4; d2 = 7; and a2 = 2.

${\displaystyle {\frac {{\sqrt {114}}+4}{7}}=2+{\frac {{\sqrt {114}}-10}{7}}=2+{\frac {14}{7({\sqrt {114}}+10)}}=2+{\frac {1}{\frac {{\sqrt {114}}+10}{2}}}.}$
${\displaystyle {\frac {{\sqrt {114}}+10}{2}}=10+{\frac {{\sqrt {114}}-10}{2}}=10+{\frac {14}{2({\sqrt {114}}+10)}}=10+{\frac {1}{\frac {{\sqrt {114}}+10}{7}}}.}$
${\displaystyle {\frac {{\sqrt {114}}+10}{7}}=2+{\frac {{\sqrt {114}}-4}{7}}=2+{\frac {98}{7({\sqrt {114}}+4)}}=2+{\frac {1}{\frac {{\sqrt {114}}+4}{14}}}.}$
${\displaystyle {\frac {{\sqrt {114}}+4}{14}}=1+{\frac {{\sqrt {114}}-10}{14}}=1+{\frac {14}{14({\sqrt {114}}+10)}}=1+{\frac {1}{\frac {{\sqrt {114}}+10}{1}}}.}$
${\displaystyle {\frac {{\sqrt {114}}+10}{1}}=20+{\frac {{\sqrt {114}}-10}{1}}=20+{\frac {14}{{\sqrt {114}}+10}}=20+{\frac {1}{\frac {{\sqrt {114}}+10}{14}}}.}$

Now, loop back to the second equation above.

Consequently, the simple continued fraction for the square root of 114 is

${\displaystyle {\sqrt {114}}=[10;1,2,10,2,1,20,1,2,10,2,1,20,1,2,10,2,1,20,\dots ].\,}$  (sequence A010179 in the OEIS)

Its decimal value is approximately 10.67707 82520 31311 21....

### Generalized continued fraction

A more rapid method is to evaluate its generalized continued fraction. From the formula derived there:

${\displaystyle {\sqrt {z}}={\sqrt {x^{2}+y}}=x+{\cfrac {y}{2x+{\cfrac {y}{2x+{\cfrac {y}{2x+\ddots }}}}}}=x+{\cfrac {2x\cdot y}{2(2z-y)-y-{\cfrac {y^{2}}{2(2z-y)-{\cfrac {y^{2}}{2(2z-y)-\ddots }}}}}}}$

and the fact that 114 is 2/3 of the way between 102=100 and 112=121 results in

${\displaystyle {\sqrt {114}}={\cfrac {\sqrt {1026}}{3}}={\cfrac {\sqrt {32^{2}+2}}{3}}={\cfrac {32}{3}}+{\cfrac {2/3}{64+{\cfrac {2}{64+{\cfrac {2}{64+{\cfrac {2}{64+\ddots }}}}}}}}={\cfrac {32}{3}}+{\cfrac {2}{192+{\cfrac {18}{192+{\cfrac {18}{192+\ddots }}}}}},}$

which is simply the aforementioned [10;1,2, 10,2,1, 20,1,2, 10,2,1, 20,1,2, ...] evaluated at every third term. Combining pairs of fractions produces

${\displaystyle {\sqrt {114}}={\cfrac {\sqrt {32^{2}+2}}{3}}={\cfrac {32}{3}}+{\cfrac {64/3}{2050-1-{\cfrac {1}{2050-{\cfrac {1}{2050-\ddots }}}}}}={\cfrac {32}{3}}+{\cfrac {64}{6150-3-{\cfrac {9}{6150-{\cfrac {9}{6150-\ddots }}}}}},}$

which is now [10;1,2, 10,2,1,20,1,2, 10,2,1,20,1,2, ...] evaluated at the third term and every six terms thereafter.

## Lucas sequence method

the Lucas sequence of the first kind Un(P,Q) is defined by the recurrence relations:

${\displaystyle U_{n}(P,Q)={\begin{cases}0&{\text{if }}n=0\\1&{\text{if }}n=1\\P\cdot U_{n-1}(P,Q)-Q\cdot U_{n-2}(P,Q)&{\text{Otherwise}}\end{cases}}}$

and the characteristic equation of it is:

${\displaystyle x^{2}-P\cdot x+Q=0}$

it has the discriminant ${\displaystyle D=P^{2}-4Q}$  and the roots:

${\displaystyle {\begin{matrix}x_{1}={\frac {P+{\sqrt {D}}}{2}},&x_{2}={\frac {P-{\sqrt {D}}}{2}}\end{matrix}}}$

all that yield the following positive value:

${\displaystyle \lim _{n\to \infty }{\frac {U_{n+1}}{U_{n}}}=x_{1}}$

so when we want ${\displaystyle {\sqrt {a}}}$ , we can choose ${\displaystyle P=2}$  and ${\displaystyle Q=1-a}$ , and then calculate ${\displaystyle x_{1}=1+{\sqrt {a}}}$  using ${\displaystyle U_{n+1}}$  and ${\displaystyle U_{n}}$ for large value of ${\displaystyle n}$ . The most effective way to calculate ${\displaystyle U_{n+1}}$  and ${\displaystyle U_{n}}$ is:

${\displaystyle {\begin{bmatrix}U_{n}\\U_{n+1}\end{bmatrix}}={\begin{bmatrix}0&1\\-Q&P\end{bmatrix}}\cdot {\begin{bmatrix}U_{n-1}\\U_{n}\end{bmatrix}}={\begin{bmatrix}0&1\\-Q&P\end{bmatrix}}^{n}\cdot {\begin{bmatrix}U_{0}\\U_{1}\end{bmatrix}}}$

Summary:

${\displaystyle {\begin{bmatrix}0&1\\a-1&2\end{bmatrix}}^{n}\cdot {\begin{bmatrix}0\\1\end{bmatrix}}={\begin{bmatrix}U_{n}\\U_{n+1}\end{bmatrix}}}$

then when ${\displaystyle n\to \infty }$ :

${\displaystyle {\sqrt {a}}={\frac {U_{n+1}}{U_{n}}}-1}$

## Using Pell's equation

Pell's equation (also known as Brahmagupta equation since he was the first to give a solution to this particular equation) and its variants yield a method for efficiently finding continued fraction convergents of square roots of integers. However, it can be complicated to execute, and usually not every convergent is generated. The ideas behind the method are as follows:

• If (p, q) is a solution (where p and q are integers) to the equation ${\displaystyle p^{2}=S\cdot q^{2}\pm 1\!}$ , then ${\displaystyle {\frac {p}{q}}}$  is a continued fraction convergent of ${\displaystyle {\sqrt {S}}}$ , and as such, is an excellent rational approximation to it.
• If (pa, qa) and (pb, qb) are solutions, then so is:
${\displaystyle p=p_{a}p_{b}+S\cdot q_{a}q_{b}\,\!}$
${\displaystyle q=p_{a}q_{b}+p_{b}q_{a}\,\!}$
(compare to the multiplication of quadratic integers)
• More generally, if (p1, q1) is a solution, then it is possible to generate a sequence of solutions (pn, qn) satisfying:
${\displaystyle p_{m+n}=p_{m}p_{n}+S\cdot q_{m}q_{n}\,\!}$
${\displaystyle q_{m+n}=p_{m}q_{n}+p_{n}q_{m}\,\!}$

The method is as follows:

• Find positive integers p1 and q1 such that ${\displaystyle p_{1}^{2}=S\cdot q_{1}^{2}\pm 1}$ . This is the hard part; It can be done either by guessing, or by using fairly sophisticated techniques.
• To generate a long list of convergents, iterate:
${\displaystyle p_{n+1}=p_{1}p_{n}+S\cdot q_{1}q_{n}\,\!}$
${\displaystyle q_{n+1}=p_{1}q_{n}+p_{n}q_{1}\,\!}$
• To find the larger convergents quickly, iterate:
${\displaystyle p_{2n}=p_{n}^{2}+S\cdot q_{n}^{2}\,\!}$
${\displaystyle q_{2n}=2p_{n}q_{n}\,\!}$
Notice that the corresponding sequence of fractions coincides with the one given by the Hero's method starting with ${\displaystyle \textstyle {\frac {p_{1}}{q_{1}}}}$ .
• In either case, ${\displaystyle {\frac {p_{n}}{q_{n}}}}$  is a rational approximation satisfying
${\displaystyle \left|{\frac {p_{n}}{q_{n}}}-{\sqrt {S}}\right|<{\frac {1}{q_{n}^{2}\cdot {\sqrt {S}}}}.}$

## Approximations that depend on the floating point representation

A number is represented in a floating point format as ${\displaystyle m\times b^{p}}$  which is also called scientific notation. Its square root is ${\displaystyle {\sqrt {m}}\times b^{p/2}}$  and similar formulae would apply for cube roots and logarithms. On the face of it, this is no improvement in simplicity, but suppose that only an approximation is required: then just ${\displaystyle b^{p/2}}$  is good to an order of magnitude. Next, recognise that some powers, p, will be odd, thus for 3141.59 = 3.14159 × 103 rather than deal with fractional powers of the base, multiply the mantissa by the base and subtract one from the power to make it even. The adjusted representation will become the equivalent of 31.4159 × 102 so that the square root will be 31.4159 × 10.

If the integer part of the adjusted mantissa is taken, there can only be the values 1 to 99, and that could be used as an index into a table of 99 pre-computed square roots to complete the estimate. A computer using base sixteen would require a larger table, but one using base two would require only three entries: the possible bits of the integer part of the adjusted mantissa are 01 (the power being even so there was no shift, remembering that a normalised floating point number always has a non-zero high-order digit) or if the power was odd, 10 or 11, these being the first two bits of the original mantissa. Thus, 6.25 = 110.01 in binary, normalised to 1.1001 × 22 an even power so the paired bits of the mantissa are 01, while .625 = 0.101 in binary normalises to 1.01 × 2−1 an odd power so the adjustment is to 10.1 × 2−2 and the paired bits are 10. Notice that the low order bit of the power is echoed in the high order bit of the pairwise mantissa. An even power has its low-order bit zero and the adjusted mantissa will start with 0, whereas for an odd power that bit is one and the adjusted mantissa will start with 1. Thus, when the power is halved, it is as if its low order bit is shifted out to become the first bit of the pairwise mantissa.

A table with only three entries could be enlarged by incorporating additional bits of the mantissa. However, with computers, rather than calculate an interpolation into a table, it is often better to find some simpler calculation giving equivalent results. Everything now depends on the exact details of the format of the representation, plus what operations are available to access and manipulate the parts of the number. For example, Fortran offers an EXPONENT(x) function to obtain the power. Effort expended in devising a good initial approximation is to be recouped by thereby avoiding the additional iterations of the refinement process that would have been needed for a poor approximation. Since these are few (one iteration requires a divide, an add, and a halving) the constraint is severe.

Many computers follow the IEEE (or sufficiently similar) representation, and a very rapid approximation to the square root can be obtained for starting Newton's method. The technique that follows is based on the fact that the floating point format (in base two) approximates the base-2 logarithm. That is ${\displaystyle \log _{2}(m\times 2^{p})=p+\log _{2}(m)}$

So for a 32-bit single precision floating point number in IEEE format (where notably, the power has a bias of 127 added for the represented form) you can get the approximate logarithm by interpreting its binary representation as a 32-bit integer, scaling it by ${\displaystyle 2^{-23}}$ , and removing a bias of 127, i.e.

${\displaystyle x_{\text{int}}\cdot 2^{-23}-127\approx \log _{2}(x).}$

For example, 1.0 is represented by a hexadecimal number 0x3F800000, which would represent ${\displaystyle 1065353216=127\cdot 2^{23}}$  if taken as an integer. Using the formula above you get ${\displaystyle 1065353216\cdot 2^{-23}-127=0}$ , as expected from ${\displaystyle \log _{2}(1.0)}$ . In a similar fashion you get 0.5 from 1.5 (0x3FC00000).

To get the square root, divide the logarithm by 2 and convert the value back. The following program demonstrates the idea. Note that the exponent's lowest bit is intentionally allowed to propagate into the mantissa. One way to justify the steps in this program is to assume ${\displaystyle b}$  is the exponent bias and ${\displaystyle n}$  is the number of explicitly stored bits in the mantissa and then show that

${\displaystyle (((x_{\text{int}}/2^{n}-b)/2)+b)\cdot 2^{n}=(x_{\text{int}}-2^{n})/2+((b+1)/2)\cdot 2^{n}.}$

/* Assumes that float is in the IEEE 754 single precision floating point format
* and that int is 32 bits. */
float sqrt_approx(float z)
{
int val_int = *(int*)&z; /* Same bits, but as an int */
/*
* To justify the following code, prove that
*
* ((((val_int / 2^m) - b) / 2) + b) * 2^m = ((val_int - 2^m) / 2) + ((b + 1) / 2) * 2^m)
*
* where
*
* b = exponent bias
* m = number of mantissa bits
*
* .
*/

val_int -= 1 << 23; /* Subtract 2^m. */
val_int >>= 1; /* Divide by 2. */
val_int += 1 << 29; /* Add ((b + 1) / 2) * 2^m. */

return *(float*)&val_int; /* Interpret again as float */
}


The three mathematical operations forming the core of the above function can be expressed in a single line. An additional adjustment can be added to reduce the maximum relative error. So, the three operations, not including the cast, can be rewritten as

val_int = (1 << 29) + (val_int >> 1) - (1 << 22) + a;


where a is a bias for adjusting the approximation errors. For example, with a = 0 the results are accurate for even powers of 2 (e.g., 1.0), but for other numbers the results will be slightly too big (e.g.,1.5 for 2.0 instead of 1.414... with 6% error). With a = -0x4B0D2, the maximum relative error is minimized to ±3.5%.

If the approximation is to be used for an initial guess for Newton's method to the equation ${\displaystyle (1/x^{2})-S=0}$ , then the reciprocal form shown in the following section is preferred.

### Reciprocal of the square root

A variant of the above routine is included below, which can be used to compute the reciprocal of the square root, i.e., ${\displaystyle x^{-{1 \over 2}}}$  instead, was written by Greg Walsh. The integer-shift approximation produced a relative error of less than 4%, and the error dropped further to 0.15% with one iteration of Newton's method on the following line.[14] In computer graphics it is a very efficient way to normalize a vector.

float invSqrt(float x)
{
float xhalf = 0.5f*x;
union
{
float x;
int i;
} u;
u.x = x;
u.i = 0x5f375a86 - (u.i >> 1);
/* The next line can be repeated any number of times to increase accuracy */
u.x = u.x * (1.5f - xhalf * u.x * u.x);
return u.x;
}


Some VLSI hardware implements inverse square root using a second degree polynomial estimation followed by a Goldschmidt iteration.[15]

## Negative or complex square

If S < 0, then its principal square root is

${\displaystyle {\sqrt {S}}={\sqrt {\vert S\vert }}\,\,i\,.}$

If S = a+bi where a and b are real and b ≠ 0, then its principal square root is

${\displaystyle {\sqrt {S}}={\sqrt {\frac {\vert S\vert +a}{2}}}\,+\,\operatorname {sgn}(b){\sqrt {\frac {\vert S\vert -a}{2}}}\,\,i\,.}$

This can be verified by squaring the root.[16][17] Here

${\displaystyle \vert S\vert ={\sqrt {a^{2}+b^{2}}}}$

is the modulus of S. The principal square root of a complex number is defined to be the root with the non-negative real part.

## Notes

1. ^ Fowler, David; Robson, Eleanor (1998). "Square Root Approximations in Old Babylonian Mathematics: YBC 7289 in Context". Historia Mathematica. 25 (25): 376. doi:10.1006/hmat.1998.2209. Retrieved 14 October 2018.
2. ^ Heath, Thomas (1921). A History of Greek Mathematics, Vol. 2. Oxford: Clarendon Press. pp. 323–324.
3. ^ Bailey, David; Borwein, Jonathan (2012). "Ancient Indian Square Roots: An Exercise in Forensic Paleo-Mathematics" (PDF). American Mathematical Monthly. 119 (8). pp. 646–657. Retrieved 2017-09-14.
4. ^ Fast integer square root by Mr. Woo's abacus algorithm (archived)
5. ^ Integer Square Root function
6. ^ Sri Bharati Krisna Tirthaji (2008) [1965]. Vedic Mathematics or Sixteen Simple Mathematical Formulae from the Vedas. Motilal Banarsidass. ISBN 978-8120801639.
7. ^ M. V. Wilkes, D. J. Wheeler and S. Gill, "The Preparation of Programs for an Electronic Digital Computer", Addison-Wesley, 1951.
8. ^ M. Campbell-Kelly, "Origin of Computing", Scientific American, September 2009.
9. ^ J. C. Gower, "A Note on an Iterative Method for Root Extraction", The Computer Journal 1(3):142–143, 1958.
10. ^ Markstein, Peter (November 2004). Software Division and Square Root Using Goldschmidt's Algorithms (PDF). 6th Conference on Real Numbers and Computers. Dagstuhl, Germany. CiteSeerX 10.1.1.85.9648.
11. ^ Meher, Pramod Kumar; Valls, Javier; Juang, Tso-Bing; Sridharan, K.; Maharatna, Koushik (2008-08-22). "50 Years of CORDIC: Algorithms, Architectures and Applications" (PDF). IEEE Transactions on Circuits & Systems-I: Regular Papers (published 2009-09-09). 56 (9): 1893–1907. doi:10.1109/TCSI.2009.2025803. Retrieved 2016-01-03.
12. ^ Beceanu, Marius. "Period of the Continued Fraction of sqrt(n)" (PDF). Theorem 2.3. Archived from the original (PDF) on 21 December 2015. Retrieved 21 December 2015.
13. ^ Gliga, Alexandra Ioana (March 17, 2006). On continued fractions of the square root of prime numbers (PDF). Corollary 3.3.
14. ^ Fast Inverse Square Root by Chris Lomont
15. ^ "High-Speed Double-Precision Computation of Reciprocal, Division, Square Root and Inverse Square Root" by José-Alejandro Piñeiro and Javier Díaz Bruguera 2002 (abstract)
16. ^ Abramowitz, Miltonn; Stegun, Irene A. (1964). Handbook of mathematical functions with formulas, graphs, and mathematical tables. Courier Dover Publications. p. 17. ISBN 978-0-486-61272-0., Section 3.7.26, p. 17
17. ^ Cooke, Roger (2008). Classical algebra: its nature, origins, and uses. John Wiley and Sons. p. 59. ISBN 978-0-470-25952-8., Extract: page 59