Fixed point iterations - Numerical Analysis

#%config InlineBackend.figure_format = 'svg'
from pylab import *

3.7.1Newton as fixed point iteration¶

The Newton-Raphson method to find the zeros of $f(x)$ can be written as

x_{n+1} = \phi(x_n), \qquad \phi(x) = x - \frac{f(x)}{f'(x)}

(1)

At a root α, we have

f(\alpha) = 0, \qquad \phi(\alpha) = \alpha - \frac{f(\alpha)}{f'(\alpha)} = \alpha

(2)

Thus the root is a fixed point of the function ϕ. We have converted the problem of root finding into one of finding the fixed points of a map via the fixed point iteration $x_{n+1} = \phi(x_n)$ .

Example 1

To find $\sqrt{a}$ for $a > 0$ , we can find the roots of $f(x) = x^2 - a$ . We can rewrite this as a fixed point relation in many ways, e.g.,

$x = x + c(x^2 - a)$ for some $c \ne 0$
$x = \frac{a}{x}$
$x = \half(x + \frac{a}{x})$ , which is Newton method.

Not all these fixed point iterations may converge.

a = 3.0
c = 0.1
x1, x2, x3 = 2.0, 2.0, 2.0
print("%6d %18.10e %18.10e %18.10e" % (0,x1,x2,x3))
for i in range(10):
    x1 = x1 + c*(x1**2 - a)
    x2 = a/x2
    x3 = 0.5*(x3 + a/x3)
    print("%6d %18.10e %18.10e %18.10e" % (i+1,x1,x2,x3))

     0   2.0000000000e+00   2.0000000000e+00   2.0000000000e+00
     1   2.1000000000e+00   1.5000000000e+00   1.7500000000e+00
     2   2.2410000000e+00   2.0000000000e+00   1.7321428571e+00
     3   2.4432081000e+00   1.5000000000e+00   1.7320508100e+00
     4   2.7401346820e+00   2.0000000000e+00   1.7320508076e+00
     5   3.1909684895e+00   1.5000000000e+00   1.7320508076e+00
     6   3.9091964797e+00   2.0000000000e+00   1.7320508076e+00
     7   5.1373781913e+00   1.5000000000e+00   1.7320508076e+00
     8   7.4766436594e+00   2.0000000000e+00   1.7320508076e+00
     9   1.2766663700e+01   1.5000000000e+00   1.7320508076e+00
    10   2.8765433904e+01   2.0000000000e+00   1.7320508076e+00

Only the third form, the Newton-Method, is converging to the root.

3.7.2Fixed point, contraction map¶

Proof 2

Uniqueness. Suppose ϕ has two fixed points $\alpha, \beta \in [a,b]$ . Then

|\alpha - \beta| = |\phi(\alpha) - \phi(\beta)| \le \lambda |\alpha - \beta|

(8)

and hence

(1-\lambda) |\alpha - \beta| \le 0

(9)

Since $0 < \lambda < 1$ , we conclude that only equality is possible, which implies that $\alpha =\beta$ .

Convergence. Since $\phi([a,b]) \subset [a,b]$ , the sequence generated by $x_n = \phi(x_{n-1})$ is contained in $[a,b]$ .

|\alpha - x_{n+1}| = |\phi(\alpha) - \phi(x_n)| \le \lambda |\alpha - x_n|

(10)

and by induction

|\alpha - x_n| \le \lambda^n |\alpha - x_0|

(11)

As $n \to \infty$ , $\lambda^n \to 0$ so that $x_n \to \alpha$ .

Error bound. Using contraction property and triangle inequality, we have

|\alpha - x_0| \le |\alpha - x_1| + |x_1 - x_0| \le \lambda |\alpha - x_0| + |x_1 - x_0|

(12)

and hence

|\alpha - x_0| \le \frac{1}{1-\lambda} |x_1 - x_0|

(13)

Combining this with (11), we get

|\alpha - x_n| \le \lambda^n |\alpha - x_0| \le \frac{\lambda^n}{1-\lambda} |x_1 - x_0|

(14)

3.7.3Differentiability and contractivity¶

Theorem 3

Assume that $\phi : [a,b] \to [a,b]$ is continuously differentiable and

\lambda = \max_{a \le x \le b} |\phi'(x)| < 1

(19)

Then

ϕ has a unique fixed point $\alpha \in [a,b]$ .
For any $x_0 \in [a,b]$ , the iterations $x_{n+1} = \phi(x_{n})$ converge to α.
$|\alpha - x_n| \le \lambda^n |\alpha - x_0| \le \frac{\lambda^n}{1-\lambda} | x_1 - x_0|$ and
$\lim_{n \to \infty} \frac{\alpha - x_{n+1}}{\alpha - x_n} = \phi'(\alpha)$
(20)

Proof 3

(1,2) For some ξ between $x$ , $y$ , we have

\phi(x) - \phi(y) = \phi'(\xi) (x-y)

(21)

and hence

|\phi(x) - \phi(y)| = |\phi'(\xi)| |x-y| \le \lambda |x-y|

(22)

so that ϕ is a contraction map and by previous theorem, (1) and (2) follow.

(3) Now

\alpha - x_{n+1} = \phi(\alpha) - \phi(x_n) = \phi'(\xi_n) (\alpha - x_n), \qquad \xi_n \textrm{ between } \alpha, x_n

(23)

and $\xi_n \to \alpha$ so that

\lim_{n \to \infty} \frac{\alpha - x_{n+1}}{\alpha - x_n} = \lim_{n \to \infty} \phi'(\xi_n) = \phi'(\alpha)

(24)

If $\phi'(\alpha) \ne 0$ , then the sequence $\{ x_n \}$ converges to α with order $p=1$ , i.e., we have linear convergence.

Proof 4

Pick a number λ satisfying

|\phi'(\alpha)| < \lambda < 1

(25)

Then pick an interval $I = [\alpha - \delta, \alpha + \delta]$ such that

\max_{x \in I} |\phi'(x)| \le \lambda < 1

(26)

Now for any $x \in I$ , $|\alpha - x| \le \delta$ and

|\alpha - \phi(x)| = |\phi(\alpha) - \phi(x)| = |\phi'(\xi)| \cdot |\alpha - x|

(27)

where ξ is between $\alpha, x$ and hence in $I$ , so that

|\alpha - \phi(x)| \le \lambda |\alpha - x| < |\alpha - x| \le \delta

(28)

Hence we have $\phi(I) \subset I$ . Now apply previous theorem using $[a,b] = [\alpha- \delta,\alpha+\delta]$ .

3.7.4Order of convergence¶

If $\phi'(\alpha) \ne 0$ , then we only get linear convergence. For faster convergence, derivatives of ϕ need to vanish at the root.

Proof 5

Since $\phi'(\alpha) = 0$ , we can choose an interval $I = [\alpha-\delta,\alpha+\delta]$ in which $|\phi'(x)| < 1$ . Using previous theorem, we know the iterations converge for any initial guess $x_0 \in I$ .

Now by Taylor expansion around α

\begin{aligned} x_{n+1} &= \phi(x_n) \\ &= \phi(\alpha) + (x_n-\alpha) \phi'(\alpha) + \ldots + \frac{(x_n-\alpha)^{p-1}} {(p-1)!} \phi^{(p-1)}(\alpha) \\ & \quad + \frac{(x_n-\alpha)^p}{p!} \phi^{(p)}(\xi_n), \qquad \textrm{$\xi_n$ between $x_n$ and $\alpha$} \end{aligned}

(33)

Using the given conditions on ϕ

x_{n+1} = \alpha + \frac{(x_n-\alpha)^p}{p!} \phi^{(p)}(\xi_n)

(34)

and hence we have $p$ ’th order convergence

|x_{n+1} - \alpha| \le M |x_n-\alpha|^p, \qquad M = \frac{1}{p!} \max |\phi^{(p)}|

(35)

The last result also follows easily since $\xi_n \to \alpha$ .

Example 3 (Newton-Raphson method)

The iteration function for Newton-Raphson method is

\phi(x) = x - \frac{f(x)}{f'(x)}

(36)

for which

\phi'(x) = \frac{f(x) f''(x)}{[f'(x)]^2}, \qquad \phi''(x) = \frac{f''(x)}{f'(x)} + \frac{f(x) f'''(x)}{[f'(x)]^2} - \frac{2 f(x) [f''(x)]^2}{[f'(x)]^3}

(37)

If $f'(\alpha) \ne 0$ , then

\phi'(\alpha) = 0, \qquad \phi''(\alpha) = \frac{f''(\alpha)}{f'(\alpha)}

(38)

and hence Newton method converges with order atleast $p=2$ .

Example 4

Consider the function

f(x) = (x-1)^2 \sin(x)

(39)

for which $x=1$ is a double root.

def f(x):
    return (x-1.0)**2 * sin(x)

def df(x):
    return 2.0*(x-1.0)*sin(x) + (x-1.0)**2 * cos(x)

x = linspace(0.0,2.0,100)
plot(x,f(x)), xlabel('x'), ylabel('f(x)'), grid(True);

Here is the Newton method

def newton(x0,m=1.0):
    n = 50
    x = zeros(50)
    x[0] = x0
    print("%6d %24.14e" % (0,x[0]))
    for i in range(1,50):
        x[i] = x[i-1] - m*f(x[i-1])/df(x[i-1])
        if i > 1:
            r = (x[i] - x[i-1])/(x[i-1]-x[i-2])
        else:
            r = 0.0
        print("%6d %24.14e %14.6e" % (i,x[i],r))
        if abs(f(x[i])) < 1.0e-14:
            break

The Newton method gives

newton(2.0)

     0     2.00000000000000e+00
     1     1.35163555744248e+00   0.000000e+00
     2     1.18244356861394e+00   2.609520e-01
     3     1.09450383817604e+00   5.197630e-01
     4     1.04837639635805e+00   5.245347e-01
     5     1.02452044172239e+00   5.171749e-01
     6     1.01235093391487e+00   5.101245e-01
     7     1.00619920246051e+00   5.055037e-01
     8     1.00310567444820e+00   5.028711e-01
     9     1.00155437342780e+00   5.014666e-01
    10     1.00077757303341e+00   5.007412e-01
    11     1.00038888338214e+00   5.003726e-01
    12     1.00019446594325e+00   5.001868e-01
    13     1.00009723903915e+00   5.000935e-01
    14     1.00004862103702e+00   5.000468e-01
    15     1.00002431089794e+00   5.000234e-01
    16     1.00001215554384e+00   5.000117e-01
    17     1.00000607779564e+00   5.000059e-01
    18     1.00000303890375e+00   5.000029e-01
    19     1.00000151945336e+00   5.000015e-01
    20     1.00000075972705e+00   5.000007e-01
    21     1.00000037986362e+00   5.000004e-01
    22     1.00000018993183e+00   5.000002e-01
    23     1.00000009496592e+00   5.000001e-01

which shows convergence towards $x=1$ . Note that

\phi'(1) = \frac{f(1) f''(1)}{[f'(1)]^2} = \half < 1

(40)

Consistent with this, we observe that Newton method is converging but only linearly; the last column above shows that

\frac{|x_{n+1} - x_n|}{|x_n - x_{n-1}|} \approx \half = \phi'(1), \qquad n \to \infty

(41)

3.7.5Multiple roots and Newton method¶

Let α be a root of $f(x)$ with multiplicity $m$ , so that

f(\alpha) = f'(\alpha) = \ldots = f^{(m-1)}(\alpha) = 0, \qquad f^{(m)}(\alpha) \ne 0

(42)

A Taylor expansion around α gives

f(x) = \frac{(x-\alpha)^m}{m!} f^{(m)}(\xi_x), \qquad \textrm{$\xi_x$ between $ \alpha$ and $x$}

(43)

Near α, $f(x)$ behaves like

f(x) = (x-\alpha)^m h(x), \qquad h(\alpha) \ne 0

(44)

Let us apply Newton method to this function. Since

f'(x) = (x-\alpha)^m h'(x) + m(x-\alpha)^{m-1} h(x)

(45)

so that the iteration function of Newton method is

\phi(x) = x - \frac{f(x)}{f'(x)} = x - \frac{(x-\alpha) h(x)}{mh(x) + (x-\alpha)h'(x)}

(46)

This satisfies

\phi'(\alpha) = 1 - \frac{1}{m} \ne 0 \qquad \textrm{if} \quad m \ge 2

(47)

Thus Newton method converges since $|\phi'(\alpha)| < 1$ , but only linearly, with rate of convergence $c = \frac{m-1}{m}$ .

To improve Newton method for multiple roots, we need an iteration function with $\phi'(\alpha) = 0$ . This is satisfied for

\phi(x) = x - m \frac{f(x)}{f'(x)}

(48)

and

\lim_{n \to \infty} \frac{\alpha - x_{n+1}}{(\alpha - x_n)^2} = - \half \phi''(\alpha)

(49)

which recovers the quadratic convergence of Newton method.

Example 5

We repeat previous example but with $m=2$ since we have a double root

newton(2.0, m=2)

     0     2.00000000000000e+00
     1     7.03271114884954e-01   0.000000e+00
     2     1.06293359720705e+00  -2.773614e-01
     3     1.00108318855754e+00  -1.719679e-01
     4     1.00000037565568e+00   1.750696e-02
     5     1.00000000000005e+00   3.469257e-04

Now we recover quadratic convergence which is clearly faster than linear convergence !!! Compare the number of correct decimal places in the root between the two methods.

The view point of fixed point iterations helped us to understand why Newton converges only linearly to multiple roots, and also helped to fix it so that quadratic convergence is recovered.

Numerical Analysis

Miscellaneous methods

Numerical Analysis

Homotopy and continuation methods

3.7Fixed point iterations

3.7.1Newton as fixed point iteration¶

3.7.2Fixed point, contraction map¶

3.7.3Differentiability and contractivity¶

3.7.4Order of convergence¶

3.7.5Multiple roots and Newton method¶