Approximation
Consider a function \(f(x) = \log(1+x)\), for \(x > 0\) as shown in Figure 1.5. This is a nonlinear function, and we all know that nonlinear functions are not fun to deal with. For example, if you want to integrate the function \(\int_a^b x \log(1+x) \; dx\), then the logarithm will force you to do integration by parts. However, in many practical problems, you may not need the full range of \(x > 0\). Suppose that you are only interested in values \(x \ll 1\). Then the logarithm can be approximated, and thus the integral can also be approximated.


To see how this is even possible, we show in Figure 1.5 the nonlinear function \(f(x) = \log(1+x)\) and an approximation \(\widehat{f}(x) = x\). The approximation is carefully chosen such that for \(x \ll 1\), the approximation \(\widehat{f}(x)\) is close to the true function \(f(x)\). Therefore, we can argue that for \(x \ll 1\),
thereby simplifying the calculation. For example, if you want to integrate \(x\log (1+x)\) for \(0 < x < 0.1\), then the integral can be approximated by \(\int_0^{0.1} x\log (1+x) \; dx \approx \int_0^{0.1} x^2 \; dx = \frac{x^3}{3} = 3.33 \times 10^{-4}\). (The actual integral is \(3.21 \times 10^{-4}\).) In this section, we will learn about the basic approximation techniques. We will use them when we discuss limit theorems in Chapter 6, as well as various distributions, such as from binomial to Poisson.
1.2.1Taylor approximation
Given a function \(f: \R \rightarrow \R\), it is often useful to analyze its behavior by approximating \(f\) using its local information. Taylor approximation (or Taylor series) is one of the tools for such a task. We will use the Taylor approximation on many occasions.
Let \(f: \R \rightarrow \R\) be a continuous function with infinite derivatives. Let \(a \in \R\) be a fixed constant. The Taylor approximation of \(f\) at \(x = a\) is
where \(f^{(n)}\) denotes the \(n\)th-order derivative of \(f\).
Taylor approximation is a geometry-based approximation. It approximates the function according to the offset, slope, curvature, and so on. According to Definition def:Taylor, the Taylor series has an infinite number of terms. If we use a finite number of terms, we obtain the \(n\)th-order Taylor approximation:
Here, the big-O notation \(\calO(\varepsilon^k)\) means any term that has an order at least power \(k\). For small \(\varepsilon\), i.e., \(\varepsilon \ll 1\), a high-order term \(\calO(\varepsilon^k) \approx 0\) for large \(k\).
Let \(f(x) = \sin x\). Then the Taylor approximation at \(x = 0\) is
We can expand further to higher orders, which yields
We show the first few approximations in Figure 1.6.
One should be reminded that Taylor approximation approximates a function \(f(x)\) at a particular point \(x = a\). Therefore, the approximation of \(f\) near \(x = 0\) and the approximation of \(f\) near \(x = \pi/2\) are different. For example, the Taylor approximation at \(x = \pi/2\) for \(f(x) = \sin x\) is


1.2.2Exponential series
An immediate application of the Taylor approximation is to derive the exponential series.
Let \(x\) be any real number. Then,
Proof. Let \(f(x) = e^x\) for any \(x\). Then, the Taylor approximation around \(x = 0\) is
Evaluate \(\displaystyle \sum_{k=0}^{\infty} \frac{\lambda^k e^{-\lambda}}{k!}\).
This result will be useful for Poisson random variables in Chapter 3.
If we substitute \(x = j\theta\) where \(j = \sqrt{-1}\), then we can show that
Matching the real and imaginary terms, we can show that
This gives the infinite series representations of the two trigonometric functions.
1.2.3Logarithmic approximation
Taylor approximation also allows us to find approximations to logarithmic functions. We start by presenting a lemma.
Let \(0 < x < 1\) be a constant. Then,
Proof. Let \(f(x) = \log(1+x)\). Then, the derivatives of \(f\) are
Taylor approximation at \(x = 0\) gives
The difference between this result and the result we showed in the beginning of this section is the order of polynomials we used to approximate the logarithm:
- sep0ex
- First-order: \(\log(1+x) = x\)
- Second-order: \(\log(1+x) = x - x^2/2\).
What order of approximation is good? It depends on where you want the approximation to be good, and how far you want the approximation to go. The difference between first-order and second-order approximations is shown in Figure 1.7.


When we prove the Central Limit Theorem in Chapter 6, we need to use the following result.
The proof of this equation can be done using the Taylor approximation. Consider \(N \log\left(1+\frac{s^2}{2N}\right)\). By the logarithmic lemma, we can obtain the second-order approximation:
Therefore, multiplying both sides by \(N\) yields
Putting the limit \(N \rightarrow \infty\), we can show that
Taking the exponential of both sides yields $$\exp\left\{ \lim_{N \rightarrow \infty} N \log \bigg(1+\frac{s^2}{2N}\bigg)\right\} = \exp\left\{\frac{s^2}{2}\right\}.$$ Moving the limit outside the exponential yields the result. Figure 1.8 provides a pictorial illustration.
