
## Section4.2Riemann Sums

###### Motivating Questions
• How can we use a Riemann sum to estimate the area between a given curve and the horizontal axis over a particular interval?

• What are the differences among left, right, middle, and random Riemann sums?

• How can we write Riemann sums in an abbreviated form?

In Section 4.1, we learned that if we have a moving object with velocity function $v\text{,}$ whenever $v(t)$ is positive, the area between $y = v(t)$ and the $t$-axis over a given time interval tells us the distance traveled by the object over that time period; in addition, if $v(t)$ is sometimes negative and we view the area of any region below the $t$-axis as having an associated negative sign, then the sum of these signed areas over a given interval tells us the moving object's change in position over the time interval.

For instance, for the velocity function given in Figure 4.2.1, if the areas of shaded regions are $A_1\text{,}$ $A_2\text{,}$ and $A_3$ as labeled, then the total distance $D$ traveled by the moving object on $[a,b]$ is

\begin{equation*} D = A_1 + A_2 + A_3\text{,} \end{equation*}

while the total change in the object's position on $[a,b]$ is

\begin{equation*} s(b) - s(a) = A_1 - A_2 + A_3\text{.} \end{equation*}

Because the motion is in the negative direction on the interval where $v(t) \lt 0\text{,}$ we subtract $A_2$ when determining the object's total change in position.

Of course, finding $D$ and $s(b)-s(a)$ for the situation given in Figure 4.2.1 presumes that we can actually find the areas represented by $A_1\text{,}$ $A_2\text{,}$ and $A_3\text{.}$ In most of our work in Section 4.1, such as in Activities 4.1.3 and 4.1.4, we worked with velocity functions that were either constant or linear, so that by finding the areas of rectangles and triangles, we could find the area bounded by the velocity function and the horizontal axis exactly. But when the curve that bounds a region is not one for which we have a known formula for area, we are unable to find this area exactly. Indeed, this is one of our biggest goals in Chapter 4: to learn how to find the exact area bounded between a curve and the horizontal axis for as many different types of functions as possible.

To begin, we expand on the ideas in Activity 4.1.2, where we encountered a nonlinear velocity function and approximated the area under the curve using four and eight rectangles, respectively. In the following preview activity, we focus on three different options for deciding how to find the heights of the rectangles we will use.

###### Preview Activity4.2.1

A person walking along a straight path has her velocity in miles per hour at time $t$ given by the function $v(t) = 0.25t^3-1.5t^2+3t+0.25\text{,}$ for times in the interval $0 \le t \le 2\text{.}$ The graph of this function is also given in each of the three diagrams in Figure 4.2.2.

Note that in each diagram, we use four rectangles to estimate the area under $y = v(t)$ on the interval $[0,2]\text{,}$ but the method by which the four rectangles' respective heights are decided varies among the three individual graphs.

1. How are the heights of rectangles in the left-most diagram being chosen? Explain, and hence determine the value of

\begin{equation*} S = A_1 + A_2 + A_3 + A_4 \end{equation*}

by evaluating the function $y = v(t)$ at appropriately chosen values and observing the width of each rectangle. Note, for example, that

\begin{equation*} A_3 = v(1) \cdot \frac{1}{2} = 2 \cdot \frac{1}{2} = 1\text{.} \end{equation*}
2. Explain how the heights of rectangles are being chosen in the middle diagram and find the value of

\begin{equation*} T = B_1 + B_2 + B_3 + B_4\text{.} \end{equation*}
3. Likewise, determine the pattern of how heights of rectangles are chosen in the right-most diagram and determine

\begin{equation*} U = C_1 + C_2 + C_3 + C_4\text{.} \end{equation*}
4. Of the estimates $S\text{,}$ $T\text{,}$ and $U\text{,}$ which do you think is the best approximation of $D\text{,}$ the total distance the person traveled on $[0,2]\text{?}$ Why?

### Subsection4.2.1Sigma Notation

It is apparent from several different problems we have considered that sums of areas of rectangles is one of the main ways to approximate the area under a curve over a given interval. Intuitively, we expect that using a larger number of thinner rectangles will provide a way to improve the estimates we are computing. As such, we anticipate dealing with sums with a large number of terms. To do so, we introduce the use of so-called sigma notation, named for the Greek letter $\Sigma\text{,}$ which is the capital letter $S$ in the Greek alphabet.

For example, say we are interested in the sum

\begin{equation*} 1 + 2 + 3 + \cdots + 100\text{,} \end{equation*}

which is the sum of the first 100 natural numbers. Sigma notation provides a shorthand notation that recognizes the general pattern in the terms of the sum. It is equivalent to write

\begin{equation*} \sum_{k=1}^{100} k = 1 + 2 + 3 + \cdots + 100\text{.} \end{equation*}

We read the symbol $\sum_{k=1}^{100} k$ as “the sum from $k$ equals 1 to 100 of $k\text{.}$” The variable $k$ is usually called the index of summation, and the letter that is used for this variable is immaterial. Each sum in sigma notation involves a function of the index; for example,

\begin{equation*} \sum_{k=1}^{10} (k^2 + 2k) = (1^2 + 2\cdot 1) + (2^2 + 2\cdot 2) + (3^2 + 2\cdot 3) + \cdots + (10^2 + 2\cdot 10)\text{,} \end{equation*}

and more generally,

\begin{equation*} \sum_{k=1}^n f(k) = f(1) + f(2) + \cdots + f(n)\text{.} \end{equation*}

Sigma notation allows us the flexibility to easily vary the function being used to track the pattern in the sum, as well as to adjust the number of terms in the sum simply by changing the value of $n\text{.}$ We test our understanding of this new notation in the following activity.

###### Activity4.2.2

For each sum written in sigma notation, write the sum long-hand and evaluate the sum to find its value. For each sum written in expanded form, write the sum in sigma notation.

1. $\sum_{k=1}^{5} (k^2 + 2)$

2. $\sum_{i=3}^{6} (2i-1)$

3. $3 + 7 + 11 + 15 + \cdots + 27$

4. $4 + 8 + 16 + 32 + \cdots + 256$

5. $\sum_{i=1}^{6} \frac{1}{2^i}$

### Subsection4.2.2Riemann Sums

When a moving body has a positive velocity function $y = v(t)$ on a given interval $[a,b]\text{,}$ we know that the area under the curve over the interval is the total distance the body travels on $[a,b]\text{.}$ While this is the fundamental motivating force behind our interest in the area bounded by a function, we are also interested more generally in being able to find the exact area bounded by $y = f(x)$ on an interval $[a,b]\text{,}$ regardless of the meaning or context of the function $f\text{.}$ For now, we continue to focus on determining an accurate estimate of this area through the use of a sum of the areas of rectangles, doing so in the setting where $f(x) \ge 0$ on $[a,b]\text{.}$ Throughout, unless otherwise indicated, we also assume that $f$ is continuous on $[a,b]\text{.}$

The first choice we make in any such approximation is the number of rectangles.

If we say that the total number of rectangles is $n\text{,}$ and we desire $n$ rectangles of equal width to subdivide the interval $[a,b]\text{,}$ then each rectangle must have width $\Delta x = \frac{b-a}{n}\text{.}$ We observe further that $x_1 = x_0 + \Delta x\text{,}$ $x_2 = x_0 + 2 \Delta x\text{,}$ and thus in general $x_{i} = a + i\Delta x\text{,}$ as pictured in Figure 4.2.3.

We use each subinterval $[x_i, x_{i+1}]$ as the base of a rectangle, and next must choose how to decide the height of the rectangle that will be used to approximate the area under $y = f(x)$ on the subinterval. There are three standard choices: use the left endpoint of each subinterval, the right endpoint of each subinterval, or the midpoint of each. These are precisely the options encountered in Preview Activity 4.2.1 and seen in Figure 4.2.2. We next explore how these choices can be reflected in sigma notation.

If we now consider an arbitrary positive function $f$ on $[a,b]$ with the interval subdivided as shown in Figure 4.2.3, and choose to use left endpoints, then on each interval of the form $[x_{i}, x_{i+1}]\text{,}$ the area of the rectangle formed is given by

\begin{equation*} A_{i+1} = f(x_i) \cdot \Delta x\text{,} \end{equation*}

as seen in Figure 4.2.4.

If we let $L_n$ denote the sum of the areas of rectangles whose heights are given by the function value at each respective left endpoint, then we see that

\begin{align*} L_n =\mathstrut \amp A_1 + A_2 + \cdots + A_{i+1} + \cdots + A_n\\ =\mathstrut \amp f(x_0) \cdot \Delta x + f(x_1) \cdot \Delta x + \cdots + f(x_i) \cdot \Delta x + \cdots + f(x_{n-1}) \cdot \Delta x\text{.} \end{align*}

In the more compact sigma notation, we have

\begin{equation*} L_n = \sum_{i = 0}^{n-1} f(x_i) \Delta x\text{.} \end{equation*}

Note particularly that since the index of summation begins at $0$ and ends at $n-1\text{,}$ there are indeed $n$ terms in this sum. We call $L_n$ the left Riemann sum for the function $f$ on the interval $[a,b]\text{.}$

There are now two fundamental issues to explore: the number of rectangles we choose to use and the selection of the pattern by which we identify the height of each rectangle. It is best to explore these choices dynamically, and the applet 1 Marc Renault, Geogebra Calculus Applets.found at http://gvsu.edu/s/a9 is a particularly useful one. There we see the image shown in Figure 4.2.5, but with the opportunity to adjust the slider bars for the left endpoint and the number of subintervals.

By moving the sliders, we can see how the heights of the rectangles change as we consider left endpoints, midpoints, and right endpoints, as well as the impact that a larger number of narrower rectangles has on the approximation of the exact area bounded by the function and the horizontal axis.

To see how the Riemann sums for right endpoints and midpoints are constructed, we consider Figure 4.2.6.

For the sum with right endpoints, we see that the area of the rectangle on an arbitrary interval $[x_i, x_{i+1}]$ is given by $B_{i+1} = f(x_{i+1}) \cdot \Delta x\text{,}$ so that the sum of all such areas of rectangles is given by

\begin{align*} R_n =\mathstrut \amp B_1 + B_2 + \cdots + B_{i+1} + \cdots + B_n\\ =\mathstrut \amp f(x_1) \cdot \Delta x + f(x_2) \cdot \Delta x + \cdots + f(x_{i+1}) \cdot \Delta x + \cdots + f(x_{n}) \cdot \Delta x\\ =\mathstrut \amp \sum_{i=1}^{n} f(x_i) \Delta x\text{.} \end{align*}

We call $R_n$ the right Riemann sum for the function $f$ on the interval $[a,b]\text{.}$ For the sum that uses midpoints, we introduce the notation

\begin{equation*} \overline{x}_{i+1} = \frac{x_{i} + x_{i+1}}{2} \end{equation*}

so that $\overline{x}_{i+1}$ is the midpoint of the interval $[x_i, x_{i+1}]\text{.}$ For instance, for the rectangle with area $C_1$ in Figure 4.2.6, we now have

\begin{equation*} C_1 = f(\overline{x}_1) \cdot \Delta x\text{.} \end{equation*}

Hence, the sum of all the areas of rectangles that use midpoints is

\begin{align*} M_n =\mathstrut \amp C_1 + C_2 + \cdots + C_{i+1} + \cdots + C_n\\ =\mathstrut \amp f(\overline{x_1}) \cdot \Delta x + f(\overline{x_2}) \cdot \Delta x + \cdots + f(\overline{x}_{i+1}) \cdot \Delta x + \cdots + f(\overline{x}_{n}) \cdot \Delta x\\ =\mathstrut \amp \sum_{i=1}^{n} f(\overline{x}_i) \Delta x\text{,} \end{align*}

and we say that $M_n$ is the middle Riemann sum for $f$ on $[a,b]\text{.}$

When $f(x) \ge 0$ on $[a,b]\text{,}$ each of the Riemann sums $L_n\text{,}$ $R_n\text{,}$ and $M_n$ provides an estimate of the area under the curve $y = f(x)$ over the interval $[a,b]\text{;}$ momentarily, we will discuss the meaning of Riemann sums in the setting when $f$ is sometimes negative. We also recall that in the context of a nonnegative velocity function $y = v(t)\text{,}$ the corresponding Riemann sums are approximating the distance traveled on $[a,b]$ by the moving object with velocity function $v\text{.}$

There is a more general way to think of Riemann sums, and that is to not restrict the choice of where the function is evaluated to determine the respective rectangle heights. That is, rather than saying we'll always choose left endpoints, or always choose midpoints, we simply say that a point $x_{i+1}^*$ will be selected at random in the interval $[x_i, x_{i+1}]$ (so that $x_i \le x_{i+1}^* \le x_{i+1}$), which makes the Riemann sum given by

\begin{equation*} f(x_1^*) \cdot \Delta x + f(x_2^*) \cdot \Delta x + \cdots + f(x_{i+1}^*) \cdot \Delta x + \cdots + f(x_n^*) \cdot \Delta x = \sum_{i=1}^{n} f(x_i^*) \Delta x\text{.} \end{equation*}

At http://gvsu.edu/s/a9, the applet noted earlier and referenced in Figure 4.2.5, by unchecking the “relative” box at the top left, and instead checking “random,” we can easily explore the effect of using random point locations in subintervals on a given Riemann sum. In computational practice, we most often use $L_n\text{,}$ $R_n\text{,}$ or $M_n\text{,}$ while the random Riemann sum is useful in theoretical discussions. In the following activity, we investigate several different Riemann sums for a particular velocity function.

###### Activity4.2.3

Suppose that an object moving along a straight line path has its velocity in feet per second at time $t$ in seconds given by $v(t) = \frac{2}{9}(t-3)^2 + 2\text{.}$

1. Carefully sketch the region whose exact area will tell you the value of the distance the object traveled on the time interval $2 \le t \le 5\text{.}$

2. Estimate the distance traveled on $[2,5]$ by computing $L_4\text{,}$ $R_4\text{,}$ and $M_4\text{.}$

3. Does averaging $L_4$ and $R_4$ result in the same value as $M_4\text{?}$ If not, what do you think the average of $L_4$ and $R_4$ measures?

4. For this question, think about an arbitrary function $f\text{,}$ rather than the particular function $v$ given above. If $f$ is positive and increasing on $[a,b]\text{,}$ will $L_n$ over-estimate or under-estimate the exact area under $f$ on $[a,b]\text{?}$ Will $R_n$ over- or under-estimate the exact area under $f$ on $[a,b]\text{?}$ Explain.

### Subsection4.2.3When the function is sometimes negative

For a Riemann sum such as

\begin{equation*} L_n = \sum_{i=0}^{n-1} f(x_i) \Delta x\text{,} \end{equation*}

we can of course compute the sum even when $f$ takes on negative values. We know that when $f$ is positive on $[a,b]\text{,}$ the corresponding left Riemann sum $L_n$ estimates the area bounded by $f$ and the horizontal axis over the interval.

For a function such as the one pictured in Figure 4.2.7, where in the first figure a left Riemann sum is being taken with 12 subintervals over $[a,d]\text{,}$ we observe that the function is negative on the interval $b \le x \le c\text{,}$ and so for the four left endpoints that fall in $[b,c]\text{,}$ the terms $f(x_i) \Delta x$ have negative function values. This means that those four terms in the Riemann sum produce an estimate of the opposite of the area bounded by $y = f(x)$ and the $x$-axis on $[b,c]\text{.}$

In Figure 4.2.7, we also see evidence that by increasing the number of rectangles used in a Riemann sum, it appears that the approximation of the area (or the opposite of the area) bounded by a curve appears to improve. For instance, in the middle graph, we use 24 left rectangles, and from the shaded areas, it appears that we have decreased the error from the approximation that uses 12. When we proceed to Section 4.3, we will discuss the natural idea of letting the number of rectangles in the sum increase without bound.

For now, it is most important for us to observe that, in general, any Riemann sum of a continuous function $f$ on an interval $[a,b]$ approximates the difference between the area that lies above the horizontal axis on $[a,b]$ and under $f$ and the area that lies below the horizontal axis on $[a,b]$ and above $f\text{.}$ In the notation of Figure 4.2.7, we may say that

\begin{equation*} L_{24} \approx A_1 - A_2 + A_3\text{,} \end{equation*}

where $L_{24}$ is the left Riemann sum using 24 subintervals shown in the middle graph, and $A_1$ and $A_3$ are the areas of the regions where $f$ is positive on the interval of interest, while $A_2$ is the area of the region where $f$ is negative. We will also call the quantity $A_1 - A_2 + A_3$ the net signed area bounded by $f$ over the interval $[a,d]\text{,}$ where by the phrase “signed area” we indicate that we are attaching a minus sign to the areas of regions that fall below the horizontal axis.

Finally, we recall from the introduction to this present section that in the context where the function $f$ represents the velocity of a moving object, the total sum of the areas bounded by the curve tells us the total distance traveled over the relevant time interval, while the total net signed area bounded by the curve computes the object's change in position on the interval.

###### Activity4.2.4

Suppose that an object moving along a straight line path has its velocity $v$ (in feet per second) at time $t$ (in seconds) given by

\begin{equation*} v(t) = \frac{1}{2}t^2 - 3t + \frac{7}{2}\text{.} \end{equation*}
1. Compute $M_5\text{,}$ the middle Riemann sum, for $v$ on the time interval $[1,5]\text{.}$ Be sure to clearly identify the value of $\Delta t$ as well as the locations of $t_0\text{,}$ $t_1\text{,}$ $\cdots\text{,}$ $t_5\text{.}$ In addition, provide a careful sketch of the function and the corresponding rectangles that are being used in the sum.

2. Building on your work in (a), estimate the total change in position of the object on the interval $[1,5]\text{.}$

3. Building on your work in (a) and (b), estimate the total distance traveled by the object on $[1,5]\text{.}$

4. Use appropriate computing technology 2 For instance, consider the applet at http://gvsu.edu/s/a9 and change the function and adjust the locations of the blue points that represent the interval endpoints $a$ and $b\text{.}$to compute $M_{10}$ and $M_{20}\text{.}$ What exact value do you think the middle sum eventually approaches as $n$ increases without bound? What does that number represent in the physical context of the overall problem?

### Subsection4.2.4Summary

• A Riemann sum is simply a sum of products of the form $f(x_i^*) \Delta x$ that estimates the area between a positive function and the horizontal axis over a given interval. If the function is sometimes negative on the interval, the Riemann sum estimates the difference between the areas that lie above the horizontal axis and those that lie below the axis.

• The three most common types of Riemann sums are left, right, and middle sums, plus we can also work with a more general, random Riemann sum. The only difference among these sums is the location of the point at which the function is evaluated to determine the height of the rectangle whose area is being computed in the sum. For a left Riemann sum, we evaluate the function at the left endpoint of each subinterval, while for right and middle sums, we use right endpoints and midpoints, respectively.

• The left, right, and middle Riemann sums are denoted $L_n\text{,}$ $R_n\text{,}$ and $M_n\text{,}$ with formulas

\begin{align*} L_n = f(x_0) \Delta x + f(x_1) \Delta x + \cdots + f(x_{n-1}) \Delta x \amp= \sum_{i = 0}^{n-1} f(x_i) \Delta x,\\ R_n = f(x_1) \Delta x + f(x_2) \Delta x + \cdots + f(x_{n}) \Delta x \amp= \sum_{i = 1}^{n} f(x_i) \Delta x,\\ M_n = f(\overline{x}_1) \Delta x + f(\overline{x}_2) \Delta x + \cdots + f(\overline{x}_{n}) \Delta x \amp= \sum_{i = 1}^{n} f(\overline{x}_i) \Delta x\text{,} \end{align*}

where $x_0 = a\text{,}$ $x_i = a + i\Delta x\text{,}$ and $x_n = b\text{,}$ using $\Delta x = \frac{b-a}{n}\text{.}$ For the midpoint sum, $\overline{x}_{i} = (x_{i-1} + x_i)/2\text{.}$

### SubsectionExercises

###### 4

Consider the function $f(x) = 3x + 4\text{.}$

1. Compute $M_4$ for $y=f(x)$ on the interval $[2,5]\text{.}$ Be sure to clearly identify the value of $\Delta x\text{,}$ as well as the locations of $x_0, x_1, \ldots, x_4\text{.}$ Include a careful sketch of the function and the corresponding rectangles being used in the sum.

2. Use a familiar geometric formula to determine the exact value of the area of the region bounded by $y = f(x)$ and the $x$-axis on $[2,5]\text{.}$

3. Explain why the values you computed in (a) and (b) turn out to be the same. Will this be true if we use a number different than $n = 4$ and compute $M_n\text{?}$ Will $L_4$ or $R_4$ have the same value as the exact area of the region found in (b)?

4. Describe the collection of functions $g$ for which it will always be the case that $M_n\text{,}$ regardless of the value of $n\text{,}$ gives the exact net signed area bounded between the function $g$ and the $x$-axis on the interval $[a,b]\text{.}$

###### 5

Let $S$ be the sum given by

\begin{equation*} S = ((1.4)^2 + 1) \cdot 0.4 + ((1.8)^2 + 1) \cdot 0.4 + ((2.2)^2 + 1) \cdot 0.4 + ((2.6)^2 + 1) \cdot 0.4 +((3.0)^2 + 1) \cdot 0.4\text{.} \end{equation*}
1. Assume that $S$ is a right Riemann sum. For what function $f$ and what interval $[a,b]$ is $S$ this function's Riemann sum? Why?

2. How does your answer to (a) change if $S$ is a left Riemann sum? a middle Riemann sum?

3. Suppose that $S$ really is a right Riemann sum. What is geometric quantity does $S$ approximate?

4. Use sigma notation to write a new sum $R$ that is the right Riemann sum for the same function, but that uses twice as many subintervals as $S\text{.}$

###### 6

A car traveling along a straight road is braking and its velocity is measured at several different points in time, as given in the following table.

1. Plot the given data on a set of axes with time on the horizontal axis and the velocity on the vertical axis.

2. Estimate the total distance traveled during the car the time brakes using a middle Riemann sum with 3 subintervals.

3. Estimate the total distance traveled on $[0,1.8]$ by computing $L_6\text{,}$ $R_6\text{,}$ and $\frac{1}{2}(L_6 + R_6)\text{.}$

4. Assuming that $v(t)$ is always decreasing on $[0,1.8]\text{,}$ what is the maximum possible distance the car traveled before it stopped? Why?

###### 7

The rate at which pollution escapes a scrubbing process at a manufacturing plant increases over time as filters and other technologies become less effective. For this particular example, assume that the rate of pollution (in tons per week) is given by the function $r$ that is pictured in Figure 4.2.9.

1. Use the graph to estimate the value of $M_4$ on the interval $[0,4]\text{.}$

2. What is the meaning of $M_4$ in terms of the pollution discharged by the plant?

3. Suppose that $r(t) = 0.5 e^{0.5t}\text{.}$ Use this formula for $r$ to compute $L_5$ on $[0,4]\text{.}$

4. Determine an upper bound on the total amount of pollution that can escape the plant during the pictured four week time period that is accurate within an error of at most one ton of pollution.