Summation notation

Section 8.2 Summation notation

Subsection 8.2.1 Basic definitions

Summation notation is a compact form for writing sums. It is written with three pieces. \[ \sum_{\fbox{\(k=1\)}}^\fbox{\(5\)} \fbox{\(2k\)} \] The part below the greek letter sigma (\(\Sigma\)) tells us to start with \(k=1\text{.}\) We increment \(k\) by \(1\) until we get to the value above the sigma. In this case we let \(k\) take on the values \(1, 2, 3, 4, 5\text{.}\) We next substitute those values of \(k\) into the expression to the right of the sigma successively and take the sum all of the resulting terms. Here is a table of the resulting values:

\begin{align*} k \amp\amp 2k\\ 1 \amp\amp 2\\ 2 \amp\amp 4\\ 3 \amp\amp 6\\ 4 \amp\amp 8\\ 5 \amp\amp 10 \end{align*}

which evaluates to

\begin{equation*} 2+4+6+8+10=30 \end{equation*}

and so we write

\begin{equation*} \sum_{k=1}^5 2k=30\text{.} \end{equation*}

The variable \(k\) is called the index of summation. The number above the sigma is called the limit of summation. The example shows us how to write a sum of even numbers. We can write the sum of odd numbers, too.

\begin{equation*} \sum_{k=1}^8 (2k-1)=1+3+5+7+9+11+13+15=64 \text{.} \end{equation*}

It is not necessary to use \(k\) as a variable or to start the summation at \(k=1\text{.}\) It is easy to verify that

\begin{equation*} \sum_{k=1}^8(2k-1)=\sum_{l=0}^7(2l+1)=64 \text{.} \end{equation*}

The number above the sigma can also be written as a variable:

\begin{gather*} \sum_{k=1}^n a_k=a_1+a_2+\cdots+a_{n-1}+a_n\\ \sum_{k=0}^n a_kb_k=a_0b_0+a_1b_1+a_2b_2+\cdots+a_{n-1}b_{n-1}+a_nb_n\\ \sum_{k=1}^n (2k-1) = 1+3+5+7+\cdots+(2n-1) \end{gather*}

Next we take the last expression and vary \(n\text{,}\) the limit of summation. Here is a table that includes some small values of \(n\text{:}\)

Table 8.2.1.

\(\sum_{k=1}^1(2k-1)=1\)	\(n=1\)
\(\sum_{k=1}^2(2k-1)=4\)	\(n=2\)
\(\sum_{k=1}^3(2k-1)=9\)	\(n=3\)
\(\sum_{k=1}^4(2k-1)=16\)	\(n=4\)
\(\vdots\)
\(\sum_{k=1}^n(2k-1)=1+3+5+\cdots+2n-1=?\)

Checkpoint 8.2.2.

What simple expression involving \(n\) can replace the question mark?

The summands can be symbolic, too. If \(f(x)\) is any function, we may write

\begin{gather*} \sum_{i=1}^r f(i)=f(1)+f(2)+\cdots+f(r-1)+f(r) \end{gather*}

Subsection 8.2.2 Alternating sums

A sum is alternating if the signs of consecutive terms are alternate between positive and negative. An example: \(1-2+3-4+5\text{.}\) These sums appear frequently enough to have this special name. They are easy to write using summation notation because of a simple fact:

\begin{equation*} (-1)^k= \begin{cases} 1 \amp \text{if \(k\) is even}\\ -1 \amp \text{if \(k\) is odd} \end{cases} \end{equation*}

We can then write

\begin{equation*} \sum_{k=1}^6 (-1)^k k^2=-1+4-9+16-25+36=21 \text{.} \end{equation*}

We can interchange the order of the signs by adding (or subtracting) one to the exponent of \(-1\text{:}\)

\begin{equation*} \sum_{k=1}^6 (-1)^{k+1} k^2=1-4+9-16+25-36=-21\text{.} \end{equation*}

Subsection 8.2.3 Manipulation with summation notation identities

Summation notation is more that mere shorthand. There are rules of manipulation that are quite useful.

For any constant \(c\text{,}\)

\begin{equation*} \sum_{k=1}^n c=\underbrace{c+c+\cdots+c+c}_{n \textrm{ summands}}=nc \end{equation*}

and

\begin{equation*} \sum_{k=1}^n ca_k =ca_1+\cdots+ca_n =c(a_1+\cdots+a_n) =c\sum_{k=1}^n a_k \end{equation*}

The process of the last equation is sometimes referred to as pulling a constant across the summation sign.

In addition,

\begin{equation*} \sum_{k=1}^n(a_k+b_k)= \sum_{k=1}^na_k+\sum_{k=1}^nb_k,\\ \sum_{k=1}^n(a_k-b_k)= \sum_{k=1}^na_k-\sum_{k=1}^nb_k \end{equation*}

and

\begin{equation*} \sum_{k=0}^na_k= \sum_{k=t}^{n+t}a_{k-t} \end{equation*}

The last identity is sometimes called shifting the index of summation.

Example 8.2.3.

Suppose we want to evaluate the sum of the first \(n\) positive integers. In other words, we want to find

\begin{equation*} s_n=1+2+\cdots+n=\sum_{k=1}^nk. \end{equation*}

If we are convinced that \(\sum_{k=1}^n(2k-1)=1+3+5+\cdots+2n-1=n^2\text{,}\) then

\begin{align*} n^2\amp =\sum_{k=1}^n(2k-1)\\ \amp =\sum_{k=1}^n(2k)-\sum_{k=1}^n1\\ \amp =2\sum_{k=1}^nk-n\\ \amp =2s_n-n \end{align*}

which yields

\begin{equation*} s_n=\frac12n(n+1). \end{equation*}

Checkpoint 8.2.4.

Evaluate the following sums

\(\displaystyle \sum_{j=1}^5 2j\)
\(\displaystyle \sum_{k=1}^5 k(k-1)\)
\(\displaystyle \sum_{i=0}^4 2^i\)
\(\displaystyle \sum_{\ell=1}^3\ell^2+\frac1\ell\)
\(\displaystyle \sum_{j=1}^5 ij\)

Solution

\(\displaystyle 2+4+6+8+10=30\)
\(\displaystyle 2+6+12+20+30=70\)
\(\displaystyle 1+2+4+8+16=31\)
\(\displaystyle 1+1+4+\frac12+9+\frac13=15\frac56\)
\(\displaystyle i+2i+3i+4i+5i=15i\)

Checkpoint 8.2.5.

Evaluate \(\sum_{k=0}^n 2^k\) for \(n=1,2,3,4\text{.}\) Find a simple expression that works for any \(n\text{.}\) Justify your answer.

Solution

\begin{equation*} \begin{array}{cc} n \amp \sum_{k=0}^n 2^k\\ \hline 1 \amp 3\\ 2 \amp 7\\ 3\amp 15\\ 4\amp 31 \end{array} \end{equation*}

We note that each answer is one less than a power of \(2\text{.}\) This leads to the equation \[ \sum_{k=0}^n 2^k=2^{n+1}-1. \] To justify the equation we note that \[ 2(\sum_{k=0}^n 2^k)=\sum_{k=0}^n 2^{k+1}=\sum_{k=1}^{n+1} 2^k =\bigl(\sum_{k=0}^{n+1} 2^k\bigr) -1 \] so we get the sum for a given value on \(n\) by doubling the value for \(n-1\) and adding \(1\text{.}\)

Subsection 8.2.4 Double summations

Sometimes it is useful to have a formula with two summations, each with its own index of summation. It is interpreted in the following way: \[ \sum_{i=1}^m \sum_{j=1}^n f(i,j)= \sum_{i=1}^m \bigl(\sum_{j=1}^n f(i,j)\bigr) \] and so, for example, \[ \sum_{i=1}^3 \sum_{j=1}^2 i^j=\sum_{i=1}^3 (i^1+i^2)=(1+1)+(2+4)+(3+9)=20 \]

Double summations are often used with matrices. Suppose we have a matrix

\begin{equation*} A= \begin{bmatrix} a_{1,1}\amp a_{1,2}\amp \cdots \amp a_{1,n}\\ a_{2,1}\amp a_{2,2}\amp \cdots \amp a_{2,n}\\ \vdots\amp \vdots\amp \vdots\amp \vdots\\ a_{m,1}\amp a_{m,2}\amp \cdots \amp a_{m,n} \end{bmatrix} \end{equation*}

Then \[ \sum_{j=1}^n a_{i,j} = a_{i,1}+a_{i,2}+\cdots+a_{i,n} \] which is the sum of the entries in the \(i\)-th row of the matrix. Similarly \[ \sum_{i=1}^m a_{i,j} = a_{1,j}+a_{2,j}+\cdots+a_{m,j} \] which is the sum of the entries in the \(j\)-th column of the matrix. We can now see that

\begin{equation*} \sum_{i=1}^m\sum_{j=1}^n a_{i,j} =\sum_{j=1}^n\sum_{i=1}^m a_{i,j} \end{equation*}

since each sides of the equation is equal to the sum of all of the entries in the matrix. This is called interchanging the order of summation.

Another way of looking at these double summations \(\sum_{i=1}^m \sum_{j=1}^n f(i,j)\) is that it is the sum of \(f(i,j)\) as \(i\) takes on values from \(1\) to \(m\) and \(j\) takes on values from \(1\) to \(n\text{.}\) Clearly there are \(mn\) summands altogether.