Week 2: Exercises

Week 2: Exercises#

Exercises – Long Day#

1: Level Curves and Directional Derivatives of Scalar Functions#

A function \(f:\mathbb{R}^2\rightarrow\mathbb{R}\) is given by the expression

\[\begin{equation*} f(x,y)=x^2+y^2. \end{equation*}\]

Another function \(g:\mathbb{R}^2\rightarrow\mathbb{R}\) is given by the expression

\[\begin{equation*} g(x,y)=x^2-4x+y^2. \end{equation*}\]

Question a#

Describe the level curves given by \(f(x,y)=c\) for the values \(c\in\{1,2,3,4,5\}\).

Hint

Remember the circle equation_ \((x-a)^2+(y-b)^2=r^2\).

Answer

The level curves are circles, all centered at \((0,0)\). Their radii are \(1,\,\sqrt 2,\,\sqrt 3,\,2,\,\sqrt{5}\,\) respectively.

Question b#

Determine the gradient of \(f\) at the point \((1,1)\) and find the directional derivative of \(f\) at \((1,1)\) in the direction given by the unit direction vector \(\pmb{e}=(1,0)\).

Answer

\(\nabla f(1,1)=(2,2)\). The directional derivative is the inner product (in this case that would be the usual dot product) of the gradient and the given direction vector.

\[\begin{equation*} \nabla_{\pmb{e}}\, f(1,1) = \langle \pmb{e}, \nabla f(1,1) \rangle = 1 \cdot 2 + 0 \cdot 2 = 2. \end{equation*}\]

Question c#

Describe the level curves given by \(g(x,y)=c\) for the values \(c \in\{-3,-2,-1,0,1\}\).

Hint

Remember the circle equation: \((x-a)^2+(y-b)^2=r^2\).

Answer

We provide the result for the first case: Since

\[\begin{equation*} x^2-4x+y^2=-3\Leftrightarrow (x-2)^2+y^2=1, \end{equation*}\]

the level curve is a circle with center at \((2,0)\) and radius 1. The other level curves are also circles with the same center but different radii.

Question d#

Determine the gradient of \(g\) at the point \((1,2)\) and find the directional derivative of \(g\) at \((1,2)\) in the direction towards the origin, \((0,0)\).

Answer

We will start with the gradient:

\[\begin{equation*} \nabla g(1,2)=(-2,4). \end{equation*}\]

Hint

We now need a unit vector that points from \((1,2)\) towards the origin.

Hint

We can use the direction vector \((-1,-2)\), but it needs to be normalized, meaning it should have a length/norm of 1.

Answer

The desired unit vector is obtained by dividing the suggested direction vector by its norm, that is:

\[\begin{equation*} \pmb{v}:=\left(-\frac{1}{\sqrt 5},-\frac{2}{\sqrt 5}\right). \end{equation*}\]

When we then calculate the inner product of \(\pmb{v}\) and the gradient \(\nabla g(1,2)\), we obtain the directional derivative:

\[\begin{equation*} \nabla_{(-1,-2)} \, g(1,2) = \nabla_{\pmb{v}} \, g(1,2) = \langle \pmb{v}, \nabla g(1,2) \rangle = -\frac{6}{\sqrt{5}}. \end{equation*}\]

2: Jacobian Matrices for Various Functions#

We define functions below of the form \(\pmb{f}: \operatorname{dom}(\pmb{f}) \to \mathbb{R}^k\), where \(\operatorname{dom}(\pmb{f}) \subseteq \mathbb{R}^n\), and where \(n\) and \(k\) can be read from the functional expression. In this exercise, we will not concern ourselves with determining the precise domain \(\operatorname{dom}(\pmb{f})\), but simply mention that if, for example, \(\ln(x_3)\) appears in the functional expression, it of course is a requirement that \(x_3 > 0\).

Question a#

Let \({f}(x_1, x_2, x_3) = x_1^2x_2 + 2x_3\). Compute the Jacobian matrix \(J_{f}(\pmb{x})\) and evaluate it at the point \(\pmb{x} = (1, -1, 3)\). Confirm that the Jacobian matrix of a scalar function of multiple variables has only one row.
Let \(\pmb{f}(x) = (3x, x^2, \sin(2x))\). Compute the Jacobian matrix \(J_{\pmb{f}}(x)\) and evaluate it at the point \(x = 2\). Confirm that the Jacobian matrix of a vector function of a single variable has only one column.
Let \(\pmb{f}(x_1, x_2) = (x_1^2, -3x_2, 12x_1)\). Compute the Jacobian matrix \(J_{\pmb{f}}(\pmb{x})\) and evaluate it at the point \(\pmb{x} = (2, 0)\).
Let \(\pmb{f}(x_1, x_2, x_3) = (x_2 \sin(x_3), 3x_1x_2 \ln(x_3))\). Compute the Jacobian matrix \(J_{\pmb{f}}(\pmb{x})\) and evaluate it at the point \(\pmb{x} = (-1, 3, 2)\).
Let \(\pmb{f}(x_1, x_2, x_3) = (x_1 e^{x_2}, 3x_2 \sin(x_2), -x_1^2 \ln(x_2 + x_3))\). Compute the Jacobian matrix \(J_{\pmb{f}}(\pmb{x})\) and evaluate it at the point \(\pmb{x} = (1, 0, 1)\).

Question b#

All the functions from the previous question are differentiable. How can this be argued? For which of the functions can we compute the Hessian matrix? Compute the Hessian matrix for the functions where it is defined.

Question c#

Let \(\pmb{v} = (1,1,1)\). Normalize the vector \(\pmb{v}\) and denote the result by \(\pmb{e}\). Check that \(||\pmb{e}||=1\). Calculate the directional derivative of the scalar function \({f}(x_1, x_2, x_3) = x_1^2x_2 + 2x_3\) at the point \(\pmb{x} = (1, -1, 3)\) in the direction given by \(\pmb{v}\). Then calculate \(J_f(\pmb{x}) \pmb{e}\). Compare it with the directional derivative. Are they equal? If so, is that a coincidence?

3: Description of Sets in the Plane#

In each of the four cases below, draw a sketch of the given set \(\,A\,\), its interior \(\,A^{\circ}\,\), its boundary \(\,\partial A\,\) and its closure \(\,\bar{A}\,\). Furthermore, determine whether \(\,A\,\) is open, closed or neither. Finally, specify whether \(\,A\,\) is bounded or unbounded.

\(\{(x,y) \mid xy\neq 0\}\)
\(\{(x,y) \mid 0<x<1 \wedge 1\leq y\leq 3\}\)
\(\{(x,y) \mid y\geq x^2 \wedge y<2 \}\)
\(\{(x,y) \mid x^2+y^2-2x+6y\leq 15 \}\)

Hint

Paper and pencil: Draw a coordinate system for each set and sketch the set within it.
Start with simple cases: If the set is defined with an inequality such as \(y<2x\), then first draw the line \(y=2x\). Do this for all inequalities and \(\neq\) conditions.
Examine the boundaries: Determine whether the boundary is included or excluded (look for \(<\), \(>\), \(\leq\), \(\geq\)). If the boundary is not included, it should be drawn as a dashed line.
Use the axes as a reference: Consider how the set relates to the \(x\)- and \(y\)-axes.

Answer

\(\{(x,y) \mid xy \neq 0\}\) represents the real plane (\(\mathbb{R}^2\)) without the coordinate axes. This region also constitutes the interior of the set, while the boundary consists of the coordinate axes. The closure is the entire real plane. The set is open and unbounded.
\(\{(x,y) \mid 0 < x < 1 \wedge 1 \leq y \leq 3\}\) is the rectangle enclosed by the lines \(x = 0\), \(x = 1\), \(y = 1\), and \(y = 3\), where \(x = 0\) and \(x = 1\) do not belong to the set, while \(y = 1\) and \(y = 3\) do. The interior of the set is the rectangle excluding the line segments, the boundary consists of all four line segments, and the closure is the rectangle including the line segments. The set is neither open nor closed, but it is bounded.
\(\{(x,y) \mid y \geq x^2 \,\,\,\mathrm{and}\,\,\,y < 2 \}\) is the set intersection of the region above the parabola \(y = x^2\) and the region below the line \(y = 2\). Note that the parabola segment from the point \((-\sqrt{2},2)\) to the point \((\sqrt{2},2)\), excluding the endpoints, is included in the set, while the line segment from \((-\sqrt{2},2)\) to \((\sqrt{2},2)\) is not included. The interior of the set is this intersection excluding the parabola segment from \((-\sqrt{2},2)\) to \((\sqrt{2},2)\). The boundary consists of this parabola segment and the line segment from \((-\sqrt{2},2)\) to \((\sqrt{2},2)\). Finally, the closure is the region along with both the line segment and the parabola segment. The given set is neither open nor closed, but it is bounded.
\(\{(x,y) \mid x^2+y^2-2x+6y \leq 15 \}\) represents the region inside the circle with center at \((1,-3)\) and radius 5. The interior is the region excluding the circle periphery, the boundary is the circle periphery itself, and the closure is the region including the circle periphery. Thus, the closure is the set itself. The set is closed and bounded.

4: All Linear Maps from \(\mathbb{R^n}\) to \(\mathbb{R}\)#

Let \(L: \mathbb{R}^n \to \mathbb{R}\) be a (arbitrary) linear mapping. Let \(e = \pmb{e}_1, \pmb{e}_2, \dots, \pmb{e}_n\) be the standard basis for \(\mathbb{R}^n\), and let \(\beta\) be the standard basis for \(\mathbb{R}\). Recall the standard basis from Mathematics 1a. Since the dimension of \(\mathbb{R}\) (over \(\mathbb{R}\)) is one, the standard basis for \(\mathbb{R}\) is simply the number \(1\).

Show that there exists a column vector \(\pmb{c} \in \mathbb{R}^n\) such that

\[\begin{equation*} L(\pmb{x}) = \pmb{c}^T \pmb{x} = \langle \pmb{x}, \pmb{c} \rangle \end{equation*}\]

where \(\langle \cdot, \cdot \rangle\) denotes the usual inner product on \(\mathbb{R}^n\). (The column vector is uniquely determined, but proving this is not part of this question.)

Hint

What is the mapping matrix \({}_\beta[L]_e\) for \(L\) with respect to the two bases?

Answer

\(\pmb{c}^T = {}_\beta[L]_e = [L(\pmb{e}_1), L(\pmb{e}_2), \dots, L(\pmb{e}_n)]\)

5: Linear(?) Vector Functions#

We consider the following two functions:

\(f: \mathbb{R}^{2 \times 2} \to \mathbb{R}^{2 \times 2}, f(X) = C X B\), where \(C = \operatorname{diag}(2,1) \in \mathbb{R}^{2 \times 2}\) and \(B = \begin{bmatrix} 1 & 1 \\ 0 & 1 \end{bmatrix}\).
\(g: \mathbb{R}^n \to \mathbb{R}, g(\pmb{x}) = \pmb{x}^T A \pmb{x}\), where \(A\) is an \(n \times n\) matrix (and isn’t the zero matrix).

Determine for each function whether it is a linear map. If the map is linear, find the mapping matrix with respect to:

the standard basis \(E=\begin{bmatrix} 1 & 0 \\ 0 & 0 \end{bmatrix}, \begin{bmatrix} 0 & 1 \\ 0 & 0 \end{bmatrix}, \begin{bmatrix} 0 & 0 \\ 1 & 0 \end{bmatrix}, \begin{bmatrix} 0 & 0 \\ 0 & 1 \end{bmatrix}\) in \(\mathbb{R}^{2 \times 2}\). Recall this example from the Mathematics 1a textbook.
the standard basis \(e\) in \(\mathbb{R}^n\). Recall this result from Math1a.

6: The Simple Chain Rule#

In this exercise we will be working with the simple chain rule given here.

We first consider a real function of two real variables given by the expression

\[\begin{equation*} g(x,y)=\ln(9-x^2-y^2). \end{equation*}\]

Question a#

Determine the largest possible domain of \(g\), and characterize it using concepts such as open, closed, bounded, and unbounded.

Answer

The logarithm is only defined for positive values, so we must require that

\[\begin{equation*} 9-x^2-y^2>0. \end{equation*}\]

But this can be rewritten to \(x^2+y^2<3^2\), meaning that \(\operatorname{dom}(g)=\{(x,y)\,|\,x^2+y^2<3^2\}\). This is a circular disk with center at the origin and radius 3, but with the periphery not included. The set is open and bounded.

We now consider a parametrized curve \(\pmb{r}\) in the \((x,y)\) plane given by

\[\begin{equation*} \pmb{r}(u)=(u,u^3)\,,\,u\in \left[-1.2\,,\,1.2\right]. \end{equation*}\]

Question b#

Which curve are we talking about (you are familiar with its equation)?

Answer

It is the graph of the third-degree polynomial \(p(x)=x^3\,,\,\,x\in \left[-1.2\,,\,1.2\right]\).

We now consider the composite function

\[\begin{equation*} h(u) = g(\pmb{r}(u)). \end{equation*}\]

Question c#

What are the domain and co-domain of \(h = g \circ \pmb{r}\)?

Answer

\(\operatorname{dom}(h) = \left[-1.2\,,\,1.2\right]\) and \(\operatorname{co-dom}(h) = \mathbb{R}\).

Question d#

Determine \(h'(1)\) using two different approaches:

Determine a functional expression for \(h(u)\) and differentiate it as usual.
Use the chain rule from Section 3.7.

Answer

We get \(h(u)=\ln(-u^6-u^2+9)\,\) and \(h'(1)=-\frac{8}{7}\).
Determining the tangent vector: Since \(\pmb{r}'(u)=(1,3\,u^2)\), we get \(\pmb{r}'(1)=(1,3)\). The gradient \(\nabla f(x,y)\) is found. Then, \(\nabla f(\pmb{r}(1))=\nabla f(1,1)=(-\frac{2}{7},-\frac{2}{7})\) can be computed. The inner product (the dot product) of the two obtained vectors is \(-\frac{8}{7}\).

7: Partial Derivatives but not Differentiable#

We start with a simple function \(f\), which is differentiable everywhere. Let \(f:\mathbb{R}^2 \to \mathbb{R}\) be given by

\[\begin{equation*} f(x_1,x_2)=x_1^2-4x_1+x_2^2. \end{equation*}\]

Question a#

Let \(\pmb{x}_0 = (x_1,x_2) \in \mathbb{R}^2\) be an arbitrary point. Show that \(f\) is differentiable at \(\pmb{x}_0\), and calculate the gradient of \(f\) at \(\pmb{x}_0\).

Soft version: Use the result in this theorem

Hard version: Solve the question directly using the definition of differentiability in Section 3.6. We will be following this latter approach in hints and answer below:

Hint

Similar to approaches in typical high school curriculum, we need to consider the relationship between \(\Delta f = f(\pmb{x}_0+\pmb{h}) - f(\pmb{x}_0)\) and \(\pmb{h}\) in connection with the limit \(\pmb{h}\longrightarrow\pmb{0}\), but note that \(\pmb{h}\) is now a vector.

Hint

Let \(\pmb{h}=(h_1,h_2)\). Calculate \(\Delta f\).

Hint

\(\Delta f=f(x_1+h_1,x_2+h_2)-f(x_1,x_2)\).

Hint

\[\begin{equation*} \Delta f = f(x_1+h_1,x_2+h_2)-f(x_1,x_2) = (x_1+h_1)^2-4(x_1+h_1)+(x_2+h_2)^2-f(x_1,x_2) \end{equation*}\]

Answer

\[\begin{align*} \Delta f &= f(x_1+h_1,x_2+h_2)-f(x_1,x_2) \\ &= (x_1+h_1)^2 - 4(x_1+h_1) + (x_2+h_2)^2 - \bigl[x_1^2 - 4x_1 + x_2^2\bigr] \\ &= 2x_1\cdot h_1+2x_2\cdot h_2-4h_1+h_1^2+h_2^2 \\ &= \begin{bmatrix} 2x_1-4 & 2x_2 \end{bmatrix} \begin{bmatrix} h_1 \\ h_2 \end{bmatrix}+h_1^2+h_2^2 \\ &= \begin{bmatrix} 2x_1-4 \\ 2x_2 \end{bmatrix}^T \, \pmb{h}+|| \pmb{h}||^2. \end{align*}\]

Since \(\varepsilon (\pmb{h}) = ||\pmb{h}||\) is an epsilon function, we can collectively write this as:

\[\begin{equation*} f(\pmb{x}_0+\pmb{h}) - f(\pmb{x}_0) = \pmb{c}^T \pmb{h}+\varepsilon (\pmb{h}) \, ||\pmb{h}|| \end{equation*}\]

where \(\pmb{c} =\begin{bmatrix} 2x_1-4 \\ 2x_2 \end{bmatrix}\).

We conclude that \(f\) is differentiable according to the definition, and we have that

\[\begin{equation*} \nabla f(x_1,x_2)= \pmb{c} = (f_{x_1}'(x_1,x_2),f_{x_2}'(x_1,x_2))=(2x_1-4,2 x_2). \end{equation*}\]

Question b#

To conclude differentiability from the partial derivatives, see this theorem, it is required that the partial derivatives are continuous. Why is it not enough for the partial derivatives to exist? We will investigate this with an example. But first, we generalize a well-known theorem (from high school) about a function of one variable: If it is differentiable at a point, it is also continuous at that point.

Show that if a function of two variables is differentiable at a point \(\pmb{x}_0\), then it is also continuous at that point.

Hint

The two definitions can be used directly (alternatively, the proof can be found in the book).

And now to the example that has named the exercise. We consider the function

\[\begin{equation*} g(x_1,x_2) = \begin{cases} \frac{x_1^2x_2}{x_1^4+x_2^2}, & \text{for } (x_1,x_2) \neq (0,0) \\ 0, & \text{for } (x_1,x_2)=(0,0) \end{cases} \end{equation*}\]

Question c#

Show that the partial derivatives of \(g\) exist at \((0,0)\), but that \(g\) is not differentiable at this point

Hint

The first part of the question should not be too difficult: The two auxiliary functions \(h_{x_2}(x_1)\) and \(h_{x_1}(x_2)\) are constant along the entire \(x_1\)-axis and the entire \(x_2\)-axis, respectively. OK?

Hint

The second part of the question: We saw that if the function is differentiable at a point, it must also be continuous at that point. Therefore, if the function is not continuous at a point, it cannot be differentiable at that point either. So we just need to show that \(g\) is not continuous at \((0,0)!\)

Hint

Based on the given functional expression, we have \(g(0,0)=0\). But what does the restriction of \(g\) along the parabola \(x_2=x_1^2\) approach as \(x_1\) approaches \(0\)?

Answer

It approaches \(\frac{1}{2}\). So, \(g\) is not continuous at \((0,0)\). Feel free to think through the whole argument again.

8: The Generalized Chain Rule#

In this exercise we will be using this theorem: Generalized chain rule

We are given the functions:

\(\pmb{f} : \mathbb{R}^3 \to \mathbb{R}^2\) defined by \(\pmb{f}(x_1, x_2, x_3) = (f_1(x_1, x_2, x_3), f_2(x_1, x_2, x_3))\), where

\[\begin{align*} f_1(x_1, x_2, x_3) &= x_1^2 + x_2^2 + x_3^2, \\ f_2(x_1, x_2, x_3) &= e^{x_1 + x_2} \, \cos(x_3). \end{align*}\]
\(g : \mathbb{R}^2 \to \mathbb{R}\) defined by \(g(y_1, y_2) = y_1 \, \sin(y_2)\).
The composition of these two functions: \(h = g \circ \pmb{f}\).

In the task, we will calculate the Jacobian matrix of \(h\) (with respect to the variables \(x_1, x_2,\) and \(x_3\)) using the generalized chain rule. You are welcome to do the calculations in SymPy.

Question a#

Find a functional expression for \(h\) as well as the domain and co-domain. Calculate the gradient of \(h\).

Question b#

Calculate the Jacobian matrix of \(\pmb{f}\). Calculate the Jacobian matrix of \(g\). What is the connection between the gradient and the Jacobian matrix of \(g\)?

Hint

For scalar functions, the Jacobian matrix is a row vector: namely, the transposed (column) gradient vector.

Question c#

Now apply the chain rule and the Jacobian matrices from the previous questions to find the Jacobian matrix of \(h\). Compare it with the answer in question a.

Hint

Your application of the generalized chain rule should involve a matrix-matrix product of \(1 \times 2\) and \(2 \times 3\) matrices.

Hint

Remember, you need to evaluate the Jacobian matrix/gradient for \(g\) at the correct point, namely at \((y_1, y_2) = \pmb{f}(x_1, x_2, x_3)\). This is just like the well-known chain rule from high school, where \(g(f(x))' = g'(f(x)) f'(x)\), with \(g'\) on the right-hand side evaluated at \(f(x)\).

9: Gradient Vector Fields and the Hessian Matrix#

Question a#

The gradient vector of \(f(x_1, x_2) = x_1^2 \sin(x_2)\) is \(\nabla f(\pmb{x}) = (2x_1 \sin(x_2), x_1^2 \cos(x_2))\). The gradient vector can therefore be considered as a map \(\nabla f : \operatorname{dom}(f) \to \mathbb{R}^2\). Write down this map as a function (where you specify \(\operatorname{dom}(f)\)), and plot it as a vector field.

Question b#

Now calculate the Jacobian matrix \(\pmb{J}_{\nabla f}(x_1,x_2)\) of \(\nabla f : \mathbb{R}^2 \to \mathbb{R}^2\) at the point \((x_1,x_2)\).

Question c#

Calculate the Hessian matrix \(\pmb{H}_{f}(x_1,x_2)\) of \(f : \mathbb{R}^2 \to \mathbb{R}\) at the point \((x_1,x_2)\) and compare it to the answer to the previous question.

Theme Exercise – Short Day#

This day is dedicated to the Theme Exercise: Theme 1: The Gradient Method.

Week 2: Exercises

Contents

Week 2: Exercises#

Exercises – Long Day#

1: Level Curves and Directional Derivatives of Scalar Functions#

Question a#

Question b#

Question c#

Question d#

2: Jacobian Matrices for Various Functions#

Question a#

Question b#

Question c#

3: Description of Sets in the Plane#

4: All Linear Maps from \(\mathbb{R^n}\) to \(\mathbb{R}\)#

5: Linear(?) Vector Functions#

6: The Simple Chain Rule#

Question a#

Question b#

Question c#

Question d#

7: Partial Derivatives but not Differentiable#

Question a#

Question b#

Question c#

8: The Generalized Chain Rule#

Question a#

Question b#

Question c#

9: Gradient Vector Fields and the Hessian Matrix#

Question a#

Question b#

Question c#

Theme Exercise – Short Day#