Optimization

Finding the “best” way to do a specific task in economics often involves what is called an optimization problem.

I — Univariate Optimization¶

Stationary Points¶

Generally, we say that $x^*$ is a stationary point of a differentiable function $f(x)$ when its slope evaluated at $x^*$ is zero, i.e., when

f'(x^*) = 0.

Necessary First-Order Condition (F.O.C)¶

More formally, suppose a function $f(x)$ is differentiable in some interval $I$ and that $x^*$ is an interior point of $I$ . Then for $x = x^*$ to be a maximum or minimum point for $f(x)$ in $I$ , a necessary condition is that it is a stationary point for $f(x)$ , i.e., $x = x^*$ satisfies

f'(x^*) = 0.

Example 1

Let

h(x) = -x^2 + 8x - 15.

F.O.C:

-2x + 8 = 0.

So the stationary point is $x^* = 4$ and $y^* = 1$ .

Example 2

Let

z(x) = x^2 - 8x + 17.

F.O.C:

2x - 8 = 0.

So the stationary point is $x^* = 4$ and $y^* = 1$ .

We have the same stationary point, but clearly these are different functions. The former is $\cap$ -shaped and the latter is $\cup$ -shaped.

Sufficient Second-Order Condition (S.O.C) for Maximum/Minimum¶

Following the two examples above, we can characterize a stationary point as a maximum or minimum by taking the second derivative.

If $f''(x^*) < 0$ , the stationary point represents a maximum.
If $f''(x^*) > 0$ , the stationary point represents a minimum.

Example 1

h''(x) = -2.

Since this is negative, the stationary point $(4,1)$ is a maximum.

Example 2

z''(x) = 2.

Since this is positive, the stationary point $(4,1)$ is a minimum.

Example 3

Let

p(x) = \frac{1}{3}x^3 - \frac{1}{2}x^2 - 2x + 20.

The F.O.C gives

p'(x) = x^2 - x - 2 = 0,

which implies $x^* = -1$ and $x^* = 2$ .

The S.O.C gives

p''(x) = 2x - 1.

At $x^* = -1$ , $p''(-1) = -3 < 0$ → maximum
At $x^* = 2$ , $p''(2) = 3 > 0$ → minimum

S.O.C Is Sufficient but Not Necessary¶

Consider

y = x^4.

The F.O.C gives $x^* = 0$ .

The S.O.C also gives 0, which is neither positive nor negative. Yet, $x^* = 0$ is clearly a minimum.

Global and Local Minimum/Maximum¶

We must distinguish between global and local extrema.

Global Maximum¶

If $f(x)$ is everywhere differentiable and has stationary point $x^*$ , then $x^*$ is a global maximum if

f'(x) \ge 0 \text{ for all } x \le x^*, \quad \text{and} \quad f'(x) \le 0 \text{ for all } x \ge x^*.

That is, the function increases up to $x^*$ and decreases afterward.

Global Minimum¶

Similarly, $x^*$ is a global minimum if

f'(x) \le 0 \text{ for all } x \le x^*, \quad \text{and} \quad f'(x) \ge 0 \text{ for all } x \ge x^*.

Local Maximum / Minimum¶

We speak of a local maximum or minimum if $x^*$ is a stationary maximum or minimum only in a neighborhood of $x^*$ , not over the entire domain.

Concavity and Convexity¶

If $f(x)$ is strictly concave on $(m,n)$ and has a stationary point $x^*$ with $m < x^* < n$ , then $x^*$ is a local maximum.
If $f(x)$ is strictly concave everywhere, it has at most one stationary point, which is a global maximum.

Conversely,

If $f(x)$ is strictly convex on $(m,n)$ and has a stationary point $x^*$ , then $x^*$ is a local minimum.
If $f(x)$ is strictly convex everywhere, that stationary point is a global minimum.

Inflection Points¶

Consider the function

k(x) = 1 + (x - 4)^3.

The F.O.C gives a stationary point $(4,1)$ .

The S.O.C evaluated at this point is 0, so it is inconclusive.

We must then take higher-order derivatives.

If the first nonzero higher-order derivative evaluated at the stationary point is:

Odd-order → inflection point
Even-order → maximum or minimum (depending on sign)

If the first non-zero derivative at the stationary point $c$ is of even order ( $n = 2, 4, 6...$ ):

Derivative Sign	Result	Visual Intuition
$f^{(n)}(c) > 0$	Local Minimum	The function “curves up” away from the point in all directions.
$f^{(n)}(c) < 0$	Local Maximum	The function “curves down” away from the point in all directions.

For this example,

k'''(4) \ne 0,

which is the third (odd) derivative. Hence, $(4,1)$ is an inflection point.

Economic Applications¶

A Monopolist’s Optimal Pricing Scheme
Strategic Behavior of Duopolists
Rules versus Discretion in Monetary Policy
The Inflation Tax and Seigniorage
The Golden Rule

II — Multivariate Optimization¶

We now generalize the univariate techniques to multivariate optimization.

Multivariate First-Order Condition¶

If we have a function

y = f(x_1, x_2, \ldots, x_n)

that is differentiable with respect to each of its arguments and has a stationary point at $(x_1^*, x_2^*, \ldots, x_n^*)$ , then each of the partial derivatives at that point equals zero.

That is,

f_1(x_1^*, x_2^*, \ldots, x_n^*) = 0 \\ f_2(x_1^*, x_2^*, \ldots, x_n^*) = 0 \\ \vdots \\ f_n(x_1^*, x_2^*, \ldots, x_n^*) = 0

Example 1

Consider the bivariate function

g(x_1, x_2) = 6x_1 - x_1^2 + 16x_2 - 4x_2^2

The first-order conditions are

g_1(x_1, x_2) = 6 - 2x_1 = 0 \\ g_2(x_1, x_2) = 16 - 8x_2 = 0

The single stationary point is therefore

x_1^* = 3, \quad x_2^* = 2

and the value of the function at this point is

g(3,2) = 25.

We will show later using the second-order condition that this stationary point represents a maximum.

Let’s visualize the equation and its stationary point.

If we take a slice of the function $g(x_1, x_2)$ at $x_2 = 2$ , the stationary point is achieved at $x_1 = 3$ . Similarly, taking a slice at $x_1 = 3$ shows a stationary point at $x_2 = 2$ . Visually, we have

Example 2

Consider the function

h(x_1, x_2) = x_1^2 + 4x_2^2 - 2x_1 - 16x_2 + x_1 x_2

The first-order conditions give

h_1(x_1, x_2) = 2x_1 - 2 + x_2 = 0 \\ h_2(x_1, x_2) = 8x_2 - 16 + x_1 = 0

Hence the single stationary point is

x_1^* = 0, \quad x_2^* = 2

and the value of the function at this point is

h(0,2) = -16.

Below is a visualization of this function with the plane tangent and stationary point.

Second-Order Condition in the Bivariate Case¶

For the univariate case, the second differential of a function can be considered as the differential of the first differential and denoted as

d(dy) = d^2 y.

For $y = f(x)$ , the second differential is

d^2 y = f''(x)(dx)^2,

which is nonnegative for any $dx$ .

Second Differential in the Bivariate Case¶

For a bivariate function $y = f(x_1, x_2)$ , the total differential is

dy = f_1(x_1, x_2),dx_1 + f_2(x_1, x_2),dx_2.

Taking the total derivative of this expression yields the second total differential:

d^2 y = f_{11}(dx_1)^2 + f_{22}(dx_2)^2 + 2f_{12}dx_1 dx_2.

Sufficient Conditions for Local Maxima and Minima¶

If $d^2 y < 0$ for all $(dx_1, dx_2)$ , the stationary point is a local maximum.
If $d^2 y > 0$ for all $(dx_1, dx_2)$ , the stationary point is a local minimum.

A necessary condition for a minimum is

f_{11} > 0 \quad \text{and} \quad f_{22} > 0,

and for a maximum,

f_{11} < 0 \quad \text{and} \quad f_{22} < 0.

However, the cross-partial derivative $f_{12}$ must also be considered.

Completing the Square¶

By completing the square, the second differential can be rewritten, leading to the condition:

f_{11} f_{22} > (f_{12})^2.

Second-Order Condition for a Maximum¶

If $y = f(x_1, x_2)$ has a stationary point $(x_1^*, x_2^*)$ and

f_{11}(x_1^*, x_2^*) < 0 \quad \text{and} \quad f_{11} f_{22} > (f_{12})^2,

then the function reaches a maximum at that point.

Second-Order Condition for a Minimum¶

f_{11}(x_1^*, x_2^*) > 0 \quad \text{and} \quad f_{11} f_{22} > (f_{12})^2,

then the function reaches a minimum.

Let’s continue with the example above.

The second partial derivatives of

h(x_1,x_2) = x_1^2 + 4x_2^2 - 2x_1 - 16x_2 + x_1x_2

are

h_{11}(x_1,x_2) = 2 \quad \text{and} \quad h_{22}(x_1,x_2) = 8

Both are positive. The cross-partial derivative is

h_{12}(x_1,x_2) = 1.

Since

h_{11} h_{22} > (h_{12})^2,

that is,

16 > 1,

the stationary point $(0,2)$ is a minimum.

As another example, consider

g(x_1,x_2) = 6x_1 - x_1^2 + 16x_2 - 4x_2^2.

The second partial derivatives are

g_{11}(x_1,x_2) = -2 \quad \text{and} \quad g_{22}(x_1,x_2) = -8,

and the cross-partial derivative is

g_{12}(x_1,x_2) = 0.

Since the second partial derivatives are both negative and

g_{11} g_{22} > (g_{12})^2,

that is,

16 > 0,

we have the conditions for a maximum.

Second-Order Condition in the General Multivariate Case¶

Let us use the tools of matrix algebra to develop a set of conditions that enables us to find the sign of the second total differential of a multivariate function.

First, assume a bivariate case for which the second total differential is given by

d^2 y = f_{11}(dx_1)^2 + f_{22}(dx_2)^2 + 2 f_{12}(dx_1)(dx_2).

This expression can be written in matrix form as the quadratic form of the two variables $dx_1$ and $dx_2$ as follows:

d^2 y = \begin{bmatrix} dx_1 & dx_2 \end{bmatrix} \begin{bmatrix} f_{11} & f_{12} \\ f_{21} & f_{22} \end{bmatrix} \begin{bmatrix} dx_1 \\ dx_2 \end{bmatrix}.

In other words, the second total differential (or second total derivative) for a multivariate function can be written more generally as

d^2 y = dx' H dx = \begin{bmatrix} dx_1 & dx_2 & \cdots & dx_n \end{bmatrix} \begin{bmatrix} f_{11} & f_{12} & \cdots & f_{1n} \\ f_{21} & f_{22} & \cdots & f_{2n} \\ \vdots & \vdots & \ddots & \vdots \\ f_{n1} & f_{n2} & \cdots & f_{nn} \end{bmatrix} \begin{bmatrix} dx_1 \\ dx_2 \\ \vdots \\ dx_n \end{bmatrix}.

Here, $H$ is the Hessian matrix, and $dx$ is the column vector of differentials.

All that remains is to determine the sign definiteness of the quadratic form by determining the sign definiteness of the Hessian.

Interpreting the Second-Order Condition¶

The sign of the second total differential $d^2 y$ determines the local curvature of the function and therefore whether a critical point is a local maximum, minimum, or neither.

Because

d^2 y = dx' H dx,

the problem reduces to determining the sign definiteness of the Hessian matrix $H$ .

Positive and Negative Definiteness¶

These cases correspond to the curvature of the function at a critical point.

Second-Order Conditions for Optimization¶

Suppose $y = f(x_1, \ldots, x_n)$ and $\nabla f = 0$ at a point $x^*$ .

Sylvester’s Criterion (Practical Test)¶

In practice, definiteness is checked using principal minors of the Hessian.

Bivariate Case ( $n = 2$ )¶

Let

H = \begin{bmatrix} f_{11} & f_{12} \\ f_{21} & f_{22} \end{bmatrix}.

Then:

This criterion is widely used in economics because it avoids computing the quadratic form directly.

Example

Consider the function

y = -x_1^2 - 2x_2^2 + 4x_1 x_2.

The Hessian matrix is

H = \begin{bmatrix} -2 & 4 \\ 4 & -4 \end{bmatrix}.

Compute the determinant:

\det(H) = (-2)(-4) - 16 = -8 < 0.

Since the determinant is negative, the Hessian is indefinite, and the critical point is a saddle point.

Economic Interpretation¶

Concavity (negative definite Hessian) corresponds to diminishing marginal returns and guarantees interior maxima in optimization problems.
Convexity (positive definite Hessian) corresponds to cost minimization problems.
Indefiniteness indicates instability or saddle behavior, common in strategic or general equilibrium settings.

I — Univariate Optimization¶

Stationary Points¶

Necessary First-Order Condition (F.O.C)¶

Sufficient Second-Order Condition (S.O.C) for Maximum/Minimum¶

S.O.C Is Sufficient but Not Necessary¶

Global and Local Minimum/Maximum¶

Global Maximum¶

Global Minimum¶

Local Maximum / Minimum¶

Concavity and Convexity¶

Inflection Points¶

Economic Applications¶

II — Multivariate Optimization¶

Multivariate First-Order Condition¶

Second-Order Condition in the Bivariate Case¶

Second Differential in the Bivariate Case¶

Sufficient Conditions for Local Maxima and Minima¶

Completing the Square¶

Second-Order Condition for a Maximum¶

Second-Order Condition for a Minimum¶

Second-Order Condition in the General Multivariate Case¶

Interpreting the Second-Order Condition¶

Positive and Negative Definiteness¶

Second-Order Conditions for Optimization¶

Sylvester’s Criterion (Practical Test)¶

Bivariate Case (n=2n = 2n=2)¶

Economic Interpretation¶

Optimizing Multivariate Functions¶

i. f(x,y)=3x2−xy+2y2−4x−7y+12f(x,y) = 3x^2 - xy + 2y^2 - 4x - 7y + 12f(x,y)=3x2−xy+2y2−4x−7y+12¶

ii. f(x,y)=60x+34y−4xy−6x2−3y2+5f(x, y) = 60x + 34y - 4xy - 6x^2 - 3y^2 + 5f(x,y)=60x+34y−4xy−6x2−3y2+5¶

iii. f(x,y)=48y−3x2−6xy−2y2+72xf(x,y) = 48y - 3x^2 - 6xy - 2y^2 + 72xf(x,y)=48y−3x2−6xy−2y2+72x¶

iv. f(x,y)=5x2−3y2−30x+7y+4xyf(x, y) = 5x^2 - 3y^2 - 30x + 7y + 4xyf(x,y)=5x2−3y2−30x+7y+4xy¶

v. f(x,y)=x3−3x+y2−4yf(x,y) = x^3 - 3x + y^2 - 4yf(x,y)=x3−3x+y2−4y¶

Bivariate Case ( $n = 2$ )¶

i. $f(x,y) = 3x^2 - xy + 2y^2 - 4x - 7y + 12$ ¶

ii. $f(x, y) = 60x + 34y - 4xy - 6x^2 - 3y^2 + 5$ ¶

iii. $f(x,y) = 48y - 3x^2 - 6xy - 2y^2 + 72x$ ¶

iv. $f(x, y) = 5x^2 - 3y^2 - 30x + 7y + 4xy$ ¶

v. $f(x,y) = x^3 - 3x + y^2 - 4y$ ¶