Lagrange Dual Problem关于拉格朗日对偶问题的一些介绍，这是运筹学中很重要的一个问题，但是在教材中没有

1. Preface

For the past few months, I have been troubled with the following questions:

How to construct the dual problem of a linear programming? Is there a general method to do this?
Why is the number of variables in dual problem equal to the number of constraints in primal problem?
Why is the number of constraints in dual problem equal to the number of variables in primal problem?
Why is the sign of the $\footnotesize i$ -th variable of dual problem determined by the sign of the $\footnotesize i$ -th constraint of primal problem?
Why is the the sign of the $\footnotesize j$ -th constraint of dual problem is determined by the sign of the $\footnotesize j$ -th variable of primal problem?

If you also want to know the reasons behind them, go on reading and you will have a deeper understanding of the dual problem.

2. Lagrange Dual Problem

2.1 Dual Pair

The two problems

\small z=\max\{c(x)\,|\,x\in X\}

\small w=\min\{w(u)\,|\,u\in U\}

form a (weak)-dual pair if $\footnotesize c(x)\leq w(u)$ for all $\footnotesize x \in X$ and all $\footnotesize u\in U$ . When $\footnotesize z=w$ , they form a strong-dual pair.

2.2 Primal Problem

Consider an optimization problem

\small \begin{aligned} &p^*=\min f_0(x) \\ s.t.\,\, &f_i(x)\leq 0,i=1,\cdots,m \end{aligned}

We denote by $\footnotesize D$ the domain of the problem, with $\footnotesize D\subseteq \mathbb{R}^n$ . The above is referred as primal problem.

One purpose of Lagrange duality is to find a lower bound on minimization problem.

2.3 Dual Problem

To the problem we associate the Lagrangian $\footnotesize \mathcal{L}: \mathbb{R}^n\times \mathbb{R}^m\rightarrow \mathbb{R}$

\small \mathcal{L}(x,\lambda) = f_0(x) + \sum_{i=1}^m\lambda_if_i(x)

The variables $\footnotesize \lambda\in \mathbb{R}^m$ are called Lagrange multipliers.

It can be easily verified that

\small f_0(x)\geq \mathcal{L}(x,\lambda),\forall\, x\in D,\lambda\geq 0

So the primal problem can be precisely expressed as

\small p^* = \min_{x\in D}\max_{\lambda\geq 0} \mathcal{L}(x,\lambda)

We then define the Lagrange dual function

\small g(\lambda) = \min_{x\in \mathbb{R}^n} \mathcal{L}(x,\lambda)

Thus we can obtain

\small f_0(x)\geq \mathcal{L}(x,\lambda) \geq g(\lambda),\forall\, x\in D,\lambda\geq 0

so the problem

\small \begin{aligned} &d^* = \max g(\lambda) \\ s.t.\,\,& \lambda \geq 0 \end{aligned}

and the primal problem form a (weak)-dual pair. The above problem is called Lagrange Dual Problem.

2.4 Cases With Equality Constraints

Generally, consider cases with equality constraints:

\small \begin{aligned} &p^*=\min f_0(x) \\ s.t.& \begin{cases} f_i(x)\leq 0,i=1,\cdots,m \\ h_i(x) = 0,i=1,\cdots,p \end{cases} \end{aligned}

Rewrite the problem as:

\small \begin{aligned} &p^*=\min f_0(x) \\ s.t.& \begin{cases} f_i(x)\leq 0,&i=1,\cdots,m \\ h_i(x) \leq 0,&i=1,\cdots,p \\ -h_i(x) \leq 0,&i=1,\cdots,p \end{cases} \end{aligned}

Using a multiplier $\footnotesize v^+_i,v^-_i$ for the constraint $\footnotesize h_i(x) \leq 0$ and $\footnotesize -h_i(x) \leq 0$ , we write the associated Lagrangian as

\small \begin{aligned} \mathcal{L}(x,\lambda,v^+,v^-) &= f_0(x) + \sum_{i=1}^m\lambda_if_i(x) + \sum_{i=1}^pv^+_ih_i(x)+\sum_{i=1}^pv^-_i(-h_i(x)) \\ & = f_0(x) + \sum_{i=1}^m\lambda_if_i(x) + \sum_{i=1}^pv_ih_i(x) \end{aligned}

where $\footnotesize v=v^+ - v^-$ doesn't have any sign constraints.

Thus, inequality constraints in the original problem are associated with sign constraints on the corresponding multipliers.

3. Examples

Based on the above theory, let's construct the dual problem of Linear Programming.

3.1 Inequality Form

Consider the following form,

\small \begin{aligned} &\max c^Tx \\ s.t.& \begin{cases} Ax\leq b \\ x\geq 0 \end{cases} \end{aligned}

change the form into

\small \begin{aligned} &\min -c^Tx \\ s.t.& \begin{cases} Ax-b\leq 0 \\ -x\leq 0 \end{cases} \end{aligned}

Construct the Lagrangian

\small \mathcal{L}(x,\lambda,v) = -c^Tx+\lambda^T(Ax-b)+v^T(-x) = (-c^T+\lambda^TA-v^T)x-\lambda^Tb

\small -c^Tx\geq \mathcal{L}(x,\lambda,v),\forall\lambda\geq 0,v\geq 0

If $\footnotesize -c^T+\lambda^TA-v^T =0$ , then

\small g(\lambda,v) = \min_x \mathcal{L}(x,\lambda,v) = -\lambda^Tb

\small -c^Tx\geq \mathcal{L}(x,\lambda,v)\geq g(\lambda,v),\forall \lambda\geq 0,v\geq 0

So the dual problem is

\small \begin{aligned} &\max -\lambda^Tb \\ s.t.& \begin{cases} -c^T+\lambda^TA = v^T \\ \lambda\geq 0,v\geq 0 \end{cases} \end{aligned}

The final form can be obtained as follow:

\small \begin{aligned} &\min \lambda^Tb \\ s.t.& \begin{cases} A^T\lambda \geq c\\ \lambda\geq 0 \end{cases} \end{aligned}

3.2 A General Form

Consider the following problem

\small \begin{aligned} &\max\, c_1x_1+c_2x_2+c_3x_3 \\ s.t.& \begin{cases} a_{11}x_1+a_{12}x_2+a_{13}x_3 \leq b_1 \\ a_{21}x_1+a_{22}x_2+a_{23}x_3 \geq b_2 \\ a_{31}x_1+a_{32}x_2+a_{33}x_3 = b_3 \\ x_1\geq 0,x_2\leq 0, x_3\text{ free} \end{cases} \end{aligned}

rewrite the problem as:

\small \begin{aligned} &\min\,-(c_1x_1+c_2x_2+c_3x_3) \\ s.t.& \begin{cases} a_{11}x_1+a_{12}x_2+a_{13}x_3-b_1 &\leq 0 \\ -(a_{21}x_1+a_{22}x_2+a_{23}x_3-b_2)&\leq 0 \\ a_{31}x_1+a_{32}x_2+a_{33}x_3 - b_3 &\leq 0 \\ -(a_{31}x_1+a_{32}x_2+a_{33}x_3 - b_3)&\leq 0 \\ -x_1\leq 0\\ x_2\leq 0 \end{cases} \end{aligned}

Construct the Lagrangian

\small \begin{aligned} \mathcal{L}(x,\lambda,v) =\,& -c_1x_1-c_2x_2-c_3x_3 + \\ &\lambda_1^+(a_{11}x_1+a_{12}x_2+a_{13}x_3-b_1) +\\ &\lambda_2^-(-(a_{21}x_1+a_{22}x_2+a_{23}x_3-b_2))+\\ &\lambda_3^+(a_{31}x_1+a_{32}x_2+a_{33}x_3 - b_3) +\\ &\lambda_3^-(-(a_{31}x_1+a_{32}x_2+a_{33}x_3 - b_3))+\\ &v_1^-(-x_1) + v_2^+x_2 \\ =\, &(-c_1+\lambda_1^+a_{11}-\lambda_2^-a_{21} + (\lambda_3^+ -\lambda_3^-)a_{31} - v_1^-)x_1 +\\ &(-c_2+\lambda_1^+a_{12}-\lambda_2^-a_{22} + (\lambda_3^+ -\lambda_3^-)a_{32}+v_2^+)x_2 +\\ &(-c_3+\lambda_1^+a_{13}-\lambda_2^-a_{23} + (\lambda_3^+ -\lambda_3^-)a_{33})x_3- \\ & (\lambda_1^+b_1-\lambda_2^-b_2+(\lambda_3^+ -\lambda_3^-)b_3) \end{aligned}

\small -c_1x_1-c_2x_2-c_3x_3\geq \mathcal{L}(x,\lambda,v),\forall\lambda_1^+,\lambda_2^-,\lambda_3^+,\lambda_3^-,v_1^-,v_2^+\geq 0

\small \begin{aligned} &-c_1+\lambda_1^+a_{11}-\lambda_2^-a_{21} + (\lambda_3^+ -\lambda_3^-)a_{31} - v_1^-=0\\ &-c_2+\lambda_1^+a_{12}-\lambda_2^-a_{22} + (\lambda_3^+ -\lambda_3^-)a_{32}+v_2^+=0\\ &-c_3+\lambda_1^+a_{13}-\lambda_2^-a_{23} + (\lambda_3^+ -\lambda_3^-)a_{33}=0 \end{aligned}

let $\footnotesize \lambda_1 = \lambda_1^+,\lambda_2 = -\lambda_2^-,\lambda_3 = \lambda_3^+ -\lambda_3^-$ , then $\footnotesize \lambda_1\geq 0,\lambda_2\leq 0,\lambda_3\text{ free}$ ,

\small \begin{aligned} &-c_1+\lambda_1a_{11}+\lambda_2a_{21} +\lambda_3a_{31} = v_1^- \geq 0\\ &-c_2+\lambda_1a_{12}+\lambda_2a_{22} +\lambda_3a_{32} = -v_2^+ \leq 0\\ &-c_3+\lambda_1a_{13}+\lambda_2a_{23} +\lambda_3a_{33} = 0 \end{aligned}

then

\small g(\lambda,v) = \min_x \mathcal{L}(x,\lambda,v) =- (\lambda_1^+b_1-\lambda_2^-b_2+(\lambda_3^+ -\lambda_3^-)b_3) = -(\lambda_1b_1+\lambda_2b_2+\lambda_3b_3)

the dual problem is

\small \begin{aligned} &\min\, \lambda_1b_1+\lambda_2b_2+\lambda_3b_3 \\ s.t.& \begin{cases} \lambda_1a_{11}+\lambda_2a_{21} +\lambda_3a_{31} \geq c_1 \\ \lambda_1a_{12}+\lambda_2a_{22} +\lambda_3a_{32} \leq c_2 \\ \lambda_1a_{13}+\lambda_2a_{23} +\lambda_3a_{33} = c_3 \\ \lambda_1\geq 0,\lambda_2\leq 0,\lambda_3\text{ free} \end{cases} \end{aligned}

Now, I think your trouble has disappeared. Thanks for your attention!

4. Reference

[1]. Laurence A. Wolsey, Integer programming; John Wiley & Sons, Inc: New York, America, 1998; pp. 28.

[2]. Lecture 7: Weak Duality(Lecturer: Laurent El Ghaoui): people.eecs.berkeley.edu/~elghaoui/T…