Method of conjugate directions.

he formula ( Orthogonality of residues ) shows the steepest descent method sometimes takes steps in the same direction that was taken before. This is a waste of efficiency. We would like to find a procedure that never takes the same direction. Instead of search directions we would like to find a set of linearly independent search directions . Of course, if the directions are chosen badly then the best length of step in every direction would be impossible to determine. Since we are minimizing it makes sense to choose directions orthogonal with respect to the scalar product . This way we can perform a -minimization in every direction $d_{k}$ (see the section ( Method of steepest descent )) and guarantee that the iteration $x_{k}$ remains -optimal with respect to the directions of the previous steps. Therefore such procedure has to hit the solution after steps or less (under theoretical assumption that numerical errors are absent).

We produce a set search direction by performing the Gram-Schmidt orthogonalization (see the section ( Gram-Schmidt orthogonalization )) with respect to the scalar product and starting from any set of linearly independent vectors. Let be a result of such orthogonalization: MATH The following calculations are motivationally similar to the calculations of the section ( Method of steepest descent ). We start from any point $x_{0}$ . At a step we look at the line MATH and seek MATH Note at this point that changing from to for any vector that is -orthogonal to $d_{k}$ would not alter the result of minimization. Hence, if we perform this procedure times we arrive to a point that is optimal with respect to every vector . Therefore, such point is a solution.

To perform the minimization we calculate the derivative MATH At this point we substitute the formula ( Connection between SLA and minimization ) and $x^{\ast}$ disappears from the calculation. MATH We equate the derivative to zero and obtain MATH We collect the results: MATH Similarly to the section ( Method of steepest descent ) one can eliminate one matrix multiplication by transforming the equation $\left( \#\right)$ . We multiply by and subtract : MATH

Algorithm

(Conjugate directions) Let be a set of vectors with the property MATH Take any point $x_{0}$ , calculate MATH and iterate for MATH

Note for future use that MATH means MATH or MATH In fact, $r_{k+1}$ is orthogonal to all directions for previous steps because, as we noted already, $x_{k+1}$ remains -optimal:

(Orthogonality of residues 2)

Notation.

Index.

Contents.