Skip to content

Powell's example of cyclic non-convergence in coordinate descent with exact minimization

License

Notifications You must be signed in to change notification settings

wmkouw/cd-powells

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Powell's example of cyclic non-convergence in coordinate descent.

Consider the objective function:

	f(x,y,z) = -xy -yz -xz + (x-1)^2 + (-x-1)^2 + (y-1)^2 + (-y-1)^2 + (z-1)^2 + (-z-1)^2 .

It can be shown that the minimizers of this function are the two points: [1, 1, 1] and [-1, -1, -1]. Exact minimization with respect to each coordinate yields the following solutions:

	x <- (1 + |y + z|/2)*sign(y + z)
	y <- (1 + |x + z|/2)*sign(x + z)
	z <- (1 + |x + y|/2)*sign(x + y)

If one starts the coordinate descent procedure at the point [-1 - ϵ, 1 + ϵ/2, -1 - ϵ/4] for a small number ϵ, and updates the coordinates in x,y,z order, then the optimization procedure will not converge. It cycles around the other vertices of the [-1, 1] cube, as shown in the animation below.

The example is just to show that exact CD will not necessarily converge. One can enforce convergence in this function by randomly selecting the coordinate to be updated or by using a gradient-based update.

References

About

Powell's example of cyclic non-convergence in coordinate descent with exact minimization

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages