almost stochastic: optimization

Showing posts with label optimization. Show all posts

2020/11/11

Convergence rate of gradient descent for convex functions

Suppose, given a convex function $f: \bR^d \to \bR$, we would like to find the minimum of $f$ by iterating \begin{align*} \theta_t = \theta_{t-1} - \gamma \nabla f(\theta_{t-1}). \end{align*} How fast do we converge to the minima of $f$?

A simple bound for optimisation using a grid

If I give you a function on $[0,1]$ and a computer and want you to find the minimum, what would you do? Since you have the computer, you can be lazy: Just compute a grid on $[0,1]$, evaluate the grid points and take the minimum. This will give you something close to the true minimum. But how much?

Nonnegative Matrix Factorization

Introduction.

In this post, I derive the nonnegative matrix factorization (NMF) algorithm as proposed by Lee and Seung (1999). I derive the multiplicative updates from a gradient descent point of view by using the treatment of Lee and Seung in their later NIPS paper Algorithms for Nonnegative Matrix Factorization. The code for this blogpost can be accessed from here.

Stochastic gradient descent

In this post, I introduce the widely used stochastic optimization technique, namely the stochastic gradient descent. I also implement the algorithm for the linear-regression problem and provide the Matlab code.

almost stochastic

2020/11/11

Convergence rate of gradient descent for convex functions

2016/09/23

A simple bound for optimisation using a grid

2013/06/22

Nonnegative Matrix Factorization

Introduction.

2013/05/23

Stochastic gradient descent

search

mathematics village

archive

band pass filter

links

almost stochastic

2020/11/11

Convergence rate of gradient descent for convex functions

2016/09/23

A simple bound for optimisation using a grid

2013/06/22

Nonnegative Matrix Factorization

Introduction.

2013/05/23

Stochastic gradient descent

rss

search

mathematics village

archive

band pass filter

links