Training Models with Regression and Gradient Descent

Gradient Descent

The goal of gradient descent is still to minimize the cost function, but it follows an iterative process:

Start with a random
Calculate the gradient for the current
Update as
Repeat 2-3 until some stopping criterion is met

where is the learning rate, or the size of step to take in the direction opposite the gradient.