About

gradient-descent

Tag Info

Info Newest Frequent Score Active Unanswered

Gradient Descent is an algorithm for finding the minimum of a function. It iteratively calculates partial derivatives (gradients) of the function and descends in steps proportional to those partial derivatives. One major application of Gradient Descent is fitting a parameterized model to a set of data: the function to be minimized is an error function for the model.

Wiki:

Gradient descent is a first-order iterative optimization algorithm. It is an optimization algorithm used to find the values of parameters (coefficients) of a function (f) that minimizes a cost function (cost).

To find a local minimum of a function using gradient descent, one takes steps proportional to the negative of the gradient (or of the approximate gradient) of the function at the current point.

Gradient descent is best used when the parameters cannot be calculated analytically (e.g. using linear algebra) and must be searched for by an optimization algorithm.

Gradient descent is also known as steepest descent, or the method of steepest descent.

Tag usage:

Questions on gradient-descent should be about implementation and programming problems, not about the theoretical properties of the optimization algorithm. Consider whether your question might be better suited to Cross Validated, the StackExchange site for statistics, machine learning and data analysis.

Read more:

History Excerpt history

Stats

created	12 years, 8 months ago
viewed	111 times
active	5 years, 1 month ago
editors	2

Recent Hot Answers

Pytorch, use loss that don't return gradient

more »

Collectives™ on Stack Overflow

About

Stats

Top Answerers

Recent Hot Answers

Collectives™ on Stack Overflow

About

Stats

Top Answerers

Recent Hot Answers

Related Tags