Various Optimization Algorithms For Training Neural Network[转]

from

https://towardsdatascience.com/optimizers-for-training-neural-network-59450d71caf6

 

Various Optimization Algorithms For Training Neural Network[转]Various Optimization Algorithms For Training Neural Network[转]

Optimizers help to get results faster

Gradient Descent

Stochastic Gradient Descent

Mini-Batch Gradient Descent

Momentum

Nesterov Accelerated Gradient

Various Optimization Algorithms For Training Neural Network[转]Various Optimization Algorithms For Training Neural Network[转]

NAG vs momentum at local minima

Adagrad

Various Optimization Algorithms For Training Neural Network[转]Various Optimization Algorithms For Training Neural Network[转]

A derivative of loss function for given parameters at a given time t.

Various Optimization Algorithms For Training Neural Network[转]Various Optimization Algorithms For Training Neural Network[转]

Update parameters for given input i and at time/iteration t

AdaDelta

Various Optimization Algorithms For Training Neural Network[转]Various Optimization Algorithms For Training Neural Network[转]

Update the parameters

Adam

Various Optimization Algorithms For Training Neural Network[转]Various Optimization Algorithms For Training Neural Network[转]

First and second order of momentum

Various Optimization Algorithms For Training Neural Network[转]Various Optimization Algorithms For Training Neural Network[转]

Update the parameters

Comparison between various optimizers

Various Optimization Algorithms For Training Neural Network[转]Various Optimization Algorithms For Training Neural Network[转]

Comparison 1

Various Optimization Algorithms For Training Neural Network[转]Various Optimization Algorithms For Training Neural Network[转]

comparison 2

Conclusions

 

上一篇:6.3 我的底盘听我的


下一篇:XGB模型可解释性SHAP包实战