在神经网络中进行自适应学习率时使用哪个乘法和加法因子？

发布于 2024-12-03 15:52:25 字数 590 浏览 4 评论 0原文

我是神经网络的新手，为了掌握这个问题，我实现了一个基本的前馈 MLP，目前我通过反向传播对其进行训练。我知道有更复杂和更好的方法可以做到这一点，但在机器学习简介他们建议，通过一两个技巧，基本的梯度下降可以有效地从现实世界的数据中学习。技巧之一是自适应学习率。

这个想法是当误差变小时，将学习率增加一个常数值a，当误差变大时，将学习率减少学习率的一小部分b。所以基本上学习率的变化是由以下因素决定的：

+(a)

我们是否在正确的方向上学习，以及

-(b * <learning rate>)

我们是否正在破坏我们的学习。然而，在上面的书中没有关于如何设置这些参数的建议。我不期望得到精确的建议，因为参数调整本身就是一个完整的主题，但至少只是对其数量级的提示。有什么想法吗？

谢谢你，
通努兹

原文

I am new to neural networks and, to get grip on the matter, I have implemented a basic feed-forward MLP which I currently train through back-propagation. I am aware that there are more sophisticated and better ways to do that, but in Introduction to Machine Learning they suggest that with one or two tricks, basic gradient descent can be effective for learning from real world data. One of the tricks is adaptive learning rate.

The idea is to increase the learning rate by a constant value a when the error gets smaller, and decrease it by a fraction b of the learning rate when the error gets larger. So basically the learning rate change is determined by:

+(a)

if we're learning in the right direction, and

-(b * <learning rate>)

if we're ruining our learning. However, on the above book there's no advice on how to set these parameters. I wouldn't expect a precise suggestion since parameter tuning is a whole topic on its own, but just a hint at least on their order of magnitude. Any ideas?

Thank you,
Tunnuz

分享到QQ

分享到微博