在Pytorch中实施不可能的培训损失

发布于 2025-02-04 04:45:39 字数 751 浏览 4 评论 0原文

我正在尝试实施本研究论文中提出的不可能的训练损失：。此损失是负模型损失（NLLLOSS）的更新版本。

这种损失的主要思想是，它在培训过程中避免了不需要的令牌。

这是我的代码：

def NLLLoss(logs, targets, c, alpha=0.1):
    out = torch.zeros_like(targets, dtype=torch.float)
    for i in range(len(targets)):
        # out[i] = logs[i][targets[i]] # The original implementation
        out[i] = alpha * (1 - logs[i][c[i]]) * logs[i][targets[i]]
    return -out.sum()/len(out)

注释的行是原始的NLLLOSS实现。这个代码很好，但是我想知道，此实现正确吗？

原文

I am trying to implement the Unlikelihood Training loss that was proposed in this research paper: NEURAL TEXT DEGENERATION WITH UNLIKELIHOOD TRAINING. This loss is an updated version of the negative log-likelihood loss (NLLLOSS).

The main idea of this loss is that it avoids unwanted tokens during the training process.

This is my code:

def NLLLoss(logs, targets, c, alpha=0.1):
    out = torch.zeros_like(targets, dtype=torch.float)
    for i in range(len(targets)):
        # out[i] = logs[i][targets[i]] # The original implementation
        out[i] = alpha * (1 - logs[i][c[i]]) * logs[i][targets[i]]
    return -out.sum()/len(out)

The commented line is the original NLLLoss implementation. This code well, but I was wondering, is this implementation correct?

分享到QQ

分享到微博