BCELoss，BCEWithLogitsLoss和CrossEntropyLoss

这篇具有很好参考价值的文章主要介绍了BCELoss，BCEWithLogitsLoss和CrossEntropyLoss。希望对大家有所帮助。如果存在错误或未考虑完全的地方，请大家不吝赐教，您也可以点击"举报违法"按钮提交疑问。

二分类

1. BCELoss

2. BCEWithLogitsLoss

多分类

1. CrossEntropyLoss

举例

二分类

两个损失：BCELoss，BCEWithLogitsLoss

1. BCELoss

输入：（[B,C], [B,C]）,代表（prediction,target）的维度，其中，B是Batchsize，C为样本的class，即样本的类别数。

输出：一个标量

等价于：BCELoss + sigmoid

import torch
from torch import nn

input = torch.randn(3) # (3,1) 随机生成一个输入，没有被sigmoid。
print(input)
print(input.shape)
target=torch.Tensor([0., 1., 1.])
loss1=nn.BCELoss()
print("BCELoss:",loss1(torch.sigmoid(input), target))#需要sigmod


输出：
BCELoss: tensor(1.0053)

2. BCEWithLogitsLoss

输入：（[B,C], [B,C]）,输出：一个标量

import torch
from torch import nn

input = torch.randn(3) # (3,1) 随机生成一个输入，没有被sigmoid。
print(input)
print(input.shape)
target=torch.Tensor([0., 1., 1.])
loss2=nn.BCEWithLogitsLoss()
print("BCEWithLogitsLoss:",loss2(input,target))#不需要sigmoid


输出：
BCEWithLogitsLoss: tensor(1.0053)

多分类

1. CrossEntropyLoss

输入：（[B,C], [B]） 输出：一个标量（这个minibatch的mean/sum的loss）

nn.CrossEntropyLoss计算过程：
input: logits(未经过softmax的模型的"输出”)

softmax(input)
-log(softmax(input))
用target做选择提取(关于logsoftmax)· mean

等价于：nn.CrossEntropyLoss = nn.NLLLoss(nn.LogSoftmax)

import torch
from torch import nn

loss2 = nn.CrossEntropyLoss(reduction="none")
target2 = torch.tensor([0, 1, 2])
predict2 = torch.tensor([[0.9, 0.2, 0.8], [0.5, 0.2, 0.4], [0.4, 0.2, 0.9]])
print(predict2.shape) # torch.Size([3, 3])
print(target2.shape) # torch.Size([3])
print(loss2(predict2, target2))

# #结果计算为：
# tensor([0.8761, 1.2729, 0.7434])

举例

1. BCEWithLogitsLoss计算ACC和Loss：

参考：https://github.com/Loche2/IMDB_RNN/blob/master/training.py

criterion = nn.BCEWithLogitsLoss()
# 计算准确率
def binary_accuracy(predicts, y):
    rounded_predicts = torch.round(torch.sigmoid(predicts))
    correct = (rounded_predicts == y).float()
    accuracy = correct.sum() / len(correct)
    return accuracy


# 训练
def train(model, iterator, optimizer, criterion):
    model.train()
    epoch_loss = 0
    epoch_accuracy = 0
    for batch in tqdm(iterator, desc=f'Epoch [{epoch + 1}/{EPOCHS}]', delay=0.1):
        optimizer.zero_grad()
        predictions = model(batch.text[0]).squeeze(1)
        loss = criterion(predictions, batch.label)
        accuracy = binary_accuracy(predictions, batch.label)
        loss.backward()
        optimizer.step()
        epoch_loss += loss.item()
        epoch_accuracy += accuracy.item()
    return epoch_loss / len(iterator), epoch_accuracy / len(iterator)

2. 计算ACC和Loss

# 截取情感分析部分代码 
    criterion = nn.CrossEntropyLoss()
    total_loss = 0.0
    correct_predictions = 0
    total_predictions = 0
   for batch in train_loader:
        input_ids = batch['input_ids'].to(device)
        labels = batch['label'].to(device)

        optimizer.zero_grad()

        logits = model(input_ids)
        loss_sentiment = criterion(logits, labels.long())
       
        loss_sentiment.backward()
        optimizer.step()

        total_loss += loss_sentiment.item()

        # get sentiment accuracy
        predicted_labels = torch.argmax(logits, dim=1)
        correct_predictions += torch.sum(predicted_labels == labels).item()
        total_predictions += labels.size(0)

    accuracy = correct_predictions / total_predictions
    loss = total_loss / len(train_loader)

也可以直接看github上别人写的例子：https://github.com/songyouwei/ABSA-PyTorch/blob/master/train.py

参考：

深刻剖析与实战BCELoss详解（主）和BCEWithLogitsLoss（次）以及与普通CrossEntropyLoss的区别（次）-CSDN博客

另外提出一个问题：

二分类必须用BCEWithLogitsLoss吗，也可以用CrossEntropyLoss吧？

(1)如果用CrossEntropyLoss的话，只要让网络的fc层为nn.Linear(hidden_size, 2)就行，这样就和多分类一样算。另外CrossEntropyLoss里面包含了softmax，所以在计算loss的时候也不需要过softmax再算loss.

(2)如果用BCEWithLogitsLoss的话，就按照上面举例中BCEWithLogitsLoss计算Loss，只是如上面代码可是，再计算Acc的时候将predict使用sigimoid缩放到0，1来计算预测正确的个数。

注：仅供学习记录，理解或者学习有误请与我联系

参考问题：二分类问题，应该选择sigmoid还是softmax？ - 知乎文章来源地址https://www.toymoban.com/news/detail-846075.html

到了这里，关于BCELoss，BCEWithLogitsLoss和CrossEntropyLoss的文章就介绍完了。如果您还想了解更多内容，请在右上角搜索TOY模板网以前的文章或继续浏览下面的相关文章，希望大家以后多多支持TOY模板网！

Toy模板网

BCELoss，BCEWithLogitsLoss和CrossEntropyLoss

二分类

1. BCELoss

2. BCEWithLogitsLoss

多分类

1. CrossEntropyLoss

举例

觉得文章有用就打赏一下文章作者

支付宝扫一扫打赏

微信扫一扫打赏

支付宝扫一扫领取红包，优惠每天领

二维码1

二维码2

BCELoss，BCEWithLogitsLoss和CrossEntropyLoss

二分类

1. BCELoss

2. BCEWithLogitsLoss

多分类

1. CrossEntropyLoss

举例

相关文章

nn.BCEWithLogitsLoss中weight参数和pos_weight参数的作用及用法

【PyTorch】进阶学习：探索BCEWithLogitsLoss的正确使用---二元分类问题中的logits与标签形状问题

成功解决使用BCEWithLogitsLoss时ValueError: Target size (torch.Size([4])) must be the same as input size (to

torch.nn.BCELoss

pytorch中BCELoss 和 binary_cross_entropy_with_logits之间的区别

觉得文章有用就打赏一下文章作者

支付宝扫一扫打赏

微信扫一扫打赏

支付宝扫一扫领取红包，优惠每天领

二维码1

二维码2