在Pytorch中，如何单热编码这样的灰度图像以进行语义分割？

发布于 2025-01-23 12:49:37 字数 769 浏览 2 评论 0原文

我正在使用验证的DEEPLABV3模型进行图像分割，并且它为我提供了一个带有Shape BXCXWXH的输出，其中B =批处理大小，C =类，W =宽度和H =高度。如果我采用此输出映像的深度gragmax，我会得到一个WXH结果，每个像素代表一个类。对于此输出图像，我的灰度图像作为标签，具有WXH形状。 HOWEWER，灰度标签图像中的像素值不在0到类别的范围内，而是0.0xx至0.2，因此我无法使用它来计算损失。为此，我必须单热编码标签图像，但我不知道该怎么做。

例如，标签图像具有以下值：

tensor([[[0.0824, 0.0824, 0.0824,  ..., 0.0431, 0.0431, 0.0317],
         [0.0824, 0.0824, 0.0824,  ..., 0.0431, 0.0431, 0.0317],
         [0.0824, 0.0824, 0.0824,  ..., 0.0431, 0.0431, 0.0317],
         ...,
         [0.0275, 0.0275, 0.0275,  ..., 0.0275, 0.0275, 0.0317],
         [0.0275, 0.0275, 0.0275,  ..., 0.0275, 0.0275, 0.0317],
         [0.0275, 0.0275, 0.0275,  ..., 0.0275, 0.0275, 0.0317]]])

具有14152个唯一像素值。图像的大小为1024x1024。我如何单速编码此图像？

数据集是Kitti语义像素级别。

原文

I'm using the pretrained DeeplabV3 model for image segmentation, and it gives me an output with shape BxCxWxH, where B=batch size, C=number of classes, W=Width and H=Height. If I take the depth-wise argmax of this output image, I get a WxH result, where every pixel represents a class. For this output image, I have a grayscale image as label, with WxH shape. Howewer, the pixel values in the grayscale label image are not in the range of 0 to number of classes, but in 0.0xx to 0.2, so I can't use it to calculate the loss. To do it, I have to one-hot encode the label image, but I don't know how to do it.

For example, the label image has the following values:

tensor([[[0.0824, 0.0824, 0.0824,  ..., 0.0431, 0.0431, 0.0317],
         [0.0824, 0.0824, 0.0824,  ..., 0.0431, 0.0431, 0.0317],
         [0.0824, 0.0824, 0.0824,  ..., 0.0431, 0.0431, 0.0317],
         ...,
         [0.0275, 0.0275, 0.0275,  ..., 0.0275, 0.0275, 0.0317],
         [0.0275, 0.0275, 0.0275,  ..., 0.0275, 0.0275, 0.0317],
         [0.0275, 0.0275, 0.0275,  ..., 0.0275, 0.0275, 0.0317]]])

with 14152 unique pixel values. The size of the image is 1024x1024. How could I one-hot encode this image?

The Dataset is the KITTI Semantics Pixel level.

分享到QQ

分享到微博