训练循环不会在数据加载器中的num_workers＆gt; 1开始

发布于 2025-01-26 13:56:07 字数 988 浏览 2 评论 0原文

我有一个NLP分类问题，其中我有一个数据载体对象，并且它的代码是

train_patentload = DataLoader(train_patentset, batch_size=4, shuffle=True,num_workers=2)

在运行训练循环时它不起作用并卡住了，尽管当num_workers = 2被删除时，代码正常运行。我已经被困了一段时间了，我感谢

在数据框架中的数据集的

class PatentDataset(Dataset):
    def __init__(self, df):
        self.df= df

    def __len__(self):
        return len(df)

    def __getitem__(self, ind):
        conv_dict = {0 : [1., 0., 0., 0., 0.], 0.25 : [1., 1., 0., 0., 0.], 0.5 : [1., 1., 1., 0., 0.], 0.75 : [1., 1., 1., 1., 0.], 1 : [1., 1., 1., 1., 1.]}
        inputs = df.iloc[ind, 1:-1].to_list()
        text = ' #^& '.join(inputs)
        label = np.array(conv_dict[df.iloc[ind, -1]])
        label = torch.as_tensor(label)
        text = tokenizer(text, padding='max_length', max_length = 256, truncation=True, return_tensors="pt")
        return text, label

train_patentset = PatentDataset(train)

帮助。

原文

I have an NLP classification problem where I have a DataLoader object and its code is

train_patentload = DataLoader(train_patentset, batch_size=4, shuffle=True,num_workers=2)

When I run the training loop it doesn't work and gets stuck though the code runs normally when num_workers=2 is removed. I have been stuck for a while now and I'd appreciate the help

the code of the DataSet

class PatentDataset(Dataset):
    def __init__(self, df):
        self.df= df

    def __len__(self):
        return len(df)

    def __getitem__(self, ind):
        conv_dict = {0 : [1., 0., 0., 0., 0.], 0.25 : [1., 1., 0., 0., 0.], 0.5 : [1., 1., 1., 0., 0.], 0.75 : [1., 1., 1., 1., 0.], 1 : [1., 1., 1., 1., 1.]}
        inputs = df.iloc[ind, 1:-1].to_list()
        text = ' #^& '.join(inputs)
        label = np.array(conv_dict[df.iloc[ind, -1]])
        label = torch.as_tensor(label)
        text = tokenizer(text, padding='max_length', max_length = 256, truncation=True, return_tensors="pt")
        return text, label

train_patentset = PatentDataset(train)

where train is the dataframe.

分享到QQ

分享到微博