PyTorch Temporal Fusion Transformer 预测输出长度

发布于 2025-01-17 13:16:02 字数 872 浏览 4 评论 0原文

我已经在一些训练数据上训练了时间融合变压器，并希望对一些看不见的数据进行预测。为此，我使用 pytorch_forecasting TimeSeriesDataSet 数据结构

testing = TimeSeriesDataSet.from_dataset(training, df[lambda x: x.year >validation_cutoff] ，predict=True，stop_randomization=True)

鉴于

df[lambda x: x.year > validation_cutoff].shape
(97036, 13)

我希望

testing.data['reals'].shape
torch.Size([97036, 9])

收到包含 97036 行的预测输出向量。因此，我继续生成我的预测，如下所示

test_dataloader = testing.to_dataloader(train=False, batch_size=128 * 10, num_workers=0)
raw_predictions, x = best_tft.predict(testing, mode="raw", return_x=True)

但是，我收到的输出大小

raw_predictions['prediction'].shape
torch.Size([25476, 1, 7])

为为什么其中一些 97036 个观测值被删除？

否则，我如何找出这 97036 个观测值中哪些被删除以及为什么被删除？

原文

I have trained a temporal fusion transformer on some training data and would like to predict on some unseen data. To do so, I'm using the pytorch_forecasting TimeSeriesDataSet data structures

testing = TimeSeriesDataSet.from_dataset(training, df[lambda x: x.year > validation_cutoff], predict=True, stop_randomization=True)

with

df[lambda x: x.year > validation_cutoff].shape
(97036, 13)

Given that

testing.data['reals'].shape
torch.Size([97036, 9])

I would expect to receive a prediction output vector containing 97036 rows. So I proceed to generate my predictions like so

test_dataloader = testing.to_dataloader(train=False, batch_size=128 * 10, num_workers=0)
raw_predictions, x = best_tft.predict(testing, mode="raw", return_x=True)

However, I receive an output of the size

raw_predictions['prediction'].shape
torch.Size([25476, 1, 7])

Why are some of these 97036 observations being removed?

Or else, how can I find out which if these 97036 observations are being dropped and why the are being removed?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

阳光①夏 2025-01-24 13:16:02

摆脱mode =“ raw”，以便在max_prediction horizon范围内获取预测。它将为每个组和max_prediction Horizon的每一行和列提供一个预测。

torch.Size([25476, 1, 7])

根据测试集的日期范围，这一次，每次在测试集上都会给出一个预测。

Get rid of mode="raw" in order to get a forecast on the max_prediction horizon range. It is going to give one forecast for each individual row of group and columns of max_prediction horizon.

torch.Size([25476, 1, 7])

This gives one prediction, per one granular group, at a time on the test set, depending on the date range of the test set.

回复收藏 0 原文

埋情葬爱 2025-01-24 13:16:02

在 TimeSeriesDataSet 的源代码中，有一些过滤器可以删除短时间序列。当您在 TimeSeriesDataSet.from_dataset 中设置 predict=True 时，它会将 min_prediction_length 设置为 max_prediction_length。然后，当要创建实际的测试数据加载器时，所有短于 min_prediction_length 的时间序列都会被删除，这会从测试集中删除整个数据，从而留下恰好 0 个观测值。到底为什么要这样实现，我不知道。要进行预测，只需设置：

testing = TimeSeriesDataSet.from_dataset(training, df[lambda x: x.year > validation_cutoff], predict=False, stop_randomization=True)

In the source code of the TimeSeriesDataSet there are filters to remove short time series. When you set predict=True in TimeSeriesDataSet.from_dataset, it sets the min_prediction_length to max_prediction_length. Then, when the actual test dataloader is to be created, all of the time series that are shorter than min_prediction_length are removed, which removes the entire data from the testing set, which leaves you with exactly 0 observations. Exactly why it is implemented in this way, I don't know. To make predictions just set:

testing = TimeSeriesDataSet.from_dataset(training, df[lambda x: x.year > validation_cutoff], predict=False, stop_randomization=True)

回复收藏 0 原文

~没有更多了~

关于作者

还不是爱你

暂无简介

文章

26 人气

关注发私信

友情链接

文江博客

PyTorch Temporal Fusion Transformer 预测输出长度

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（2）

关于作者

相关话题

热门标签

推荐作者

alipaysp_snBf0MSZIv

梦断已成空

瞎闹

凯凯我们等你回来

寄意

似梦非梦

友情链接

PyTorch Temporal Fusion Transformer 预测输出长度

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（2）

关于作者

相关话题

热门标签

推荐作者

alipaysp_snBf0MSZIv

梦断已成空

瞎闹

凯凯我们等你回来

寄意

似梦非梦

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。