将内存 blob 分配给 py-torch 输出张量（C++ API）

发布于 2025-01-19 23:41:42 字数 1450 浏览 5 评论 0原文

我正在使用Py-torch培训线性模型，并将其保存到带有“保存”功能调用的文件中。我还有另一个代码将模型加载到C ++中并执行推理。我想指示火炬CPP库在最终输出张量上使用特定的内存斑点。这甚至可能吗？如果是，怎么样？在下面，您可以看到我要实现的一小部分。

#include <iostream>
#include <memory>

#include <torch/script.h>

int main(int argc, const char* argv[]) {
  if (argc != 3) {
    std::cerr << "usage: example-app <path-to-exported-script-module>\n";
    return -1;
  }
  long numElements = (1024*1024)/sizeof(float) * atoi(argv[2]);

  float *a = new float[numElements]; 
  float *b = new float[numElements];
  float *c = new float[numElements*4];

  for (int i = 0; i < numElements; i++){
    a[i] = i;
    b[i] = -i;
  }

  //auto options = torch::TensorOptions().dtype(torch::kFloat64);
  at::Tensor a_t = torch::from_blob((float*) a, {numElements,1});
  at::Tensor b_t = torch::from_blob((float*) b, {numElements,1});
  at::Tensor out = torch::from_blob((float*) c, {numElements,4});

  at::Tensor c_t = at::cat({a_t,b_t}, 1);
  at::Tensor d_t = at::reshape(c_t, {numElements,2});

  torch::jit::script::Module module;
  try {
    module = torch::jit::load(argv[1]);
  }
  catch (const c10::Error& e) {
    return -1;
  }


  out =  module.forward({d_t}).toTensor();
  std::cout<< out.sizes() << "\n";

  delete [] a;
  delete [] b;
  delete [] c;

  return 0;
}

因此，我将内存分配到“ C”中，然后从此内存中施放张量。我将此内存存储在名为“ Out”的张量中。当我调用正向方法时，我加载模型。我观察到所得数据被复制/移至“ OUT”张量。但是，我想指示火炬直接存储在“输出”内存中。这可能吗？

原文

I am training a linear model using py-torch and I am saving it to a file with the "save" function call. I have another code that loads the model in C++ and performs inference.
I would like to instruct the Torch CPP Library to use a specific memory blob at the final output tensor. Is this even possible? If yes, how? Below you can see a small example of what I am trying to achieve.

#include <iostream>
#include <memory>

#include <torch/script.h>

int main(int argc, const char* argv[]) {
  if (argc != 3) {
    std::cerr << "usage: example-app <path-to-exported-script-module>\n";
    return -1;
  }
  long numElements = (1024*1024)/sizeof(float) * atoi(argv[2]);

  float *a = new float[numElements]; 
  float *b = new float[numElements];
  float *c = new float[numElements*4];

  for (int i = 0; i < numElements; i++){
    a[i] = i;
    b[i] = -i;
  }

  //auto options = torch::TensorOptions().dtype(torch::kFloat64);
  at::Tensor a_t = torch::from_blob((float*) a, {numElements,1});
  at::Tensor b_t = torch::from_blob((float*) b, {numElements,1});
  at::Tensor out = torch::from_blob((float*) c, {numElements,4});

  at::Tensor c_t = at::cat({a_t,b_t}, 1);
  at::Tensor d_t = at::reshape(c_t, {numElements,2});

  torch::jit::script::Module module;
  try {
    module = torch::jit::load(argv[1]);
  }
  catch (const c10::Error& e) {
    return -1;
  }


  out =  module.forward({d_t}).toTensor();
  std::cout<< out.sizes() << "\n";

  delete [] a;
  delete [] b;
  delete [] c;

  return 0;
}

So, I am allocating memory into "c" and then I am casting creating a tensor out of this memory. I store this memory into a tensor named "out". I load the model when I call the forward method. I observe that the resulted data are copied/moved into the "out" tensor. However, I would like to instruct Torch to directly store into "out" memory. Is this possible?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

聚集的泪 2025-01-26 23:41:42

在 libtorch 源代码中的某个地方（我不记得在哪里，我会尝试找到该文件），有一个类似于下面的运算符（注意最后一个 &&），

torch::tensor& operator=(torch::Tensor rhs) &&;

如果我没记错的话，它可以满足您的需要。基本上，torch 假设如果您将张量 rhs 分配给右值引用张量，那么您实际上意味着将 rhs 复制到底层存储中。

所以在你的情况下，那就是

std::move(out) = module.forward({d_t}).toTensor();

或

torch::from_blob((float*) c, {numElements,4}) = module.forward({d_t}).toTensor();

Somewhere in libtorch source code (I don' remember where, I'll try to find the file), there is an operator which is something like below (notice the last &&)

torch::tensor& operator=(torch::Tensor rhs) &&;

and which does what you need if I remember correctly. Basically torch assumes that if you allocate a tensor rhs to an rvalue reference tensor, then you actually mean to copy rhs into the underlying storage.

So in your case, that would be

std::move(out) = module.forward({d_t}).toTensor();

torch::from_blob((float*) c, {numElements,4}) = module.forward({d_t}).toTensor();

回复收藏 0 原文

~没有更多了~

关于作者

羅雙樹

暂无简介

文章

28 人气

关注发私信

友情链接

文江博客

将内存 blob 分配给 py-torch 输出张量（C++ API）

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

关于作者

相关话题

热门标签

推荐作者

helenabai_sz

993438968

若能看破又如何

情未る

纪平伟

bobowiki

友情链接

将内存 blob 分配给 py-torch 输出张量（C++ API）

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

关于作者

相关话题

热门标签

推荐作者

helenabai_sz

993438968

若能看破又如何

情未る

纪平伟

bobowiki

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。