当任务结果很大时，我应该如何使用 Celery？

发布于 2024-10-03 20:36:23 字数 282 浏览 14 评论 0原文

处理在 Celery 中执行的结果很大的任务的最佳方法是什么？我正在考虑诸如表转储之类的事情，我可能会返回数百兆字节的数据。

我认为将消息塞入结果数据库的幼稚方法在这里不会为我服务，更不用说如果我将 AMQP 用于结果后端了。然而，我有一些延迟是一个问题的问题；根据导出的特定实例，有时我必须阻塞直到它返回并直接从任务客户端发出导出数据（针对导出内容传入的 HTTP 请求，它不存在，但必须< /em> 在对该请求的响应中提供...无论需要多长时间）

那么，为此编写任务的最佳方法是什么？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

拥抱影子 2024-10-10 20:36:23

一种选择是在所有工作计算机上运行静态 HTTP 服务器。然后，您的任务可以将大型结果转储到静态根目录中的唯一文件中，并返回对该文件的 URL 引用。然后接收者可以在闲暇时获取结果。

例如。大概是这样的：

@task
def dump_db(db):
  # Some code to dump the DB to /srv/http/static/db.sql
  return 'http://%s/%s.sql' % (socket.gethostname(), db)

您当然需要一些方法来获取旧文件，以及保证唯一性，可能还需要其他问题，但您已经了解了总体思路。

One option would be to have a static HTTP server running on all of your worker machines. Your task can then dump the large result to a unique file in the static root and return a URL reference to the file. The receiver can then fetch the result at its leisure.

eg. Something vaguely like this:

@task
def dump_db(db):
  # Some code to dump the DB to /srv/http/static/db.sql
  return 'http://%s/%s.sql' % (socket.gethostname(), db)

You would of course need some means of reaping old files, as well as guaranteeing uniqueness, and probably other issues, but you get the general idea.

回复收藏 0 原文