如何知道一组 RabbitMQ 任务何时完成？

发布于 2024-12-09 12:02:00 字数 816 浏览 4 评论 0原文

我正在使用 RabbitMQ 让工作进程对视频文件进行编码。我想知道所有文件何时完成 - 即所有工作进程何时完成。

我能想到的唯一方法是使用数据库。当视频完成编码时：

UPDATE videos SET status = 'complete' WHERE filename = 'foo.wmv'
-- etc etc etc as each worker finishes --

然后检查是否所有视频都已编码：

SELECT count(*) FROM videos WHERE status != 'complete'

但是如果我要这样做，那么我觉得我正在失去 RabbitMQ 作为多个分布式工作进程机制的好处，因为我仍然需要手动维护数据库队列。

RabbitMQ 依赖项有标准机制吗？也就是说，“等待这 5 个任务完成，一旦完成，然后开始一个新任务？”

我不想让父进程将这些任务添加到队列中，然后“等待”每个任务返回“已完成”状态。然后我必须为每组视频维护一个单独的进程，此时与单个线程池概念相比，我已经失去了解耦工作进程的优势。

我是否在要求一些不可能的事情？或者，是否有广泛采用的标准解决方案来管理我错过的队列中任务的整体状态？

编辑：搜索后，我发现了这个类似的问题：获取结果使用 RabbitMQ 进行长时间运行的任务

人们对此有什么特别的想法吗？

原文

I am using RabbitMQ to have worker processes encode video files. I would like to know when all of the files are complete - that is, when all of the worker processes have finished.

The only way I can think to do this is by using a database. When a video finishes encoding:

UPDATE videos SET status = 'complete' WHERE filename = 'foo.wmv'
-- etc etc etc as each worker finishes --

And then to check whether or not all of the videos have been encoded:

SELECT count(*) FROM videos WHERE status != 'complete'

But if I'm going to do this, then I feel like I am losing the benefit of RabbitMQ as a mechanism for multiple distributed worker processes, since I still have to manually maintain a database queue.

Is there a standard mechanism for RabbitMQ dependencies? That is, a way to say "wait for these 5 tasks to finish, and once they are done, then kick off a new task?"

I don't want to have a parent process add these tasks to a queue and then "wait" for each of them to return a "completed" status. Then I have to maintain a separate process for each group of videos, at which point I've lost the advantage of decoupled worker processes as compared to a single ThreadPool concept.

Am I asking for something which is impossible? Or, are there standard widely-adopted solutions to manage the overall state of tasks in a queue that I have missed?

Edit: after searching, I found this similar question: Getting result of a long running task with RabbitMQ

Are there any particular thoughts that people have about this?

分享到QQ

分享到微博