Java-高效的调度结构?
我对这个问题的长度表示歉意,但我认为包含足够的细节很重要,因为我正在寻找解决问题的合适方法,而不是简单的代码建议!
一般描述:
我正在开发一个项目,该项目需要能够以某个相对重复间隔“安排”任务。
这些间隔以某个内部时间表示,表示为随着程序执行而递增的整数(所以不等于实时)。每次发生这种情况时,都会询问计划以检查是否有在此时间步执行的任何任务。
如果执行了任务,则应重新安排其在相对于当前时间的位置再次运行(例如,以 5 个时间步长)。该相对位置简单地存储为任务对象的整数属性。
问题:
我正在努力决定如何构建它 - 部分原因是它是一组稍微难以查找的搜索词。
就目前情况而言,我认为每次计时器递增时,我都需要:
- 在计划中的“0”位置执行任务
- 将这些任务重新添加到计划中的相对位置(例如,每 5 次重复的任务)步骤将返回到位置 5)
- 计划中的每组任务的“执行时间”都会减一(例如位置 1 的任务将移动到位置 0)
假设:
有几个假设可能会限制可能的我可以使用的解决方案:
- 间隔必须是相对的,而不是特定时间,并且被定义为距离当前时间的整数步数
- 这些间隔可以采用任何整数值,例如不受限制。。
- 多个任务可以安排在同一时间步长,但它们的执行顺序并不重要
- 所有执行都应保留在单个线程中 - 多线程解决方案不适用 由于其他限制而适合
我的主要问题是:
我如何设计此时间表以有效地工作?哪些数据类型/集合可能有用?
我应该考虑另一种结构/方法吗?
我是否错误地放弃了调度框架(例如 Quartz),它似乎在“实时”时域而不是“非实时”时域?
非常感谢您提供任何可能的帮助。如果有需要,请随时评论以获取更多信息,我会在需要的地方进行编辑!
I apologise for the length of this problem, but I thought it important to include sufficient detail given that I'm looking for a suitable approach to my problem, rather than a simple code suggestion!
General description:
I am working on a project that requires tasks being able to be 'scheduled' at some relative repeating interval.
These intervals are in terms of some internal time, that is represented as an integer that is incremented as the program executes (so not equal to real time). Each time this happens, the schedule will be interogated to check for any tasks due to execute at this timestep.
If a task is executed, it should then be rescheduled to run again at a position relative to the current time (e.g. in 5 timesteps). This relative position is simply stored as an integer property of the Task object.
The problem:
I am struggling somewhat to decide upon how I should structure this- partly because it is a slightly difficult set of search terms to look for.
As it stands, I am thinking that each time the timer is incremented I need to:
- Execute tasks at the '0' position in the schedule
- Re-add those tasks to the schedule again at their relative position (e.g. a task that repeats every 5 steps will be returned to the position 5)
- Each group of tasks in the schedule will have their 'time until execution' decremented one (e.g. a task at position 1 will move to position 0)
Assumptions:
There are a couple of assumptions that may limit the possible solutions I can use:
- The interval must be relative, not a specific time, and is defined to be an integer number of steps from the current time
- These intervals may take any integer value, e.g. are not bounded.
- Multiple tasks may be scheduled for the same timestep, but their order of execution is not important
- All execution should remain in a single thread- multi-threaded solutions are not suitable due to other constraints
The main questions I have are:
How could I design this Schedule to work in an efficient manner? What datatypes/collections may be useful?
Is there another structure/approach I should consider?
Am I wrong to dismiss scheduling frameworks (e.g. Quartz), which appear to work more in the 'real' time domain rather 'non-real' time domain?
Many thanks for any possible help. Please feel free to comment for further information if neccessary, I will edit wherever needed!
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(7)
嗯,Quartz 是非常强大的工具,但是它的配置可能性有限,因此如果您需要特定功能,您应该编写自己的解决方案。
然而,研究Quartz源代码和数据结构是个好主意,因为它们已经成功地解决了很多问题,你会发现fg数据库级别的进程间同步、运行延迟任务等。
我'我曾经编写过自己的调度程序,该调度程序适用于 Quartz 可能不容易适应的任务,但是一旦我学习了 Quartz,我就了解了我的解决方案可以改进多少,知道它是如何在 Quartz 中完成的。
Well, Quartz is quite powerfull tools, however it has limited configuration possibilities, so if you need specific features, you should propably write your own solution.
However, it's a good idea to study the Quartz source code and data structures, because they have successfully dealt with much problems you would find f.g. inter-process synchronization on database level, running delayed tasks etc.
I've written once my own scheduler, which was adapted to tasks where propably Quartz would not be easy to adapt, but once I've learned Quartz I've understood how much I could improve in my solutions, knowing how it was done in Quartz.
怎么样,它使用您自己的 Ticks 和 executeNextInterval() :
您可能想添加一些错误处理,但它应该可以完成您的工作。
这里有一些单元测试:)
How about this, it uses your own Ticks with executeNextInterval() :
You might want to add some error handling, but it should do your job.
And here are some UnitTests for it :)
循环链表可能就是您正在寻找的数据结构。您无需递减每个任务元素中的字段,只需递增任务循环列表中“当前”字段的索引即可。伪代码结构可能看起来像这样:
任何时候你安排一个新任务,你只需将它添加到当前“tick”前面 N 个刻度的位置
A circular linked list might be the data structure you're looking for. Instead of decrementing fields in each task element, you simply increment the index of the 'current' field in the circular list of tasks. A pseudocode structure might look something like this:
any time you schedule a new task, you just add it in the position N ticks forward of the current 'tick'
这里有一些想法:
一切保持简单。如果您没有数百万个任务,则不需要优化的数据结构(除了骄傲或过早优化的冲动)。
避免相对时间。使用绝对内部刻度。如果添加任务,请将“下次运行”设置为当前刻度值。将其添加到列表中,按时间对列表进行排序。
查找任务时,从列表的开头开始,选择时间 <= 当前刻度的所有内容,运行该任务。
将所有这些任务收集到另一个列表中。全部运行完毕后,根据当前的刻度和增量计算“下次运行”(这样就不会得到循环的任务),将它们全部添加到列表中,排序。
Here are a couple of thoughts:
Keep everything simple. If you don't have millions of tasks, there is no need for an optimized data structure (except pride or the urge for premature optimization).
Avoid relative times. Use an absolute internal tick. If you add a task, set the "run next time" to the current tick value. Add it to the list, sort the list by time.
When looking for tasks, start at the head of the list and pick everything which has a time <= current tick, run the task.
Collect all those tasks in another list. After all have run, calculate the "run next time" based on the current tick and the increment (so you don't get tasks that loop), add all of them to the list, sort.
看一下
DelayQueue
使用PriorityQueue
来维护此类有序事件列表的方式。DelayQueue
实时工作,因此可以使用Condition
和LockSupport
中提供的可变定时等待方法。您可以实现类似SyntheticDelayQueue
的功能,其行为方式与DelayQueue
相同,但使用您自己的合成时间服务。显然,您必须替换 jdk 免费提供的定时等待/信号机制,而这对于高效地完成可能并不简单。Take a look at the way
DelayQueue
uses aPriorityQueue
to maintain such an ordered list of events.DelayQueue
works using real time and hence can use the variable timed wait methods available inCondition
andLockSupport
. You could implement something like aSyntheticDelayQueue
that behaves in the same way asDelayQueue
but uses your own synthetic time service. You would obviously have to replace the timed wait/signalling mechanisms that come for free with the jdk though and this might be non trivial to do efficiently.如果我必须这样做,我会创建一个简单的队列(链表变体)。该队列将包含一个动态数据结构(例如简单列表),其中包含需要完成的所有任务。在每个时间间隔(或时间步),进程读取队列的第一个节点,执行它在该节点的列表中找到的指令。在每次执行结束时,它将计算重新调度并将新执行添加到队列中的另一个节点,或者在将指令存储在该节点内之前创建直到该位置的节点。然后删除第一个节点,并在下一个时间步执行第二个节点(现在是第一个)。该系统也不需要跟踪任何整数,并且所需的所有数据结构都可以在 java 语言中找到。这应该可以解决你的问题。
If I had to do it, I'd create a simple queue ( linked list variant). This queue would contain a dynamic data structure (simple list for example) containing all the tasks that need to be done. At each time interval (or time-step), the process reads the first node of the queue, executes the instructions it finds in the list of that node. At the end of each execution it would compute the rescheduling and add the new execution to another node in the queue or create nodes up to that position before storing the instruction within that node. The first node is then removed and the second node (now the first) is executed at the next time-step. This system would also not require any integers to be kept track of and all data structures needed are found in the java language. This should solve your problem.
使用 ScheduledExecutorService。它内置了您需要的一切。使用起来非常简单:
Use a ScheduledExecutorService. It has everything you need built right in. Here's how simple it is to use: