等待信号量的进程的调度

发布于 2024-12-25 15:32:12 字数 408 浏览 5 评论 0原文

人们总是说，当信号量的计数为 0 时，请求该信号量的进程将被阻塞并添加到等待队列中。
当某个进程释放信号量并且计数从0增加到>1时，阻塞进程被激活。这可以是从被阻止的进程中随机挑选的任何进程。

现在我的问题是：
如果将它们添加到队列中，为什么阻塞进程的激活不按 FIFO 顺序？我认为从队列中选择下一个进程比随机选择一个进程并为其授予信号量要容易。如果这个随机逻辑背后有一些想法，请解释一下。另外，内核如何从队列中随机选择一个进程？ 就队列数据结构而言，从队列中获取随机过程也是一件复杂的事情。
标签：各种操作系统，因为每个操作系统都有一个内核，通常用C++编写，并且互斥共享相似的概念

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

古镇旧梦 2025-01-01 15:32:13

FIFO 是系统中等待列表的最简单的数据结构
这不支持优先级，但这不是绝对的答案
否则。根据选择的调度算法不同，
线程可能有不同的绝对优先级，或者某种
优先级衰减可能会生效，在这种情况下，操作系统可能会选择
在之前的某个时间间隔内拥有最少 CPU 时间的线程。
由于此类策略被广泛使用（尤其是后者），
通常的规则是认为你不知道（尽管绝对
优先级，它将是具有最高优先级的线程之一）。

回复收藏 0 原文

森罗 2025-01-01 15:32:13

当“随机”调度一个进程时，并不是随机选择一个进程；而是“随机”地调度一个进程。而是选择过程是不可预测的。

Windows 内核使用的算法是有一个线程队列（Windows 调度“线程”，而不是“进程”）等待信号量。当信号量被释放时，内核调度队列中等待的下一个线程。然而，调度线程并不立即使该线程开始执行；它只是通过将线程放入等待运行的线程队列中来使线程能够执行。直到 CPU 没有更高优先级的线程可以执行时，该线程才会真正运行。

当线程在调度队列中等待时，实际正在执行的另一个线程可能会等待同一个信号量。在传统的队列系统中，新线程必须停止执行并转到队列末尾排队等待该信号量。

然而，在最近的 Windows 内核中，新线程不必停止并等待该信号量。如果已分配该信号量的线程仍然位于运行队列中，则该信号量可能会被重新分配给旧线程，导致旧线程再次返回等待信号量。

这样做的好处是，原本要在队列中等待信号量然后在队列中等待运行的线程将根本不用等待。缺点是您无法预测哪个线程将真正获得下一个信号量，并且这是不公平的，因此等待信号量的线程可能会饿死。

回复收藏 0 原文

一个人的旅程 2025-01-01 15:32:13

并不是说它不能是先进先出（FIFO）；而是说它不能是先进先出（FIFO）。事实上，我敢打赌许多实现都是如此，正如您所说的原因。规范并不是随机选择过程；而是随机选择过程。这是因为它没有被指定，所以你的程序不应该依赖于以任何特定方式选择它。（它可以随机选择；仅仅因为它不是最快的方法并不意味着它不能完成。）

回复收藏 0 原文

优雅的叶子 2025-01-01 15:32:13

这里的所有其他答案都很好地描述了基本问题 - 特别是围绕线程优先级和就绪队列。然而，另一个需要考虑的事情是 IO。我在这里只讨论 Windows，因为它是我所知道的唯一具有任何权威的平台，但其他内核可能也有类似的问题。

在 Windows 上，当 IO 完成时，称为内核模式 APC（异步过程调用）的东西会针对启动 IO 的线程排队以完成它。如果线程碰巧正在等待调度程序对象（例如示例中的信号量），则该线程将从该对象的等待队列中删除，这会导致（内部内核模式）等待完成并出现（类似）STATUS_ALERTED。现在，由于这些内核模式 APC 是一个实现细节，并且您无法从用户模式看到它们，因此 WaitForMultipleObjects 的内核实现会在此时重新启动等待，这会导致您的线程被推到队列的后面。从内核模式的角度来看，队列仍然按照 FIFO 顺序，因为底层等待 API 的第一个调用者仍然位于队列的头部，但是从您的角度来看，在用户模式下，您只是被推到了由于您没有看到并且很可能无法控制的事情而排在队列后面。这使得队列顺序在用户模式下显得随机。其实现仍然是一个简单的 FIFO，但由于 IO，它看起来不像是更高抽象级别的 FIFO。

我在这里猜测更多，但我认为类 UNIX 操作系统在信号传递和内核需要劫持进程以在其上下文中运行的位置方面具有类似的限制。

现在这种情况并不总是发生，但文档必须保守，除非明确保证顺序是 FIFO（如上所述 - 至少对于 Windows - 不可能），然后文档中描述了顺序被称为“随机”或“无证”或其他东西，因为随机过程控制它。它还为操作系统供应商提供了稍后更改顺序的自由。

All of the other answers here are great descriptions of the basic problem - especially around thread priorities and ready queues. Another thing to consider however is IO. I'm only talking about Windows here, since it is the only platform I know with any authority, but other kernels are likely to have similar issues.

On Windows, when an IO completes, something called a kernel-mode APC (Asynchronous Procedure Call) is queued against the thread which initiated the IO in order to complete it. If the thread happens to be waiting on a scheduler object (such as the semaphore in your example) then the thread is removed from the wait queue for that object which causes the (internal kernel mode) wait to complete with (something like) STATUS_ALERTED. Now, since these kernel-mode APCs are an implementation detail, and you can't see them from user mode, the kernel implementation of WaitForMultipleObjects restarts the wait at that point which causes your thread to get pushed to the back of the queue. From a kernel mode perspective, the queue is still in FIFO order, since the first caller of the underlying wait API is still at the head of the queue, however from your point of view, way up in user mode, you just got pushed to the back of the queue due to something you didn't see and quite possibly had no control over. This makes the queue order appear random from user mode. The implementation is still a simple FIFO, but because of IO it doesn't look like one from a higher level of abstraction.

I'm guessing a bit more here, but I would have thought that unix-like OSes have similar constraints around signal delivery and places where the kernel needs to hijack a process to run in its context.

Now this doesn't always happen, but the documentation has to be conservative and unless the order is explicitly guaranteed to be FIFO (which as described above - for windows at least - it can't be) then the ordering is described in the documentation as being "random" or "undocumented" or something because a random process controls it. It also gives the OS vendors lattitude to change the ordering at some later time.

回复收藏 0 原文