线和纤维有什么区别？

发布于 2024-07-18 00:57:50 字数 70 浏览 15 评论 0原文

线和纤维有什么区别？我听说过红宝石纤维，也听说过它们有其他语言版本，有人可以用简单的术语向我解释一下线和纤维之间的区别吗？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

空袭的梦i 2024-07-25 00:57:50

用最简单的术语来说，线程通常被认为是抢占式的（尽管这可能并不总是正确的，具体取决于操作系统），而光纤则被认为是轻量级的协作线程。两者都是应用程序的单独执行路径。

对于线程：当前执行路径可能随时被中断或抢占（注意：此声明是概括性的，并且可能并不总是成立，具体取决于操作系统/线程包/等）。这意味着对于线程来说，数据完整性是一个大问题，因为一个线程可能会在更新一大块数据的过程中停止，从而使数据的完整性处于不良或不完整的状态。这也意味着操作系统可以通过同时运行多个线程来利用多个 CPU 和 CPU 内核，并由开发人员来保护数据访问。

对于纤程：只有当纤程产生执行时，当前执行路径才会被中断（与上面的注释相同）。这意味着光纤始终在明确定义的位置启动和停止，因此数据完整性不再是问题。此外，由于纤程通常在用户空间中进行管理，因此无需进行昂贵的上下文切换和 CPU 状态更改，从而使得从一个纤程到下一个纤程的更改极其高效。另一方面，由于没有两个纤程可以完全相同地运行，因此仅使用纤程将无法充分利用多个 CPU 或多 CPU 核心。

回复收藏 0 原文

笑脸一如从前 2024-07-25 00:57:50

线程使用抢占式调度，而纤程使用协作调度。

对于线程，控制流可能随时中断，另一个线程可以接管。使用多个处理器，您可以同时运行多个线程（同步多线程，或 SMT）。因此，您必须非常小心并发数据访问，并使用互斥体、信号量、条件变量等保护您的数据。要做到正确往往非常棘手。

对于光纤，控制仅在您告诉它时才切换，通常使用名为 yield() 的函数调用。这使得并发数据访问变得更容易，因为您不必担心数据结构或互斥体的原子性。只要您不让步，就不会有被抢占的危险，也不会有其他光纤尝试读取或修改您正在使用的数据的危险。但结果是，如果您的光纤进入无限循环，则其他光纤都无法运行，因为您没有屈服。

您还可以混合线和纤维，这会引起两者面临的问题。不建议这样做，但如果谨慎行事，有时可能是正确的做法。

回复收藏 0 原文

拿命拼未来 2024-07-25 00:57:50

首先，我建议阅读进程和线程之间的区别的解释作为背景材料。

一旦你读完，它就非常简单了。线程可以在内核、用户空间中实现，或者两者可以混合实现。纤维基本上是在用户空间中实现的线程。

通常所说的线程是在内核中实现的执行线程：即所谓的内核线程。内核线程的调度完全由内核处理，尽管内核线程可以根据需要通过休眠来自愿释放 CPU。内核线程的优点是可以使用阻塞 I/O，而让内核操心调度。它的主要缺点是线程切换相对较慢，因为它需要陷入内核。
纤程是用户空间线程，其调度由单个进程下的一个或多个内核线程在用户空间中处理。这使得光纤切换非常快。如果将访问一组特定共享数据的所有纤程分组在单个内核线程的上下文中，并由单个内核线程处理它们的调度，那么您可以消除同步问题，因为纤程将有效地串行运行，并且您已经完成了控制他们的日程安排。将相关纤程分组到单个内核线程下非常重要，因为它们运行的内核线程可以被内核抢占。这一点在其他很多答案中都没有说清楚。另外，如果您在纤程中使用阻塞 I/O，则整个内核线程都是块的一部分，包括属于该内核线程的所有纤程。

在《现代操作系统》第 11.4 节“Windows Vista 中的进程和线程”中，Tanenbaum 评论道：

虽然纤程是协同调度的，但是如果有多个纤程
线程调度纤程，需要非常仔细的同步
需要确保纤维不会相互干扰。到
简化线和纤维之间的相互作用，通常
仅创建与要运行的处理器一样多的线程很有用
它们，并将线程关联到每个线程仅在一组不同的
可用的处理器，甚至只有一个处理器。每个线程都可以
然后运行纤维的特定子集，建立一个
线程和纤维之间的多对多关系简化了
同步。即便如此，仍面临诸多困难
纤维。大多数 Win32 库完全不知道纤程，并且
尝试像使用线程一样使用纤维的应用程序将
遇到各种失败。内核对纤维一无所知，
当纤程进入内核时，它正在执行的线程可能
块并且内核将在该块上调度任意线程
处理器，使其无法运行其他光纤。对于这些
除非从其他地方移植代码，否则很少使用光纤的原因
明确需要光纤提供的功能的系统。

First I would recommend reading this explanation of the difference between processes and threads as background material.

Once you've read that it's pretty straight forward. Threads cans be implemented either in the kernel, in user space, or the two can be mixed. Fibers are basically threads implemented in user space.

What is typically called a thread is a thread of execution implemented in the kernel: what's known as a kernel thread. The scheduling of a kernel thread is handled exclusively by the kernel, although a kernel thread can voluntarily release the CPU by sleeping if it wants. A kernel thread has the advantage that it can use blocking I/O and let the kernel worry about scheduling. It's main disadvantage is that thread switching is relatively slow since it requires trapping into the kernel.
Fibers are user space threads whose scheduling is handled in user space by one or more kernel threads under a single process. This makes fiber switching very fast. If you group all the fibers accessing a particular set of shared data under the context of a single kernel thread and have their scheduling handled by a single kernel thread, then you can eliminate synchronization issues since the fibers will effectively run in serial and you have complete control over their scheduling. Grouping related fibers under a single kernel thread is important, since the kernel thread they are running in can be pre-empted by the kernel. This point is not made clear in many of the other answers. Also, if you use blocking I/O in a fiber, the entire kernel thread it is a part of blocks including all the fibers that are part of that kernel thread.

In section 11.4 "Processes and Threads in Windows Vista" in Modern Operating Systems, Tanenbaum comments:

Although fibers are cooperatively scheduled, if there are multiple
threads scheduling the fibers, a lot of careful synchronization is
required to make sure fibers do not interfere with each other. To
simplify the interaction between threads and fibers, it is often
useful to create only as many threads as there are processors to run
them, and affinitize the threads to each run only on a distinct set of
available processors, or even just one processor. Each thread can
then run a particular subset of the fibers, establishing a one
to-many relationship between threads and fibers which simplifies
synchronization. Even so there are still many difficulties with
fibers. Most Win32 libraries are completely unaware of fibers, and
applications that attempt to use fibers as if they were threads will
encounter various failures. The kernel has no knowledge of fibers,
and when a fiber enters the kernel, the thread it is executing on may
block and the kernel will schedule an arbitrary thread on the
processor, making it unavailable to run other fibers. For these
reasons fibers are rarely used except when porting code from other
systems that explicitly need the functionality provided by fibers.

回复收藏 0 原文

我的奇迹 2024-07-25 00:57:50

在 Win32 中，纤程是一种用户管理的线程。纤程有自己的堆栈和指令指针等，但纤程不由操作系统调度：您必须显式调用 SwitchToFiber。相比之下，线程是由操作系统抢先调度的。因此粗略地说，纤程是在应用程序/运行时级别管理的线程，而不是真正的操作系统线程。

其结果是光纤更便宜，并且应用程序对调度有更多的控制权。如果应用程序创建大量并发任务，和/或希望在运行时进行密切优化，这一点可能很重要。例如，数据库服务器可能选择使用纤程而不是线程。

（同一术语可能有其他用法；如前所述，这是 Win32 定义。）

回复收藏 0 原文

若相惜即相离 2024-07-25 00:57:50

请注意，除了线程和纤程之外，Windows 7 还引入了用户模式调度：

用户模式调度（UMS）是
轻量级机制
应用程序可以用来安排他们的
自己的线程。应用程序可以切换
用户模式下的 UMS 线程之间
不涉及系统调度程序
并重新获得处理器的控制权，如果
UMS 线程在内核中阻塞。 UMS
线与纤维的不同之处在于
每个UMS线程都有自己的线程
上下文而不是共享线程
单个线程的上下文。这
能够在线程之间切换
用户模式让UMS更高效
比管理大的线程池
短期工作项目数量
需要很少的系统调用。

有关线程、光纤和 UMS 的更多信息，请观看 Dave Probert：Windows 7 内部 - 用户模式调度程序 (UMS)。