线程上下文切换 Vs.进程上下文切换

发布于 2024-10-26 22:32:40 字数 40 浏览 7 评论 0 原文

谁能告诉我在这两种情况下到底做了什么？他们每个人的主要成本是多少？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

苍景流年 2024-11-02 22:32:40

线程切换和进程切换之间的主要区别在于，在线程切换期间，虚拟内存空间保持不变，而在进程切换期间则不然。
两种类型都涉及将控制权移交给操作系统内核以执行上下文切换。切入和切出操作系统内核的过程以及切出寄存器的成本是执行上下文切换的最大固定成本。

更模糊的成本是上下文切换会扰乱处理器的缓存机制。基本上，当上下文切换时，处理器在其缓存中“记住”的所有内存地址实际上都变得无用。这里的一个很大的区别是，当您更改虚拟内存空间时，处理器的转换后备缓冲区 (TLB) 或等效内容会被刷新，从而使内存访问在一段时间内变得更加昂贵。在线程切换期间不会发生这种情况。

回复收藏 0 原文

画尸师 2024-11-02 22:32:40

进程上下文切换涉及内存地址空间的切换。这包括内存地址、映射、页表和内核资源——这是一个相对昂贵的操作。在某些体系结构上，它甚至意味着刷新不可跨地址空间共享的各种处理器缓存。例如，x86 必须刷新 TLB，而某些 ARM 处理器必须刷新整个 L1 缓存！

线程切换是同一进程中从一个线程到另一个线程的上下文切换（跨进程从一个线程切换到另一个线程只是进程切换）。切换处理器状态（例如程序计数器和寄存器内容）通常非常高效。

回复收藏 0 原文

青春有你 2024-11-02 22:32:40

首先，如果传出线程尚不存在，操作系统会将传出线程引入内核模式，因为线程切换只能在运行于内核模式的线程之间执行。然后调用调度程序来决定将执行切换的线程。做出决定后，内核将位于CPU（CPU寄存器）中的部分线程上下文保存到内存中的专用位置（通常位于传出线程的内核堆栈的顶部）。然后内核执行从传出线程的内核堆栈到传入线程的内核堆栈的切换。之后，内核将先前存储的传入线程的上下文从内存加载到 CPU 寄存器中。最后将控制权返回到用户模式，但处于新线程的用户模式。
当操作系统确定传入线程在另一个进程中运行时，内核会执行一个额外步骤：设置新的活动虚拟地址空间。

这两种情况的主要成本都与缓存污染有关。在大多数情况下，传出线程使用的工作集与传入线程使用的工作集有很大不同。因此，传入线程将在大量缓存未命中的情况下开始其生命周期，从而从缓存中刷新旧的无用数据并从内存中加载新数据。 TLB（Translation Look Aside Buffer，位于CPU上）也是如此。在重置虚拟地址空间（线程在不同进程中运行）的情况下，惩罚会更严重，因为重置虚拟地址空间会导致整个 TLB 的刷新，即使新线程实际上需要仅加载少量新条目。因此，新线程将在大量 TLB 未命中和频繁页面遍历的情况下开始其时间量程。线程切换的直接成本也不容忽视（从约 250 个周期到高达约 1500-2000 个周期），并且取决于 CPU 复杂性、两个线程的状态以及它们实际使用的寄存器组。

PS：关于上下文切换开销的好文章： http://blog.tsunanet.net/2010/11/how-long-does-it-take-to-make-context.html

First of all, operating system brings outgoing thread in a kernel mode if it is not already there, because thread switch can be performed only between threads, that runs in kernel mode. Then the scheduler is invoked to make a decision about thread to which will be performed switching. After decision is made, kernel saves part of the thread context that is located in CPU (CPU registers) into the dedicated place in memory (frequently on the top of the kernel stack of outgoing thread). Then the kernel performs switch from kernel stack of outgoing thread on to kernel stack of the incoming thread. After that, kernel loads previously stored context of incoming thread from memory into CPU registers. And finally returns control back into user mode, but in user mode of the new thread.
In the case when OS has determined that incoming thread runs in another process, kernel performs one additional step: sets new active virtual address space.

The main cost in both scenarios is related to a cache pollution. In most cases, the working set used by the outgoing thread will differ significantly from working set which is used by the incoming thread. As a result, the incoming thread will start its life with avalanche of cache misses, thus flushing old and useless data from the caches and loading the new data from memory. The same is true for TLB (Translation Look Aside Buffer, which is on the CPU). In the case of reset of virtual address space (threads run in different processes) the penalty is even worse, because reset of virtual address space leads to the flushing of the entire TLB, even if new thread actually needs to load only few new entries. As a result, the new thread will start its time quantum with lots TLB misses and frequent page walking. Direct cost of threads switch is also not negligible (from ~250 and up to ~1500-2000 cycles) and depends on the CPU complexity, states of both threads and sets of registers which they actually use.

P.S.: Good post about context switch overhead: http://blog.tsunanet.net/2010/11/how-long-does-it-take-to-make-context.html

回复收藏 0 原文