为什么线程共享堆空间？

发布于 2024-09-11 03:08:17 字数 210 浏览 6 评论 0原文

每个线程都有自己的堆栈，但它们共享一个公共堆。

每个人都清楚堆栈是用于局部/方法变量和变量的。堆用于实例/类变量。

线程之间共享堆有什么好处？

有多个线程同时运行，因此共享内存可能会导致并发修改、互斥等开销等问题。堆中线程共享哪些内容。

为什么会这样呢？为什么不让每个线程也拥有自己的堆呢？谁能提供一个现实世界的例子，线程如何利用共享内存？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

请你别敷衍 2024-09-18 03:08:17

当您想将数据从一个线程传递到另一个线程时该怎么办？（如果您从未这样做过，那么您将编写单独的程序，而不是一个多线程程序。）主要有两种方法：

您似乎认为理所当然的方法是共享内存：除了有令人信服的理由需要特定于线程的数据（例如堆栈）之外，所有线程都可以访问所有数据。基本上，有一个共享堆。这为您提供了速度：任何时候一个线程更改某些数据，其他线程都可以看到它。（限制：如果线程在不同的处理器上执行，则情况并非如此：程序员需要特别努力才能正确且有效地使用共享内存。）大多数主要命令式语言，特别是 Java 和 C# ，喜欢这个模型。
每个线程可以有一个堆，再加上一个共享堆。这需要程序员决定将哪些数据放在哪里，而这通常与现有的编程语言不能很好地配合。
双重方法是消息传递：每个线程都有自己的数据空间；当一个线程想要与另一个线程通信时，它需要显式地向另一个线程发送消息，以便将数据从发送者堆复制到接收者堆。在这种设置中，许多社区更喜欢将线程称为进程。这为您提供了安全：由于线程无法随心所欲地覆盖其他线程的内存，因此可以避免很多错误。另一个好处是分布：您可以使线程在不同的计算机上运行，而无需更改程序中的一行。您可以找到大多数语言的消息传递库，但集成往往不太好。理解消息传递的好语言是 Erlang 和 JoCaml。
事实上，消息传递环境通常在幕后使用共享内存，至少只要线程在同一台机器/处理器上运行。这节省了大量时间和内存，因为将消息从一个线程传递到另一个线程不需要复制数据。但由于共享内存不暴露给程序员，其固有的复杂性仅限于语言/库的实现。

What do you do when you want to pass data from one thread to another? (If you never did that you'd be writing separate programs, not one multi-threaded program.) There are two major approaches:

The approach you seem to take for granted is shared memory: except for data that has a compelling reason to be thread-specific (such as the stack), all data is accessible to all threads. Basically, there is a shared heap. That gives you speed: any time a thread changes some data, other threads can see it. (Limitation: this is not true if the threads are executing on different processors: there the programmer needs to work especially hard to use shared memory correctly and efficiently.) Most major imperative languages, in particular Java and C#, favor this model.
It is possible to have one heap per thread, plus a shared heap. This requires the programmer to decide which data to put where, and that often doesn't mesh well with existing programming languages.
The dual approach is message passing: each thread has its own data space; when a thread wants to communicate with another thread it needs to explicitly send a message to the other thread, so as to copy the data from the sender's heap to the recipient's heap. In this setting many communities prefer to call the threads processes. That gives you safety: since a thread can't overwrite some other thread's memory on a whim, a lot of bugs are avoided. Another benefit is distribution: you can make your threads run on separate machines without having to change a single line in your program. You can find message passing libraries for most languages but integration tends to be less good. Good languages to understand message passing in are Erlang and JoCaml.
In fact message passing environments usually use shared memory behind the scene, at least as long as the threads are running on the same machine/processor. This saves a lot of time and memory since passing a message from one thread to another then doesn't require making a copy of the data. But since the shared memory is not exposed to the programmer, its inherent complexity is confined to the language/library implementation.

回复收藏 0 原文

无言温柔 2024-09-18 03:08:17

因为否则它们将是流程。这就是线程共享内存的整个想法。

回复收藏 0 原文

一曲爱恨情仇 2024-09-18 03:08:17

通常，进程不共享堆空间。有 API 允许这样做，但默认情况下进程是单独的

线程共享堆空间。

这就是“实用的想法”——使用内存的两种方式——共享和不共享。

回复收藏 0 原文

把回忆走一遍 2024-09-18 03:08:17

在许多语言/运行时中，堆栈（除其他外）用于保存函数/方法参数和变量。如果线程共享堆栈，事情就会变得非常混乱。

void MyFunc(int a) // Stored on the stack
{
   int b; // Stored on the stack
}

当对“MyFunc”的调用完成时，堆栈将被弹出，并且 a 和 b 不再位于堆栈中。由于线程不共享堆栈，因此变量 a 和 b 不存在线程问题。

由于堆栈的性质（压入/弹出），它并不真正适合在函数调用之间保持“长期”状态或共享状态。像这样：

int globalValue; // stored on the heap

void Foo() 
{
   int b = globalValue; // Gets the current value of globalValue

   globalValue = 10;
}

void Bar() // Stored on the stack
{
   int b = globalValue; // Gets the current value of globalValue

   globalValue = 20;
}


void main()
{
   globalValue = 0;
   Foo();
   // globalValue is now 10
   Bar();
   // globalValue is now 20
}

In many languages/runtimes the stack is (among other) used for keep function/method parameters and variables. If thread shared a stack, things would get really messy.

void MyFunc(int a) // Stored on the stack
{
   int b; // Stored on the stack
}

When the call to 'MyFunc' is done, the stacked is popped and a and b is no longer on the stack. Because threads dont share stacks, there is no threading issue for the variables a and b.

Because of the nature of the stack (pushing/popping) its not really suited for keeping 'long term' state or shared state across function calls. Like this:

int globalValue; // stored on the heap

void Foo() 
{
   int b = globalValue; // Gets the current value of globalValue

   globalValue = 10;
}

void Bar() // Stored on the stack
{
   int b = globalValue; // Gets the current value of globalValue

   globalValue = 20;
}


void main()
{
   globalValue = 0;
   Foo();
   // globalValue is now 10
   Bar();
   // globalValue is now 20
}

回复收藏 0 原文