当前位置：文江博客话题详情

函数式编程和多核架构

发布于 2024-07-07 02:26:31 字数 66 浏览 4 评论 0原文

我在某处读到，函数式编程适合利用计算中的多核趋势。我真的不明白。它与 lambda 演算和冯诺依曼架构有关吗？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

吻风 2024-07-14 02:26:31

函数式编程最大限度地减少或消除了副作用，因此更适合分布式编程。即多核处理。

换句话说，许多难题可以同时在不同的核心上独立解决，而不必担心一个操作会影响另一个操作，就像在其他编程风格中一样。

回复收藏 0 原文

浅紫色的梦幻 2024-07-14 02:26:31

处理并行处理最困难的事情之一是锁定数据结构以防止损坏。如果两个线程在没有完全锁定数据结构的情况下同时改变数据结构，则可能会导致从无效数据到死锁的任何情况。

相反，函数式编程语言倾向于强调不可变数据。任何状态都与逻辑分离，数据结构一旦创建就无法修改。锁定的需要大大减少。

另一个好处是，一些非常容易并行化的过程（例如迭代）被抽象为函数。在 C++ 中，您可能有一个 for 循环，对列表中的每个项目运行一些数据处理。但是编译器无法知道这些操作是否可以安全地并行运行——也许其中一个操作的结果取决于它之前的操作。当使用像map()或reduce()这样的函数时，编译器可以知道调用之间不存在依赖关系。因此可以同时处理多个项目。

回复收藏 0 原文

献世佛 2024-07-14 02:26:31

我在某处读到，函数式编程适合利用计算中的多核趋势......我并没有真正理解这个想法。与 lambda 演算和冯·诺依曼架构有关吗？

您引用的信念背后的论点是，纯函数式编程控制副作用，这使得引入并行性变得更加容易和安全，因此，纯函数式编程语言在多核计算机的上下文中应该是有利的。

不幸的是，由于以下几个原因，这种信念早已被证明是错误的：

纯函数式数据结构的绝对性能很差。因此，在性能方面（这是并行编程的唯一目的），纯函数式编程是朝着错误方向迈出的一大第一步。
纯函数式数据结构的扩展性很差，因为它们强调共享资源，包括分配器/GC 和主内存带宽。因此，随着核心数量的增加，并行化的纯函数式程序通常获得的加速效果很差。
纯函数式编程导致性能不可预测。因此，真正的纯函数式程序在并行化时通常会出现性能下降，因为粒度实际上是随机的。

例如，Haskell 社区经常引用的混蛋两行快速排序通常会运行数千次比用更传统的语言（如 F#）编写的真正的就地快速排序慢几倍。此外，虽然您可以轻松地并行化优雅的 Haskell 程序，但您不太可能看到任何性能改进，因为所有不必要的复制都会使单个核心饱和多核机器的整个主内存带宽，从而使并行性变得毫无价值。事实上，没有人能够在 Haskell 中编写任何类型的具有竞争力的性能的通用并行排序。 Haskell 标准库提供的最先进的排序通常比传统的替代方案慢数百倍。

然而，函数式编程更常见的定义是一种强调使用一流函数的风格，实际上在多核编程环境中非常有用，因为这种范例非常适合分解并行程序。例如，请参阅 .NET 4 中 System.Threading.Tasks 命名空间中新的高阶 Parallel.For 函数。

I've read somewhere that functional programming is suitable to take advantage of multi-core trend in computing... I didn't really get the idea. Is it related to the lambda calculus and von neumann architecture?

The argument behind the belief you quoted is that purely functional programming controls side effects which makes it much easier and safer to introduce parallelism and, therefore, that purely functional programming languages should be advantageous in the context of multicore computers.

Unfortunately, this belief was long since disproven for several reasons:

The absolute performance of purely functional data structures is poor. So purely functional programming is a big initial step in the wrong direction in the context of performance (which is the sole purpose of parallel programming).
Purely functional data structures scale badly because they stress shared resources including the allocator/GC and main memory bandwidth. So parallelized purely functional programs often obtain poor speedups as the number of cores increases.
Purely functional programming renders performance unpredictable. So real purely functional programs often see performance degradation when parallelized because granularity is effectively random.

For example, the bastardized two-line quicksort often cited by the Haskell community typically runs thousands of times slower than a real in-place quicksort written in a more conventional language like F#. Moreover, although you can easily parallelize the elegant Haskell program, you are unlikely to see any performance improvement whatsoever because all of the unnecessary copying makes a single core saturate the entire main memory bandwidth of a multicore machine, rendering parallelism worthless. In fact, nobody has ever managed to write any kind of generic parallel sort in Haskell that is competitively performant. The state-of-the-art sorts provided by Haskell's standard library are typically hundreds of times slower than conventional alternatives.

However, the more common definition of functional programming as a style that emphasizes the use of first-class functions does actually turn out to be very useful in the context of multicore programming because this paradigm is ideal for factoring parallel programs. For example, see the new higher-order Parallel.For function from the System.Threading.Tasks namespace in .NET 4.

回复收藏 0 原文

咋地 2024-07-14 02:26:31

当没有副作用时，评估顺序并不重要。然后可以并行计算表达式。

回复收藏 0 原文

以为你会在 2024-07-14 02:26:31

基本论点是，像 C/C++ 等语言很难自动并行化，因为函数可以设置全局变量。考虑两个函数调用：

a = foo(b, c);
d = bar(e, f);

虽然 foo 和 bar 没有共同的参数，并且一个不依赖于另一个的返回代码，但它们仍然可能具有依赖关系，因为 foo 可能会设置 bar 所依赖的全局变量（或其他副作用）。

函数式语言保证 foo 和 bar 是独立的：没有全局变量，也没有副作用。因此 foo 和 bar 可以在不同的内核上自动安全地运行，无需程序员干预。

The basic argument is that it is difficult to automatically parallelize languages like C/C++/etc because functions can set global variables. Consider two function calls:

a = foo(b, c);
d = bar(e, f);

Though foo and bar have no arguments in common and one does not depend on the return code of the other, they nonetheless might have dependencies because foo might set a global variable (or other side effect) which bar depends upon.

Functional languages guarantee that foo and bar are independant: there are no globals, and no side effects. Therefore foo and bar could be safely run on different cores, automatically, without programmer intervention.

回复收藏 0 原文