并发素数生成器
我正在解决projecteuler.net上的问题来学习如何在Erlang中编程,并且我在创建一个素数生成器时遇到了最困难的事情,它可以在不到一分钟的时间内创建所有低于200万的素数。 使用顺序风格,我已经编写了三种类型的生成器,包括埃拉托色尼筛法,但它们的性能都不够好。
我认为并发筛子会很好用,但我收到了 bad_arity 消息,我不确定为什么。 关于为什么我会遇到这个问题或者如何正确编码有什么建议吗?
这是我的代码,注释掉的部分是我试图使事情并发的地方:
-module(primeserver). -compile(export_all). start() -> register(primes, spawn(fun() -> loop() end)). is_prime(N) -> rpc({is_prime,N}). rpc(Request) -> primes ! {self(), Request}, receive {primes, Response} -> Response end. loop() -> receive {From, {is_prime, N}} -> if N From ! {primes, false}; N =:= 2 -> From ! {primes, true}; N rem 2 =:= 0 -> From ! {primes, false}; true -> Values = is_not_prime(N), Val = not(lists:member(true, Values)), From ! {primes, Val} end, loop() end. for(N,N,_,F) -> [F(N)]; for(I,N,S,F) when I + S [F(I)|for(I+S, N, S, F)]; for(I,N,S,F) when I + S =:= N -> [F(I)|for(I+S, N, S, F)]; for(I,N,S,F) when I + S > N -> [F(I)]. get_list(I, Limit) -> if I [I*A || A [] end. is_not_prime(N) -> for(3, N, 2, fun(I) -> List = get_list(I,trunc(N/I)), lists:member(N,lists:flatten(List)) end ). %%L = for(1,N, fun() -> spawn(fun(I) -> wait(I,N) end) end), %%SeedList = [A || A %% lists:foreach(fun(X) -> %% Pid ! {in_list, X} %% end, SeedList) %% end, L). %%wait(I,N) -> %% List = [I*A || A lists:member(X,List) %% end.
I'm going through the problems on projecteuler.net to learn how to program in Erlang, and I am having the hardest time creating a prime generator that can create all of the primes below 2 million, in less than a minute. Using the sequential style, I have already written three types of generators, including the Sieve of Eratosthenes, and none of them perform well enough.
I figured a concurrent Sieve would work great, but I'm getting bad_arity messages, and I'm not sure why. Any suggestions on why I have the problem, or how to code it properly?
Here's my code, the commented out sections are where I tried to make things concurrent:
-module(primeserver). -compile(export_all). start() -> register(primes, spawn(fun() -> loop() end)). is_prime(N) -> rpc({is_prime,N}). rpc(Request) -> primes ! {self(), Request}, receive {primes, Response} -> Response end. loop() -> receive {From, {is_prime, N}} -> if N From ! {primes, false}; N =:= 2 -> From ! {primes, true}; N rem 2 =:= 0 -> From ! {primes, false}; true -> Values = is_not_prime(N), Val = not(lists:member(true, Values)), From ! {primes, Val} end, loop() end. for(N,N,_,F) -> [F(N)]; for(I,N,S,F) when I + S [F(I)|for(I+S, N, S, F)]; for(I,N,S,F) when I + S =:= N -> [F(I)|for(I+S, N, S, F)]; for(I,N,S,F) when I + S > N -> [F(I)]. get_list(I, Limit) -> if I [I*A || A [] end. is_not_prime(N) -> for(3, N, 2, fun(I) -> List = get_list(I,trunc(N/I)), lists:member(N,lists:flatten(List)) end ). %%L = for(1,N, fun() -> spawn(fun(I) -> wait(I,N) end) end), %%SeedList = [A || A %% lists:foreach(fun(X) -> %% Pid ! {in_list, X} %% end, SeedList) %% end, L). %%wait(I,N) -> %% List = [I*A || A lists:member(X,List) %% end.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(10)
另一种可供考虑的替代方案是使用概率素数生成。 Joe 的书中有一个这样的例子(“主服务器”),我认为它使用 Miller-Rabin...
Another alternative to consider is to use probabalistic prime generation. There is an example of this in Joe's book (the "prime server") which uses Miller-Rabin I think...
素数并行算法:http://www.cs.cmu.edu/~丑闻/cacm/node8.html
Primes parallel algorithm : http://www.cs.cmu.edu/~scandal/cacm/node8.html
“badarity”错误意味着您尝试使用错误数量的参数调用“fun”。 在这种情况下...
%%L = for(1,N, fun() -> spawn(fun(I) -> wait(I,N) end) end),
for/3 函数需要一个 fun arity 1 的 fun,spawn/1 函数期望 arity 0 的 fun。请尝试这样做:
传递给 spawn 的 fun 继承其环境的所需部分(即 I),因此无需显式传递它。
虽然计算素数总是很有趣,但请记住,这不是 Erlang 旨在解决的问题。 Erlang 是为大规模 Actor 式并发而设计的。 它很可能在所有数据并行计算的例子上表现得相当糟糕。 在许多情况下,ML 中的顺序解决方案速度非常快,以至于任何数量的核心都不足以让 Erlang 赶上,例如 F# 和 .NET 任务并行库 对于此类操作无疑是更好的工具。
The 'badarity' error means that you're trying to call a 'fun' with the wrong number of arguments. In this case...
%%L = for(1,N, fun() -> spawn(fun(I) -> wait(I,N) end) end),
The for/3 function expects a fun of arity 1, and the spawn/1 function expects a fun of arity 0. Try this instead:
The fun passed to spawn inherits needed parts of its environment (namely I), so there's no need to pass it explicitly.
While calculating primes is always good fun, please keep in mind that this is not the kind of problem Erlang was designed to solve. Erlang was designed for massive actor-style concurrency. It will most likely perform rather badly on all examples of data-parallel computation. In many cases, a sequential solution in, say, ML will be so fast that any number of cores will not suffice for Erlang to catch up, and e.g. F# and the .NET Task Parallel Library would certainly be a much better vehicle for these kinds of operations.
我使用 Go 和通道编写了一个埃拉托色式并发素数筛。
这是代码: http://github.com/aht/gosieve
我在这里写了关于它的博客:< a href="http://blog.onideas.ws/eratosthenes.go" rel="noreferrer">http://blog.onideas.ws/eratosthenes.go
该程序可以筛选出前一百万个素数(15,485,863 以内的所有素数)大约需要 10 秒。 筛子是并发的,但算法主要是同步的:goroutines(“参与者”——如果你喜欢的话)之间需要太多的同步点,因此它们不能并行地自由漫游。
I wrote an Eratosthenesque concurrent prime sieve using the Go and channels.
Here is the code: http://github.com/aht/gosieve
I blogged about it here: http://blog.onideas.ws/eratosthenes.go
The program can sieve out the first million primes (all primes upto 15,485,863) in about 10 seconds. The sieve is concurrent, but the algorithm is mainly synchronous: there are far too many synchronization points required between goroutines ("actors" -- if you like) and thus they can not roam freely in parallel.
您可以找到四种不同的 Erlang 实现来查找素数(其中两个基于埃拉托斯特尼筛法)此处。 此链接还包含比较 4 种解决方案性能的图表。
You can find four different Erlang implementations for finding prime numbers (two of which are based on the Sieve of Eratosthenes) here. This link also contains graphs comparing the performance of the 4 solutions.
埃拉托斯特尼筛法相当容易实现,但正如您所发现的那样,它并不是最有效的。 您尝试过阿特金筛吗?
阿特金筛@维基百科
The Sieve of Eratosthenes is fairly easy to implement but -- as you have discovered -- not the most efficient. Have you tried the Sieve of Atkin?
Sieve of Atkin @ Wikipedia
两个快速单进程 Erlang 素数生成器; sprimes 在大约 2.7 秒内生成 2m 以下的所有素数,fprimes 在我的计算机上生成大约 3 秒(配备 2.4 GHz Core 2 Duo 的 Macbook)。 两者都基于埃拉托斯特尼筛法,但由于 Erlang 最适合列表而不是数组,因此两者都保留一个非消除素数列表,检查当前头的整除性并保留已验证素数的累加器。 两者都还实现了一个质数轮来对列表进行初始缩减。
懒惰:懒惰/ 1和懒惰:下一个/ 1指的是伪惰性无限列表的简单实现:
通过筛子生成素数并不是并发的好地方(但它可以使用并行性来检查整除性,尽管该操作不是足够复杂,足以证明我迄今为止编写的所有并行过滤器的额外开销是合理的)。
`
Two quick single-process erlang prime generators; sprimes generates all primes under 2m in ~2.7 seconds, fprimes ~3 seconds on my computer (Macbook with a 2.4 GHz Core 2 Duo). Both are based on the Sieve of Eratosthenes, but since Erlang works best with lists, rather than arrays, both keep a list of non-eliminated primes, checking for divisibility by the current head and keeping an accumulator of verified primes. Both also implement a prime wheel to do initial reduction of the list.
lazy:lazy/1 and lazy:next/1 refer to a simple implementation of pseudo-lazy infinite lists:
Prime generation by sieves is not a great place for concurrency (but it could use parallelism in checking for divisibility, although the operation is not sufficiently complex to justify the additional overhead of all parallel filters I have written thus far).
`
我喜欢欧拉计划。
关于素数生成器的主题,我是埃拉托色尼筛法的忠实粉丝。
对于 2,000,000 以下的数字,您可以尝试简单的 isPrime 检查实现。 我不知道你会如何在 erlang 中做到这一点,但逻辑很简单。
c# 在 1 分钟内运行了 2,000,000 个这样的列表
编辑: 顺便说一句,埃拉托色尼筛可以轻松实现并且运行速度很快,但是当你开始时会变得笨拙进入庞大的名单。 最简单的实现,使用布尔数组和 int 值,运行速度非常快。 问题是您开始遇到值大小和数组长度的限制。 -- 切换到字符串或位数组实现会有所帮助,但您仍然面临着以大值迭代列表的挑战。
I love Project Euler.
On the subject of prime generators, I am a big fan of the Sieve of Eratosthenes.
For the purposes of the numbers under 2,000,000 you might try a simple isPrime check implementation. I don't know how you'd do it in erlang, but the logic is simple.
c# ran a list like this for 2,000,000 in well under the 1 minute mark
Edit: On a side note, the sieve of Eratosthenes can be implemented easily and runs quickly, but gets unwieldy when you start getting into huge lists. The simplest implementation, using a boolean array and int values runs extremely quickly. The trouble is that you begin running into limits for the size of your value as well as the length of your array. -- Switching to a string or bitarray implementation helps, but you still have the challenge of iterating through your list at large values.
欧拉计划问题(我想说的是前 50 个问题中的大多数,如果不是更多的话)主要是关于蛮力,以及在选择边界时的独创性。
请记住测试任何 N 是否为素数(通过暴力),您只需要查看它是否可以被任何素数整除到 Floor(sqrt(N)) + 1,而不是 N/2。
祝你好运
Project Euler problems (I'd say most of the first 50 if not more) are mostly about brute force with a splash of ingenuity in choosing your bounds.
Remember to test any if N is prime (by brute force), you only need to see if its divisible by any prime up to floor(sqrt(N)) + 1, not N/2.
Good luck
这是一个VB版本
here is a vb version