快速计算 (a*b) mod c for c=2^N +-1

发布于 2024-07-17 11:46:36 字数 581 浏览 15 评论 0原文

在 32 位整数数学中，加法和乘法的基本数学运算是隐式计算 mod 2^32 的，这意味着您的结果将是加法或乘法的最低位。

如果您想使用不同的模数计算结果，您当然可以使用不同语言中任意数量的 BigInt 类。对于值 a,b,c < 2^32 你可以计算 64 位长整数的中间值，并使用内置的 % 运算符来减少到正确的答案

但我被告知，当 C 是时，有一些特殊的技巧可以有效地计算 a*b mod C形式 (2^N)-1 或 (2^N)+1，不使用 64 位数学或 BigInt 库，并且非常高效，比任意模数评估更高效，并且还可以正确计算通常情况下的情况如果包含中间乘法，则会溢出 32 位 int。

不幸的是，尽管听说这种特殊情况有一个快速评估方法，但我实际上还没有找到该方法的描述。 “这不是高德纳的作品吗？” “这不是维基百科上的某个地方吗？” 是我听到的咕哝。

这显然是随机数生成器中的一种常见技术，随机数生成器执行 a*b mod 2147483647 的乘法，因为 2147483647 是等于 2^31 -1 的质数。

那我就请教一下专家吧。我找不到任何讨论的这种巧妙的特殊情况乘以 mod 方法是什么？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

碍人泪离人颜 2024-07-24 11:46:36

我认为技巧如下（我将以 10 为基数进行计算，因为这样更容易，但原理应该成立）

假设您要乘以 a*b mod 10000-1，

a = 1234 = 12 * 100 + 34
b = 5432 = 54 * 100 + 32

现在a*b = 12 * 54 * 10000 + 34 * 54 * 100 + 12 * 32 * 100 + 34 * 32

12 * 54 * 10000 =  648 * 10000
34 * 54 * 100   = 1836 * 100
12 * 32 * 100   =  384 * 100
34 * 32         = 1088

自 x * 10000 ≡ x (mod 10000-1) > [1]，第一项和最后一项变为648+1088。第二项和第三项是“技巧”出现的地方。请注意：

1836 = 18 * 100 + 36
1836 * 100 ≡ 18 * 10000 + 3600 ≡ 3618 (mod 10000-1).

这本质上是一个循环移位。给出结果 648 + 3618 + 8403 + 1088。还要注意，在所有情况下，相乘的数字都 < 10000（因为 a < 100 且 b < 100），因此如果您只能将多个 2 位数字组合在一起并将它们相加，那么这是可以计算的。

在二进制中，也会有类似的结果。

从a和b开始，都是32位。假设您想将它们相乘 mod 2^31 - 1，但您只有 16 位乘法器（给出 32 位）。该算法将是这样的：

 a = 0x12345678
 b = 0xfedbca98
 accumulator = 0
 for (x = 0; x < 32; x += 16)
     for (y = 0; y < 32; y += 16)
         // do the multiplication, 16-bit * 16-bit = 32-bit
         temp = ((a >> x) & 0xFFFF) * ((b >> y) & 0xFFFF)

         // add the bits to the accumulator, shifting over the right amount
         total_bits_shifted = x + y
         for (bits = 0; bits < total_bits_shifted + 32; bits += 31)
             accumulator += (temp >> (bits - total_bits_shifted)) & 0x7FFFFFFF

         // do modulus if it overflows
         if (accumulator > 0x7FFFFFFFF)
             accumulator = (accumulator >> 31) + (accumulator & 0x7FFFFFFF);

已经晚了，所以累加器部分可能无法工作。我认为原则上这是对的。有人可以随意编辑此内容以使其正确。

展开后，这也相当快，我猜这就是 PRNG 使用的方法。

[1]: x*10000 ≡ x*(9999+1) ≡ 9999*x + x ≡ x (mod 9999)

I think the trick is the following (I'm going to do it in base 10, because it's easier, but the principle should hold)

Suppose you are multiplying a*b mod 10000-1, and

a = 1234 = 12 * 100 + 34
b = 5432 = 54 * 100 + 32

now a*b = 12 * 54 * 10000 + 34 * 54 * 100 + 12 * 32 * 100 + 34 * 32

12 * 54 * 10000 =  648 * 10000
34 * 54 * 100   = 1836 * 100
12 * 32 * 100   =  384 * 100
34 * 32         = 1088

Since x * 10000 ≡ x (mod 10000-1) [1], the first and last terms become 648+1088. The second and third terms are where the 'trick' come in. Note that:

1836 = 18 * 100 + 36
1836 * 100 ≡ 18 * 10000 + 3600 ≡ 3618 (mod 10000-1).

This is essentially a circular shift. Giving the results of 648 + 3618 + 8403 + 1088. And also note that in all cases, the multiplied numbers are < 10000 (since a < 100 and b < 100), so this is calculable if you only could multiple 2 digit numbers together, and add them.

In binary, it's going to work out similarly.

Start with a and b, both are 32 bits. Suppose you want to multiply them mod 2^31 - 1, but you only have a 16 bit multiplier (giving 32 bits). The algorithm would be something like this:

 a = 0x12345678
 b = 0xfedbca98
 accumulator = 0
 for (x = 0; x < 32; x += 16)
     for (y = 0; y < 32; y += 16)
         // do the multiplication, 16-bit * 16-bit = 32-bit
         temp = ((a >> x) & 0xFFFF) * ((b >> y) & 0xFFFF)

         // add the bits to the accumulator, shifting over the right amount
         total_bits_shifted = x + y
         for (bits = 0; bits < total_bits_shifted + 32; bits += 31)
             accumulator += (temp >> (bits - total_bits_shifted)) & 0x7FFFFFFF

         // do modulus if it overflows
         if (accumulator > 0x7FFFFFFFF)
             accumulator = (accumulator >> 31) + (accumulator & 0x7FFFFFFF);

It's late, so the accumulator part of that probably won't work. I think in principle it's right though. Someone feel free to edit this to make it right.

Unrolled, this is pretty fast, as well, which is what the PRNG use, I'm guessing.

[1]: x*10000 ≡ x*(9999+1) ≡ 9999*x + x ≡ x (mod 9999)

回复收藏 0 原文

一笔一画续写前缘 2024-07-24 11:46:36

假设您可以将 a*b 计算为 p*2^N+q。这可能需要 64 位计算，或者您可以将 a 和 b 分成 16 位部分并在 32 位上计算。

然后 a*b mod 2^N-1 = p+q mod 2^N-1 因为 2^N mod 2^N-1 = 1。

并且a*b mod 2^N+1 = -p+q mod 2^N+1 因为2^N mod 2^N+1 = -1。

在这两种情况下，都不会被 2^N-1 或 2^N+1 除。

回复收藏 0 原文

百变从容 2024-07-24 11:46:36

快速搜索发现了这个： http://home.pipeline.com/ ~hbaker1/AB-mod-N.pdf。不幸的是，对我来说已经太晚了，无法充分理解这一点，只能写下简化的公式，但它可能在那篇论文的某个地方。

回复收藏 0 原文

反差帅 2024-07-24 11:46:36

您可以使用蒙哥马利缩减（还有其他描述）以减少模乘计算的成本。不过，这仍然没有使用 N 是正/负 2 的幂的属性。

回复收藏 0 原文

_蜘蛛 2024-07-24 11:46:36

您要查找的恒等式为 x mod N = (x mod 2^q)- c*floor(x/2^q)，假设 N = 2^q + c 且 c 是任意整数（但通常为 ±1）。

您可能想阅读 Richard Crandall 和 Carl Pomerance 所著的《素数：计算视角》中的第 9.2.3 节：“特殊形式的模”。除了理论之外，它还包含实现上述关系的算法的伪代码。

回复收藏 0 原文

淡淡绿茶香 2024-07-24 11:46:36

我找到了关于这个主题的相当广泛的页面，不仅仅是讨论算法甚至是问题和解决方案的具体历史以及人们使用该解决方案的方式。

回复收藏 0 原文

~没有更多了~

关于作者

新一帅帅

暂无简介

文章

26 人气

关注发私信

友情链接

文江博客

快速计算 (a*b) mod c for c=2^N +-1

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（6）

关于作者

相关话题

热门标签

推荐作者

佚名

羁客

天天爱笑的徐老师

星

夏日落

隐诗

友情链接

快速计算 (a*b) mod c for c=2^N +-1

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（6）

关于作者

相关话题

热门标签

推荐作者

佚名

羁客

天天爱笑的徐老师

星

夏日落

隐诗

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。