浮点算术 - Double 类型的模运算符

发布于 2024-09-03 02:56:22 字数 1112 浏览 1 评论 0原文

所以我试图找出为什么模运算符返回如此大的异常值。

如果我有代码：

double result = 1.0d % 0.1d;，

它将给出0.099999999999999995的结果。我期望值为 0

请注意，使用除法运算符不存在此问题 - double result = 1.0d / 0.1d;

将给出结果 10.0，这意味着余数应该为 0 >。

让我明确一点：我对错误的存在并不感到惊讶，令我惊讶的是与实际的数字相比，错误是如此之大。 0.0999 ~= 0.1 并且 0.1 与 0.1d 处于同一数量级，仅与 1.0d 相差一个数量级。它不像你可以将它与 double.epsilon 进行比较，或者说“如果其差异 < 0.00001，则它相等”。

我在 StackOverflow 上阅读了有关此主题的以下文章 one 两个三个，等等。

谁能建议解释一下为什么这个错误这么大？任何避免将来遇到问题的建议（我知道我可以使用十进制代替，但我担心它的性能）。

编辑：我应该特别指出，我知道 0.1 是一个无限以二进制重复一系列数字 - 这有什么关系吗？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

中二柚 2024-09-10 02:56:22

出现该错误的原因是 double 无法精确表示 0.1——它可以表示的最接近值是 0.100000000000000005551115123126。现在，当你用 1.0 除以它时，它会得到一个略小于 10 的数字，但双精度数又不能准确表示它，因此它最终会向上舍入到 10。但是当你执行 mod 时，它可以给你稍微小于 10 的数字。余数小于 0.1。

由于 0 = 0.1 mod 0.1，mod 中的实际误差是 0.1 - 0.09999999... -- 非常小。

如果将 % 运算符的结果加上 9 * 0.1，它将再次为您提供 1.0。

编辑

有关舍入内容的更多详细信息 - 特别是因为这个问题是混合精度危险的一个很好的例子。

浮点数的 a % b 计算方式通常为 a - (b * Floor(a/b))。问题是，它可能会以比这些操作更高的内部精度一次性完成（并且在每个阶段将结果四舍五入为 fp 数字），因此它可能会给您带来不同的结果。很多人看到的一个例子是，Intel x86/x87 硬件使用 80 位精度进行中间计算，而仅使用 64 位精度计算内存中的值。因此，上面等式中 b 中的值来自内存，因此是一个不完全是 0.1 的 64 位 fp 数字（感谢 dan04 提供了精确值），因此当它计算 1.0/0.1 时，得到 9.999999999999999944488848768742172978818416595458984375 （四舍五入到 80 位）。现在，如果将其四舍五入为 64 位，则为 10.0，但如果保留 80 位内部并对其进行取整，则它会截断为 9.0，从而得到 .0999999999999999500399638918679556809365749359130859375 作为最终答案。

因此，在这种情况下，您会看到一个很大的明显错误，因为您使用的是非连续阶跃函数（下限），这意味着内部值的微小差异可能会导致您超过该阶跃。但由于 mod 本身是一个不连续的阶跃函数，这是可以预料的，这里的实际误差是 0.1-0.0999...因为 0.1 是 mod 函数范围内的不连续点。

The error comes about because a double can't exactly represent 0.1 -- the closest it can represent is something like 0.100000000000000005551115123126. Now when you divide 1.0 by that it gives you a number slightly less than 10, but again a double can't exactly represent it, so it ends up rounding up to 10. But when you do the mod, it can give you that slightly less than 0.1 remainder.

since 0 = 0.1 mod 0.1, the actual error in the mod is 0.1 - 0.09999999... -- very small.

If you add the result of the % operator to 9 * 0.1, it will give you 1.0 again.

Edit

A bit more detail on the rounding stuff -- particularly as this problem is a good example of the perils of mixed precision.

The way a % b for floating point numbers is usually computed is as a - (b * floor(a/b)). The problem is that it may be done all at once with more internal precision than you'd get with those operations (and rounding the result to a fp number at each stage), so it might give you a different result. One example that a lot of people see is with the Intel x86/x87 hardware is using 80-bit precision for intermediate computations and only 64-bit precision for values in memory. So the value in b in the equation above is coming from memory and is thus a 64-bit fp number that's not quite 0.1 (thank dan04 for the exact value), so when it computes 1.0/0.1 it gets 9.99999999999999944488848768742172978818416595458984375 (rounded to 80 bits). Now if you round that to 64 bits, it would be 10.0, but if you keep the 80 bit internal and do the floor on it, it truncates to 9.0 and thus gets .0999999999999999500399638918679556809365749359130859375 as the final answer.

So in this case, you're seeing a large apparent error because you're using a noncontinuous step function (floor) which means that a very tiny difference in an internal value can push you over the step. But since mod is itself a noncontinuous step function thats to be expected and the real error here is 0.1-0.0999... as 0.1 is the discontinuous point in the range of the mod function.

回复收藏 0 原文