[0.0, 1.0) 范围内双精度值的唯一值总数是多少？

发布于 2024-10-23 14:40:26 字数 925 浏览 10 评论 0原文

Random.NextDouble()（范围 [0.0,1.0) 中的 Double）有时会与一个大的 Int64 相乘（让 Int64 big = 9000000000L），结果会取整以获得比从 Random 获得的值更大的随机 Int64 值.Next()（范围 [0,Int32.MaxValue) 中的 Int32）。

Random r = new Random();
long big = 9000000000L;
long answer = (long) (r.NextDouble() * big);

在我看来， [0.0, 1.0) 范围内的 Double 唯一值总数提供了它可能生成的唯一 Int64 数量的上限。事实上，这是一个宽松的上限，因为许多不同的 Double 将映射到相同的 Int64。

因此，我想知道： [0.0, 1.0) 范围内双精度值的唯一值总数是多少？

如果您能告诉我“big”可以取的最大值是多少，以便“answer”可以是范围 [0,big) 中的值，以及“answer”值的分布是否均匀（假设），那就更好了Random.NextDouble() 是统一的。

编辑：这里的Double（双精度）指的是IEEE 754浮点双精度，而Int64（long）和Int32（int）分别指的是64位和32位有符号2的补码。

受到这个问题的启发：Generate 10digits unique random number in java

虽然我使用的是 C#，但这个问题与语言无关，更多的是关于离散数学而不是编程，但它困扰我主要不是因为数学好奇心，而是因为程序员只想使用一个公式，只有当它做了什么从安全角度来看，这是应该做的。

原文

Random.NextDouble() (a Double from the range [0.0,1.0)) is sometimes multiplied with a large Int64 (let Int64 big = 9000000000L), and the result floored to obtain a random Int64 value larger than what can be obtained from Random.Next() (an Int32 from the range [0,Int32.MaxValue)).

Random r = new Random();
long big = 9000000000L;
long answer = (long) (r.NextDouble() * big);

It seems to me that the total number of unique values for a Double in the range [0.0, 1.0) provides an upper-bound for the number of unique Int64 it can possibly generate. A loose upper-bound, in fact, as many different Doubles will map to the same Int64.

Hence, I would like to know: what is the total number of unique values for a double in the range [0.0, 1.0)?

Even better if you can tell me what is the largest value "big" can take so that "answer" can be a value from the range [0,big), and whether the distribution of values of "answer" is uniform, assuming that Random.NextDouble() is uniform.

Edit: Double (double) here refers to IEEE 754 floating-point double, while Int64 (long) and Int32 (int) refer to 64-bit and 32-bit signed 2's complement respectively.

Inspired by this question: Generating 10 digits unique random number in java

While I used C#, this question is language-agnostic and is more about discrete mathematics than programming, but it bothers me not mainly from a sense of mathematical curiousity, but from that of a programmer wanting to use a formula only if it does what it is supposed to do and from a security viewpoint.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

油焖大侠 2024-10-30 14:40:26

IEEE-754 有 11 位指数和 52 位尾数。假设符号位为 0（正），如果指数范围为 0x001 到 0x3FE，则该值为 0 到 1 之间的标准浮点数。尾数以不存储的前导 1 进行解释。对于指数的每个 0x3FE 值，都有 2^52 个尾数值。此外，如果指数为 0x000，则尾数将被解释为没有该主值，但如同指数为 0x001，总共 0x3FF = 1023 个指数，其中所有尾数均有效。总共有 1023*2^52 个值。另外，负0也可以算，多了一个值。

如果从所有值均匀生成随机双精度数，那么在相乘以生成 Int64 时确实会产生偏差。然而，任何合理的随机库都会在 [0, 1) 上近似均匀分布，并且将其转换为 Int64 时不会出现偏差。允许生成 [0, big) 中的所有整数的“big”的最大值是 2^53——1/2 和 1 之间的 2^52 个数字的分辨率是 2^(-53)。然而，通常情况下，这些数字是通过将随机整数除以整数范围（通常是 Int32）来生成的，这意味着您实际上无法生成比该源更多的数字。考虑直接组合两个 Int32，例如将一位移位 32 位并将它们组合成 Int64。（但要小心——生成器的状态空间可能只有 32 位。）

回复收藏 0 原文

烟─花易冷 2024-10-30 14:40:26

作为您问题的推论，我会告诉您，Random C# 生成器在内部使用一个生成器，该生成器“给他”0...Int32.MaxValue - 1 之间的数字。 >。然后，它将数字除以 Int32.MaxValue（从技术上讲，它乘以该数字的倒数）以返回双精度值。因此，在 C# 中，仅返回 Int32.MaxValue 可能的双精度数 (0...Int32.MaxValue - 1)

回复收藏 0 原文

俏︾媚 2024-10-30 14:40:26

IEEE754 对双精度的精度非常清楚：

http://en.wikipedia.org/wiki /IEEE_754-2008

您有 52 位精度加上一个额外的假定位。

您的指数从 -1022 到 1023，大约 11 位，包括符号。

第 64 位是数字的总符号。

我们将忽略次标准化数字。

您询问的是 -1022 和 0 之间的指数。这意味着您有大约 10 个可用的 11 位指数可供您使用。

您有 52+1 位可用尾数。

这大约是 62 位可用精度，用于表示

在此处输入图像描述的 2**62 个不同值

回复收藏 0 原文

能否归途做我良人 2024-10-30 14:40:26

@wnoise 几乎做到了，但这是我的两分钱。

IEEE 浮点数可以作为整数进行比较和递增，但有一些限制，请参阅这个问题了解详细信息。因此，如果我们将 +0.0 和 1.0 转换为 64 位整数，我们将得到 0 到 1 之间的步数：

#include <iostream>

int main()
{
        double zero = 0.0;
        double one = 1.0;
        unsigned long long z = *reinterpret_cast<unsigned long long*>(&zero);
        unsigned long long o = *reinterpret_cast<unsigned long long*>(&one);
        std::cout << z << std::endl;
        std::cout << o << std::endl;
}

这分别给出 0 和 4607182418800017408，即在 [0.0, 1.0) 范围内有 4607182418800017408 个唯一的 double 值。

@wnoise pretty much nailed it, but here's my two cents.

IEEE floats can be compared and incremented as integers with some restrictions, see this question for details. So, if we cast +0.0 and 1.0 to 64 bit integers, we get the number of steps between zero and one:

#include <iostream>

int main()
{
        double zero = 0.0;
        double one = 1.0;
        unsigned long long z = *reinterpret_cast<unsigned long long*>(&zero);
        unsigned long long o = *reinterpret_cast<unsigned long long*>(&one);
        std::cout << z << std::endl;
        std::cout << o << std::endl;
}

This gives me 0 and 4607182418800017408, respectively, i.e. there are 4607182418800017408 unique double values in the range [0.0, 1.0).

回复收藏 0 原文