为什么将整数转换为 float16 很危险？

发布于 2025-01-13 11:16:33 字数 426 浏览 2 评论 0原文

我最近遇到了一个令人惊讶且恼人的错误，其中我将整数转换为 float16 并且值发生了变化：

>>> import numpy as np
>>> np.array([2049]).astype(np.float16)
array([2048.], dtype=float16)
>>> np.array([2049]).astype(np.float16).astype(np.int32)
array([2048.], dtype=int32)

这可能不是一个错误，因为它也发生在 PyTorch 上。我猜它与半浮点表示有关，但我无法弄清楚为什么 2049 是第一个被错误转换的整数。

这个问题与Python并不特别相关（我猜）

原文

I have run recently into a surprising and annoying bug in which I converted an integer into a float16 and the value changed:

>>> import numpy as np
>>> np.array([2049]).astype(np.float16)
array([2048.], dtype=float16)
>>> np.array([2049]).astype(np.float16).astype(np.int32)
array([2048.], dtype=int32)

This is likely not a bug, because it happens also for PyTorch. I guess it is related to half-float representation, but I couldn't figure out why 2049 is the first integer that is badly casted.

The question is not specially related to Python (I guess)

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

救星 2025-01-20 11:16:33

你是对的，它通常与浮点数的定义方式有关（正如其他人所说，在 IEEE 754 中）。让我们看一下：

浮点数由符号 s（此处为 1 位）、尾数 m（此处为 10 位）和指数 e（此处为 5 位，即 −14 ≤ e ≤ 15）表示。然后计算浮点数 x，

x=s*[1].m*b**e,

其中基 b 为 2，[1] 为固定（免费）位。

最多 2**11 我们的整数可以用尾数精确表示，其中

2** 11-1 表示为 m = bin(2**10-1) 和 e = bin(10)
2**11 表示用 m = bin(0) 和 e = bin(11) 表示

，那么事情就变得有趣了：

2**11+1 不能用我们的尾数精确表示，并且是四舍五入的。
2**11+2 可以表示（通过 m = bin(0) 和 e = bin(11)）

等等...

观看此视频以获取详细示例https://www.youtube.com/watch?v=L8OYx1I8qNg

You are right, its in general related to how floating-point numbers are defined (In IEEE 754 as others said). Lets look into it:

The float is represented by a sign s (here 1 bit), a mantissa m (here 10 bits) and an exponent e (here 5 bits for −14 ≤ e ≤ 15). The float x is then calculated by