如果 SHA-1 哈希值只有 160 位，为什么它却有 40 个字符长？

发布于 2024-09-19 06:13:45 字数 206 浏览 15 评论 0 原文

问题的标题说明了一切。我一直在研究 SHA-1，在大多数地方我看到它是 40 个十六进制字符长，对我来说是 640 位。难道不能只用 10 个十六进制字符来表示 160bit = 20byte 吗？一个十六进制字符可以代表 2 个字节，对吗？为什么它是所需时间的两倍？我的理解中缺少什么。

如果使用 Base32 或 Base36 ，SHA-1 甚至不能只有 5 个或更少的字符吗？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

我ぃ本無心為│何有愛 2024-09-26 06:13:45

一个十六进制字符只能表示16个不同的值，即4位。 (16 = 2⁴)

40 × 4 = 160。

不，您需要 5 个以上的 36 进制字符。

总共有 2¹⁶⁰ 种不同的 SHA-1 哈希值。

2¹⁶⁰ = 16⁴⁰，所以这是我们需要 40 个十六进制数字的另一个原因。

但是 2¹⁶⁰ = 36^{160 log₃₆2} = 36^30.9482...，所以你仍然需要 31 个字符使用base-36。

回复收藏 0 原文

倾城月光淡如水﹏ 2024-09-26 06:13:45

我认为OP的困惑来自于表示SHA1哈希值的字符串需要40个字节（至少如果您使用ASCII），这等于320位（而不是640位）。

原因是哈希值是二进制的，而十六进制字符串只是其编码。因此，如果您要使用更有效的编码（或根本不编码），则只能占用 160 位空间（20 字节），但问题是它不是二进制安全的。

不过，您可以使用 base64，在这种情况下，您需要大约 27-28 个字节（或字符）而不是 40 个（请参阅

回复收藏 0 原文

最冷一天 2024-09-26 06:13:45

每个 8 位字节有两个十六进制字符，而不是每个十六进制字符有两个字节。

如果您使用 8 位字节（如 SHA-1 定义中所示），则十六进制字符会对字节内的单个高或低 4 位半字节进行编码。所以一个完整的字节需要两个这样的字符。

回复收藏 0 原文

云胡 2024-09-26 06:13:45

我的答案与之前的答案仅在我的理论中关于OP混乱的确切起源以及我提供的阐明的婴儿步骤中有所不同。

一个字符根据使用的编码占用不同的字节数（原因如下）。因此，40 个 Java 字符等于 80 个字节 = 640 位，OP 的计算和 10 个 Java 字符确实会封装 SHA-1 哈希的正确信息量。

然而，与数千个可能的 Java 字符不同，只有 16 个不同的十六进制字符，即 0、1、2、3、4、5、6、7、8、9、A、B、 C、D、E 和 F。但这些与 Java 字符不一样，并且比 Java 字符 0 到 9 和 A 到 F 的编码占用的空间要少得多。它们是表示仅由以下字符表示的所有可能值的符号。 4 位：

0  0000    4  0100    8  1000    C  1100
1  0001    5  0101    9  1001    D  1101
2  0010    6  0110    A  1010    E  1110
3  0011    7  0111    B  1011    F  1111

因此每个十六进制字符只有半个字节，40 个十六进制字符为我们提供了 20 个字节 = 160 位 - SHA-1 哈希的长度。

My answer only differs from the previous ones in my theory as to the EXACT origin of the OP's confusion, and in the baby steps I provide for elucidation.

A character takes up different numbers of bytes depending on the encoding used (see here). There are a few contexts these days when we use 2 bytes per character, for example when programming in Java (here's why). Thus 40 Java characters would equal 80 bytes = 640 bits, the OP's calculation, and 10 Java characters would indeed encapsulate the right amount of information for a SHA-1 hash.

Unlike the thousands of possible Java characters, however, there are only 16 different hex characters, namely 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, A, B, C, D, E and F. But these are not the same as Java characters, and take up far less space than the encodings of the Java characters 0 to 9 and A to F. They are symbols signifying all the possible values represented by just 4 bits:

0  0000    4  0100    8  1000    C  1100
1  0001    5  0101    9  1001    D  1101
2  0010    6  0110    A  1010    E  1110
3  0011    7  0111    B  1011    F  1111

Thus each hex character is only half a byte, and 40 hex characters gives us 20 bytes = 160 bits - the length of a SHA-1 hash.

回复收藏 0 原文