当前位置：文江博客话题详情

在java对象中存储大十六进制数（md5）的最有效方法

发布于 2024-11-18 02:23:29 字数 346 浏览 2 评论 0原文

考虑以下用例，将文件的 MD5 总和存储在 java（或 groovy）对象中的最有效方法（性能和存储空间最佳）是什么：

我需要与数千个进行比较其他 md5 和。
我可能需要将其存储在 HSQLDB 中，以便可以根据 md5 提取记录/group by
可以将其存储在 Map 中作为

我试图避免存储的键它作为 String 作为字符串比较将更加昂贵并且占用更多空间。 BigInteger(string,radix) 会更高效吗？另外，如果持久化到数据库应该选择什么数据类型？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

表情可笑 2024-11-25 02:23:29

创建一个包装 byte[] 且不提供突变的类。如果您想将其用作映射中的键，那么它需要具有可比性或具有哈希码。使用 byte[]，您可以更轻松地从前 32 位计算简单的哈希码。

回复收藏 0 原文

∞梦里开花 2024-11-25 02:23:29

为了在 Java 中比较速度，将其存储为两个 long 值可能是最快的。对于持久性来说，如果您的数据库和持久性工具支持的话，存储为字节数组是最有意义的。否则，存储为十六进制或 Base-64 编码文本是相当常见的，并且可以与访问同一数据库的其他应用程序良好地互操作。

回复收藏 0 原文

怎樣才叫好 2024-11-25 02:23:29

如果需要执行大量比较，可以将 MD5 值存储为 2 个长整数，这样您最多只需要执行 4 次逻辑运算即可与另一个 MD5 值进行检查。

基本上，提供一个接受输入的类，原始摘要数据为 byte[] 并使用

ByteBuffer bb = ByteBuffer.wrap(digestData);
long[] bits = new long[] {
    bb.getLong(),
    bb.getLong()
};

与另一个 long[] MD5 数组进行比较，并使用

boolean eq = ((bits[0]^otherBits[0]) | (bits[1]^otherBits[1])) == 0);

Reconstruct the MD5 with

ByteBuffer bb = ByteBuffer.allocate(16);
bb.putLong(bits[0]);
bb.putLong(bits[1]);

byte[] digestData = new byte[16];
bb.get(digestData);

注意：我并不建议每次比较时都将 byte[] 转换为 long[]，这只是存储摘要的方法进行比较。最后一个重建片段是可选的，您应该将数据保留为 byte[] 并仅比较 long[] 数组。在数据库中，将数据存储为 32 字节的十六进制值。

If you need to perform a lot of comparisons, you could store the MD5 value as 2 long integers, that way you only need to perform at most 4 logical operations to check against another MD5 value.

Basically, provide a class that will accept an input, a raw digest data as byte[] and use

ByteBuffer bb = ByteBuffer.wrap(digestData);
long[] bits = new long[] {
    bb.getLong(),
    bb.getLong()
};

Compare with another long[] MD5 array with

boolean eq = ((bits[0]^otherBits[0]) | (bits[1]^otherBits[1])) == 0);

Reconstruct the MD5 with

ByteBuffer bb = ByteBuffer.allocate(16);
bb.putLong(bits[0]);
bb.putLong(bits[1]);

byte[] digestData = new byte[16];
bb.get(digestData);

Note : I am not suggesting to convert the byte[] into long[] for every comparisons, this is simply how to store the digest for comparisons. The last reconstruction snippet is optional, you should keep the data as byte[] and compare the long[] arrays only. In the database, store the data as a 32 bytes hexadecimal value.

回复收藏 0 原文

~没有更多了~