ByteBuffer 的 HashSet（实际上是整数）来分隔唯一的 & ByteBuffer 数组中的非唯一元素

发布于 2024-10-23 00:09:34 字数 407 浏览 2 评论 0原文

我有一个 ByteBuffer 数组（实际上代表整数）。我想要单独的 unique &数组中非唯一的 ByteBuffer（即整数）。因此我使用这种类型的 HashSet：
HashSet; columnsSet = new HashSet()

只是想知道 HashSet 是否是一个好方法？当我为 ByteBuffer 这样做时，我是否会付出更多的成本，而不是为 Integer 这样做？

（实际上，我正在从数据库读取序列化数据，需要在此操作后写回，因此我想避免字节缓冲区到整数之间的序列化和反序列化！）

您对此的想法表示赞赏。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

凉城凉梦凉人心 2024-10-30 00:09:34

创建 ByteBuffer 比从重用的 ByteBuffer 中读取/写入要昂贵得多。

存储整数最有效的方法是使用int类型。如果您想要一组这些，您可以使用使用 int 原语的 TIntHashSet。您可以使用 O(1) 预分配对象执行多次读取/反序列化/存储和反向操作。

回复收藏 0 原文

烟若柳尘 2024-10-30 00:09:34

首先，它会起作用。两个 ByteBuffer 上的 equals() 开销肯定会更高，但可能不足以抵消不必反序列化的好处（不过，我并不完全认为确定这是否会是一个大问题）。

我很确定性能将渐近相同，但更节省内存的解决方案是对数组进行排序，然后线性地遍历它并测试连续元素的相等性。

举个例子，假设您的缓冲区包含以下内容：

1 2 5 1

对它进行排序：

1 1 2 5

开始迭代后，您会得到 ar[0].equals(ar[1]) 并且您知道这些是重复项。继续这样直到n-1。

First of all, it will work. The overhead of equals() on two ByteBuffers will definitely be higher, but perhaps not enough to offset the benefits of not having to deserialize (though, I'm not entirely sure if that would be such a big problem).

I'm pretty sure that the performance will asymptotically be the same, but a more memory-efficient solution is to sort your array, then step through it linearly and test successive elements for equality.

An example, suppose your buffers contain the following: