散列方法允许增加桶的数量，而不会弄乱之前的数据映射

发布于 2024-11-04 08:16:58 字数 738 浏览 1 评论 0原文

是否有一种算法/方法可以让我在不重建数据/重新散列的情况下增加存储桶的数量。

实践中的问题： 假设您有一群由字符串“用户名”标识的用户。然后，您将这些“用户名”散列到存储桶列表中。

This is done by something like:
String username = "user";
int index = username.hash();
int bucketIndex = index % bucketlist.size();

因此，在这个方案中，如果我想增加“桶”的数量，那么还需要移动桶中的数据。这样它就可以与用不同的数字取模得到的新桶索引相匹配。

这实际上只是一个映射。在哪里可以找到属于给定用户的存储桶。

可能的愚蠢解决方案： 具有旧的存储桶尺寸和新的存储桶尺寸。然后尝试在两个桶中查找。然后使用 new bucketlist.size() 慢慢移动所有用户，使其匹配。这不需要在散列和移动时完全停止。

需要什么： 确实是所有用户的举动都不好。在许多桶中寻找正确的桶也并不理想。

重点是能够仅通过使用算法来确定要使用列表中的哪个存储桶。

并且不可能将存储桶列表的大小作为用户名的一部分。

如果它的作用大致相同，则不需要像这里所做的那样进行散列。

我不知道这个问题是否有任何明智的答案......

原文

Is there an algorithm/method that lets me increase the number of buckets without rebuilding the data/ re-hashing.

The problem in practice:
Say you have a bunch of users that are identified by a string, "username".
Then you hash these "usernames" to a list of buckets.

This is done by something like:
String username = "user";
int index = username.hash();
int bucketIndex = index % bucketlist.size();

So in this scheme if I one would want to increase the number of "buckets", one would also need to move the data in the buckets. So that it matches the new bucket index that one gets with doing modulo with a different number.

This is really just a mapping. Where to find the bucket that belongs to a given user.

Possible dumb solutions:
Have both the old bucket size and the new bucket size. And then try to look in two buckets.
Then slowly move all the users so that it matches by using new bucketlist.size(). This would not require a total stop, while hashing and moving.

What's needed:
It is really the moving of all users that is bad. And looking in many buckets to find the correct one is also not ideal.

And the whole point is to be able to pinpoint which bucket in the list to use just by using an algorithm.

And it is not possible to have the size of the bucket list as part of the username.

It does not need to be hashing like it is done here if it roughly does the same.

I don't know if there is any sensible answer to this...

分享到QQ

分享到微博