Python - 为什么使用 uuid4() 以外的任何东西来表示唯一字符串？

发布于 2024-08-25 03:15:44 字数 668 浏览 11 评论 0原文

我看到一些针对上传图像名称、会话 ID 等的唯一字符串生成的实现已被放弃，其中许多都使用 SHA1 或其他哈希值。

我并不是质疑使用这样的自定义方法的合法性，而只是质疑原因。如果我想要一个唯一的字符串，我只需这样说：

>>> import uuid
>>> uuid.uuid4()
UUID('07033084-5cfd-4812-90a4-e4d24ffb6e3d')

我就完成了。在阅读 uuid 之前我并不是很信任，所以我这样做了：

>>> import uuid
>>> s = set()
>>> for i in range(5000000):  # That's 5 million!
>>>     s.add(str(uuid.uuid4()))
...
...
>>> len(s)
5000000

没有一个中继器（考虑到赔率是 1.108e+50，我不希望有一个中继器，但看到它的实际应用令人欣慰）。通过组合 2 个 uuid4() 来创建字符串，您甚至可以将成功率降低一半。

那么，话虽如此，为什么人们花时间在 random() 和其他独特字符串等上？关于 uuid 是否存在重要的安全问题或其他问题？

原文

I see quit a few implementations of unique string generation for things like uploaded image names, session IDs, et al, and many of them employ the usage of hashes like SHA1, or others.

I'm not questioning the legitimacy of using custom methods like this, but rather just the reason. If I want a unique string, I just say this:

>>> import uuid
>>> uuid.uuid4()
UUID('07033084-5cfd-4812-90a4-e4d24ffb6e3d')

And I'm done with it. I wasn't very trusting before I read up on uuid, so I did this:

>>> import uuid
>>> s = set()
>>> for i in range(5000000):  # That's 5 million!
>>>     s.add(str(uuid.uuid4()))
...
...
>>> len(s)
5000000

Not one repeater (I wouldn't expect one now considering the odds are like 1.108e+50, but it's comforting to see it in action). You could even half the odds by just making your string by combining 2 uuid4()s.

So, with that said, why do people spend time on random() and other stuff for unique strings, etc? Is there an important security issue or other regarding uuid?

分享到QQ

分享到微博