我需要一种算法,可以将一个 32 位有符号整数一对一映射(即无冲突)到另一个 32 位有符号整数。
我真正关心的是足够的熵,以便函数的输出看起来是随机的。基本上我正在寻找一种类似于 XOR 密码的密码,但它可以生成更任意的输出。安全性并不是我真正关心的,尽管默默无闻才是。
出于澄清目的进行编辑:
- 算法必须是对称的,以便我可以在没有密钥对的情况下反转操作。
- 该算法必须是双射的,每个 32 位输入数字必须生成一个 32 位唯一数字。
- 函数的输出必须足够模糊,仅向输入添加一个就会对输出产生很大的影响。
预期结果示例:
F(100) = 98456
F(101) = -758
F(102) = 10875498
F(103) = 986541
F(104) = 945451245
F(105) = -488554
就像 MD5 一样,改变一件事可能会改变很多事情。
我正在寻找一个数学函数,因此手动映射整数对我来说不是一个解决方案。对于那些询问的人来说,算法速度并不是很重要。
I need an algorithm that can do a one-to-one mapping (ie. no collision) of a 32-bit signed integer onto another 32-bit signed integer.
My real concern is enough entropy so that the output of the function appears to be random. Basically I am looking for a cipher similar to XOR Cipher but that can generate more arbitrary-looking outputs. Security is not my real concern, although obscurity is.
Edit for clarification purpose:
- The algorithm must be symmetric, so that I can reverse the operation without a keypair.
- The algorithm must be bijective, every 32-bit input number must generate a 32-bit unique number.
- The output of the function must be obscure enough, adding only one to the input should result big effect on the output.
Example expected result:
F(100) = 98456
F(101) = -758
F(102) = 10875498
F(103) = 986541
F(104) = 945451245
F(105) = -488554
Just like MD5, changing one thing may change lots of things.
I am looking for a mathematical function, so manually mapping integers is not a solution for me. For those who are asking, algorithm speed is not very important.
发布评论
评论(11)
使用任何 32 位分组密码!根据定义,分组密码以可逆的方式将其范围内的每个可能的输入值映射到唯一的输出值,并且根据设计,在没有密钥的情况下很难确定任何给定值将映射到什么。只需选择一个密钥,如果安全性或模糊性很重要,请将其保密,然后使用密码作为转换。
要将这个想法扩展到非 2 次方范围,请参阅我的帖子 使用分组密码进行安全排列。
解决您的具体问题:
Use any 32-bit block cipher! By definition, a block cipher maps every possible input value in its range to a unique output value, in a reversible fashion, and by design, it's difficult to determine what any given value will map to without the key. Simply pick a key, keep it secret if security or obscurity is important, and use the cipher as your transformation.
For an extension of this idea to non-power-of-2 ranges, see my post on Secure Permutations with Block Ciphers.
Addressing your specific concerns:
下面的论文为您提供了 4 或 5 个映射示例,为您提供了函数而不是构建映射集: www.cs.auckland.ac.nz/~john-rugis/pdf/BijectiveMapping.pdf
The following paper gives you 4 or 5 mapping examples, giving you functions rather than building mapped sets: www.cs.auckland.ac.nz/~john-rugis/pdf/BijectiveMapping.pdf
如果您的目标只是获得粗略大小的数字的看似随机的排列,那么还有另一种可能的方法:将数字集减少为素数。
形式的映射
然后您可以使用f(i) = (i * a + b) % p
,如果 p 确实是素数,则这将是所有 a != 0 和所有 b 的双射。对于较大的 a 和 b 来说,它看起来相当随机。
例如,在我偶然发现这个问题的情况下,我使用 1073741789 作为小于 1 << 的数字范围的素数。 30. 这使我只丢失 35 个数字,这对我来说没问题。
我的编码是 then
,解码是
注意 507371178 * 233233408 % 1073741789 == 1,所以这两个数字是模 1073741789 的数字域的逆数(您可以使用扩展欧几里德算法计算出这些字段中的逆数)。
我相当随意地选择了 a 和 b,我只是确保它们大约是 p 大小的一半。
If your goal is simply to get a seemingly random permutation of numbers of a roughly defined size, then there is another possible way: reduce the set of numbers to a prime number.
Then you can use a mapping of the form
f(i) = (i * a + b) % p
and if p is indeed a prime, this will be a bijection for all a != 0 and all b. It will look fairly random for larger a and b.
For example, in my case for which I stumbled on this question, I used 1073741789 as a prime for the range of numbers smaller than 1 << 30. That makes me lose only 35 numbers, which is fine in my case.
My encoding is then
and the decoding is
Note that 507371178 * 233233408 % 1073741789 == 1, so those two numbers are inverse the field of numbers modulo 1073741789 (you can figure out inverse numbers in such fields with the extended euclidean algorithm).
I chose a and b fairly arbitrarily, I merely made sure they are roughly half the size of p.
我将尝试通过一个更简单的示例来解释我的解决方案,然后可以轻松地将其扩展到您的大型示例。
假设我有一个 4 位数字。有 16 个不同的值。将其视为一个四维立方体:
(来源:ams.org)
。
每个顶点代表这些数字之一,每一位代表一个维度。因此它基本上是 XYZW,其中每个维度只能具有值 0 或 1。现在假设您使用不同顺序的维度。例如 XZYW。现在每个顶点的数量都改变了!
您可以对任意数量的维度执行此操作,只需排列这些维度即可。如果您不关心安全性,那么这对您来说可能是一个不错的快速解决方案。另一方面,我不知道输出是否足够“模糊”以满足您的需求,当然在完成大量映射之后,映射可以反转(这可能是优点或缺点,具体取决于您的需求。)
I will try to explain my solution to this on a much simpler example, which then can be easily extended for your large one.
Say i have a 4 bit number. There are 16 distinct values. Look at it as if it was a four dimensional cube:
(source: ams.org)
.
Every vertex represents one of those numbers, every bit represents one dimension. So its basicaly XYZW, where each of the dimensions can have only values 0 or 1. Now imagine you use a different order of dimensions. For example XZYW. Each of the vertices now changed its number!
You can do this for any number of dimensions, just permute those dimensions. If security is not your concern this could be a nice fast solution for you. On the other hand, i dont know if the output will be "obscure" enough for your needs and certainly after a large amount of mapping done, the mapping can be reversed (which may be an advantage or disadvantage, depending on your needs.)
除了生成随机查找表之外,您还可以使用函数组合:
Apart from generating random lookup-tables, you can use a combination of functions:
您可以使用随机生成的查找表吗?只要表中的随机数是唯一的,您就可以获得双射映射。但它不是对称的。
针对所有 32 位值使用一个 16 GB 查找表可能不切实际,但您可以对高位字和低位字使用两个单独的 16 位查找表。
PS:我认为你可以生成一个对称双射查找表,如果这很重要的话。该算法将从一个空的 LUT 开始:
选择第一个元素,为其分配随机映射。要使映射对称,也分配逆数:
选择下一个数字,再次分配随机映射,但选择尚未分配的数字。 (即在本例中,不要选择 1 或 3)。重复此操作直至 LUT 完成。这应该生成随机双射对称映射。
Can you use a random generated lookup-table? As long as the random numbers in the table are unique, you get a bijective mapping. It's not symmetric, though.
One 16 GB lookup-table for all 32 bit values is probably not practical, but you could use two separate 16-bit lookup tables for the high-word and the low word.
PS: I think you can generate a symmetric bijective lookup table, if that's important. The algorithm would start with an empty LUT:
Pick the first element, assign it a random mapping. To make the mapping symmetric, assign the inverse, too:
Pick the next number, again assign a random mapping, but pick a number that's not been assigned yet. (i.e. in this case, don't pick 1 or 3). Repeat until the LUT is complete. This should generate a random bijective symmetric mapping.
取一个数,乘以9,取反数,除以9。
应该足够晦涩了!
编辑:它不是 0 结尾整数的双射
您可以随时添加特定规则,例如:
取一个数,除以 10 x 倍,乘以 9,逆数字,除以 9,乘以 10^x。
所以
它有效!
编辑2:为了更加晦涩,您可以添加任意数字,并在最后减去。
Take a number, multiplies by 9, inverse digits, divide by 9.
Should be obscure enough !!
Edit : it is not a bijection for 0 ending integer
You can always add a specific rule like :
Take a number, divide by 10 x times, multiplies by 9, inverse digits, divide by 9, multiples by 10^x.
And so
W00t it works !
Edit 2 : For more obscurness, you can add an arbitrary number, and substract at the end.
这是我的简单想法:
正如 PeterK 提出的那样,您可以移动数字的位,但您可以为每个数字进行不同的位排列,并且仍然能够破译它。
密码是这样的:
将输入数字视为位数组
I[0..31]
,将输出视为O[0..31]
。准备一个由 64 个随机生成的数字组成的数组
K[0..63]
。这将是你的钥匙。从第一个随机数 (
I[K[0] mod 32]
) 确定的位置取出输入数字的位,并将其放在结果的开头 (O[0]< /代码>)。现在要决定将哪个位放置在
O[1]
处,请使用之前使用过的位。如果为0,则使用K[1]生成I
中要取的位置,如果为1,则使用K[2](简单来说就是跳过一个随机数)。现在这效果不太好,因为你可能会两次使用相同的位。为了避免这种情况,请在每次迭代后重新对位进行编号,并忽略已使用的位。要生成获取
O[1]
的位置,请使用I[K[p] mod 31]
,其中 p 为 1 或 2,具体取决于位O[0]
,因为还剩下 31 位,编号从 0 到 30。为了说明这一点,我举一个例子:
我们有一个 4 位数字,和 8 个随机数:25, 5 , 28, 19, 14, 20, 0, 18.
25 mod 4 = 1,所以我们将取位置为 1 的位(从 0 开始计数)
我们刚刚取了一位值为 1 的位,所以我们随机跳过一个数字并使用 28。还剩下 3 位,因此为了计算位置,我们取 28 mod 3 = 1。我们取剩余位的第一个(从 0 开始计数):
我们再次跳过一个数字,并取 14。14 mod 2 = 0,所以我们取第 0 位:
现在没关系,但前一位是 0,所以我们取 20。 20 mod 1 = 0:
就是这样。
破译这样的数字很容易,只需做同样的事情即可。放置代码第一位的位置从密钥中得知,接下来的位置由先前插入的位确定。
显然,这具有仅移动位的所有缺点(例如 0 变为 0,MAXINT 变为 MAXINT),但似乎很难找到某人如何在不知道密钥的情况下加密该数字,而密钥必须是秘密的。
Here is my simple idea:
You can move around the bits of the number, as PeterK proposed, but you can have a different permutation of bits for each number, and still be able to decipher it.
The cipher goes like this:
Treat the input number as an array of bits
I[0..31]
, and the output asO[0..31]
.Prepare an array
K[0..63]
of 64 randomly generated numbers. This will be your key.Take the bit of input number from position determined by the first random number (
I[K[0] mod 32]
) and place it at the beginning of your result (O[0]
). Now to decide which bit to place atO[1]
, use the previously used bit. If it is 0, use K[1] to generate position inI
from which to take, it it is 1, use K[2] (which simply means skip one random number).Now this will not work well, as you may take the same bit twice. In order to avoid it, renumber the bits after each iteration, omitting the used bits. To generate the position from which to take
O[1]
useI[K[p] mod 31]
, where p is 1 or 2, depending on the bitO[0]
, as there are 31 bits left, numbered from 0 to 30.To illustrate this, I'll give an example:
We have a 4-bit number, and 8 random numbers: 25, 5, 28, 19, 14, 20, 0, 18.
25 mod 4 = 1, so we'll take bit whose position is 1 (counting from 0)
We've just taken a bit of value 1, so we skip one random number and use 28. There are 3 bits left, so to count position we take 28 mod 3 = 1. We take the first (counting from 0) of the remaining bits:
Again we skip one number, and take 14. 14 mod 2 = 0, so we take the 0th bit:
Now it doesn't matter, but the previous bit was 0, so we take 20. 20 mod 1 = 0:
And this is it.
Deciphering such a number is easy, one just has to do the same things. The position at which to place the first bit of the code is known from the key, the next positions are determined by the previously inserted bits.
This obviously has all the disadvantages of anything which just moves the bits around (for example 0 becomes 0, and MAXINT becomes MAXINT), but is seems harder to find how someone has encrypted the number without knowing the key, which has to be secret.
如果您不想使用正确的加密算法(可能出于性能和复杂性原因),您可以使用更简单的密码,例如 维吉尼亚密码。该密码实际上被描述为le chiffre indéchiffrable(法语,意为“牢不可破的密码”)。
下面是一个简单的 C# 实现,它根据相应的键值来移动值:
当输入稍有变化时,该算法不会在输出中产生较大的变化。但是,您可以使用另一个双射运算而不是加法来实现此目的。
If you don't want to use proper cryptographic algorithms (perhaps for performance and complexity reasons) you can instead use a simpler cipher like the Vigenère cipher. This cipher was actually described as le chiffre indéchiffrable (French for 'the unbreakable cipher').
Here is a simple C# implementation that shifts values based on a corresponding key value:
This algorithm does not create a big shift in the output when the input is changed slightly. However, you can use another bijective operation instead of addition to achieve that.
在一张大纸上画一个大圆圈。从圆顶部顺时针方向,等间距地写入从 0 到 MAXINT 的所有整数。逆时针写入从 0 到 MININT 的所有整数,再次等距。请注意,MININT 位于圆圈底部的 MAXINT 旁边。现在在一张硬卡片的两面复制这个图形。将硬卡通过两者的中心固定在圆圈上。选择一个旋转角度,任何你喜欢的角度。现在您有了一个 1-1 映射,它可以满足您的一些要求,但可能还不够模糊。松开卡,将其围绕一个直径(任何直径)翻转。重复这些步骤(以任何顺序),直到获得满意的双射。
如果您一直密切关注,那么用您喜欢的语言对其进行编程应该不难。
澄清以下评论:如果您仅将卡片旋转到纸张上,那么方法就像您抱怨的一样简单。但是,当您翻转卡片时,映射并不等于任何
m
的(x+m) mod MAXINT
。例如,如果您不旋转卡片并将其围绕直径翻转到 0(位于钟面顶部),则 1 映射到 -1,2 映射到 -2,依此类推。(x+m) mod MAXINT
仅对应于卡片的旋转。Draw a large circle on a large sheet of paper. Write all the integers from 0 to MAXINT clockwise from the top of the circle, equally spaced. Write all the integers from 0 to MININT anti-clockwise, equally spaced again. Observe that MININT is next to MAXINT at the bottom of the circle. Now make a duplicate of this figure on both sides of a piece of stiff card. Pin the stiff card to the circle through the centres of both. Pick an angle of rotation, any angle you like. Now you have a 1-1 mapping which meets some of your requirements, but is probably not obscure enough. Unpin the card, flip it around a diameter, any diameter. Repeat these steps (in any order) until you have a bijection you are happy with.
If you have been following closely it shouldn't be difficult to program this in your preferred language.
For Clarification following the comment: If you only rotate the card against the paper then the method is as simple as you complain. However, when you flip the card over the mapping is not equivalent to
(x+m) mod MAXINT
for anym
. For example, if you leave the card unrotated and flip it around the diameter through 0 (which is at the top of the clock face) then 1 is mapped to -1, 2 to -2, and so forth.(x+m) mod MAXINT
corresponds to rotations of the card only.将数字一分为二(16 个最高有效位和 16 个最低有效位),并将两个 16 位结果中的位视为两副牌中的牌。混合套牌,将一张套牌压入另一套牌中。
因此,如果您的初始数字是
b31,b30,...,b1,b0
您最终会得到b15,b31,b14,b30,...,b1,b17,b0,b16
。它实施起来很快而且很快,反之亦然。如果您查看结果的十进制表示形式,该系列看起来相当晦涩难懂。
您可以手动映射 0 ->最大值和最大值-> 0 以避免它们映射到自身。
Split the number in two (16 most significant bits and 16 least significant bits) and consider the bits in the two 16-bit results as cards in two decks. Mix the decks forcing one into the other.
So if your initial number is
b31,b30,...,b1,b0
you end up withb15,b31,b14,b30,...,b1,b17,b0,b16
. It's fast and quick to implement, as is the inverse.If you look at the decimal representation of the results, the series looks pretty obscure.
You can manually map 0 -> maxvalue and maxvalue -> 0 to avoid them mapping onto themselves.