密码分析：两个明文文件的异或

发布于 2024-11-01 15:36:50 字数 404 浏览 1 评论 0原文

我有一个文件，其中包含两个异或纯文本文件的结果。如何攻击该文件以解密任一明文文件？我搜索了很多，但找不到任何答案。谢谢！

编辑：

嗯，我还有两个密文，我对它们进行异或以获得两个明文的异或。我之所以问这个问题，是因为据布鲁斯·施奈尔（Bruce Schneier）说。 198，应用密码学，1996 “...她可以将它们异或在一起，并得到两个彼此异或的明文消息。这很容易破解，然后她可以将其中一个明文与密文进行异或以获得密钥流。” （这与简单的流密码有关）但除此之外，他没有提供任何解释。这就是我在这里问的原因。原谅我的无知。

另外，使用的算法很简单，并且使用长度为 3 的对称密钥。

进一步编辑：

我忘记添加：我假设使用简单的流密码进行加密。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

他不在意 2024-11-08 15:36:50

我不是密码分析师，但如果您对文件的特征有所了解，您可能就有机会。

例如，假设您知道两个原始明文：

包含纯 ASCII 英语文本
都是有关体育（或其他）的文章

给定这两条信息，您可能采取的一种方法是使用以下单词扫描密文“解密”：您可能期望其中包含“足球”、“球员”、“得分”等。在密文的位置 0、位置 1、位置 2 等处使用“足球”执行解密。

如果解密字节序列的结果看起来是一个单词或单词片段，那么您很有可能从这两个文件中找到了明文。这可能会给你一些周围明文的线索，你可以看看这是否会导致合理的解密。等等。

使用您可能期望出现在明文中的其他单词/短语/片段重复此过程。

回应你的问题的编辑：施奈尔所说的是，如果有人有 2 个使用相同密钥进行异或加密的密文，对这些密文进行异或将“取消”密钥流，因为：

(A ^ k) - ciphertext of A
(B ^ k) - ciphertext of B

(A ^ k) ^ (B ^ k) - the two ciphertexts XOR'ed together which simplifies to:

A ^ B ^ k ^ k - which continues to simplify to
A ^ B ^ 0
A ^ B

所以现在，攻击者有了一个新的仅由两个明文组成的密文。如果攻击者知道其中一个明文（假设攻击者可以合法访问 A，但不能合法访问 B），则可用于恢复另一个明文：

A ^ (A ^ B)
(A ^ A) ^ B
0 ^ B
B

现在攻击者拥有 B 的明文

。实际上比这更糟糕 - 如果攻击者拥有 A 和 A 的密文，那么他就可以恢复密钥流。

但是，我上面给出的猜测方法是上述方法的变体，攻击者使用（希望是好的）猜测而不是已知的明文。显然这并不那么容易，但它是相同的概念，并且不需要从已知的明文开始就可以完成。现在，攻击者有了一个密文，当他正确猜出某些明文时，该密文会“告诉”他（因为解密后会产生其他明文）。因此，即使原始 XOR 运算中使用的密钥是随机乱码，攻击者在进行有根据的猜测时也可以使用已“删除”随机乱码的文件来获取信息。

I'm no cryptanalyst, but if you know something about the characteristics of the files you might have a chance.

For example, lets assume that you know that both original plaintexts:

contain plain ASCII English text
are articles about sports (or whatever)

Given those 2 pieces of information, one approach you might take is to scan through the ciphertext 'decrypting' using words that you might expect to be in them, such as "football", "player", "score", etc. Perform the decryption using "football" at position 0 of the ciphertext, then at position 1, then 2 and so on.

If the result of decrypting a sequence of bytes appears to be a word or word fragment, then you have a good chance that you've found plaintext from both files. That may give you a clue as to some surrounding plaintext, and you can see if that results in a sensible decryption. And so on.

Repeat this process with other words/phrases/fragments that you might expect to be in the plaintexts.

In response to your question's edit: what Schneier is talking about is that if someone has 2 ciphertexts that have been XOR encrypted using the same key, XORing those ciphertexts will 'cancel out' the keystream, since:

(A ^ k) - ciphertext of A
(B ^ k) - ciphertext of B

(A ^ k) ^ (B ^ k) - the two ciphertexts XOR'ed together which simplifies to:

A ^ B ^ k ^ k - which continues to simplify to
A ^ B ^ 0
A ^ B

So now, the attacker has a new ciphertext that's composed only of the two plaintexts. If the attacker knows one of the plaintexts (say the attacker has legitimate access to A, but not B), that can be used to recover the other plaintext:

A ^ (A ^ B)
(A ^ A) ^ B
0 ^ B
B

Now the attacker has the plaintext for B.

It's actually worse than this - if the attacker has A and the ciphertext for A then he can recover the keystream already.

But, the guessing approach I gave above is a variant of the above with the attacker using (hopefully good) guesses instead of a known plaintext. Obviously it's not as easy, but it's the same concept, and it can be done without starting with known plaintext. Now the attacker has a ciphertext that 'tells' him when he's correctly guessed some plaintext (because it results in other plaintext from the decryption). So even if the key used in the original XOR operation is random gibberish, an attacker can use the file that has that random gibberish 'removed' to gain information when he's making educated guesses.

回复收藏 0 原文

呆头 2024-11-08 15:36:50

您需要利用这两个文件都是纯文本的事实。从这个事实可以得出很多含义。假设两个文本都是英文文本，您可以使用某些字母比其他字母更受欢迎的事实。请参阅本文。

另一个提示是注意正确英文文本的结构。例如，每当一个语句结束，下一个语句开始时，就会出现一个（点、空格、大写字母）序列。

请注意，在 ASCII 代码中，空格是二进制“0010 0000”，更改字母中的该位将更改字母大小写（从小写到大写，反之亦然）。如果两个文件都是纯文本，将会有大量使用空间的异或运算，对吧？
在此页面上分析可打印字符表。

另外，最后您可以使用拼写检查器。

我知道我没有为你的问题提供解决方案。
我只是给了你一些提示。玩得开心，请分享您的发现。
这确实是一项有趣的任务。

回复收藏 0 原文

忆梦 2024-11-08 15:36:50

这很有趣。施奈尔的书确实说打破这一点很容易。然后他就把这件事搁置了。我想你必须给读者留下一些练习！

Dawson 和 Nielson 发表了一篇文章，显然描述了一种自动化的文本文件的此任务的流程。购买单篇文章有点贵。然而，第二篇论文的标题为自动密码分析的自然语言方法
Two-time Pads 引用了 Dawson 和 Nielsen 的工作并描述了他们所做的一些假设（主要是文本限制为 27 个字符）。但第二篇论文似乎是免费提供的，并描述了他们自己的系统。我不确定它是否免费，但它在约翰霍普金斯大学的服务器上公开可用。

那篇论文大约有 10 页长，看起来很有趣。我现在没有时间阅读它，但稍后可能会。我觉得很有趣（并且很能说明问题）的是，需要一篇 10 页的论文来描述另一位密码学家描述为“简单”的任务。

回复收藏 0 原文