Python - 尝试处理文件的位
我最近开始学习Python,我选择通过尝试解决一个我觉得有趣的问题来学习东西。这个问题是获取一个文件(无论是否为二进制)并使用一种简单的方法对其进行加密,例如将其中的每个“1001 0001”替换为“0010 0101”,反之亦然。
但是,我没有找到办法做到这一点。读取文件时,我可以使用 read() 方法创建一个数组,其中每个元素包含一个字节的数据。但是,如果这个字节是我选择替换的字节之一,我该如何替换为另一个字节,然后将结果信息写入输出加密文件中呢?
提前致谢!
I have very recently started to learn Python, and I chose to learn things by trying to solve a problem that I find interesting. This problem is to take a file (binary or not) and encrypt it using a simple method, something like replacing every "1001 0001" in it with a "0010 0101", and vice-versa.
However, I didn't find a way to do it. When reading the file, I can create an array in which each element contains one byte of data, with the read() method. But how can I replace this byte with another one, if it is one of the bytes I chose to replace, and then write the resulting information into the output encrypted file?
Thanks in advance!
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
data:image/s3,"s3://crabby-images/d5906/d59060df4059a6cc364216c4d63ceec29ef7fe66" alt="扫码二维码加入Web技术交流群"
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(4)
要交换字节
10010001
和00100101
:To swap bytes
10010001
and00100101
:read() 返回一个不可变的字符串,因此您首先需要将其转换为字符列表。然后检查列表并根据需要更改字节,最后将列表连接回新字符串以写入输出文件。
read() returns an immutable string, so you'll first need to convert that to a list of characters. Then go through your list and change the bytes as needed, and finally join the list back into a new string to write to the output file.
按照亚伦的回答,一旦有了字符串,您还可以使用
translate
或replace
:文档:Python
字符串
。Following Aaron's answer, once you have a string, then you can also use
translate
orreplace
:Docs: Python
string
.我对这面有些相关的文字墙感到抱歉——我只是处于教学心情。
如果你想优化这样的操作,我建议使用numpy。优点是整个翻译操作是通过单个 numpy 操作完成的,并且这些操作是用 C 编写的,因此它的速度与使用 python 获得的速度一样快。
在下面的示例中,我只是使用 XOR 每个字节与
0b11111111
查找表——第一个元素是0b0000000
的翻译,第二个元素是0b00000001
的翻译,第三个元素是0b00000001
的翻译0b00000010
,等等。通过更改查找表,您可以进行任何类型的在文件内不发生更改的转换。为了强调这一切的简单性,我没有进行任何参数检查。调用如下脚本:
要直接回答您的问题,如果您想交换
0b10010001
和0b00100101
,请替换lookup_table = ...
行与此:当然,没有使用 频率分析。但正如您所知,使用一次性密码本进行的加密是牢不可破的,只要因为垫子是安全的。此修改后的脚本使用一次性密码本进行加密或解密(您必须自己创建它,存储到文件中,并以某种方式(存在问题)安全地传输到消息的预期收件人):
示例用法(linux) :
收件人然后做:
中提琴! python 中的三行(加上两行导入)快速且牢不可破的加密。
I'm sorry about this somewhat relevant wall of text -- I'm just in a teaching mood.
If you want to optimize such an operation, I suggest using numpy. The advantage is that the entire translation operation is done with a single numpy operation, and those are written in C, so it is about as fast as you can get it using python.
In the below example I simply XOR every byte with
0b11111111
using a lookup table -- first element is the translation of0b0000000
, the second the translation of0b00000001
, third0b00000010
, and so on. By altering the lookup table, you can do any kind of translation that does not change within the file.To highlight the simplicity of it all I've done no argument checking. Invoke script like this:
To directly answer your question, if you want to swap
0b10010001
and0b00100101
, you replace thelookup_table = ...
line with this:Of course there is no lookup table encryption that isn't easily broken using frequency analysis. But as you may know, encryption using a one-time pad is unbreakable, as long as the pad is safe. This modified script encrypts or decrypts using a one-time pad (which you'll have to create yourself, store to a file, and somehow (there's the rub) securely transmit to the intended recipient of the message):
Example usage (linux):
Recipient then does:
Viola! Fast and unbreakable encryption with three lines (plus two import lines) in python.