解压缩文件会导致“BadZipFile:文件不是 zip 文件”
我有两个 zip 文件,它们都可以用 Windows 资源管理器和 7-zip 很好地打开。
但是,当我使用 Python 的 zipfile 模块 [ zipfile.ZipFile("filex.zip") ] 打开它们时,其中一个被打开,但另一个给出错误“BadZipfile:文件不是 zip 文件
” 。
我通过使用 7-Zip 打开并查看其属性(如 7Zip.ZIP 所示)来确保后一个文件是有效的 Zip 文件。当我用文本编辑器打开该文件时,前两个字符是“PK”,表明它确实是一个zip文件。
我正在使用 Python 2.5,并且真的不知道如何进行此操作。我在 Windows 和 Ubuntu 上都进行了尝试,两个平台上都存在问题。
更新: Windows 上 Python 2.5.4 的回溯:
Traceback (most recent call last):
File "<module1>", line 5, in <module>
zipfile.ZipFile("c:/temp/test.zip")
File "C:\Python25\lib\zipfile.py", line 346, in init
self._GetContents()
File "C:\Python25\lib\zipfile.py", line 366, in _GetContents
self._RealGetContents()
File "C:\Python25\lib\zipfile.py", line 378, in _RealGetContents
raise BadZipfile, "File is not a zip file"
BadZipfile: File is not a zip file
基本上,当调用 _EndRecData
函数从“中央目录末尾”记录获取数据时,注释长度检出失败 [ endrec [7] == len(comment) ]。
_EndRecData
函数中的局部变量值如下:
END_BLOCK: 4096,
comment: '\x00',
data: '\xd6\xf6\x03\x00\x88,N8?<e\xf0q\xa8\x1cwK\x87\x0c(\x82a\xee\xc61N\'1qN\x0b\x16K-\x9d\xd57w\x0f\xa31n\xf3dN\x9e\xb1s\xffu\xd1\.....', (truncated)
endrec: ['PK\x05\x06', 0, 0, 4, 4, 268, 199515, 0],
filesize: 199806L,
fpin: <open file 'c:/temp/test.zip', mode 'rb' at 0x045D4F98>,
start: 4073
I have two zip files, both of them open well with Windows Explorer and 7-zip.
However when i open them with Python's zipfile module [ zipfile.ZipFile("filex.zip") ], one of them gets opened but the other one gives error "BadZipfile: File is not a zip file
".
I've made sure that the latter one is a valid Zip File by opening it with 7-Zip and looking at its properties (says 7Zip.ZIP). When I open the file with a text editor, the first two characters are "PK", showing that it is indeed a zip file.
I'm using Python 2.5 and really don't have any clue how to go about for this. I've tried it both with Windows as well as Ubuntu and problem exists on both platforms.
Update: Traceback from Python 2.5.4 on Windows:
Traceback (most recent call last):
File "<module1>", line 5, in <module>
zipfile.ZipFile("c:/temp/test.zip")
File "C:\Python25\lib\zipfile.py", line 346, in init
self._GetContents()
File "C:\Python25\lib\zipfile.py", line 366, in _GetContents
self._RealGetContents()
File "C:\Python25\lib\zipfile.py", line 378, in _RealGetContents
raise BadZipfile, "File is not a zip file"
BadZipfile: File is not a zip file
Basically when the _EndRecData
function is called for getting data from End of Central Directory" record, the comment length checkout fails [ endrec[7] == len(comment) ].
The values of locals in the _EndRecData
function are as following:
END_BLOCK: 4096,
comment: '\x00',
data: '\xd6\xf6\x03\x00\x88,N8?<e\xf0q\xa8\x1cwK\x87\x0c(\x82a\xee\xc61N\'1qN\x0b\x16K-\x9d\xd57w\x0f\xa31n\xf3dN\x9e\xb1s\xffu\xd1\.....', (truncated)
endrec: ['PK\x05\x06', 0, 0, 4, 4, 268, 199515, 0],
filesize: 199806L,
fpin: <open file 'c:/temp/test.zip', mode 'rb' at 0x045D4F98>,
start: 4073
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(16)
名为 file 的文件可能会让 python 感到困惑 - 尝试将其命名为其他名称。如果它仍然不起作用,请尝试以下代码:
files named file can confuse python - try naming it something else. if it STILL wont work, try this code:
我遇到了同样的问题。我的问题是它是 gzip 文件而不是 zip 文件。我切换到类
gzip.GzipFile
,它的工作就像一个魅力。I run into the same issue. My problem was that it was a gzip instead of a zip file. I switched to the class
gzip.GzipFile
and it worked like a charm.astronautlevel 的解决方案适用于大多数情况,但 Zip 中的压缩数据和 CRC 也可以包含相同的 4 个字节。您应该执行
rfind
(而不是find
),查找 pos+20,然后将 write\x00\x00
添加到文件末尾(告诉 zip 应用程序“注释”部分的长度为 0 字节)。astronautlevel's solution works for most cases, but the compressed data and CRCs in the Zip can also contain the same 4 bytes. You should do an
rfind
(notfind
), seek to pos+20 and then add write\x00\x00
to the end of the file (tell zip applications that the length of the 'comments' section is 0 bytes long).我遇到了同样的问题,并且能够解决我的文件的这个问题,请参阅我的答案
zipfile 无法处理某种类型的 zip 数据?
I had the same problem and was able to solve this issue for my files, see my answer at
zipfile cant handle some type of zip data?
我对 python 很陌生,我面临着完全相同的问题,以前的方法都不起作用。
在解压缩之前尝试打印“损坏的”文件会返回一个空字节对象。
事实证明,我试图在将文件写入磁盘后立即解压缩该文件,而不关闭文件处理程序。
关闭文件流然后解压缩文件解决了我的问题。
I'm very new at python and i was facing the exact same issue, none of the previous methods were working.
Trying to print the 'corrupted' file just before unzipping it returned an empty byte object.
Turned out, I was trying to unzip the file right after writing it to disk, without closing the file handler.
Closing the file stream then unzipping the file resolved my issue.
有时,某些 zip 文件包含损坏的文件,解压缩 zip 时会出现 badzipfile 错误。但是有像 7zip winrar 这样的工具可以忽略这些错误并成功解压缩 zip 文件。您可以创建一个子进程并使用此代码来解压缩 zip 文件,而不会出现 BadZipFile 错误。
Sometime there are zip file which contain corrupted files and upon unzipping the zip gives badzipfile error. but there are tools like 7zip winrar which ignores these errors and successfully unzip the zip file. you can create a sub process and use this code to unzip your zip file without getting BadZipFile Error.
我遇到了这个问题,正在寻找一个好的、干净的解决方案;但直到我找到这个答案之前一直没有解决方案。我遇到了与@marsl(答案中)相同的问题。在我的例子中,它是一个 gzip 文件而不是 zip 文件。
我可以用这种方法取消归档并解压缩我的 gzip 文件:
I faced this problem and was looking for a good and clean solution; But there was no solution until I found this answer. I had the same problem that @marsl (among the answers) had. It was a gzipfile instead of a zipfile in my case.
I could unarchive and decompress my gzipfile with this approach:
显示从 Python 获得的完整回溯——这可能会提示具体问题是什么。 未回答:什么软件产生了错误文件,在什么平台上?
更新:回溯表明检测文件中的“中央目录结束”记录时出现问题 - 请参阅从 C:\Python25\Lib\zipfile.py 第 128 行开始的函数 _EndRecData
建议:
(1)通过上述函数进行追踪
(2)在最新的Python上尝试一下
(3)回答上述问题。
(4) 阅读此 以及
google("BadZipfile: File is not a zip file")
发现的任何其他似乎相关的内容Show the full traceback that you got from Python -- this may give a hint as to what the specific problem is. Unanswered: What software produced the bad file, and on what platform?
Update: Traceback indicates having problem detecting the "End of Central Directory" record in the file -- see function _EndRecData starting at line 128 of C:\Python25\Lib\zipfile.py
Suggestions:
(1) Trace through the above function
(2) Try it on the latest Python
(3) Answer the question above.
(4) Read this and anything else found by
google("BadZipfile: File is not a zip file")
that appears to be relevant您是否尝试过更新的 python,或者如果这太麻烦,只需使用更新的 zipfile.py ?我已成功使用 Python 2.6.2(当时最新)中的 zipfile.py 副本与 Python 2.5 来打开 Py2.5s zipfile 模块不支持的一些 zip 文件。
Have you tried a newer python, or if that is too much trouble, simply a newer zipfile.py? I have successfully used a copy of zipfile.py from Python 2.6.2 (latest at the time) with Python 2.5 in order to open some zip files that weren't supported by Py2.5s zipfile module.
在某些情况下,您必须确认 zip 文件是否实际上是 gzip 格式。我就是这种情况,我通过以下方法解决了这个问题:
In some cases, you have to confirm if the zip file is actually in gzip format. this was the case for me and i solved it by :
为此,我认为这是在文件未完全下载时发生的。所以我只是在下载代码中删除它。
您可以将我的代码与 pip install Ultimate-utils 一起使用以获得最新版本。
for this this happened when the file wasn't downloaded fully I think. So I just delete it in my download code.
you can use my code with pip install ultimate-utils for the most up to date version.
在另一种情况下,当 ml/dl 模型具有不同格式时,会出现此警告。
举个例子:
你想打开pickle,但模型格式是.sav
解决方案:
您需要将格式更改为原始格式
泡菜 --> .pkl
张量流--> .h5
ETC。
In the other case, this warning showing up when the ml/dl model has different format.
For the example:
you want to open pickle, but the model format is .sav
Solution:
you need to change the format to original format
pickle --> .pkl
tensorflow --> .h5
etc.
就我而言,该目录中缺少 zip 文件本身 - 因此,当我尝试解压缩它时,我收到错误
“BadZipFile:文件不是 zip 文件”
。我将 .zip 文件移至该目录后问题得到解决。在运行 python 脚本之前,请确认该文件确实存在于您的目录中。In my case, the zip file itself was missing from that directory - thus when I tried to unzip it, I got the error
"BadZipFile: File is not a zip file"
. It got resolved after I moved the .zip file to the directory. Please confirm that the file is indeed present in your directory before running the python script.就我而言,zip 文件刚刚损坏。使用 NanaZip 或 7zip 解压缩它会出现错误消息,例如“zip 文件已损坏”
In my case, the zip file is just broken. Unzip it with NanaZip or 7zip gives me error message like "the zip file is broken"
当我尝试从驱动器解压缩文件时,我也遇到了类似的问题。使用在线文件压缩网站来压缩您的文件,这不会破坏文件,并且不会为我引发错误。
I also faced a similar problem when I tried to unzip my file from the drive. Use the online file zipping websites to zip your file which does not break the file and the error doesn't raise for me.
就我而言,zip 文件已损坏。我尝试使用 urllib.request.urlretrieve 下载 zip 文件,但由于某种原因该文件无法完全下载。
我连接到 VPN,文件下载得很好,并且我能够打开该文件。
In my case, the zip file was corrupted. I was trying to download the zip file with
urllib.request.urlretrieve
but the file wouldn't completely download for some reason.I connected to a VPN, the file downloaded just fine, and I was able to open the file.