re.compile(pattern, file) 调用导致系统崩溃

发布于 2024-11-08 18:49:48 字数 381 浏览 0 评论 0原文

我有一个需要解析的文件。解析是增量构建的，以便在每次迭代时表达式变得更加具体。

使系统超载的代码段大致如下所示：

    for item in ret:
        pat = r'a\sstyle=".+class="VEAPI_Pushpin"\sid="msftve(.+?)".+>%s<'%item[1]
        r=re.compile(pat, re.DOTALL)
        match = r.findall(f)

该文件是一个相当大的 HTML 文件（从 bing 地图解析），每个答案必须与其确切的 id 匹配。

在应用此更改之前，工作流程非常好。我可以做些什么来避免这种情况吗？或者优化代码？

原文

I have a file I need to parse. The parsing is built incrementally, such that on each iteration the expressions becomes more case specific.

The code segment which overloads the system looks roughly like this:

    for item in ret:
        pat = r'a\sstyle=".+class="VEAPI_Pushpin"\sid="msftve(.+?)".+>%s<'%item[1]
        r=re.compile(pat, re.DOTALL)
        match = r.findall(f)

The file is a rather large HTML file (parsed from bing maps), and each answer must match its exact id.

Before appying this change the workflow was very good. Is there anything I can do to avoid this? Or to optimize the code?

分享到QQ

分享到微博