如何在 WordprocessingML 中搜索/替换文本

发布于 2024-11-04 10:30:29 字数 894 浏览 4 评论 0原文

在WordprocessingML（MS Word文档保存的格式）中，是否可以轻松搜索文本？

我遇到的主要问题是 WordprocessingML 格式将每个段落分解为“runs”，例如：

为了存储句子 “模块 1：某些部分标题”，WordprocessingML 将 XML 标记指定为：

  <w:p w:rsidR="00F9529C" w:rsidRDefault="00F9529C" w:rsidP="00F9529C">
   <w:pPr>
    <w:pStyle w:val="Heading1_5019"/>
   </w:pPr>
   <w:bookmarkStart w:id="0" w:name="_Toc247333659"/>
   <w:r>
    <w:t>M</w:t>
   </w:r>
   <w:r w:rsidRPr="007D2739">
    <w:t xml:space="preserve">odule 1: </w:t>
   </w:r>
   <w:r>
    <w:t>Some Section Title</w:t>
   </w:r>
   <w:bookmarkEnd w:id="0"/>
  </w:p>

正如您所看到的，该句子被分成“M”、“模块 1：”、“某些部分标题”。这种安排使得无法搜索整个句子。有办法解决这个问题吗？

为了澄清这一点，我尝试使用 DomDocument 在 PHP 中执行此操作。

原文

In WordprocessingML (the format MS Word documents saves in), is there anyway to search through the text easily?

The main problem I run into is that WordprocessingML format break down each paragraph into "runs", for example:

To store the sentence "Module 1: Some Section Title", WordprocessingML specifies the XML markup to be:

  <w:p w:rsidR="00F9529C" w:rsidRDefault="00F9529C" w:rsidP="00F9529C">
   <w:pPr>
    <w:pStyle w:val="Heading1_5019"/>
   </w:pPr>
   <w:bookmarkStart w:id="0" w:name="_Toc247333659"/>
   <w:r>
    <w:t>M</w:t>
   </w:r>
   <w:r w:rsidRPr="007D2739">
    <w:t xml:space="preserve">odule 1: </w:t>
   </w:r>
   <w:r>
    <w:t>Some Section Title</w:t>
   </w:r>
   <w:bookmarkEnd w:id="0"/>
  </w:p>

As you can see, the sentence was broken into "M", "odule 1:", "Some Section Title". This arrangement make it impossible to search for the sentence as a whole. Is there anyway to get around this?

To clarify, I am trying to do this in PHP using DomDocument.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

南巷近海 2024-11-11 10:30:29

我编写了一些示例代码，演示如何搜索和替换 Open XML WordprocessingML 文档中的文本。我的方法是：一旦找到包含需要替换的文本的段落，就将该段落中的所有运行分解为单个字符的运行。然后就可以直接找到与您的搜索字符串匹配的连续运行集。然后，您可以使用替换文本创建新的运行，然后删除与搜索字符串匹配的单个字符运行。我已经使用 XML DOM（使用 System.Xml.XmlDocument）实现了这一点。您可以在博客文章在 Open XML WordprocessingML 中搜索和替换文本中找到示例代码文档。此外，我还录制了一个简短的截屏视频，展示了该算法的工作原理：http:// www.youtube.com/watch?v=w128hJUu3GM

回复收藏 0 原文