c# - 以编程方式将pdf转换为word

发布于 2024-08-18 08:29:06 字数 102 浏览 2 评论 0原文

有谁知道以编程方式将 PDF 文件转换为 word .doc 文件(不是 docx)的好解决方案?我已经尝试过 SautinSoft 的解决方案,但即使它能完成工作,但它的质量并不是最好的。

Does anyone know of a good solution for converting PDF files to a word .doc files (not docx) programmatically? I've tried SautinSoft's solution but even though it does the job, it's not the best quality.

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(4

面犯桃花 2024-08-25 08:29:06

我们提供了一个名为 EasyConverter SDK 的解决方案,您不妨尝试一下:

http:// www.pdfonline.com/easyconverter/sdk/index.htm

如果您想在尝试评估版本之前快速了解结果,您可以先使用此处的在线转换器:

http://www.pdfonline.com/pdf2word/index.asp

确实有很多考虑因素将 PDF 等大多数静态格式转换为 Word。 EasyConverter SDK 适用于大多数商业文档,而营销文档(通常使用更精美的布局)通常更具挑战性。

We offer a solution called EasyConverter SDK that you may wish to give a try:

http://www.pdfonline.com/easyconverter/sdk/index.htm

If you want to get a quick idea of what the results would look like before trying the evaluation version, you can use the online converter here first:

http://www.pdfonline.com/pdf2word/index.asp

There are indeed many considerations when converting a mostly static format like PDF to Word. EasyConverter SDK works nicely for most business documents while marketing documents (which typically utilize fancier layouts) are usually more challenging.

独留℉清风醉 2024-08-25 08:29:06

就像“解决方案”中一样,可能是一种实现方法,但您必须自己深入研究:

PDF 文件格式......很难理解。首先,它根本无法与Word格式相比。它的格式旨在在所有平台和打印机上产生一致的外观,其中 Word 稍微不那么严格。

首先,编辑 PDF 文件也相当困难:因为你没有像 Word 那样的“文本”;它更像是字母块。这些都是单独定位的。

我看到的唯一可行解决方案如下:

  1. 将 PDF 渲染为图像。 (因此需要 PDF 渲染库!)
  2. 将此图像附加到 .doc 中。 (因此需要一个 .DOC 编写库!)

我认为这也是 SautinSoft 正在做的事情;这就是它质量不好的原因。如果您想要良好的质量,图像可能会变得相当大(即您无法获得通用字体或重复图形等优化,就像 PDF 文件那样)。

As in "solution", a way to do it, probably, but you'd have to digg into this yourself:

The PDF file format is... quite hard to understand. First of all, it can't be compared to Word format at all. It's format is designed to produce a consistent look on all platforms and printers, Word therein, is a little less strict.

Editing PDF files, first, is quite hard too: because you don't have "text" like in Word; it's more like chunks of letters. These are all positioned individually.

The only doable solution I see is the following:

  1. Render the PDF to an image. (Thus requires a PDF rendering library!)
  2. Append this image into a .doc. (Thus requires a .DOC writing library!)

I think it's what SautinSoft is doing too; that's the reason of it's bad quality. Images can get quite huge if you want good quality (i.e. you can't get the optimization like generic fonts or repeating graphics, like you have with PDF files).

会发光的星星闪亮亮i 2024-08-25 08:29:06

将 PDF 转换为 SVG 并将 SVG 嵌入到 Word 文档中。

Convert the PDF to SVG and embed the SVG in the Word document.

ペ泪落弦音 2024-08-25 08:29:06

PDF 是一种“结束文件”显示格式,因此它会丢弃 Word 文件中需要的大量细节(例如流程)。那里有一些工具,但您不太可能对结果完全满意。

有一篇博客文章更好地解释了这些问题 http:// pdf.jpedal.org/java-pdf-blog/bid/12670/PDF-text

PDF is an 'endfile' display format so it throws away a lot of detail you would need in a word file (such as flow). There are tools out there but you are not likely to be totally happy with the results.

There is a blog post explaining the issues better at http://pdf.jpedal.org/java-pdf-blog/bid/12670/PDF-text

~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文