当前位置：文江博客话题详情

从 PDF 文档中删除超链接 (iTextSharp)

发布于 2024-11-04 10:56:02 字数 223 浏览 1 评论 0原文

我正在尝试利用 iTextSharp（该产品非常新）从 PDF 文档中删除超链接。有谁知道这是否可能？我一直在研究 API，但没有找到明显的方法来做到这一点。

我的问题是，我正在对一个嵌入 iframe 中的 PDF 的系统进行维护，并且 PDF 中的链接导致用户最终在 iframe 中而不是在新窗口或选项卡中浏览网站，所以我正在寻找了解一种在请求时删除 PDF 中链接的方法。

提前致谢，斯科特

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

Hello爱情风 2024-11-11 10:56:02

人们点击的链接是给定页面的 /Annots 数组中的注释。

您有两个选择：

销毁整个 /Annots 数组
搜索 /Annots 数组并删除所有链接注释

简单地销毁注释数组很容易：

 PdfDictionary pageDict = reader.getPageN(1); // 1st page is 1
 pageDict.remove(PdfName.ANNOTS);

 stamper.close();

问题是您可能会销毁您想要保留的注释以及您不想保留的注释't。

解决方案是搜索 annot 数组以查找 URL 链接。

PdfDictionary pageDict = reader.getPageN(1);
PdfArray annots = pageDict.getAsArray(PdfName.ANNOTS);
PdfArray newAnnots = new PdfArray();
if (annots != null) {
  for (int i = 0; i < annots.size(); ++i) {
    PdfDictionary annotDict = annots.getAsDict(i);
    if (!PdfName.LINK.equals(annotDict.getAsName(PdfName.SUBTYPE))) {
      // annots are actually listed as PdfIndirectReference's.  
      // Adding the dict directly would be A Bad Thing.
      newAnnots.add(annots.get(i));// get the original reference, not the dict
    }
  }
  pageDict.put(PdfName.ANNOTS, newAnnots);
}

这将删除所有链接注释，而不仅仅是链接到内部网站的链接注释。如果您需要深入挖掘，则需要查看 PDF 规范，第 12.5.6.5 节（链接注释）和第 12.6.4.7 节（URI 操作）。

The links people click on are annotations in a given page's /Annots array.

You have two options:

Destroy the entire /Annots array
Search through the /Annots array and remove all the link annotations

Simply blasting the annotation array is easy:

 PdfDictionary pageDict = reader.getPageN(1); // 1st page is 1
 pageDict.remove(PdfName.ANNOTS);

 stamper.close();

The problem is that you might be destroying annotations that you want to keep along with those you don't.

The solution is to search the annot array looking for links to URLs.

PdfDictionary pageDict = reader.getPageN(1);
PdfArray annots = pageDict.getAsArray(PdfName.ANNOTS);
PdfArray newAnnots = new PdfArray();
if (annots != null) {
  for (int i = 0; i < annots.size(); ++i) {
    PdfDictionary annotDict = annots.getAsDict(i);
    if (!PdfName.LINK.equals(annotDict.getAsName(PdfName.SUBTYPE))) {
      // annots are actually listed as PdfIndirectReference's.  
      // Adding the dict directly would be A Bad Thing.
      newAnnots.add(annots.get(i));// get the original reference, not the dict
    }
  }
  pageDict.put(PdfName.ANNOTS, newAnnots);
}

This will remove all link annotations, not just those that link to internal sites. If you need to dig deeper, you'll need to check out the PDF Spec, section 12.5.6.5 (link annotations) and section 12.6.4.7 (URI actions).

回复收藏 0 原文

溇涏 2024-11-11 10:56:02

使用 PDFSharp 您可以这样做：

  void RemoveHyperlinks (string sourcePDF, string targetPDF) {
            using (PdfDocument PDFDoc = PdfReader.Open (sourcePDF, PdfDocumentOpenMode.Import)) {
                using (PdfDocument PDFNewDoc = new PdfDocument ()) {
                    // Copy pages to new doc
                    for (int Pg = 0; Pg < PDFDoc.Pages.Count; Pg++) {
                        PdfPage page = PDFDoc.Pages[Pg];
                        //page.HasAnnotations
                        page.Annotations.Clear();
                        var newPage = PDFNewDoc.AddPage(page);
                    } // for

                    PDFNewDoc.Save (targetPDF);
                } // using 
            } // using 
        }
    }

With PDFSharp you can do this that way:

  void RemoveHyperlinks (string sourcePDF, string targetPDF) {
            using (PdfDocument PDFDoc = PdfReader.Open (sourcePDF, PdfDocumentOpenMode.Import)) {
                using (PdfDocument PDFNewDoc = new PdfDocument ()) {
                    // Copy pages to new doc
                    for (int Pg = 0; Pg < PDFDoc.Pages.Count; Pg++) {
                        PdfPage page = PDFDoc.Pages[Pg];
                        //page.HasAnnotations
                        page.Annotations.Clear();
                        var newPage = PDFNewDoc.AddPage(page);
                    } // for

                    PDFNewDoc.Save (targetPDF);
                } // using 
            } // using 
        }
    }

回复收藏 0 原文

~没有更多了~