飞碟对细胞中图像的处理是一个错误吗?

发布于 2024-11-01 21:19:33 字数 16657 浏览 1 评论 0 原文

我设法将我的示例缩减为以下内容(它使用了一些相当大的数据 URL,但 stackoverflow 应该很好地将它们从侧面流出......):

static
{
    String oldValue = System.getProperty("java.protocol.handler.pkgs");
    if (oldValue == null)
    {
        System.setProperty("java.protocol.handler.pkgs", "org.xhtmlrenderer.protocols");
    }
    else if (!oldValue.contains("org.xhtmlrenderer.protocols"))
    {
        System.setProperty("java.protocol.handler.pkgs", oldValue + "|org.xhtmlrenderer.protocols");
    }
}

@Test
public void testLegacyCellSpacing() throws Exception
{
    ITextRenderer renderer = new ITextRenderer();

    renderer.setDocumentFromString("<html>\n" +
                                   "  <head>\n" +
                                   "    <title>Some title</title>\n" +
                                   "  </head>\n" +
                                   "  <body>\n" +
                                   "    <div>\n" +
                                   "<table cellpadding=\"0\" cellspacing=\"0\">\n" +
                                   "<tr>\n" +
                                   "<td>\n" +
                                   "   <img src=\"\" width=\"185\" height=\"83\"/>\n" +
                                   "</td>\n" +
                                   "</tr>\n" +
                                   "<tr>\n" +
                                   "<td>\n" +
                                   "   <img src=\"\" width=\"185\" height=\"84\"/>\n" +
                                   "</td>\n" +
                                   "</tr>\n" +
                                   "<tr>\n" +
                                   "<td>\n" +
                                   "   <img src=\"\" width=\"185\" height=\"83\"/>\n" +
                                   "</td>\n" +
                                   "</tr>\n" +
                                   "</table>\n" +
                                   "    </div>\n" +
                                   "  </body>\n" +
                                   "</html>", null);

    renderer.layout();

    File pdfFile = File.createTempFile("TestFlyingSaucer", ".pdf");
    OutputStream output = new BufferedOutputStream(new FileOutputStream(pdfFile));
    try
    {
        renderer.createPDF(output, true);
    }
    finally
    {
        output.close();
    }

    System.out.println(); // breakpointing here to look at the output.  no good assertions available for checking this problem yet. :(
}

此代码运行,并生成一个 PDF,每个图像之间的空格不正确。我无法弄清楚是什么导致了这些空格,但我也无法弄清楚这是否是一个错误。我所知道的是:

  1. 网络浏览器不会在图像之间显示空格。
  2. 如果您删除图像并将其替换为固定大小的 div 之类的内容,问题就会神奇地消失(这证明飞碟正在使用旧的 cellpadding 和 cellspacing 属性做正确的事情,这是我最初关心的问题。)
  3. 如果您添加单元格的 1 像素边框 (td { border: 1px Solid red; }),它表明额外的空间位于 td 元素之外。
  4. 如果删除 之间的空格,问题似乎就会消失。

对我来说,元素内部的额外空间可能会导致元素外部的额外填充/边距,这似乎很奇怪。也许我完全误解了 HTML,所以我希望有人能在我将问题塞进他们的跟踪器之前在这里纠正我。

(另外:抢占“不要为此使用表格”的典型下意识答案 - 这不是我的 HTML,我试图在清理后将现实世界的 HTML 渲染为 PDF。此示例显示清理后的 HTML以排除清理代码中的问题。)

I managed to whittle my example down to the following (it uses some reasonably hefty data URLs but stackoverflow should flow them off the side nicely...):

static
{
    String oldValue = System.getProperty("java.protocol.handler.pkgs");
    if (oldValue == null)
    {
        System.setProperty("java.protocol.handler.pkgs", "org.xhtmlrenderer.protocols");
    }
    else if (!oldValue.contains("org.xhtmlrenderer.protocols"))
    {
        System.setProperty("java.protocol.handler.pkgs", oldValue + "|org.xhtmlrenderer.protocols");
    }
}

@Test
public void testLegacyCellSpacing() throws Exception
{
    ITextRenderer renderer = new ITextRenderer();

    renderer.setDocumentFromString("<html>\n" +
                                   "  <head>\n" +
                                   "    <title>Some title</title>\n" +
                                   "  </head>\n" +
                                   "  <body>\n" +
                                   "    <div>\n" +
                                   "<table cellpadding=\"0\" cellspacing=\"0\">\n" +
                                   "<tr>\n" +
                                   "<td>\n" +
                                   "   <img src=\"\" width=\"185\" height=\"83\"/>\n" +
                                   "</td>\n" +
                                   "</tr>\n" +
                                   "<tr>\n" +
                                   "<td>\n" +
                                   "   <img src=\"\" width=\"185\" height=\"84\"/>\n" +
                                   "</td>\n" +
                                   "</tr>\n" +
                                   "<tr>\n" +
                                   "<td>\n" +
                                   "   <img src=\"\" width=\"185\" height=\"83\"/>\n" +
                                   "</td>\n" +
                                   "</tr>\n" +
                                   "</table>\n" +
                                   "    </div>\n" +
                                   "  </body>\n" +
                                   "</html>", null);

    renderer.layout();

    File pdfFile = File.createTempFile("TestFlyingSaucer", ".pdf");
    OutputStream output = new BufferedOutputStream(new FileOutputStream(pdfFile));
    try
    {
        renderer.createPDF(output, true);
    }
    finally
    {
        output.close();
    }

    System.out.println(); // breakpointing here to look at the output.  no good assertions available for checking this problem yet. :(
}

This code runs, and generates a PDF which has incorrect spaces between each image. I can't figure out what is causing the spaces, but I can't figure out if it's a bug either. What I do know:

  1. Web browsers don't show spaces between the images.
  2. If you remove the images and replace it with something like a fixed size div, the problem magically goes away (this proves that flying saucer is doing the right thing with the legacy cellpadding and cellspacing attributes, which was my initial concern.)
  3. If you add a 1 pixel border to the cells (td { border: 1px solid red; }), it shows that the extra space is outside the td elements.
  4. If you remove the whitespace between <td> and <img the problem appears to vanish.

It seems odd to me that extra space inside the element could result in extra padding/margins outside the element. Perhaps I'm completely misunderstanding HTML though, so I'm hoping that someone will set me straight here before I jam an issue in their tracker.

(Also: Preempting the typical knee-jerk answer of "don't use tables for that" - this is not my HTML, I'm trying to render real world HTML to PDF after cleaning it up. This example shows post-cleanup HTML to rule out the problem being in the cleanup code.)

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(1

小伙你站住 2024-11-08 21:19:33

浏览器对 HTML 中许多地方的空白很敏感,包括元素内部和元素之间,甚至是您意想不到的地方。例如,如果您有一个带有 li 元素的 ul,如果您使用 CSS 使用 li {display:inline} 来设置项目的样式,水平菜单, 和下一个

  • 之间的任何空格都会将菜单项分开!
  • 在这种特殊情况下,看到第一个子元素缩进的空格肯定是不寻常的,但我不能明确地说根据 HTML 标准它是否会被视为错误。最好的解决方案确实是简单地消除引起问题的空白。

    Browsers are sensitive to white space in HTML in many places, including inside and in between elements, and even in places you would not expect. For instance, if you had a ul with li elements, if you use CSS to style the items with li {display:inline} to make a horizontal menu, any white space between </li> and the next <li> will space the menu items apart!

    In this particular case, seeing the white space indenting the first child element is certainly unusual but I can't say categorically if it would be considered a bug according to the HTML standards. Your best solution is indeed to simply eliminate white space where it causes issues.

    ~没有更多了~
    我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
    原文