iText 内存管理 - PdfReader/Watermarking 负载过多

发布于 2024-11-29 17:34:48 字数 2028 浏览 5 评论 0原文

我正在给文档添加水印，并且我不想将它们完全加载到内存中，因为它们可能非常大。我发现 RandomAccessFileOrArray 可以缓冲读取，它做得很好，但仍然加载了太多我不喜欢的内容。

也就是说，在我加载 5 Mb PDF 文件后，使用的内存增加了 23Mb ！当我开始给它加水印时，它又跳了 27Mb ！之后，使用的内存逐渐增加，但并不可怕。

这种行为有理由吗？您知道如何定义 PdfReader 或 RandomAccessFileOrArray 或其他内容的缓冲区大小吗？

感谢您的意见。

printMem 方法通过显示空闲-已用-总计来显示内存的状态。

这是我的代码

printMem("Before load");
    PdfReader reader = null;
    try {
        reader = new PdfReader(new RandomAccessFileOrArray(new FileInputStream("C:/TEMP/zip/100258.pdf")),null);
        printMem("After load");
        FileOutputStream out = new FileOutputStream(f);
        PdfStamper stamp = new PdfStamper(reader, out);

        int numPages = reader.getNumberOfPages();
        int page=1;
        BaseFont baseFont = 
            BaseFont.createFont(BaseFont.HELVETICA_BOLDOBLIQUE,
                BaseFont.WINANSI, BaseFont.EMBEDDED);
        float width;
        float height;

        while (page <= numPages) {
            printMem("Page " + page);
            PdfContentByte cb = stamp.getOverContent(page);
            height = reader.getPageSizeWithRotation(page).getHeight() / 2;
            width = reader.getPageSizeWithRotation(page).getWidth() / 2;

            cb.saveState();
            cb.setColorFill(MEDIUM_GRAY);

            // Primary Text
            cb.beginText();
            cb.setFontAndSize(baseFont, PRIMARY_FONT_SIZE);
            cb.showTextAligned(Element.ALIGN_CENTER, "WatermarkText", width,
                    height, TEXT_TILT_ANGLE);
            cb.endText();

            cb.restoreState();
            page++;
        }
        stamp.close();
    } catch(Throwable e) {
        reader = null;
        System.gc();
    }

这是部分输出：

Before load | 1566248160 6615840 1572864000
After load | 1542392472 30471528 1572864000
Page 1 | 1515096880 57767120 1572864000
Page 2 | 1515095992 57768008 1572864000
Page 47 | 1512998840 59865160 1572864000
Page 48 | 1512998840 59865160 1572864000

原文

I'm watermarking documents, and I don't want to have to load them completely to memory, as they can be quite large. I found that RandomAccessFileOrArray that kind of buffers the reading, which it does fine but still loads too much to my liking.

That is, after I load a 5 Mb PDF file, the used memory increases 23Mb ! And when I start watermarking it it jumps another 27Mb ! After that used memory gradually increases, but not horribly.

Is there a reason to such behaviour ? Would you know a way to define the buffer size of the PdfReader or RandomAccessFileOrArray or something else ?

Thanks for your input.

The method printMem shows the status of the memory by showing free - used - total.

Here is my code

printMem("Before load");
    PdfReader reader = null;
    try {
        reader = new PdfReader(new RandomAccessFileOrArray(new FileInputStream("C:/TEMP/zip/100258.pdf")),null);
        printMem("After load");
        FileOutputStream out = new FileOutputStream(f);
        PdfStamper stamp = new PdfStamper(reader, out);

        int numPages = reader.getNumberOfPages();
        int page=1;
        BaseFont baseFont = 
            BaseFont.createFont(BaseFont.HELVETICA_BOLDOBLIQUE,
                BaseFont.WINANSI, BaseFont.EMBEDDED);
        float width;
        float height;

        while (page <= numPages) {
            printMem("Page " + page);
            PdfContentByte cb = stamp.getOverContent(page);
            height = reader.getPageSizeWithRotation(page).getHeight() / 2;
            width = reader.getPageSizeWithRotation(page).getWidth() / 2;

            cb.saveState();
            cb.setColorFill(MEDIUM_GRAY);

            // Primary Text
            cb.beginText();
            cb.setFontAndSize(baseFont, PRIMARY_FONT_SIZE);
            cb.showTextAligned(Element.ALIGN_CENTER, "WatermarkText", width,
                    height, TEXT_TILT_ANGLE);
            cb.endText();

            cb.restoreState();
            page++;
        }
        stamp.close();
    } catch(Throwable e) {
        reader = null;
        System.gc();
    }

And here is the partial output:

Before load | 1566248160 6615840 1572864000
After load | 1542392472 30471528 1572864000
Page 1 | 1515096880 57767120 1572864000
Page 2 | 1515095992 57768008 1572864000
Page 47 | 1512998840 59865160 1572864000
Page 48 | 1512998840 59865160 1572864000

分享到QQ

分享到微博