如何用pdf中的空填充空空间？使用PDFBox

发布于 2025-01-22 07:14:28 字数 766 浏览 4 评论 0原文

我正在使用Java PDFBox来读取PDF，

这是一个非常长的PDF，其中有40多页，我需要在每个页面上提取100个以上的元素，使用坐标手动进行操作将使我永远。

有没有办法将PDF页面文本排成行，每个空白空间都充满了一个空值？

例如，当我解析此表时：

使用代码：

            PDFTextStripper stripper = new PDFTextStripper();
            stripper.setSortByPosition(true);

            stripper.setStartPage(30);
            stripper.setEndPage(30);
            LOG.info("page 30 \n{}", stripper.getText(document));

我明白了：

016         1 300 
030        17 994        41 629        15 712 
042           676           676

问题是我无法确定是否只有一个或两个值！

原文

I am using Java PDFBOX to read a pdf

It is a very long pdf with more than 40 pages, and I need to extract more than 100 elements on each page, doing it manually using coordinates would take me forever.

Is there a way to get the pdf page text in rows with each empty space filled with some null value?

When I parse this table for example:

using the code:

            PDFTextStripper stripper = new PDFTextStripper();
            stripper.setSortByPosition(true);

            stripper.setStartPage(30);
            stripper.setEndPage(30);
            LOG.info("page 30 \n{}", stripper.getText(document));

I get this:

016         1 300 
030        17 994        41 629        15 712 
042           676           676

The problem is that I can't tell if there are just one or two values which are which !!

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

列表为空，暂无数据

关于作者

围归者

暂无简介

文章

28 人气

关注发私信

alipaysp_snBf0MSZIv

文章 0 评论 0

关注

梦断已成空

文章 0 评论 0

关注

瞎闹

文章 0 评论 0

关注

凯凯我们等你回来

文章 0 评论 0

关注

寄意

文章 0 评论 0

关注

似梦非梦

文章 0 评论 0

友情链接

文江博客

如何用pdf中的空填充空空间？使用PDFBox

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

关于作者

相关话题

热门标签

推荐作者

alipaysp_snBf0MSZIv

梦断已成空

瞎闹

凯凯我们等你回来

寄意

似梦非梦

友情链接

如何用pdf中的空填充空空间？使用PDFBox

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

关于作者

相关话题

热门标签

推荐作者

alipaysp_snBf0MSZIv

梦断已成空

瞎闹

凯凯我们等你回来

寄意

似梦非梦

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。