Java的PDF阅读器为pdf.js
我们有一个项目,我们使用pdf.js
将PDF渲染到网页中,并为PDF页面创建HTML容器元素。 pdf的内容在视图中以HTML span
的形式拆分。
附件是显示PDF文本在视图中呈现的图像。它还显示,每个span
都有data-key
与PDF中的一行不对应。
现在,我需要一个用于Java的PDF阅读器,该读取器将内容读取并用span
使用data-key
或仅span span
按顺序。
有很多Java库可读取PDF内容,这些内容逐行获取内容,但不能解决我的问题。我需要一个Java库,该库可能会破坏等于span
的内容。
We have a project where we use pdf.js
to render a PDF into webpage and it creates HTML container elements for the PDF pages. The content of the PDF is split as HTML span
in the view.
Attached is the image which shows how pdf text is rendered in the view. It also shows, each span
has a data-key
does not corresponds to a line in PDF.
Now, I need a pdf reader for java which reads and breaks the content as span
with data-key
or just the span
in the order.
There are lot of java libraries available to read PDF content which gets the content line by line but that does not solve my issue. I need a java library which could break the content equivalent to span
in the view.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论