如何将 Jsoup（Java html 解析器）中生成的文档转换为字符串

发布于 2024-11-26 21:06:29 字数 184 浏览 4 评论 0原文

我有一个用 jsoup 制作的文档，如下所示

Document doc = Jsoup.connect("http://en.wikipedia.org/").get();

How do i conversion that doc into a string。

原文

I have a document that was made in jsoup that looks like this

Document doc = Jsoup.connect("http://en.wikipedia.org/").get();

How do i convert that doc into a string.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

故事↓在人 2024-12-03 21:06:29

您是否尝试过：

Document doc = Jsoup.connect("http://en.wikipedia.org/").get();
String htmlString = doc.toString();

由于 Document 扩展了 Element，它还具有方法 html()，该方法根据 “检索元素的内部 HTML” a href="http://jsoup.org/apidocs/">API。所以这应该有效：

Document doc = Jsoup.connect("http://en.wikipedia.org/").get();
String htmlString = doc.html();

附加信息：

每个Document对象都有一个对内部类Document.OutputSettings实例的引用，可以访问它通过 Document 的方法outputSettings()。在那里，您可以使用 setter prettyPrint(true/false) 启用/禁用漂亮打印。请参阅 Document 和 Document.OutputSettings 的 API 了解更多信息

Have you tried:

Document doc = Jsoup.connect("http://en.wikipedia.org/").get();
String htmlString = doc.toString();

As Document extends Element it also has got the method html() which "Retrieves the element's inner HTML" according to the API. So that should work:

Document doc = Jsoup.connect("http://en.wikipedia.org/").get();
String htmlString = doc.html();

Additional Info:

Each Document object has got a reference to an instance of the inner class Document.OutputSettings which can be accessed via the method outputSettings() of Document. There you can enable/disable pretty-printing by using the setter prettyPrint(true/false). See the API for Document and Document.OutputSettings for furtherinformation

回复收藏 0 原文

溺渁∝ 2024-12-03 21:06:29

doc.toString() 可以工作，doc.outerHtml() 也可以。

回复收藏 0 原文

微凉徒眸意 2024-12-03 21:06:29

 Document doc = Jsoup.connect("http://en.wikipedia.org/").get();     
 Elements post = doc.select("div.post-content");
 String dd = post.toString();
 Document ddd = Jsoup.parse(dd);

将字符串解析为文档后，您可以使用它的文档函数

 Elements scriptTag = ddd.getElementsByTag("script");
 System.out.println(scriptTag);

 Document doc = Jsoup.connect("http://en.wikipedia.org/").get();     
 Elements post = doc.select("div.post-content");
 String dd = post.toString();
 Document ddd = Jsoup.parse(dd);

After parsing the string to document then you can use on it document functions

 Elements scriptTag = ddd.getElementsByTag("script");
 System.out.println(scriptTag);

回复收藏 0 原文

~没有更多了~

关于作者

土豪我们做朋友吧

暂无简介

文章

25 人气

关注发私信

友情链接

文江博客

如何将 Jsoup（Java html 解析器）中生成的文档转换为字符串

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（3）

关于作者

相关话题

热门标签

推荐作者

尘曦

在梵高的星空下

善良天后

韬韬不绝

qq_CgiN62

不美如何

友情链接

如何将 Jsoup（Java html 解析器）中生成的文档转换为字符串

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（3）

关于作者

相关话题

热门标签

推荐作者

尘曦

在梵高的星空下

善良天后

韬韬不绝

qq_CgiN62

不美如何

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。