PHP 用 DOM 解析（无结果）

发布于 2024-10-15 03:39:52 字数 664 浏览 8 评论 0原文

我正在尝试检索位于此 span 类属性中的正文文本。

<span id="" style="color:#525B64;">The quick brown fox jumped over the lazy dog.</span>

我在我的网络服务器上测试了它，没有收到任何错误，但页面是空白的。我对此很陌生，所以我不知道从这里该去哪里。

这是我的代码。

<?php
// Load remote file, supress parse errors
libxml_use_internal_errors(TRUE);
$dom = new DOMDocument;
$dom->loadHTMLFile('http://somewebpage.com');
libxml_clear_errors();

// use XPath to find all nodes with a class attribute of header
$xp = new DOMXpath($dom);
$nodes = $xp->query('//span[@class="msgBody"]');

// output first item's content
echo $nodes->item(0)->nodeValue;
?>

原文

I am trying to retreive the body text located in this span class attribute.

<span id="" style="color:#525B64;">The quick brown fox jumped over the lazy dog.</span>

I tested it on my web server and I get no errors but the page is blank. I'm very new to this so I do not know where to go from here.

Here is my code.

<?php
// Load remote file, supress parse errors
libxml_use_internal_errors(TRUE);
$dom = new DOMDocument;
$dom->loadHTMLFile('http://somewebpage.com');
libxml_clear_errors();

// use XPath to find all nodes with a class attribute of header
$xp = new DOMXpath($dom);
$nodes = $xp->query('//span[@class="msgBody"]');

// output first item's content
echo $nodes->item(0)->nodeValue;
?>

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

情魔剑神 2024-10-22 03:39:52

这段代码中一切看起来都很好。

我想做的是：

删除抑制解析错误的行。
使用 file_get_contents 加载远程文件，看看它是否
使用 //* 之类的 XPath 正确加载查询文档，并循环遍历生成的 DOMNodeList（使用 foreach）查看树是否正确构建。

顺便提一句。为了抑制 ->loadHTMLFile() 方法报告的解析错误，我使用 @ 运算符。

回复收藏 0 原文

茶色山野 2024-10-22 03:39:52

DOM 为所有内容创建节点：属性、文本、注释、元素，凡是你能想到的东西。因此，您并不是在追求跨度节点的值，即使看起来是这样，您实际上想要获取跨度内的 TextNode 并获取其值。尝试类似的操作：

echo $nodes->item(0)->childNodes->item(0)->nodeValue

您也可以直接从 xpath 查询中获取此内容：（

$nodes = $xp->query('//span[@class="msgBody"]/text()');

尽管我个人在 xpath 方面从未有过太多运气。）

The DOM creates nodes for everthing: attributes, text, comments, elements, you name it. So you're not after the value of the span node even though it might seem that way, you actually want to get the TextNode inside of the span and get its value instead. Try something like:

echo $nodes->item(0)->childNodes->item(0)->nodeValue

You can also get this directly from the xpath query:

$nodes = $xp->query('//span[@class="msgBody"]/text()');

(Though I've never had much luck with xpath, personally.)

回复收藏 0 原文

王权女流氓 2024-10-22 03:39:52

您确定您正在解析的文档中只有一个包含此类的 span 元素吗？

也许 ->item(0) 返回空元素并且所需元素是列表中的下一个元素？

回复收藏 0 原文

薔薇婲 2024-10-22 03:39:52

这种行为通常是由于默认命名空间（检查是否有类似的内容：xmlhs="http://www.w3.org/1999/xhtml"< /代码>）。

在 XPath 表达式中使用默认命名空间中的元素名称是 xpath 标记中最常见的常见问题 - 只需搜索“xpath 默认命名空间”即可找到许多好的答案。

回复收藏 0 原文

~没有更多了~

关于作者

情魔剑神

暂无简介

文章

25 人气

关注发私信

lanyue

文章 0 评论 0

关注

海螺姑娘

文章 0 评论 0

关注

Demos

文章 0 评论 0

关注

亢龙有悔

文章 0 评论 0

关注

海未深

文章 0 评论 0

关注

浅忆流年

文章 0 评论 0

友情链接

文江博客

PHP 用 DOM 解析（无结果）

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（4）

关于作者

相关话题

热门标签

推荐作者

lanyue

海螺姑娘

Demos

亢龙有悔

海未深

浅忆流年

友情链接

PHP 用 DOM 解析（无结果）

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（4）

关于作者

相关话题

热门标签

推荐作者

lanyue

海螺姑娘

Demos

亢龙有悔

海未深

浅忆流年

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。