用于查询多个选择器的 XPath
我想从选择器获取值和属性 然后根据查询获取其子级的属性和值。
请允许我举个例子。
这就是结构
<div class='message'>
<div>
<a href='http://www.whatever.com'>Text</a>
</div>
<div>
<img src='image_link.jpg' />
</div>
</div>
<div class='message'>
<div>
<a href='http://www.whatever2.com'>Text2</a>
</div>
<div>
<img src='image_link2.jpg' />
</div>
</div>
所以我想进行一次查询来匹配所有这些。
像这样的事情:
//$dom is the DomDocument() set up after loaded HTML with $dom->loadHTML($html);
$dom_xpath = new DOMXpath($dom);
$elements = $dom_xpath->query('//div[@class="message"], //div[@class="message"] //a, //div[@class="message"] //img');
foreach($elements as $ele){
echo $ele[0]->getAttribute('class'); //it should return 'message'
echo $ele[1]->getAttribute('href'); //it should return 'http://www.whatever.com' in the 1st loop, and 'http://www.whatever2.com' in the second loop
echo $ele[2]->getAttribute('src'); //it should return image_link.jpg in the 1st loop and 'image_link2.jpg' in the second loop
}
有没有像我在示例中所做的那样使用多个 xpath 选择器来做到这一点?以避免一直进行查询并节省一些 CPU。
I want to get values and attributes from a selector
and then get attributes and values of its children based on a query.
allow me to give an example.
this is the structure
<div class='message'>
<div>
<a href='http://www.whatever.com'>Text</a>
</div>
<div>
<img src='image_link.jpg' />
</div>
</div>
<div class='message'>
<div>
<a href='http://www.whatever2.com'>Text2</a>
</div>
<div>
<img src='image_link2.jpg' />
</div>
</div>
So I would like to make a query to match all of those once.
Something like this:
//$dom is the DomDocument() set up after loaded HTML with $dom->loadHTML($html);
$dom_xpath = new DOMXpath($dom);
$elements = $dom_xpath->query('//div[@class="message"], //div[@class="message"] //a, //div[@class="message"] //img');
foreach($elements as $ele){
echo $ele[0]->getAttribute('class'); //it should return 'message'
echo $ele[1]->getAttribute('href'); //it should return 'http://www.whatever.com' in the 1st loop, and 'http://www.whatever2.com' in the second loop
echo $ele[2]->getAttribute('src'); //it should return image_link.jpg in the 1st loop and 'image_link2.jpg' in the second loop
}
Is there some way of doing that using multiple xpath selectors like I did in the example? to avoid making queries all the time and save some CPU.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
在单个表达式中使用联合运算符 (
|
),如下所示:请注意,这将返回展平的结果集(可以这么说)。换句话说,您不会像示例所示那样访问三个一组的元素。相反,您只需迭代表达式匹配的所有内容(按文档顺序)。因此,简单地迭代
//div[@class="message"]
返回的节点并使用 DOM 方法访问它们的子节点(对于其他元素)可能会更聪明。Use the union operator (
|
) in a single expression like this:Note that this will return a flattened result set (so to speak). In other words, you won't access the elements in groups of three like your example shows. Instead, you'll just iterate everything the expressions matched (in document order). For this reason, it might be even smarter to simply iterate the nodes returned by
//div[@class="message"]
and use DOM methods to access their children (for the other elements).使用:
这将选择属于其
class
属性的文档中第 $k-thdiv
(及其任何后代)的所有三个属性具有字符串值"message"
您可以评估
N
个此类 XPath 表达式 - 对于$k
从 1 到N
,其中N
是总计数//div[@class='message']
基于 XSLT 的验证:
何时将此转换应用于提供的 XML 文档(包含在单个顶部元素变得格式良好):
XPath 表达式计算两次,选定的属性被格式化并输出:
Use:
This selects all three attributes that belong to the $k-th
div
(and any of its descendants) in the document whoseclass
attribute has string value"message"
You can evaluate
N
such XPath expressions -- for$k
from 1 toN
, whereN
is the total count of//div[@class='message']
XSLT - based verification:
when this transformation is applied on the provided XML document (wrapped in a single top element to become well-formed):
The XPath expression is evaluated twice and the selected attributes are formatted and output: