我想从沃尔玛或Target等一些电子商务网站爬网并收集有关产品及其价格的信息,我发现使用或 Google富裕结果测试以像价格一样提取结构性数据,说明等要比搜索使用XPATH等搜索HTML中的属性更容易。但是,我不知道如何下载提取的数据,以便我可以将其集成到Java代码中。
或者更好的是,有什么办法可以在这些模式测试工具的帮助下在Java中对其进行编程?
I want to crawl and collect information about products and their prices from some e-commerce websites like Walmart or Target, and I found using Schema Markup Validator or Google's Rich Result Test to extract structured data like price, description, etc would be easier than searching for attributes in HTML with xpath etc. However I don't know how to download the extracted data so that I can integrate it in my Java code.
Or better, is there any way I can program this in Java with the help of these schema testing tools?
发布评论