自动检测任意电子商务网站的产品数据源?
我的网络应用程序需要访问任意电子商务商店并确定它是否具有产品数据提要(即 Google Base 提要;商店中所有产品的 RSS/ATOM 提要)。另外,我需要提取该提要的位置。
到目前为止,我能想到的最好的解决方案是维护给定电子商务平台的这些提要的已知位置的完整列表,并逐一检查站点,当它们返回 404 时将它们从列表中划掉。
两个问题:
- 你能想到更好的方法吗?
- 我将如何生成此已知产品数据源位置列表?根据我的经验,它们通常不会公开(与博客 RSS 提要不同)。
非常感谢! :)
-富有的
My web app needs to access an arbitrary E-Commerce store and determine whether or not it has a product data feed (i.e. a Google Base feed; an RSS/ATOM feed of all products in the store). Also, I need to extract the location of this feed.
The best solution I can think of so far is to maintain a comprehensive list of known locations of these feeds for given E-Commerce platforms and check them one by one for the site, crossing them off the list as they come back 404.
Two questions:
- Can you think of a better approach?
- How would I go about generating this list of known product data feed locations? In my experience, they are generally not made public (unlike blog RSS feeds).
Thanks so much! :)
-Rich
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
用户搜索引擎 API 来发现 Feed。您可以尝试使用 Google、Bing 和 Yahoo 搜索 API 来发现您所在域的产品 Feed。可以按如下方式完成:
显然,这假设搜索引擎已找到提要并对其建立索引。
我不相信产品数据源存在“已知位置”这样的东西。不过,您可以尝试在算法中包含以下模式:
User Search Engine APIs to Discover Feeds. You could try using the Google, Bing and Yahoo Search APIs to discover product feeds on the domains you are interested in. This could be done as follows:
Obviously, this assumes that the feeds have been found and indexed by the search engines.
I don't believe there is such a thing as a "known location" for a product data feed. However, you could try including the following patterns in your algorithm: