屏幕抓取 ASP.NET 网页以检索网格视图中显示的数据

发布于 2024-07-15 23:17:12 字数 320 浏览 10 评论 0原文

我正在使用 RUBY 来屏幕截图一个网页（在 asp.net 中创建），该网页使用 gridview 来显示数据。我能够成功读取网格第 1 页上显示的数据，但无法弄清楚如何移动到网格中的下一页来读取所有数据。

问题是页码超链接不是普通的超链接（带有 URL），而是 javascript 超链接，它会导致回发到同一页面。

超链接的示例：-

<a href="javascript:__doPostBack('gvw_offices','Page$6')" style="color:Black;">6</a>

原文

I am using RUBY to screen scrap a web page (created in asp.net) which uses gridview to display data. I am successfully able to read the data displayed on page-1 of the grid but unable to figure out how I can move to the next page in the grid to read all the data.

Problem is the page number hyperlinks are not normal hyperlinks (with URL) but instead are javascript hyperlink which causes postback to the same page..

An example of the hyperlink:-

<a href="javascript:__doPostBack('gvw_offices','Page$6')" style="color:Black;">6</a>

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

鱼忆七猫命九 2024-07-22 23:17:12

如果您已经使用 ruby 进行处理，我建议使用 Watir，这是一个专为浏览器测试而设计的 ruby 库。一方面，它为您提供了一个更好的页面上 DOM 元素的界面，并且使单击这样的链接变得更容易：

ie.link(:text, '6').click

然后，当然您也有更简单的方法来导航表格。自动化这个过程很容易：

1..total_number_of_pages.each do |next_page|

  ie.link(:text, next_page).click
  # table processing goes here

end

我不知道你的用例，但这种方法有它的优点和缺点。一方面，它实际上运行一个浏览器实例，因此如果您需要经常以完全自动化的方式在后台安静地运行它，这可能不是最好的方法。另一方面，如果可以启动浏览器实例，那么您不必担心所有回发废话，您可以像用户一样单击链接。

瓦蒂尔：http://wtr.rubyforge.org/

I recommend using Watir, a ruby library designed for browser testing, if you're already using ruby for processing. For one thing, it gives you a much nicer interface to the DOM elements on the page, and it makes clicking links like this easier:

ie.link(:text, '6').click

Then, of course you have easier methods for navigating the table as well. It's easy enough to automate this process:

1..total_number_of_pages.each do |next_page|

  ie.link(:text, next_page).click
  # table processing goes here

end

I don't know your use case, but this approach has its advantages and disadvantages. For one thing, it actually runs a browser instance, so if this is something you need to frequently run quietly in the background in completely automated way, this may not be the best approach. On the other hand, if it's ok to launch a browser instance, then you don't have to worry about all that postback nonsense, and you can just click the link as if you were a user.

Watir: http://wtr.rubyforge.org/

回复收藏 0 原文