如何使用xml解析html表格？

发布于 2024-12-12 01:46:55 字数 704 浏览 0 评论 0原文

我有一个 html 表格，其格式

<table>
 <th>
     <td> td1 </td>
     <td> td2 </td>
     <td> td3 </td>
     <td> td4 </td>
     <td> td5 </td>
     <td> td6 </td>
     <td> td7 </td>
     <td> td8 </td>
     <td> td9 </td>
     <td> td10 <td>
   </th>
</table>

需要解析表格主体内每行的单元格。我使用 javascript 循环遍历该行，为了保存我正在使用 webmethod 的 html 内容（因为在保存时，我的页面将重新加载，并且我将丢失我的 html 表，以避免我使用 webmethod 将其存储在会话中）也发生在我的 javascript 调用中）。问题是我的客户端脚本有时会被跳过，并且我无法保存我的 html 内容。所以我想到在一个脚本调用中将html内容作为一个整体发送，并在服务器端进行解析。

现在，我需要知道如何从服务器端解析它。有人可以帮我用 xml 解析它吗？

原文

I'm having an html table of the format

<table>
 <th>
     <td> td1 </td>
     <td> td2 </td>
     <td> td3 </td>
     <td> td4 </td>
     <td> td5 </td>
     <td> td6 </td>
     <td> td7 </td>
     <td> td8 </td>
     <td> td9 </td>
     <td> td10 <td>
   </th>
</table>

I need to parse through the cells in each row within the table body. I looped through the row using a javascript and inorder to save the html content i'm using webmethod( because on saving, my page will reload and i will lose my html table, to avoid that i stored it on a session using webmethod and this too happens within my javascript call) . The issue is my client side script is getting skipped at times and i'm not able to save my html content. So i thought of to send the html content as a whole in one script call and do the parsing in server-side.

Now, i need to know how to parse it from server-side. Can some-body help me to parse it using xml?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

对风讲故事 2024-12-19 01:46:56

我认为您应该尝试

CodePlex 中的

HTML Agility Pack
Html Agility Pack (HAP) 到底是什么？
这是一个敏捷的 HTML 解析器，它构建一个读/写 DOM 并支持
普通 XPATH 或 XSLT（实际上您不必了解 XPATH 也不必
XSLT 来使用它，不用担心...）。它是一个 .NET 代码库，允许
您可以解析“网络外”HTML 文件。解析器非常宽容
带有“现实世界”格式错误的 HTML。对象模型非常类似于
什么建议 System.Xml，但用于 HTML 文档（或流）。