Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 10 years ago.
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
接受
或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
发布评论
评论(2)
显然 SgmlReader 已在这里更新:
https://github.com/MindTouch/SGMLReader
Apparently SgmlReader's updated here:
https://github.com/MindTouch/SGMLReader
HTML 是 SGML 的一种实现。 如果你想正确解析 HTML,你将需要一个 SGML 解析器。 SGMLreader 似乎很好地满足了这些需求,我打算自己使用它。 我建议使用 HTML tidy。 它是一个本机应用程序,但它的 .net 绑定确实存在。 如果您需要完全托管的代码,那么 SGMLreader 就是您的最佳选择。
HTML is an implementation of SGML. If you want to parse HTML properly, you will need an SGML parser. SGMLreader appears to fit those needs well, and I plan to use it myself. I would suggest using HTML tidy. It is a native application, but .net bindings for it do exist. If you need entirely managed code, then the SGMLreader is the way to go.