如何在 mshtml.HTMLDocument (.NET) 中禁用 Javascript

发布于 2024-07-06 08:21:50 字数 432 浏览 6 评论 0原文

我有这样的代码：

Dim Document As New mshtml.HTMLDocument
Dim iDoc As mshtml.IHTMLDocument2 = CType(Document, mshtml.IHTMLDocument2)
iDoc.write(html)
iDoc.close()

但是，当我加载这样的 HTML 时，它会执行其中的所有 Javascript，并从“html”代码请求某些资源。

我想禁用 javascript 和所有其他弹出窗口（例如证书错误）。

我的目标是使用 mshtml 文档中的 DOM 以可靠的方式从 HTML 中提取一些标签（而不是一堆正则表达式）。

或者是否有另一个 IE/Office DLL，我可以只加载 HTML，而无需考虑 IE 相关的弹出窗口或活动脚本？

原文

I've got a code like this :

Dim Document As New mshtml.HTMLDocument
Dim iDoc As mshtml.IHTMLDocument2 = CType(Document, mshtml.IHTMLDocument2)
iDoc.write(html)
iDoc.close()

However when I load an HTML like this it executes all Javascripts in it as well as doing request to some resources from "html" code.

I want to disable javascript and all other popups (such as certificate error).

My aim is to use DOM from mshtml document to extract some tags from the HTML in a reliable way (instead of bunch of regexes).

Or is there another IE/Office DLL which I can just load an HTML wihtout thinking about IE related popups or active scripts?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

桃扇骨 2024-07-13 08:21:50

Dim Document As New mshtml.HTMLDocument
Dim iDoc As mshtml.IHTMLDocument2 = CType(Document, mshtml.IHTMLDocument2)
'add this code
iDoc.designMode="On"
iDoc.write(html)iDoc.close()

Dim Document As New mshtml.HTMLDocument
Dim iDoc As mshtml.IHTMLDocument2 = CType(Document, mshtml.IHTMLDocument2)
'add this code
iDoc.designMode="On"
iDoc.write(html)iDoc.close()

回复收藏 0 原文

复古式 2024-07-13 08:21:50

如果您已经将“html”作为字符串，并且您只想访问它的 DOM 视图，那么为什么要将它“渲染”到浏览器控件呢？

我不熟悉.Net 技术，但必须有某种 StringToDOM/StringToJSON 类型的东西更适合您的需求。

同样，如果您上面使用的“html”变量是 URL，则只需使用 wget 或类似工具将标记检索为字符串，并使用适用的工具进行解析。

我会寻找 .Net XML/DOM 库并使用它。（再次，我认为这将是该语言的一部分，但我不确定）

PS 经过快速谷歌后我发现了这个（来源）。不确定如果您要在 HTMLDocument 中使用它是否会有帮助。

    if(typeof(DOMParser) == 'undefined') {
      DOMParser = function() {}
      DOMParser.prototype.parseFromString = function(str, contentType) {
      if(typeof(ActiveXObject) != 'undefined') {
        var xmldata = new ActiveXObject('MSXML.DomDocument');
        xmldata.async = false;
        xmldata.loadXML(str);
        return xmldata;
     } else if(typeof(XMLHttpRequest) != 'undefined') {
        var xmldata = new XMLHttpRequest;
        if(!contentType) {
          contentType = 'application/xml';
        }
        xmldata.open('GET', 'data:' + contentType + ';charset=utf-8,' + encodeURIComponent(str), false);
        if(xmldata.overrideMimeType) {
          xmldata.overrideMimeType(contentType);
        }
        xmldata.send(null);
        return xmldata.responseXML;
     }
  }
}

If you have the 'html' as a string already, and you just want access to the DOM view of it, why "render" it to a browser control at all?

I'm not familiar with .Net technology, but there has to be some sort of StringToDOM/StringToJSON type of thing that would better suit your needs.

Likewise, if the 'html' variable you are using above is a URL, then just use wget or similar to retrieve the markup as a string, and parse with an applicable tool.

I'd look for a .Net XML/DOM library and use that. (again, I would figure that this would be part of the language, but I'm not sure)

PS after a quick Google I found this (source). Not sure if it would help, if you were to use this in your HTMLDocument instead.

    if(typeof(DOMParser) == 'undefined') {
      DOMParser = function() {}
      DOMParser.prototype.parseFromString = function(str, contentType) {
      if(typeof(ActiveXObject) != 'undefined') {
        var xmldata = new ActiveXObject('MSXML.DomDocument');
        xmldata.async = false;
        xmldata.loadXML(str);
        return xmldata;
     } else if(typeof(XMLHttpRequest) != 'undefined') {
        var xmldata = new XMLHttpRequest;
        if(!contentType) {
          contentType = 'application/xml';
        }
        xmldata.open('GET', 'data:' + contentType + ';charset=utf-8,' + encodeURIComponent(str), false);
        if(xmldata.overrideMimeType) {
          xmldata.overrideMimeType(contentType);
        }
        xmldata.send(null);
        return xmldata.responseXML;
     }
  }
}

回复收藏 0 原文