InputStreamReader不从某些网站读取HTML代码
我使用以下代码
if (HttpURLConnection.HTTP_OK == con.getResponseCode()) {
BufferedReader bf = new BufferedReader(new InputStreamReader(con.getInputStream()));
String line;
while ((line = bf.readLine()) != null) {
sb.append(line);
sb.append("/n");
}
System.out.println(sb);
}
从我的连接中读取HTML代码。代码效果很好,并且在某些网站上得到了正确的事情,但是其他网站似乎对此有所防御?无论哪种方式,我都不会收到完整的HTML连接代码。 这是应该打印出的实际结果。
I use the following piece of code
if (HttpURLConnection.HTTP_OK == con.getResponseCode()) {
BufferedReader bf = new BufferedReader(new InputStreamReader(con.getInputStream()));
String line;
while ((line = bf.readLine()) != null) {
sb.append(line);
sb.append("/n");
}
System.out.println(sb);
}
to read HTML code from my connection. Code works just fine and gets the right thing on some websites, but other ones seem to have some kind of defence against this? Either way, I don't receive full HTML code of connection.
Here's an example of what should be printed out expected result vs what I get actual result.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
一些Web服务器要求您指定用户代理,在这里您可以找到它们的列表: https://user-agents.net/
您可以尝试使用Windows 10(64bit)上的Firefox 100.0之类的东西:
Some web servers require you to specify a user agent, here you can find a list of them: https://user-agents.net/
You could try something like Firefox 100.0 on Windows 10 (64bit):