Selenium（Python）Webdriver JavaScript（Noscrypt）

发布于 2025-02-04 20:54:49 字数 904 浏览 0 评论 0原文

我正在尝试从网站上刮擦数据，以提供学生的注释来进行分析我会尝试这么好的

from selenium import webdriver
#set chromodriver.exe path
driver = webdriver.Chrome(executable_path="C:\\chromedriver.exe")
#set page load timeout

#launch URL
driver.get("https://amatti.education.gov.dz/")

当运行此代码打开网站时，第一件事： [该网站打开正常] [1] https://i.sstatic.net/ay7qj.png 网站打开后，将转到此网站：

[打开后访问此站点] [2] https://i.sstatic.net/nwvea.png

我注意到网站的html中有很好这意味着，如果浏览器不支持JavaScript将转到URL：Google.com

<noscript>
    <meta http-equiv="refresh" content="0; url=http://www.google.com/" />
</noscript>

有任何解决方案可以自动化此网站 [1]： https://i.sstatic.net/AY7QJ.PNG [2]： https://i.sstatic.net/nwvea.png

原文

I am trying to scraping data from a site provide note of student to make analysis
I try this good

from selenium import webdriver
#set chromodriver.exe path
driver = webdriver.Chrome(executable_path="C:\\chromedriver.exe")
#set page load timeout

#launch URL
driver.get("https://amatti.education.gov.dz/")

the first thing happen when run this code is open the site :
[the site open normal][1]
https://i.sstatic.net/ay7QJ.png
after the site open it go to this site :

[after open go to this site][2]
https://i.sstatic.net/NWvEa.png

I notice there is this good in the html of the site
that mean if the browser not support JavaScript will go to URL : google.com

<noscript>
    <meta http-equiv="refresh" content="0; url=http://www.google.com/" />
</noscript>

there is any solution to automate this site
[1]: https://i.sstatic.net/ay7QJ.png
[2]: https://i.sstatic.net/NWvEa.png

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

微暖i 2025-02-11 20:54:49

我找到了解决方案
问题来自网络驱动器
该网站知道有机器人刮擦数据
所以我使用这个论点

options.add_argument("--disable-blink-features=AutomationControlled")

，它的工作很好

I found the solution
the problem comes from WebDrive
the site knows there is bot scraping data
so i use this argument

options.add_argument("--disable-blink-features=AutomationControlled")

and its work fine

回复收藏 0 原文

~没有更多了~

关于作者

终止放荡

暂无简介

文章

27 人气

关注发私信

甲如呢乙后呢

文章 0 评论 0

关注

王权女流氓

文章 0 评论 0

关注

云雾

文章 0 评论 0

关注

wyh2033345759

文章 0 评论 0

关注

乖乖

文章 0 评论 0

关注

qq_xR3jkM

文章 0 评论 0

友情链接

文江博客

Selenium（Python）Webdriver JavaScript（Noscrypt）

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

关于作者

相关话题

热门标签

推荐作者

甲如呢乙后呢

王权女流氓

云雾

wyh2033345759

乖乖

qq_xR3jkM

友情链接

Selenium（Python）Webdriver JavaScript（Noscrypt）

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

关于作者

相关话题

热门标签

推荐作者

甲如呢乙后呢

王权女流氓

云雾

wyh2033345759

乖乖

qq_xR3jkM

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。