如何监控正版页面点击量
我正在尝试监控真实的页面点击率。这就是我的网站所做的。我有一个文章目录,人们可以在其中发布文章。当他们的文章发布时,他们会根据访问其页面的唯一用户数量获得报酬。所以页面点击量很重要。这是我面临的问题。
我需要什么:
- 我不想跟踪小型搜索引擎或机器人的页面点击。
- 我希望四大搜索引擎都能浏览我的网站,因为我可以通过 IP 地址监控它们,而不会将它们的访问算作页面点击。对于垃圾邮件机器人来说这是无法做到的,因为它们在伪装成真人或主要搜索引擎方面做得很好。
问题:
- 互联网上存在垃圾邮件机器人 不尊重 robots.txt 文件
- 有些机器人试图冒充真正的人类用户。通过操纵用户代理和标头中的其他内容。
- 总是检查数据库中是否有良好的 IP 地址可能会影响性能。
- 人类可以绕过验证码,只允许他们的机器人查看我的页面
可能的解决方案:
- 在每个页面上都需要验证码。如果验证码通过。然后将 IP 地址记录为良好,或者在用户计算机上提交一个 cookie,表明他们通过了。
- 允许所有主要搜索引擎 IP 地址,这样它们就不会收到验证码
- 购买机器人检测软件
- 要求查看者每 7 天传递一次验证码
获得准确的人工页面浏览量对于该网站的正常运行至关重要。你们还有其他想法吗
I am trying to monitor genuine page hits. Here is what my site does. I have an article directory where people can post articles. When their article is posted they are paid depending on the amount of unique users visit their pages. So page hits are important. Here is the problem I am facing.
What I need:
- I don't want to track page hits by minor search engines or robots.
- I would like the major 4 search engines to surf my site because I can monitor them by IP address and not count their visit as a page hit. This cannot be done for spam bots because they do a good job of passing as a real human or major search engine.
Problems:
- There are spam bots on the internet
that do not honor the robot.txt file - There are bots that try to fake being a real human user. By manipulating the user agent and other things in the header.
- Performance may suffer by always checking the database for good IP addresses
- A human being can bypass the captha only to allow their robot to view my pages
Possible solutions:
- Require a captcha on every page. If the captcha passes. then log the IP address as good or submit a cookie on the users machine indicating they passed.
- Allow all major search engines IP address, so they will not be presented with a captcha
- Purchase a bot detection software
- Require the viewer to pass a captca every 7 days
Getting accurate human page views is critical for this site to work properly. Do you guys have any other ideas
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
您可以将其留给 Google Analytics。它可以很好地解决您想要解决的问题,而且是免费的。
You could just leave it to Google Analytics. It does a very good job solving the kind of problem you're trying to solve and it's free.
您有理由不使用现有的服务或解决方案吗?
如果您只想监控页面点击量,请在您的网站上设置 Google Analytics 或类似服务,它们会比手动解决方案更好地过滤噪音。
Do you have a reason not to use an existing service or solution?
If you just want to monitor page hits, set up Google Analytics or a similar service on your site, and they'll do a better job of filtering out the noise than a hand-rolled solution possibly could.