在文章的 50% 后实施验证码
我们计划将大量的商业研究报告和文章从内部网发布到互联网上。 但是,我们不希望其他人复制内容并自行托管。
我读到有关验证码的保护,想知道这是否可能。 读者应该能够免费阅读文章的 50%,之后应该输入验证码才能阅读文章的其余部分 [通过这种方式,我们让那些模仿者的生活变得更加困难]
有关如何实现这一点的任何指示? 内容以 HTML 形式提供,并提供 Perl、PHP 编程经验。 如果需要,可以雇用其他人。
另外,搜索引擎会抓取文章的一半,并想知道它是否会因为无法破解验证码而因无法抓取文章的其余部分而惩罚该网站?
谢谢。
We are planning to put large number of Business Research Reports and Articles from our intranet on to the Internet. However, we don't want others to copy the content and host it on their own.
I read about protection by CAPTCHA and was wondering if this is possible. Readers should be able to read 50% of the article for FREE after which a CAPTCHA should be entered to read the rest of the article [In this way we are making life little harder for those copycats]
Any pointers on how to implment this ? The content is in HTML and programming experience in Perl, PHP. Can hire others if required.
Aditionally, search engine will crawl half of the article and wondering if it will penalize the site for not being able to crawl the rest of the article since it won't be able to crack the CAPTCHA ?
Thanks.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(4)
Recaptcha 提供了一个非常好的验证码服务 - http://recaptcha.net/
有一个 PHP 类,您可以使用它可以用来完成所有艰苦的工作。
重要的是要记住,搜索引擎无法解决验证码,因此它们只会索引报告的前半部分。 只要这一半包含大部分正确的关键词,就不会造成大问题。 不要犯“检测”搜索引擎并向普通用户显示不同内容的错误,因为主要搜索引擎认为这是垃圾邮件。
另一种解决方案是使用 Copyscape (http://www.copyscape.com/) 等服务以保护您的内容。
There's a really good Captcha service provided by Recaptcha - http://recaptcha.net/
There is a PHP class that you can use to do all the hard work.
It's important to bear in mind that search engines aren't able to solve a Captcha and so they will only index the first half of the report. As long as this half contains largely the correct key words, it shouldn't cause a massive problem. Don't make the mistake of "detecting" a search engine and showing them different content to a normal user as the major search engines think that this is spamming.
An alternative solution would be to use a service like Copyscape (http://www.copyscape.com/) to protect your content.
我知道这不是您要问的,但请考虑到验证码普遍被破坏,并且不会保护您的内容。 你说前半部分免费,是不是意味着后半部分要收费? 验证码在这里根本帮不了你...
但即使你只是想阻止自动抓取,验证码仍然无法解决问题。 查看我对另一个验证码问题的回答...或者您可以直接转到我演示的 ppt去年OWASP。
I know this is not what you're asking, but please take into account that CAPTCHAs are universally broken, and will not protect your content. You said the first half is free, does that mean you intend to charge for the other half? CAPTCHA won't help you here at all...
But even if you're just trying to prevent automated scraping, CAPTCHA still won't do the trick. Check out my answer to another captcha question... Or you can go straight to the ppt I presented at OWASP last year.
让您的 PHP 程序员输出文章的 50%。 在底部添加验证码。 如果用户输入正确的验证码,则输出文章的 100%。
作为一名 PHP 程序员,我使用 http://www.phpcaptcha.org 来实现验证码。
不,它不会惩罚您,但该特定部分不会显示在搜索结果中。
Have your PHP programmer output 50% of the article. On the bottom, add a captcha. If the user types in the correct captcha, output 100% of the article.
As a PHP programmer, I use http://www.phpcaptcha.org to implement captcha.
No, it won't penalize you but that particular section will not be shown on the search results.
正如已经提到的 reCAPTCHA 是一个很好的方法。
看看Captcha::reCAPTCHA on CPAN,根据 CPAN 评级 评论“开箱即用”
如果您想要验证码,那么有很多模块在 CPAN 上执行此操作;-)
希望有所帮助。
As already mentioned reCAPTCHA is a good way to go.
Have a look at Captcha::reCAPTCHA on CPAN which according to the CPAN rating reviews "Works out of the box"
If your want Captcha then there are plenty of modules that do this on CPAN ;-)
Hope that helps.