ReCAPTCHA 是如何工作的?

发布于 2024-08-05 03:45:12 字数 298 浏览 6 评论 0原文

我对这篇文章的阅读表明,ReCAPTCHA 的好处是它可以人类验证书籍 OCR/数字化中无法识别的单词。它通过使用“你是人类吗?”中的这些词来做到这一点。测试。所以 ReCAPTCHA 一石二鸟。伟大的!

但我不明白。如果数字化过程无法识别该单词,那么假设的人类输入的输入是根据什么进行验证的?这是如何运作的?

My reading of this article suggests that a benefit of ReCAPTCHA is that it can have humans verify words not recognised in the OCR/digitization of books. It does this by using these words in "Are you human?" tests. So ReCAPTCHA kills two birds with one stone. Great!

But I dont get it. If the word can't be recognised by the digitization process then what is the input entered, by the supposed human being, verified against? How does this work?

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(2

苦妄 2024-08-12 03:45:12

它显示两个字。其中之一计算机已经知道,而另一个则不知道。它假设如果你正确地了解了已知的一个,那么你也必须了解另一个。

你不知道这两者中的哪一个是已知的,所以理论上你无法欺骗它。此外,它还会与多人重播一个单词以获得独立确认,然后将其作为有效答案发送回源(报纸公司、书籍扫描组)。

但是如果计算机无法读取这样的
验证码,系统如何知道
谜题的正确答案?这是
how:每个无法阅读的新单词
通过 OCR 正确地提供给用户
与另一个词连用
答案是已知的。这
然后要求用户阅读这两个单词。
如果他们解决了问题
答案已知,系统假设
他们的答案对于新的来说是正确的
一。然后系统给出新的
图像给其他一些人
以更高的置信度确定
原来的答案是否是
正确。

http://recaptcha.net/learnmore.html

It shows two words. One of them the computer already knows, the other, it doesn't. It assumes that if you get the known one right, that you must know the other.

You don't know which of the two is already known so you, theoretically can't trick it. Additionally, it will replay a word with multiple people to get independent confirmation before sending it back to the source (newspaper company, book scanning group) as a valid answer.

But if a computer can't read such a
CAPTCHA, how does the system know the
correct answer to the puzzle? Here's
how: Each new word that cannot be read
correctly by OCR is given to a user in
conjunction with another word for
which the answer is already known. The
user is then asked to read both words.
If they solve the one for which the
answer is known, the system assumes
their answer is correct for the new
one. The system then gives the new
image to a number of other people to
determine, with higher confidence,
whether the original answer was
correct.

http://recaptcha.net/learnmore.html

新一帅帅 2024-08-12 03:45:12

引自了解 reCAPTCHA 的工作原理

但是如果计算机无法读取这样的验证码,系统如何知道谜题的正确答案呢?具体方法如下:将 OCR 无法正确读取的每个新单词与已知答案的另一个单词一起提供给用户。然后要求用户阅读这两个单词。如果他们解决了已知答案的问题,系统就会假设他们的答案对于新问题来说是正确的。然后,系统将新图像提供给其他一些人,以更高的置信度确定原始答案是否正确。

Quoted from LEARN HOW reCAPTCHA WORKS

But if a computer can't read such a CAPTCHA, how does the system know the correct answer to the puzzle? Here's how: Each new word that cannot be read correctly by OCR is given to a user in conjunction with another word for which the answer is already known. The user is then asked to read both words. If they solve the one for which the answer is known, the system assumes their answer is correct for the new one. The system then gives the new image to a number of other people to determine, with higher confidence, whether the original answer was correct.

~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文