OCR RSA 密钥卡（安全令牌）

发布于 2024-08-16 07:01:15 字数 882 浏览 8 评论 0原文

我组装了一个快速的 WinForm/嵌入式 IE 浏览器控件，每天早上登录我们公司的银行网站并抓取/导出所需的存款信息（该银行是一家小型区域银行）。由于我们有几十个从同一个主账户提取的“伪账户”，因此检索实际上需要 10-15 分钟。

无论如何，唯一的问题是我们的商业银行帐户需要 RSA 安全令牌 (http://www.rsa.com/node.aspx?id=1156)--如果你不熟悉，它是一个小设备，每 15(?) 秒显示一个随机的 6 位数字，所以我必须在开始之前提示输入这个值。这是基于网站登录的安全模型之上的，因此即使您创建了一个无法执行任何操作的只读帐户，您仍然必须输入 RSA 号码。我们为不同的人提供了 5 个这样的令牌公司。

从我们的角度来看，这是令人讨厌的安全问题。我开玩笑说使用网络摄像头对钥匙扣上的数字进行 OCR 识别，这样他们就不必输入它——主要是为了在早上有人到达之前完成抓取/导出。好吧，他们问我是否真的能做到。

现在我问你，你认为从相机生成的 JPEG 图像中可靠地 OCR 这些数字需要多努力（多少小时）？我已经知道我可以轻松获取 JPEG。我认为您会尝试登录 3 次，因此确实需要达到 99% 的准确率。我可以在休息时间处理这个问题，但他们不希望我花超过几个小时的时间，所以我想尽可能多地利用现有代码。这是一个 7 段显示器（如闹钟），因此它并不完全是 OCR 包用来查看的文本。

另外，显示屏侧面还有一个倒计时器；通常，当它下降到 1 格时，您会等到下一个数字出现，然后从 5 格重新开始（就像手机上的信号强度）。因此，这也需要是 OCRd，但它不是文本。

不管怎样，当我打字的时候，我想得越多，我就越不相信我能真正把它做好，所以也许我应该在业余时间做这件事？

原文

I put together a quick WinForm/embedded IE browser control which logs into our company's bank website each morning and scrapes/exports the desired deposit information (the bank is a smallish regional bank). Since we have a few dozen "pseudoaccounts" that draw from the same master account, this actually takes 10-15 minutes to retrieve.

Anyway, the only problem is that our business bank account reuires an RSA security token (http://www.rsa.com/node.aspx?id=1156)--if you are not familiar, it is a small device which shows a random 6 digit number every 15(?) seconds, so I have to prompt for this value before starting. This is on top of the website's login based security model, so even if you create a read-only account that can't do anything, you still have to put the RSA number in. We have 5 of these tokens for different people in the company.

From our perspective this is nusiance security. I was joking about using a web camera to OCR the digits from the key fob so they didn't have to type it in -- mainly so that the scraping/export would be done before anyone arrives in the morning. Well, they asked if I could really do it.

So now I ask you, how hard (how many hours) do you think it would take to OCR these digits reliably from a JPEG image produced by the camera? I already know I can get the JPEG easily. I think you get 3 tries to log in, so it really needs to hit a 99% accuracy rate. I could work on this on my off time, but they don't want me to put more than a few hours into it, so I want to leverage as much existing code as possible. This is a 7-segment display (like an alarm clock) so it's not exactly text that an OCR package would be used to seeing.

Also--there is a countdown timer on the side of the display; typically when it is down to 1 bar, you wait until the next number appears and it starts over at 5 bars (like signal strength on your cell phone). So this would need to be OCRd as well but it is not text.

Anyway the more I think about it as I type this, the less convinced I am that I can truly get this right, so maybe I should just work on it in my spare time?

分享到QQ

分享到微博