从仅闪存站点抓取数据
我了解一些 javascript、HTML、CSS、VBA 以及一般编程结构(函数、循环等),
您将如何解决这个问题。体育博彩网站在其仅 Flash 的网站上发布数字数据(赔率)。投注赔率变化非常频繁。
我需要获取这些数据。优选地,通过每秒抓取数据,将数据和时间戳放入文本文件中。
他们使用 RMTPS 协议将数据发送到我的浏览器。那么数据包监听是不可能的吗?我是个n00b,所以请原谅我错了。
我知道像素抓取是可能的,但要让它发挥作用却很痛苦。
我保存了网站。我将 swf 文件逆向工程为 fla 文件。我想我有该网站的完整源代码。它相当大。幸运的是,代码非常具有描述性和可读性,但仍然庞大。
问题是,如何不通过像素抓取或数据包嗅探来获取数据,而是通过侵入和“监听”数据流并捕获我想要的数据?
我很确定没有像 getOddsGameID(223) 之类的函数。我认为投注赔率数据传输可能是由服务器发起的,而不是来自脚本本身内的某些定时函数。所以我需要假装我是一个浏览器并在原始数据进入 flash 解释器之前获取原始数据。
I know a bit of javascript, HTML, CSS, VBA and just general programming structures (functions, loops, etc.)
How would you approach this problem. A sports betting website posts numerical data (odds) on its flash only website. The betting odds change very frequently.
I need to get this data. Preferably by scraping the data each second, putting the data and the timestamp in text file.
They use RMTPS protocol to send the data to my browser. So packet listening is impossible? I am a n00b so forgive me I am wrong.
I know pixel scraping is possible, but its a pain in the a**holes to get it to work.
I saved the website. I reverse engineered the swf file into fla file. I think I have the complete source code of the site. Its rather big. Fortunately the code is very descriptive and readable, but still enormous.
The question is, how can I get the data not by pixel scraping or packet sniffing, but by intruding and "listening" to the data stream and capturing the data I want?
I am pretty sure there is NO function like getOddsGameID(223) or something. I think the betting odds data transfer may be initiated by the server and not from some timed function within script itself. So I need to sort of pretend I am a browser and get the raw data before it goes into flash interpreter.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
您可以尝试使用 Adobe 的 swf2html 工具来提取字符串。它可能会满足你的需要,尽管我并没有取得太大的成功。
you could try using Adobe's swf2html tool to extract the strings. It may get you what you need, although I haven't had much success with it.