需要有关类似 VNC 的应用程序中的图块缓存机制的建议

发布于 2024-11-14 21:42:12 字数 823 浏览 8 评论 0原文

我正在开发“远程截屏”应用程序（就像 VNC 但不完全一样），我在其中通过网络传输更新的屏幕像素图块。我想实现缓存机制，并且想听听您的建议...

我认为应该这样做。对于每个图块坐标，都有固定大小的堆栈（缓存），我在其中添加更新的图块。保存时，我计算图块数据（即像素）的某种校验和（可能 CRC-16 就足够了，对吧？）。当获取新图块时（来自桌面的新屏幕截图），我计算其校验和并与该图块坐标堆栈中的所有项目校验和进行比较。如果校验和匹配，则不发送图块，而是发送特殊消息，例如“从位置 X 处的缓存堆栈获取图块”。这意味着我需要在服务器和客户端上拥有相同的缓存堆栈。

我的问题是：

默认堆栈大小（深度）应该是多少？假设堆栈大小为 5，这意味着将保存指定坐标的最后 5 个图块，并且屏幕像素分辨率的 5 倍将是总缓存大小。对于大屏幕，屏幕的原始 RGB 缓冲区约为。 5 MB，所以拥有 10 级堆栈意味着 50 MB 缓存，对吗？那么缓存深度应该是多少呢？我想可能是 10，但需要您的建议。
在通过网络发送之前，我将图块压缩为 JPEG。我应该在压缩前实现 JPEG 图块或原始 RBG 图块的缓存吗？逻辑选择是缓存原始切片，因为这样可以避免对缓存中找到的切片进行不必要的 JPEG 编码。但保存 RGB 像素将需要更大的缓存大小。那么最好的选择是什么 - 压缩之前还是压缩之后？
仅 CRC-16 校验和就足以将新屏幕图块与缓存堆栈中的图块进行比较吗？我的意思是，当 CRC 匹配时，我是否应该另外对图块进行逐字节比较，还是多余的？冲突概率是否低到足以被丢弃？
总的来说，您对我描述的方案有何看法？你会改变什么？如有任何建议，我们将不胜感激！

原文

I'm developing "remote screencasting" application (just like VNC but not exactly), where I transfer updated tiles of screen pixels over the network. I'd like to implement the caching mechanism, and I'd like to hear your recommendations...

Here is how I think it should be done. For each tile coordinate, there is fixed size stack (cache) where I add updated tiles. When saving, I calculate some kind of checksum (probably CRC-16 would suffice, right?) of the tile data (i.e. pixels). When getting new tile (from the new screenshot of desktop), I calculate its checksum and compare to all items checksums in the stack of that tile coordinate. If the checksum matches, instead of sending the tile I send the special message e.g. "get tile from cache stack at position X". This means I need to have identical cache stacks on the server and on the client.

Here comes my questions:

What should be the default stack size (depth)? Say if stack size is 5, this means last 5 tiles of specified coordinates will be saved, and 5 times the resolution of screen pixels will be the total cache size. For big screens raw RGB buffer of screen will be approx. 5 megabytes, so having 10-level stack means 50MB cache, right? So what should be the cache depth? I think maybe 10 but need your suggestions.
I'm compressing the tiles into JPEG before sending over network. Should I implement caching of JPEG tiles, or raw RBG tiles before compression? Logical choice would be caching raw tiles as it would avoid unnecessary JPEG encoding for the tiles that would be found in cache. But saving RGB pixels will require much bigger cache size. So what's the best option - before or after compression?
Is CRC-16 checksum alone enough for comparing new screen tiles with the tiles in cache stack? I mean should I additionally make byte-by-byte comparison for the tiles when CRC matches, or is it redundant? Is the collision probability low enough to be discarded?
In general, what do you think about the scheme I described? What would you change in it? Any kind of suggestions would be appreciated!

分享到QQ

分享到微博