如果以比页粒度更精细的方式对给定内存位置进行写入，您是否可以强制崩溃？

发布于 2024-08-25 22:39:33 字数 1130 浏览 18 评论 0原文

我正在编写一个程序，出于性能原因使用共享内存（已经评估了套接字和管道作为替代方案，它们对于我的任务来说不够快，一般来说，任何涉及副本的 IPC 方法都太慢）。在共享内存区域中，我正在编写许多固定大小的结构。有一个程序负责将结构写入共享内存，并且有许多客户端从中读取。然而，客户端需要写入每个结构体的一个成员（引用计数，它们将自动更新）。所有其他成员应该仅供客户阅读。

由于客户端需要更改该成员，因此它们无法将共享内存区域映射为只读。但他们也不应该修改其他成员，并且由于这些程序是用 C++ 编写的，因此内存损坏是可能的。理想情况下，一个客户端应该尽可能难以让另一个客户端崩溃。我只担心有缺陷的客户端，而不是恶意的客户端，因此允许不完美的解决方案。

我可以尝试通过将标头中的成员声明为 const 来阻止客户端覆盖，但这并不能防止覆盖导致内存损坏（缓冲区溢出、错误的转换等）。我可以插入金丝雀，但随后我必须不断支付检查它们的费用。

我可以将指向实际数据的指针存储在单独的映射只写页面中，同时将结构保留在只读映射页面中，而不是直接存储引用计数成员。这将起作用，如果我尝试写入指向的数据，操作系统将迫使我的应用程序崩溃，但在尝试写入无锁算法，因为需要遵循另一个间接级别可以改变是否可以原子地完成某些操作。

有没有什么方法可以标记较小的内存区域，这样写入它们就会导致您的应用程序崩溃？有些平台有硬件观察点，也许我可以通过内联汇编激活其中一个观察点，但在 32 位 x86 上我一次只能激活 4 个观察点，并且每个观察点只能覆盖结构的一部分，因为它们是有限的至 4 字节。这也会使我的程序调试起来很痛苦；）

编辑：我发现这篇相当令人瞠目结舌的论文，但不幸的是它需要使用 ECC 内存和修改后的 Linux 内核。

原文

I'm writing a program that for performance reasons uses shared memory (sockets and pipes as alternatives have been evaluated, and they are not fast enough for my task, generally speaking any IPC method that involves copies is too slow). In the shared memory region I am writing many structs of a fixed size. There is one program responsible for writing the structs into shared memory, and many clients that read from it. However, there is one member of each struct that clients need to write to (a reference count, which they will update atomically). All of the other members should be read only to the clients.

Because clients need to change that one member, they can't map the shared memory region as read only. But they shouldn't be tinkering with the other members either, and since these programs are written in C++, memory corruption is possible. Ideally, it should be as difficult as possible for one client to crash another. I'm only worried about buggy clients, not malicious ones, so imperfect solutions are allowed.

I can try to stop clients from overwriting by declaring the members in the header they use as const, but that won't prevent memory corruption (buffer overflows, bad casts, etc.) from overwriting. I can insert canaries, but then I have to constantly pay the cost of checking them.

Instead of storing the reference count member directly, I could store a pointer to the actual data in a separate mapped write only page, while keeping the structs in read only mapped pages. This will work, the OS will force my application to crash if I try to write to the pointed to data, but indirect storage can be undesirable when trying to write lock free algorithms, because needing to follow another level of indirection can change whether something can be done atomically.

Is there any way to mark smaller areas of memory such that writing them will cause your app to blow up? Some platforms have hardware watchpoints, and maybe I could activate one of those with inline assembly, but I'd be limited to only 4 at a time on 32-bit x86 and each one could only cover part of the struct because they're limited to 4 bytes. It'd also make my program painful to debug ;)

Edit: I found this rather eye popping paper, but unfortunately it requires using ECC memory and a modified Linux kernel.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

墟烟 2024-09-01 22:39:33

我认为不可能像操作系统级别那样使一些位只读。

我刚才想到的一件事是，您可以像您建议的那样将引用计数放在不同的页面中。如果结构体具有通用大小，并且全部位于连续的内存位置中，则可以使用指针算术从结构体指针中定位引用计数，而不是在结构体中使用指针。这可能比为您的用例提供一个指针更好。

long *refCountersBase;//The start address of the ref counters page
MyStruct *structsBase;//The start address of your structures page

//get address to reference counter
long *getRefCounter(MyStruct *myStruct )
{
    size_t n = myStruct - structsBase;
    long *ref = refCountersBase + n;
    return ref;
}

I don't think its possible to make a few bits read only like that at the OS level.

One thing that occurred to me just now is that you could put the reference counts in a different page like you suggested. If the structs are a common size, and are all in sequential memory locations you could use pointer arithmetic to locate a reference count from the structures pointer, rather than having a pointer within the structure. This might be better than having a pointer for your use case.

long *refCountersBase;//The start address of the ref counters page
MyStruct *structsBase;//The start address of your structures page

//get address to reference counter
long *getRefCounter(MyStruct *myStruct )
{
    size_t n = myStruct - structsBase;
    long *ref = refCountersBase + n;
    return ref;
}

回复收藏 0 原文