循环无锁缓冲区

发布于 2024-07-20 15:56:56 字数 1128 浏览 7 评论 0原文

我正在设计一种系统，该系统连接到一个或多个数据源流，并对数据进行一些分析，然后根据结果触发事件。在典型的多线程生产者/消费者设置中，我将有多个生产者线程将数据放入队列，多个消费者线程读取数据，并且消费者只对最新数据点加上 n 个点感兴趣。如果缓慢的消费者跟不上，生产者线程将不得不阻塞，当然，当没有未处理的更新时，消费者线程也会阻塞。使用带有读取器/写入器锁的典型并发队列会很好地工作，但传入的数据率可能很大，因此我想减少锁定开销，尤其是生产者的写入器锁。我认为我需要一个循环无锁缓冲区。

现在有两个问题：

循环无锁缓冲区是答案吗？
如果是这样，在我推出自己的版本之前，您知道有什么公共实现可以满足我的需要吗？

任何实现循环无锁缓冲区的指针总是受欢迎的。

顺便说一句，这是在 Linux 上用 C++ 实现的。

一些附加信息：

响应时间对于我的系统至关重要。理想情况下，消费者线程希望尽快看到任何更新，因为额外的 1 毫秒延迟可能会使系统变得毫无价值，或者价值大大降低。

我倾向于的设计思想是半无锁循环缓冲区，其中生产者线程尽可能快地将数据放入缓冲区中，我们将缓冲区的头部称为A，除非缓冲区已满，否则不会阻塞，当A 与缓冲区 Z 的末尾相遇。每个消费者线程将保存两个指向循环缓冲区的指针，P 和 P_n，其中 P 是线程的本地缓冲区头，P_n > 是 P 之后的第 n 项。每个消费者线程一旦完成当前 P 的处理，就会将其 P 和 P_n 提前，并且缓冲区指针 Z 的末尾会以最慢的 P_n 提前>。当 P 赶上 A 时，这意味着不再需要处理新的更新，消费者会旋转并忙于等待 A 再次前进。如果消费者线程旋转时间太长，它可以进入睡眠状态并等待条件变量，但我可以接受消费者占用 CPU 周期等待更新，因为这不会增加我的延迟（我将拥有更多的 CPU 核心）比线程）。想象一下，你有一个圆形轨道，生产者运行在一群消费者前面，关键是调整系统，使生产者通常只领先消费者几步，而这些操作中的大部分都可以使用无锁技术完成。我知道正确实施的细节并不容易……好吧，非常困难，这就是为什么我想在犯一些自己的错误之前从别人的错误中吸取教训。

原文

I'm in the process of designing a system which connects to one or more stream of data feeds and do some analysis on the data than trigger events based on the result. In a typical multi-threaded producer/consumer setup, I will have multiple producer threads putting data into a queue, and multiple consumer threads reading the data, and the consumers are only interested in the latest data point plus n number of points. The producer threads will have to block if slow consumer can not keep up, and of course consumer threads will block when there are no unprocessed updates. Using a typical concurrent queue with reader/writer lock will work nicely but the rate of data coming in could be huge, so i wanted to reduce my locking overhead especially writer locks for the producers. I think a circular lock-free buffer is what I needed.

Now two questions:

Is circular lock-free buffer the answer?
If so, before i roll my own, do you know any public implementation that will fit my need?

Any pointers in implementing a circular lock-free buffer are always welcome.

BTW, doing this in C++ on Linux.

Some additional info:

The response time is critical for my system. Ideally the consumer threads will want to see any updates coming in as soon as possible because an extra 1 millisecond delay could make the system worthless, or worth a lot less.

The design idea I'm leaning toward is a semi-lock-free circular buffer where the producer thread put data in the buffer as fast as it can, let's call the head of the buffer A, without blocking unless the buffer is full, when A meets the end of buffer Z. Consumer threads will each hold two pointers to the circular buffer, P and P_n, where P is the thread's local buffer head, and P_n is nth item after P. Each consumer thread will advance its P and P_n once it finish processing current P and the end of buffer pointer Z is advanced with the slowest P_n. When P catch up to A, which means no more new update to process, the consumer spins and do busy wait for A to advance again. If consumer thread spin for too long, it can be put to sleep and wait for a condition variable, but i'm okay with consumer taking up CPU cycle waiting for update because that does not increase my latency (I'll have more CPU cores than threads). Imagine you have a circular track, and the producer is running in front of a bunch of consumers, the key is to tune the system so that the producer is usually runing just a few step ahead of the consumers, and most of these operation can be done using lock-free techniques. I understand getting the details of the implementation right is not easy...okay, very hard, that's why I want to learn from others' mistakes before making a few of my own.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

甜味拾荒者 2024-07-27 15:56:56

在过去的几年里，我对无锁数据结构进行了专门研究。我已经阅读了该领域的大部分论文（只有大约四十篇左右 - 尽管只有大约十到十五篇是真正有用的:-)

AFAIK，无锁循环缓冲区尚未发明。问题将在于处理读者超越作者或反之亦然的复杂情况。

如果您还没有花费至少六个月的时间来学习无锁数据结构，请不要尝试自己编写一个。您可能会出错，并且您可能不会意识到错误的存在，直到您的代码在新平台上部署后失败为止。

但我相信有一个解决方案可以满足您的要求。

您应该将无锁队列与无锁空闲列表配对。

空闲列表将为您提供预分配，从而消除对无锁分配器的（财政上昂贵的）要求；当空闲列表为空时，您可以通过立即从队列中出列一个元素并使用该元素来复制循环缓冲区的行为。

（当然，在基于锁的循环缓冲区中，一旦获得锁，获取元素非常快 - 基本上只是指针取消引用 - 但在任何无锁算法中都不会得到这一点；它们通常必须去完全不按他们的方式做事；自由列表弹出失败然后出队的开销与任何无锁算法需要做的工作量相当）。

Michael 和 Scott 早在 1996 年就开发了一个非常好的无锁队列。下面的链接将为您提供足够的详细信息来追踪他们论文的 PDF； Michael 和 Scott，先进先出

无锁空闲列表是最简单的无锁算法，事实上我认为我还没有见过关于它的实际论文。

循环无锁缓冲区

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（18）

关于作者

相关话题

热门标签

推荐作者

1CH1MKgiKxn9p

ゞ记忆︶ㄣ

JackDx

信远

yaoduoduo1995

霞映澄塘

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。