实现半循环文件，可按需扩展和保存

发布于 2024-10-16 04:03:41 字数 965 浏览 9 评论 0原文

好吧，这个标题会有点令人困惑。让我试着更好地解释一下。我正在构建一个日志记录程序。该程序将有 3 个主要状态：

写入循环缓冲文件，仅保留最后 10 分钟的数据。
写入缓冲文件，忽略时间（记录所有数据）。
重命名整个缓冲区文件，并使用过去 10 分钟的数据开始一个新的缓冲区文件（并将状态更改为 1）。

现在，用例是这样的。我的网络时常遇到一些网络瓶颈。所以我想建立一个系统来记录TCP流量，当它检测到瓶颈（通过Nagios检测）时。然而，当它检测到瓶颈时，大部分有用数据已经被传输。

所以，我想要的是有一个运行类似 dumpcap< 的守护进程/code>一直如此。在正常模式下，它只会保留过去 10 分钟的数据（因为如果不需要的话，保留大量数据是没有意义的）。但是当Nagios发出警报时，我会在守护进程中发送一个信号来存储所有内容。然后，当 Naigos 恢复时，它将发送另一个信号以停止存储并将缓冲区刷新到保存文件。

现在的问题是我不知道如何干净地存储循环 10 分钟的数据。我可以每 10 分钟存储一个新文件，并在模式 1 下删除旧文件。但这对我来说似乎有点脏（特别是在确定文件中发生警报的时间时）。

理想情况下，保存的文件应使警报始终位于文件中的 10:00 标记处。虽然每 10 分钟就有新文件可以做到这一点，但“修复”文件到那时似乎有点脏。

有什么想法吗？我应该做一个旋转文件系统并在最后将它们组合成1（进行相当多的后处理）吗？有没有一种方法可以干净地实现半循环文件，从而不需要任何后处理？

谢谢

噢，而且现阶段语言并不那么重要（我倾向于Python，但不反对任何其他语言。这比整体设计问题小）...

原文

Ok, that title is going to be a little bit confusing. Let me try to explain it a little bit better. I am building a logging program. The program will have 3 main states:

Write to a round-robin buffer file, keeping only the last 10 minutes of data.
Write to a buffer file, ignoring the time (record all data).
Rename entire buffer file, and start a new one with the past 10 minutes of data (and change state to 1).

Now, the use case is this. I have been experiencing some network bottlenecks from time to time in our network. So I want to build a system to record TCP traffic when it detects the bottleneck (detection via Nagios). However by the time it detects the bottlenecking, most of the useful data has already been transmitted.

So, what I'd like is to have a deamon that runs something like dumpcap all the time. In normal mode, it'll only keep the past 10 minutes of data (Since there's no point in keeping a boat load of data if it's not needed). But when Nagios alerts, I will send a signal in the deamon to store everything. Then, when Naigos recovers it will send another signal to stop storing and flush the buffer to a save file.

Now, the problem is that I can't see how to cleanly store a rotating 10 minutes of data. I could store a new file every 10 minutes and delete the old ones if in mode 1. But that seems a bit dirty to me (especially when it comes to figuring out when the alert happened in the file).

Ideally, the file that was saved should be such that the alert is always at the 10:00 mark in the file. While that is possible with new files every 10 minutes, it seems like a bit dirty to "repair" the files to that point.

Any ideas? Should I just do a rotating file system and combine them into 1 at the end (doing quite a bit of post-processing)? Is there a way to implement the semi-round-robin file cleanly so that there is no need for any post-processing?

Thanks

Oh, and the language doesn't matter as much at this stage (I'm leaning towards Python, but have no objection to any other language. It's less of an issue than the overall design)...

分享到QQ

分享到微博