识别程序“之前”和“之后”管道中的程序来自相同的“工具集”。

发布于 2024-11-17 21:26:59 字数 299 浏览 3 评论 0原文

比如说，我正在编写一些工具集，其中每个工具都对相同的文本数据流进行操作，解析它，对其进行一些操作，并使用与原始输入中相同的语法返回文本流。工具可以是在管道中组合（与其他 UNIX 工具/脚本/其他内容一起）。因为文本输入处理（解析）非常昂贵，我想避免它，以防万一工具集中的两个或多个工具在管道中一个接一个地使用相反，二进制流（直接存储在内存结构中，没有无用的“额外”解析）。是吗可能知道（使用一些技巧、进程间通信或其他任何方式）管道中任何工具“之前”或“之后”的工具是工具集的一部分吗？我猜是 UNIX 环境。还没有准备好接受这种“信号”（AFAIK）。谢谢你的想法...

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

蹲墙角沉默 2024-11-24 21:27:00

另一种方法是让所有工具读取文本或二进制表示形式，可能由文件开头的幻数表示。命令行选项可以选择输出格式。
根据使用情况，最好将二进制设置为“默认”，并使用选项选择文本输出。

prog0 -binout <input.file | prog1 -binout | prog2 >output.file

对比：

prog0 <input.file | prog1 | prog2 -txtout >output.file

如果二进制幻数由非 ASCII 字节组成，则文本格式不需要幻数。

Another way would be to have all the tools read either textual or binary representations, perhaps indicated by a magic number at the beginning of the file. And a command-line option could select the output format.
Depending on the usage, it may be preferable to make binary the "default", and select text-output with an option.

prog0 -binout <input.file | prog1 -binout | prog2 >output.file

vs.

prog0 <input.file | prog1 | prog2 -txtout >output.file

You don't need a magic number for the text format if the binary magic number consists of non-ASCII bytes.

回复收藏 0 原文

清欢 2024-11-24 21:26:59

不，通过管道连接在一起的进程没有双向通信的方法。如果解析真的非常昂贵，以至于这是必要的（我猜它不是，但分析它），那么我可以想到两个选择：

有一个主程序，它可以选择告诉它哪些工具按顺序运行，然后让它运行“解析”工具，然后运行请求的工具（全部使用二进制 I/O），然后运行“输出”工具。公开使用解析/输出工具包装的各个工具也不会非常困难。
如果希望用户有足够的知识，让每个工具允许标志告诉他们期望二进制输入并提供二进制输出，以便用户可以像这样链接：
<前><代码>tool1 -o |工具2 -i -o |工具3 -i -o |工具4-i
其中 -o 表示提供二进制输出，-i 表示接受二进制输入。

No, processes that are piped together have no methods of two-way communication. If the parsing is really so expensive that this is necessary (I'd guess it isn't, but profile it), then you have a two options that I can think of:

Have a master program that takes options to tell it which tools to run, in which order, and then have it run a "parse" tool, followed by the requested tools (all using binary I/O), followed by an "output" tool. It wouldn't be terribly difficult to also expose the individual tools, wrapped with the parse/output tools.
If users are expected to be knowledgeable enough, have each tool allow flags to tell them to expect binary input and give binary output, so that users can chain like:
```
tool1 -o | tool2 -i -o | tool3 -i -o | tool4 -i
```
where -o means give binary output and -i means accept binary input.

回复收藏 0 原文

北渚 2024-11-24 21:26:59

您当然可以让工具链中的进程进行对话，但这需要一些工作。一种想法是让工具集中的每个进程使用 pgid（管道中每个进程的 pgid 相同）来确定共享内存名称，然后将其输入流的 pid 和 inode 写入共享内存。然后工具集中的每个进程都会知道管道中也在管道中的其他进程。如果 inode 匹配，它们就会知道它们的邻居是否在工具集中。

回复收藏 0 原文

~没有更多了~