Java、子进程和未读输出流:什么时候会死锁?
当使用 Runtime.exec() 在 java 中创建子进程时,我知道我必须填充输出流的输入/排出以防止阻塞子进程。
有趣的是,Process
的 javadoc 声明了更多一点:
...failure to promptly write the input stream or read the output stream of
the subprocess may cause the subprocess to block, and even deadlock.
我想知道在这种情况下子进程也可能死锁!
问题:
1、什么情况下会出现死锁?
2. 为什么会死锁?
3. 你能提供一个简短的示例程序来显示这个死锁吗?
4. 这个死锁是操作系统的错误吗?
When creating sub-processes in java using Runtime.exec()
, I am aware that I have to fill the input/drain the output streams to prevent blocking of the subprocess.
Interestingly, the javadoc of Process
states a little bit more:
...failure to promptly write the input stream or read the output stream of
the subprocess may cause the subprocess to block, and even deadlock.
I am wondering that in this situation the subprocess can also deadlock!
Questions:
1. Under which conditions does it deadlock?
2. Why does it deadlock?
3. Can you provide a short example program which shows this deadlock?
4. Is this deadlock a bug in the OS?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
当父进程在读取任何输出之前尝试向其子进程的输入流发送过多数据时,由于缓冲区大小有限,可能会发生死锁。
考虑以下代码:
对于
lines
的较小值,它会正常工作;在我们开始读取之前,tr 的所有输出都可以放入操作系统缓冲区中。然而,对于较大的值(>10000 应该足够了),操作系统缓冲区将被填满;在tr
命令中,对write
的调用将被阻塞,等待缓冲区被耗尽,然后 Java 代码正在写入的缓冲区将被填满(因为tr
被阻止,阻止它从输入中读取),进而阻止我们对out.write
的调用,导致死锁两个进程都在等待写入未主动读取的已满缓冲区。这种死锁并不是操作系统中的错误,因为进程间通信的缓冲区大小有限是经过深思熟虑的设计决策。替代方案(无限的缓冲区大小)有一些缺点:
另外,进程内缓冲区也可能导致死锁。假设,为了尝试解决上述死锁,我们将 Java 代码更改为交替写入一行,然后读取一行。然而,Linux 进程在不直接写入终端时,通常不会在每一行后刷新< /a>.因此,
tr
可能会读取一行,并将其写入其 libc 输出缓冲区,然后阻塞等待下一行被写入 - 而我们的 Java 代码将阻塞等待tr 输出一行。
Deadlock can occur due to limited buffer sizes when the parent tries to send too much data to the input stream of its child before reading any of the output.
Consider this code:
For small values for
lines
, it will work fine; all oftr
's output can fit in the OS buffer before we start reading it. However, for large values (>10000 ought to be sufficient), the OS buffer will fill up; within thetr
command, calls towrite
will block, waiting for the buffer to be drained, and in turn, the buffer that the Java code is writing to will fill up (becausetr
is be blocked, preventing it from reading from its input), in turn blocking our calls toout.write
, leading to a deadlock where both processes are waiting to write into full buffers that aren't being actively read from.This deadlock is not a bug in the OS, as limited buffer sizes for inter-process communication are a deliberate design decision. The alternative (unlimited buffer sizes) has some downsides:
As an aside, deadlock can also occur due to in-process buffers. Suppose that, to attempt to address the deadlock above, we changed our Java code to write one line, then read one line, alternatingly. However, it's common for Linux processes to not flush after every line when they're not writing directly to a terminal. So
tr
might read a line, and write it to its libc output buffer, and than block waiting for the next line to be written--and our Java code will block waiting fortr
to output a line.死锁总是涉及至少 2 个参与者。所以可能死锁的并不是子进程本身,而是父进程和子进程的组合。
举个例子:
子进程想要读取一行,处理它并打印结果。
父进程认为它可以首先从子进程读取输出,然后才向其提供输入。
结果:死锁,两个进程都在等待对方的输入。
因此,不,这不是操作系统中的错误,而是(父)进程中的逻辑错误。
Deadlock involves always at least 2 participants. So it's not the subprocess alone that may dead lock, but the combination of the parent process and the subprocess.
For an example:
Subprocess wants to read a line, process it and print the result.
Parent process thinks it can read output from the subprocess first, and only then provide input to it.
Result: deadlock, both processes are waiting for input from the other.
And, hence, no, it's not a bug in the OS but a logical error in the (parent) process.