当前位置：文江博客话题详情

gprof 是否考虑了阻塞时间？

发布于 2024-08-16 08:41:41 字数 63 浏览 12 评论 0原文

我在可执行文件上运行 gprof，但可执行文件花费大量时间等待子进程完成。 gprof 计时是否考虑了等待时间？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

烂柯人 2024-08-23 08:41:41

我没有太多使用 gprof，但据我所知，wait 和每个看到的子进程都没有被分析。

看一个简单的例子：

#include <stdlib.h>
#include <unistd.h>
#include <limits.h>

void slow_function()
{
    unsigned int i;
    for (i = 0; i < UINT_MAX; i++);
}

void quick_function(pid_t child)
{
    int status;
    waitpid(child, &status, 0);
    return;
}

int main(int argc, const char *argv[])
{
    pid_t child;

    child = fork();
    if (child == 0) // child process
    {
        slow_function();
        exit(0);
    }
    else
        quick_function(child);

    return 0;
}

gprof 输出是（在我的机器上）：

  %   cumulative   self              self     total
 time   seconds   seconds    calls  Ts/call  Ts/call  name
  0.00      0.00     0.00        1     0.00     0.00  quick_function

如果你真的想分析子线程/线程，我建议此作为起点。

I haven't used gprof much, but to my knowledge, neither the wait nor the child processes per see are profiled.

See a simple example:

#include <stdlib.h>
#include <unistd.h>
#include <limits.h>

void slow_function()
{
    unsigned int i;
    for (i = 0; i < UINT_MAX; i++);
}

void quick_function(pid_t child)
{
    int status;
    waitpid(child, &status, 0);
    return;
}

int main(int argc, const char *argv[])
{
    pid_t child;

    child = fork();
    if (child == 0) // child process
    {
        slow_function();
        exit(0);
    }
    else
        quick_function(child);

    return 0;
}

The gprof output for this is (on my machine):

  %   cumulative   self              self     total
 time   seconds   seconds    calls  Ts/call  Ts/call  name
  0.00      0.00     0.00        1     0.00     0.00  quick_function

If you actually want to profile the childs/threads, I'd suggest this as a starting point.

回复收藏 0 原文

捂风挽笑 2024-08-23 08:41:41

似乎有一个选项可以记录分叉进程，这篇 ibm 文章对此进行了一些讨论。

同一篇文章建议尝试tprof，它在使用上与gprof类似，但在下使用不同的方法该引擎盖可以为多进程/多线程应用程序提供更准确的图像。

回复收藏 0 原文

冬天的雪花 2024-08-23 08:41:41

gprof 仅计算进程中的实际 CPU 时间。效果更好的是对调用堆栈进行采样，并在挂钟时间而不是CPU时间进行采样。当然，在等待用户输入时不应采集样本（或者如果采集了样本，则应将其丢弃）。一些分析器可以完成所有这些，例如 RotateRight/Zoom，或者您可以使用 pstack 或 lsstack，但这里有一个简单的方法。