当前位置：文江博客话题详情

Linux performance profiling

在 Linux 上分析程序的速度

发布于 2024-07-07 11:09:58 字数 510 浏览 9 评论 0 原文

我有一个程序的几个变体，我想比较它们的性能。两者执行的任务基本相同。

一切都用 C 语言和内存来完成。另一个调用外部实用程序并执行文件 IO。

我如何可靠地比较它们？

1) 使用“time”获取“CPU 时间”有利于调用 system() 和执行 IO 的第二种变体。即使我将“系统”时间添加到“用户”时间，它仍然不会计入 wait() 上阻塞的时间。

2）我不能只给它们计时，因为它们在服务器上运行并且可以随时从CPU上移走。对 1000 次实验进行平均是一个软选项，因为我不知道如何利用我的服务器 - 它是集群上的虚拟机，有点复杂。

3）分析器没有帮助，因为它们会给我在代码中花费的时间，这再次有利于执行 system() 的版本

我需要将这些程序消耗的所有 CPU 时间相加，包括用户、内核、IO 和子进程递归地。

我预计这是一个常见问题，但似乎仍然没有找到解决方案。

（用 times() 解决 - 见下文。谢谢大家）

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

背叛残局 2024-07-14 11:09:58

如果我明白的话，在 bash 命令行上输入“time myapplication”并不是您想要的。

如果你想要准确性，你必须使用分析器......你有来源，是吗？

尝试类似 Oprofile 或 Valgrind，或者看看这个了解更多信息扩展列表。

如果你没有来源，老实说我不知道......

回复收藏 0 原文

残疾 2024-07-14 11:09:58

/usr/bin/time（不是 bash 中内置的“时间”）可以提供一些有趣的统计数据。

$ /usr/bin/time -v xeyes
    Command being timed: "xeyes"
    User time (seconds): 0.00
    System time (seconds): 0.01
    Percent of CPU this job got: 0%
    Elapsed (wall clock) time (h:mm:ss or m:ss): 0:04.57
    Average shared text size (kbytes): 0
    Average unshared data size (kbytes): 0
    Average stack size (kbytes): 0
    Average total size (kbytes): 0
    Maximum resident set size (kbytes): 0
    Average resident set size (kbytes): 0
    Major (requiring I/O) page faults: 9
    Minor (reclaiming a frame) page faults: 517
    Voluntary context switches: 243
    Involuntary context switches: 0
    Swaps: 0
    File system inputs: 1072
    File system outputs: 0
    Socket messages sent: 0
    Socket messages received: 0
    Signals delivered: 0
    Page size (bytes): 4096
    Exit status: 0

/usr/bin/time (not built-in "time" in bash) can give some interesting stats.

$ /usr/bin/time -v xeyes
    Command being timed: "xeyes"
    User time (seconds): 0.00
    System time (seconds): 0.01
    Percent of CPU this job got: 0%
    Elapsed (wall clock) time (h:mm:ss or m:ss): 0:04.57
    Average shared text size (kbytes): 0
    Average unshared data size (kbytes): 0
    Average stack size (kbytes): 0
    Average total size (kbytes): 0
    Maximum resident set size (kbytes): 0
    Average resident set size (kbytes): 0
    Major (requiring I/O) page faults: 9
    Minor (reclaiming a frame) page faults: 517
    Voluntary context switches: 243
    Involuntary context switches: 0
    Swaps: 0
    File system inputs: 1072
    File system outputs: 0
    Socket messages sent: 0
    Socket messages received: 0
    Signals delivered: 0
    Page size (bytes): 4096
    Exit status: 0

回复收藏 0 原文

早乙女 2024-07-14 11:09:58

运行它们一千次，测量实际花费的时间，然后对结果进行平均。这应该可以消除由于服务器上运行的其他应用程序而导致的任何差异。

回复收藏 0 原文

不奢求什么 2024-07-14 11:09:58

我好像终于找到了。

姓名
times - 获取进程时间

概要
#include

   clock_t times(struct tms *buf);

描述
times() 将当前进程时间存储在 buf 的 tms 结构体中
指着。 struct tms 的定义如下：

   struct tms {
          clock_t tms_utime;  /* user time */
          clock_t tms_stime;  /* system time */
          clock_t tms_cutime; /* user time of children */
          clock_t tms_cstime; /* system time of children */
   };

子进程的时间是所有等待子进程的递归和。

我想知道为什么它还没有成为标准 CLI 实用程序。或者也许我只是无知。

I seem to have found it at last.

NAME
times - get process times

SYNOPSIS
#include

   clock_t times(struct tms *buf);

DESCRIPTION
times() stores the current process times in the struct tms that buf
points to. The struct tms is as defined in :

   struct tms {
          clock_t tms_utime;  /* user time */
          clock_t tms_stime;  /* system time */
          clock_t tms_cutime; /* user time of children */
          clock_t tms_cstime; /* system time of children */
   };

The children's times are a recursive sum of all waited-for children.

I wonder why it hasn't been made a standard CLI utility yet. Or may be I'm just ignorant.

回复收藏 0 原文