当磁盘页面出现故障时，哪个 (OS X) dtrace 探针会触发？

发布于 2024-09-11 20:59:25 字数 903 浏览 17 评论 0原文

我正在编写一份有关页面错误的文档，并试图获取一些具体的数字来使用，因此我编写了一个读取 12*1024*1024 字节数据的简单程序。简单：

int main()
{
  FILE*in = fopen("data.bin", "rb");
  int i;
  int total=0;
  for(i=0; i<1024*1024*12; i++)
    total += fgetc(in);
  printf("%d\n", total);
}

是的，它会遍历并读取整个文件。问题是我需要 dtrace 探针在此过程中将触发 1536 次 (12M/8k)。即使我计算了所有 fbt:mach_kernel:vm_fault*: 探针和所有 vminfo::: 探针，我也没有达到 500，所以我知道我没有找到正确的探针。

有人知道我在哪里可以找到当磁盘页面出现故障时触发的 dtrace 探针吗？

更新：

如果问题是 stdio 函数中进行了一些智能预取，我尝试了以下操作：

int main()
{
  int in = open("data.bin", O_RDONLY | O_NONBLOCK);
  int i;
  int total=0;
  char buf[128];
  for(i=0; i<1024*1024*12; i++)
  {
    read(in, buf, 1);
    total += buf[0];
  }
  printf("%d\n", total);
}

此版本需要更长的时间才能运行（42 秒实时，其中 10 秒是用户，其余的是系统时间 - 页面错误，我猜）但仍然产生了我预期的五分之一的错误。

出于好奇，时间增加并不是由于循环开销和转换（char 到 int）造成的。仅执行这些操作的代码版本需要 0.07 秒。

原文

I'm writing up a document about page faulting and am trying to get some concrete numbers to work with, so I wrote up a simple program that reads 12*1024*1024 bytes of data. Easy:

int main()
{
  FILE*in = fopen("data.bin", "rb");
  int i;
  int total=0;
  for(i=0; i<1024*1024*12; i++)
    total += fgetc(in);
  printf("%d\n", total);
}

So yes, it goes through and reads the entire file. The issue is that I need the dtrace probe that is going to fire 1536 times during this process (12M/8k). Even if I count all of the fbt:mach_kernel:vm_fault*: probes and all of the vminfo::: probes, I don't hit 500, so I know I'm not finding the right probes.

Anyone know where I can find the dtrace probes that fire when a page is faulted in from disk?

UPDATE:

On the off chance that the issue was that there was some intelligent pre-fetching going on in the stdio functions, I tried the following:

int main()
{
  int in = open("data.bin", O_RDONLY | O_NONBLOCK);
  int i;
  int total=0;
  char buf[128];
  for(i=0; i<1024*1024*12; i++)
  {
    read(in, buf, 1);
    total += buf[0];
  }
  printf("%d\n", total);
}

This version takes MUCH longer to run (42s real time, 10s of which was user and the rest was system time - page faults, I'm guessing) but still generates one fifth as many faults as I would expect.

For the curious, the time increase is not due to loop overhead and casting (char to int.) The code version that does just these actions takes .07 seconds.

分享到QQ

分享到微博