如何在 ubuntu 中分析 TLB 命中和 TLB 未命中

发布于 2025-01-05 10:48:31 字数 1276 浏览 1 评论 0原文

我编写了一个简单的 C++ 程序，使用 for 循环打印从 1 到 100 的数字。我想找到特定程序在运行时发生的 TLB 命中和未命中的数量。有没有可能获得这些数据？

我正在使用Ubuntu。我使用过 perf 工具。但它在不同的时期产生不同的结果。我很困惑我的代码的哪一部分导致了如此大量的 TLB 命中、TLB 未命中和缓存未命中。

当然，可能还有其他进程同时运行，例如 Ubuntu GUI。但是，这个结果是否也包含了那些过程呢？我使用的命令： perf stat -e dTLB-loads -e dTPerformance counter stats for './hellocc':

结果：第一次 -

       909,822 dTLB-loads                                                  
         2,023 dTLB-misses               #    0.22% of all dTLB cache hits 
         4,512 cache-misses                                                

   0.006821182 seconds time elapsed

LB-misses ./hellocc

结果：第二次 - Performance counter stats for './hellocc' ：

       907,810 dTLB-loads                                                  
         2,045 dTLB-misses               #    0.23% of all dTLB cache hits 
         4,533 cache-misses                                                

   0.006780635 seconds time elapsed

我的简单代码：

#include <iostream>    
using namespace std;    
int main
{    
    cout << "hello" << "\n";    
    for(int i=1; i <= 100; i = i + 1)    
        cout<< i << "\t" ;    
    return 0;    
}

原文

I have written a simple C++ program using for-loop to print the numbers from 1 to 100. I want to find the number of TLB hits and misses occurring for the particular program while running. Is there any possibility to get this data?

I am using Ubuntu. I have used perf tool. But it is producing different result in different times. I am very confused what part of my code is leading to such a huge number TLB hits, TLB misses and cache misses.

Ofcourse there might be other processes running simultaneously like Ubuntu GUI. But, does this result includes those process too?
command I used: perf stat -e dTLB-loads -e dTPerformance counter stats for './hellocc':

result: first time--

       909,822 dTLB-loads                                                  
         2,023 dTLB-misses               #    0.22% of all dTLB cache hits 
         4,512 cache-misses                                                

   0.006821182 seconds time elapsed

LB-misses ./hellocc

result: Second time-- Performance counter stats for './hellocc':

       907,810 dTLB-loads                                                  
         2,045 dTLB-misses               #    0.23% of all dTLB cache hits 
         4,533 cache-misses                                                

   0.006780635 seconds time elapsed

My simple code:

#include <iostream>    
using namespace std;    
int main
{    
    cout << "hello" << "\n";    
    for(int i=1; i <= 100; i = i + 1)    
        cout<< i << "\t" ;    
    return 0;    
}

分享到QQ

分享到微博