当前位置：文江博客话题详情

测量 Linux 上进程的内存使用情况

发布于 2024-08-14 05:06:41 字数 522 浏览 9 评论 0原文

我正在尝试测量 linux 上进程（java 程序）的内存使用情况，并有两个与之相关的问题：

我尝试使用脚本 ps_mem.py（对 /proc/$PID/smaps 中的值求和），总内存使用峰值约为 135MB（私有内存和共享内存）。共享内存量小于1MB。尝试将 Valgrind 与 Massif 工具valgrind --tool=massif --trace-children=yes --stacks=yes java myProgram 结合使用，在内存使用峰值时产生约 10MB 的内存。
现在据我了解，堆是存储程序变量的地方，这是否意味着两种方法之间的区别是代码本身（包括jvm）占用的空间？
如果不同的机器具有不同的 RAM 量或/和使用不同的处理器（ARM 或 x86），同一程序是否在不同的机器上使用不同的内存量？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

可可 2024-08-21 05:06:42

视情况而定。
- smaps 中的许多共享内存映射都直接由磁盘上的库/二进制文件支持。虽然这些页面的占用空间确实很重要，但它并不那么重要，因为系统可以随时删除这些页面，并在再次需要时从磁盘重新加载它们。
- 任何脏的或私有的东西都只属于当前进程（如果你的程序在没有执行的情况下分叉，则为进程树）。这一点更为重要，因为如果系统需要将这些页面推出内存，则必须将它们保存到交换区。
- 正在测量的山体可能与后者相关。但是，JVM 本身（不包括您的程序）占用的内存都在两者中。
是的。 Java 或其使用的库可能会根据可用 RAM 的大小调整其内存模型。在不同的体系结构上，您使用完全不同的二进制文件，这些二进制文件可能更大或更小，或者排列方式不同，或者使用不同的 JIT 和内存管理策略。

回复收藏 0 原文

番薯 2024-08-21 05:06:42

除此之外还有一个类似的问题，并在这里回答相同的问题，让人们了解 linux proc stat vm info 目前如何不准确。
Valgrind 可以显示详细信息，但它会显着减慢目标应用程序的速度，并且大多数时候它会改变应用程序的行为。

我假设每个人都想知道 WRT“内存使用情况”如下...
在Linux中，单个进程可能使用的物理内存量可以大致分为以下几类。

Ma 匿名映射内存
- .p 私有
  - .d dirty == malloc/mmapped堆和栈分配和写入的内存
  - .c clean == malloc/mmapped 堆和堆栈内存一旦分配、写入，然后释放，但尚未回收
- .s 共享
  - .d dirty == 应该没有
  - .c clean == 应该没有
Mn 命名映射内存
- .p 私有
  - .d dirty == 文件映射写入内存私有
  - .c clean == 映射的程序/库文本私有映射
- .s 共享
  - .d dirty == 文件映射写入共享内存
  - .c clean == 映射库文本共享映射

我更愿意按如下方式获取数字，以便以最少的开销获得实数。
您必须将这些内容相加，以便将 ps 显示为 RSS 的内容除以得到更准确的数字以免混淆。
/proc/(pid)/status 尝试显示这些数字，但失败了。
因此，我的愿望是，不要尝试正确地为每个映射标记 [anon]、[stack]
Linux 内核人员会将 proc 入口代码主线化，以求和并显示这些 Mapd、Mapc、Mnpd，... 数字。
恕我直言，嵌入式 Linux 的人们会非常高兴。

Mapd:

 awk '/^[0-9a-f]/{if ($6=="") {anon=1}else{anon=0}} /Private_Dirty/{if(anon) {asum+=$2}else{nasum+=$2}} END{printf "sum=%d\n",asum}' /proc/<pid>/smaps

Mapc:

 awk '/^[0-9a-f]/{if ($6=="") {anon=1}else{anon=0}} /Private_Clean/{if(anon) {asum+=$2}else{nasum+=$2}} END{printf "sum=%d\n",asum}' /proc/<pid>/smaps

Mnpd:...等等

There's a similar question other than this and answering the same here to let people know about how linux proc stat vm info currently is not accurate.
Valgrind can show detailed information but it slows down the target application significantly, and most of the time it changes the behavior of the app.

I assume what everyone wants to know WRT "memory usage" is the following...
In linux, the amount of physical memory a single process might use can be roughly divided into following categories.

M.a anonymous mapped memory
- .p private
  - .d dirty == malloc/mmapped heap and stack allocated and written memory
  - .c clean == malloc/mmapped heap and stack memory once allocated, written, then freed, but not reclaimed yet
- .s shared
  - .d dirty == there should be none
  - .c clean == there should be none
M.n named mapped memory
- .p private
  - .d dirty == file mmapped written memory private
  - .c clean == mapped program/library text private mapped
- .s shared
  - .d dirty == file mmapped written memory shared
  - .c clean == mapped library text shared mapped

I would prefer to get the numbers as follows to get the real numbers in least overhead.
You have to sum these up in order to divide what ps shows as RSS and get more accurate numbers not to confuse.
/proc/(pid)/status tries to show these numbers, but they are failing.
So instead of trying to label [anon], [stack], correctly to each mapping, my wish is
that linux kernel people will mainline the proc entry code to sum and show these M.a.p.d, M.a.p.c, M.n.p.d, .... numbers.
Embedded linux people will get really happy IMHO.

M.a.p.d:

 awk '/^[0-9a-f]/{if ($6=="") {anon=1}else{anon=0}} /Private_Dirty/{if(anon) {asum+=$2}else{nasum+=$2}} END{printf "sum=%d\n",asum}' /proc/<pid>/smaps

M.a.p.c:

 awk '/^[0-9a-f]/{if ($6=="") {anon=1}else{anon=0}} /Private_Clean/{if(anon) {asum+=$2}else{nasum+=$2}} END{printf "sum=%d\n",asum}' /proc/<pid>/smaps

M.n.p.d:... and so on

回复收藏 0 原文

下壹個目標 2024-08-21 05:06:42

对于#1，共享内存是指（可能）由多个进程使用的内存。这基本上是如果您在多个进程中运行相同的二进制文件或者不同的进程正在使用共享库。堆是存储已分配内存的位置（当您在 Java 中使用 new 时）。由于 Java 有其 VM，因此它在进程级别分配大量内存，而您在 Java 代码中看不到这些内存。我认为是的，这 135 MB 的大部分来自 JVM 代码/数据本身。但是，堆栈也占用内存（当您进行函数调用并具有局部变量时）。

对于#2，当我们让内存等于 RAM + 交换空间时，不同数量的 RAM 不会影响使用多少“内存”。但是，不同的处理器（特别是如果我们讨论的是 32 位与 64 位）可能会使用不同的内存量。此外，进程的编译方式可能会改变所使用的内存量，因为您可以指示编译器针对速度上的内存占用进行优化，以及完全禁用部分或全部优化。

回复收藏 0 原文