对于内联函数来说，什么是好的启发式方法？

发布于 2024-08-19 05:27:33 字数 107 浏览 17 评论 0原文

考虑到您只是尝试优化速度，那么决定是否内联函数的良好启发式是什么？显然代码大小应该很重要，但是当（例如）gcc 或 icc 确定是否内联函数调用时通常会使用其他因素吗？该领域是否有任何重要的学术工作？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

世界如花海般美丽 2024-08-26 05:27:33

维基百科有 a 几段与此相关，底部有一些链接：

除了内存大小和缓存问题之外，另一个考虑因素是寄存器压力。从编译器的角度来看，“内联过程中添加的变量可能会消耗额外的寄存器，并且在寄存器压力已经很高的区域中，这可能会强制溢出，从而导致额外的 RAM 访问。”

因为虚拟方法不是静态已知的，但 JIT 可以收集运行时分析信息，例如方法调用频率：

具有 JIT 编译器和运行时类加载的语言还有其他权衡， ucsd.edu/classes/sp00/cse231/openjit.pdf" rel="noreferrer">设计、实现和评估即时编译器中的优化（针对 Java）讨论静态的方法内联方法和动态加载的类及其对性能的改进。
练习 JUDO：动态优化下的 Java 声称他们的“内联策略是基于代码大小和分析信息。如果方法入口的执行频率低于某个阈值，则该方法不会被内联，因为它被视为冷方法。以避免代码爆炸，我们不会内联字节码大小超过 25 字节的方法，为了避免沿深度调用链内联，当沿调用链的累积内联字节码大小超过 40 字节时，内联会停止。”尽管他们有运行时分析信息（方法调用频率），但他们仍然小心地避免内联大型函数或函数链以防止膨胀。

在 Google Scholar 上进行搜索发现了许多论文，例如

Google 图书上的搜索揭示了相当多的书籍，其中包含有关各种上下文中的函数内联的论文或章节。

《编译器设计手册：优化和机器代码生成》有一章介绍编译器设计中的统计和机器学习技术，其中包含启发式设置各种参数、分析结果的方法。本章引用了 Vaswani 等人的论文 Microarchitecture用于编译器优化的敏感经验模型，他们提出“使用经验模型
构建微架构敏感模型以进行编译器优化的技术”。
（其他一些书籍从程序员的角度讨论了 inling，例如 C++ for Game Programmers，讨论了过于频繁内联函数的危险以及内联和内联之间的区别如果编译器确定这样做弊大于利，则通常会忽略程序员的内联请求；作为最后的手段可以使用宏来覆盖。）

Wikipedia has a few paragraphs about this, with some links at the bottom:

In addition to memory size and cache issues, another consideration is register pressure. From the compiler's point of view "the added variables from the inlined procedure may consume additional registers, and in an area where register pressure is already high this may force spilling, which causes additional RAM accesses."

Languages with JIT compilers and runtime class loading have other tradeoffs since the virtual methods aren't known statically, yet the JIT can collect runtime profiling information, such as method call frequency:

Design, Implementation, and Evaluation of Optimizations in a Just-in-Time Compiler (for Java) talks about method inlining of static methods and dynamically loaded classes and its improvements on performance.
Practicing JUDO: Java Under Dynamic Optimizations claims that their "inlining policy is based on the code size and profiling information. If the execution frequency of a method entry is below a certain threshold, the method is then not inlined because it is regarded as a cold method. To avoid code explosion, we do not inline a method with a bytecode size of more than 25 bytes. . . . To avoid inlining along a deep call chain, inlining stops when the accumulated inlined bytecode size along the call chain exceeds 40 bytes." Although they have runtime profiling information (method call frequency) they are still careful to avoid inlining large functions or chains of functions to prevent bloat.

A search on Google Scholar reveals a number of papers, such as

A search on Google Books reveals quite a number of books with papers or chapters about function inlining in various contexts.

The Compiler Design Handbook: Optimizations and Machine Code Generation has a chapter about Statisical and Machine Learning Techniques in Compiler Design, with heuristics to set various parameters, profiling the results. This chapter references the Vaswani et al paper Microarchitecture Sensitive Empirical Models for Compiler Optimizations where they propose "the use of empirical modeling
techniques for building microarchitecture sensitive models for compiler optimizations".
(Some other books talk about inling from the programmer's point of view, such as C++ for Game Programmers, which talks about the dangers of inlining functions too often and the differences between inlining and macros. Compilers often ignore the programmer's inline requests if they can determine that they would do more harm than good; this can be overridden with macros as a last resort.)

回复收藏 0 原文