当前位置：文江博客话题详情

有运行时代码修改的聪明案例吗？

发布于 2024-10-29 08:49:07 字数 147 浏览 14 评论 0原文

您能想到运行时代码修改（程序在运行时修改自己的代码）的任何合法（智能）用途吗？

现代操作系统似乎不赞成执行此操作的程序，因为病毒已使用此技术来避免检测。

我能想到的就是某种运行时优化，通过在运行时了解一些在编译时无法知道的东西来删除或添加一些代码。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

梦醒时光 2024-11-05 08:49:07

代码修改有很多有效的案例。在运行时生成代码可用于：

某些虚拟机使用JIT 编译来提高性能。
动态生成专用函数在计算机图形学中早已很常见。例如，请参见 Rob Pike、Bart Locanthi 和 John Reiser Blit 上位图图形的硬件软件权衡 (1984)或者 Chris Lattner 的帖子 (2006) 关于 Apple 在运行时使用 LLVM OpenGL 堆栈中的代码专业化。
在某些情况下，软件会采用一种称为“蹦床”的技术，该技术涉及在堆栈（或其他位置）上动态创建代码。例如 GCC 的嵌套函数和一些Unices的信号机制。

有时代码会在运行时转换为代码（这称为动态二进制翻译）：

模拟器，例如Apple 的 Rosetta 使用此技术来加速仿真。另一个例子是 Transmeta 的代码变形软件。
复杂的调试器和分析器，例如Valgrind或Pin 使用它在执行时检测您的代码。
在对 x86 指令集进行扩展之前，像 VMWare 这样的虚拟化软件无法直接在虚拟机内运行特权 x86 代码。相反，它必须即时将任何有问题的指令翻译为更合适的自定义代码。

代码修改可用于解决指令集的限制：

曾经有一段时间（我知道很久以前），计算机没有指令从子例程返回或间接寻址内存。自修改代码是实现子例程、指针和数组的唯一方法。

更多代码修改情况：

许多调试器替换指令来实现断点。
一些动态链接器在运行时修改代码。本文提供了有关 Windows DLL 运行时重定位的一些背景知识，Windows DLL 实际上是一种代码形式修改。

回复收藏 0 原文

意犹 2024-11-05 08:49:07

这已经在计算机图形学中完成，特别是用于优化目的的软件渲染器。在运行时，会检查许多参数的状态，并生成光栅化器代码的优化版本（可能消除大量条件），这允许更快地渲染图形基元（例如三角形）。

回复收藏 0 原文

秋心╮凉 2024-11-05 08:49:07

一个合理的原因是 asm 指令集缺少一些必要的指令，您可以自己构建这些指令。示例：在 x86 上，无法为寄存器中的变量创建中断（例如，使用 ax 中的中断号创建中断）。仅允许编码到操作码中的常量数字。通过自我修改代码，我们可以模拟这种行为。

回复收藏 0 原文

別甾虛僞 2024-11-05 08:49:07

一些编译器过去使用它进行静态变量初始化，避免了后续访问的条件成本。换句话说，他们通过在第一次执行时用空操作覆盖该代码来实现“仅执行该代码一次”。

回复收藏 0 原文

如若梦似彩虹 2024-11-05 08:49:07

有很多情况：

病毒通常使用自修改代码在执行之前“反混淆”其代码，但该技术也可用于阻止逆向工程、破解和不需要的黑客攻击
在某些情况下，运行时可能存在特定点（例如，在读取配置文件后立即）当知道 - 在进程的剩余生命周期中 - 将始终或永远不会采用特定分支时：而不是不必要地检查某些变量来确定分支的方式，分支指令本身可以相应修改
- 例如，我们可能会知道，只会处理一种可能的派生类型，这样虚拟调度就可以替换为特定的调用
- 检测到哪些硬件可用后，可以对匹配代码的使用进行硬编码
不必要的代码进行硬编码。用无操作指令替换或跳过它，或者将下一位代码直接移位到位（如果使用与位置无关的操作码则更容易）
为促进其自身调试而编写的代码可能会注入预期的陷阱/信号/中断指令由调试器在战略位置进行。
一些基于用户输入的谓词表达式可能会被库编译为本机代码内
联一些直到运行时才可见的简单操作（例如从动态加载的库）...
有条件地添加自我检测/分析步骤
裂缝可以实现为修改加载它们的代码的库（不完全是“自我”修改，但需要相同的技术和权限）。
...

某些操作系统的安全模型意味着自修改代码在没有 root/admin 权限的情况下无法运行，这使得它对于通用用途来说不切实际。

来自维基百科：

在具有严格 W^X 安全性的操作系统下运行的应用程序软件无法执行允许写入的页面中的指令，只有操作系统本身才允许将指令写入内存并随后执行这些指令。

在此类操作系统上，即使是像 Java VM 这样的程序也需要 root/admin 权限才能执行其 JIT 代码。（有关更多信息，请参阅http://en.wikipedia.org/wiki/W%5EX细节）

There are many cases:

Viruses commonly used self-modifying code to "deobfuscate" their code prior to execution, but that technique can also be useful in frustrating reverse engineering, cracking and unwanted hackery
In some cases, there can be a particular point during runtime (e.g. immediately after reading the config file) when it is known that - for the rest of the lifetime of the process - a particular branch will always or never be taken: rather than needlessly checking some variable to determine which way to branch, the branch instruction itself could be modified accordingly
- e.g. It may become known that only one of the possible derived types will be handled, such that virtual dispatch can be replaced with a specific call
- Having detected which hardware is available, use of a matching code may be hardcoded
Unnecessary code can be replaced with no-op instructions or a jump over it, or have the next bit of code shifted directly into place (easier if using position-independent opcodes)
Code written to facilitate its own debugging might inject a trap/signal/interrupt instruction expected by the debugger at a strategic location.
Some predicate expressions based on user input might be compiled into native code by a library
Inlining some simple operations that aren't visible until runtime (e.g. from dynamically loaded library)...
Conditionally adding self-instrumentation/profiling steps
Cracks may be implemented as libraries that modify the code that loads them (not "self" modifying exactly, but needs the same techniques and permissions).
...

Some OSs' security models mean self-modifying code can't run without root/admin privileges, making it impractical for general-purpose use.

From Wikipedia:

Application software running under an operating system with strict W^X security cannot execute instructions in pages it is allowed to write to—only the operating system itself is allowed to both write instructions to memory and later execute those instructions.

On such OSes, even programs like the Java VM need root/admin privileges to execute their JIT code. (See http://en.wikipedia.org/wiki/W%5EX for more details)

回复收藏 0 原文

温暖的光 2024-11-05 08:49:07

综合操作系统基本上根据 API 调用部分评估了您的程序，并用结果替换了操作系统代码。主要好处是大量的错误检查消失了（因为如果你的程序不会要求操作系统做一些愚蠢的事情，它就不需要检查）。

是的，这是运行时优化的一个例子。

回复收藏 0 原文

缱绻入梦 2024-11-05 08:49:07

许多年前，我花了一个早上尝试调试一些自修改代码，一条指令改变了下一条指令的目标地址，即我正在计算分支地址。它是用汇编语言编写的，当我一次执行一条指令时，它运行得很好。但是当我运行该程序时它失败了。最终，我意识到机器正在从内存中获取 2 条指令，并且（因为指令已放置在内存中）我正在修改的指令已经被获取，因此机器正在执行该指令的未修改（不正确）版本。当然，当我调试时，一次只执行一条指令。

我的观点是，自修改代码的测试/调试可能非常麻烦，并且通常对机器的行为（无论是硬件还是虚拟）有隐藏的假设。此外，系统永远无法在（现在的）多核机器上执行的各个线程/进程之间共享代码页。这抵消了虚拟内存等的许多好处。它还会使在硬件级别完成的分支优化失效。

（注意 - 我没有将 JIT 包含在自修改代码的类别中。JIT 正在从代码的一种表示形式转换为另一种表示形式，它不会修改代码）

总而言之，这只是一个坏主意 - 真的整洁，确实晦涩，但真的很糟糕。

当然，如果您只有 8080 和 ~512 字节的内存，您可能不得不采取这种做法。

回复收藏 0 原文

半枫 2024-11-05 08:49:07

从操作系统内核的角度来看，每个即时编译器和链接器运行时都会执行程序文本自我修改。突出的例子是 Google 的 V8 ECMA 脚本解释器。

回复收藏 0 原文

眼眸里的那抹悲凉 2024-11-05 08:49:07

自修改代码（实际上是“自生成”代码）的另一个原因是为了实现性能的即时编译机制。例如，读取代数表达式并根据一系列输入参数进行计算的程序可以在说明计算之前将表达式转换为机器代码。

回复收藏 0 原文

半枫 2024-11-05 08:49:07

你知道老栗子，硬件和软件之间没有逻辑区别……也可以说代码和数据之间没有逻辑区别。

什么是自修改代码？将值放入执行流中的代码，以便可以将其解释为命令而不是数据。当然，函数式语言中的理论观点认为实际上没有区别。我是说 on e 可以在命令式语言和编译器/解释器中以简单的方式做到这一点，而无需假定平等地位。

我指的是实际意义上的数据可以改变程序执行路径（在某种意义上这是非常明显的）。我正在考虑类似编译器的东西，它创建一个表（数据数组），在解析时遍历该表，从一个状态移动到另一个状态（并且还修改其他变量），就像程序如何从一个命令移动到另一个命令一样，修改过程中的变量。

因此，即使在编译器创建代码空间并引用完全独立的数据空间（堆）的常见情况下，人们仍然可以修改数据以显式更改执行路径。

回复收藏 0 原文

墟烟 2024-11-05 08:49:07

我已经实现了一个使用进化来创建最佳算法的程序。它使用自修改代码来修改DNA蓝图。

回复收藏 0 原文

遗心遗梦遗幸福 2024-11-05 08:49:07

一个用例是 EICAR 测试文件，它是一个合法的 DOS 可执行 COM 文件，用于测试防病毒程序。

X5O!P%@AP[4\PZX54(P^)7CC)7}$EICAR-STANDARD-ANTIVIRUS-TEST-FILE!$H+H*

它必须使用自代码修改，因为可执行文件必须仅包含 [21h-60h, 7Bh-7Dh] 范围内的可打印/可键入 ASCII 字符，这显着限制了可编码指令的数量。详细

信息解释

它也用于 floating> floating> floating> floating>在 DOS 中，

某些编译器会发出 CD xx，其中 xx 的范围从 0x34-0x3B 代替 x87 浮点指令。由于cd是int指令的操作代码，因此，如果没有X87协处理器，它将跳入中断34h-3bh并在软件中模仿该说明。否则，中断处理程序将用9B DX替换这两个字节，以便以后的执行将通过X87直接处理而无需仿真。

MS-DOS中X87浮点仿真的协议是什么？

另一种用法是在在在 在期间在期间优化代码Runtime

例如，在没有可变位移动的体系结构上（或者当它们非常慢）可以是模拟常量移位当移位计数提前已知时，通过在控制到达该指令之前以及加载高速缓存运行之前更改指令中包含移位计数的立即字段，

它也可以用于将函数调用更改为当不同（微）架构有多个版本时，最优化的版本。例如，您有用标量、SSE2、AVX、AVX-512...编写的相同函数，并且根据当前的 CPU，您将选择最好的一个。可以使用Code Dispatcher在启动时设置的功能指针轻松完成，但是您还有一个间接级别，这对CPU不利。一些编译器支持函数多取代自动编译到不同版本，然后在加载时，链接器将链接器将功能固定在所需的函数地址上那些。但是，如果您没有编译器和链接器支持，并且您也不希望间接呢？只需在启动时修改呼叫指令，而不是更改功能指针。现在呼叫都是静态的，可以通过CPU正确预测

One use case is the EICAR test file which is a legitimate DOS executable COM file for testing antivirus programs.

X5O!P%@AP[4\PZX54(P^)7CC)7}$EICAR-STANDARD-ANTIVIRUS-TEST-FILE!$H+H*

It has to use self code modification because the executable file must contain only printable/typeable ASCII characters in the range [21h-60h, 7Bh-7Dh] which limits the number of encodable instructions significantly

The details are explained here

It's also used for floating-point operation dispatching in DOS

Some compilers will emit CD xx with xx ranging from 0x34-0x3B in places of x87 floating-point instructions. Since CD is the opcode for int instruction, it'll jump into the interrupt 34h-3Bh and emulate that instruction in software if the x87 coprocessor is not available. Otherwise the interrupt handler will replace those 2 bytes with 9B Dx so that later executions will be handled directly by x87 without emulation.

What is the protocol for x87 floating point emulation in MS-DOS?

Another usage is to optimize code during runtime

For example on an architecture without variable bit shifts (or when they're very slow) then they can be emulated using only constant shifts when the shift count is known far in advance by changing the immediate field containing the shift count in the instruction before control reaches that instruction and before the cache is loaded for running

It can also be used to change function calls to the most optimized version when there are multiple versions for different (micro-)architectures. For example you have the same function written in scalar, SSE2, AVX, AVX-512... and depending on the current CPU you'll choose the best one. It can be done easily using function pointers which are set at startup by the code dispatcher, but then you have one more level of indirection which is bad for the CPU. Some compilers support function multiversioning which automatically compiles to different versions, then at load time the linker will fix the function addresses to the desired ones. But what if you don't have compiler and linker support, and you don't want the indirection either? Just modify the call instructions yourself at startup instead of changing the function pointers. Now the calls are all static and can be predicted correctly by the CPU

回复收藏 0 原文