预处理器和编译器之间的边界到底在哪里？

发布于 2024-12-01 02:39:28 字数 338 浏览 2 评论 0原文

根据各种来源（例如，SE Kevlin Henney 的广播节目，如果我没记错的话），“带有类的 C”是通过预处理器技术实现的（然后输出被馈送到 C 编译器），而 C++ 始终是通过编译器实现的（即只是正好早期吐出了C）。这似乎引起了一些混乱，所以我想知道：

预处理器和编译器之间的边界到底在哪里？什么时候将实现某种语言的软件称为“预处理器”，什么时候将其称为“编译器”？

顺便问一下，“编译语言”是一个既定术语吗？如果是这样，这到底意味着什么？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

吹梦到西洲 2024-12-08 02:39:29

编译器由多个进程（组件）组成。预处理器只是其中之一，也是相对最简单的一种。

从维基百科文章，编译器进程的划分：

除了最小的编译器之外，所有编译器都有两个以上的阶段。然而，
这些阶段通常被视为前端或
后端。这两端相交的点是开放的
辩论。
前端一般被认为是语法所在
进行语义处理，同时翻译成较低的语言
表示级别（比源代码）。
中间端通常是
旨在对源代码以外的表单执行优化
或机器代码。这种源代码/机器代码独立性是
旨在使通用优化能够在版本之间共享
支持不同语言和目标处理器的编译器。
后端从中间获取输出。它可能会执行更多
针对特定的分析、转换和优化
电脑。然后，它为特定处理器和操作系统生成代码。”

预处理只是前端工作的一小部分。

第一个通过在现有 C 编译器工具集前面附加附加进程而制成的 C++ 编译器，不是因为它设计得好，而是因为认为这种非原生 C++编译

器无法在商业领域生存。

如今，我 rel="nofollow">cfront 用于C++11 是不可能制作的。

回复收藏 0 原文

谁与争疯 2024-12-08 02:39:29

答案很简单。
预处理器将文本作为输入并以文本作为输出。例如，旧的 unix 命令 m4、cpp（C 预处理器）以及 roff、nroff 和 troff 等 unix 程序，它们用于（并且仍然）格式化手册页（unix 命令“man”）或格式化文本用于打印或排版。
预处理器非常简单，它们对它们处理的“文本语言”一无所知。换句话说，它们通常处理自然语言。 C 预处理器除了它的名称之外，例如仅识别#define、#include、#ifdef、#ifndef、#else 等，如果您使用#define MACRO，它会尝试在它找到的任何地方“扩展”该宏。但这不一定是 C 或 C++ 程序文本，它也可以是用意大利语或希腊语写的小说。
交叉编译成不同语言的编译器通常称为翻译器。因此，发出 C 代码的 C++ 旧 cfront“编译器”是一个 C++ 翻译器。
历史上一直使用预处理器和后来的翻译器，因为旧机器根本缺乏内存，无法在一个程序中完成所有操作，而是由专门的程序从一个磁盘到另一个磁盘完成。
典型的 C 程序可以从各种来源编译。构建过程将由 make 管理。如今，C 预处理器通常直接构建到 C/C++ 编译器中。典型的 make 运行会在 *.c 文件上调用 CPP 并将输出写入不同的目录，C 编译器 CC 会从那里将其直接编译为机器代码，或更常见的是会将汇编代码输出为文本。注意：C 编译器仅检查语法，它并不真正关心类型安全等。然后汇编器将采用该汇编器代码并输出一个 *.o 文件，稍后可以将其与其他 *.o 文件和 *.lib 链接文件转化为可执行程序。 OTOH，您可能有一个 make 规则，它不会调用 C 编译器，而是调用 lint 命令，即 C 语言分析器，它会查找典型的错误和错误（这些错误会被 C 编译器忽略）。
在维基百科（或使用 man 的机器终端）上查找有关 lint、nroff、troff、m4 等的信息非常有趣；D

The answer is pretty simple.
A preprocessor works on text as input and has text as output. Examples for that are the old unix commands m4, cpp (the C Pre Processor), and also unix programs like roff and nroff and troff which where used (and still are) to format man pages (unix command "man") or format text for printing or typesetting.
Preprocessors are very simple, they don't know anything about the "language of the text" they process. In other words they usually process natural languages. The C preprocessor besides its name, e.g. only recognizes #define, #include, #ifdef, #ifndef, #else etc. and if you use #define MACRO it tries to "expand" that macro everywhere it finds it. But that does not need to be C or C++ program text, it can as well be a novel written in italian or greek.
Compilers that cross compile into a different language are usually called translators. So the old cfront "compiler" for C++ which emitted C code was a C++ translator.
Preprocessors and later translators are historically used because old machines simply lacked memory to be able to do everything in one program, but instead it was done by specialized programs and from disk to disk.
A typical C program would be compiled from various sources. And the build process would be managed with make. In our days the C preprocessor is usually build directly into the C/C++ compiler. A typical make run would call the CPP on the *.c files and write the output to a different directory, from there either the C compiler CC would compile it straight to machine code or more commonly would output assembler code as text. Note: the c compiler only checks syntax, it does not really care about type safety etc. Then the assembler would take that assembler code and would output a *.o file wich later can be linked with other *.o files and *.lib files into an executable program. OTOH you likely had a make rule that would not call the C compiler but the lint command, the C language analyser, which is looking for typical mistakes and errors (which are ignored by the c compiler).
It is quite interesting to look up about lint, nroff, troff, m4 etc. on wikipedia (or your machines terminal using man) ;D

回复收藏 0 原文