当前位置：文江博客话题详情

Go 中没有符号表？

发布于 2024-08-11 12:23:43 字数 249 浏览 25 评论 0原文

Google 的新语言“Go”在其网站上这样说道：

该语言被设计为易于分析，无需符号表即可解析

我当然不是这些问题的专家，但我认为符号表是所有使用变量的语言和 Go 的编译器所共有的基本构造明确使用变量。我不明白什么？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

夜声 2024-08-18 12:23:43

解析意味着只是弄清楚程序结构：将模块分成语句/声明，将表达式分解为子表达式等。您最终会得到一个树结构，称为“解析树”或“抽象语法树”（谷草转氨酶）。

显然，C++需要符号表来进行解析。

本页讨论为什么 C++ 需要符号表进行解析的一些原因。

当然，解析只是编译的一部分，您将需要符号表来进行完整的编译。

然而，解析本身在编写分析工具时很有用（例如哪个模块导入哪个模块）。因此，简化解析过程意味着更容易编写代码分析工具。

回复收藏 0 原文

刘备忘录 2024-08-18 12:23:43

解释和编译绝对需要符号表或类似的东西。几乎所有语言都是如此。

在 C 和 C++ 中，甚至解析语言也需要符号表。

回复收藏 0 原文

美人骨 2024-08-18 12:23:43

@正义是对的。稍微扩展一下，在 C 中，唯一真正棘手的部分是区分类型和变量。特别是当您看到以下内容时：

T t;

您需要知道 T 是一种合法解析的类型。这是您必须在符号表中查找的内容。只要在解析继续时将类型添加到符号表中，就相对容易弄清楚。您不需要在编译器中做太多额外的工作：T 要么出现在表中，要么不出现。

在 C++ 中，事情要复杂得多。存在大量不明确或潜在不明确的结构。最明显的是这个：

B::C (c);

除了不清楚 B 是 class、typedef 还是 的事实之外命名空间，也不清楚 C 是否是一种类型，而 c 是该类型的对象，或者 C 是否是一个函数（或构造函数）将 c 作为参数（或者即使 C 是一个重载了 operator() 的对象）。您需要符号表来进行解析，尽管仍然可以足够快地继续解析，因为符号的类型位于符号表中。

当模板加入进来时，事情会变得更加糟糕。如果 C (c) 在模板中，您可能不知道在模板的实际定义中，C 是类型还是函数/对象。这是因为模板可以将 C 声明为类型或变量。这意味着您需要符号表，但您没有符号表，而且在模板实际声明之前您无法拥有符号表。更糟糕的是，仅仅拥有符号的类型还不够：您可能会遇到需要符号所代表的类型的完整信息的情况，包括大小、对齐方式和其他特定于机器的信息。

所有这些都有几个实际效果。我想说的最重要的两个是：

编译速度要快得多。我认为 Go 的编译速度比 C 更快，而 C++ 在涉及大量模板的情况下编译时间很慢。
您可以编写不依赖于完整编译器的解析器。这对于进行代码分析和重构非常有用。

@Justice is right. To expand on that a little, in C the only actual tricky part is telling types apart from variables. Specifically when you see this:

T t;

You need to know that T is a type for that to be a legal parse. That's something you have to look up in a symbol table. This is relatively simple to figure out as long as types are added to the symbol table as the parse continues. You don't need to do much extra work in the compiler: either T is present in the table or it isn't.

In C++ things are much, much more complicated. There are enormous numbers of ambiguous or potentially ambiguous constructs. The most obvious is this one:

B::C (c);

Aside from the fact that it's not clear if B is a class, a typedef, or a namespace, it's also not clear if C is a type and c an object of that type, or if C is a function (or constructor) taking c as an argument (or even if C is an object with operator() overloaded). You need the symbol table to carry on parsing, although it is still possible to continue quickly enough, as the type of the symbol is in the symbol table.

Things get much, much, much worse than that when templates come into the mix. If C (c) is in a template, you might not know in the actual definition of the template, if C is a type or a function/object. That's because the template can declare C to be either a type or a variable. What this means is that you need the symbol table, but you don't have one -- and you can't have one until the template is actually declared. Even worse, it's not necessarily sufficient to have just the type of the symbol: you can come up with situations which require the full information of the type the symbol represents, including size, alignment, and other machine-specific information.

All this has several practical effects. The two most significant I would say are:

Compilation is much faster. I assume Go is faster to compile than C, and C++ has famously slow compilation times for situations involving a lot of templates.
You can write parsers that don't depend on having a full compiler. This is very useful for doing code analysis and for refactoring.

回复收藏 0 原文