沃尔特·布莱特（Walter Bright）使用了“冗余”这个词……或者“这是什么意思？”

发布于 2024-09-15 07:24:52 字数 587 浏览 3 评论 0原文

所以我正在阅读 Walter Bright 关于 Bitwise 中的 D 语言的采访 ( http://www.bitwisemag.com/copy/programming/d/interview/d_programming_language.html），我遇到了关于语言解析的这段非常有趣的引用：

然而，从理论角度来看，能够生成良好的诊断需要语法中存在冗余。冗余用于猜测意图，冗余越多，猜测就越有可能是正确的。这就像英语——如果我们偶尔拼错一个单词，或者缺少一个单词，冗余使我们能够正确猜测其含义。如果语言中没有冗余，那么任何随机的字符序列都是有效的程序。

现在我正试图弄清楚他说的“冗余”到底是什么意思。

我几乎无法理解最后一部分，他提到有可能有一种语言，其中“任何随机的字符序列都是有效的程序”。我被告知错误分为三种：语法错误、运行时错误和语义错误。是否存在唯一可能的错误是语义错误的语言？组装是这样的吗？机器码呢？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

苏别ゝ 2024-09-22 07:24:52

我将重点讨论为什么（我认为）Walther Bright 认为冗余是好的。我们以 XML 为例。此代码片段：

<foo>...</foo>

有冗余，如果我们使用 S 表达式，则结束标记是多余的：

(foo ...)

它更短，并且程序员不必频繁地键入 foo 来理解该代码片段。减少冗余。但它也有缺点，例如 http://www.prescod.net/xml/sexprs。 html 显示：

(document author: "[email protected]"
    (para "This is a paragraph " (footnote "(better than the one under there)" ".")
    (para "Ha! I made you say \"underwear\"."))


<document author="[email protected]">
<para>This is a paragraph <footnote>(just a little one).</para>
<para>Ha! I made you say "underwear".</para>
</document>

两者都缺少脚注的结束标记/结束括号。一旦解析器看到，xml 版本就完全无效。 S 表达式仅在文档末尾有效，并且仅当您在其他地方没有不需要的右括号时才有效。因此，在某些情况下，冗余确实有助于理解作者的意思（并指出他表达方式中的错误）。

I'll focus on why (I think) Walther Bright thinks redunancy is good. Let's take XML as an example. This snippet:

<foo>...</foo>

has redunancy, the closing tag is redunant if we use S-Expressions instead:

(foo ...)

It's shorter, and the programmer doesn't have to type foo more often than neccessary to make sense of that snippet. Less redunancy. But it has downsides, as an example from http://www.prescod.net/xml/sexprs.html shows:

(document author: "[email protected]"
    (para "This is a paragraph " (footnote "(better than the one under there)" ".")
    (para "Ha! I made you say \"underwear\"."))


<document author="[email protected]">
<para>This is a paragraph <footnote>(just a little one).</para>
<para>Ha! I made you say "underwear".</para>
</document>

In both, the end tag/a closing paren for footnote is missing. The xml version is plain invalid as soon as the parser sees </para>. The S-Expression one is only invalid by the end of the document, and only if you don't have an unneeded closing paren somewhere else. So redunancy does help, in some cases, to udnerstand what the writer meant (and point out errors in his way of expressing that).

回复收藏 0 原文

明媚如初 2024-09-22 07:24:52

汇编语言（无论如何，大多数汇编语言）根本不是这样的——它们具有相当严格的语法，并且大多数随机字符串都会被诊断为错误。

机器代码更接近。由于不涉及从“源”代码到“对象”代码的转换，因此所有错误都是语义错误，而不是语法错误。大多数处理器确实有各种他们会拒绝的输入（例如，执行“错误操作码”陷阱/中断）。您可能会争辩说，在某些情况下，这将是语法（例如，根本无法识别的操作码），而其他则是语义（例如，该指令不允许的一组操作数）。

对于那些还记得它的人来说，东元因为几乎所有可能的输入分配一些含义而闻名（臭名昭著？），所以它几乎是相同的方式。一个有趣的挑战是弄清楚如果您输入（例如）您的名字会发生什么。

回复收藏 0 原文

淡淡的优雅 2024-09-22 07:24:52

nglsh nclds ll srts of xtr ltrs t mk it ezr t 读

回复收藏 0 原文

深海蓝天 2024-09-22 07:24:52

好吧，使用 C# 中的示例（因为我不知道 D）。如果你有一个带有抽象方法的类，则该类本身必须标记为抽象：

public abstract class MyClass
{
    public abstract MyFunc();
}

现在，编译器自动将 MyClass 标记为抽象（这是 C++ 处理它的方式）是微不足道的，但在 C# 中，你必须这样做明确地表达出来，这样你的意图就很明确。

与虚拟方法类似。在 C++ 中，如果在基类中声明 virtual，则方法在所有派生类中自动为 virtual。在 C# 中，该方法必须显式标记为override，这样就不会混淆您想要的内容。

Well, to use an example from C# (since I don't know D). If you have a class with an abstract method, the class itself must be marked abstract:

public abstract class MyClass
{
    public abstract MyFunc();
}

Now, it would be trivial for the compiler to automatically mark MyClass as abstract (that is the way C++ handles it), but in C#, you must do it explicitly, so that your intentions are clear.

Similarly with virtual methods. In C++, if declare virtual in a base class, a method is automatically virtual in all derived classes. In C#, the method must nevertheless be explicit marked override, so there is no confusion about what you wanted.

回复收藏 0 原文

彩虹直至黑白 2024-09-22 07:24:52

我认为他正在谈论语言的句法结构以及如何解释它们。举个例子，考虑用多种语言呈现的简单的“if”语句。

在 bash（shell 脚本）中，它看起来像这样：

if [ cond ]; then
  stmts;
elif [ other_cond ]; then
  other_stmts;
else
  other_other_stmts;
fi

在 C 中（带单个语句，没有大括号）：

if (cond)
  stmt;
else if (other_cond)
  other_stmt;
else
  other_other_stmt;

您可以看到，在 bash 中，if 语句的语法结构比 C 中多得多。事实上，bash 中的所有控制结构都有自己的结束分隔符（例如 if/then/fi、for/do/done、case/in/esac code>,...)，而在 C 中，花括号随处可见。这些独特的分隔符消除了代码含义的歧义，从而提供了解释器/编译器可以诊断错误情况并将其报告给用户的上下文。

然而，这需要权衡。程序员通常更喜欢简洁的语法（如 C、Lisp 等）而不是冗长的语法（如 Pascal、Ada 等）。但是，他们也更喜欢包含行/列号和建议解决方案的描述性错误消息。当然，这些目标是相互矛盾的——鱼和熊掌不可兼得（至少在保持编译器/解释器的内部实现简单的同时）。

I think he was talking about syntactical structures in the language and how they can be interpreted. As an example, consider the humble "if" statement, rendered in several languages.

In bash (shell script), it looks like this:

if [ cond ]; then
  stmts;
elif [ other_cond ]; then
  other_stmts;
else
  other_other_stmts;
fi

in C (w/single statments, no curly braces):

if (cond)
  stmt;
else if (other_cond)
  other_stmt;
else
  other_other_stmt;

You can see that in bash, there is a lot more syntactical structure to the if statement than there is in C. In fact, all control structures in bash have their own closing delimiters (e.g. if/then/fi, for/do/done, case/in/esac,...), whereas in C the curly brace is used everywhere. These unique delimiters disambiguate the meaning of the code, and thereby provide context from which the interpreter/compiler can diagnose error conditions and report them to the user.

There is, however, a tradeoff. Programmers generally prefer terse syntax (a la C, Lisp, etc.) to verbose syntax (a la Pascal, Ada, etc.). However, they also prefer descriptive error messages containing line/column numbers and suggested resolutions. These goals are of course at odds with each other--you can't have your cake and eat it too (at least, while keeping the internal implementation of the compiler/interpreter simple).

回复收藏 0 原文