ML 中的派生类型表达式

发布于 2024-09-10 06:59:43 字数 162 浏览 3 评论 0原文

所有，

我想在 ML 中导出以下函数的类型表达式：

fun f x y z = y (x z)

现在我知道输入相同的内容会生成类型表达式。但我希望手工得出这些值。

另外，请提及派生类型表达式时要遵循的一般步骤。

原文

All,

I want to derive the type expression for the function below in ML:

fun f x y z = y (x z)

Now I know typing the same would generate the type expression. But I wish to derive these values by hand.

Also, please mention the general steps to follow when deriving type expressions.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

你爱我像她 2024-09-17 06:59:43

我将尝试以尽可能机械的方式来完成此操作，就像大多数编译器中的实现一样。

让我们分解一下：

fun f x y z = y (x z)

这基本上是糖：

val f = fn x => fn y => fn z => y (x z)

让我们添加一些元语法类型变量（这些不是真正的 SML 类型，只是本示例中的占位符）：

val f : TX = fn (x : T2) => fn (y : T3) => fn (z : T4) => y (x z) : T5

好的，所以我们可以从此开始生成一个约束系统。 T5是f的最终返回类型。目前，我们将把整个函数的最终类型称为“TX”——一些新鲜的、未知的类型变量。

因此，在您给出的示例中将产生约束的是函数应用程序。它告诉我们表达式中事物的类型。事实上，这是我们唯一掌握的信息！

那么这些应用程序告诉我们什么？

忽略我们上面分配的类型变量，让我们只看一下函数体：

y (x z)

z 没有应用于任何东西，所以我们只需查找我们之前分配给它的类型变量（T4）并使用它作为其类型。

x 应用于 z，但我们还不知道它的返回类型，因此让我们为其生成一个新的类型变量，并使用我们之前分配给 x (T2) 的类型来创建约束：

T2 = T4 -> T7

y 应用于 ( xz），我们称之为 T7。再一次，我们还不知道 y 的返回类型，所以我们只给它一个新变量：

T3 = T7 -> T8

我们还知道 y 的返回类型是函数整个主体的返回类型，我们称之为“T5”较早，所以我们添加了约束：

T5 = T8

为了紧凑性，我将对此进行一些整理，并根据函数返回函数的事实为 TX 添加一个约束。这可以通过完全相同的方法导出，只是稍微复杂一些。如果您不相信我们最终会得到这个约束，希望您可以自己做这个练习：

TX = T2 -> T3 -> T4 -> T5

现在我们收集所有约束：

val f : TX = fn (x : T2) => fn (y : T3) => fn (z : T4) => y (x z) : T5
TX = T2 -> T3 -> T4 -> T5
T2 = T4 -> T7
T3 = T7 -> T8
T5 = T8

我们开始通过将左侧替换为右侧来求解这个方程组约束系统以及原始表达式中，从最后一个约束开始，一直到顶部。

val f : TX = fn (x : T2) => fn (y : T3) => fn (z : T4) => y (x z) : T8
TX = T2 -> T3 -> T4 -> T8
T2 = T4 -> T7
T3 = T7 -> T8

val f : TX = fn (x : T2) => fn (y : T7 -> T8) => fn (z : T4) => y (x z) : T8
TX = T2 -> (T7 -> T8) -> T4 -> T8
T2 = T4 -> T7

val f : TX = fn (x : T4 -> T7) => fn (y : T7 -> T8) => fn (z : T4) => y (x z) : T8
TX = (T4 -> T7) -> (T7 -> T8) -> T4 -> T8

val f : (T4 -> T7) -> (T7 -> T8) -> T4 -> T8 = fn (x : T4 -> T7) => fn (y : T7 -> T8) => fn (z : T4) => y (x z) : T8

好吧，目前看来这很糟糕。我们现在并不真正需要表达式的整个主体 - 它只是为了提供一些清晰的解释。基本上在符号表中我们会有这样的东西：

val f : (T4 -> T7) -> (T7 -> T8) -> T4 -> T8

最后一步是将剩下的所有类型变量概括为我们所知道和喜爱的更熟悉的多态类型。基本上这只是一次传递，将第一个未绑定类型变量替换为“a”，将第二个变量替换为“b”，依此类推。

val f : ('a -> 'b) -> ('b -> 'c) -> 'a -> 'c

我很确定您会发现您的 SML 编译器也会为该术语建议的类型。我是凭记忆手工完成的，所以如果我在某个地方搞砸了一些东西，我深表歉意：p

我发现很难找到这个推理和类型约束过程的良好解释。我用了两本书来学习它：Andrew Appel 的《ML 中的现代编译器实现》和 Pierce 的《类型和编程语言》。这两个人都不能独立地完全启发我，但在他们两个之间我找到了答案。

I'm going to try to do this in the most mechanical way possible, exactly as the implementation in most compilers would.

Let's break it down:

fun f x y z = y (x z)

This is basically sugar for:

val f = fn x => fn y => fn z => y (x z)

Let's add some meta-syntactic type variables (these are not real SML-types, just place holders for this example's sake):

val f : TX = fn (x : T2) => fn (y : T3) => fn (z : T4) => y (x z) : T5

OK, so we can start generating a system of constraints from this. T5 is the eventual return type of f. For the moment, we're going to just call the eventual type of this whole function "TX" - some fresh, unknown type variable.

So the thing that is going to be generating constraints in the example you've given is function application. It tells us about the types of things in the expression. In fact, it's the only information we have!

So what do the applications tell us?

Ignoring the type variables we assigned above, let's just look at the body of the function:

y (x z)

z is not applied to anything, so we're going to just look up what the type variable we assigned to it was earlier (T4) and use that as its type.

x is applied to z, but we don't know its return type yet, so let's generate a fresh type variable for that and use the type we assigned x (T2) earlier to create a constraint:

T2 = T4 -> T7

y is applied to the result of (x z), which we just called T7. Once again, we don't know the return type of y yet, so we'll just give it a fresh variable:

T3 = T7 -> T8

We also know that the return type of y is the return type for the whole body of the function, we we called "T5" earlier, so we add the constraint:

T5 = T8

For compactness, I'm going to kludge this a little and add a constraint for TX based on the fact that there are functions being returned by functions. This is derivable by exactly the same method, except it's a little more complex. Hopefully you can do this yourself as an exercise if you're not convinced that we would eventually end up with this constraint:

TX = T2 -> T3 -> T4 -> T5

Now we collect all the constraints:

val f : TX = fn (x : T2) => fn (y : T3) => fn (z : T4) => y (x z) : T5
TX = T2 -> T3 -> T4 -> T5
T2 = T4 -> T7
T3 = T7 -> T8
T5 = T8

We start to solve this system of equations by substituting left hand sides with right hand sides in the system of constraints, as well as in the original expression, starting from the last constraint and working our way to the top.

val f : TX = fn (x : T2) => fn (y : T3) => fn (z : T4) => y (x z) : T8
TX = T2 -> T3 -> T4 -> T8
T2 = T4 -> T7
T3 = T7 -> T8

val f : TX = fn (x : T2) => fn (y : T7 -> T8) => fn (z : T4) => y (x z) : T8
TX = T2 -> (T7 -> T8) -> T4 -> T8
T2 = T4 -> T7

val f : TX = fn (x : T4 -> T7) => fn (y : T7 -> T8) => fn (z : T4) => y (x z) : T8
TX = (T4 -> T7) -> (T7 -> T8) -> T4 -> T8

val f : (T4 -> T7) -> (T7 -> T8) -> T4 -> T8 = fn (x : T4 -> T7) => fn (y : T7 -> T8) => fn (z : T4) => y (x z) : T8

OK, so this looks horrible at the moment. We don't really need the whole body of the expression sitting around at the moment - it was just there to provide some clarity in the explanation. Basically in the symbol table we would have something like this:

val f : (T4 -> T7) -> (T7 -> T8) -> T4 -> T8

The last step is to generalise all the type variables that are left over into the more familiar polymorphic types that we know and love. Basically this is just a pass, replacing the first unbound type variable with 'a, the second with 'b and so on.

val f : ('a -> 'b) -> ('b -> 'c) -> 'a -> 'c

Which I'm pretty sure you'll find is the type that your SML compiler will suggest for that term too. I did this by hand and from memory, so apologies if I've botched something somewhere :p

I found it difficult to find a good explanation of this inference and type constraint process. I used two books to learn it - 'Modern Compiler Implementation in ML' by Andrew Appel, and 'Types and Programming Languages' by Pierce. Neither one was independently completely illuminating for me, but between the two of them I figured it out.

回复收藏 0 原文

你与清晨阳光 2024-09-17 06:59:43

要确定某物的类型，您需要查看使用它的每个地方。例如，如果您看到 val h = hd l，则您知道 l 是一个列表（因为 hd 将列表作为参数）并且您还知道 h 的类型是 l 的列表类型。因此，假设 h 的类型是 a，而 l 的类型是 a list（其中 a 是占位符）。现在，如果您看到 val h2 = h*2，您就知道 h 和 h2 是 int，因为2 是一个 int，您可以将一个 int 与另一个 int 相乘，两个 int 相乘的结果是一个 int。由于我们之前说过 h 的类型是 a 这意味着 a 是 int，因此 h 的类型是 a code>l 是 int list。