处理函数式编程中增量数据建模的变化

发布于 2024-09-01 05:59:35 字数 719 浏览 6 评论 0原文

作为一名开发人员，我在工作中必须解决的大多数问题都与数据建模有关。例如，在 OOP Web 应用程序世界中，我经常必须更改对象中的数据属性以满足新要求。

如果我幸运的话，我什至不需要以编程方式添加新的“行为”代码（函数、方法）。相反，我可以通过注释属性 (Java) 以声明方式添加验证，甚至 UI 选项。

在函数式编程中，由于模式匹配和数据构造函数（Haskell，ML），添加新的数据属性似乎需要大量代码更改。

我该如何最小化这个问题？

这似乎是一个公认的问题，正如 Xavier Leroy 在“对象”第 24 页上很好地指出的那样以及类与模块” - 对于那些没有 PostScript 查看器的人来说，总结一下，它基本上是说FP 语言比 OOP 语言更适合在数据对象上添加新行为，但 OOP 语言更适合添加新数据对象/属性。

FP 语言中是否使用了任何设计模式来帮助缓解此问题？

我已阅读 Phillip Wadler 的建议使用 Monad 来帮助解决这个模块化问题，但我不确定我是否理解如何？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

情深已缘浅 2024-09-08 05:59:36

正如大流士培根指出的，这本质上是表达问题，是一个长期存在的问题，没有普遍接受的解决方案。不过，缺乏两全其美的方法并不能阻止我们有时想要采取一种或另一种方式。现在，您需要一个“函数式语言的设计模式”，所以让我们尝试一下。下面的示例是用 Haskell 编写的，但对于 Haskell（或任何其他语言）来说不一定是惯用的。

首先快速回顾一下“表达问题”。考虑以下代数数据类型：

data Expr a = Lit a | Sum (Expr a) (Expr a)

exprEval (Lit x) = x
exprEval (Sum x y) = exprEval x + exprEval y

exprShow (Lit x) = show x
exprShow (Sum x y) = unwords ["(", exprShow x, " + ", exprShow y, ")"]

这表示简单的数学表达式，仅包含文字值和加法。使用这里的函数，我们可以获取一个表达式并对其求值，或者将其显示为String。现在，假设我们要添加一个新函数 - 比如说，将一个函数映射到所有文字值：

exprMap f (Lit x) = Lit (f x)
exprMap f (Sum x y) = Sum (exprMap f x) (exprMap f y)

简单！我们可以一整天不费吹灰之力地编写函数！代数数据类型太棒了！

事实上，它们太棒了，我们想让我们的表达类型更具表现力。让我们扩展它来支持乘法，我们只是......呃......哦天哪，这会很尴尬，不是吗？我们必须修改我们刚刚编写的每个函数。绝望！

事实上，也许扩展表达式本身比添加使用它们的函数更有趣。因此，假设我们愿意在另一个方向上进行权衡。我们怎样才能做到这一点？

好吧，半途而废是没有意义的。让我们颠倒一切并反转整个程序。这是什么意思？嗯，这就是函数式编程，还有什么比高阶函数更函数式呢？我们要做的就是将表示表达式值的数据类型替换为表示表达式操作的数据类型。我们不需要选择构造函数，而是需要记录所有可能的操作，如下所示：

data Actions a = Actions {
    actEval :: a,
    actMap  :: (a -> a) -> Actions a }

那么我们如何创建没有数据类型的表达式呢？好吧，我们的函数现在是数据，所以我想我们的数据需要是函数。我们将使用常规函数创建“构造函数”，返回操作记录：

mkLit x = Actions x (\f -> mkLit (f x))

mkSum x y = Actions 
    (actEval x + actEval y) 
    (\f -> mkSum (actMap x f) (actMap y f))

现在我们可以更轻松地添加乘法吗？当然可以！

mkProd x y = Actions 
    (actEval x * actEval y) 
    (\f -> mkProd (actMap x f) (actMap y f))

哦，但是等等 - 我们之前忘记添加一个 actShow 动作，让我们添加它，我们只是......呃，好吧。

无论如何，使用两种不同的风格会是什么样子？

expr1plus1 = Sum (Lit 1) (Lit 1)
action1plus1 = mkSum (mkLit 1) (mkLit 1)
action1times1 = mkProd (mkLit 1) (mkLit 1)

当你不扩展它们时，几乎是一样的。

作为一个有趣的旁注，请考虑在“actions”样式中，表达式中的实际值完全隐藏 - actEval 字段仅承诺为我们提供一些信息正确的类型，如何提供它是它自己的事。由于惰性求值，该字段的内容甚至可能是仅根据需要执行的复杂计算。 Actions a 值对外部检查完全不透明，仅向外界呈现定义的操作。

这种编程风格——用一堆“动作”替换简单的数据，同时将实际的实现细节隐藏在黑匣子中，使用类似构造函数的函数来构建新的数据位，能够将非常不同的“值”与相同的“值”互换一组“动作”等等——很有趣。可能有一个名字，但我似乎不太记得了......

As Darius Bacon noted, this is essentially the expression problem, a long-standing issue with no universally accepted solution. The lack of a best-of-both-worlds approach doesn't stop us from sometimes wanting to go one way or the other, though. Now, you asked for a "design pattern for functional languages", so let's take a shot at it. The example that follows is written in Haskell, but isn't necessarily idiomatic for Haskell (or any other language).

First, a quick review of the "expression problem". Consider the following algebraic data type:

data Expr a = Lit a | Sum (Expr a) (Expr a)

exprEval (Lit x) = x
exprEval (Sum x y) = exprEval x + exprEval y

exprShow (Lit x) = show x
exprShow (Sum x y) = unwords ["(", exprShow x, " + ", exprShow y, ")"]

This represents simple mathematical expressions, containing only literal values and addition. With the functions we have here, we can take an expression and evaluate it, or show it as a String. Now, say we want to add a new function--say, map a function over all the literal values:

exprMap f (Lit x) = Lit (f x)
exprMap f (Sum x y) = Sum (exprMap f x) (exprMap f y)

Easy! We can keep writing functions all day without breaking a sweat! Algebraic data types are awesome!

In fact, they're so awesome, we want to make our expression type more, errh, expressive. Let's extend it to support multiplication, we'll just... uhh... oh dear, that's going to be awkward, isn't it? We have to modify every function we just wrote. Despair!

In fact, maybe extending the expressions themselves is more interesting than adding functions that use them. So, let's say we're willing to make the trade-off in the other direction. How might we do that?

Well, no sense doing things halfway. Let's up-end everything and invert the whole program. What does that mean? Well, this is functional programming, and what's more functional than higher-order functions? What we'll do is replace the data type representing expression values with one representing actions on the expression. Instead of choosing a constructor we'll need a record of all possible actions, something like this:

data Actions a = Actions {
    actEval :: a,
    actMap  :: (a -> a) -> Actions a }

So how do we create an expression without a data type? Well, our functions are data now, so I guess our data needs to be functions. We'll make "constructors" using regular functions, returning a record of actions:

mkLit x = Actions x (\f -> mkLit (f x))

mkSum x y = Actions 
    (actEval x + actEval y) 
    (\f -> mkSum (actMap x f) (actMap y f))

Can we add multiplication more easily now? Sure can!

mkProd x y = Actions 
    (actEval x * actEval y) 
    (\f -> mkProd (actMap x f) (actMap y f))

Oh, but wait--we forgot to add an actShow action earlier, let's add that in, we'll just... errh, well.

At any rate, what does it look like to use the two different styles?

expr1plus1 = Sum (Lit 1) (Lit 1)
action1plus1 = mkSum (mkLit 1) (mkLit 1)
action1times1 = mkProd (mkLit 1) (mkLit 1)

Pretty much the same, when you're not extending them.

As an interesting side note, consider that in the "actions" style, the actual values in the expression are completely hidden--the actEval field only promises to give us something of the correct type, how it provides it is its own business. Thanks to lazy evaluation, the contents of the field may even be an elaborate computation, performed only on demand. An Actions a value is completely opaque to external inspection, presenting only the defined actions to the outside world.

This programming style--replacing simple data with a bundle of "actions" while hiding the actual implementation details in a black box, using constructor-like functions to build new bits of data, being able to interchange very different "values" with the same set of "actions", and so on--is interesting. There's probably a name for it, but I can't quite seem to recall...

回复收藏 0 原文