“：”和“：”之间的区别和“|”在 R 线性建模中

发布于 2025-01-06 02:11:10 字数 372 浏览 1 评论 0原文

在 R 中构建线性模型时，以下两条语句有什么区别：

lm(y ~ x | z)
lm(y ~ x : z)

lm 函数文档记录了 : 运算符，如下所示：

first:second 形式的规范表示通过将第一个中的所有项与第二个中的所有项进行交互而获得的项集。

该页面上没有提及 | 语法。有什么区别？

原文

When constructing a linear model in R, what is the difference between the following two statements:

lm(y ~ x | z)
lm(y ~ x : z)

The lm function documentation documents the : operator as follows:

A specification of the form first:second indicates the set of terms obtained by taking the interactions of all terms in first with all terms in second.

There's no mention of | syntax on that page. What is the difference?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

嗼ふ静 2025-01-13 02:11:10

: 用于交互。在您的示例 lm(y ~ x : z) 中，该公式表示“y 取决于 x 和 z 之间的交互作用。

通常，除非您还包含单独的项 x 和 z，否则您不会在这样的线性回归中包含交互作用。 > 是 x + x:z + z 的缩写。

AFAIK，| 根本没有被 lm 使用，它当然不会出现在 demo("lm.glm",它用于 nlme 包中的混合效果模型。

来自 ?intervals.lme 的示例：

model <- lme(distance ~ age, Orthodont, random = ~ age | Subject)
ranef(model)

这里是 | 表示“分组依据”。每个受试者都适合不同的年龄随机效应（查看ranef(model)，您可以看到每一行对应于一个人（受试者）的随机效应。）

: is used for interactions. In your example lm(y ~ x : z), the formula means "y is dependent upon an interaction effect between x and z.

Usually, you wouldn't include an interaction in a linear regression like this unless you also included the individual terms x and z as well. x * z is short for x + x:z + z.

AFAIK, | isn't used by lm at all. It certainly doesn't show up in any of the examples in demo("lm.glm", "stats"). It is used in the mixed effects models in the nlme package.

An example from ?intervals.lme:

model <- lme(distance ~ age, Orthodont, random = ~ age | Subject)
ranef(model)

Here the | means "group by". That is, a different random effect for age is fitted for every subject. (Looking at ranef(model), you can see that each row corresponds to the random effects for a person (subject).)

回复收藏 0 原文

~没有更多了~