识别面板数据模型（stata）中观察很少的群体

发布于 2025-01-10 19:53:55 字数 169 浏览 1 评论 0原文

如何识别面板数据模型中观测值很少的组？

我使用 xtlogit 估计了几个随机效应模型。平均每组有 26 个观测值，但有些组只记录 1 个观测值。我想识别它们并将它们从模型中排除......有什么建议吗？我的面板数据是使用以下方式设置的：xtset Countrycode Year

需要登录才能够评论，你可以免费注册一个本站的账号。

你爱我像她 2025-01-17 19:53:55

假设您的足够大面板的神奇数字是 7，并且您适合第一个模型。

   bysort countrycode : egen n_used = total(e(sample))

然后为您提供可用和可使用的观测值的计数，之后您对后续模型的标准是 if n_used >= 7

您可以直接

bysort countrycode : gen n_available = _N

不管模型拟合如何。

差异有两个：

Let's suppose your magic number for a big enough panel is 7 and that you fit a first model.

   bysort countrycode : egen n_used = total(e(sample))

then gives you a count of how many observations were available and can be used, after which your criterion for a later model is if n_used >= 7

You could just go

bysort countrycode : gen n_available = _N

regardless of a model fit.

The differences are two-fold:

That last statement would disregard any missing values in the variables used in a model fit.
If you also used if and/or in to restrict model fit to particular subsets of observations, then e(sample) knows about that, but the last statement does not.

~没有更多了~