具有聚类标准误差的固定效应回归

Question

我目前正在研究 r 中的固定效应回归。

我有一个包含以下变量的数据集：公司 CEO 的总薪酬 (TOTAL_COMP)、公司代码 (GVKEY)、财政年度 (FISCAL_YEAR)、许多公司特征（如资产 (AT)）和公司所属行业 (SIC).

虽然数据是专有的，但它看起来像：

例如

TOTAL_COMP <- c(100, 200, 50, 150, 300, 200, 150, 75)

GVKEY <- c("103", "103", "103", "103", "104", "104", "104", "104")

AT <- c(1000, 1100, 1200, 1300, 2000, 2100, 2200, 2300)

FISCAL_YEAR <- c(2015, 2016, 2017, 2018, 2015, 2016, 2017, 2018)

SIC <- c(78, 78, 78, 78, 80, 80, 80, 80)

我使用了 plm function 和运行固定效应回归与行业 (SIC) 固定效应。

model <- plm(TOTAL_COMP ~ AT, index = "SIC", model = "within", data = COMBINED_DATA)

我随后想在公司 (GVKEY) 而不是行业 (SIC) 级别对标准误差进行聚类。但是，我不确定要使用哪个功能。我试过了：

coeftest(model, vcov = vcovHC(regression, type = "HC0", cluster = "group"))

但恐怕这会将标准误差集中在行业 (SIC) 而不是公司 (GVKEY) 级别。如何计算公司 (GVKEY) 级别的标准误差？

（使用的包：plm 和 lmtest）

Answer 1

您可以将 SIC 设置为虚拟变量并在公司级别对标准误差进行聚类：

# The fixed effects model
model <- lm(TOTAL_COMP ~ AT + factor(SIC), data = COMBINED_DATA)

# The fixed effects model with cluster settings
library(estimatr)
Clu_robust <- lm_robust(TOTAL_COMP ~ AT + factor(SIC), 
                        data = COMBINED_DATA,
                        clusters = GVKEY, 
                        se_type = 'stata')

具有聚类标准误差的固定效应回归

Fixed effect regression with clustered standard errors

r

plm