用于 coxph 损失函数的 r 中的 GBM

Question

我在 r 中使用 gbm 来预测生存（分布 = "coxph"）。

当gbm.predict(...., type = "response")时的预测值大约在[-0.001到0.5]之间。

如何在没有 0 到 1 ([0,1]) 范围内的风险的情况下解释新样本的风险。

Answer 1

如果您查看 bazehaz.gbm，您会看到 gbm.predict 给出了 lambda_0(t)。

The proportional hazard model assumes h(t|x)=lambda(t)*exp(f(x)).
gbm can estimate the f(x) component via partial likelihood.
After estimating f(x), basehaz.gbm can compute the a nonparametric estimate of lambda(t).

所以你可以这样走（我希望）：

model = gbm(Surv(durata, status2015) ~ .-fold, data= ...)
XB =predict.gbm(model, n.trees = ..., type = "response")

lambda0 = basehaz.gbm(t = data$time, delta = data$censoring,  t.eval =   sort(unique(data$time)), cumulative = FALSE, f.x = XB , smooth=T)
XBnew =predict.gbm(model, n.trees = ..., data=newData, type = "response")

hazard = h(t|x)= lambda0*exp(XBnew).

如果您正在寻找生存功能..我正在研究它。 :)

P.S.: XB 思想的估计有一个奇怪的行为.. 因为它随着树的数量而显着变化。 :(

用于 coxph 损失函数的 r 中的 GBM

GBM in r for coxph loss function

r

predict

gbm