在守护进程模式下运行 vowpal wabbit contextual bandit 失败

Question

我正在尝试以 --daemon 模式将我的数据流式传输到 vw。现在我只使用一个玩具示例。由于某些原因，我没有使用大众拥有的任何策略（epsilon、bag、first）。我总是得到一个错误。有什么问题吗？

我的数据：

9:0:0.3 | SmartPhone 1 16
7:0:0.3 | SmartPhone 1 21

我如何启动培训：

vw --cb 10 --epsilon 0.1 -d train.vw --save_resume -f model.vw

它给了我错误（对于 --bag 10，--first 2，cb_explore 10--bag 10 顺便说一句，同样的错误）：

PS C:\Users\Default.PC\Desktop\vw> vw -d train.vw --cb 10 --epsilon 0.1 -f model.vw
final_regressor = model.vw
Num weight bits = 18
learning rate = 0.5
initial_t = 0
power_t = 0.5
Error: unrecognised option '--epsilon'

但是，如果我不使用 --epsilon，代码就可以工作。

vw --cb 10 -d train.vw --save_resume -f model.vw

我可以用它来预测：

predict: | SmartPhone 1 19 - result is 3

Answer 1

不确定使用的是哪个 vw 版本，但带有 --epsilon 的命令似乎适用于 vw 8.9.2：

PS C:\Users\xxx\Desktop> vw --version
8.9.2 (git commit: 884420267)

PS C:\Users\xxx\Desktop> vw -d train.vw --cb 10 --epsilon 0.1 -f model.vw
final_regressor = model.vw
Num weight bits = 18
learning rate = 0.5
initial_t = 0
power_t = 0.5
using no cache
Reading datafile = train.vw
num sources = 1
Enabled reductions: gd, scorer, csoaa, cb
average  since         example        example  current  current  current
loss     last          counter         weight    label  predict features
0.000000 0.000000            1            1.0    known        1        4
0.000000 0.000000            2            2.0    known        1        4

finished run
number of examples = 2
weighted example sum = 2.000000
weighted label sum = 0.000000
average loss = 0.000000
total feature number = 8

Answer 2

我会尝试使用 --cb_explore 10 而不是 --cb 10，因为它允许使用探索算法。

可以在此处的教程中找到更多详细信息： https://vowpalwabbit.org/tutorials/contextual_bandits.html

--cb The contextual bandit module which allows you to optimize predictor based on already collected data, or contextual bandits without exploration.

--cb_explore The contextual bandit learning algorithm for when the maximum number of actions is known ahead of time and semantics of actions stays the same across examples.

在守护进程模式下 运行 vowpal wabbit contextual bandit 失败

Failed to run vowpal wabbit contextual bandit in daemon mode

vowpalwabbit

在守护进程模式下运行 vowpal wabbit contextual bandit 失败