如何在 python 脚本中更改 fasttext api 的参数

Question

我们在命令提示符中有 fasttext commands 到运行

我已经克隆了 github 存储库，例如，在我使用的命令中更改网络参数以进行监督学习就像

 ./fasttext supervised -input FT_Race_data.txt -output race_model  -lr 0.4 -epoch 30 -loss hs

我正在更改 lr 和 epoch 以及 loss。我可以训练并获取所需的输出。

为了在 python 脚本中编程，我安装了 fasttext 库并尝试了

classifier = fasttext.supervised('FT_Race_data.txt','race_model')

模型得到了训练，但结果并不好，在这种情况下，我没有定义任何参数。所以我试了一下

classifier = fasttext.supervised('FT_Race_data.txt','race_model', 0.4, 30, 'hs')

程序运行没有错误，但没有给出任何结果。所以我试了一下

classifier = fasttext.supervised(input = 'FT_Race_data.txt',output ='race_model', lr = 0.4,epoch= 30,loss = 'hs')

它给出了一个错误，即 fasttext 只接受两个参数。

如何像在命令提示符中那样更改 python 脚本中的参数以微调监督学习？

Answer 1

为了将来参考，形成讨论 here, it seems that the pip install fasttext doesn't install the full features available in the repo。

所以直到最新功能包含在 https://pypi.python.org/pypi/fasttext, for python bindings with features to train models and set parameters, follow the following installation procedure as outlined here 中。

git clone https://github.com/facebookresearch/fastText.git
cd fastText
pip install .

然后使用 train_supervised 一个函数，其中 returns 一个模型对象可以设置不同的参数，如下所示 example in that repo。

fastText.train_supervised(input, lr=0.1, dim=100, ws=5, epoch=5, minCount=1, minCountLabel=0, minn=0, maxn=0, neg=5, wordNgrams=1, loss='softmax', bucket=2000000, thread=12, lrUpdateRate=100, t=0.0001, label='__label__', verbose=2, pretrainedVectors='')

如何在 python 脚本中更改 fasttext api 的参数

how to change parameters of fasttext api in a python script

python

nlp

machine-learning

fasttext