在 train_test_split sklearn python 上设置种子

Question

有什么方法可以在 train_test_split 和 python sklearn 上设置种子。我已将参数 random_state 设置为整数，但我仍然无法重现结果。

提前致谢。

Answer 1

from sklearn.model_selection import train_test_split
x = [k for k in range(0, 10)]
y = [k for k in range(0, 10)]
x_train, x_test, y_train, y_test = train_test_split(x, y, test_size=0.4, random_state=11)
print (x_train)
x_train, x_test, y_train, y_test = train_test_split(x, y, test_size=0.4, random_state=11)
print (x_train)
x_train, x_test, y_train, y_test = train_test_split(x, y, test_size=0.4, random_state=11)
print (x_train)

每次拆分数据时，上面的代码都会为 x_train 产生相同的结果。随机性可能在您的数据框中，而不是 train_test_split。

Answer 2

只需在train_test_split中指定参数random_state=some_number_you_wan to use,，如random_state=42

在 train_test_split sklearn python 上设置种子

Setting seed on train_test_split sklearn python

python-3.x

scikit-learn

jupyter-notebook

train-test-split