在 train_test_split sklearn python 上设置种子
Setting seed on train_test_split sklearn python
有什么方法可以在 train_test_split 和 python sklearn 上设置种子。我已将参数 random_state
设置为整数,但我仍然无法重现结果。
提前致谢。
from sklearn.model_selection import train_test_split
x = [k for k in range(0, 10)]
y = [k for k in range(0, 10)]
x_train, x_test, y_train, y_test = train_test_split(x, y, test_size=0.4, random_state=11)
print (x_train)
x_train, x_test, y_train, y_test = train_test_split(x, y, test_size=0.4, random_state=11)
print (x_train)
x_train, x_test, y_train, y_test = train_test_split(x, y, test_size=0.4, random_state=11)
print (x_train)
每次拆分数据时,上面的代码都会为 x_train 产生相同的结果。随机性可能在您的数据框中,而不是 train_test_split。
只需在train_test_split
中指定参数random_state=some_number_you_wan to use,
,如random_state=42
有什么方法可以在 train_test_split 和 python sklearn 上设置种子。我已将参数 random_state
设置为整数,但我仍然无法重现结果。
提前致谢。
from sklearn.model_selection import train_test_split
x = [k for k in range(0, 10)]
y = [k for k in range(0, 10)]
x_train, x_test, y_train, y_test = train_test_split(x, y, test_size=0.4, random_state=11)
print (x_train)
x_train, x_test, y_train, y_test = train_test_split(x, y, test_size=0.4, random_state=11)
print (x_train)
x_train, x_test, y_train, y_test = train_test_split(x, y, test_size=0.4, random_state=11)
print (x_train)
每次拆分数据时,上面的代码都会为 x_train 产生相同的结果。随机性可能在您的数据框中,而不是 train_test_split。
只需在train_test_split
中指定参数random_state=some_number_you_wan to use,
,如random_state=42