Scikit 学习交叉验证拆分

Scikit learn cross validation split

我目前正在使用 cross_validation.cross_val_predict 来获取 LogisticRegression 分类器所做的预测。我的问题是:训练集的数据百分比是多少,测试集的数据百分比是多少?是 80%-20% 的拆分吗?

我在 Whosebug 上查看了网站和其他问题,但没有找到问题的答案。

documentation for this function 中,它声明 cv 参数:

cv : cross-validation generator or int, optional, default: None A cross-validation generator to use. If int, determines the number of folds in StratifiedKFold if y is binary or multiclass and estimator is a classifier, or the number of folds in KFold otherwise. If None, it is equivalent to cv=3. This generator must include all elements in the test set exactly once. Otherwise, a ValueError is raised.