catboost eval_set在Scikit-Learn管道中无法使用

发布于 2025-02-08 19:25:41 字数 1185 浏览 1 评论 0原文

我正在尝试将x_valid数据集传递到evar_set从fit函数中的参数（从catboost库中）（这是 documentation ）但是我收到以下错误：

ValueError: Pipeline.fit does not accept the cat_features parameter. You can pass parameters to specific steps of your pipeline using the stepname__parameter format, e.g. `Pipeline.fit(X, y, logisticregression__sample_weight=sample_weight)`.

我正在运行的代码是

catboost_model = CatBoostClassifier(learning_rate=0.02, eval_metric='AUC')

pipeline = Pipeline([("classifer", catboost_model)])

cat_columns = ['frontend_client_type']

X_train, X_valid, y_train, y_valid = train_test_split(df[cat_columns], df['label'], test_size=0.2)

pipeline = pipeline.fit(
    X_train,
    y_train,
    cat_features=cat_columns,
    classifer__eval_set=[(X_valid, y_valid)],
)

我的合成数据框架是

df = pd.DataFrame({'frontend_client_type':['android', 'android', 'ios', 'web', 'android'],
                   'label':[True, True, False, False, True]})

原文

I am trying to pass X_valid dataset into the eval_set parameters in the fit function from CatBoost library (this is the link to the documentation) but I am getting the following error:

ValueError: Pipeline.fit does not accept the cat_features parameter. You can pass parameters to specific steps of your pipeline using the stepname__parameter format, e.g. `Pipeline.fit(X, y, logisticregression__sample_weight=sample_weight)`.

The code that I am running is

catboost_model = CatBoostClassifier(learning_rate=0.02, eval_metric='AUC')

pipeline = Pipeline([("classifer", catboost_model)])

cat_columns = ['frontend_client_type']

X_train, X_valid, y_train, y_valid = train_test_split(df[cat_columns], df['label'], test_size=0.2)

pipeline = pipeline.fit(
    X_train,
    y_train,
    cat_features=cat_columns,
    classifer__eval_set=[(X_valid, y_valid)],
)

My synthetic dataframe is

df = pd.DataFrame({'frontend_client_type':['android', 'android', 'ios', 'web', 'android'],
                   'label':[True, True, False, False, True]})

分享到QQ

分享到微博