谁能告诉我为什么我的管道错了？

发布于 2025-01-24 23:28:56 字数 723 浏览 3 评论 0原文

我正在尝试建立一个管道，以便执行GridSearchCV以找到最佳参数。我已经将数据分为火车和验证，并具有以下代码：

cols = ['home_ownership', "purpose","addr_state",  "application_type", "term"]

column_transformer = make_pipeline(

(OneHotEncoder(categories = cols)),

(OrdinalEncoder(categories = X["grade"])),

"passthrough")


imputer = SimpleImputer(strategy='median')

scaler = StandardScaler()

model = SGDClassifier(loss='log',random_state=42,n_jobs=-1,warm_start=True)

pipeline_sgdlogreg = make_pipeline(imputer, column_transformer, scaler, model)

当我执行GridSearchCV时，我会遇到Follwing错误：

“不能将中位数策略与非数字数据（...）一起使用，

我不明白为什么我为什么得到这个错误。任何分类变量都没有缺少值。

我完善了follwing：插补 - ＆gt; coding-＆gt; scaling-＆gt;任何

人都可以散发一些灯光吗？

原文

I am trying to build a pipeline in order to perform GridSearchCV to find the best parameters. I already split the data into train and validation and have the following code:

cols = ['home_ownership', "purpose","addr_state",  "application_type", "term"]

column_transformer = make_pipeline(

(OneHotEncoder(categories = cols)),

(OrdinalEncoder(categories = X["grade"])),

"passthrough")


imputer = SimpleImputer(strategy='median')

scaler = StandardScaler()

model = SGDClassifier(loss='log',random_state=42,n_jobs=-1,warm_start=True)

pipeline_sgdlogreg = make_pipeline(imputer, column_transformer, scaler, model)

When I perform GridSearchCV I am getting the follwing error:

"cannot use median strategy with non-numeric data (...)"

I do not understand why am I getting this error. None of the categorical variables have missing values.

I perfoming the follwing: Imputation->Encoding->Scaling-> Modeling

Can anyone shed some light?

分享到QQ

分享到微博