逻辑回归can; t使用分类变量来训练我的模型

发布于 2025-01-23 03:45:21 字数 813 浏览 0 评论 0原文

我想使用此分类变量训练我的模型是我的目标变量，

SelectedColumns=['workOrganiz' , 'education', 'maritalSt','jobType','ageGroup','workHoursPeriod','sex','lifequality']

我尝试运行这样的逻辑回归，

dfML=df[SelectedColumns]
list_of_results=[]
#train and test set stratified
X=dfML.iloc[:,:-1]    #all features except last
y=dfML.iloc[:,-1]  #target last column

X_train,X_test,y_train,y_test=train_test_split(X,y,test_size=0.3,random_state=15,stratify=y)
clf=LogisticRegression()
lrm=clf.fit(X_train,y_train)
y_pred=lrm.predict(X_test)

但是我会收到以下错误，

ValueError: could not convert string to float: 'Private'

我在做什么错？使用假人使我的模型具有100％的精度和精度。

dfML=df[SelectedColumns]
dfML=pd.get_dummies(dfML)

如果我删除dfml = df [selectedcolumns]，

原文

I want to train my model using this categorical variables being lifequality my objective variable

SelectedColumns=['workOrganiz' , 'education', 'maritalSt','jobType','ageGroup','workHoursPeriod','sex','lifequality']

I try to run a logistic regression like this

dfML=df[SelectedColumns]
list_of_results=[]
#train and test set stratified
X=dfML.iloc[:,:-1]    #all features except last
y=dfML.iloc[:,-1]  #target last column

X_train,X_test,y_train,y_test=train_test_split(X,y,test_size=0.3,random_state=15,stratify=y)
clf=LogisticRegression()
lrm=clf.fit(X_train,y_train)
y_pred=lrm.predict(X_test)

but I get the following error

ValueError: could not convert string to float: 'Private'

What am I doing wrong?
Using dummies makes my model have a precision and accuracy of 100%

dfML=df[SelectedColumns]
dfML=pd.get_dummies(dfML)

If I remove the dfml=df[SelectedColumns] the 100% doesn't happen

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

晨与橙与城 2025-01-30 03:45:21

回归算法只能使用“数字”来计算分类预测。您可以进行工作，并且仍然使用分类变量作为预测指标。有不同的方式，但一种简单的方式称为“虚拟编码”。您可以使用功能get_dummies（）将分类伏击更改为多个0和1列。参见 https> https：https：// www。 geeksforgeeks.org/how-to-to-create-dummy-variables in-python-with-with-pandas/amp/

回复收藏 0 原文

~没有更多了~