如何修复:' valueerror:找到的输入变量,示例数量不一致'
为了使用线性回归预测房价,我无法使用model.fit()
训练模型,因为它给了我一个错误。
这是我的代码:
#importing dependencies
import pandas as pd
import numpy as np
from sklearn.linear_model import LinearRegression
import matplotlib.pyplot as plt
from sklearn.model_selection import train_test_split
#data loading
dataset = pd.read_csv('/content/dataset - train.csv')
#data visualization
plt.xlabel('Area')
plt.ylabel('Price')
plt.scatter(dataset['LotArea'], dataset['SalePrice'], color='red', marker='*')
#splitting data into features and target
X = dataset.drop(['SalePrice'], axis = 1)
Y = dataset['LotArea']
#data splitting into train and test data
X_train, X_test, Y_train, Y_train = train_test_split(X, Y, test_size=0.2, random_state=0)
#training the model
model = LinearRegression()
model.fit(X_train, Y_train)
我收到的错误:
ValueError Traceback (most recent call last)
<ipython-input-31-a42a894194a6> in <module>()
1 model = LinearRegression()
----> 2 model.fit(X_train, Y_train)
3 frames
/usr/local/lib/python3.7/dist-packages/sklearn/utils/validation.py in check_consistent_length(*arrays)
332 raise ValueError(
333 "Found input variables with inconsistent numbers of samples: %r"
--> 334 % [int(l) for l in lengths]
335 )
336
ValueError: Found input variables with inconsistent numbers of samples: [1168, 292]
请帮助我解决此问题。
For predicting house prices using linear regression, I am not able to train the model using model.fit()
as it gives me an error.
Here is my code:
#importing dependencies
import pandas as pd
import numpy as np
from sklearn.linear_model import LinearRegression
import matplotlib.pyplot as plt
from sklearn.model_selection import train_test_split
#data loading
dataset = pd.read_csv('/content/dataset - train.csv')
#data visualization
plt.xlabel('Area')
plt.ylabel('Price')
plt.scatter(dataset['LotArea'], dataset['SalePrice'], color='red', marker='*')
#splitting data into features and target
X = dataset.drop(['SalePrice'], axis = 1)
Y = dataset['LotArea']
#data splitting into train and test data
X_train, X_test, Y_train, Y_train = train_test_split(X, Y, test_size=0.2, random_state=0)
#training the model
model = LinearRegression()
model.fit(X_train, Y_train)
The error I get:
ValueError Traceback (most recent call last)
<ipython-input-31-a42a894194a6> in <module>()
1 model = LinearRegression()
----> 2 model.fit(X_train, Y_train)
3 frames
/usr/local/lib/python3.7/dist-packages/sklearn/utils/validation.py in check_consistent_length(*arrays)
332 raise ValueError(
333 "Found input variables with inconsistent numbers of samples: %r"
--> 334 % [int(l) for l in lengths]
335 )
336
ValueError: Found input variables with inconsistent numbers of samples: [1168, 292]
Please help me resolve this problem.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论