在使用随机森林和SVM时，使用DataPreparer对数据进行标准化后，为什么我的数据值会变为负面？

发布于 2025-02-06 08:42:13 字数 403 浏览 4 评论 0原文

我正在研究预测建模，我需要在网站上预测在线客户是否最终在网站上购买产品，并且由于这是一个分类问题，我正在使用随机森林分类器和SVM。

在创建用于培训，测试和验证集的拟合拆分之后，我对数据进行伪造，标准化和标准化。但是，在我将集合归一化之后，它们的值都变为负面。有没有办法改变这一点，为什么会发生这种情况？

我用来将拟合集正常化的代码如下：

data_preparer = DataPreparer(one_hot_encoder, standard_scaler)
data_preparer.prepare_data(fitting_splits.train_set).head()
data_preparer.prepare_data(fitting_splits.validation_set).head()

原文

I am working on predictive modeling where I need to predict whether an online customer ends up purchasing a product on a website or not, and I am using Random Forest Classifier and SVM since it's a classification problem.

After creating the fitting splits for training, testing, and validation sets, I dummify, standardize and normalize my data. However, after I normalize the sets, their values become all negative. Is there a way to change that and why does it happen?

The code that I am using to normalize my fitting sets is as below:

data_preparer = DataPreparer(one_hot_encoder, standard_scaler)
data_preparer.prepare_data(fitting_splits.train_set).head()
data_preparer.prepare_data(fitting_splits.validation_set).head()

分享到QQ

分享到微博