最先进的维度算法

发布于 2024-10-21 18:37:16 字数 675 浏览 12 评论 0原文

我们知道有一些算法可以减少数据集的维度，例如 PCA 和 Isomap，

目前最先进的算法是什么？降低数据集的维度。
你有一个例子吗，也许是在 MATLAB 上？

假设我们有一个包含 100,000 个属性的数据集，例如 Dorothea 数据集（由结构分子特征表示的化学化合物必须分为活性（与凝血酶结合）或非活性。这是 NIPS 2003 特征选择挑战赛的 5 个数据集之一。）

Data Set Characteristics:   Multivariate

Number of Instances:        1950

Area:                       Life

Attribute Characteristics:  Integer

Number of Attributes:       100000

Date Donated                2008-02-29

Associated Tasks:           Classification

Missing Values?             N/A

Number of Web Hits:         17103

原文

We know there are algorithms to reduce the dimension of data sets like PCA and Isomap

What is the state of the art in the
reducing dimensionality to data sets.
Do you have an example, maybe on MATLAB?

Lets say we have a data set with 100,000 attributes like Dorothea Data Set
(Chemical compounds represented by structural molecular features must be classified as active (binding to thrombin) or inactive. This is one of 5 datasets of the NIPS 2003 feature selection challenge.)

Data Set Characteristics:   Multivariate

Number of Instances:        1950

Area:                       Life

Attribute Characteristics:  Integer

Number of Attributes:       100000

Date Donated                2008-02-29

Associated Tasks:           Classification

Missing Values?             N/A

Number of Web Hits:         17103

分享到QQ

分享到微博