matlab中的模糊c均值tcp转储聚类
您好,我有一些数据是这样表示的:
0,tcp,http,SF,239,486,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,8,8,0.00,0.00,0.00,0.00,1.00,0.00,0.00,19,19,1.00,0.00,0.05,0.00,0.00,0.00,0.00,0.00,normal.
它来自 1999 年 kdd 杯,基于 darpa 集。
我的文本文件有像这样的行和行数据,在 matlab 中,您可以通过键入 findcluster 来使用通用聚类工具,但它只接受 .dat 文件。
我也不太确定它是否会接受这样的格式。我也不确定为什么转储文件中有这么多尾随零。
谁能帮助我如何利用文本文档并通过 matlab 中的 fcm 聚类方法运行它?确实需要代码帮助。
Hi I have some data thats represented like this:
0,tcp,http,SF,239,486,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,8,8,0.00,0.00,0.00,0.00,1.00,0.00,0.00,19,19,1.00,0.00,0.05,0.00,0.00,0.00,0.00,0.00,normal.
Its from the kdd cup 1999 which was based on the darpa set.
the text file I have has rows and rows of data like this, in matlab there is the generic clustering tool you can use by typing findcluster but it only accepts .dat files.
Im also not very sure if it will accept the format like this. Im also not sure why there is so many trailing zeros in the dump files.
Can anyone help how I can utilise the text document and run it thru a fcm clustering method in matlab? Code help is really needed.
FINDCLUSTER 只是两个聚类算法的 GUI 界面:FCM 和 SUBCLUST
您首先需要从文件中读取数据,查看 TEXTSCAN 函数。
然后你需要处理非数字属性;要么删除它们,要么以某种方式转换它们。据我所知,提到的两种算法仅支持数字数据。
访问KDD cup数据集的原始网站,了解每个属性。
FINDCLUSTER is simply a GUI interface for two clustering algorithms: FCM and SUBCLUST
You first need to read the data from file, look into the TEXTSCAN function for that.
Then you need to deal with non-numeric attributes; either remove them or convert them somehow. As far as I can tell, the two algorithms mentioned only support numeric data.
Visit the original website of the KDD cup dataset to find out the description of each attribute.