当前位置：文江博客话题详情

Python k-means cluster-computing cluster-analysis hyperparameters

寻找Kmeans聚类的优化数字

发布于 2025-02-05 20:14:47 字数 398 浏览 4 评论 0原文

我有一个具有长2D数组的文本文件。每个元素的数字在1到6之间。

我可以使用以下帖子中提供的指南聚集数据：

在此处输入链接描述，

但我想知道如何使群集选择群集数量的“ N_Clusters”的值，而无需我选择了这个价值。

我尝试了肘方法，但是到目前为止，我看到的示例使用图纸来选择最佳簇数。我的问题是：如何在没有视觉检查的情况下找到“簇数”的最佳值？

收藏 0

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

评论（1）

梅窗月明清似水 2025-02-12 20:14:47

我尝试了肘方法，但是我到目前为止看到的示例
绘图以选择最佳簇数。我的问题是：如何
为了找到没有视觉的“群集数”的最佳值
检查？

“肘方法”只是搜索退货减少的点。

虽然可以在视觉上设置超参数，但这只是检查最新改进是否显着的问题（即越过一些阈值）。

构建一个 k 值的表，计算 error 和百分比改进从上一步：

如果您的改善阈值为10％，则可以在k = 5停止。之后，这些改进正在减少（即倾向于过度合身，无法概括）。

在Python中，看起来像这样：

for k in range(min(errors), max(errors)):
    improvement = (errors[k] - errors[k+1]) / errors[k]
    if improvement < threshold:
        print('Diminishing returns after k =', k)
        break

输出：

Diminishing returns after k = 5

I tried the elbow method but the examples that I saw so far they use
drawing to choose the optimal number of clusters. My question is: how
to find the optimum value for "number of clusters" without a visual
check?

The "elbow method" is just a search for the point of diminishing returns.

While setting hyper-parameters can be done visually, it is just a matter of checking the whether the most recent improvement is significant (i.e. crossing some threshold).

Build a table of the k values, the computed error, and the percent improvement from the preceding step:

If your threshold for improvement is 10%, you can stop at k=5. After that, the improvements are diminishing (i.e. tending to overfit and failing to generalize).

In Python, it would look like this:

for k in range(min(errors), max(errors)):
    improvement = (errors[k] - errors[k+1]) / errors[k]
    if improvement < threshold:
        print('Diminishing returns after k =', k)
        break

That outputs:

Diminishing returns after k = 5

回复收藏 0 原文

~没有更多了~

关于作者

暂无简介

文章

评论

29 人气

关注发私信

相关话题

热门标签

操作系统程序设计 IT运维 Linux系统管理 JavaScript 服务器应用 solaris C/C++ PHP Shell BSD Vue.js aix Oracle Python HTML 系统管理 HTML5 CSS 前端

推荐作者

alipaysp_snBf0MSZIv

文章 0 评论 0

梦断已成空

文章 0 评论 0

瞎闹

文章 0 评论 0

凯凯我们等你回来

文章 0 评论 0

寄意

文章 0 评论 0

似梦非梦

文章 0 评论 0

友情链接

我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的隐私政策了解更多相关信息。单击 接受 或继续使用网站，即表示您同意使用 Cookies 和您的相关数据。

原文