取决于聚类成员资格的颜色图散点图

发布于 2025-01-16 16:48:22 字数 267 浏览 5 评论 0原文

我对数据集进行软聚类，我想创建一个很酷的图形，看起来与发布的图像相似。我想以图形形式显示两个（或更多集群）之间的数据点成员资格。不过我不太确定该怎么做。我已经使用标准为数据点分配颜色，但不确定如何创建如下所示的更动态的图形。任何帮助表示赞赏。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

请帮我爱他 2025-01-23 16:48:23

我认为标记正是您要寻找的东西：

x1 = y1 = 1
x2 = y2 = 2

dx = np.random.rand(10)
dy = np.random.rand(10)

x = np.array([x1 + dx, x2 + dx]).ravel()
y = np.array([y1 + dy, y2 + dy]).ravel()

threshold = 4
markers = np.array(["o" if xy > threshold else "h" for xy in x + y])


fig, ax = plt.subplots()
for marker in np.unique(markers):
    index = markers == marker 
    ax.scatter(x[index], y[index], marker=marker)

添加一些额外的代码来控制颜色和透明度（alpha）

import numpy as np
import matplotlib.pyplot as plt


x1 = y1 = 1
x2 = y2 = 2

dx = np.random.rand(10)
dy = np.random.rand(10)

x = np.array([x1 + dx, x2 + dx]).ravel()
y = np.array([y1 + dy, y2 + dy]).ravel()

threshold = 4
markers = np.array(["o" if xy > threshold else "h" for xy in x + y])

blue_color = "midnightblue" # predefined
pink_color = "orchid"  
colors = [blue_color if marker == "o" else pink_color for marker in markers]

alphas = np.array([abs(xy - threshold) for xy in x + y])
alphas = 1 - alphas/np.max(alphas) 


fig, ax = plt.subplots()
for i in range(len(x)):
    ax.scatter(x[i], y[i], marker=markers[i], color=colors[i], alpha=alphas[i])

I think markers are just the thing your looking for:

x1 = y1 = 1
x2 = y2 = 2

dx = np.random.rand(10)
dy = np.random.rand(10)

x = np.array([x1 + dx, x2 + dx]).ravel()
y = np.array([y1 + dy, y2 + dy]).ravel()

threshold = 4
markers = np.array(["o" if xy > threshold else "h" for xy in x + y])


fig, ax = plt.subplots()
for marker in np.unique(markers):
    index = markers == marker 
    ax.scatter(x[index], y[index], marker=marker)

Adding someaditional code to control color and transparency (alpha)

import numpy as np
import matplotlib.pyplot as plt


x1 = y1 = 1
x2 = y2 = 2

dx = np.random.rand(10)
dy = np.random.rand(10)

x = np.array([x1 + dx, x2 + dx]).ravel()
y = np.array([y1 + dy, y2 + dy]).ravel()

threshold = 4
markers = np.array(["o" if xy > threshold else "h" for xy in x + y])

blue_color = "midnightblue" # predefined
pink_color = "orchid"  
colors = [blue_color if marker == "o" else pink_color for marker in markers]

alphas = np.array([abs(xy - threshold) for xy in x + y])
alphas = 1 - alphas/np.max(alphas) 


fig, ax = plt.subplots()
for i in range(len(x)):
    ax.scatter(x[i], y[i], marker=markers[i], color=colors[i], alpha=alphas[i])

回复收藏 0 原文

萌化 2025-01-23 16:48:23

scikit-learn 中的 GaussianMixture 所做的事情与问题所要求的很接近。

具体来说，predict_proba(X) 返回一个数组，其中包含 X 中每个点属于该分量的概率。在下面的示例中，我们拟合了两个混合组件，因此最后两个图应该彼此相反：

from sklearn.mixture import GaussianMixture
from sklearn.datasets import make_moons
import matplotlib.pyplot as plt

X, _ = make_moons(noise=0.05)

mix = GaussianMixture(n_components=2).fit(X)
probs = mix.predict_proba(X)

fig, ax = plt.subplots(1, 3, sharey=True)
ax[0].scatter(X[:, 0], X[:, 1])
ax[1].scatter(X[:, 0], X[:, 1], c=probs[:, 0])
ax[2].scatter(X[:, 0], X[:, 1], c=probs[:, 1])
plt.show()

The GaussianMixture in scikit-learn does something close to what the question asks.

Specifically, predict_proba(X) returns an array with the probability of each point in X belonging to the component. In the example below we fit two mixture components, so the last two plots should be opposites of each other:

from sklearn.mixture import GaussianMixture
from sklearn.datasets import make_moons
import matplotlib.pyplot as plt

X, _ = make_moons(noise=0.05)

mix = GaussianMixture(n_components=2).fit(X)
probs = mix.predict_proba(X)

fig, ax = plt.subplots(1, 3, sharey=True)
ax[0].scatter(X[:, 0], X[:, 1])
ax[1].scatter(X[:, 0], X[:, 1], c=probs[:, 0])
ax[2].scatter(X[:, 0], X[:, 1], c=probs[:, 1])
plt.show()

回复收藏 0 原文

~没有更多了~