用余弦相似性在不同情节结束的两种计算T-SNE图的方法，但是该方法似乎是相同的

发布于 2025-02-07 13:32:58 字数 948 浏览 2 评论 0原文

在过去的一个小时中，我一直在研究这个问题，但似乎找不到问题... 我有一份文章列表，我想查看哪些文章彼此相似。

我通过计算文章的TF-IDF向量之间的余弦相似性并制作结果的T-SNE图。我以两种方式做到了这一点，但令我惊讶的是，这些地块彼此截然不同，而且我看不出哪一个是正确的。

在示例中，TFDOC是TF-IDF。

from sklearn.metrics.pairwise import cosine_similarity
from sklearn import manifold

X = cosine_similarity(tfdoc, tfdoc)
model = manifold.TSNE(random_state=1, metric="precomputed")
Y = model.fit_transform(X)

绘制后，这将导致：

但是当我使用此代码时：

from sklearn.manifold import TSNE

tsne = TSNE(random_state=1, metric="cosine")

embs = tsne.fit_transform(tfdoc)

a href =“ https://i.sstatic.net/kcwk8.png” “> “在此处输入图像说明”

有人知道这里的区别到底是什么？

提前致谢！！

原文

I have been looking at this for the past hour but can not seem to find the problem...
I have a list of articles on which I want to see which articles are similar to each other.

I have done this by computing the cosine similarities between the TF-IDF vectors of the articles and making a t-SNE plot of the result. I have done this in 2 ways but what surprised me is that the plots are very different from each other, and I do not see which one is correct.

In the examples, tfdoc is the TF-IDF.

from sklearn.metrics.pairwise import cosine_similarity
from sklearn import manifold

X = cosine_similarity(tfdoc, tfdoc)
model = manifold.TSNE(random_state=1, metric="precomputed")
Y = model.fit_transform(X)

when plotted, this results in:

But when I use this code:

from sklearn.manifold import TSNE

tsne = TSNE(random_state=1, metric="cosine")

embs = tsne.fit_transform(tfdoc)

It results in:

Does someone know what the difference here exactly is?

Thanks in advance!!

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

棒棒糖 2025-02-14 13:32:58

第一个测试使用余弦相似性，而第二个测试使用余弦距。通常，较大的余弦距离意味着较小的余弦相似性。

回复收藏 0 原文

~没有更多了~

关于作者

以为你会在

暂无简介

文章

27 人气

关注发私信

alipaysp_snBf0MSZIv

文章 0 评论 0

关注

梦断已成空

文章 0 评论 0

关注

瞎闹

文章 0 评论 0

关注

凯凯我们等你回来

文章 0 评论 0

关注

寄意

文章 0 评论 0

关注

似梦非梦

文章 0 评论 0

友情链接

文江博客

用余弦相似性在不同情节结束的两种计算T-SNE图的方法，但是该方法似乎是相同的

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

关于作者

相关话题

热门标签

推荐作者

alipaysp_snBf0MSZIv

梦断已成空

瞎闹

凯凯我们等你回来

寄意

似梦非梦

友情链接

用余弦相似性在不同情节结束的两种计算T-SNE图的方法，但是该方法似乎是相同的

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

关于作者

相关话题

热门标签

推荐作者

alipaysp_snBf0MSZIv

梦断已成空

瞎闹

凯凯我们等你回来

寄意

似梦非梦

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。