当前位置：文江博客话题详情

performance tags database metadata normalization

用于多实体高性能标记的数据库

发布于 2024-11-15 15:04:47 字数 777 浏览 8 评论 0原文

我正在为社交应用程序设计一个数据库，并试图确定我的方法是否 1) 表现良好，2) 是否正确标准化？

我对标签查询性能和数据库设计的研究得出的结论是，具有全文索引搜索的单个标签表可产生最佳性能。

请参阅：http://tagging.pui.ch/post/37027746608/ tagsystems-performance-tests

我知道我可以（并且应该从纯粹规范化的角度来看）将标签放在一个单独的表中，每个标签都有一个键，但是随着数据库变大，性能会受到影响（根据链接的文章）。标签搜索是我的应用程序的关键组件，必须表现良好。

下面的结构说明了我设计的使用桥接元数据表的基本方法，我希望使用这个单个表来桥接更多的“对象表”，但我只提供了几个来说明这一点：

Users Table: UserID PK、用户名等

博客表：BlogID PK、UserID FK、BlogTxt 等

照片表：PhotoID PK、UserID FK、PhotoPath 等

元数据表：MetadataID PK、UserID FK、ObjectTable（帖子或博客）、ObjectID FK（PostID 或 BlogID）、标签（tag1、tag2、tag3）

除了上述问题之外，我也有兴趣知道是否有更好的替代方案。我是数据库设计的新手，所以请原谅我对正确执行此操作的方法的严重无知。非常感谢。

收藏 0

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

评论（1）

我做我的改变 2024-11-22 15:04:47

我对标签查询性能和数据库设计的研究得出的结论是，具有全文索引搜索的单个标签表可产生最佳性能。

这实际上是不正确的...

您可以获得的最佳性能是切换到具有数组类型和位图索引扫描的数据库引擎，在 int[] array 使用触发器列，并在其上添加适当的索引（gin、gist、rtree）。

这允许编写查询（下面的 Postgres 语法），例如：

create index on posts using gin (tags);

-- bitmap AND/OR index scan on posts
-- has 1 or 2 or 3 or any of 4, 5, 6 without 7 or 8
select *
from posts
where tags && array[1,2,3]
or tags && array[4,5,6] and not tags && array[7,8]

上面的内容将消除您可以想到的使用 MySQL 的任何潜在优化。

My research on tag query performance and db design concluded that a single tags table with full text index search yields the best performance.

This is actually incorrect...

The best performance you can get is to switch to a database engine that has an array type and bitmap index scans, maintain an aggregate of your tags in an int[] array column using triggers, and add an appropriate index (gin, gist, rtree) on it.

This allows to write queries (Postgres syntax below) such as:

create index on posts using gin (tags);

-- bitmap AND/OR index scan on posts
-- has 1 or 2 or 3 or any of 4, 5, 6 without 7 or 8
select *
from posts
where tags && array[1,2,3]
or tags && array[4,5,6] and not tags && array[7,8]

The above will blow away any potential optimization you can think of using MySQL.

回复收藏 0 原文

~没有更多了~

关于作者

暂无简介

0 文章

0 评论

21356 人气

关注发私信

相关话题

热门标签

操作系统程序设计 IT运维 Linux系统管理 JavaScript 服务器应用 solaris C/C++ PHP Shell BSD Vue.js aix Oracle Python HTML 系统管理 HTML5 CSS 前端

推荐作者

游缘惊梦

文章 0 评论 0

小兔几

文章 0 评论 0

Glik

文章 0 评论 0

生生漫

文章 0 评论 0

Luxian

文章 0 评论 0

Champion-Ming

文章 0 评论 0

友情链接

我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的隐私政策了解更多相关信息。单击 接受 或继续使用网站，即表示您同意使用 Cookies 和您的相关数据。

原文