索引视图：如何选择聚集索引？

发布于 2024-08-20 12:59:04 字数 234 浏览 13 评论 0原文

我将基于三个表（SQL Server 2005）创建一个索引视图，这些表之间具有内部和外部联接。我将针对此视图运行所有类型的查询。所以，我想知道选择要聚集的索引的最佳方法是什么。标准是什么，或者有什么工具可以帮助我。

（抱歉，如果我的问题很无聊，我在设计数据库方面没有很多经验）。

提前致谢！

编辑：我应该在这里澄清一下，我在视图中使用的表的使用非常频繁，我为维护索引而花费的任何开销都应该得到回报。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

墟烟 2024-08-27 12:59:04

由于它是一个索引，因此您必须选择一个列（或一组列），该列在所有情况下都保证为非空且唯一。这是最大、最严格的标准 - 任何可能为 NULL 或重复的内容从一开始就是不可能的。

根据您将在此索引视图上运行的查询类型，您可能还想查看是否有任何要对其运行范围查询的列（例如日期或其他列）。这可能会成为一个有趣的聚类键候选者。

但最重要的是：您的集群键必须在任何情况下都是唯一且非空的。根据我个人的经验，为了减少索引大小（从而增加每页的条目数），我会尝试使用尽可能小的键 - 单个 INT 最好，或者两个 INT 的组合 - 或者可能GUID - 但不要在集群键中使用 VARCHAR(500) 字段！

更新：致所有那些不断告诉我们聚集索引不必是唯一的发帖者 - 看看“索引女王”Kimberly Tripp 对此主题的看法：

让我们从我知道的关键事情开始
在聚类键中查找：
<前><代码>* 唯一
* 狭窄的
* 静止的
为什么独特？
聚类键应该是
唯一的，因为聚类键（当
存在一个）用作查找键
来自所有非聚集索引。拿
例如a后面的索引
书籍 - 如果您需要查找数据
索引条目指向 - 那
条目（索引条目）必须是唯一的
否则，哪个索引条目将是
您要找的是哪一位？所以，当
您创建聚集索引 - 它
必须是唯一的。但是，SQL Server
不需要你的聚类
键是在唯一列上创建的。你
可以在您想要的任何列上创建它
喜欢。在内部，如果聚类
key 不唯一，那么 SQL Server 将
通过添加 4 字节来“唯一化”它
数据的整数。因此，如果
聚集索引创建于
不独特的东西
只是有额外的开销
创建索引，浪费磁盘
空间、INSERT 的额外成本和
更新，在 SQL Server 2000 中，
clustereD 会产生额外成本
索引重建（这是因为
聚类键的糟糕选择是
现在更有可能）。

资料来源： http ://www.sqlskills.com/blogs/kimberly/post/Ever-increasing-clustering-key-the-Clustered-Index-Debateagain!.aspx

Since it's an index, you have to pick a column (or set of columns) which is guaranteed to be non-null and unique in all cases. That's the biggest and most stringent criteria - anything that might be NULL or duplicate is out of the question right from the get-go.

Depending on the type of queries you'll be running on this indexed view, you might also want to see if you have any columns (e.g. a DATE or something) that you'll be running range queries against. That might make an interesting candidate for a clustering key.

But the main thing is: your clustering key must be unique and non-null in any circumstance. And in my personal experience, to reduce index size (and thus increase the number of entries per page), I'd try to use as small a key as possible - a single INT is best, or a combination of two INTs - or possibly a GUID - but don't use VARCHAR(500) fields in your clustering key!

UPDATE: to all those poster who keep telling us clustered indexes don't need to be unique - check out what the "Queen of Indexing", Kimberly Tripp, has to say on the topic:

Let's start with the key things that I
look for in a clustering key:
* Unique
* Narrow
* Static
Why Unique?
A clustering key should be
unique because a clustering key (when
one exists) is used as the lookup key
from all non-clustered indexes. Take
for example an index in the back of a
book - if you need to find the data
that an index entry points to - that
entry (the index entry) must be unique
otherwise, which index entry would be
the one you're looking for? So, when
you create the clustered index - it
must be unique. But, SQL Server
doesn't require that your clustering
key is created on a unique column. You
can create it on any column(s) you'd
like. Internally, if the clustering
key is not unique then SQL Server will
“uniquify” it by adding a 4-byte
integer to the data. So if the
clustered index is created on
something which is not unique then not
only is there additional overhead at
index creation, there's wasted disk
space, additional costs on INSERTs and
UPDATEs, and in SQL Server 2000,
there's an added cost on a clustereD
index rebuild (which because of the
poor choice for the clustering key is
now more likely).

Source: http://www.sqlskills.com/blogs/kimberly/post/Ever-increasing-clustering-key-the-Clustered-Index-Debateagain!.aspx

回复收藏 0 原文

墨落画卷 2024-08-27 12:59:04

拇指法则：
选择您可能在查询中最常使用的列，如 WHERE、GROUP 等。这些列可能是非聚集索引的良好候选列。选择一列（或一组列），这可能会使您的行变得唯一，并且可能是聚集索引的良好候选者。

正如 marc 所提到的，聚集索引施加了唯一约束，因此它绝对需要您选择的列不应该有任何空值和重复项。

回复收藏 0 原文

安静被遗忘 2024-08-27 12:59:04

聚集索引不必是唯一的。其中的列甚至可以为空。例如，这将运行而不会出现错误：

create table  #test (col1 int identity, col2 int)
create clustered index ix_test on #test (col2)
insert into #test (col2) values (1)
insert into #test (col2) values (1) -- Duplicate in clustered index
insert into #test (col2) values (null)

聚集索引是磁盘上表结构的一部分。因此，聚集索引不使用额外的磁盘空间。

默认情况下，SQL Server 在主键上集群，这通常是一个不错的选择。如果您有大量表查找的密集查询，您可以更改它。更改聚集索引可以消除表查找。

A clustered index does not have to be unique. The columns in it can even be nullable. For example, this will run without an error:

create table  #test (col1 int identity, col2 int)
create clustered index ix_test on #test (col2)
insert into #test (col2) values (1)
insert into #test (col2) values (1) -- Duplicate in clustered index
insert into #test (col2) values (null)

A clustered index is part of the table structure on disk. As such, a clustered index uses no additional disk space.

By default, SQL Server clusters on the primary key, which is usually a good choice. You can change that if you have intensive queries with a lot of table lookups. Changing which index is clustered can eliminate table lookups.

回复收藏 0 原文

~没有更多了~