根据列值将数据划分到不同的表中是否合理？

发布于 2024-09-28 14:22:51 字数 776 浏览 3 评论 0原文

如果我有一个大表，其中有一列的值范围相当有限（例如 < 100），那么将该表划分为多个名称与该列值相关的表是否合理？

例如，像列这样的表：

table "TimeStamps": [Id] [DeviceId] [MessageCounter] [SomeData]

其中 [DeviceId] 是“有限范围”列将被分成几个不同的表：

table "TimeStamps1": [Id] [MessageCounter] [SomeData]
table "TimeStamps2": [Id] [MessageCounter] [SomeData]
...
table "TimeStampsN": [Id] [MessageCounter] [SomeData]

我的原始表遇到的问题是找到最大的 MessageCounter 值某些 DeviceId 值需要很长时间才能执行（请参阅此邮政）。

如果表是分开的，找到最大列数应该是一个 O(1) 的操作。

[编辑]

只是偶然发现了这个，我想我会更新它。我最初给我带来的问题是查询原始数据库时的性能问题。但是，在添加额外的数据库索引和计划的索引重组作业后，我能够通过规范化形式获得出色的性能。 SSMS 数据库引擎优化顾问工具对于识别瓶颈和建议缺失的索引有很大帮助。

原文

If I have a large table with a column which has a rather limited range of values (e.g. < 100), is it reasonable to divide this table into several tables with names tied to that column value?

E.g. a table like with columns:

table "TimeStamps": [Id] [DeviceId] [MessageCounter] [SomeData]

where [DeviceId] is the "limited range" column would be separated into several different tables:

table "TimeStamps1": [Id] [MessageCounter] [SomeData]
table "TimeStamps2": [Id] [MessageCounter] [SomeData]
...
table "TimeStampsN": [Id] [MessageCounter] [SomeData]

The problem I am having with my original table is that finding a largest MessageCounter value for some DeviceId values takes really long time to execute (see this post).

If tables would be separated, finding a maximum column number should be an O(1) operation.

[Edit]

Just stumbled upon this, thought I would update it. The problem I originally brought me here was performance issues when querying the original database. However, after adding additional db indexes and scheduled index reorganizing jobs, I was able to get great performance with the normalized form. SSMS Database Engine Tuning Advisor tool was of great help for identifying bottlenecks and suggesting the missing indexes.

分享到QQ

分享到微博