合并相同的表但保持单独的引用完整性

发布于 2024-08-04 11:20:38 字数 1213 浏览 13 评论 0原文

考虑一个具有诸如 (fk_dim1value, fk_dim2value, ..., value) 之类的事实表的维度模型，其中 fk_X 列是对应的普通维度表 dim1value 的外键（ id，value），dim2value（id，value），等。

这些事实和维度表是从不同的来源自动收集的，所以它们有很多......而且它们是多余的：所有维度值表在结构上是相同的，(id, value)，表示简单的文本值集合，没有进一步的语义（唯一的区别是在不同的事实表中引用它们的外键不同）。稍后可能会出现不太重要的维度类型，但不同类型维度的集合仍然很小。

因此，我想将维度表合并到一个表 dimvalue (fk_dim, dimvalue_id, value) 中，其中 fk_dim 引用表 dimension (dim_id, name) >，并且 dimvalue_id 仅在每个维度内是唯一的。然后，自然主键是复合的：(fk_dim, dimvalue_id)。

事实表外键列现在都引用同一个表，dimvalue (fk_dim, dimvalue_id, value) ...但是当然，每个列都与特定维度相关联，因此仍应限制为专门引用该维度的值（统一表dimvalue的水平分区）。

有没有（明智的）方法来做到这一点？

我的意思是类似“半复合”外键，即对复合 PK 的“切片”的单列引用，其他列具有固定值。 “完全复合”FK 将是 FOREIGN KEY (col1, col2) REFERENCES dimvalue (fk_dim, dimvalue_id) 但这里 fk_dim 是固定的，因此“home”一侧键只有一列，引用 dimvalue 主键的第二列；类似于FOREIGN KEY (fk_dim7value) REFERENCES dimvalue (fk_dim=7, dimvalue_id)。

这样的事情可能吗？或者我在最后一段中迷失了方向？我是否应该放弃整个 dimvalue 表的外键，然后添加检查约束以按维度进行限制？或者引用完整性是否要求我放弃更多并只接受所有单独的相同表？

（限制对写入性能的影响并不重要；读取性能是一个设计目标。）

原文

Consider a dimensional model with fact tables like (fk_dim1value, fk_dim2value, ..., value) where the fk_X columns are foreign keys into corresponding trivial dimension tables dim1value (id, value), dim2value (id, value), etc.

These fact-and-dimension tables are collected automatically from disparate sources, so there's a lot of them ... and they are redundant: all the dimension value tables are structurally identical, (id, value), representing simple collections of textual values with no further semantics (the only difference being the different foreign keys referencing them in the various fact tables). Less trivial dimension types will probably come up later, but the set of different types of dimensions will remain small.

So I want to merge the dimension tables into one table dimvalue (fk_dim, dimvalue_id, value) where fk_dim references a table dimension (dim_id, name), and dimvalue_id is unique only within each dimension. The natural primary key is then composite: (fk_dim, dimvalue_id).

The fact table foreign-key columns now all reference the same table, dimvalue (fk_dim, dimvalue_id, value) ... but of course each column is associated with a particular dimension and thus should still be limited to referencing the values of that dimension specifically (a horizontal partition of the unified table dimvalue).

Is there a (sensible) way to do this?

I mean something like a “half-composite” foreign key, i.e. a single-column reference to a “slice” of a composite PK, with a fixed value for the other column(s). A “fully-composite” FK would be FOREIGN KEY (col1, col2) REFERENCES dimvalue (fk_dim, dimvalue_id) but here fk_dim is fixed and so the “home” side of the key is just one column, referencing the second column of the dimvalue primary key; something like FOREIGN KEY (fk_dim7value) REFERENCES dimvalue (fk_dim=7, dimvalue_id).

Is something like that possible? Or am I losing my way in this last paragraph? Should I give up and just foreign-key to the whole dimvalue table and then add check constraints to limit by dimension? Or does referential integrity require me to give up even more and just accept all the separate identical tables?

(Impact of constraints on write performance is not important; read performance is a design goal.)

分享到QQ

分享到微博