接线表和标准化问题
我很难弄清楚以下设计模式是否可以接受。我对关系模型有以下要求(以及其他一些要求):
1)它必须能够表示应用程序(例如 AppA
、AppB
、AppC
),每个都有它自己的一组属性。
2)每个应用程序都可以通过不同的渠道进行通信,例如互联网
(电子邮件、Twitter、Facebook)、电话
(短信、彩信等),因此有很多-节目和频道之间的一对多关系。
3) 有一组预定义的标识符(地址、电话号码、登录帐户)可以由许多程序共享,因此程序和标识符之间也存在多对多关系。
4)相同的标识符可以发送多种类型的消息,程序也可以(同样是多对多),但我需要能够在每个应用程序的基础上限制通信类型标识符的使用。
基本上,我所做的就是创建四个表:Program
、Channel
、Ident
和 CommunicationType
来存储以下信息:我没有为 (Program, Channel)
、(Program, Identifier)
等创建联结表,这只会使设计变得复杂,而是创建了一个由这四个表的主键组成的单个表具有唯一的对(程序、通道、Ident、通信类型)
的约束。现在,该表的每条记录都链接到给定的通信。
当然,这以一种非常简单的方式解决了我的问题,但现在我问自己,如果它违反了规范化原则,这是否可以接受。有人可以给我意见吗?
I am having a hard time trying to figure out if the following design pattern is acceptable. I have the following requirements (and some other more) for a relational model:
1) It must be able to represent applications (such as AppA
, AppB
, AppC
), each one with it's own set of attributes.
2) Every applications can communicate through different channels like Internet
(E-Mail, Twitter, Facebook), Phone
(SMS, MMS, etc.) so that there's a many-to-many relationship between programs and channels.
3) There is a set of pre-defined identifiers (addresses, phone-numbers, login accounts) which can be shared by many programs, so that, again, there's a many-to-many relationship between programs and identifiers.
4) The same identifier can send several types of messages, and so can the programs (again, many-to-many), but I need to be able to restrict usage of identifiers for communications type on a per-application basis.
Basically, what I did was to create four tables, Program
, Channel
, Ident
and CommunicationType
to store information about each of these and, instead of creating junction tables for (Program, Channel)
, (Program, Identifier)
, and so on which would just complicate the design, I created a single table consisting of the primary keys of these four tables with a unique constraint on (Program, Channel, Ident, CommunicationType)
. Now, each record of this table is linked to a given communication.
Of course, this solves my problem in a pretty easy way, but now I am questioning myself whether this is acceptable at all of if it defeats the principles of normalization. Can anyone please give me an opinion?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
data:image/s3,"s3://crabby-images/d5906/d59060df4059a6cc364216c4d63ceec29ef7fe66" alt="扫码二维码加入Web技术交流群"
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(3)
这是个好主意。
当你设计这样的表格时,你需要小心一件事。您的结构具有键 {Program, Channel, Ident, CommunicationType},允许 Program 和 Channel、Channel 和 Ident、Program 和 CommunicationType 等各种可能的组合。有时这是一个坏主意。
这就是它成为一个坏主意的原因。您似乎是在说,并非 Ident、Program 和 CommunicationsType 的每个组合都是有效的。
将有效组合存储在自己的表中。使用外键引用来维护数据完整性。
构建一个具有键 {Program, Ident, CommunicationsType} 的表。具有键 {Program, Channel, Ident, CommunicationType} 的表可以设置对其的外键引用。
构建尽可能多的表来实现您所知道的所有约束。更多表意味着数据完整性检查更简单。 (您可能需要比我提到的更多的表。不要假设它们需要有两列;它们可能需要更多。)
您是否需要一个键为 {Program, Channel} 的表还不清楚。但如果你这样做,那么你需要按照这些思路构建表格。 (空气代码。)
根据需要添加其他列。在某些情况下,您可能会发现需要重叠的外键。我认为你在这里不需要它们,但我可能是错的。
我不确定你所说的“如果它违反了正常化原则”是什么意思。具有四列主键的表不会违反任何正常形式仅仅因为这个原因,尽管可能由于其他原因。未能实现所有已知的约束通常是,嗯,次优设计,但这并不是因为它违反了任何正常形式。
That's a fine idea.
You need to be careful of one thing when you design tables like this. Your structure, which has the key {Program, Channel, Ident, CommunicationType}, allows every possible combination of Program and Channel, of Channel and Ident, of Program and CommunicationType, and so on. Sometimes that's a bad idea.
And that's what makes it a bad idea. You seem to be saying that not every combination of Ident, Program, and CommunicationsType is valid.
Store valid combinations in their own tables. Use foreign key references to maintain data integrity.
Build a table that has the key {Program, Ident, CommunicationsType}. The table that has the key {Program, Channel, Ident, CommunicationType} can set a foreign key reference to it.
Build as many tables as it takes to implement all the constraints you know of. More tables means data integrity checks are simpler. (You might need more tables than the ones I mentioned. Don't assume they need to have two columns; they might need more.)
It's not at all clear that you need a table keyed {Program, Channel}. But if you do, then you need to build tables something along these lines. (Air code.)
Add other columns as needed. In some cases, you'll probably find that you need overlapping foreign keys. I don't think you need them here, but I could be wrong.
I'm not sure what you mean by "if it defeats the principles of normalization". A table that has a four-column primary key doesn't violate any of the normal forms for that reason alone, although it might for other reasons. Failing to implement all the known constraints is generally, um, sub-optimal design, but not because it violates any of the normal forms.
我不会这样做。
我会在每对(或 n 元组)表之间创建一个联结表。这最终将允许更简单的查询,并且还允许您根据需要在每种情况下独立于其他情况以适当的方式约束行。
您可能还会发现这些连接处需要额外的归因,例如从一个软件到另一个软件,方向性、有效负载、使用的语言、正在访问的查询点等是什么。
i would not do this.
i would create one junction table between each pair (or n-tuple) of tables. This will allow simpler querying in the end, and will also allow you to constrain the rows in a proper way as needed in each case independently of the others.
you will also probably find that extra attribution is needed on these junctions, like from one software to another, what is the directionality, payload, language used, query point being accessed etc.
很抱歉为您提供了需要更多信息的答案。在这一点上,我的声誉不允许发表任何评论……
根据解释,我看不出所选择的设计有什么问题。
然而,要真正回答您的问题,了解您选择这种设计的原因会很有用。
毕竟,即使没有包含所有键和复合唯一索引的单个表,它也可以工作。以这种方式锁定所有组合是相当有限制的。
当您找到通信时,您仍然需要加入一个或多个其他表才能访问构成该通信的信息。
为什么要以这种方式存储每个唯一的通信路径?
Sorry for providing you an answer that is asking for more information. My reputation at this point allows no comments...
I can see nothing wrong with the chosen design based on the explanation.
However, to truly answer your question it would be usefull to understand why you chose this design.
After all it would also work without the single table with all the keys and compound unique index. It is rather restrictive to have all combinations locked down this way.
When you have found the communication you will still have to join with one or more of the other tables to access the information that makes up the communication.
Why do you want to have each unique communication path stored in this manner?