当前位置：文江博客话题详情

NULL 和 FK 关系意味着什么 - 数据库

发布于 2024-07-15 21:58:55 字数 371 浏览 1 评论 0原文

我在关系 SQL 数据库中创建 FK 关系时遇到了困难，经过工作中的简短讨论，我们意识到我们有可为空的列，这很可能是导致该问题的原因。我一直认为 NULL 意味着未分配、未指定、空白等，并且确实从未见过这样的问题。

与我交谈的其他开发人员认为，处理这种情况的唯一方法是，如果两个实体之间确实存在关系，那么您必须创建一个表来连接两个实体的数据......

对我来说，这似乎很直观至少可以说，对于包含来自另一个表的 ID 的列，如果该列不为空，则它必须具有来自另一个表的 ID，但如果它为 NULL，则可以继续。这本身似乎与某些人的说法和建议相矛盾。

处理两个表之间可能存在关系的情况的最佳实践或正确方法是什么，如果指定了一个值，那么它必须在另一个表中......

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

神妖 2024-07-22 21:58:55

这是完全可以接受的，这意味着，如果该列有任何值，那么它的值必须存在于另一个表中。（我看到其他答案另有说法，但我不敢苟同。）

想象一个车辆和引擎表，引擎尚未安装在车辆中（因此车辆 ID 为空）。或者是带有主管列和公司首席执行官的员工表。

更新：根据 Solberg 的请求，这里是具有外键关系的两个表的示例，显示外键字段值可以为空。

CREATE TABLE [dbo].[EngineTable](
    [EngineID] [int] IDENTITY(1,1) NOT NULL,
    [EngineCylinders] smallint NOT NULL,
 CONSTRAINT [EngineTbl_PK] PRIMARY KEY NONCLUSTERED 
(
    [EngineID] ASC
)WITH (IGNORE_DUP_KEY = OFF) ON [PRIMARY]
) ON [PRIMARY]

CREATE TABLE [dbo].[CarTable](
    [CarID] [int] IDENTITY(1,1) NOT NULL,
    [Model] [varchar](32) COLLATE SQL_Latin1_General_CP1_CI_AS NOT NULL,
    [EngineID] [int] NULL
 CONSTRAINT [PK_UnitList] PRIMARY KEY CLUSTERED 
(
    [CarID] ASC
)WITH (IGNORE_DUP_KEY = OFF) ON [PRIMARY]
) ON [PRIMARY]

ALTER TABLE [dbo].[CarTable]  WITH CHECK ADD CONSTRAINT [FK_Engine_Car] FOREIGN KEY([EngineID])
REFERENCES [dbo].[EngineTable] ([EngineID])


Insert Into EngineTable (EngineCylinders) Values (4);
Insert Into EngineTable (EngineCylinders) Values (6);
Insert Into EngineTable (EngineCylinders) Values (6);
Insert Into EngineTable (EngineCylinders) Values (8);

-- 现在一些测试：

Insert Into CarTable (Model, EngineID) Values ('G35x', 3);  -- References the third engine

Insert Into CarTable (Model, EngineID) Values ('Sienna', 13);  -- Invalid FK reference - throws an error

Insert Into CarTable (Model) Values ('M');  -- Leaves null in the engine id field & does NOT throw an error

It's perfectly acceptable, and it means that, if that column has any value, its value must exist in another table. (I see other answers asserting otherwise, but I beg to differ.)

Think a table of Vehicles and Engines, and the Engines aren't installed in a Vehicle yet (so VehicleID is null). Or an Employee table with a Supervisor column and the CEO of the company.

Update: Per Solberg's request, here is an example of two tables that have a foreign key relationship showing that the foreign key field value can be null.

CREATE TABLE [dbo].[EngineTable](
    [EngineID] [int] IDENTITY(1,1) NOT NULL,
    [EngineCylinders] smallint NOT NULL,
 CONSTRAINT [EngineTbl_PK] PRIMARY KEY NONCLUSTERED 
(
    [EngineID] ASC
)WITH (IGNORE_DUP_KEY = OFF) ON [PRIMARY]
) ON [PRIMARY]

CREATE TABLE [dbo].[CarTable](
    [CarID] [int] IDENTITY(1,1) NOT NULL,
    [Model] [varchar](32) COLLATE SQL_Latin1_General_CP1_CI_AS NOT NULL,
    [EngineID] [int] NULL
 CONSTRAINT [PK_UnitList] PRIMARY KEY CLUSTERED 
(
    [CarID] ASC
)WITH (IGNORE_DUP_KEY = OFF) ON [PRIMARY]
) ON [PRIMARY]

ALTER TABLE [dbo].[CarTable]  WITH CHECK ADD CONSTRAINT [FK_Engine_Car] FOREIGN KEY([EngineID])
REFERENCES [dbo].[EngineTable] ([EngineID])


Insert Into EngineTable (EngineCylinders) Values (4);
Insert Into EngineTable (EngineCylinders) Values (6);
Insert Into EngineTable (EngineCylinders) Values (6);
Insert Into EngineTable (EngineCylinders) Values (8);

-- Now some tests:

Insert Into CarTable (Model, EngineID) Values ('G35x', 3);  -- References the third engine

Insert Into CarTable (Model, EngineID) Values ('Sienna', 13);  -- Invalid FK reference - throws an error

Insert Into CarTable (Model) Values ('M');  -- Leaves null in the engine id field & does NOT throw an error

回复收藏 0 原文

↘人皮目录ツ 2024-07-22 21:58:55

我认为这场辩论是对象关系阻抗不匹配的另一个副产品。一些 DBA 类型会迂腐地说，基于对关系代数语义的更深入理解，FK 中永远不允许 null，但应用程序开发人员会认为这使他们的领域层更加优雅。

“尚未建立”关系的用例是有效的，但对于空 FK，一些人发现它通过引入更复杂的 SQL 功能（特别是 LEFT JOIN）增加了查询的复杂性。

我见过的一种常见的替代解决方案是在每个表中引入一个“空行”或“哨兵行”，其中 pk=0 或 pk=1（基于 RDBMS 支持的内容）。这允许您设计一个具有“尚未建立”关系的域层，但也避免引入 LEFT JOIN，因为您保证总会有一些东西可以加入。

当然，这种方法也需要勤奋，因为您基本上是在权衡 LEFT JOIN ，以便必须检查查询中哨兵行的存在，这样您就不会更新/删除它，等等。权衡是否合理是另一件事。我倾向于同意，仅仅为了避免更花哨的连接而重新发明 null 似乎有点愚蠢，但我也在一个应用程序开发人员无法赢得与 DBA 辩论的环境中工作。

编辑

我删除了一些“事实上”的措辞，并试图澄清“失败”连接的含义。 @wcoenen 的例子是我个人最常听到的避免 null FK 的原因。这并不是说它们像“破碎”那样失败了，而是失败了——有些人会认为——未能遵守最小意外原则。

另外，我把这个回复变成了一个维基，因为我基本上已经把它从原来的状态中删除了，并从其他帖子中借用了。

回复收藏 0 原文

残花月 2024-07-22 21:58:55

我强烈支持在 OLTP 系统中使用外键中的 NULL 来指示无父项的论点，但在决策支持系统中它很少能很好地工作。最合适的做法是使用特殊的“不适用”（或类似）值作为子记录（在事实表中）可以链接到的父项（在维度表中）。

原因是，向下钻取/交叉等的探索性质可能会导致用户在仅仅要求提供更多信息时不理解指标如何变化。例如，当财务数据集市包含产品销售和其他收入来源的组合时，深入到“产品类型”应该将非产品销售相关数据分类，而不是让这些数字从报告中删除，因为事实表和产品维度表之间没有连接。

回复收藏 0 原文

奶茶白久 2024-07-22 21:58:55

当外键是复合外键时，会出现在外键列中允许空值的问题。如果两列之一为空，这意味着什么？另一列是否必须与引用表中的任何内容匹配？通过简单的（单列）外键约束，您可以摆脱空值。

另一方面，如果两个表之间的关系是有条件的（两个实体都可以单独存在，但可能几乎巧合地相关），那么最好使用“连接表”（包含一个表）来对其进行建模。 FK 到引用表，另一个到引用表，并且具有自己的主键作为两个 FK 的组合。

作为连接表的示例，假设您的数据库包含俱乐部和人员表。有些人属于某些俱乐部。连接表将是club_members，并且将包含引用“people”表的人员的FK，并且将包含该人员所属俱乐部的另一个FK，并且人员和俱乐部的标识符的组合将是主键连接表。（连接表的另一个名称是“关联”或“关联”表。）

回复收藏 0 原文

天涯沦落人 2024-07-22 21:58:55

我倾向于传达该专栏含义的设计。就域而言，空值可能意味着任意数量的事物。在相关表中放置一个表示“不需要”或“未选择”的值至少可以传达目的，而无需询问开发人员或查阅文档。

回复收藏 0 原文

悲凉≈ 2024-07-22 21:58:55

假设您需要生成所有客户的报告。每个客户都有一个国家/地区的 FK，并且国家/地区数据需要包含在报告中。现在假设您允许 FK 为 null，并且执行以下查询：

SELECT * FROM customer, country WHERE customer.countryID = country.ID

任何国家/地区 FK 为 null 的客户都将从报告中默默忽略（您需要使用 LEFT JOIN 来修复它）。我发现这不直观且令人惊讶，因此我不喜欢 NULL FK，并在我的数据库模式中避免使用它们。相反，我使用哨兵值，例如特殊的“未知国家”。

Suppose you would need to generate a report of all customers. Each customer has a FK to a country and the country data needs to be included in the report. Now suppose you allow the FK to be null, and you do the following query:

SELECT * FROM customer, country WHERE customer.countryID = country.ID

Any customer where the country FK is null would be silently omitted from the report (you need to use LEFT JOIN instead to fix it). I find this unintuitive and surprising, so I don't like NULL FKs and avoid them in my database schemas. Instead I use sentinel values, e.g. a special "unkown country".

回复收藏 0 原文

时间海 2024-07-22 21:58:55

CREATE TABLE [tree]
{
    [id] int NOT NULL,
    [parent_id] int NULL
};

ALTER TABLE [tree] ADD CONSTRAINT [FK_tree_tree] FOREIGN KEY([parent_id])
REFERENCES [tree] ([id]);

这并没有什么问题！根节点将永远有一个 NULL 父节点，这不是“尚未建立”关系的情况。这里的连接也没有问题。

让根节点指向自身作为父节点以避免 NULL FK 或任何其他创造性的解决方法，意味着现实世界不再在数据库中准确建模。

没有人提到的一个潜在问题是包含大量 NULL 值的列上的索引性能。虽然这本身与外键问题无关，但它可能会使连接表现不佳。

我确实明白，如果您是一名 DBA，正在处理拥有数亿行的超大型数据库，您将不会需要 NULL 外键，因为它们根本无法执行。但事实是，大多数开发人员一生中永远不会使用如此大的数据库，而今天的数据库可以处理几十万行的这种情况。强调一个（糟糕的）比喻，我们大多数人都不会驾驶 F1 赛车，而我妻子的雅阁中的自动变速箱可以很好地完成它需要做的事情（或者至少，它曾经是这样，直到几周前它坏了） ...）。

CREATE TABLE [tree]
{
    [id] int NOT NULL,
    [parent_id] int NULL
};

ALTER TABLE [tree] ADD CONSTRAINT [FK_tree_tree] FOREIGN KEY([parent_id])
REFERENCES [tree] ([id]);

There is nothing wrong with this! The root node will eternally have a NULL parent, and this is not a case of a "not yet established" relationship. No problem with joins here, either.

Having the root node point to itself as the parent to avoid the NULL FK, or any other creative workaround, means that the real world is no longer accurately modeled in the database.

The one potential issue that nobody mentioned is with index performance on columns that contain lots of NULL values. This per se has nothing to do with the foreign key question, though, but it can make joins perform poorly.

I do understand that if you are a DBA working with ultra-large databases that have hundreds of millions of rows, you would not want NULL foreign keys, because they would simply not perform. The truth is, though, that most developers will never work with such large databases in their lifetime, and today's databases can handle such a situation just fine with a few hundred thousand rows. To stress a (poor) metaphor, most of us so not drive F1 race cars, and the automatic transmission in my wife's Accord does what it needs to do just fine (or at least, it used to, until it broke a few weeks ago ...).

回复收藏 0 原文