多租户或非多租户

发布于 2024-10-31 11:56:27 字数 923 浏览 11 评论 0原文

我需要做出一个艰难的数据库设计决策，涉及到我积极维护的客户基于 Web 的 CRM 的分支机构数量不断增加的多租户。

我很早就决定为每个分支使用单独的应用程序和单独的数据库，因为这是满足具有不同数据和代码要求的三个不同分支的最简单方法。我还希望避免在每个查询中管理租户 ID，就像我在 2007 年构建的传统经典 ASP (cringe) 应用程序中所做的那样……太恐怖了。

但现在分支机构的数据需求正在趋同，随着业务的扩展，我需要能够快速推出新分支机构并共享全球产品SKU。

由于所有分支的表和视图都是相同的，并且现在可以使用更好的 ORM 工具来管理多租户应用程序，因此我想知道为多个分支拥有一个共享数据库是否会更好。

集中式数据库的注意事项：

全球产品 SKU
简化库存申请
更易于备份
部署一次，而不是为每个分支机构

部署集中式数据库的注意事项：

更容易区分具有单独数据库的分支机构需求
模块化部署（一个损坏的分支不会破坏所有）
共享数据库更难管理和开发
我必须重新设计发票编号（由种子生成的序列）
到处都少了 WHERE 子句
对其他分支有很多影响

恢复一个损坏的分支不太可能有多达 10 个分支机构。目前有 3 名

具有该领域实际经验的开发人员，如果您遇到我的情况，您会怎么做？保留应用程序和DB 是分开的，还是合并成一个巨大的系统？

编辑：关于多租户利弊的Microsoft 文章很棒。我应该指出，分支之间的数据隔离不是主要问题。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

咋地 2024-11-07 11:56:27

硬着头皮将它们合并起来。在需要的位置添加您的租户 ID，并更改您的查询。

对于自定义，请研究插件类型架构，该架构允许您为特定客户端部署特定屏幕。

我们有一个软件产品就是以这种方式构建的。有时它部署在客户端站点上，有时我们托管它。出于所有意图和目的，处理具有客户端特定扩展的单个代码库比处理代码的多个分支更容易一个数量级。

其一，当我们解决问题时，我们是为每个人解决问题。当然，如果我们破坏了它，我们就会为每个人破坏它，但这就是单元测试的目的。针对一个代码库维护一组单元测试比为多个分支维护它们要容易得多。

我们从事多租户业务已有 10 多年了，我一次也没有回头看。一般来说，如果您在验证检索记录的人实际上是否有权获取该记录时已经具有安全意识，那么查询并没有那么不同。

我不同意科尔宾提出的问题。围绕版本控制的问题应该已经通过建立基于属性的安全结构来处理。这样您就可以通过用户或租户配置来打开/关闭功能。另外，我发现客户 A 不想要客户 B 所要求的相同新功能的情况很少见。

第二个关于数据混合的问题也不是问题。只需查看 salesforce.com 或任何其他大型网站即可。他们绝对使用多租户架构，并且从使用它们的客户端数量来看，这似乎不是问题。这里最重要的是能够确保您的客户的数据是安全的。

回复收藏 0 原文

云柯 2024-11-07 11:56:27

如果您谈论的是 10 个分支机构，则多租户似乎成本高昂，收益甚微。

您没有提到多租户的复杂性：

版本控制变得很困难。客户 X、Y 和 Z 可能需要新功能，而客户 A、B 和 C 则不需要。多租户应用程序很难容纳每个人，尤其是在新功能需要更改数据库架构的情况下。这并非不可能，只是更加困难。
一些客户对他们的数据与其他客户混合在同一个表中感到非常不舒服。尽管我们知道得更清楚，但这对他们来说像是一种安全风险。法律部门讨厌它。此外，如果您曾经为客户端转储原始数据，则共享数据库需要小心。

您可以通过更好的实践消除一些痛点：

自动化部署。这将使添加新客户端或升级/降级现有客户端变得更加容易。数据库维护（备份、重建索引）也应该自动设置。
将共享数据（SKU、库存）存储在中央数据库中，并让每个应用程序实例直接或通过服务访问它。

不要误会我的意思，我开发的最有趣的应用程序之一是多租户。这可能会带来巨大的好处，但您更有可能看到它们拥有数千个客户，而不是十个客户。

回复收藏 0 原文

夏末染殇 2024-11-07 11:56:27

老实说，这是一个商业问题。您要么能够在多租户设置中向较小的用户组提供更多自定义功能，但需要更多的 IT 开销。也就是说，您将需要更多的人员和硬件（管理层的说法是：金钱），但提供更大的灵活性。

如果您处于一个巨大的 Borg 环境中，那么您可以降低 IT 开销（同样，人员和事物，以管理资金），但您的最终用户必须在其软件中吸收较少的灵活性。所有的错误都是所有用户的问题，所以大的错误很快就会被解决。然而，新功能也会影响所有用户，因此速度会变慢。

如果你个人有能力拨打这个电话，而企业只需要听你说的话，或者你可以以某种方式推动管理层，我建议你问自己一系列关于你更喜欢哪种情况的问题：

A) 做你希望有更多的人来管理这件事并分担工资/责任
B) 据您所知，很快就会有第四个用户组吗？
C) 你想在这家公司呆多久？

如果您对前两个问题的回答是肯定的，那么您可能需要多租户。

回复收藏 0 原文

〃安静 2024-11-07 11:56:27

在我工作的情况下，出于监管/法律原因，我们必须将每个客户的数据保存在单独的数据库中。然而，有一些信息必须共享，主要与客户端 URL 对应于哪个数据库的查找表等相关。此外，如果客户希望以某种逻辑方式分离数据，则可以选择拥有多个数据库。因此，对于我们的每个产品，我们实际上拥有三种类型的数据库：

ApplicationData，它只有几个表，其中包含有关客户端本身的信息，例如当通过某个 URL 访问时要使用哪个 MasterData 数据库（见下文）以及哪个该客户端可以使用这些功能。每个产品都只有一个 ApplicationData，无论有多少不同的客户端正在使用该产品。
MasterData，其中包含特定于客户端的信息，例如用户、角色和权限（在我们的示例中，aspnet_regsql 创建的表位于此处）。此处指定的权限包括给定用户可以使用哪些 ClientData 数据库（见下文）。所有 MasterData 数据库（同一产品）的架构都是相同的。
ClientData，包含用户交互的数据。在一个产品中，这是客户可以根据大量条件搜索的数据，创建相关报告等。在另一种产品中，这包含客户可以上传的动态数据，以便其他用户可以联系人们进行调查通过电话等。同一产品的所有 ClientData 数据库的架构都是相同的。

现在，需要注意的是：对于 MasterData 和 ClientData，我们实际上使用相同的架构，并且通常使用相同的实际数据库。这是出于历史原因，因为允许客户端拥有一个与多个 ClientData 数据库相对应的身份验证数据库 (MasterData) 的功能是一项相对较新的功能，仅适用于我们的一款产品。此外，这种结构还简化了部署，因为大多数客户端仅使用一个 ClientData 数据库。然而，在我们的项目中，MasterData和ClientData在Entity Framework下有独立的实体模型，我们要保证MasterData和ClientData之间不存在外键等直接关系。

这个设置对我们来说非常有效。一大优点是，将不同的ClientData数据库放在不同的服务器上没有问题。这对负载平衡有很大帮助，并且提供了一种自然的数据分区方式。如果客户愿意付费，我们基本上可以为拥有大量数据的客户提供专用的数据库服务器。

在这种情况下，另一件事对我们真正有帮助是 Red Gate 的工具，特别是多脚本、SQL 源代码控制和架构比较等工具。当我们升级某些东西并且架构发生变化时，我们必须将更改部署到所有相关数据库。这些工具节省的时间远远超出了它们本身的价值。请注意，除了作为满意的用户之外，我与 Red Gate 没有任何关系。

编辑：（回应评论）

ApplicationData 是每个产品一个数据库。我们使用的三个基于 Web 的产品对 ApplicationData 使用相同的架构，因为它们记录的信息类型基本相同。然而，它没有理由必须保持这种状态。 ApplicationData 数据库都位于同一服务器上。 ApplicationData 中的表之一指向客户端 MasterData 的正确服务器和数据库名称，因此给定客户端的 MasterData 可以驻留在任何服务器上。

MasterData 具有每个 ClientData 数据库的服务器和数据库名称信息，因此数据库可以驻留在任何服务器上。实际上，目前我们只有两台用于这些产品的生产数据库服务器。每个产品的 MasterData 架构都是相似的，但我认为它们并不完全相同（我必须检查）。每个客户都有自己的主数据。如果客户购买多个产品，则该客户的每个产品都有一个主数据；如果客户购买了该功能（或请求自定义开发此类功能），则产品会以其他方式（基本上通过 Web 服务）进行交互。给定产品的 ClientData 始终具有相同的架构。

因此，总而言之：

ApplicationData 是每个产品都有相同的架构。
每个产品中的每个客户端
产品中的一个客户端有一个或多个 ClientData 实例，

因为我们的产品中只有一个支持每个客户端多个 ClientData 实例。对于第二个产品，这可能最终会被实现，对于第三个产品来说，它作为一个功能根本没有任何意义，

我希望它能回答你的问题！

I work in a situation where, for regulatory/legal reasons, we have to keep each client's data in a separate database. However, there is certain information that must be shared, mostly related to things like a lookup table for which client's URL corresponds to which database. Also, a client can choose to have multiple databases if they wish to separate their data in some logical way. So, for each of our products, we really have three types of databases:

ApplicationData, which has just a few tables that contain information about the clients themselves, like which MasterData database (see below) to use when reached by a certain URL and which features are available to that client. Each product has just one ApplicationData, no matter how many different clients are using that product.
MasterData, which contains client-specific information such as users, roles, and permissions (in our case, the tables that aspnet_regsql creates are here). Among the permissions specified here are which ClientData databases are available to a given user (see below). The schema for all MasterData databases (for the same product) are the same.
ClientData, which contains the data with which the user interacts. In one product, this is data that the client can search based on a large number of criteria, create reports about, etc. In another product, this contains the dynamic data that a client can upload so that other users can contact people to take surveys over the phone, etc. The schema for all ClientData databases for the same product is the same.

Now, one caveat: We actually use the same schema, and often the same actual database, for MasterData and ClientData. This is for historical reasons, as the ability to allow a client to have one authentication database (MasterData) corresponding to a number of ClientData databases is a relatively new feature that only applies to one of our products. Also, this structure simplifies deployment, since most clients only use one ClientData database. However, MasterData and ClientData have separate entity models under Entity Framework in our projects, and we have to ensure that there are no direct relationships between MasterData and ClientData such as foreign keys.

This setup works pretty well for us. One major advantage is that there is no problem with putting different ClientData databases on different servers. This helps greatly with load balancing, and it provides a natural way to partition data. We can essentially offer a client with a huge amount of data a dedicated database server if they are willing to pay for it.

One more thing that has really helped us in this situation are Red Gate's tools, specifically tools like Multi-Script, SQL Source Control, and Schema Compare. When we upgrade something, and the schema changes, we have to deploy the changes to all the relevant databases. These tools have more than paid for themselves in time saved. Note that I have no affiliation with Red Gate other than as a satisfied user.

Edit: (in response to comment)

ApplicationData is one database per product. The three web-based products we have use the same schema for ApplicationData, since they record basically the same types of information. However, there is no reason it would have to stay that way. The ApplicationData databases are all on the same server. One of the tables in ApplicationData points to the correct server and database name for the client's MasterData, so MasterData for a given client can reside on any server.

MasterData has server and database name information for each ClientData database, so again, the databases can reside on any server. In practice, for now, we only have two production database servers total for these products. The MasterData schema is similar per product, but I do not think they are exactly the same (I would have to check). Each client has its own MasterData. If a client purchases multiple products, there is a MasterData for each product for that client; the products interact in other ways (through web services, basically) if a client has purchased that feature (or requests custom development of such a feature. ClientData for a given product always has the same schema.

So, in summary:

ApplicationData is per product and happens to have the same schema in each product.
MasterData is per client for a product.
There are one or more ClientData instances for a client within a product.

I did oversimplify slightly in that only one of our products supports multiple ClientData instances per client. For a second product, that will probably be implemented eventually. For a third product, it would make no sense at all as a feature and will likely just remain as is.

I hope that answers your question!

回复收藏 0 原文