当前位置：文江博客话题详情

适用于基于 Java（GWT、Spring、Hibernate）的 Web 应用程序的 SaaS/多租户方法

发布于 2024-10-27 12:34:56 字数 723 浏览 14 评论 0原文

我目前正在考虑将使用 Spring、GWT、Hibernate、Jackrabbit、Hibernate Search / Lucene（以及其他）的基于 Java 的单租户 Web 应用程序转换为成熟的 SaaS 风格应用程序。

我偶然发现了一篇文章，其中强调了以下 7 个“事项”，作为对单租户应用程序进行重要更改以使其成为 SaaS 应用程序：

应用程序必须支持多租户。
该应用程序必须具有一定程度的自助注册功能。
必须有适当的订阅/计费机制。
应用程序必须能够有效地扩展。
必须有适当的功能来监视、配置和管理应用程序和租户。
必须有一种机制来支持唯一的用户识别和身份验证。
必须有一种机制来支持每个租户的某种程度的定制。

我的问题是，是否有人使用与我列出的类似技术在 SaaS/多租户应用程序中实现上述 7 件事中的任何一个？在我走上我目前正在考虑的道路之前，我渴望获得尽可能多的关于最佳方法的意见。

首先，我非常确定我能够很好地掌握如何在模型级别处理多个租户。我正在考虑向所有表中添加一个租户 ID，然后使用 Hibernate 过滤器（以及用于 Hibernate 搜索的全文过滤器）根据登录用户的租户 ID 进行所有查询的过滤。

然而，我确实对性能也有一些担忧，尤其是当我们的租户数量增长得相当高时。

任何有关如何实施此类解决方案的建议将不胜感激（如果这个问题有点过于开放，我深表歉意）。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

寂寞笑我太脆弱 2024-11-03 12:34:56

我建议您构建应用程序以支持所有 4 种类型的租户隔离，即每个租户的单独数据库、每个租户的单独架构、每个租户的单独表以及具有租户 ID 的所有租户的共享表。这将使您能够随着数据库的增长灵活地水平分区，拥有多个数据库，每个数据库都有一组较小的租户，并且还能够为一些大型租户拥有单独的数据库。您的一些大租户也可能坚持要求他们的数据（数据库）应驻留在他们的场所，而应用程序可以在云外运行。

以下是您在构建应用程序时可能需要考虑的非功能性和基础设施级功能的详尽检查列表（其中一些您可能不会立即需要，但请考虑一下业务情况，如果您的竞争开始提供）

租户级定制a）UI主题和徽标b）表单和网格，c）数据模型扩展和自定义字段，d）通知模板，e）选择列表和主数据
租户级角色创建和管理和权限、字段级访问权限、数据范围策略
模块和功能的租户级访问控制设置，以便可以根据订阅包启用/禁用特定模块和功能。
计量和监控任务/事件/交易，并在超出购买配额后限制访问控制。如果您的业务模型发生变化，则能够在未来计量任何新实体。
将业务规则和工作流程从代码库中外部化，并将它们表示为元数据，以便您可以为每个租户组/租户自定义它们。
查询生成器用于创建了解租户以及特定租户添加的自定义字段的自定义报告。
租户封装和框架级连接字符串管理，使您的开发人员在编写查询时不必担心租户 ID。

所有这些都是基于我们构建可用于任何领域或应用程序的通用多租户框架的经验。不幸的是，您无法使用我们的框架，因为它基于 .NET。

但是，无论您使用何种技术堆栈，任何多租户 SaaS 产品（新的或迁移的）的工程需求都是相同的。

I would recommend that you architect your application to support all the 4 types of tenant isolation namely separate database for each tenant, separate schema for each tenant, separate table for each tenant and shared table for all tenants with a tenant ID. This will give you the flexibility to horizontally partition your database as you grow, having multiple databases each having a group of smaller tenants and also the ability to have a separate database for some large tenants. Some of your large tenants could also insist that their data (database) should reside in their premise, while the application can run off the cloud.

Here is an exaustive check list of non-functional and infrastructure level features that you may want to consider while architecting your application (some of them you may not need immediately, but think of a business situation of how you will handle such a need if your competition starts offering it)

tenant level customization of a) UI themes and logos b) forms and grids, c) data model extensions and custom fields, d) notification templates, e) pick up lists and master data
tenant level creation and administration of roles and privileges, field level access permissions, data scope policies
tenant level access control settings for modules and features, so that specific modules and features could be enabled / disabled depending on the subscription package.
Metering and monitoring of tasks / events / transactions and restriction of access control once the purchased quota is exceeded. The ability to meter any new entity in the future if and when your business model changes.
Externalising the business rules and workflows out of your code base and representing them as meta data, so that you can customize them for each tenant group / tenant.
Query builder for creating custom reports that is aware of the tenant as well as custom fields added by specific tenants.
Tenant encapsulation and framework level connection string management such that your developers do not have to worry about tenant IDs while writing queries.

All these are based on our experience in building a general purpose multi-tenant framework that can be used for any domain or application. Unfortunately, you cannot use our framework as it is based on .NET

But the engineering needs of any multi-tenant SaaS product (new or migrated) are the same irrespective of the technology stack that you use.

回复收藏 0 原文

尘曦 2024-11-03 12:34:56

您列出的所有技术对于单租户和多租户应用程序来说都是非常常见且合理的。我想说，支持 SaaS 的 7 个“事物”更多的是取决于您如何使用这些技术，而不是技术。听起来您已经有了一个可以运行的单租户应用程序。因此，可能没有太多理由偏离那里的技术选择，除非某些东西已经运行得不太好。不过，你的问题在其他方面是相当开放的，所以很难说得更具体。

不过，我确实有一些关于按租户 ID 拆分数据库（可能还有其他内容）的反馈。如果您知道您最终可能会有很多租户（例如数千或更多，特别是如果它们很小），那么您的建议可能是最好的。但是，如果您的租户数量较少（特别是规模较大时），您可能需要考虑每个租户一个数据库，以便每个租户都有自己的表空间。我的意思是单个数据库安装，其中包含相同模式的多个实例，每个租户一个。

这可以成为一个优势有几个原因。一是你提到的性能。向每个表添加租户 ID 会增加磁盘访问和查询时间的开销，并增加代码复杂性。数据库中的每个索引还需要包含租户 ID。如果您不小心，您会面临在租户之间混合数据的额外风险（尽管 Hibernate 过滤器将有助于减轻这种风险）。通过每个租户一个数据库，您可以将访问限制为仅正确的数据库。移植当前的应用程序可能也会容易得多，您基本上只需要尽早在某个地方拦截您的请求，以根据 URL 决定租户并指向正确的数据库。每个租户也可以轻松进行备份，如果您打算允许他们下载备份，则特别有用。

另一方面，也有理由不这样做。您将需要处理大量数据库模式，并且必须独立更新它们（如果您想避免因模式更改而导致所有租户停机，这实际上可能是一个优势，您可以逐步推出它们）。它允许您在特殊情况下将平台视为一次性升级的真正多租户 SaaS 部署，从而在生产中管理多个版本。最后，我听说几乎每个数据库供应商在一次安装中支持的模式实例数量都存在一个突破点（据推测有些可能会达到数十万个）。

当然，这实际上取决于您的用例。您提到了单一租户，这让我相信您现在没有太多租户，但您确实提到了要增加很多租户。我不确定你的意思是数百还是数百万，但无论哪种方式，我希望这对你的考虑有所帮助。祝你好运！

All of the technologies that you listed are quite common and reasonable for both single- and multi-tenant applications. I'd say supporting the 7 "things" for SaaS is much more of a function of how you use the technologies than which. It sounds like you already have a single-tenant application that works. So there's probably not much reason to deviate from the technology selections there unless something is just not working very well already. Your question is otherwise fairly open-ended though, so it's hard to be too much more specific there.

I do have some feedback on splitting the database (and perhaps other things) by tenant ID though. If you know you might eventually have a lot of tenants (say many thousands or more, particularly if they're small) then what you suggest is perhaps best. If however you'll have a smaller number of tenants (particularly if they're large) you might want to consider a database per tenant, so they each have their own table space. By that I mean a single database installation with multiple instances of the same schema inside of it, one per tenant.

There are a few reasons this can be an advantage. One is performance as you mentioned. Adding a tenant ID to every single table is overhead on disk access, query time and increases code complexity. Every index in the database will need to include the tenant ID as well. You run an additional risk of mixing data between tenants if you're not careful (although a Hibernate filter would help mitigate that). With a database per tenant you could restrict access to only the correct one. Porting your current application will probably be a lot easier too, you basically just need to intercept your request somewhere early to decide the tenant based on the URL and point to the right database. Backups are also easy to do per tenant, particularly useful if you ever intend on allowing them to download a backup.

On the other hand there are reasons not to do this. You'll have a lot of database schemas to deal with and they'll have to be updated independently (which can actually be an advantage if you want to avoid taking all tenants down for a schema change, you can roll them out incrementally). It lets you have special cases that could deviate from treating the platform as a true multi-tenant SaaS deployment that's upgraded all at once, resulting in management of multiple versions in production. Lastly I've heard there is a breaking point with just about every database vendor out there in the number of schema instances they'll support in one installation (supposedly some can go to hundreds of thousands though).

It really depends on your use case of course. You mentioned single-tenant which leads me to believe you don't have too many tenants right now, however you do mention growing to lots of tenants. I'm not sure if you mean hundreds or millions, yet either way I hope this helps some with your considerations. Best of luck!

回复收藏 0 原文