选择 Google Cloud Run 中的实例数量

发布于 2025-01-10 13:29:16 字数 2214 浏览 0 评论 0原文

我正在 Google Cloud Run 中运行连接到 Cloud SQL 数据库的 PHP-FPM 应用程序。我的应用程序在正常使用中运行良好，但我们有时会发布大量产品，并期望较高的流量负载。我们提前知道客户何时发布他们的产品，流量会增加，因此我们能够相应地规划我们的服务器容量。

我们预计大约有 1000 位客户，他们会疯狂地刷新页面。这将对我们的数据库产生很大的压力，因为许多端点会对数据库产生大量的查询。我一直在运行 Siege 来加载测试我的应用程序，因此我知道我们的数据库是服务器负载增加时的临界点。通常我们只在一个 Cloud Run 实例上运行。我还从早期的 Siege 测试运行中了解到，大量流量会导致 Cloud Run 动态启动新实例。对于我们的应用程序来说，这是一个非常耗时的过程，导致平均请求处理时间较长。因此，我们希望有超过 1 个容器实例可供发布。对于早期且较小的版本，我们已将最小容器数量增加到 20 个，但这个数字几乎是出乎意料的。因此，我正在寻找在 Cloud Run 中运行这些版本的最佳设置，但是我正在努力寻找关于我应该运行多少个 Cloud Run 实例（“最小服务器”和“最大服务器”）的任何好的答案。

这些是我的相关应用程序配置：

Cloud Run 配置：

8 GB RAM
4 CPUs
Maximum requests per container: 80

数据库配置：

15 GB RAM
4 CPUs
EDIT: Can handle 4000 simultaneous connections

相关 PHP-FPM 配置：

pm = dynamic
pm.max_children = 375 (formula: (8000MB - 500MB)/ 20MB. Memory of server minus some "extras", limited to 20 MB per child.)
pm.start_servers = 37 (formula: 10% of pm.max_children)
pm.min_spare_servers = 35 (Bit of a guess, must be less than pm.start_servers.
pm.max_spare_servers = 100 (Bit of a guess on my side)

因此，我的 Cloud Run 实例总共接受 80 个并行请求，而我的 PHP-FPM 允许 375 个子进程。

我在 Cloud Run 并发文档中看到“...当您设置了最大实例限制，在某些情况下，没有足够的实例来满足该流量负载，在这种情况下，传入请求可能会排队长达 10 秒”。对我们来说，不要丢弃任何请求很重要，因此运行的实例也不能太少。

我的问题：

是否有任何涉及进程数量、服务器硬件或任何上述规格的公式可以为我指明 Cloud Run 中理想的实例数量？
我是否应该限制 Cloud Run 可以启动的最大容器数量？我非常有信心 100 个 Cloud Run 容器同时向我的数据库发送垃圾邮件将导致灾难。
在我的应用程序之外实现单独的排队功能是否是一个好主意，这样它就不会过载？
有人可以向我指出一些其他资源，可以帮助我找到正确的方向吗？我能找到的唯一合适的东西是关于 Pluralsight 的课程，高性能 PHP< /a>，但它不包括在云中运行和扩展应用程序。

我知道这些问题的答案并不完全是黑白分明的，但我正在寻找一些指导方针，因为我认为 Cloud Run 文档并不是特别有启发性。我不知道以后应该用2个还是20个容器。

PS：这种升级仅在非常有限的时间内进行，因此我不关心运行许多容器的成本。我只是想要最好的性能和最小的灾难机会。

原文

I am running a PHP-FPM application inside Google Cloud Run connected to a Cloud SQL database. My application is running fine in normal use, but we sometimes release a large amount of products, and expect a high traffic load. We know in advance when our customers release their products and the traffic will increase, so we are able to plan our server capacity accordingly.

We are expecting about 1000 customers, who will refresh the page like crazy. That will generate a lot of pressure on our database, as many of the endpoints generate a huge amount of queries to the database.
I have been running Siege to load test my application, and I therefore know that our database is the critical point when the server load increases. Normally we run on only one Cloud Run instance. I also know from earlier test runs with Siege, that much traffic causes Cloud Run to spin up new instances on the fly. This was a very time consuming process for our application, resulting in a high average request processing time. For this reason, we want to have more than 1 container instance ready for the release.
For earlier, and smaller, releases, we have bumped the minimum number of containers to 20, but that number is taken pretty much out of the blue.
Therefore I am looking for the optimal setup to run in Cloud Run for these releases, but
I am struggling to find any good answers to how many Cloud Run instances I should run, both "min servers" and "max servers".

These are my relevant application configs:

Cloud Run configs:

8 GB RAM
4 CPUs
Maximum requests per container: 80

Database configs:

15 GB RAM
4 CPUs
EDIT: Can handle 4000 simultaneous connections

Relevant PHP-FPM configs:

pm = dynamic
pm.max_children = 375 (formula: (8000MB - 500MB)/ 20MB. Memory of server minus some "extras", limited to 20 MB per child.)
pm.start_servers = 37 (formula: 10% of pm.max_children)
pm.min_spare_servers = 35 (Bit of a guess, must be less than pm.start_servers.
pm.max_spare_servers = 100 (Bit of a guess on my side)

So my Cloud Run instance is accepting a total of 80 parallel requests, and my PHP-FPM allows 375 child processes.

I have seen in Cloud Run concurrency docs that "...when you set a maximum instances limit, in some scenarios there will be insufficient instances to meet that traffic load. In that case, incoming requests can be queued for up to 10 seconds".
It is important to us not dropping any requests, so I can not have too few instances running either.

My questions:

Is there any formula involving number of processes, server hardware or any of the abovementioned specs that could point me to an ideal number of instances in Cloud Run?
Should I limit the max number of containers that Cloud Run can spin up? I am pretty confident that 100 Cloud Run containers all spamming my database simultaneously will lead to disaster.
Could it be a good idea to implement a separate queuing functionality outside of my application, so that it is not overloaded?
Could anyone point me to some other resources that can get me in the right direction here? The only proper thing I have been able to find is a course on Pluralsight, High Performance PHP, but it does not cover running and scaling your application in the cloud.

I understand that the answers to these questions are not completely black and white, but I am looking for some guidelines, as I don't find the Cloud Run docs especially enlightening. I don't know whether I should use 2 or 20 containers in the future.

PS: This upscaling is for a very limited time only, so I don't care about the cost of running many containers. I simply want the best performance with the least chance of disaster.

分享到QQ

分享到微博