当前位置：文江博客话题详情

PostgreSQL 持久连接消耗大量内存

发布于 2024-07-29 06:11:49 字数 398 浏览 9 评论 0原文

我有一个 C++ 应用程序，它在 Windows 上使用 PostgreSQL 8.3。我们使用 libpq 接口。

我们有一个多线程应用程序，其中每个线程打开一个连接并在没有 PQFinish 的情况下继续使用它。

我们注意到，对于每个查询（尤其是 SELECT 语句），postgres.exe 的内存消耗都会增加。它高达 1.3 GB。最终，postgres.exe 崩溃并强制我们的程序创建一个新连接。

以前有人遇到过这个问题吗？

编辑：shared_buffer 当前在我们的conf中设置为128MB。文件。

EDIT2：我们现在采取的解决方法是为每笔交易调用 PQfinish。但是，这会稍微减慢我们的处理速度，因为每次建立连接都非常慢。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

不必了 2024-08-05 06:11:49

在 PostgreSQL 中，每个连接都有一个专用的后端。该后端不仅保存连接和会话状态，而且还是一个执行引擎。放置后端并不是特别便宜，即使在空闲时，它们也会消耗内存和同步开销。

对于任何给定的 Pg 服务器在任何给定的工作负载上，都有一个最佳数量的活跃工作后端，其中添加更多的工作后端会减慢速度而不是加快速度。您想要找到那个点，并将后端的数量限制在该水平附近。不幸的是，这没有什么神奇的方法，它主要涉及对您的硬件和工作负载进行基准测试。

如果您需要更多连接，则应使用代理或池系统，该系统允许您将“连接状态”与“执行引擎”分开。两个流行的选择是 PgBouncer 和 PgPool-II 。您可以维护从应用程序到代理/池程序的轻量级连接，并让它安排工作负载以保持数据库服务器以最佳负载运行。如果传入的查询过多，有些查询会在执行前等待，而不是竞争资源并减慢服务器上的所有查询。

请参阅 postgresql wiki。

请注意，如果您的工作负载主要是读取，特别是如果它包含不经常更改的项目，您可以为其确定可靠的缓存失效方案，您还可以使用memcached或Redis来减少数据库工作负载。这需要更改应用程序。 PostgreSQL 的 LISTEN 和 NOTIFY 将帮助您进行合理的缓存失效。

许多数据库引擎在核心数据库引擎的设计中内置了执行引擎和连接状态的某种分离。 Sybase ASE 确实如此，我认为 Oracle 也如此，但我对后者不太确定。不幸的是，由于 PostgreSQL 的每个连接一个进程的模型，在后端之间传递工作并不容易，这使得 PostgreSQL 更难在本地执行此操作，因此大多数人使用代理或池。

我强烈建议您阅读 PostgreSQL 高性能。我与 Greg Smith 或出版商没有任何关系/隶属关系^*，我只是认为它很棒，并且如果您关心数据库的性能，将会非常有用。

^* ...好吧，我写这篇文章时没有。我现在在同一家公司工作。

In PostgreSQL, each connection has a dedicated backend. This backend not only holds connection and session state, but is also an execution engine. Backends aren't particularly cheap to leave lying around, and they cost both memory and synchronization overhead even when idle.

There's an optimum number of actively working backends for any given Pg server on any given workload, where adding more working backends slows things down rather than speeding it up. You want to find that point, and limit the number of backends to around that level. Unfortunately there's no magic recipe for this, it mostly involves benchmarking - on your hardware and with your workload.

If you need more connections than that, you should use a proxy or pooling system that allows you to separate "connection state" from "execution engine". Two popular choices are PgBouncer and PgPool-II . You can maintain light-weight connections from your app to the proxy/pooler, and let it schedule the workload to keep the database server working at its optimum load. If too many queries come in, some wait before being executed instead of competing for resources and slowing down all queries on the server.

See the postgresql wiki.

Note that if your workload is read-mostly, and especially if it has items that don't change often for which you can determine a reliable cache invalidation scheme, you can also potentially use memcached or Redis to reduce your database workload. This requires application changes. PostgreSQL's LISTEN and NOTIFY will help you do sane cache invalidation.

Many database engines have some separation of execution engine and connection state built in to the core database engine's design. Sybase ASE certainly does, and I think Oracle does too, but I'm not too sure about the latter. Unfortunately, because of PostgreSQL's one-process-per-connection model it's not easy for it to pass work around between backends, making it harder for PostgreSQL to do this natively, so most people use a proxy or pool.

I strongly recommend that you read PostgreSQL High Performance. I don't have any relationship/affiliation with Greg Smith or the publisher^*, I just think it's great and will be very useful if you're concerned about your DB's performance.

^* ... well, I didn't when I wrote this. I work for the same company now.

回复收藏 0 原文