在非常大的Postgres桌子上创建索引非常慢，而且时间耗尽

发布于 2025-02-05 22:33:12 字数 685 浏览 3 评论 0原文

我有一张桌子上有一个大约2亿行和800列的AWS RDS群集，我想优化读取速度。不幸的是，查询创建过程太慢了，以至于我的客户连接时间耗尽。我已经尝试了许多事情来解决这个问题，例如：

修改tcp_keepalive群集和超时设置
创建另一个表并尝试索引索引，而
添加同时 index Creation参数以防锁会导致延误
与我的本地防火墙设置混乱，因此在所有这些情况下，在提交索引创建查询之后，任何网络连接都不会关闭

，我在数小时后收到了一个错误消息：ssl syscall错误：操作计时

并检查日志，我看到了

LOG: could not send data to client: Connection timed out
LOG: could not send data to client: Broken pipe
FATAL: connection to client lost

我使用postico和psql cli提交这些查询的消息，以试图排除任何怪异的客户端设置，但是无用。

我有点新手，所以我有可能在故障排除步骤上执行了错误，而且我还阅读了其他相关的帖子以进行故障排除，但并没有真正取得任何进展，并真的很感谢任何建议。提前致谢！

原文

I have a table with about 200 million rows and 800 columns in an AWS RDS cluster that I'd like to optimize read speed on. Unfortunately, the query creation process is so slow that my client connection times out. I've tried a number of things to address this, such as:

Modifying the tcp_keepalivecluster and timeout settings
Creating another table and attempting to index that instead
Adding the CONCURRENTLY index creation parameter in case locks were causing delays
Messing with my local firewall settings so any network connections don't get closed out

In all these cases, after submitting index creation queries, I get an error message after many hours saying: SSL SYSCALL error: Operation timed out

and checking the logs, I see messages like

LOG: could not send data to client: Connection timed out
LOG: could not send data to client: Broken pipe
FATAL: connection to client lost

I've submitted these queries using Postico and the psql CLI in attempt to rule out any weird client settings, too, but to no avail.

I'm a bit of a novice so it's possible I've executed on the troubleshooting steps incorrectly, and I've also read other related posts to troubleshoot, but haven't really made any headway and would really appreciate any advice. Thanks in advance!

分享到QQ

分享到微博