姜戈 + FastCGI - 随机引发OperationalError
我正在运行 Django 应用程序。 之前在Apache + mod_python下装过,一切OK。 切换到 Lighttpd + FastCGI。 现在我随机得到以下异常(它出现的地点和时间似乎都不可预测)。 由于它是随机的,并且只有在切换到 FastCGI 后才会出现,我认为它与某些设置有关。
谷歌搜索时发现了一些结果,但它们似乎与设置 maxrequests=1 有关。 但是,我使用默认值,即 0。
有什么想法可以在哪里寻找吗?
附言。 我正在使用 PostgreSQL。 可能也与此有关,因为在进行数据库查询时会出现异常。
File "/usr/lib/python2.6/site-packages/django/core/handlers/base.py", line 86, in get_response
response = callback(request, *callback_args, **callback_kwargs)
File "/usr/lib/python2.6/site-packages/django/contrib/admin/sites.py", line 140, in root
if not self.has_permission(request):
File "/usr/lib/python2.6/site-packages/django/contrib/admin/sites.py", line 99, in has_permission
return request.user.is_authenticated() and request.user.is_staff
File "/usr/lib/python2.6/site-packages/django/contrib/auth/middleware.py", line 5, in __get__
request._cached_user = get_user(request)
File "/usr/lib/python2.6/site-packages/django/contrib/auth/__init__.py", line 83, in get_user
user_id = request.session[SESSION_KEY]
File "/usr/lib/python2.6/site-packages/django/contrib/sessions/backends/base.py", line 46, in __getitem__
return self._session[key]
File "/usr/lib/python2.6/site-packages/django/contrib/sessions/backends/base.py", line 172, in _get_session
self._session_cache = self.load()
File "/usr/lib/python2.6/site-packages/django/contrib/sessions/backends/db.py", line 16, in load
expire_date__gt=datetime.datetime.now()
File "/usr/lib/python2.6/site-packages/django/db/models/manager.py", line 93, in get
return self.get_query_set().get(*args, **kwargs)
File "/usr/lib/python2.6/site-packages/django/db/models/query.py", line 304, in get
num = len(clone)
File "/usr/lib/python2.6/site-packages/django/db/models/query.py", line 160, in __len__
self._result_cache = list(self.iterator())
File "/usr/lib/python2.6/site-packages/django/db/models/query.py", line 275, in iterator
for row in self.query.results_iter():
File "/usr/lib/python2.6/site-packages/django/db/models/sql/query.py", line 206, in results_iter
for rows in self.execute_sql(MULTI):
File "/usr/lib/python2.6/site-packages/django/db/models/sql/query.py", line 1734, in execute_sql
cursor.execute(sql, params)
OperationalError: server closed the connection unexpectedly
This probably means the server terminated abnormally
before or while processing the request.
I'm running a Django application. Had it under Apache + mod_python before, and it was all OK. Switched to Lighttpd + FastCGI. Now I randomly get the following exception (neither the place nor the time where it appears seem to be predictable). Since it's random, and it appears only after switching to FastCGI, I assume it has something to do with some settings.
Found a few results when googleing, but they seem to be related to setting maxrequests=1. However, I use the default, which is 0.
Any ideas where to look for?
PS. I'm using PostgreSQL. Might be related to that as well, since the exception appears when making a database query.
File "/usr/lib/python2.6/site-packages/django/core/handlers/base.py", line 86, in get_response
response = callback(request, *callback_args, **callback_kwargs)
File "/usr/lib/python2.6/site-packages/django/contrib/admin/sites.py", line 140, in root
if not self.has_permission(request):
File "/usr/lib/python2.6/site-packages/django/contrib/admin/sites.py", line 99, in has_permission
return request.user.is_authenticated() and request.user.is_staff
File "/usr/lib/python2.6/site-packages/django/contrib/auth/middleware.py", line 5, in __get__
request._cached_user = get_user(request)
File "/usr/lib/python2.6/site-packages/django/contrib/auth/__init__.py", line 83, in get_user
user_id = request.session[SESSION_KEY]
File "/usr/lib/python2.6/site-packages/django/contrib/sessions/backends/base.py", line 46, in __getitem__
return self._session[key]
File "/usr/lib/python2.6/site-packages/django/contrib/sessions/backends/base.py", line 172, in _get_session
self._session_cache = self.load()
File "/usr/lib/python2.6/site-packages/django/contrib/sessions/backends/db.py", line 16, in load
expire_date__gt=datetime.datetime.now()
File "/usr/lib/python2.6/site-packages/django/db/models/manager.py", line 93, in get
return self.get_query_set().get(*args, **kwargs)
File "/usr/lib/python2.6/site-packages/django/db/models/query.py", line 304, in get
num = len(clone)
File "/usr/lib/python2.6/site-packages/django/db/models/query.py", line 160, in __len__
self._result_cache = list(self.iterator())
File "/usr/lib/python2.6/site-packages/django/db/models/query.py", line 275, in iterator
for row in self.query.results_iter():
File "/usr/lib/python2.6/site-packages/django/db/models/sql/query.py", line 206, in results_iter
for rows in self.execute_sql(MULTI):
File "/usr/lib/python2.6/site-packages/django/db/models/sql/query.py", line 1734, in execute_sql
cursor.execute(sql, params)
OperationalError: server closed the connection unexpectedly
This probably means the server terminated abnormally
before or while processing the request.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(13)
可能的解决方案:http://groups.google.com/group/django-用户/browse_thread/thread/2c7421cdb9b99e48
不理想的想法,连接两次到数据库充其量是一种解决方法。
可能的解决方案:使用连接池(pgpool、pgbouncer),这样您就可以拥有稳定的数据库连接池,并快速传递给 FCGI 守护进程。
问题是这会触发另一个错误,psycopg2 会引发 InterfaceError 因为它尝试断开两次连接(pgbouncer 已经处理了这个问题)。
现在,罪魁祸首是 Django 信号 request_finished 触发 connection.close(),并且即使已经断开连接,也会大声失败。 我认为这种行为是不需要的,就好像请求已经完成一样,我们不再关心数据库连接。 纠正这个问题的补丁应该很简单。
相关回溯:
此处的异常处理可以增加更多宽容:
/usr/local/lib/python2.6/dist-packages/Django-1.1.1-py2.6.egg/django/db/__init__.py< /strong>
或者它可以在 psycopg2 上处理得更好,所以如果我们想要做的只是断开连接并且它已经是:
/usr/local/lib/python2.6/dist-packages ,则不会抛出致命错误/Django-1.1.1-py2.6.egg/django/db/backends/__init__.py
除此之外,我缺乏想法。
Possible solution: http://groups.google.com/group/django-users/browse_thread/thread/2c7421cdb9b99e48
Not ideal thought, connecting twice to the DB is a workaround at best.
Possible solution: using connection pooling (pgpool, pgbouncer), so you have DB connections pooled and stable, and handed fast to your FCGI daemons.
The problem is that this triggers another bug, psycopg2 raising an InterfaceError because it's trying to disconnect twice (pgbouncer already handled this).
Now the culprit is Django signal request_finished triggering connection.close(), and failing loud even if it was already disconnected. I don't think this behavior is desired, as if the request already finished, we don't care about the DB connection anymore. A patch for correcting this should be simple.
The relevant traceback:
Exception handling here could add more leniency:
/usr/local/lib/python2.6/dist-packages/Django-1.1.1-py2.6.egg/django/db/__init__.py
Or it could be handled better on psycopg2, so to not throw fatal errors if all we're trying to do is disconnect and it already is:
/usr/local/lib/python2.6/dist-packages/Django-1.1.1-py2.6.egg/django/db/backends/__init__.py
Other than that, I'm short on ideas.
在交换机中,您是否更改了 PostgreSQL 客户端/服务器版本?
我在 php+mysql 上看到过类似的问题,罪魁祸首是客户端/服务器版本之间的不兼容(即使它们有相同的主要版本!)
In the switch, did you change PostgreSQL client/server versions?
I have seen similar problems with php+mysql, and the culprit was an incompatibility between the client/server versions (even though they had the same major version!)
闻起来像是可能存在线程问题。 Django 不保证线程安全,尽管文件内文档似乎表明 Django/FCGI 可以以这种方式运行。 尝试使用 prefork 运行,然后击败服务器。 如果问题消失了...
Smells like a possible threading problem. Django is not guaranteed thread-safe although the in-file docs seem to indicate that Django/FCGI can be run that way. Try running with prefork and then beat the crap out of the server. If the problem goes away ...
也许两种设置(Apache+mod_python 和 lighttpd + FastCGI)的 PYTHONPATH 和 PATH 环境变量是不同的。
Maybe the PYTHONPATH and PATH environment variable is different for both setups (Apache+mod_python and lighttpd + FastCGI).
最后我切换回 Apache + mod_python(除了这个之外,我在使用 fcgi 时还遇到了其他随机错误),现在一切都很好并且稳定。
这个问题仍然悬而未决。 如果以后有人遇到这个问题并解决了,可以在这里记录解决方案以供将来参考。 :)
In the end I switched back to Apache + mod_python (I was having other random errors with fcgi, besides this one) and everything is good and stable now.
The question still remains open. In case anybody has this problem in the future and solves it they can record the solution here for future reference. :)
我在使用 geodjango 模型时修复了类似的问题,该模型未对其功能之一使用默认 ORM。 当我添加一行来手动关闭连接时,错误消失了。
http://code.djangoproject.com/ticket/9437
我仍然随机看到错误(然而,当使用用户登录/会话进行操作时,约 50% 的请求。
I fixed a similar issue when using a geodjango model that was not using the default ORM for one of its functions. When I added a line to manually close the connection the error went away.
http://code.djangoproject.com/ticket/9437
I still see the error randomly (~50% of requests) when doing stuff with user login/sessions however.
我最近遇到了同样的问题(lighttpd、fastcgi 和 postgre)。 找了好几天的解决办法没有成功,最后只好改用mysql。 问题就消失了。
I went through the same problem recently (lighttpd, fastcgi & postgre). Searched for a solution for days without success, and as a last resort switched to mysql. The problem is gone.
为什么不将会话存储在缓存中?
另外
,您可以尝试将 postgres 与 pgbouncer 一起使用(postgres - prefork 服务器,不喜欢每次连接/断开连接),但首先检查您的 postgresql.log。
另一个版本 - 会话表中有很多记录,django-admin.py cleanup 可以提供帮助。
Why not storing session in cache?
Set
Also you can try use postgres with pgbouncer (postgres - prefork server and don't like many connects/disconnects per time), but firstly check your postgresql.log.
Another version - you have many records in session tables and django-admin.py cleanup can help.
问题可能主要出在进口方面。 至少那是发生在我身上的事。
在从网上找不到任何内容后,我编写了自己的解决方案。 请在这里查看我的博文: 简单的 Python 实用程序来检查所有导入你的项目
当然,这只会帮助你很快地找到原始问题的解决方案,而不是问题本身的实际解决方案。
The problem could be mainly with Imports. Atleast thats what happened to me.
I wrote my own solution after finding nothing from the web. Please check my blogpost here: Simple Python Utility to check all Imports in your project
Ofcourse this will only help you to get to the solution of the original issue pretty quickly and not the actual solution for your problem by itself.
从 method=prefork 更改为 method=threaded 解决了我的问题。
Change from method=prefork to method=threaded solved the problem for me.
即使我不使用 django 而是使用金字塔作为框架,我也会尝试给出答案。 我很长时间以来都遇到这个问题。 问题是,在测试中产生这个错误真的很困难......无论如何。 最后,我通过挖掘会话、范围会话、会话实例、引擎和连接等的全部内容来解决这个问题。我发现了这个:
http://docs.sqlalchemy.org/en/rel_0_7/core/pooling.html#disconnect-handling-pessimistic
这种方法只是添加了一个引擎连接池的监听器。 在侦听器中,向数据库查询静态选择。 如果失败,池会尝试在失败之前建立与数据库的新连接。 重要提示:这发生在任何其他内容被扔到数据库之前。 因此可以预先检查连接,以防止其余代码失败。
这不是一个干净的解决方案,因为它本身不能解决错误,但它的作用就像一个魅力。 希望这对某人有帮助。
I try to give an answer to this even if I'am not using django but pyramid as the framework. I was running into this problem since a long time. Problem was, that it was really difficult to produce this error for tests... Anyway. Finally I solved it by digging through the whole stuff of sessions, scoped sessions, instances of sessions, engines and connections etc. I found this:
http://docs.sqlalchemy.org/en/rel_0_7/core/pooling.html#disconnect-handling-pessimistic
This approach simply adds a listener to the connection pool of the engine. In the listener a static select is queried to the database. If it fails the pool try to establish a new connection to the database before it fails at all. Important: This happens before any other stuff is thrown to the database. So it is possible to pre check connection what prevents the rest of your code from failing.
This is not a clean solution since it don't solve the error itself but it works like a charm. Hope this helps someone.
适用的引用:
“2019 年有人吗?”
- 一半的 YouTube 评论,大约 2019 年
如果有人仍在处理此问题,请确保您的应用程序“急切分叉”,以便您的 Python DB 驱动程序(对我来说是 psycopg2)不会在进程之间共享资源。
我通过添加lazy-apps = true选项解决了uWSGI上的这个问题,这会导致直接分叉应用程序进程,而不是等待写入时复制。 我想其他 WSGI / FastCGI 主机也有类似的选项。
An applicable quote:
"2019 anyone?"
- half of YouTube comments, circa 2019
If anyone is still dealing with this, make sure your app is "eagerly forking" such that your Python DB driver (
psycopg2
for me) isn't sharing resources between processes.I solved this issue on uWSGI by adding the
lazy-apps = true
option, which causes is to fork app processes right out of the gate, rather than waiting for copy-on-write. I imagine other WSGI / FastCGI hosts have similar options.您是否考虑过降级到 Python 2.5.x(特别是 2.5.4)? 我认为 Django 在 Python 2.6 上不会被认为是成熟的,因为有一些向后不兼容的更改。 但是,我怀疑这能否解决您的问题。
另外,Django 1.0.2 修复了一些邪恶的小错误,因此请确保您正在运行它。 这很好地可以解决你的问题。
Have you considered downgrading to Python 2.5.x (2.5.4 specifically)? I don't think Django would be considered mature on Python 2.6 since there are some backwards incompatible changes. However, I doubt this will fix your problem.
Also, Django 1.0.2 fixed some nefarious little bugs so make sure you're running that. This very well could fix your problem.