集成 CherryPy Web 框架、SQLAlchemy 会话和 lighttpd 以提供高负载 Web 服务的最佳实践

发布于 2024-07-14 15:39:36 字数 2560 浏览 12 评论 0原文

我正在 lighttpd 后面开发一个 CherryPy FastCGI 服务器，并使用以下设置来启用在 CherryPy 控制器内使用 ORM SQLAlchemy 会话。但是，当我对大约 500 个循环的 14 个并发请求运行压力测试时，它开始出现类似 AttributeError: '_ThreadData' object has no attribute 'scoped_session_class' in open_dbsession() 的错误code> 或 AttributeError: 'Request' 对象在 close_dbsession() 中没有属性 'scoped_session_class' 一段时间后。总体错误率约为50%。

仅当我在lighttpd后面运行服务器时才会发生这种情况，而不是直接通过cherrypy.engine.start()运行时会发生这种情况。已确认 connect() 不会引发异常。

我还尝试将 scoped_session 的返回值分配给 GlobalSession （就像它此处），但随后它给出了诸如 UnboundExceptionError 和其他 SA 级错误之类的错误。（并发数：10，循环数：1000，错误率：16%。即使直接运行也会发生。）

有一些可能的原因，但我缺乏足够的知识来选择一个。
1. FastCGI环境下start_thread订阅是否不可靠？似乎是在 connect() 之前调用了 open_dbsession()
2. cherrypy.thread_data 是否因某种原因被清除？

服务器代码

import sqlalchemy as sa  
from sqlalchemy.orm import session_maker, scoped_session

engine = sa.create_engine(dburi, strategy="threadlocal")  
GlobalSession = session_maker(bind=engine, transactional=False)

def connect(thread_index):  
    cherrypy.thread_data.scoped_session_class = scoped_session(GlobalSession)

def open_dbsession():  
    cherrypy.request.scoped_session_class = cherrypy.thread_data.scoped_session_class

def close_dbsession():  
    cherrypy.request.scoped_session_class.remove()


cherrypy.tools.dbsession_open = cherrypy.Tool('on_start_resource', open_dbsession)  
cherrypy.tools.dbsession_close = cherrypy.Tool('on_end_resource', close_dbsession)  
cherrypy.engine.subscribe('start_thread', connect)

lighttpd fastcgi 配置

...
var.server_name = "test"
var.server_root = "/path/to/root"
var.svc_env = "test"
fastcgi.server = (
  "/" => (
    "cherry.fcgi" => (
      "bin-path" => server_root + "/fcgi_" + server_name + ".fcgi",
      "bin-environment" => (
        "SVC_ENV" => svc_env
      ),
      "bin-copy-environment" => ("PATH", "LC_CTYPE"),
      "socket" => "/tmp/cherry_" + server_name + "." + svc_env + ".sock",
      "check-local" => "disable",
      "disable-time"    => 1,
      "min-procs"       => 1,
      "max-procs"       => 4,
    ),
  ),
)

编辑

从原始源代码中恢复了代码示例中缺少的 thread_index 参数（感谢评论）
澄清错误不会立即发生
将条件范围缩小到 lighttpd

原文

I'm developing a CherryPy FastCGI server behind lighttpd with the following setup to enable using ORM SQLAlchemy sessions inside CherryPy controllers. However, when I run stress tests with 14 concurrent requests for about 500 loops, it starts to give errors like AttributeError: '_ThreadData' object has no attribute 'scoped_session_class' in open_dbsession() or AttributeError: 'Request' object has no attribute 'scoped_session_class' in close_dbsession() after a while. The error rate is around 50% in total.

This happens only when I run the server behind lighttpd, not when it's run directly through cherrypy.engine.start(). It's confirmed that connect() isn't raising exceptions.

I also tried assigning the return value of scoped_session to GlobalSession (like it does here), but then it gave out errors like UnboundExceptionError and other SA-level errors. (Concurrency: 10, loops: 1000, error rate: 16%. Occurs even when run directly.)

There are some possible causes but I lack sufficient knowledge to pick one.
1. Are start_thread subscriptions unreliable under FastCGI environment? It seems that open_dbsession() is called before connect()
2. Does cherrypy.thread_data get cleared for some reason?

server code

import sqlalchemy as sa  
from sqlalchemy.orm import session_maker, scoped_session

engine = sa.create_engine(dburi, strategy="threadlocal")  
GlobalSession = session_maker(bind=engine, transactional=False)

def connect(thread_index):  
    cherrypy.thread_data.scoped_session_class = scoped_session(GlobalSession)

def open_dbsession():  
    cherrypy.request.scoped_session_class = cherrypy.thread_data.scoped_session_class

def close_dbsession():  
    cherrypy.request.scoped_session_class.remove()


cherrypy.tools.dbsession_open = cherrypy.Tool('on_start_resource', open_dbsession)  
cherrypy.tools.dbsession_close = cherrypy.Tool('on_end_resource', close_dbsession)  
cherrypy.engine.subscribe('start_thread', connect)

lighttpd fastcgi config

...
var.server_name = "test"
var.server_root = "/path/to/root"
var.svc_env = "test"
fastcgi.server = (
  "/" => (
    "cherry.fcgi" => (
      "bin-path" => server_root + "/fcgi_" + server_name + ".fcgi",
      "bin-environment" => (
        "SVC_ENV" => svc_env
      ),
      "bin-copy-environment" => ("PATH", "LC_CTYPE"),
      "socket" => "/tmp/cherry_" + server_name + "." + svc_env + ".sock",
      "check-local" => "disable",
      "disable-time"    => 1,
      "min-procs"       => 1,
      "max-procs"       => 4,
    ),
  ),
)

edits

Restored the missing thread_index argument in the code example from the original source code (thanks to the comment)
Clarified that errors do not occur immediately
Narrowed down the conditions to lighttpd

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

谁把谁当真 2024-07-21 15:39:36

如果您查看 plugins.ThreadManager.acquire_thread，您将看到 self.bus.publish('start_thread', i) 行，其中 i 是所见线程的数组索引。订阅 start_thread 通道的任何侦听器都需要接受该 i 值作为位置参数。因此，重写您的连接函数以读取： def connect(i):

我猜这会以某种方式默默失败；我会看看是否可以找到它、测试并修复它。

回复收藏 0 原文

心意如水 2024-07-21 15:39:36

我还尝试将scoped_session的返回值分配给GlobalSession（就像这里所做的那样），但随后它给出了诸如UnboundExceptionError和其他SA级错误之类的错误。（并发数：10，循环数：1000，错误率：16%）

如果我没有显式实例化scoped_session类，则不会发生此错误。

GlobalSession = scoped_session(session_maker(bind=engine, transactional=False))

def connect(thread_index):  
    cherrypy.thread_data.scoped_session_class = GlobalSession

def open_dbsession():  
    cherrypy.request.scoped_session_class = cherrypy.thread_data.scoped_session_class

def close_dbsession():  
    cherrypy.request.scoped_session_class.remove()

I also tried assigning the return value of scoped_session to GlobalSession (like it does here), but then it gave out errors like UnboundExceptionError and other SA-level errors. (Concurrency: 10, loops: 1000, error rate: 16%)

This error did not occur if I didn't instantiate the scoped_session class explicitly.

i.e.

GlobalSession = scoped_session(session_maker(bind=engine, transactional=False))

def connect(thread_index):  
    cherrypy.thread_data.scoped_session_class = GlobalSession

def open_dbsession():  
    cherrypy.request.scoped_session_class = cherrypy.thread_data.scoped_session_class

def close_dbsession():  
    cherrypy.request.scoped_session_class.remove()

回复收藏 0 原文

~没有更多了~