如何使用有状态Python模块正确实现测试隔离？

发布于 2024-11-01 04:15:41 字数 1047 浏览 6 评论 0原文

我正在从事的项目是一个封装为 Python 包的业务逻辑软件。这个想法是，各种脚本或应用程序将导入它，初始化它，然后使用它。

它当前有一个顶级 init() 方法，可以执行初始化并设置各种内容，一个很好的例子是它设置 SQLAlchemy 具有数据库连接并存储 SA 会话以供以后访问。它存储在我的项目的子包中（即 myproj.model.Session，因此其他代码可以在导入模型后获得工作 SA 会话）。

长话短说，这使我的包裹成为一个有状态的包裹。我正在为项目编写单元测试，这种安全行为带来了一些问题：

测试应该被隔离，但我的包的内部状态打破了这种隔离
我无法测试主要的 init() 方法，因为它的行为取决于
未来测试的状态需要针对具有众所周知的模型状态的（尚未编写的）控制器部分运行（例如，预先填充的 sqlite 内存数据库）

我是否应该以某种方式重构我的包，因为当前的结构不是最佳（可能）实践（tm）？ :)

我应该就这样保留它并每次都设置/拆卸整个过程吗？如果我要实现完全隔离，这意味着在每次测试时都完全擦除并重新填充数据库，这不是矫枉过正吗？

这个问题实际上是关于整体代码&测试结构，但为了它的价值，我使用 nose-1.0我的测试。我知道隔离插件可能会帮助我，但我希望在测试套件中做奇怪的事情之前先得到正确的代码。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

梦醒灬来后我 2024-11-08 04:15:41

您有几个选择：

模拟数据库

有一些权衡需要注意。

您的测试将变得更加复杂，因为您必须进行连接的设置、拆卸和模拟。您可能还想验证发送的 SQL/命令。它还往往会产生一种奇怪的紧密耦合，这可能会导致您在架构或 SQL 更改时花费额外的时间来维护/更新测试。

这通常是最纯粹的测试隔离，因为它减少了测试的潜在巨大依赖性。它还往往会使测试更快，并减少在持续集成环境中自动化测试套件的开销。

在每次测试时重新创建数据库

需要注意的权衡。

这可能会使您的测试非常慢，具体取决于重新创建数据库实际花费的时间。如果开发数据库服务器是共享资源，则必须进行额外的初始投资以确保每个开发人员在服务器上拥有自己的数据库。服务器可能会受到影响，具体取决于测试运行的频率。在持续集成环境中运行测试套件会产生额外的开销，因为它至少需要，可能更多的数据库（取决于同时构建的分支数量）。

好处与实际运行将在生产中使用的相同代码路径和类似资源有关。这通常有助于尽早发现错误，这总是一件非常好的事情。

ORM DB 交换

如果您使用像 SQLAlchemy 这样的 ORM，您可以将底层数据库与可能更快的内存数据库进行交换。这可以让您减轻前面两个选项的一些负面影响。

它与生产中使用的数据库并不完全相同，但 ORM 应该有助于减轻掩盖错误的风险。通常，设置内存数据库的时间比文件支持的数据库要短得多。它还具有与当前测试运行隔离的好处，因此您不必担心共享资源管理或最终拆卸/清理。

You have a few options:

Mock the database

There are a few trade offs to be aware of.

Your tests will become more complex as you will have to do the setup, teardown and mocking of the connection. You may also want to do verification of the SQL/commands sent. It also tends to create an odd sort of tight coupling which may cause you to spend additonal time maintaining/updating tests when the schema or SQL changes.

This is usually the purest for of test isolation because it reduces a potentially large dependency from testing. It also tends to make tests faster and reduces the overhead to automating the test suite in say a continuous integration environment.

Recreate the DB with each Test

Trade offs to be aware of.

This can make your test very slow depending on how much time it actually takes to recreate your database. If the dev database server is a shared resource there will have to be additional initial investment in making sure each dev has their own db on the server. The server may become impacted depending on how often tests get runs. There is additional overhead to running your test suite in a continuous integration environment because it will need at least, possibly more dbs (depending on how many branches are being built simultaneously).

The benefit has to do with actually running through the same code paths and similar resources that will be used in production. This usually helps to reveal bugs earlier which is always a very good thing.

ORM DB swap

If your using an ORM like SQLAlchemy their is a possibility that you can swap the underlying database with a potentially faster in-memory database. This allows you to mitigate some of the negatives of both the previous options.

It's not quite the same database as will be used in production, but the ORM should help mitigate the risk that obscures a bug. Typically the time to setup an in-memory database is much shorter that one which is file-backed. It also has the benefit of being isolated to the current test run so you don't have to worry about shared resource management or final teardown/cleanup.

回复收藏 0 原文