当前位置：文江博客话题详情

多少测试就足够了？

发布于 2024-07-26 04:32:08 字数 226 浏览 6 评论 0原文

我最近花了大约 70% 的时间编写集成测试的功能。有一次，我在想“该死，所有这些艰苦的测试工作，我知道我这里没有错误，为什么我要这么努力？让我们浏览一下测试并完成它吧……”

五分钟后，测试失败了。详细检查表明，这是我们正在使用的第三方库中的一个重要的未知错误。

那么……对于要检验什么、要相信什么，你的界限在哪里呢？您是否测试了所有内容，或者测试了您预计会出现大多数错误的代码？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

著墨染雨君画夕 2024-08-02 04:32:09

好问题！

首先 - 听起来你广泛的集成测试得到了回报:)

从我个人的经验来看：

如果它是一个“绿色领域”新项目，
我喜欢执行严格的单元测试
并有一个彻底的（彻底的
可能）集成测试计划
设计的。
如果它是现有的软件
测试覆盖率很差，那么我
更喜欢设计一套集成
测试特定/已知的测试
功能。那我介绍一下
测试（单元/集成）如我
代码库取得进一步进展。

多少才够呢？棘手的问题——我认为还不够！

回复收藏 0 原文

下雨或天晴 2024-08-02 04:32:09

“凡事太多就够了。”

我不遵循严格的 TDD 实践。我尝试编写足够的单元测试来覆盖所有代码路径并练习我认为重要的任何边缘情况。基本上我会尝试预测可能会出现什么问题。我还尝试将我编写的测试代码量与我认为被测代码的脆弱性或重要性相匹配。

我在一个方面很严格：如果发现错误，我首先编写一个测试来执行该错误并失败，然后更改代码并验证测试是否通过。

回复收藏 0 原文

茶花眉 2024-08-02 04:32:09

Gerald Weinberg 的经典著作《计算机编程心理学》有很多关于测试的好故事。我特别喜欢的一个是第 4 章“编程作为一种社交活动”“Bill”要求一位同事检查他的代码，他们仅在 13 个语句中发现了 17 个错误。代码审查提供了额外的眼睛来帮助发现错误，您使用的眼睛越多，发现如此微妙的错误的机会就越大。就像莱纳斯所说，“只要有足够多的眼球，所有的错误都是浅薄的”，你的测试基本上是机器人的眼睛，它们会在白天或晚上的任何时间根据你的需要多次检查你的代码，并让你知道一切是否仍然正常。

多少测试就足够取决于您是从头开始开发还是维护现有系统。

从头开始时，您不希望花费所有时间编写测试并最终无法交付，因为您能够编码的 10% 的功能都经过了详尽的测试。需要确定一些优先级。一个例子是私有方法。由于私有方法必须由以某种形式（公共/包/受保护）可见的代码使用，因此可以认为私有方法被覆盖在更可见的方法的测试中。如果私有代码中有一些重要或模糊的行为或边缘情况，则需要在此处包含一些白盒测试。

测试应该帮助您确保 1) 了解需求，2) 通过编码实现可测试性，遵守良好的设计实践，3) 了解以前现有的代码何时停止工作。如果您无法描述某些功能的测试，我敢打赌您对该功能的理解还不够透彻，无法干净地编写代码。使用单元测试代码迫使您做一些事情，例如将数据库连接或实例工厂等重要的事情作为参数传递，而不是屈服于让类本身做太多事情并变成“上帝”对象的诱惑。让你的代码成为你的金丝雀意味着你可以自由地编写更多代码。当先前通过的测试失败时，这意味着以下两种情况之一：要么代码不再执行预期的操作，要么功能的要求已更改，并且只需更新测试即可满足新的要求。

在使用现有代码时，您应该能够证明所有已知场景都已涵盖，这样当下一个更改请求或错误修复出现时，您就可以自由地深入研究您认为合适的任何模块，而不必担心，”如果我破坏了某些东西怎么办”，这会导致花费更多的时间来测试甚至是小的修复，然后才实际更改代码。

因此，我们无法为您提供严格且快速的测试数量，但您应该争取一定程度的覆盖范围，以增强您对不断进行更改或添加功能的能力的信心，否则您可能已经达到了收益递减的地步。

Gerald Weinberg's classic book "The Psychology of Computer Programming" has lots of good stories about testing. One I especially like is in Chapter 4 "Programming as a Social Activity" "Bill" asks a co-worker to review his code and they find seventeen bugs in only thirteen statements. Code reviews provide additional eyes to help find bugs, the more eyes you use the better chance you have of finding ever-so-subtle bugs. Like Linus said, "Given enough eyeballs, all bugs are shallow" your tests are basically robotic eyes who will look over your code as many times as you want at any hour of day or night and let you know if everything is still kosher.

How many tests are enough does depend on whether you are developing from scratch or maintaining an existing system.

When starting from scratch, you don't want to spend all your time writing test and end up failing to deliver because the 10% of the features you were able to code are exhaustively tested. There will be some amount of prioritization to do. One example is private methods. Since private methods must be used by the code which is visible in some form (public/package/protected) private methods can be considered to be covered under the tests for the more-visible methods. This is where you need to include some white-box tests if there are some important or obscure behaviors or edge cases in the private code.

Tests should help you make sure you 1) understand the requirements, 2) adhere to good design practices by coding for testability, and 3) know when previously existing code stops working. If you can't describe a test for some feature, I would be willing to bet that you don't understand the feature well enough to code it cleanly. Using unit test code forces you to do things like pass in as arguments those important things like database connections or instance factories instead of giving in to the temptation of letting the class do way too much by itself and turning into a 'God' object. Letting your code be your canary means that you are free to write more code. When a previously passing test fails it means one of two things, either the code no longer does what was expected or that the requirements for the feature have changed and the test simply needs to be updated to fit the new requirements.

When working with existing code, you should be able to show that all the known scenarios are covered so that when the next change request or bug fix comes along, you will be free to dig into whatever module you see fit without the nagging worry, "what if I break something" which leads to spending more time testing even small fixes then it took to actually change the code.

So, we can't give you a hard and fast number of tests but you should shoot for a level of coverage which increases your confidence in your ability to keep making changes or adding features, otherwise you've probably reached the point of diminished returns.

回复收藏 0 原文