是否应该在每次构建时执行代码覆盖率？

发布于 2024-11-17 13:15:03 字数 822 浏览 10 评论 0原文

我是棕地应用程序开发的忠实粉丝。毫无疑问，这是一本很棒的书，我会向所有开发人员推荐它。我来这里是因为我明白了书中关于代码覆盖率的要点。在我的新商店，我们使用 Team City 进行自动化构建/持续集成，构建完成大约需要 40 分钟。《Brownfield》一书讨论了无摩擦开发以及我们希望如何减轻开发人员必须承受的常见负担。这是我在第 130 页读到的内容。

“代码覆盖率：两个进程的价格？正如您从清单 5.2 中的示例目标中看到的，您最终得到两个输出文件：一份包含测试结果，一份包含代码覆盖率结果。这是因为你实际上正在执行此任务期间的测试。

如果您正在运行，从技术上讲，您不需要在单独的任务中执行测试代码覆盖率任务。出于这个原因，许多团队将替代自动化的测试任务的代码覆盖率任务，本质上是在 CI 流程。 CI服务器将编译代码、测试代码并生成代码覆盖率每次签到的统计数据。

尽管这种方法在概念上没有任何问题，但请注意一些问题缺点。首先，生成代码覆盖率统计数据会产生开销。什么时候有很多测试，这种开销可能足以导致摩擦长时间运行的自动构建脚本的形式。请记住，主要构建脚本应该尽可能快地运行，以鼓励团队成员经常运行它。如果运行时间太长，您可能会发现开发人员正在寻找解决方法。

由于这些原因，我们建议单独执行代码覆盖率任务构建脚本的默认任务。它应该定期运行，也许作为构建文件中的单独计划任务每两周甚至每月执行一次，但我们不认为该指标有足够的好处来保证额外的开销它在每次签入时执行。“

这与我当前商店的做法相反，我们每次构建都执行 NCover。我想去找我的领导并要求我们不要这样做，但我能做的最好的就是告诉他”这就是《布朗菲尔德》书中所说的“。我认为这还不够好。所以我依靠你们向我提供有关这个主题的个人经验和建议。谢谢。

原文

I'm a huge fan of Brownfield Application Development. A great book no doubt and I'd recommend it to all devs out there. I'm here because I got to the point in the book about code coverage. At my new shop, we're using Team City for automated builds/continuous integration and it takes about 40 minutes for the build to complete. The Brownfield book talks all about frictionless development and how we want to ease the common burdens that developers have to endure. Here's what I read on page 130..

"Code coverage: Two processes for the price of one?
As you can see from the sample target in listing 5.2, you end up with two output files:
one with the test results and one with the code coverage results. This is because you
actually are executing your tests during this task.

You don’t technically need to execute your tests in a separate task if you’re running
the code coverage task. For this reason, many teams will substitute an automated
code coverage task for their testing task, essentially performing both actions in the
CI process. The CI server will compile the code, test it, and generate code coverage
stats on every check-in.

Although there’s nothing conceptually wrong with this approach, be aware of some
downsides. First, there’s overhead to generating code coverage statistics. When
there are a lot of tests, this overhead could be significant enough to cause friction in
the form of a longer-running automated build script. Remember that the main build
script should run as fast as possible to encourage team members to run it often. If
it takes too long to run, you may find developers looking for workarounds.

For these reasons, we recommend executing the code coverage task separately from
the build script’s default task. It should be run at regular intervals, perhaps as a separate scheduled task in your build file that executes biweekly or even monthly, but we
don’t feel there’s enough benefit to the metric to warrant the extra overhead of having
it execute on every check-in."

This is contrary to the practice at my current shop were we execute NCover per build. I want to go to my lead and request we not do this, but the best I can do is tell him "this is what the Brownfield book says". I don't think that's good enough. So I'm relying on you guys to fill me in with your personal experiences and advice on this topic. Thanks.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

断念 2024-11-24 13:15:03

持续集成/自动化构建系统中总是存在两个相互竞争的利益：

您希望构建尽可能快地运行
您希望构建在运行时提供尽可能多的反馈（例如，运行最多数量的测试、最多信息量）构建的稳定性和覆盖范围等）

您将始终需要在这些相互竞争的利益之间进行权衡并找到平衡。我通常尝试将构建时间控制在 10 分钟以内，并且如果需要超过 20 分钟才能提供有关构建稳定性的任何有意义的反馈，则会认为构建系统已损坏。但这并不需要是测试每种情况的完整构建；稍后可能会运行其他测试或在其他计算机上并行运行以进一步测试系统。

如果您发现构建时间为 40 分钟，我建议您尽快执行以下操作之一：

将构建/测试分发到多台计算机上，以便测试可以并行运行，并且您可以获得更快的
反馈在您的构建中花费了大量时间，但没有提供大量好处，并且仅将这些任务作为夜间构建的一部分执行，

如果可能的话，我会100％推荐第一个解决方案。然而，有时硬件无法立即可用，我们必须做出牺牲。

代码覆盖率是一个相对稳定的指标，因为代码覆盖率数字在一天内急剧恶化的情况相对较少。因此，如果代码覆盖率需要很长时间才能执行，那么它在每次构建时都发生并不重要。但您仍然应该尝试至少每晚获取一次代码覆盖率数字。夜间构建可以允许花费更长的时间，因为（大概）不会有人等待它们，但它们仍然会定期提供有关项目状态的反馈，并确保不会引入很多不可预见的问题。

也就是说，如果您能够让硬件进行更多分布式或并行构建/测试，您绝对应该走这条路 - 它将确保您的开发人员尽快知道他们是否破坏了某些内容或在系统中引入了问题。构建系统的快速反馈所带来的生产力的提高将很快收回硬件成本。

另外，如果您的构建机器不是持续工作（即有很多时间空闲），那么我建议将其设置为执行以下操作：

当代码发生更改时，进行构建和测试。忽略一些运行时间较长的任务，包括潜在的代码覆盖率。
一旦这个构建/测试周期完成（或并行），启动一个更长的构建，更彻底地测试事物，进行代码覆盖率等
这两个构建都应该提供有关系统运行状况的反馈

这样，您就可以获得快速反馈，而且只要构建机器有能力，就可以为每个构建进行更多扩展的测试。

There are always two competing interests in continuous integration / automated build systems:

You want the build to run as quickly as possible
You want the build to run with as much feedback as possible (e.g. the most number of tests run, the most amount of information available about the build's stability and coverage, etc)

You will always need to make tradeoffs and find a balance between these competing interests. I usually try to keep my build times under 10 minutes, and will consider build systems broken if it takes more than about 20 minutes to give any sort of meaningful feedback about the build's stability. But this doesn't need to be a complete build that tests every case; there may be additional tests that are run later or in parallel on other machines to further test the system.

If you are seeing build times of 40 minutes, I would recommend you do one of the following as soon as possible:

Distribute the build/testing onto multiple machines, so that tests can be run in parallel and you can get faster feedback
Find things that are taking a lot of time in your build but are not giving a great amount of benefit, and only do those tasks as part of a nightly build

I would 100% recommend the first solution if at all possible. However, sometimes the hardware isn't available right away and we have to make sacrifices.

Code coverage is a relatively stable metric, in that it is relatively rare that your code coverage numbers would get dramatically worse within a single day. So if the code coverage is taking a long time to perform, then it's not really critical that it occurs on every build. But you should still try to get code coverage numbers at least once a night. Nightly builds can be allowed to take a bit longer, since there (presumably) won't be anybody waiting on them, but they still provide regular feedback about your project's status and ensure there aren't lots of unforeseen problems being introduced.

That said, if you are able to get the hardware to do more distributed or parallel building/testing, you should definitely go that route - it will ensure that your developers know as soon as possible if they broke something or introduced a problem in the system. The cost of the hardware will quickly pay itself back in the increased productivity that occurs from the rapid feedback of the build system.

Also, if your build machine is not constantly working (i.e. there is a lot of time when it is idle), then I would recommend setting it up to do the following:

When there is a code change, do a build and test. Leave out some of the longer running tasks, including potentially code coverage.
Once this build/test cycle completes (or in parallel), kick off a longer build that tests things more thoroughly, does code coverage, etc
Both of these builds should give feedback about the health of the system

That way, you get the quick feedback, but also get the more extended tests for every build, so long as the build machine has the capacity for it.

回复收藏 0 原文