LOC 计数是否应该包括测试和评论？

发布于 2024-07-07 22:01:54 字数 423 浏览 12 评论 0原文

虽然 LOC（代码行数）是衡量代码复杂性的一种有问题的方法，但它是最流行的一种，并且如果非常仔细地使用，可以提供至少对代码库的相对复杂性的粗略估计（即，如果一个程序是 10KLOC另一个是 100KLOC，由能力大致相同的团队用相同的语言编写，第二个程序几乎肯定要复杂得多）。

在计算代码行数时，您是否更喜欢计算中的注释？测试怎么样？

我见过各种不同的方法。 cloc 和 sloccount 等工具允许包含或排除注释。其他人认为注释是代码及其复杂性的一部分。

单元测试也存在同样的困境，有时可能会达到被测试代码本身的大小，甚至超过它。

我见过各种各样的方法，从仅计算“可操作”非注释非空白行，到“已测试、注释的代码的 XXX 行”，这更像是在所有代码文件上运行“wc -l”项目”。

您的个人偏好是什么，为什么？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

夏末染殇 2024-07-14 22:01:54

一位智者曾经告诉我，在管理程序员时，“你衡量什么，你就得到什么”。

如果您在 LOC 输出中对它们进行令人惊讶的评分，您往往会得到很多代码行。

如果你根据他们解决的错误数量来评价他们，你会惊奇地发现很多错误都被修复了。

如果您根据添加的功能对它们进行评分，您会获得很多功能。

如果你根据圈复杂度对它们进行评分，你会得到极其简单的函数。

由于当今代码库的主要问题之一是它们增长的速度有多快以及一旦增长就很难改变，所以我倾向于完全避免使用 LOC 作为衡量标准，因为它会导致错误的基本行为。

也就是说，如果您必须使用它，请不要添加注释和测试，并且需要一致的编码风格。

但如果您确实想要测量“代码大小”，只需 tar.gz 代码库即可。与计算行数相比，它往往可以更好地粗略估计“内容”，而行数容易受到不同编程风格的影响。

回复收藏 0 原文

伤感在游骋 2024-07-14 22:01:54

测试和评论也必须保留。如果你打算使用 LOC 作为衡量标准（我只是假设我无法说服你放弃它），你应该给出所有三行（真实代码行、注释、测试）。

最重要的（希望也是显而易见的）事情是你要保持一致。不要报告一个项目仅包含实际代码行，而另一个项目则包含所有三行代码。查找或创建一个工具，可以为您自动执行此过程并生成报告。

Lines of Code:       75,000
Lines of Comments:   10,000
Lines of Tests:      15,000
                  ---------
Total:              100,000

这样您就可以确定它会

完成。
每次都以同样的方式完成。

Tests and comments have to be maintained too. If you're going to use LOC as a metric (and I'm just going to assume that I can't talk you out of it), you should give all three (lines of real code, comments, tests).

The most important (and hopefully obvious) thing is that you be consistent. Don't report one project with just the lines of real code and another with all three combined. Find or create a tool that will automate this process for you and generate a report.

Lines of Code:       75,000
Lines of Comments:   10,000
Lines of Tests:      15,000
                  ---------
Total:              100,000

This way you can be sure it will

Get done.
Get done the same way every time.

回复收藏 0 原文

眼泪淡了忧伤 2024-07-14 22:01:54

我个人认为 LOC 指标本身不如其他一些代码指标那么有用。

NDepend 将为您提供 LOC 指标，但也会为您提供许多其他指标，例如循环复杂度。这里没有列出所有这些，而是列表的链接。

还有一个免费的 CodeMetric 插件，适用于反射器

回复收藏 0 原文

南薇 2024-07-14 22:01:54

我不会直接回答你的问题，原因很简单：我讨厌代码行度量。无论您想要衡量什么，都很难做得比 LOC 更差；几乎任何您想到的其他指标都会更好。

特别是，您似乎想要测量代码的复杂性。总体而言，循环复杂度（也称为 McCabe 复杂度）是更好的衡量标准。

具有高循环复杂度的例程是您想要集中注意力的例程。这些例程难以测试、充满错误且难以维护。

有许多工具可以测量这种复杂性。在 Google 上快速搜索您最喜欢的语言，您会发现数十种可以完成此类复杂操作的工具。

回复收藏 0 原文

孤芳又自赏 2024-07-14 22:01:54

代码行的确切含义是：不计算任何注释或空行。为了使其与其他源代码具有可比性（无论其中的指标是否有帮助），您至少需要类似的编码风格：

for (int i = 0; i < list.count; i++)
{
    // do some stuff
}

for (int i = 0; i < list.count; i++){
    // do some stuff
}

第二个版本的功能完全相同，但少了一个 LOC。当你有很多嵌套循环时，这可以总结很多。这就是发明像功能点这样的指标的原因。

Lines of Code means exactly that: No comments or empty lines are counted. And in order for it to be comparable to other source code (no matter if the metric in itsle fis helpful or not), you need at least similar coding styles:

for (int i = 0; i < list.count; i++)
{
    // do some stuff
}

for (int i = 0; i < list.count; i++){
    // do some stuff
}

The second version does exactly the same, but has one LOC less. When you have a lot of nested loops, this can sum up quite a bit. Which is why metrics like function points were invented.

回复收藏 0 原文

左秋 2024-07-14 22:01:54

取决于您使用 LOC 的用途。

作为复杂性衡量标准 - 没有那么多。也许 100KLOC 主要是从一个简单的表生成的代码，而 10KLOC 就是 5KLOC 正则表达式。

然而，我看到每一行代码都与运行成本相关。只要程序存在，您就需要为每一行付费：维护时需要读取它，它可能包含需要修复的错误，它会增加编译时间、从源代码控制获取和备份时间，然后再进行更改或者删除它，您可能需要查明是否有人依赖它等。平均成本可能是每条线和每天几纳便士，但它是加起来的东西。

KLOC 可以作为项目需要多少基础设施的第一手指标。在这种情况下，我将包括注释和测试 - 尽管注释行的运行成本远低于第二个项目中的正则表达式之一。

[编辑] [对代码大小有类似看法的人]1

回复收藏 0 原文

星星的轨迹 2024-07-14 22:01:54

我们只使用代码行度量来做一件事 - 函数应该包含足够少的代码行，以便在不滚动屏幕的情况下阅读。大于此值的函数通常难以阅读，即使它们的循环复杂度非常低。对于他的使用，我们确实计算了空格和注释。

很高兴看到您在重构过程中删除了多少行代码 - 在这里您只想计算实际的代码行、无助于可读性的空格和无助于阅读的注释。没有用（不能自动化）。

最后是免责声明 - 明智地使用指标。指标的一个很好的用途是帮助回答“代码的哪一部分将从重构中受益最多”或“最新签入的代码审查有多紧急？”的问题。 - 圈复杂度为 50 的 1000 行函数是一个闪烁的霓虹灯，上面写着“现在重构我”。衡量标准的错误使用是“程序员 X 的生产力如何”或“我的软件有多复杂”。

回复收藏 0 原文

趴在窗边数星星i 2024-07-14 22:01:54

文章摘录：如何计算代码行数 (LOC)？相对于计算逻辑 .NET 程序的代码行数。

如何计算代码行数 (LOC)？

你算方法签名声明吗？你计算只包含括号的行数吗？当单个方法调用由于参数较多而写在几行时，您是否会计算几行？您计算“命名空间”和“使用命名空间”声明吗？你算接口和抽象方法声明吗？声明字段时是否计算字段赋值？你算空行吗？

根据每个开发人员的编码风格以及选择的语言（C#、VB.NET…），通过测量 LOC 可能会出现显着差异。

显然，通过解析源文件来测量 LOC 看起来是一个复杂的主题。多亏了精明的人，有一种简单的方法可以准确测量所谓的逻辑 LOC。与物理 LOC（通过解析源文件推断出的 LOC）相比，逻辑 LOC 有 2 个显着优势：

编码风格不会干扰逻辑 LOC。例如，LOC 不会更改，因为由于参数数量较多，方法调用会在多行上生成。
逻辑 LOC 独立于语言。从用不同语言编写的程序集中获得的值是可比较的并且可以求和。

在 .NET 世界中，可以根据 PDB 文件计算逻辑 LOC，调试器使用这些文件将 IL 代码与源代码链接起来。 NDepend 工具以这种方式计算方法的逻辑 LOC：它等于在 PDB 文件中为方法找到的序列点的数量。序列点用于标记 IL 代码中与原始源中的特定位置相对应的点。有关序列点的更多信息请参见此处。请注意，不考虑与 C# 大括号“{”和“}”相对应的序列点。

显然，类型的 LOC 是其方法 LOC 的总和，命名空间的 LOC 是其类型 LOC 的总和，程序集的 LOC 是其命名空间 LOC 的总和，应用程序的 LOC 是其程序集 LOC 的总和。以下是一些观察结果：

接口、抽象方法和枚举的 LOC 等于 0。在计算 LOC 时，仅考虑有效执行的具体代码。
命名空间、类型、字段和方法声明不被视为代码行，因为它们没有相应的序列点。
当 C# 或 VB.NET 编译器面对内联实例字段初始化时，它会为每个实例构造函数生成一个序列点（相同的注释适用于内联静态字段初始化和静态构造函数）。
从匿名方法计算的 LOC 不会干扰其外部声明方法的 LOC。
NbILInstructions 和 LOC（在 C# 和 VB.NET 中）之间的总体比率通常约为 7。

Excerpt from the article: How do you count your number of Lines Of Code (LOC) ? relative to the tool NDepend that counts the logical numbers of lines of code for .NET programs.

How do you count your number of Lines Of Code (LOC) ?

Do you count method signature declaration? Do you count lines with only bracket? Do you count several lines when a single method call is written on several lines because of a high number of parameters? Do you count ‘namespaces’ and ‘using namespace’ declaration? Do you count interface and abstract methods declaration? Do you count fields assignment when they are declared? Do you count blank line?

Depending on the coding style of each of developer and depending on the language choose (C#, VB.NET…) there can be significant difference by measuring the LOC.

Apparently measuring the LOC from parsing source files looks like a complex subject. Thanks to an astute there exists a simple way to measure exactly what is called the logical LOC. The logical LOC has 2 significant advantages over the physical LOC (the LOC that is inferred from parsing source files):

Coding style doesn’t interfere with logical LOC. For example the LOC won’t change because a method call is spawn on several lines because of a high number of arguments.
Logical LOC is independent from the language. Values obtained from assemblies written with different languages are comparable and can be summed.

In the .NET world, the logical LOC can be computed from the PDB files, the files that are used by the debugger to link the IL code with the source code. The tool NDepend computes the logical LOC for a method this way: it is equals to the number of sequence point found for a method in the PDB file. A sequence point is used to mark a spot in the IL code that corresponds to a specific location in the original source. More info about sequence points here. Notice that sequence points which correspond to C# braces‘{‘ and ‘}’ are not taken account.

Obviously, the LOC for a type is the sum of its methods’ LOC, the LOC for a namespace is the sum of its types’ LOC, the LOC for an assembly is the sum of its namespaces’ LOC and the LOC for an application is the sum of its assemblies LOC. Here are some observations:

Interfaces, abstract methods and enumerations have a LOC equals to 0. Only concrete code that is effectively executed is considered when computing LOC.
Namespaces, types, fields and methods declarations are not considered as line of code because they don’t have corresponding sequence points.
When the C# or VB.NET compiler faces an inline instance fields initialization, it generates a sequence point for each of the instance constructor (the same remark applies for inline static fields initialization and static constructor).
LOC computed from an anonymous method doesn’t interfere with the LOC of its outer declaring methods.
The overall ratio between NbILInstructions and LOC (in C# and VB.NET) is usually around 7.

回复收藏 0 原文

~没有更多了~