在C+&#x2B中使用下划线的规则是什么。标识符?

发布于 2025-02-11 07:55:43 字数 195 浏览 3 评论 0 原文

在C ++中常见的是将成员变量命名具有某种前缀的成员变量,以表示它们是成员变量而不是局部变量或参数的事实。如果您来自MFC背景,则可能会使用 M_FOO 。我还偶尔也看过 myfoo

C#(或可能只是.NET)似乎建议仅使用下划线,例如 _foo 。这是C ++标准允许的吗?

It's common in C++ to name member variables with some kind of prefix to denote the fact that they're member variables, rather than local variables or parameters. If you've come from an MFC background, you'll probably use m_foo. I've also seen myFoo occasionally.

C# (or possibly just .NET) seems to recommend using just an underscore, as in _foo. Is this allowed by the C++ standard?

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(6

夜雨飘雪 2025-02-18 07:55:43

规则(在C ++ 11中没有更改):

从2003 C ++标准开始:

17.4.3.1.2全局名称[lib.global.names]

某些名称和功能签名始终保留到实现:

  • 每个包含双重下划线的名称( __ )或以下划线开始,然后将大写字母(2.11)保留到实现中以供任何用途。
  • 以下划线开头的每个名称都保留到实现中,以用作全局名称空间中的名称。 165

165)此类名称也保留在命名空间 :: std (17.4.3.1)。

C ++语言基于C语言(1.1/2,C ++ 03),而C99是标准参考(1.2/1,C ++ 03),因此了解1999 C标准的限制很有用(尽管它们不直接适用于C ++):

7.1.3保留标识符

每个标头均声明或定义了其关联子中列出的所有标识符,并且
可选地声明或定义了其关联的未来库方向列出的标识符,该标识符始终保留用于任何用途或用作文件范围标识符。

  • 所有以下划线和大写字母或另一个开始的标识符
    下划线总是保留用于任何用途。
  • 所有以下划线开头的标识符始终保留作为标识符
    在普通名称空间和标签名称空间中都有文件范围。
  • 以下任何一个子条款中的每个宏名称(包括未来库
    如果包括其相关的任何标头,则保留说明供使用;
    除非另有明确说明(请参阅7.1.4)。
  • 所有具有外部链接的标识符在以下任何一个子条款中(包括
    未来图书馆的说明)始终保留作为外部的标识符
    链接。 154
  • 每个标识符都在以下任何一个子条款中列出的文件范围(包括
    未来图书馆的指示)保留作为宏名称和标识符。
    如果包括任何关联的标头,则在同名空间中以同一名称空间为单位。

没有其他标识符保留。如果程序声明或定义标识符
保留它的上下文(除了7.1.4允许以外)或定义保留的上下文
标识符是宏名称,行为是未定义的。

如果程序删除(使用 #undef )在第一个标识符中的任何宏定义
上面列出的组,行为是不确定的。

154)带有外部链接的预留标识符列表包括 errno Math_errhandling setjmp 和<代码> va_end 。

其他限制可能适用。例如,POSIX标准保留许多可能以正常代码显示的标识符:

  • 以资本 e 开头的名称遵循数字或大写字母:
    • 可用于其他错误代码名称。
  • 开头的名称为或 ,然后是小写字母:
    • 可用于其他字符测试和转换功能。
  • lc _ 开头的名称,然后是大写字母:
    • 可用于指定语言环境属性的其他宏。
  • 保留了所有现有数学功能的名称。
    • 分别在浮点和长时间参数上运行的相应函数。
  • sig 开头的名称,然后保留大写字母:
    • 有关其他信号名称。
  • sig _ 开始的名称,然后保留大写字母:
    • 用于其他信号动作。
  • 名称以开始 mem wcs ,然后保留一个小写字母:
    • 对于其他字符串和数组功能。
  • 名称以开始pri scn ,然后保留任何小写字母或 x
    • 对于其他格式指定宏
  • 对于以 _T 结尾的 名称,请保留:
    • 对于其他类型名称。

尽管现在出于您自己的目的使用这些名称可能不会引起问题,但它们确实增加了与该标准的未来版本发生冲突的可能性。


就我个人而言,我只是没有开始使用下划线标识符。我规则的新增加:不要在任何地方使用双重下划线,这很容易,因为我很少使用下划线。

在对本文进行研究之后,我不再使用 _T 结束标识符
因为这是由POSIX标准保留的。

任何标识符以 _T 结尾的规则令我感到惊讶。我认为这是一个Posix标准(尚不确定),以寻找澄清以及官方章节和经文。这是从 gnu libtool手册

cesarb提供了以下链接,指向 posix 2008 保留的符号在此处定义。这些限制比上面的限制更为细微。

The rules (which did not change in C++11):

  • Reserved in any scope, including for use as implementation macros:
    • identifiers beginning with an underscore followed immediately by an uppercase letter
    • identifiers containing adjacent underscores (or "double underscore")
  • Reserved in the global namespace:
    • identifiers beginning with an underscore
  • Also, everything in the std namespace is reserved. (You are allowed to add template specializations, though.)

From the 2003 C++ Standard:

17.4.3.1.2 Global names [lib.global.names]

Certain sets of names and function signatures are always reserved to the implementation:

  • Each name that contains a double underscore (__) or begins with an underscore followed by an uppercase letter (2.11) is reserved to the implementation for any use.
  • Each name that begins with an underscore is reserved to the implementation for use as a name in the global namespace.165

165) Such names are also reserved in namespace ::std (17.4.3.1).

The C++ language is based on the C language (1.1/2, C++03), and C99 is a normative reference (1.2/1, C++03), so it's useful to know the restrictions from the 1999 C Standard (although they do not apply to C++ directly):

7.1.3 Reserved identifiers

Each header declares or defines all identifiers listed in its associated subclause, and
optionally declares or defines identifiers listed in its associated future library directions subclause and identifiers which are always reserved either for any use or for use as file scope identifiers.

  • All identifiers that begin with an underscore and either an uppercase letter or another
    underscore are always reserved for any use.
  • All identifiers that begin with an underscore are always reserved for use as identifiers
    with file scope in both the ordinary and tag name spaces.
  • Each macro name in any of the following subclauses (including the future library
    directions) is reserved for use as specified if any of its associated headers is included;
    unless explicitly stated otherwise (see 7.1.4).
  • All identifiers with external linkage in any of the following subclauses (including the
    future library directions) are always reserved for use as identifiers with external
    linkage.154
  • Each identifier with file scope listed in any of the following subclauses (including the
    future library directions) is reserved for use as a macro name and as an identifier with
    file scope in the same name space if any of its associated headers is included.

No other identifiers are reserved. If the program declares or defines an identifier in a
context in which it is reserved (other than as allowed by 7.1.4), or defines a reserved
identifier as a macro name, the behavior is undefined.

If the program removes (with #undef) any macro definition of an identifier in the first
group listed above, the behavior is undefined.

154) The list of reserved identifiers with external linkage includes errno, math_errhandling, setjmp, and va_end.

Other restrictions might apply. For example, the POSIX standard reserves a lot of identifiers that are likely to show up in normal code:

  • Names beginning with a capital E followed a digit or uppercase letter:
    • may be used for additional error code names.
  • Names that begin with either is or to followed by a lowercase letter:
    • may be used for additional character testing and conversion functions.
  • Names that begin with LC_ followed by an uppercase letter:
    • may be used for additional macros specifying locale attributes.
  • Names of all existing mathematics functions suffixed with f or l are reserved:
    • for corresponding functions that operate on float and long double arguments, respectively.
  • Names that begin with SIG followed by an uppercase letter are reserved:
    • for additional signal names.
  • Names that begin with SIG_ followed by an uppercase letter are reserved:
    • for additional signal actions.
  • Names beginning with str, mem, or wcs followed by a lowercase letter are reserved:
    • for additional string and array functions.
  • Names beginning with PRI or SCN followed by any lowercase letter or X are reserved:
    • for additional format specifier macros
  • Names that end with _t are reserved:
    • for additional type names.

While using these names for your own purposes right now might not cause a problem, they do raise the possibility of conflict with future versions of that standard.


Personally I just don't start identifiers with underscores. New addition to my rule: Don't use double underscores anywhere, which is easy as I rarely use underscore.

After doing research on this article I no longer end my identifiers with _t
as this is reserved by the POSIX standard.

The rule about any identifier ending with _t surprised me a lot. I think that is a POSIX standard (not sure yet) looking for clarification and official chapter and verse. This is from the GNU libtool manual, listing reserved names.

CesarB provided the following link to the POSIX 2004 reserved symbols and notes 'that many other reserved prefixes and suffixes ... can be found there'. The
POSIX 2008 reserved symbols are defined here. The restrictions are somewhat more nuanced than those above.

╰ゝ天使的微笑 2025-02-18 07:55:43

避免名称相撞的规则都在C ++标准中(请参阅Stroustrup Book),并由C ++ Gurus(Sutter等)提到。

个人规则

是因为我不想处理案件,并且想要一个简单的规则,所以我设计了一个简单而正确的个人:


命名符号时,如果您:
,您将避免与编译器/OS/标准库发生碰撞:

  • 永远不会启动使用下划线的符号
  • 切勿在内部有两个连续下强调的符号命名。

当然,将您的代码放在唯一的名称空间中也有助于避免碰撞(但不会防止邪恶的宏)

一些示例

(我使用宏,因为它们是C/C ++符号的更高代码污染,但这可能是从变量名到class名称的任何内容)

#define _WRONG
#define __WRONG_AGAIN
#define RIGHT_
#define WRONG__WRONG
#define RIGHT_RIGHT
#define RIGHT_x_RIGHT

n3242.pdf 文件(我希望最终的标准文本相似):

17.6.3.3.2全局名称[global.names]

某些名称和功能签名始终保留到实现:

- 每个包含双重下划线_ _或以下划线开始的名称,然后将大写字母(2.12)保留给实现。

- 以下划线开头的每个名称都保留到实现中,以用作全局名称空间中的名称。

但是也:

17.6.3.3.5用户定义的文字后缀[usrlit.suffix]

不以下划线开头的文字后缀标识符保留用于将来的标准化。

最后一个子句令人困惑,除非您考虑一个以一个下划线开头的名称,然后再说一个小写字母,如果在全球名称空间中定义了 not> not>

The rules to avoid collision of names are both in the C++ standard (see Stroustrup book) and mentioned by C++ gurus (Sutter, etc.).

Personal rule

Because I did not want to deal with cases, and wanted a simple rule, I have designed a personal one that is both simple and correct:


When naming a symbol, you will avoid collision with compiler/OS/standard libraries if you:

  • never start a symbol with an underscore
  • never name a symbol with two consecutive underscores inside.

Of course, putting your code in an unique namespace helps to avoid collision, too (but won't protect against evil macros)

Some examples

(I use macros because they are the more code-polluting of C/C++ symbols, but it could be anything from variable name to class name)

#define _WRONG
#define __WRONG_AGAIN
#define RIGHT_
#define WRONG__WRONG
#define RIGHT_RIGHT
#define RIGHT_x_RIGHT

Extracts from C++0x draft

From the n3242.pdf file (I expect the final standard text to be similar):

17.6.3.3.2 Global names [global.names]

Certain sets of names and function signatures are always reserved to the implementation:

— Each name that contains a double underscore _ _ or begins with an underscore followed by an uppercase letter (2.12) is reserved to the implementation for any use.

— Each name that begins with an underscore is reserved to the implementation for use as a name in the global namespace.

But also:

17.6.3.3.5 User-defined literal suffixes [usrlit.suffix]

Literal suffix identifiers that do not start with an underscore are reserved for future standardization.

This last clause is confusing, unless you consider that a name starting with one underscore and followed by a lowercase letter would be Ok if not defined in the global namespace...

扛刀软妹 2025-02-18 07:55:43

来自 msdn

在标识符开始时使用两个顺序下划线字符(__),或一个单一的领导下划线,然后是大写字母,用于所有范围的C ++实现。您应该避免使用一个领先的下划线下划线,然后是一个带有文件范围的名称的小写字母,因为可能与当前或将来的保留标识符发生冲突。

这意味着您可以将单个下划线作为成员变量前缀,只要接下来是下案字母。

这显然是从C ++标准的第17.4.3.1.2节中获取的,但是我找不到完整标准在线的原始来源。

另请参见这个问题

From MSDN:

Use of two sequential underscore characters ( __ ) at the beginning of an identifier, or a single leading underscore followed by a capital letter, is reserved for C++ implementations in all scopes. You should avoid using one leading underscore followed by a lowercase letter for names with file scope because of possible conflicts with current or future reserved identifiers.

This means that you can use a single underscore as a member variable prefix, as long as it's followed by a lower-case letter.

This is apparently taken from section 17.4.3.1.2 of the C++ standard, but I can't find an original source for the full standard online.

See also this question.

多彩岁月 2025-02-18 07:55:43

至于问题的另一部分,通常将变量名称的 end score放在变量名称的上,以与内部任何内容不冲突。

我甚至在课堂和名称空间内都这样做,因为我只需要记住一个规则(与“在全局范围中的名称的末尾,以及其他地方的名称的开始”)。

As for the other part of the question, it's common to put the underscore at the end of the variable name to not clash with anything internal.

I do this even inside classes and namespaces because I then only have to remember one rule (compared to "at the end of the name in global scope, and the beginning of the name everywhere else").

罪歌 2025-02-18 07:55:43

是的,可以在标识符中的任何地方使用下划线。我相信规则是:第一个字符中的AZ,AZ,_中的任何一个,以及以下字符的 +0-9。

下划线前缀在C代码中很常见 - 单个下划线表示“私有”,并且通常保留编译器的双重下划线。

Yes, underscores may be used anywhere in an identifier. I believe the rules are: any of a-z, A-Z, _ in the first character and those +0-9 for the following characters.

Underscore prefixes are common in C code -- a single underscore means "private", and double underscores are usually reserved for use by the compiler.

脸赞 2025-02-18 07:55:43

首先,当前工作草案中的规则在 [lex.name] p3 << /a>:

此外,某些标识符以令牌预处理 - token-token 保留为C ++实现,否则不得使用;无需诊断。

  • 每个包含双重下划线的标识符 __ 或以下划线开头,然后将大写字母保留给实现。
  • 以下划线开头的每个标识符都保留在实现中,以用作全局名称空间中的名称。

此外,标准库保留在名称空间std 和一些僵尸名称中定义的所有名称;请参阅 [Reserved.names.general]

Posix呢?

正如公认的答案所指出的那样,实现的其他部分,例如POSIX标准,它限制了您可以使用的标识符。

标题部分中描述的每个标识符都保留在同一名称空间中的文件范围的标识符中,如果包括标题。

任何标头 [储备] 后缀 _t

v2_chap02.html“ rel =“ nofollow noreferrer”> posix 2008标准,2.2.2

在C ++中,几乎所有与POSIX相关的问题都可以通过命名空间来避免。
这也是为什么C ++标准可以添加诸如 std :: enable_if_t 之类的符号,而不会破坏POSIX的兼容性。

可视化

int x;      // OK
int x_;     // OK
int _x;     // RESERVED
int x__;    // RESERVED (OK in C)
int __x;    // RESERVED
int _X;     // RESERVED
int assert; // RESERVED (macro name)
int x_t;    // RESERVED (only by POSIX)

namespace {
int y;      // OK
int y_;     // OK
int _y;     // OK
int y__;    // RESERVED (OK in C, ignoring namespaces)
int __y;    // RESERVED
int _Y;     // RESERVED
int assert; // RESERVED (macro name)
int y_t;    // OK
}

y 的上述规则适用于命名和未命名的名称空间。
无论哪种方式,在以下名称空间中,全局名称空间的规则
不再应用(请参阅 [namespace.unamed] )。

y 的上述规则还适用于类,功能等中的标识符。除了全球范围外。

即使断言在此处不像功能风格的宏一样使用,但该名称已保留。这也是为什么到目前为止取得了一些成功。

推荐的练习

要安全,始终避免双重下降,并始终避免使用领先的下划线。
在某些情况下,后者还可以,但是很难记住这些规则,而且要安全比后悔要好。

_ 本身呢?

有些人使用 _ 表示未使用某些变量或功能参数。但是,您可以避免以下内容:

void foo(T _) { /* ... */ }
// replace with:
void foo(T) { /* ... */ }

std::scoped_lock _{mutex};
// replace with:
std::scoped_lock lock{mutex};

您也可以施放参数 p to void 喜欢(void)p ,如果这是关于沉默警告关于 p 未使用,您需要C兼容性。请参阅为什么为什么将未使用的返回值投入到void?

Firstly, the rules in current working draft are laid out in [lex.name] p3:

In addition, some identifiers appearing as a token or preprocessing-token are reserved for use by C++ implementations and shall not be used otherwise; no diagnostic is required.

  • Each identifier that contains a double underscore __ or begins with an underscore followed by an uppercase letter is reserved to the implementation for any use.
  • Each identifier that begins with an underscore is reserved to the implementation for use as a name in the global namespace.

Furthermore, the standard library reserves all names defined in namespace std and some zombie names; see [reserved.names.general].

What about POSIX?

As the accepted answer has pointed out, there may be other parts of the implementation, like the POSIX standard, which limit the identifiers you can use.

Each identifier with file scope described in the header section is reserved for use as an identifier with file scope in the same name space if the header is included.

ANY Header [reserves] Suffix _t

- POSIX 2008 Standard, 2.2.2

In C++, almost all problems associated with POSIX can be avoided through namespaces.
This is also why the C++ standard can add tons of symbols like std::enable_if_t without breaking POSIX compatibility.

Visualization

int x;      // OK
int x_;     // OK
int _x;     // RESERVED
int x__;    // RESERVED (OK in C)
int __x;    // RESERVED
int _X;     // RESERVED
int assert; // RESERVED (macro name)
int x_t;    // RESERVED (only by POSIX)

namespace {
int y;      // OK
int y_;     // OK
int _y;     // OK
int y__;    // RESERVED (OK in C, ignoring namespaces)
int __y;    // RESERVED
int _Y;     // RESERVED
int assert; // RESERVED (macro name)
int y_t;    // OK
}

The above rules for y apply to both named and unnamed namespaces.
Either way, in the following namespace, the rules of the global namespace
no longer apply (see [namespace.unnamed]).

The above rules for y also apply to identifiers in classes, functions, etc.; anything but global scope.

Even though assert isn't used like a function-style macro here, the name is reserved. This is also why proposal P2884 contemplates making it a keyword in C++26, with some success so far.

Recommended Practice

To be safe, always avoid double underscores, and always avoid nams with leading underscores.
The latter are okay in some cases, but it's difficult to memorize these rules, and it's better to be safe than sorry.

What about _ in itself?

Some people use _ to indicate that some variable or function parameter isn't used. However, you can avoid this with:

void foo(T _) { /* ... */ }
// replace with:
void foo(T) { /* ... */ }

std::scoped_lock _{mutex};
// replace with:
std::scoped_lock lock{mutex};

You can also cast a parameter p to void like (void)p, if this is about silencing warnings about p being unused, and you need C compatibility. See Why cast unused return values to void?.

~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文