为什么 C++编译器不会消除 new 返回的指针的空检查吗？

发布于 2024-12-17 09:09:52 字数 723 浏览 2 评论 0原文

最近，我在 ideone.com (gcc-4.3.4) 上运行了以下代码

#include <stddef.h>
#include <stdio.h>
#include <stdlib.h>
#include <new>

using namespace std;

void* operator new( size_t size ) throw(std::bad_alloc)
{
     void* ptr = malloc( 2 * 1024 * 1024 * 1024);
     printf( "%p\n", ptr );
     return ptr;
}

void operator delete( void* ptr )
{
    free( ptr );
}

int main()
{
    char* ptr = new char;
    if( ptr == 0 ) {
        printf( "unreachable\n" );
    }
    delete ptr;
}

并得到了以下输出：

(nil)
unreachable

尽管 new 永远不应该返回空指针，因此调用者可以依靠它，并且编译器可以消除了 ptr == 0 检查并将依赖代码视为不可访问。

为什么编译器不消除该代码？这只是一个错过的优化还是有其他原因？

原文

Recently I ran the following code on ideone.com (gcc-4.3.4)

#include <stddef.h>
#include <stdio.h>
#include <stdlib.h>
#include <new>

using namespace std;

void* operator new( size_t size ) throw(std::bad_alloc)
{
     void* ptr = malloc( 2 * 1024 * 1024 * 1024);
     printf( "%p\n", ptr );
     return ptr;
}

void operator delete( void* ptr )
{
    free( ptr );
}

int main()
{
    char* ptr = new char;
    if( ptr == 0 ) {
        printf( "unreachable\n" );
    }
    delete ptr;
}

and got this output:

(nil)
unreachable

although new should never return a null pointer and so the caller can count on that and the compiler could have eliminated the ptr == 0 check and treat dependent code as unreachable.

Why would the compiler not eliminate that code? Is it just a missed optimization or is there some other reason for that?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

执手闯天涯 2024-12-24 09:09:52

我认为这非常简单，您混淆了两个根本不同的事情：

malloc() 可以返回任何内容，特别是零。
标准要求全局 C++ 分配函数 void *operator new(size_t) throw(std::bad_alloc) 要么返回指向所需存储量的指针（+适当对齐），或者通过异常退出。

如果您想替换全局分配函数，您有责任提供遵守标准规则的替换函数。最简单的版本如下所示：

void * operator new(size_t n) throw(std::bad_alloc) {
  void * const p = std::malloc(n);
  if (p == NULL) throw std::bad_alloc();
  return p;
}

任何严肃的实现实际上都应该包含一个循环来调用注册的新处理程序，直到分配成功，并且只有在没有更多新处理程序时才抛出异常。

您编写的程序根本就是格式不正确的。

题外话：为什么这个new要这样定义？当您说 T * p = ::new T(); 时，请考虑标准分配顺序。等价于：

void * addr = ::operator new(sizeof(T));  // allocation
T * p = ::new (addr) T();                 // construction

如果第二行抛出异常（即构造失败），则使用相应的释放函数释放内存。但是，如果第一次调用失败，则执行绝不能到达第二行！实现此目的的唯一方法是通过异常退出。（分配函数的无抛出版本仅用于手动使用，其中用户代码可以在继续构造之前检查分配器的结果。）

I think this is very simple and you have two fundamentally different things confused:

malloc() can return anything, in particular zero.
the global C++ allocation function void * operator new(size_t) throw(std::bad_alloc) is required by the standard to either return a pointer to the required amount of storage (+ suitably aligned), or otherwise exit through an exception.

If you want to replace the global allocation function, it is your responsibility to provide a replacement that abides by the rules of the standard. The simplest version looks like this:

void * operator new(size_t n) throw(std::bad_alloc) {
  void * const p = std::malloc(n);
  if (p == NULL) throw std::bad_alloc();
  return p;
}

Any serious implementation should actually contain a loop to call the registered new-handler until the allocation succeeds, and only throw once there are no more new-handlers.

The program that you wrote is simply ill-formed.

Digression: Why is this new defined that way? Consider the standard allocation sequence when you say T * p = ::new T();. It is equivalent to this:

void * addr = ::operator new(sizeof(T));  // allocation
T * p = ::new (addr) T();                 // construction

If the second line throws (i.e. construction fails), the memory is deallocated with the corresponding deallocation function. If the first call fails, though, then the execution must never reach the second line! The only way to achieve this is by exiting through an exception. (The no-throw versions of the allocation functions are only for manual use where the user code can inspect the result of the allocator before proceeding to construction.)

回复收藏 0 原文

谁把谁当真 2024-12-24 09:09:52

C++11 在这个问题上很明确：

void* operator new(std::size_t size); : ... 3 所需行为：返回非空指针适当对齐存储 (3.7.4)，否则抛出 bad_alloc 异常。 此要求对此函数的替换版本具有约束力。

您点击了“未定义的行为”。

[编辑]
现在，为什么这会阻碍优化呢？编译器供应商倾向于花时间来优化常用的代码模式。对于更快的未定义行为进行优化通常对他们没有什么好处。（某些 UB 可能在该特定编译器上定义良好，并且仍然可以优化，但上面的示例可能不会）。

回复收藏 0 原文

玩物 2024-12-24 09:09:52

我认为您对优化器的期望太多了。当优化器到达此代码时，它认为 new char 只是另一个函数调用，其返回值存储在堆栈中。因此它不认为 if 条件值得特殊对待。

这可能是由于您覆盖了operator new这一事实而触发的，并且它超出了优化器的支付等级，看看您调用的malloc，它可以返回NULL，并确定此覆盖版本不会返回NULL。 malloc 看起来就像只是另一个函数调用。谁知道？您也可能会链接到您自己的版本。

在 C++ 中，还有一些重写运算符改变其行为的其他示例：operator &&、operator || 和 operator ,。其中每一个在不被重写时都有特殊的行为，但在被重写时表现得像标准运算符。例如，如果左侧计算结果为“假”，运算符 && 将根本不会计算其右侧。但是，如果被覆盖，运算符 && 的两边都会在将它们传递给运算符 &&之前进行计算。 /代码>;短路功能完全消失。（这样做是为了支持使用运算符重载来定义 C++ 中的迷你语言；有关示例，请参阅 Boost Spirit 库。）

回复收藏 0 原文

叹倦 2024-12-24 09:09:52

编译器为什么要这样做？

对于 new 的不透明实现，不可能知道该实现是否正确。你的是非标准的，所以你很幸运它毕竟进行了检查。

回复收藏 0 原文

红玫瑰 2024-12-24 09:09:52

存在多个operator new；请参阅此处。而且您没有声明您有可能抛出异常。所以编译器不应该推断它永远不会返回空指针。

我不太了解最新的 C++11 标准，但我猜想只有标准定义的operator new（抛出异常的那个）应该返回一个非零指针，而不是任何用户定义的。

在当前的 GCC 主干中，文件 libstdc++-v3/libsupc++/new 似乎不包含任何特定属性告诉 GCC nil 永远不会返回......即使我相信这是未定义的行为扔新的就为零。

回复收藏 0 原文

茶色山野 2024-12-24 09:09:52

Clang 做了您期望的优化：

cccc@~/workspace/tmp$ clang++ --version
Apple clang version 13.1.6 (clang-1316.0.21.2.3)
Target: x86_64-apple-darwin21.4.0
Thread model: posix
InstalledDir: /Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin
cccc@~/workspace/tmp$ clang++ test.cc -std=c++11
test.cc:10:40: warning: overflow in expression; result is -2147483648 with type 'int' [-Winteger-overflow]
    void *ptr = malloc(2 * 1024 * 1024 * 1024);
                                       ^
test.cc:15:6: warning: function previously declared with an explicit exception specification redeclared with an implicit exception specification [-Wimplicit-exception-spec-mismatch]
void operator delete(void *ptr) { free(ptr); }
     ^
/Applications/Xcode.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX.sdk/usr/include/c++/v1/new:182:36: note: previous declaration is here
_LIBCPP_OVERRIDABLE_FUNC_VIS void  operator delete(void* __p) _NOEXCEPT;
                                   ^
2 warnings generated.
cccc@~/workspace/tmp$ ./a.out 
0x0
unreachable
cccc@~/workspace/tmp$ clang++ test.cc -std=c++11 -O3
test.cc:10:40: warning: overflow in expression; result is -2147483648 with type 'int' [-Winteger-overflow]
    void *ptr = malloc(2 * 1024 * 1024 * 1024);
                                       ^
test.cc:15:6: warning: function previously declared with an explicit exception specification redeclared with an implicit exception specification [-Wimplicit-exception-spec-mismatch]
void operator delete(void *ptr) { free(ptr); }
     ^
/Applications/Xcode.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX.sdk/usr/include/c++/v1/new:182:36: note: previous declaration is here
_LIBCPP_OVERRIDABLE_FUNC_VIS void  operator delete(void* __p) _NOEXCEPT;
                                   ^
2 warnings generated.
cccc@~/workspace/tmp$ ./a.out 
cccc@~/workspace/tmp$

Clang does the optimization you expected:

cccc@~/workspace/tmp$ clang++ --version
Apple clang version 13.1.6 (clang-1316.0.21.2.3)
Target: x86_64-apple-darwin21.4.0
Thread model: posix
InstalledDir: /Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin
cccc@~/workspace/tmp$ clang++ test.cc -std=c++11
test.cc:10:40: warning: overflow in expression; result is -2147483648 with type 'int' [-Winteger-overflow]
    void *ptr = malloc(2 * 1024 * 1024 * 1024);
                                       ^
test.cc:15:6: warning: function previously declared with an explicit exception specification redeclared with an implicit exception specification [-Wimplicit-exception-spec-mismatch]
void operator delete(void *ptr) { free(ptr); }
     ^
/Applications/Xcode.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX.sdk/usr/include/c++/v1/new:182:36: note: previous declaration is here
_LIBCPP_OVERRIDABLE_FUNC_VIS void  operator delete(void* __p) _NOEXCEPT;
                                   ^
2 warnings generated.
cccc@~/workspace/tmp$ ./a.out 
0x0
unreachable
cccc@~/workspace/tmp$ clang++ test.cc -std=c++11 -O3
test.cc:10:40: warning: overflow in expression; result is -2147483648 with type 'int' [-Winteger-overflow]
    void *ptr = malloc(2 * 1024 * 1024 * 1024);
                                       ^
test.cc:15:6: warning: function previously declared with an explicit exception specification redeclared with an implicit exception specification [-Wimplicit-exception-spec-mismatch]
void operator delete(void *ptr) { free(ptr); }
     ^
/Applications/Xcode.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX.sdk/usr/include/c++/v1/new:182:36: note: previous declaration is here
_LIBCPP_OVERRIDABLE_FUNC_VIS void  operator delete(void* __p) _NOEXCEPT;
                                   ^
2 warnings generated.
cccc@~/workspace/tmp$ ./a.out 
cccc@~/workspace/tmp$

回复收藏 0 原文