如何使用 MSVC 内在函数来获得与此 GCC 代码等效的内容?
以下代码调用 GCC 中 clz/ctz 的内置函数,在其他系统上有 C 版本。 显然,如果系统有内置的 clz/ctz 指令(如 x86 和 ARM),C 版本有点次优。
#ifdef __GNUC__
#define clz(x) __builtin_clz(x)
#define ctz(x) __builtin_ctz(x)
#else
static uint32_t ALWAYS_INLINE popcnt( uint32_t x )
{
x -= ((x >> 1) & 0x55555555);
x = (((x >> 2) & 0x33333333) + (x & 0x33333333));
x = (((x >> 4) + x) & 0x0f0f0f0f);
x += (x >> 8);
x += (x >> 16);
return x & 0x0000003f;
}
static uint32_t ALWAYS_INLINE clz( uint32_t x )
{
x |= (x >> 1);
x |= (x >> 2);
x |= (x >> 4);
x |= (x >> 8);
x |= (x >> 16);
return 32 - popcnt(x);
}
static uint32_t ALWAYS_INLINE ctz( uint32_t x )
{
return popcnt((x & -x) - 1);
}
#endif
我需要调用哪些函数、需要包含哪些标头等才能在此处为 MSVC 添加正确的 ifdef? 我已经看过此页面,但我并不完全了解确定 #pragma 的用途(是否必需?)以及它对编译的 MSVC 版本要求有何限制。 作为一个不真正使用 MSVC 的人,我也不知道这些内在函数在其他体系结构上是否有 C 等效项,或者在 #defining 它们时是否也必须 #ifdef x86/x86_64。
The following code calls the builtin functions for clz/ctz in GCC and, on other systems, has C versions. Obviously, the C versions are a bit suboptimal if the system has a builtin clz/ctz instruction, like x86 and ARM.
#ifdef __GNUC__
#define clz(x) __builtin_clz(x)
#define ctz(x) __builtin_ctz(x)
#else
static uint32_t ALWAYS_INLINE popcnt( uint32_t x )
{
x -= ((x >> 1) & 0x55555555);
x = (((x >> 2) & 0x33333333) + (x & 0x33333333));
x = (((x >> 4) + x) & 0x0f0f0f0f);
x += (x >> 8);
x += (x >> 16);
return x & 0x0000003f;
}
static uint32_t ALWAYS_INLINE clz( uint32_t x )
{
x |= (x >> 1);
x |= (x >> 2);
x |= (x >> 4);
x |= (x >> 8);
x |= (x >> 16);
return 32 - popcnt(x);
}
static uint32_t ALWAYS_INLINE ctz( uint32_t x )
{
return popcnt((x & -x) - 1);
}
#endif
What functions do I need to call, which headers do I need to include, etc to add a proper ifdef for MSVC here? I've already looked at this page, but I'm not entirely sure what the #pragma is for (is it required?) and what restrictions it puts on MSVC version requirements for compilation. As someone who doesn't really use MSVC, I also don't know whether these intrinsics have C equivalents on other architectures, or whether I have to #ifdef x86/x86_64 as well when #defining them.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(6)
有两个内在函数“_BitScanForward”和“_BitScanReverse”,它们适合 MSVC 的相同目的。 包括 。 这些功能是:
有等效的 64 位版本“_BitScanForward64”和“_BitScanReverse64”。
请在此处阅读更多信息:
MSDN 上的 x86 内在函数
There are two intrinsics "_BitScanForward" and "_BitScanReverse", which suits the same purpose for MSVC. Include . The functions are:
There are equivalent 64bit versions "_BitScanForward64" and "_BitScanReverse64".
Read more here:
x86 Intrinsics on MSDN
从 sh0dan 代码中跳出来,应该像这样更正实现:
正如代码中注释的那样,如果 value 为 0,则 ctz 和 clz 都是未定义的。在我们的抽象中,我们将
__builtin_clz(value)
修复为(value?__builtin_clz(value):32)
但这是一个选择Bouncing from sh0dan code, the implementation should be corrected like this :
As commented in the code, both ctz and clz are undefined if value is 0. In our abstraction, we fixed
__builtin_clz(value)
as(value?__builtin_clz(value):32)
but it's a choiceMSVC 中 int __builtin_ctz (unsigned int x) 的等效函数是 unsigned int _tzcnt_u32 (unsigned int a),用于 32 位 整数并返回尾随零的计数。 对于64位使用unsigned __int64 _tzcnt_u64 (unsigned __int64 a) 1。
MSVC 中 int __builtin_clz (unsigned int x) 的等效函数是 unsigned int _lzcnt_u32 (unsigned int a),用于 32 位 整数并返回前导零的计数。 对于64位使用unsigned __int64 _lzcnt_u64 (unsigned __int64 a) 2
C++ 头文件:immintrin.h
The equivalent function for int __builtin_ctz (unsigned int x) in MSVC is unsigned int _tzcnt_u32 (unsigned int a) for 32 bit integer and returns count of trailing zeros. For 64 bit use unsigned __int64 _tzcnt_u64 (unsigned __int64 a) 1.
The equivalent function for int __builtin_clz (unsigned int x) in MSVC is unsigned int _lzcnt_u32 (unsigned int a) for 32 bit integer and returns count of leading zeros. For 64 bit use unsigned __int64 _lzcnt_u64 (unsigned __int64 a) 2
C++ Header: immintrin.h
我在韩国网站 https://torbjorn.tistory.com/317 找到它
在msvc编译器中,可以使用
__lzcnt(unsigned int)
来替代gcc编译器中的__builtin_clz(unsigned int)
。I find it in a korean website https://torbjorn.tistory.com/317
In msvc compiler, you can use
__lzcnt(unsigned int)
to replace__builtin_clz(unsigned int)
in gcc compiler.如果 MSVC 有一个用于此目的的编译器内在函数,它将位于此处:
Compiler Intrinsics on MSDN
否则,你必须使用 __asm 来编写它
If MSVC has a compiler intrinsic for this, it'll be here:
Compiler Intrinsics on MSDN
Otherwise, you'll have to write it using __asm
在 Linux 和 Windows (x86) 上测试:
Tested on linux and windows (x86) :