如何使用 MSVC 内在函数来获得与此 GCC 代码等效的内容？

发布于 2024-07-09 10:03:58 字数 1101 浏览 10 评论 0原文

以下代码调用 GCC 中 clz/ctz 的内置函数，在其他系统上有 C 版本。显然，如果系统有内置的 clz/ctz 指令（如 x86 和 ARM），C 版本有点次优。

#ifdef __GNUC__
#define clz(x) __builtin_clz(x)
#define ctz(x) __builtin_ctz(x)
#else
static uint32_t ALWAYS_INLINE popcnt( uint32_t x )
{
    x -= ((x >> 1) & 0x55555555);
    x = (((x >> 2) & 0x33333333) + (x & 0x33333333));
    x = (((x >> 4) + x) & 0x0f0f0f0f);
    x += (x >> 8);
    x += (x >> 16);
    return x & 0x0000003f;
}
static uint32_t ALWAYS_INLINE clz( uint32_t x )
{
    x |= (x >> 1);
    x |= (x >> 2);
    x |= (x >> 4);
    x |= (x >> 8);
    x |= (x >> 16);
    return 32 - popcnt(x);
}
static uint32_t ALWAYS_INLINE ctz( uint32_t x )
{
    return popcnt((x & -x) - 1);
}

#endif

我需要调用哪些函数、需要包含哪些标头等才能在此处为 MSVC 添加正确的 ifdef？我已经看过此页面，但我并不完全了解确定 #pragma 的用途（是否必需？）以及它对编译的 MSVC 版本要求有何限制。作为一个不真正使用 MSVC 的人，我也不知道这些内在函数在其他体系结构上是否有 C 等效项，或者在 #defining 它们时是否也必须 #ifdef x86/x86_64。

原文

The following code calls the builtin functions for clz/ctz in GCC and, on other systems, has C versions. Obviously, the C versions are a bit suboptimal if the system has a builtin clz/ctz instruction, like x86 and ARM.

#ifdef __GNUC__
#define clz(x) __builtin_clz(x)
#define ctz(x) __builtin_ctz(x)
#else
static uint32_t ALWAYS_INLINE popcnt( uint32_t x )
{
    x -= ((x >> 1) & 0x55555555);
    x = (((x >> 2) & 0x33333333) + (x & 0x33333333));
    x = (((x >> 4) + x) & 0x0f0f0f0f);
    x += (x >> 8);
    x += (x >> 16);
    return x & 0x0000003f;
}
static uint32_t ALWAYS_INLINE clz( uint32_t x )
{
    x |= (x >> 1);
    x |= (x >> 2);
    x |= (x >> 4);
    x |= (x >> 8);
    x |= (x >> 16);
    return 32 - popcnt(x);
}
static uint32_t ALWAYS_INLINE ctz( uint32_t x )
{
    return popcnt((x & -x) - 1);
}

#endif

What functions do I need to call, which headers do I need to include, etc to add a proper ifdef for MSVC here? I've already looked at this page, but I'm not entirely sure what the #pragma is for (is it required?) and what restrictions it puts on MSVC version requirements for compilation. As someone who doesn't really use MSVC, I also don't know whether these intrinsics have C equivalents on other architectures, or whether I have to #ifdef x86/x86_64 as well when #defining them.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

过期情话 2024-07-16 10:03:59

有两个内在函数“_BitScanForward”和“_BitScanReverse”，它们适合 MSVC 的相同目的。包括。这些功能是：

#ifdef _MSC_VER
#include <intrin.h>

static uint32_t __inline ctz( uint32_t x )
{
   int r = 0;
   _BitScanReverse(&r, x);
   return r;
}

static uint32_t __inline clz( uint32_t x )
{
   int r = 0;
   _BitScanForward(&r, x);
   return r;
}
#endif

有等效的 64 位版本“_BitScanForward64”和“_BitScanReverse64”。

请在此处阅读更多信息：

MSDN 上的 x86 内在函数

There are two intrinsics "_BitScanForward" and "_BitScanReverse", which suits the same purpose for MSVC. Include . The functions are:

#ifdef _MSC_VER
#include <intrin.h>

static uint32_t __inline ctz( uint32_t x )
{
   int r = 0;
   _BitScanReverse(&r, x);
   return r;
}

static uint32_t __inline clz( uint32_t x )
{
   int r = 0;
   _BitScanForward(&r, x);
   return r;
}
#endif

There are equivalent 64bit versions "_BitScanForward64" and "_BitScanReverse64".

Read more here:

x86 Intrinsics on MSDN

回复收藏 0 原文

鹊巢 2024-07-16 10:03:58

从 sh0dan 代码中跳出来，应该像这样更正实现：

#ifdef _MSC_VER
#include <intrin.h>

uint32_t __inline ctz( uint32_t value )
{
    DWORD trailing_zero = 0;

    if ( _BitScanForward( &trailing_zero, value ) )
    {
        return trailing_zero;
    }
    else
    {
        // This is undefined, I better choose 32 than 0
        return 32;
    }
}

uint32_t __inline clz( uint32_t value )
{
    DWORD leading_zero = 0;

    if ( _BitScanReverse( &leading_zero, value ) )
    {
       return 31 - leading_zero;
    }
    else
    {
         // Same remarks as above
         return 32;
    }
}
#endif

正如代码中注释的那样，如果 value 为 0，则 ctz 和 clz 都是未定义的。在我们的抽象中，我们将 __builtin_clz(value) 修复为 (value?__builtin_clz(value):32) 但这是一个选择

Bouncing from sh0dan code, the implementation should be corrected like this :

#ifdef _MSC_VER
#include <intrin.h>

uint32_t __inline ctz( uint32_t value )
{
    DWORD trailing_zero = 0;

    if ( _BitScanForward( &trailing_zero, value ) )
    {
        return trailing_zero;
    }
    else
    {
        // This is undefined, I better choose 32 than 0
        return 32;
    }
}

uint32_t __inline clz( uint32_t value )
{
    DWORD leading_zero = 0;

    if ( _BitScanReverse( &leading_zero, value ) )
    {
       return 31 - leading_zero;
    }
    else
    {
         // Same remarks as above
         return 32;
    }
}
#endif

As commented in the code, both ctz and clz are undefined if value is 0. In our abstraction, we fixed __builtin_clz(value) as (value?__builtin_clz(value):32) but it's a choice

回复收藏 0 原文

月下凄凉 2024-07-16 10:03:58

MSVC 中 int __builtin_ctz (unsigned int x) 的等效函数是 unsigned int _tzcnt_u32 (unsigned int a)，用于 32 位 整数并返回尾随零的计数。对于64位使用unsigned __int64 _tzcnt_u64 (unsigned __int64 a) 1。
MSVC 中 int __builtin_clz (unsigned int x) 的等效函数是 unsigned int _lzcnt_u32 (unsigned int a)，用于 32 位 整数并返回前导零的计数。对于64位使用unsigned __int64 _lzcnt_u64 (unsigned __int64 a) 2

C++ 头文件：immintrin.h

回复收藏 0 原文

山色无中 2024-07-16 10:03:58

我在韩国网站 https://torbjorn.tistory.com/317 找到它
在msvc编译器中，可以使用__lzcnt(unsigned int)来替代gcc编译器中的__builtin_clz(unsigned int)。

回复收藏 0 原文

梦初启 2024-07-16 10:03:58

如果 MSVC 有一个用于此目的的编译器内在函数，它将位于此处：

Compiler Intrinsics on MSDN

否则，你必须使用 __asm 来编写它

回复收藏 0 原文

心是晴朗的。 2024-07-16 10:03:58

在 Linux 和 Windows (x86) 上测试：

#ifdef WIN32
    #include <intrin.h>
    static uint32_t __inline __builtin_clz(uint32_t x) {
        unsigned long r = 0;
        _BitScanReverse(&r, x);
        return (31-r);
    }
#endif

uint32_t clz64(const uint64_t x)
{
    uint32_t u32 = (x >> 32);
    uint32_t result = u32 ? __builtin_clz(u32) : 32;
    if (result == 32) {
        u32 = x & 0xFFFFFFFFUL;
        result += (u32 ? __builtin_clz(u32) : 32);
    }
    return result;
}

Tested on linux and windows (x86) :

#ifdef WIN32
    #include <intrin.h>
    static uint32_t __inline __builtin_clz(uint32_t x) {
        unsigned long r = 0;
        _BitScanReverse(&r, x);
        return (31-r);
    }
#endif

uint32_t clz64(const uint64_t x)
{
    uint32_t u32 = (x >> 32);
    uint32_t result = u32 ? __builtin_clz(u32) : 32;
    if (result == 32) {
        u32 = x & 0xFFFFFFFFUL;
        result += (u32 ? __builtin_clz(u32) : 32);
    }
    return result;
}

回复收藏 0 原文

~没有更多了~