当前位置：文江博客话题详情

运行基本AVX512代码时获得非法指令

发布于 2025-01-17 14:44:13 字数 924 浏览 4 评论 0 原文

我正在尝试学习AVX说明，并且在运行基本代码时会收到

非法指令（核心倾倒）

代码将在下面提到，我正在使用

g ++ -mavx512f 1.cpp

到底是什么问题以及如何克服它？谢谢你！

#include <immintrin.h>
#include<iostream>
using namespace std;

void add(const float a[], const float b[], float res[], int n)
{
    int i = 0;

    for(; i < (n&(~0x31)) ; i+=32 )
    {
        __m512 x = _mm512_loadu_ps( &a[i] );
        __m512 y = _mm512_loadu_ps( &b[i] );

        __m512 z = _mm512_add_ps(x,y);
        _mm512_stream_ps(&res[i],z);
    }

    for(; i<n; i++) res[i] = a[i] + b[i];
}

int main()
{
    int n = 100000;
    float a[n], b[n], res[n];
    for(int i = 0;i < n; i++)
    {
        a[i] = i;
        b[i] = i+10;
    }
    add(a,b,res,n);
    for(int i=0;i<n;i++) cout<<res[i]<<" ";
    cout<<endl;
    return 0;
}

原文

I am trying to learn AVX instructions and while running a basic code I recieve

Illegal instruction (core dumped)

The code is mentioned below and I am compiling it using

g++ -mavx512f 1.cpp

What exactly is the problem and how to overcome it?
Thank You!

#include <immintrin.h>
#include<iostream>
using namespace std;

void add(const float a[], const float b[], float res[], int n)
{
    int i = 0;

    for(; i < (n&(~0x31)) ; i+=32 )
    {
        __m512 x = _mm512_loadu_ps( &a[i] );
        __m512 y = _mm512_loadu_ps( &b[i] );

        __m512 z = _mm512_add_ps(x,y);
        _mm512_stream_ps(&res[i],z);
    }

    for(; i<n; i++) res[i] = a[i] + b[i];
}

int main()
{
    int n = 100000;
    float a[n], b[n], res[n];
    for(int i = 0;i < n; i++)
    {
        a[i] = i;
        b[i] = i+10;
    }
    add(a,b,res,n);
    for(int i=0;i<n;i++) cout<<res[i]<<" ";
    cout<<endl;
    return 0;
}

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

恏ㄋ傷疤忘ㄋ疼 2025-01-24 14:44:13

您的CPU可能根本不支持AVX512。
仅这些新一代的CPU支持AVX-512：

wikipedia有一张漂亮的表格（包括诸如AVX512VBMI或FP16之类的特征的故障）
zen 4（及以后）。
服务器/工作站：（“ Xeon可伸缩性能”），后来，
客户端：冰湖1035G4和火箭湖桌面
（也非常有限的释放 1 。

不是奥尔德湖（第12代）；英特尔对其AVX-512的支持进行了回归，并正在积极阻止人们在硅中使用AVX-512支持，该硅最初可在Ecores中使用。

。

Intel Client CPU最终将能够再次利用其硅中存在的AVX-512硬件， avx10 ，这使他们能够以所有多汁的功能（例如遮罩， vpternlogd ，，更好的洗牌，32个矢量，32矢量）曝光AVX-512的256位矢量宽度子集寄存器，广播内存 - 源操作数等。在2023年宣布没有关于客户cpus支持它的消息（这需要他们在e内核上实施）。我以为MicroCode更新可以报告现有AVX-512 CPU上的AVX10.1/512支持，因为它在Evex Prefix中均未添加任何新事物（FP圆形模式在512以外的矢量宽度中覆盖-bit），但显然是花岗岩急流将是第一个支持Avx10.1的人。
Xeon Phi Compute卡，第二代，然后（ nofollow noreferrer“> knight's Landing的着陆）。 /p>

编译器选项

使用clang或 g ++ -o3 -march =本机启用CPU支持的所有内容。

如果收到编译错误（例如Undeclared函数 _MM512_LOADU_PS ），您的CPU确实支持支持AVX512，因此G ++没有启用它。（或您要尝试使用的任何其他CPU功能。）

inmintrin.h 仍然可以定义固有的，并使用 __属性__（（ewlance_inline，target，target（“ avx512f”））））））因此。是关于 eways_inline 函数的失败（ __________ia32 _... ）的函数（

__ /code>和 -mtune = 选项，如果要为其他CPU制作二进制文件，而不仅仅是您要编译的机器：

GCC编译器开关（-mavx -mavx2 -mavx2 -mavx512f）到底是什么
？ a href =“ https://stackoverflow.com/questions/55747789/the-effect-of-architecture-when-using-sse-avx-intrinisics”>使用SSE/AVX Intinisics时架构的效果MSVC和Classic ICC不同：您可以使用interinsics而不告诉编译器它可以使用这些ISA扩展名，因此即使在一个函数中，他们的优化器也必须小心分支机构。）
error：inlinging inlling ofland tobly_inline

相关：

硬件不足上的编码
=“ https://stackoverflow.com/questions/53637886/53637886/coding-on-insuffitife-hardware AVX Intrinsics-任何兼容库式 - 兼容性“> intel avx intrinsics：compatibility库？（在编译时仿真而不是运行时）。

MSVC和ICC do 让您在不告诉编译器目标支持它们的情况下使用内在信息，因此，这种针对CPU检查代码的方法与这些编译器不起作用。他们会很乐意让您编译当前CPU上不会运行的代码。（因为MSVC假设您将进行运行时CPU检测和派遣，而不是分发源代码以供每个人优化自己的机器。）

Compiler options

Use clang or g++ -O3 -march=native to enable everything your CPU supports.

If you get compile errors (like undeclared function _mm512_loadu_ps), your CPU does not support AVX512 so g++ didn't enable it. (Or whatever other CPU feature you're trying to use.)

immintrin.h would still define the intrinsic, with __attribute__((always_inline,target("avx512f")). So it's required to inline, but can only do so into functions that are themselves using __attribute__((target("avx512f")) or a similar pragma, or command line options. That's why the error message is about inlining failed for an always_inline function (the intrinsic wrapper around the __builtin_ia32_...) into a function with incompatible target options.

Only use separate -mavx512f and -mtune= options if you want to make a binary for other CPUs, not just the machine you're compiling on. Related:

What exactly do the gcc compiler switches (-mavx -mavx2 -mavx512f) do?
The Effect of Architecture When Using SSE / AVX Intrinisics (MSVC and classic ICC are different: you can use intrinsics without telling the compiler it can use those ISA extensions, so even within one function their optimizer has to be careful with code-motion out of branches.)
error: inlining failed to call always_inline

Coding on insufficient hardware
Intel AVX intrinsics: any compatibility library out? (emulate at compile time instead of runtime).

MSVC and ICC do let you use intrinsics without telling the compiler the target supports them, so this method of checking your code against the CPU doesn't work with those compilers. They'll happily let you compile code that won't run on the current CPU. (Because MSVC assumes you're going to do runtime CPU detection and dispatching, instead of distributing source code for everyone to optimize for their own machine.)

More about CPUs without AVX-512

Intel processor name/number meanings

Skylake-client does not have AVX-512, only Skylake-server.
Intel Alder Lake hybrid (big.LITTLE) CPUs won't have AVX-512, only AVX2 even on the big cores.
Low-power CPUs like Silvermont / Tremont don't even have AVX1, until Gracemont (Alder Lake E-cores).

Also note, there are multiple extensions to AVX-512, like AVX-512VPOPCNTDQ that introduces SIMD instructions to count set bits in each SIMD element. Check Wikipedia's CPUs with AVX-512 table to see which CPU has what. AVX-512F is the "foundation", and AVX-512VL allows using cool new instructions on 128 and 256-bit vectors.

Footnote 1: Pentium/Celeron versions of older Intel CPUs don't even have AVX, just SSE4.2. (Also lacking BMI1/2 because they disabled decoding of VEX prefixes).

回复收藏 0 原文

~没有更多了~

关于作者

不再见

暂无简介

文章

349 人气

关注发私信

友情链接

文江博客

运行基本AVX512代码时获得非法指令

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

编译器选项

更多信息

Compiler options

More about CPUs without AVX-512

关于作者

相关话题

热门标签

推荐作者

十二

飞烟轻若梦

OPleyuhuo

wxb0109

旧城空念

-小熊_

友情链接

运行基本AVX512代码时获得非法指令

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

编译器选项

更多信息

Compiler options

More about CPUs without AVX-512

关于作者

相关话题

热门标签

推荐作者

十二

飞烟轻若梦

OPleyuhuo

wxb0109

旧城空念

-小熊_

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。