Aarch64霓虹灯和SVE的软件优化指南
有ARM软件优化指南(例如, https://developer.arm.arm.arm.arm.com/documentation/documentation/documentation/swog309707070707070707070707070707070707070707070707070707070707070707070707070707070707070707070707070707070707070707070707070707070707 /最新对于Neoverse N1)。
本指南似乎不包含霓虹灯或SVE的延迟和吞吐量。是否有单独的霓虹灯或SVE指南(例如, insr(simd& fp scalar)
指令< /a>)?
指针将非常有帮助!
There is ARM software optimization guide (e.g., https://developer.arm.com/documentation/swog309707/latest for neoverse n1).
This guide doesn't seem to contain the latency and throughput for Neon or SVE. Is there a separate guide for NEON or SVE (e.g., the instruction latency and throughput for INSR (SIMD&FP scalar)
instruction)?
A pointer would be very helpful!
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
data:image/s3,"s3://crabby-images/d5906/d59060df4059a6cc364216c4d63ceec29ef7fe66" alt="扫码二维码加入Web技术交流群"
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
霓虹灯说明的时间在该文档中,在Asimd下列出(这是该指令集的ARM更正式名称)。请参阅第3.15节。
SVE指令没有时间安排,因为据我了解,N1根本不支持该扩展名。但是,如果您查看指南的某些核心确实支持SVE,则会看到其中包括的时间。对于 neoverse n2 他们来自第3.26节。
The timings for Neon instructions are in that document, listed under ASIMD (which is Arm's more formal name for that instruction set). See Sections 3.15 onward.
There are no timings for SVE instructions because, as I understand it, the N1 simply doesn't support that extension. But if you look at the guide for some core that does support SVE, you'll see the timings included. For the Neoverse N2 they are from Section 3.26 onward.