当前位置：文江博客话题详情

高性能 C++多维数组

发布于 2024-09-27 22:30:41 字数 697 浏览 0 评论 0原文

我正在寻找有关 C++ 高性能多维数组库/类的建议。我真正需要的是：

动态分配数组的能力，其大小在运行时确定
访问和修改的能力单个数组值（快速）
能够使用简单的数组算术，例如array1 = array2 + 2 * array3< /p>
维护良好的库

我遇到过各种库，包括：

Blitz++，它看起来正是我需要的，但看起来不太好维护良好（最新的稳定版本是 5 年前）
Boost，它不支持数组运算，并且与 Blitz++ 相比似乎相当慢。
Jonn Bowman 的 array.h 没有文档。

还有人对上述选项有任何其他建议或意见吗？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

凉宸 2024-10-04 22:30:41

Eigen 维护得非常好（至少现在，有新版本每个月都会发布）并支持您需要的其他操作。

回复收藏 0 原文

何以笙箫默 2024-10-04 22:30:41

此处进行了一项广泛且相对较新的调查，包括基准。

我相信您可以通过将 Boost.UBlas 绑定到 LAPACK 或 Intel MKL 等底层数值库来加速 Boost.UBlas，但还没有这样做。

fwiw，最常出现的候选实现是 Boost.UBlas 和 MTL。根据我的经验，广泛采用更有可能促进持续的支持和发展。

回复收藏 0 原文

始终不够 2024-10-04 22:30:41

uBlas，Boost 的一部分。它提供完整的 BLAS 级别 1-3，因此提供大量数组算术函数。
Armadillo 似乎也是一个 C++ 线性代数库，据我所知，它可以选择使用 LAPACK/Atlas (这当然使其速度更快）。
GNU 科学库提供完整的 BLAS。我不知道它有多快，或者是否可以使用LAPACK/Atlas。
如果你不需要比你列出的内容更奇特的东西，你可以很容易地自己包装，例如 Atlas 的 BLAS。但如果没有必要，您可能不想重新发明轮子。

回复收藏 0 原文

丶情人眼里出诗心の 2024-10-04 22:30:41

Necomi 似乎提供了您想要的功能。

它支持任意多维数字，其维度可以在运行时固定，提供对单个元素的快速访问，同时还支持算术（除其他外）表达式。

回复收藏 0 原文

烟花肆意 2024-10-04 22:30:41

还有另一个无耻的自我推销，

https://github.com/dwwork/FortCpp/

我已经在 GitHub 上发布了我个人对此问题的解决方案。
无论如何，我都不是 C++ 专家，但我想我至少应该把它扔掉。

回复收藏 0 原文

不醒的梦 2024-10-04 22:30:41

也许你想尝试我的“Multi”库： https://gitlab.com/correaa/boost- multi

动态分配数组的能力，其大小在运行时确定

    multi::array<int, 2> A({m, n});

访问和修改单个数组值的能力（快速）

    A[i][j] += 42;

生成类似于 A.base() + i*stride_1 + j 的机器代码*stride_2。 https://gitlab.com/correaa/ boost-multi#whats-up-with-the-multiple-bracket-notation

能够使用简单的数组算术，例如 array1 = array2 + 2 * array3

嗯，不是那样的，我决定将数组算术与数据结构分开。

话虽如此，该库与 STL 算法非常兼容（如果您的步骤允许，它有一个使用 BLAS 的适配器）。

过度简化有关访问模式的一些问题，......

template<class T, class X, class Y>
auto axpy(T alpha, X const& x, Y&& y) -> Y&& {
    assert( extensions(x) == extensions(y) );
    std::transform(
        x.elements().begin(), x.elements().end(),
        y.elements().begin(), 
        y.elements().begin(),
        [&](auto const& ex, auto& ey) {return alpha*x + ey;}
    );
    return std::forward<Y>(y);
}
...

auto array1 = axpy(2, array2, +array3);  // array1 = 2*array2 + array3;  // unary + will make a modifiable copy before it starts.

一个维护良好的库，

我将尽我所能维护它，我也欢迎贡献者。

Maybe you would like to try my "Multi" library: https://gitlab.com/correaa/boost-multi

the ability to dynamically allocate arrays with a size determined at run-time

    multi::array<int, 2> A({m, n});

the ability to access and modify single array values (fast)

    A[i][j] += 42;

Generates machine code similar to A.base() + i*stride_1 + j*stride_2. https://gitlab.com/correaa/boost-multi#whats-up-with-the-multiple-bracket-notation

to be able to use simple array arithmetic such as array1 = array2 + 2 * array3

Well, not like that, I decided to keep array arithmetic separate from the data structure.

Having said that, the library is very compatible with STL algorithms (and it has an adaptor to use BLAS if your strides permit).

Oversimplifying some issues about access patterns, ...

template<class T, class X, class Y>
auto axpy(T alpha, X const& x, Y&& y) -> Y&& {
    assert( extensions(x) == extensions(y) );
    std::transform(
        x.elements().begin(), x.elements().end(),
        y.elements().begin(), 
        y.elements().begin(),
        [&](auto const& ex, auto& ey) {return alpha*x + ey;}
    );
    return std::forward<Y>(y);
}
...

auto array1 = axpy(2, array2, +array3);  // array1 = 2*array2 + array3;  // unary + will make a modifiable copy before it starts.

a well-maintained library

I will maintain it as long as I could, I also welcome contributors.

回复收藏 0 原文

回首观望 2024-10-04 22:30:41

也许存在诸如 BLAS、CBLAS 之类的库，但不记得在哪里。

http://www.netlib.org/blas/

回复收藏 0 原文

自在安然 2024-10-04 22:30:41

从性能角度来看，我尝试过 boost::MultiArray 和 Armadillo。两者都不是很快，因为与数组或向量相比，两者的访问时间都很慢，而且我能够在 x1(4:10) = x2(1:6) + x2(2:7) 等操作中击败这些包+ x2(3:8) 通过使用简单的手工编码循环（我确信在我的编译器优化的帮助下）。当您进行矩阵乘法时，这些软件包可能会通过 LAPACK 和 BLAS 提供一些好处，但您始终可以自己使用这些接口。

回复收藏 0 原文