C动态分配速度问题

发布于 2024-12-04 07:51:13 字数 743 浏览 1 评论 0原文

我使用这段代码动态创建一个二维数组：

char **FileTables;
int rows = 1000;
int i;

FileTables = (char**)malloc(rows * sizeof(char));
for (i = 0; i < rows; i++) {
    FileTables[i] = (char*)malloc(256 * sizeof(char));
}

问题是有 1000 行，而且可能还有更多，分配所有内存需要几秒钟。有没有更快/更好的方法来做到这一点？

编辑：除了明显更简单的代码之外，使用其中一种方法相对于另一种方法是否还有优势？

char **FileTables;
int rows = 1000;
int i;

FileTables = malloc(rows * sizeof(char*));
FileTables[0] = malloc(rows * 256 * sizeof(char));
for (i = 0; i < rows; i++) {
    FileTables[i] = FileTables[0] + i * 256;
}

而且..

char (*FileTables)[256];
int rows = 1000;

FileTables = malloc(rows * sizeof(*FileTables));

（是的，我修复了不必要的演员）

原文

I'm using this code to dynamically create a 2d array:

char **FileTables;
int rows = 1000;
int i;

FileTables = (char**)malloc(rows * sizeof(char));
for (i = 0; i < rows; i++) {
    FileTables[i] = (char*)malloc(256 * sizeof(char));
}

Problem is with 1000 rows, and there could be more, it takes a couple of seconds to allocate all the memory.
Is there any faster/better method to doing this?

EDIT:
Is there an advantage to using one of these methods over the other, besides the obvious simpler code?

char **FileTables;
int rows = 1000;
int i;

FileTables = malloc(rows * sizeof(char*));
FileTables[0] = malloc(rows * 256 * sizeof(char));
for (i = 0; i < rows; i++) {
    FileTables[i] = FileTables[0] + i * 256;
}

And..

char (*FileTables)[256];
int rows = 1000;

FileTables = malloc(rows * sizeof(*FileTables));

(And yes, I fixed the unnecessary casting)

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

蒗幽 2024-12-11 07:51:13

您只需两次分配和一些指针算术即可逃脱：

int rows = 1000;
int cols = 256;
char *data;
char **FileTables;
int i;

data = malloc(rows * cols);
FileTables = malloc(rows * sizeof(char*));
for (i = 0; i < rows; i++) {
    FileTables[i] = data + i * cols;
}

另请注意，我修复了 malloc(rows * sizeof(char)) 中的错误（sizeof(char)应该是 sizeof(char*)，因为您正在将一个指针数组分配给 char）。

You could get away with just two allocations and some pointer arithmetic:

int rows = 1000;
int cols = 256;
char *data;
char **FileTables;
int i;

data = malloc(rows * cols);
FileTables = malloc(rows * sizeof(char*));
for (i = 0; i < rows; i++) {
    FileTables[i] = data + i * cols;
}

Also note that I fixed a bug in malloc(rows * sizeof(char)) (the sizeof(char) should be sizeof(char*), since you're allocating an array of pointers to char).

回复收藏 0 原文

梓梦 2024-12-11 07:51:13

只要列数不变，或者如果您使用的是 C99，您就可以使用单个 malloc ，而不必自己执行丑陋的行/列寻址算术：

char (*FileTables)[256] = malloc(rows * sizeof *FileTables);

As long as the number of columns is constant, or if you're using C99, you can get away with a single malloc without having to do ugly row/column addressing arithmetic yourself:

char (*FileTables)[256] = malloc(rows * sizeof *FileTables);

回复收藏 0 原文

谈情不如逗狗 2024-12-11 07:51:13

如果数组的大小始终为 row × 256，那么您可以考虑使用一维数组 malloc(row * 256)，并按步幅访问它：

char get(unsigned i, unsigned j, char * array) { return array[j + 256 * i]; }
void set(char value, unsigned i, unsigned j, char * array) { array[j + 256 * i] = value; }

这可以避免多次分配并提供更好的内存局部性。最重要的是，您可以选择行或列顺序进行微观优化。

If the array is always of the size row × 256, then you might consider a one-dimensional array malloc(row * 256), and access it in strides:

char get(unsigned i, unsigned j, char * array) { return array[j + 256 * i]; }
void set(char value, unsigned i, unsigned j, char * array) { array[j + 256 * i] = value; }

This avoids multiple allocations and gives better memory locality. On top of that, you can pick row or column ordering to micro-optimize.

回复收藏 0 原文

乞讨 2024-12-11 07:51:13

char **FileTables; 
int rows = 1000; 
int i; 

FileTables = (char**)malloc(rows * sizeof(char *)); 
char *data = (char *)malloc(256 * 1000 * sizeof(char));
for (i = 0; i < rows; ++i) { 
    FileTables[i] = data;
    data += 256 * sizeof(char);
}

应该是一个更好的解决方案。

char **FileTables; 
int rows = 1000; 
int i; 

FileTables = (char**)malloc(rows * sizeof(char *)); 
char *data = (char *)malloc(256 * 1000 * sizeof(char));
for (i = 0; i < rows; ++i) { 
    FileTables[i] = data;
    data += 256 * sizeof(char);
}

Should be a better solution.

回复收藏 0 原文

谜泪 2024-12-11 07:51:13

我不相信你能达到接近秒的速度。在我的机器上将行数增加到 1000 万仍然不到一秒。

但是，如果您想最小化分配，则只需要一个。

FileTables = (char**) malloc(rows * (sizeof(char *) + 256*sizeof(char)));
FileTables[0] = (char *) &FileTables[rows];
for (i = 1; i < rows; i++) {
    FileTables[i] = FileTables[i-1] + 256 * sizeof (char);
}
free(FileTables);

更有效的方法是避免第二级间接。

typedef char chars[256];

int main(int argc, char** argv) {
    chars* FileTables;
    int rows = 100000000;
    int i;

    FileTables = (chars*) malloc(rows * sizeof (chars));
    free(FileTables);

    return (EXIT_SUCCESS);
}

这避免了指针查找，因为 C 可以计算其余部分。

I don't believe you will get anywhere near seconds. Increasing the rows to 10 million is still under a second on my machine.

However if you want to minimise allocations, you only need one.

FileTables = (char**) malloc(rows * (sizeof(char *) + 256*sizeof(char)));
FileTables[0] = (char *) &FileTables[rows];
for (i = 1; i < rows; i++) {
    FileTables[i] = FileTables[i-1] + 256 * sizeof (char);
}
free(FileTables);

A more efficient way to do this is to avoid the second level of indirection.

typedef char chars[256];

int main(int argc, char** argv) {
    chars* FileTables;
    int rows = 100000000;
    int i;

    FileTables = (chars*) malloc(rows * sizeof (chars));
    free(FileTables);

    return (EXIT_SUCCESS);
}

This avoid a pointer lookup as the C can calculate the rest.

回复收藏 0 原文

平定天下 2024-12-11 07:51:13

首先，你确定是内存分配的问题吗？分配 1000 个内存块通常不会花费几秒钟。

如果您有特殊需求，您可以研究替代的 malloc 实现（例如，如果您在线程中分配内存，则可以使用 google 的 tcmalloc）。

否则，malloc 真正“慢”的部分实际上是从操作系统获取内存（使用 sbrk() 或 mmap()），并且大多数 malloc 实现一次会抓取一大块，然后将其分成较小的部分返回，因此这里不是有 1000 个调用来分配 1k，可能有 60 个调用来分配 16k。在 strace 或类似的环境下运行程序可能会让您了解实际进行了多少次缓慢的系统调用。您可以自己实现类似的行为，通过一次调用来分配 256K 并将其细分为更小的块。您可以尝试分配一大块内存，然后立即 free() 释放它，并希望库 malloc 保留该内存并且不会返回操作系统获取更多内存。

回复收藏 0 原文