有趣的是，这可能是堆栈溢出问题

发布于 2024-07-18 17:25:25 字数 799 浏览 12 评论 0 原文

以下过程（解释如下）适用于非常小的列表，但是当列表包含大量项目（1/2 百万）时，应用程序进入“无响应”状态，并且需要大约 2.5 分钟才能完成（非常糟糕）时间）。我可能会添加应用程序需要处理 1 亿个项目的列表至少（最终）。

这是有问题的过程的代码：

    public void removeItems(List<long> L, SortedList<long, List<long>> _subLists)
    {
        foreach (KeyValuePair<long, List<long>> kvp in _subLists)
        {
            foreach (long duplicate in kvp.Value)
            {
                int j = L.IndexOf(duplicate);
                L.RemoveRange(j,(int)kvp.Key); 

            }
        }
    }

L 是长值列表。 _subLists 是一个排序列表，其中每个值都是一个列表 L 中的值，开始一些差异（不相关）的算术级数。与该值关联的键是值包含的系列的长度。

示例：

L = {1,2,3,5,6,7,18,20,21} _subLists = {2,<20>} {3,<1,5>}

该过程只是从 L 中删除算术级数级数。

原文

The following procedure (explanation follows) works fine for really small lists, but when the list contains a larger number of items (1/2 million) the application enters "not responding" state,and it takes about 2.5 minutes to finish (very bad time).
I might add the application needs to process lists of 100 million items
at least (eventually).

here is the code for the problematic procedure:

    public void removeItems(List<long> L, SortedList<long, List<long>> _subLists)
    {
        foreach (KeyValuePair<long, List<long>> kvp in _subLists)
        {
            foreach (long duplicate in kvp.Value)
            {
                int j = L.IndexOf(duplicate);
                L.RemoveRange(j,(int)kvp.Key); 

            }
        }
    }

L is a list of long values.
_subLists is a sorted list where each value is a list of
values from L,starting an arithmetic progression series of some difference (not relevant).
the key associated with that value is the length of the series the values contain.

Example:

L = {1,2,3,5,6,7,18,20,21}
_subLists = {2,<20>}
{3,<1,5>}

The procedure simply removes the arithmetic progression series from L.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

九公里浅绿 2024-07-25 17:25:25

此过程的运行时间（以大 O 表示法表示）为 n^2，这相当慢，如果其中一个列表有 1 亿个条目，则运行时间可能会很慢。这里不存在堆栈溢出问题，只是迭代这么多数据很慢。我真的没有在这里看到一个问题，你想让这个更快吗？如果是这样，那么嵌套的 for 循环肯定是问题所在。

回复收藏 0 原文

清旖 2024-07-25 17:25:25

你的问题是你要从 L 中删除很多项目，这是一个非常昂贵的操作。每次删除项目时，都会复制内存以将已删除项目上方的所有项目向下移动。删除的项目越多，洗牌的项目越多，所需的时间就越长。内存是性能的瓶颈，RAM 的运行速度比 CPU 慢，如果您分页到磁盘，速度会非常慢。

你如何改善这一点。

最简单的选择是使用 L 的容器，它在删除项目时具有更好的性能 - 例如 LinkedList。当删除元素时，LinkedList 不需要在内存中移动项目，但它们确实需要更多内存来存储数据（每个值两个指针）。如果开销太大，则可能使用 LinkedList > 来代替，其中每个 List 保存最大数量的值。

或者，更改删除算法，以便迭代列表 L 并创建一个包含 _subLists 中未找到的值的新列表。您可以更改 _subLists 存储数据的方式，以便更快地查找范围内的项目。