List.AddRange 实现不理想

发布于 2024-08-19 03:13:15 字数 2527 浏览 14 评论 0原文

对我的 C# 应用程序进行分析表明，List.AddRange 花费了大量时间。使用 Reflector 查看此方法中的代码表明它调用了 List.InsertRange ，其实现如下：

public void InsertRange(int index, IEnumerable<T> collection)
{
    if (collection == null)
    {
        ThrowHelper.ThrowArgumentNullException(ExceptionArgument.collection);
    }
    if (index > this._size)
    {
        ThrowHelper.ThrowArgumentOutOfRangeException(ExceptionArgument.index, ExceptionResource.ArgumentOutOfRange_Index);
    }
    ICollection<T> is2 = collection as ICollection<T>;
    if (is2 != null)
    {
        int count = is2.Count;
        if (count > 0)
        {
            this.EnsureCapacity(this._size + count);
            if (index < this._size)
            {
                Array.Copy(this._items, index, this._items, index + count, this._size - index);
            }
            if (this == is2)
            {
                Array.Copy(this._items, 0, this._items, index, index);
                Array.Copy(this._items, (int) (index + count), this._items, (int) (index * 2), (int) (this._size - index));
            }
            else
            {
                T[] array = new T[count];          // (*)
                is2.CopyTo(array, 0);              // (*)
                array.CopyTo(this._items, index);  // (*)
            }
            this._size += count;
        }
    }
    else
    {
        using (IEnumerator<T> enumerator = collection.GetEnumerator())
        {
            while (enumerator.MoveNext())
            {
                this.Insert(index++, enumerator.Current);
            }
        }
    }
    this._version++;
}

private T[] _items;

可以说接口的简单性（只有一个 InsertRange 重载）证明运行时类型检查和转换的性能开销是合理的。但是我用 (*) 指示的 3 行背后的原因可能是什么？我认为它可以重写为更快的替代方案：

is2.CopyTo(this._items, index);

您认为有什么理由不使用这种更简单且明显更快的替代方案吗？

编辑：

感谢您的回答。因此，共识认为这是针对以有缺陷/恶意方式实现 CopyTo 的输入集合的一种保护措施。对我来说，不断付出 1) 运行时类型检查 2) 临时数组的动态分配 3) 双倍复制操作的代价似乎有点过分了，而所有这些都可以通过定义 2 个或多个 InsertRange 重载来保存，一个像现在一样获取 IEnumerable，第二个获取 List，第三个获取 T[]。后两者的运行速度可以是当前情况的两倍。

编辑 2：

我确实实现了一个类 FastList，与 List 相同，但它还提供了接受 T[] 参数的 AddRange 的重载。此重载不需要动态类型验证和元素的双重复制。我确实通过将 4 字节数组添加到最初为空的列表 1000 次来根据 List.AddRange 来分析此 FastList.AddRange。我的实现比标准 List.AddRange 的速度快了 9 倍（九！）。在我们应用程序的重要使用场景之一中，List.AddRange 占用大约 5% 的运行时间，用提供更快 AddRange 的类替换 List 可以将应用程序运行时间提高 4%。

原文

Profiling my C# application indicated that significant time is spent in List<T>.AddRange. Using Reflector to look at the code in this method indicated that it calls List<T>.InsertRange which is implemented as such:

public void InsertRange(int index, IEnumerable<T> collection)
{
    if (collection == null)
    {
        ThrowHelper.ThrowArgumentNullException(ExceptionArgument.collection);
    }
    if (index > this._size)
    {
        ThrowHelper.ThrowArgumentOutOfRangeException(ExceptionArgument.index, ExceptionResource.ArgumentOutOfRange_Index);
    }
    ICollection<T> is2 = collection as ICollection<T>;
    if (is2 != null)
    {
        int count = is2.Count;
        if (count > 0)
        {
            this.EnsureCapacity(this._size + count);
            if (index < this._size)
            {
                Array.Copy(this._items, index, this._items, index + count, this._size - index);
            }
            if (this == is2)
            {
                Array.Copy(this._items, 0, this._items, index, index);
                Array.Copy(this._items, (int) (index + count), this._items, (int) (index * 2), (int) (this._size - index));
            }
            else
            {
                T[] array = new T[count];          // (*)
                is2.CopyTo(array, 0);              // (*)
                array.CopyTo(this._items, index);  // (*)
            }
            this._size += count;
        }
    }
    else
    {
        using (IEnumerator<T> enumerator = collection.GetEnumerator())
        {
            while (enumerator.MoveNext())
            {
                this.Insert(index++, enumerator.Current);
            }
        }
    }
    this._version++;
}

private T[] _items;

One can argue that the simplicity of the interface (only having one overload of InsertRange) justifies the performance overhead of runtime type cheching and casting.
But what could be the reason behind the 3 lines I have indicated with (*) ?
I think it could be rewritten to the faster alternative:

is2.CopyTo(this._items, index);

Do you see any reason for not using this simpler and apparently faster alternative?

Edit:

Thanks for the answers. So consensus opinion is that this is a protective measure against the input collection implementing the CopyTo in a defective/malicious manner. To me it seems like a overkill to constantly pay the price of 1) runtime type checking 2) dynamic allocation of the temporary array 3) double the copy operation, when all this could have been saved by defining 2 or a few more overloads of InsertRange, one getting IEnumerable as now, the second getting a List<T>, third getting T[]. The later two could have been implemented to run around twice as fast as in the current case.

Edit 2:

I did implement a class FastList, identical to List, except that it also provides an overload of AddRange which takes a T[] argument. This overload does not need the dynamic type verification, and double-copying of elements. I did profile this FastList.AddRange against List.AddRange by adding 4-byte arrays 1000 times to a list which was initially emtpy. My implementation beats the speed of standard List.AddRange with a factor of 9 (nine!). List.AddRange takes about 5% of runtime in one of the important usage scenarios of our application, replacing List with a class providing a faster AddRange could improve application runtime by 4%.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

小…楫夜泊 2024-08-26 03:13:15

它们阻止 ICollection 的实现访问插入边界之外的目标列表的索引。如果调用 CopyTo 的错误（或“操纵”）实现，则上述实现会导致 IndexOutOfBoundsException。

请记住，T[].CopyTo 实际上在内部实现为 memcpy，因此添加该行的性能开销很小。当您以如此低的成本为大量呼叫添加安全性时，您不妨这样做。

编辑：我觉得奇怪的部分是，对 ICollection.CopyTo 的调用（复制到临时数组）不会在调用<之后立即发生。代码>确保容量。如果它被移动到该位置，则遵循任何同步异常列表将保持不变。按原样，仅当插入发生在列表末尾时，该条件才成立。这里的推理是：

所有必要的分配都发生在更改列表元素之前。
对 Array.Copy 的调用不会失败，因为
- 内存已分配
- 边界已检查
- 源数组和目标数组的元素类型匹配
- 没有像 C++ 那样使用“复制构造函数”——它只是一个 memcpy
唯一可以引发异常的项是对 ICollection.CopyTo 的外部调用以及调整列表大小和分配临时数组所需的分配。如果所有这三个发生在移动元素进行插入之前，则更改列表的事务不能引发同步异常。
最后说明：这解决了严格的异常行为 - 上述基本原理没有添加线程安全性。

编辑2（对OP编辑的回应）：你对此进行过分析吗？您大胆声称 Microsoft 应该选择更复杂的 API，因此您应该确保当前方法速度慢的断言是正确的。我从来没有遇到过 InsertRange 的性能问题，而且我非常确定，与重新实现动态列表相比，重新设计算法可以更好地解决人们遇到的任何性能问题。为了避免您认为我以消极的方式严厉，请记住以下几点：

我不想无法忍受我的开发团队中喜欢重新发明方轮。
我绝对希望我的团队中的人关心潜在的性能问题，并就他们的代码可能产生的副作用提出问题。当这一点出现时，就会获胜——但只要人们提出问题，我就会促使他们将问题转化为可靠的答案。如果您可以向我展示应用程序通过最初看似糟糕的想法获得了显着的优势，那么有时事情就是这样的。

They are preventing the implementation of ICollection<T> from accessing indices of the destination list outside the bounds of insertion. The implementation above results in an IndexOutOfBoundsException if a faulty (or "manipulative") implementation of CopyTo is called.

Keep in mind that T[].CopyTo is quite literally internally implemented as memcpy, so the performance overhead of adding that line is minute. When you have such a low cost of adding safety to a tremendous number of calls, you might as well do so.

Edit: The part I find strange is the fact that the call to ICollection<T>.CopyTo (copying to the temporary array) does not occur immediately following the call to EnsureCapacity. If it were moved to that location, then following any synchronous exception the list would remain unchanged. As-is, that condition only holds if the insertion happens at the end of the list. The reasoning here is:

All necessary allocation happens before altering the list elements.
The calls to Array.Copy cannot fail because
- The memory is already allocated
- The bounds are already checked
- The element types of the source and destination arrays match
- There is no "copy constructor" used like in C++ - it's just a memcpy
The only items that can throw an exception are the external call to ICollection.CopyTo and the allocations required for resizing the list and allocating the temporary array. If all three of these occur before moving elements for the insertion, the transaction to change the list cannot throw a synchronous exception.
Final note: This address strictly exceptional behavior - the above rationale does not add thread-safety.

Edit 2 (response to the OP's edit): Have you profiled this? You are making some bold claims that Microsoft should have chosen a more complicated API, so you should make sure you're correct in the assertions that the current method is slow. I've never had a problem with the performance of InsertRange, and I'm quite sure that any performance problems someone does face with it will be better resolved with an algorithm redesign than by reimplementing the dynamic list. Just so you don't take me as being harsh in a negative way, keep the following in mind:

I ~~don't want~~ can't stand people on my dev team that like to reinvent the square wheel.
I definitely want people on my team that care about potential performance issues, and ask questions about the side effects their code may have. This point wins out when present - but as long as people are asking questions I will drive them to turn their questions into solid answers. If you can show me that an application gains a significant advantage through what initially appears to be a bad idea, then that's just the way things go sometimes.

回复收藏 0 原文