如何判断一个数组是否是 O(n) 的排列？

酒解孤独 2024-09-09 09:14:17

我有点怀疑是否有解决方案。您的问题似乎与几年前在数学文献中提出的问题非常接近，其中给出了的摘要这里（“重复检测问题”，S. Kamal Abdali，2003）使用循环检测——其想法如下：

如果存在重复，则存在一个数字j 1 和 N 之间，这样以下情况将导致无限循环：

x := j;
do
{
   x := a[x];
}
while (x != j);

因为排列由不同元素 s₀, s₁, 的一个或多个子集 S 组成。 .. s_k-1 其中 s_j = a[s_j-1] 对于 1 和 k-1 之间的所有 j，并且 s ₀ = a[s_k-1]，因此所有元素都包含在循环中 - 重复项之一不会成为此类子集的一部分。

例如，如果数组 = [2, 1, 4, 6, 8, 7, 9, 3, 8]

，则位置 5 处的粗体元素是重复的，因为所有其他元素形成循环： { 2-> 1, 4-> 6-> 7-> 9-> 8-> 3}。而数组 [2, 1, 4, 6, 5, 7, 9, 3, 8] 和 [2, 1, 4, 6, 3, 7, 9, 5, 8] 是有效的排列（循环 { 2 -> 1, 4 -> 7 -> 3, 5 } 和 { 1, 4 -> 9 -> 8 -> 5 -> 3 })。

阿卜达利研究了一种查找重复项的方法。基本上，如果您遇到以下算法（使用 Floyd 的循环查找算法），则可以使用所讨论的重复项的数量：

function is_duplicate(a, N, j)
{
     /* assume we've already scanned the array to make sure all elements
        are integers between 1 and N */
     x1 := j;
     x2 := j;
     do
     {             
         x1 := a[x1];
         x2 := a[x2];
         x2 := a[x2];
     } while (x1 != x2);

     /* stops when it finds a cycle; x2 has gone around it twice, 
        x1 has gone around it once.
        If j is part of that cycle, both will be equal to j. */
     return (x1 != j);
}

困难在于我不确定您所描述的问题是否与他论文中的问题相符，而且我也不确定他描述的方法是否在 O(N) 中运行或使用固定的空间量。一个潜在的反例是以下数组：

[3, 4, 5, 6, 7, 8, 9, 10, ... N-10, N-9, N-8, N-7, N-2, N- 5, N-5, N-3, N-5, N-1, N, 1, 2]

基本上是单位排列移位 2，元素为 [N-6, N-4, 和 N-2 ] 替换为 [N-2, N-5, N-5]。这有正确的总和（不是正确的乘积，但我拒绝将乘积作为可能的检测方法，因为使用任意精度算术计算 N！的空间要求是 O(N)，这违反了“固定内存空间”的精神要求），如果你试图找到周期，你会得到周期 { 3 -> 5-> 7-> 9-> ... N-7-> N-5→ N-1 } 和 { 4 → 6-> 8-> ... N-10-> N-8-> N-2→ N-> 2}。问题是最多可能有 N 个周期（身份排列有 N 个周期），每个周期需要 O(N) 才能找到重复项，并且您必须以某种方式跟踪哪些周期已被跟踪，哪些周期尚未被跟踪。我怀疑是否可以在固定的空间内做到这一点。但也许确实如此。

这是一个足够严重的问题，值得在 mathoverflow.net 上询问（尽管大多数时候 mathoverflow.net 在 stackoverflow 上被引用）对于太简单的问题）

编辑：我在 mathoverflow 上提问< /a>，那里有一些有趣的讨论。

I'm very slightly skeptical that there is a solution. Your problem seems to be very close to one posed several years ago in the mathematical literature, with a summary given here ("The Duplicate Detection Problem", S. Kamal Abdali, 2003) that uses cycle-detection -- the idea being the following:

If there is a duplicate, there exists a number j between 1 and N such that the following would lead to an infinite loop:

x := j;
do
{
   x := a[x];
}
while (x != j);

because a permutation consists of one or more subsets S of distinct elements s₀, s₁, ... s_k-1 where s_j = a[s_j-1] for all j between 1 and k-1, and s₀ = a[s_k-1], so all elements are involved in cycles -- one of the duplicates would not be part of such a subset.

e.g. if the array = [2, 1, 4, 6, 8, 7, 9, 3, 8]

then the element in bold at position 5 is a duplicate because all the other elements form cycles: { 2 -> 1, 4 -> 6 -> 7 -> 9 -> 8 -> 3}. Whereas the arrays [2, 1, 4, 6, 5, 7, 9, 3, 8] and [2, 1, 4, 6, 3, 7, 9, 5, 8] are valid permutations (with cycles { 2 -> 1, 4 -> 6 -> 7 -> 9 -> 8 -> 3, 5 } and { 2 -> 1, 4 -> 6 -> 7 -> 9 -> 8 -> 5 -> 3 } respectively).

Abdali goes into a way of finding duplicates. Basically the following algorithm (using Floyd's cycle-finding algorithm) works if you happen across one of the duplicates in question:

function is_duplicate(a, N, j)
{
     /* assume we've already scanned the array to make sure all elements
        are integers between 1 and N */
     x1 := j;
     x2 := j;
     do
     {             
         x1 := a[x1];
         x2 := a[x2];
         x2 := a[x2];
     } while (x1 != x2);

     /* stops when it finds a cycle; x2 has gone around it twice, 
        x1 has gone around it once.
        If j is part of that cycle, both will be equal to j. */
     return (x1 != j);
}

The difficulty is I'm not sure your problem as stated matches the one in his paper, and I'm also not sure if the method he describes runs in O(N) or uses a fixed amount of space. A potential counterexample is the following array:

[3, 4, 5, 6, 7, 8, 9, 10, ... N-10, N-9, N-8, N-7, N-2, N-5, N-5, N-3, N-5, N-1, N, 1, 2]

which is basically the identity permutation shifted by 2, with the elements [N-6, N-4, and N-2] replaced by [N-2, N-5, N-5]. This has the correct sum (not the correct product, but I reject taking the product as a possible detection method since the space requirements for computing N! with arbitrary precision arithmetic are O(N) which violates the spirit of the "fixed memory space" requirement), and if you try to find cycles, you will get cycles { 3 -> 5 -> 7 -> 9 -> ... N-7 -> N-5 -> N-1 } and { 4 -> 6 -> 8 -> ... N-10 -> N-8 -> N-2 -> N -> 2}. The problem is that there could be up to N cycles, (identity permutation has N cycles) each taking up to O(N) to find a duplicate, and you have to keep track somehow of which cycles have been traced and which have not. I'm skeptical that it is possible to do this in a fixed amount of space. But maybe it is.

This is a heavy enough problem that it's worth asking on mathoverflow.net (despite the fact that most of the time mathoverflow.net is cited on stackoverflow it's for problems which are too easy)

edit: I did ask on mathoverflow, there's some interesting discussion there.

如何判断一个数组是否是 O(n) 的排列？

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（16）

关于作者

相关话题

热门标签

推荐作者

琉璃梦幻

qq_4zWU6L

话少情深

西西弗的石头怪

彻夜缠绵

千寻…

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。