当前位置：文江博客话题详情

区间并集

发布于 2024-07-25 05:23:11 字数 94 浏览 19 评论 0原文

我有一个代表间隔的类。该类有两个可比较类型的属性“start”和“end”。现在我正在寻找一种有效的算法来获取一组此类间隔的并集。

提前致谢。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

最好是你 2024-08-01 05:23:12

使用扫描线算法。基本上，您对列表中的所有值进行排序（同时保留它是与每个项目一起的间隔的开始还是结束）。此操作的复杂度为 O(n log n)。然后，沿着已排序的项目进行一次循环并计算间隔 O(n)。

O(n log n) + O(n) = O(n log n)

回复收藏 0 原文

恋你朝朝暮暮 2024-08-01 05:23:12

事实证明，这个问题已经被解决了很多次——在不同程度的想象中，在命名法下：http://en.wikipedia.org/wiki/Interval_tree , http://en .wikipedia.org/wiki/Segment_tree ，以及“RangeTree”

（因为OP的问题涉及大量间隔，这些数据结构很重要）

就我自己选择的python库选择而言：

从测试中，我'我发现在功能齐全和 python 当前（非位腐烂）方面最能解决问题的是：SymPy 中的“Interval”和“Union”类，请参阅：http://sympystats.wordpress.com/2012/03/30/simplifying-sets/
另一个好东西寻找选择，性能更高但功能较少的选项（例如。不适用于浮点范围删除）： https://pypi.python.org/pypi/Banyan< /a>

最后：在 IntervalTree、SegmentTree、RangeTree 中的任何一个下搜索 SO 本身，你会找到答案/钩子更多

回复收藏 0 原文

雨的味道风的声音 2024-08-01 05:23:12

对所有点进行排序。然后遍历列表，增加“开始”点的计数器，并减少“结束”点的计数器。如果计数器达到 0，则它确实是并集中某个间隔的端点。

计数器永远不会变为负数，并且在列表末尾将达到 0。

回复收藏 0 原文

活泼老夫 2024-08-01 05:23:12

geocar 的算法在以下情况下失败：

s=[tp(0,1),tp(0,3)]

我不太确定，但我认为这是正确的方法：

class tp():
    def __repr__(self):
        return '(%.2f,%.2f)' % (self.start, self.end)
    def __init__(self,start,end): 
        self.start=start
        self.end=end
s=[tp(0,1),tp(0,3),tp(4,5)]
s.sort(key=lambda self: self.start)
print s
y=[ s[0] ]
for x in s[1:]:
    if y[-1].end < x.start:
        y.append(x)
    elif y[-1].end == x.start:
        y[-1].end = x.end
    if x.end > y[-1].end:
        y[-1].end = x.end
print y

我还实现了它的减法：

#subtraction
z=tp(1.5,5) #interval to be subtracted
s=[tp(0,1),tp(0,3), tp(3,4),tp(4,6)]

s.sort(key=lambda self: self.start)
print s
for x in s[:]:
    if z.end < x.start:
        break
    elif z.start < x.start and z.end > x.start and z.end < x.end:
        x.start=z.end
    elif z.start < x.start and z.end > x.end:
        s.remove(x)
    elif z.start > x.start and z.end < x.end:
        s.append(tp(x.start,z.start))
        s.append(tp(z.end,x.end))
        s.remove(x)
    elif z.start > x.start and z.start < x.end and z.end > x.end:
        x.end=z.start
    elif z.start > x.end:
        continue

print s

The algorithm by geocar fails when:

s=[tp(0,1),tp(0,3)]

I'm not very sure but I think this is the correct way:

class tp():
    def __repr__(self):
        return '(%.2f,%.2f)' % (self.start, self.end)
    def __init__(self,start,end): 
        self.start=start
        self.end=end
s=[tp(0,1),tp(0,3),tp(4,5)]
s.sort(key=lambda self: self.start)
print s
y=[ s[0] ]
for x in s[1:]:
    if y[-1].end < x.start:
        y.append(x)
    elif y[-1].end == x.start:
        y[-1].end = x.end
    if x.end > y[-1].end:
        y[-1].end = x.end
print y

I also implemented it for subtraction:

#subtraction
z=tp(1.5,5) #interval to be subtracted
s=[tp(0,1),tp(0,3), tp(3,4),tp(4,6)]

s.sort(key=lambda self: self.start)
print s
for x in s[:]:
    if z.end < x.start:
        break
    elif z.start < x.start and z.end > x.start and z.end < x.end:
        x.start=z.end
    elif z.start < x.start and z.end > x.end:
        s.remove(x)
    elif z.start > x.start and z.end < x.end:
        s.append(tp(x.start,z.start))
        s.append(tp(z.end,x.end))
        s.remove(x)
    elif z.start > x.start and z.start < x.end and z.end > x.end:
        x.end=z.start
    elif z.start > x.end:
        continue

print s

回复收藏 0 原文

人间不值得 2024-08-01 05:23:12

在 C++ 中求区间并集的总和

#include <iostream>
#include <algorithm>

struct interval
{
    int m_start;
    int m_end;
};

int main()
{
    interval arr[] = { { 9, 10 }, { 5, 9 }, { 3, 4 }, { 8, 11 } };

    std::sort(
        arr,
        arr + sizeof(arr) / sizeof(interval),
        [](const auto& i, const auto& j) { return i.m_start < j.m_start; });

    int total = 0;
    auto current = arr[0];
    for (const auto& i : arr)
    {
        if (i.m_start >= current.m_end)
        {
            total += current.m_end - current.m_start;
            current = i;
        }
        else if (i.m_end > current.m_end)
        {
            current.m_end = i.m_end;
        }
    }

    total += current.m_end - current.m_start;
    std::cout << total << std::endl;
}

To find the total of the union of intervals in c++

#include <iostream>
#include <algorithm>

struct interval
{
    int m_start;
    int m_end;
};

int main()
{
    interval arr[] = { { 9, 10 }, { 5, 9 }, { 3, 4 }, { 8, 11 } };

    std::sort(
        arr,
        arr + sizeof(arr) / sizeof(interval),
        [](const auto& i, const auto& j) { return i.m_start < j.m_start; });

    int total = 0;
    auto current = arr[0];
    for (const auto& i : arr)
    {
        if (i.m_start >= current.m_end)
        {
            total += current.m_end - current.m_start;
            current = i;
        }
        else if (i.m_end > current.m_end)
        {
            current.m_end = i.m_end;
        }
    }

    total += current.m_end - current.m_start;
    std::cout << total << std::endl;
}

回复收藏 0 原文

棒棒糖 2024-08-01 05:23:11

按其中一个术语（例如，开始）对它们进行排序，然后在浏览列表时检查与其（右侧）邻居的重叠。

class tp:
    def __repr__(self):
        return "(%d,%d)" % (self.start, self.end)

    def __init__(self, start, end):
        self.start = start
        self.end = end


s = [tp(5, 10), tp(7, 8), tp(0, 5)]
s.sort(key=lambda self: self.start)
y = [s[0]]
for x in s[1:]:
    if y[-1].end < x.start:
        y.append(x)
    elif y[-1].end == x.start:
        y[-1].end = x.end

Sort them by one of the terms (start, for example), then check for overlaps with its (right-hand) neighbor as you move through the list.

class tp:
    def __repr__(self):
        return "(%d,%d)" % (self.start, self.end)

    def __init__(self, start, end):
        self.start = start
        self.end = end


s = [tp(5, 10), tp(7, 8), tp(0, 5)]
s.sort(key=lambda self: self.start)
y = [s[0]]
for x in s[1:]:
    if y[-1].end < x.start:
        y.append(x)
    elif y[-1].end == x.start:
        y[-1].end = x.end

回复收藏 0 原文

~没有更多了~