我需要为此实现 B 树搜索吗？

发布于 2024-09-28 19:00:24 字数 789 浏览 6 评论 0原文

我有一个整数数组，可能有数十万（或更多），按数字升序排序，因为这就是它们最初的堆叠方式。

我需要能够尽可能高效地查询数组以获取其第一次出现的数字 >= 某些输入的索引。我不假思索地知道如何做到这一点的唯一方法是迭代测试条件的数组，直到它返回 true，此时我将停止迭代。然而，这是解决这个问题最昂贵的解决方案，我正在寻找最好的算法来解决它。

我正在使用 Objective-C 进行编码，但我将给出一个 JavaScript 示例，以扩大能够做出回应的受众。

// Sample set
var numbers = [1, 7, 23, 23, 23, 89, 1002, 1003];

var indexAfter100 = getIndexOfValueGreaterThan(100);
var indexAfter7 = getIndexOfValueGreaterThan(7);

// (indexAfter100 == 6) == true
// (indexAfter7 == 2) == true

将这些数据放入数据库中以执行此搜索只是最后的解决方案，因为我渴望看到某种算法来在内存中快速解决此问题。

我确实有能力更改数据结构，或者在构建数组时存储附加数据结构，因为我的程序已经将每个数字一一推入此堆栈，所以我只需修改将它们添加到堆栈的代码即可。在将索引添加到堆栈时搜索索引是不可能的，因为事后将使用不同的值频繁重复搜索操作。

现在我正在考虑“B 树”，但说实话，我不知道如何实现它，在我开始弄清楚这一点之前，我想知道是否有一个适合这个单一用例的好算法更好的？

原文

I have an array of integers, which could run into the hundreds of thousands (or more), sorted numerically ascending since that's how they were originally stacked.

I need to be able to query the array to get the index of its first occurrence of a number >= some input, as efficiently as possible. The only way I would know how to do this without even thinking about it would be to iterate through the array testing the condition until it returns true, at which point I'd stop iterating. However, this is the most expensive solution to this problem and I'm looking for the best algorithm to solve it.

I'm coding in Objective-C, but I'll give an example in JavaScript to broaden the audience of people who are able to respond.

// Sample set
var numbers = [1, 7, 23, 23, 23, 89, 1002, 1003];

var indexAfter100 = getIndexOfValueGreaterThan(100);
var indexAfter7 = getIndexOfValueGreaterThan(7);

// (indexAfter100 == 6) == true
// (indexAfter7 == 2) == true

Putting this data into a DB in order to perform this search will only be a last-resort solution since I'm keen to see some sort of algorithm to tackle this quickly in memory.

I do have the ability to change the data structure, or to store an additional data structure as I'm building the array, since my program has already pushed each number one by one onto this stack, so I'd just modify the code that's adding them to the stack. Searching for the index as they're being added to the stack isn't possible since the search operation will be repeated frequently with different values after the fact.

Right now I'm thinking "B-Tree" but to be honest, I would have no idea how to implement one and before I go off and start figuring that out, I wonder if there's a nice algorithm that fits this single use-case better?

分享到QQ

分享到微博