找到图中没有已知方程的最近点的有效算法

发布于 2024-11-25 20:14:15 字数 709 浏览 0 评论 0原文

我出于好奇而问这个问题，因为我的快速而肮脏的实现似乎已经足够好了。不过我很好奇更好的实现是什么。

我有一张真实世界数据的图表。图表中不存在重复的 X 值，并且 X 值以一致的速率递增，但 Y 数据基于现实世界的输出。我想以编程方式找到图表上距任意给定点 P 最近的点。我正在尝试找到一种有效（即快速）的算法来执行此操作。我不需要确切的最近点，我可以选择“几乎”最近的点。

明显的惰性解决方案是递增图中的每个点，计算距离，然后找到距离的最小值。然而，理论上对于大图来说这可能会很慢；对于我想要的东西来说太慢了。

由于我只需要一个近似最近的点，我想理想的最快方程将涉及生成一条最佳拟合线并使用该线实时计算该点应位于的位置；但这听起来像是一个潜在的数学难题，我不打算承担。

我的解决方案是一个 hack，它之所以有效，只是因为我假设我的点 P 不是任意的，即我假设 P 通常会接近我的图形线，当发生这种情况时，我可以从考虑中划掉远处的 X 值。我计算与 P 共享 X 坐标的线上点的距离，并使用该点与 P 之间的距离来计算可能是较近点的最大/最小 X 值。

我忍不住觉得应该有一个比我的解决方案更快的算法（这只是有用，因为我假设 99% 的时间我的点 P 已经是接近直线的点）。我尝试在谷歌上搜索更好的算法，但发现了很多不太合适的算法，以至于很难在所有混乱的不合适的算法中找到我正在寻找的东西。那么，这里有人有一个更有效的建议算法吗？请记住，我不需要完整的算法，因为我所拥有的算法可以满足我的需求，我只是好奇正确的解决方案是什么。

原文

I'm asking this questions out of curiostity, since my quick and dirty implementation seems to be good enough. However I'm curious what a better implementation would be.

I have a graph of real world data. There are no duplicate X values and the X value increments at a consistant rate across the graph, but Y data is based off of real world output. I want to find the nearest point on the graph from an arbitrary given point P programmatically. I'm trying to find an efficient (ie fast) algorithm for doing this. I don't need the the exact closest point, I can settle for a point that is 'nearly' the closest point.

The obvious lazy solution is to increment through every single point in the graph, calculate the distance, and then find the minimum of the distance. This however could theoretically be slow for large graphs; too slow for what I want.

Since I only need an approximate closest point I imagine the ideal fastest equation would involve generating a best fit line and using that line to calculate where the point should be in real time; but that sounds like a potential mathematical headache I'm not about to take on.

My solution is a hack which works only because I assume my point P isn't arbitrary, namely I assume that P will usually be close to my graph line and when that happens I can cross out the distant X values from consideration. I calculating how close the point on the line that shares the X coordinate with P is and use the distance between that point and P to calculate the largest/smallest X value that could possible be closer points.

I can't help but feel there should be a faster algorithm then my solution (which is only useful because I assume 99% of the time my point P will be a point close to the line already). I tried googling for better algorithms but found so many algorithms that didn't quite fit that it was hard to find what I was looking for amongst all the clutter of inappropriate algorithms. So, does anyone here have a suggested algorithm that would be more efficient? Keep in mind I don't need a full algorithm since what I have works for my needs, I'm just curious what the proper solution would have been.

分享到QQ

分享到微博