当前位置：文江博客话题详情

支持 google/bing 地图的数据结构

发布于 2024-08-18 22:57:54 字数 88 浏览 6 评论 0原文

我想知道像 google/bing 地图这样的应用程序中的数据结构是什么。为什么搜索路线时返回结果这么快？使用什么样的算法来确定这些信息？

谢谢

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

蝶舞 2024-08-25 22:57:54

您的问题有两个部分：

使用什么样的数据结构来存储地图信息。
使用什么样的算法从源“导航”到目的地。

对此，我想补充另一个问题：

Google/Bing 如何“流入”数据。例如，您可以无缝地从数英里放大到地面，同时保持坐标系不变。

我将尝试按顺序解决每个问题。请注意，我不为 Google 地图或 Bing 团队工作，因此很明显，此信息可能不完全准确。我的这一点基于从一门关于数据结构和算法的优秀计算机科学课程中获得的知识。

Ans 1) 该图存储在边加权有向图中。地图上的位置是“顶点”，从一个位置到另一位置（从一个顶点到另一个顶点）的路径是“边”。

很明显，由于可能有数百万个顶点和一个数量级的边，所以真正有趣的是这个边加权有向图的表示。

我想说这将由某种邻接列表来表示，我这么说的原因是因为，如果你想象一张地图，它本质上是一个稀疏图。从一个地点到另一个地点的方法只有几种。想想你的房子吧！有多少条路（在我们的例子中是边）通向它？邻接表适合表示稀疏图，邻接矩阵适合表示稠密图。

当然，即使我们能够在内存中有效地表示稀疏图，但考虑到顶点和边的数量巨大，也不可能将所有内容一次性存储在内存中。因此，我会想象下面有某种流媒体库。

打个比方，如果你曾经玩过《魔兽世界》/《Syrim》/《GTA》等开放世界游戏，你会发现在很大程度上，没有加载屏幕。但很明显，不可能一次性将所有内容都放入内存中。因此，使用四叉树和视锥体剔除算法的组合，这些游戏能够动态加载资源（地形、精灵、网格等）。

我会想象类似的东西，但是对于图表。我没有对这个特定方面进行太多思考，但是为了构建一个非常基本的系统，人们可以想象一个内存数据库，它们在运行时根据需要查询并添加/删除图中的顶点和边。这给我们带来了另一个有趣的观点。由于需要在运行时删除和添加顶点和边，因此邻接列表的经典实现不会削减它。

在经典的实现中，我们只是将一个 List（Java 中的 Vector）存储在数组的每个元素中：Adj[]。我可以想象，用链表代替 Adj[] 数组，用二叉搜索树代替 List[Edge]。二叉搜索树将有助于 O(log N) 插入和删除节点。这是非常理想的，因为在 List 实现中，加法的复杂度为 O(1)，删除的复杂度为 O(N)，而当您处理数百万条边时，这是令人望而却步的。

这里要注意的最后一点是，在您实际开始导航之前，“没有”图表。由于可能有数百万用户，因此为每个人维护一张巨大的图是没有意义的（仅由于内存空间需求，这是不可能的）。我想当您统计导航过程时，会为您创建一个图表。很明显，由于您从位置 A 开始并转到位置 B（以及之后可能的其他位置），因此专门为您创建的图不应占用大量内存（假设流架构已到位）。

Ans 2）这是一个非常有趣的问题。解决这个问题最基本的算法是 Dijkstra 路径查找算法。存在更快的变体，例如 A*。我认为 Dijkstra 足够快，如果它可以与上面讨论的流架构正常工作的话。 Dijkstra 使用与 V 成比例的空间和与 E lg V 成比例的时间，这是非常好的图形，特别是对于稀疏图。请记住，如果流式架构尚未确定，V 和 E 将爆炸，Dijkstra 的空间和运行时要求将使其令人望而却步。

Ans 1) 流媒体问题：不要将此问题与上面讨论的流媒体架构混淆。这基本上是在问如何实现无缝缩放。

实现此目的的一个很好的算法是四叉树算法（您可以将其推广到 n 树）。您可以将较粗糙的图像存储在树的较高位置，并在向下遍历树时存储更高分辨率的图像。这实际上就是 KML（Keyhole）通过其映射算法所做的事情。 Keyhole 是一家公司，多年前就与 NVIDIA 合作开发了第一个类似“Google Earth”的软件。

四叉树剔除的灵感来自现代 3D 游戏，它用于快速剔除场景中不在视锥体中的部分。

为了进一步澄清这一点，想象一下您正在从高处查看美国地图。在此级别，您基本上将地图分为 4 个部分，并使每个部分成为四叉树的子级。

现在，当您放大时，您会放大其中一个部分（很明显，您可以在中心右侧放大，这样您的缩放实际上会触及所有 4 个部分，但为了简单起见，假设您放大了其中一个部分）部分）。因此，当您放大某个部分时，您会遍历该部分的 4 个子部分。这 4 个子项包含其父项的更高分辨率数据。然后，您可以继续缩小，直到看到一组叶子，其中包含最高分辨率的数据。为了实现从一种分辨率到下一种分辨率的“无缝”跳跃，可以利用模糊和淡入淡出效果的组合。

作为这篇文章的后续内容，我将尝试添加我在此处提出的许多概念的链接。

There are two parts to your question:

What kind of data structure is used to store the map information.
What kind of algorithm is used to "navigate" from source to destination.

To this, I would add another question:

How is Google/Bing able to "stream in" the data. So for example, you are able to zoom in from miles up to the ground level seamlessly, all the while maintaining the coordinate system.

I will attempt to address each question in order. Do note, that I do not work for the Google Maps or the Bing team, so quite obviously, this information might not be completely accurate. I am basing this off of the knowledge gained from a good CS course about data structures and algorithms.

Ans 1) The map is stored in an Edge Weighted Directed Graph. Locations on the map are Vertices and the path from one location to another (from one vertex to another) are the Edges.

Quite obviously, since there can be millions of vertices and an order of magnitude more edges, the really interesting thing would be the representation of this Edge Weighted Digraph.

I would say that this would be represented by some kind of Adjacency List and the reason I say so is because, if you imagine a map, it is essentially a sparse graph. There are only a few ways to get from one location to another. Think about your house! How many roads (edges in our case) lead to it? Adjacency Lists are good for representing sparse graphs, and adjacency matrix is good for representing dense graphs.

Of course, even though we are able to efficiently represent sparse graphs in memory, given the sheer number of Vertices and Edges, it would be impossible to store everything in memory at once. Hence, I would imagine some kind of a streaming library underneath.

To create an analogy for this, if you have ever played an open-world game like World of Warcraft / Syrim / GTA, you will observe that to a large part, there is no loading screen. But quite obviously, it is impossible to fit everything into memory at once. Thus using a combination of quad-trees and frustum culling algorithms, these games are able to dynamically load resources (terrain, sprites, meshes etc).

I would imagine something similar, but for Graphs. I have not put a lot of thought into this particular aspect, but to cook up a very basic system, one can imagine an in memory database, which they query and add/remove vertices and edges from the graph at run-time as needed. This brings us to another interesting point. Since vertices and edges need to be removed and added at run-time, the classic implementation of Adjacency List will not cut it.

In a classic implementation, we simply store a List (a Vector in Java) in each element of an array: Adj[]. I would imagine, a linked list in place of the Adj[] array and a binary search tree in place of List[Edge]. The binary search tree would facilitate O(log N) insertion and removal of nodes. This is extremely desirable since in the List implementation, while addition is O(1), removal is O(N) and when you are dealing with millions of edges, this is prohibitive.

A final point to note here is that until you actually start the navigation, there is "no" graph. Since there can be million of users, it doesn't make sense to maintain one giant graph for everybody (this would be impossible due to memory space requirement alone). I would imagine that as you stat the navigation process, a graph is created for you. Quite obviously, since you start from location A and go to location B (and possibly other locations after that), the graph created just for you should not take up a very large amount of memory (provided the streaming architecture is in place).

Ans 2) This is a very interesting question. The most basic algorithm for solving this problem would be Dijkstra Path Finding algorithm. Faster variations such as A* exist. I would imagine Dijkstra to be fast enough, if it could work properly with the streaming architecture discussed above. Dijkstra uses space proportional to V and time proportional to E lg V, which are very good figures, especially for sparse graphs. Do keep in mind, if the streaming architecture has not been nailed down, V and E will explode and the space and run-time requirements of Dijkstra will make it prohibitive.

Ans 1) Streaming question: Do not confuse this question with the streaming architecture discussed above. This is basically asking how the seamless zoom is achieved.

A good algorithm for achieving this is the Quad Tree algorithm (you can generalize this to n-tree). You store coarser images higher up in the tree and higher resolution images as you traverse down the tree. This is actually what KML (Keyhole) did with its mapping algorithm. Keyhole was a company that partnered with NVIDIA many years back to produce one of the first "Google Earth" like softwares.

The inspiration for Quad Tree culling, comes from modern 3D games, where it is used to quickly cull away parts of the scene which is not in the view frustum.

To further clarify this, imagine that you are looking at the map of USA from really high up. At this level, you basically split the map into 4 sections and make each section a child of the Quad Tree.

Now, as you zoom in, you zoom in on one of the sections (quite obviously you can zoom right in the center, so that your zoom actually touches all 4 sections, but for simplicity's sake, lets say you zoom in on one of the sections). So when you zoom in to one section, you traverse the 4 children of that section. These 4 children contain higher resolution data of its parent. You can then continue to zoom down till you hit a set of leaves, which contain the highest resolution data. To make the jump from one resolution to the next "seamless" a combination of blur and fading effects can be utilized.

As a follow-up to this post, I will try to add links to many of the concepts I put in here.

回复收藏 0 原文