如何在 MySQL 中使用别名对城市进行建模
给定的位置(城市)可以有一个名称以及其他可以识别该位置的别名。我需要在数据库中对此进行建模。
可以使用城市或其任何别名执行搜索:
例如
城市:
- 名称:洛杉矶
- 别名:LA
指定搜索条件时,我可以指定 LA 或洛杉矶,它应该返回相同的结果(例如好莱坞) 。
我正在考虑使用一对多关系来做到这一点,其中 1 个城市可以有多个别名,并且许多别名可以映射到一个城市。
在进行搜索时,我使用城市和城市别名表的联接来查找正确的城市。
有更好的方法来处理这个问题吗?
编辑: (对于遇到相同问题/要求并碰巧来到此页面的任何人) 请参阅我的答案,因为我最终使用了它,但标记的答案可以帮助您识别独特的城市。
A given location (city), can have a name and also other aliases by which it can be known. I need to model this in a database.
Search can be executed using either city or any of its alias:
For e.g.
City:
- name: Los Angeles
- alias: LA
When specifying the search criterion, I can either specify LA or Los Angeles, and it should return the same result (e.g. Hollywood).
I was thinking of doing it using One-To-Many relation where 1 city can have many aliases, and many aliases can map to one city.
When doing a search, I use a join of city and cityAlias table to find the correct city.
Is there a better way to deal with this?
EDIT:
(For anyone who runs into the same problem/requirements and happen to comes to this page)
Please see my answer as well, since I ended up using that, but the marked answer helps you identify unique cities.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
data:image/s3,"s3://crabby-images/d5906/d59060df4059a6cc364216c4d63ceec29ef7fe66" alt="扫码二维码加入Web技术交流群"
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(3)
我可以添加到您的解决方案中的唯一一件事是,您可以尝试首先在城市表中查找完全匹配的内容,如果没有,则加入别名。这样您就可以跳过一些非常昂贵的连接。
另一件需要注意的事情是,这个双表解决方案可能会遇到重复条目的问题。我不是在谈论不同城市的相同别名(这可以使用唯一列进行检查),而是与城市名称匹配的别名。这些“重复条目”的示例,后跟详细说明:
城市
别名
我知道这不应该发生......但你知道摩尔定律:)这些跨表重复可能会给您在查找表中带来麻烦(我想您正在使用它们作为查找来“猜测”当他/她写下“LA”时,城市实际上试图选择什么用户)。但如果用户写了“洛杉矶”,您就必须决定是优先考虑城市还是别名。我知道我提供的例子有点愚蠢,但作为非美国公民,我无法提供更好的例子。但是有很多城市,每个城市都有很多别名...我不会冒险:)
首先检查城市表将使该城市优先于其他城市的同等命名别名。或者,您可以在插入之前检查城市别名是否已作为城市名称存在。
这就是我能想到的:)
The only thing I can add to your solution is that you can try first looking for the exact match in the city tables and if there isn't any, then joining with the alias one. That way you might skip some joins that are quite expensive.
Another thing to notice is that this double table solution might have trouble with duplicated entries. I'm not talking about same aliases for different cities (this can be checked with a unique column), but aliases matching city names. Example of these "duplicate entries" followed by detailed explanation:
Cities
Aliases
I know this should not happen... but you know Moore's laws :) These cross-table-duplicates might give you trouble in a lookup table (I imagine you're using them as a look up to "guess" what City has actually tried to select the user when he/she wrote "LA"). But if the user wrote "Los Angeles", you'll have to decide whether to prioritize the City or the Alias. I know the example I provided is a bit silly but as a non-american citizen I can't provide better examples. But there are many cities out there with many aliases for each one... I wouldn't take a chance :)
Checking first the city table will give the city priority over an equaled named alias for other city. Or you can check whether an alias to a city is already present as a city name before inserting it.
That's all I can think of :)
一些注意事项:
DestinationAlias
表不需要代理键。(idDestination, alias)
(或相反)可以用作PRIMARY KEY
。要消除两个表中的(公共)名称重复以及可能出现的问题,您可以从
Destination
表中删除name
列并添加DestinationDefaultAlias
表,与DestinationAlias
具有1:1
关系(以及与Destination< 隐含的
1:1
关系) /代码>):当您想要查找默认名称时,请将
Destination
与DestinationDefaultAlias
连接起来。要搜索所有别名,请使用DestinationAlias
加入。Some notes:
The
DestinationAlias
table does not need a surrogate key. The(idDestination, alias)
(or the reverse) can serve as aPRIMARY KEY
.To eliminate the duplication of the (common) names in both tables and the problems that might occur, you can remove the
name
column from theDestination
table and add aDestinationDefaultAlias
table, having a1:1
relationship withDestinationAlias
(and an implied1:1
relationship withDestination
):When you want to find the default name, you join
Destination
withDestinationDefaultAlias
. To search all aliases, you join withDestinationAlias
.假设:
洛杉矶是纽约的别名
对于添加的每个城市,添加城市名称本身作为别名表的条目,因此我们只需搜索别名表。
城市表:
别名表:
用例 1:
搜索 LA:产量 (cityID) => [1, 3] =独特> [洛杉矶,La Aguardia]
用例 2:
搜索洛杉矶:产量 (cityID) => [1, 2] =独特> [洛杉矶、纽约]
用例3:
搜索纽约:yields (cityID) => [2,2]=独特> [纽约]
Assumption:
Los Angeles is an alias of New York
For every city added, add the city name itself as an entry to the alias table, so we only have to search on alias table.
City table:
Alias table:
Use case 1:
Search for LA: yields (cityID) => [1, 3] =unique> [Los Angeles, La Aguardia]
Use case 2:
Search for Los Angeles: yields (cityID) => [1, 2] =unique> [Los Angeles, New York]
Use case 3:
Search for New York: yields (cityID) => [2,2] =unique> [New York]