当前位置：文江博客话题详情

从街道地址中删除街道号码

发布于 2024-07-25 17:29:03 字数 250 浏览 11 评论 0原文

使用 Ruby (newb) 和正则表达式，我尝试从街道地址解析街道号码。我在简单的问题上没有遇到麻烦，但我需要一些帮助：

“6223 1/2 S Figueroa ST”==> 'S Figueroa ST'

感谢您的帮助！

更新：

'6223 1/2 2ND ST'==> “2ND ST”

来自@pesto “贝克街 221B 号”==> '贝克街'

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

阳光下慵懒的猫 2024-08-01 17:29:03

这将删除字符串前面的所有内容，直到遇到字母：

street_name = address.gsub(/^[^a-zA-Z]*/, '')

如果可能有“221B Baker Street”之类的内容，那么您必须使用更复杂的内容。这应该有效：

street_name = address.gsub(/^((\d[a-zA-Z])|[^a-zA-Z])*/, '')

This will strip anything at the front of the string until it hits a letter:

street_name = address.gsub(/^[^a-zA-Z]*/, '')

If it's possible to have something like "221B Baker Street", then you have to use something more complex. This should work:

street_name = address.gsub(/^((\d[a-zA-Z])|[^a-zA-Z])*/, '')

回复收藏 0 原文

雪花飘飘的天空 2024-08-01 17:29:03

组匹配：

.*\d\s(.*)

如果您还需要考虑公寓号码：

.*\d.*?\s(.*)

这将处理 123A 街道名称

，只要字符串中没有其他数字，就应该去除前面的数字（和空格）。只需捕获第一组 (.*)

Group matching:

.*\d\s(.*)

If you need to also take into account apartment numbers:

.*\d.*?\s(.*)

Which would take care of 123A Street Name

That should strip the numbers at the front (and the space) so long as there are no other numbers in the string. Just capture the first group (.*)

回复收藏 0 原文

帅哥哥的热头脑 2024-08-01 17:29:03

stackoverflow 上还有另外一组答案：
解析可用街道地址、城市、州、邮政编码string

我认为谷歌/雅虎解码器方法是最好的，但取决于你谈论的频率/地址数量 - 否则所选的答案可能是最好的

回复收藏 0 原文

坐在坟头思考人生 2024-08-01 17:29:03

街道名称也可以是数字吗？例如

1234 45TH ST

，

1234 45 ST

您可以处理上面的第一种情况，但第二种情况很困难。

我会按空格分割地址，跳过任何不包含字母的前导部分，然后加入其余部分。我不了解 Ruby，但这里有一个 Perl 示例，它也突出了我的方法的问题：

#!/usr/bin/perl

use strict;
use warnings;

my @addrs = (
    '6223 1/2 S FIGUEROA ST',
    '1234 45TH ST',
    '1234 45 ST',
);

for my $addr ( @addrs ) {
    my @parts = split / /, $addr;

    while ( @parts ) {
        my $part = shift @parts;
        if ( $part =~ /[A-Z]/ ) {
            print join(' ', $part, @parts), "\n";
            last;
        }
    }
}

C:\Temp> skip
S FIGUEROA ST
45TH ST
ST

Can street names be numbers as well? E.g.

1234 45TH ST

or even

1234 45 ST

You could deal with the first case above, but the second is difficult.

I would split the address on spaces, skip any leading components that do not contain a letter and then join the remainder. I do not know Ruby, but here is a Perl example which also highlights the problem with my approach:

#!/usr/bin/perl

use strict;
use warnings;

my @addrs = (
    '6223 1/2 S FIGUEROA ST',
    '1234 45TH ST',
    '1234 45 ST',
);

for my $addr ( @addrs ) {
    my @parts = split / /, $addr;

    while ( @parts ) {
        my $part = shift @parts;
        if ( $part =~ /[A-Z]/ ) {
            print join(' ', $part, @parts), "\n";
            last;
        }
    }
}

C:\Temp> skip
S FIGUEROA ST
45TH ST
ST

回复收藏 0 原文