当前位置：文江博客话题详情

Python regex syntax python-re

正则表达输出的问题

发布于 2025-02-09 04:10:48 字数 608 浏览 1 评论 0原文

我正在使用以下代码从文本文件中滤除电话号码。

import re

pattern = (r'\d*-\d*-\d*')
names = (r'\w*')

with open('records.txt', 'r') as f:
    for line in f:
        matches = re.findall(pattern, line)
        namesMatch = re.findall(names, line) 
        if matches:
            print(matches)
        elif namesMatch:
            print(namesMatch + ":")`

我将获得以下输出：

['John']
['222-333-4447']
['423-444-5678']
['123-455-1223']
['Paul']
['423-444-5778']

我想要类似的输出，但以字符串格式如下：

John:
222-333-4447
423-444-5678
123-455-1223
Paul:
423-444-5778

I am using the following code to filter out phone numbers from a text file.

import re

pattern = (r'\d*-\d*-\d*')
names = (r'\w*')

with open('records.txt', 'r') as f:
    for line in f:
        matches = re.findall(pattern, line)
        namesMatch = re.findall(names, line) 
        if matches:
            print(matches)
        elif namesMatch:
            print(namesMatch + ":")`

I am getting the following output:

['John']
['222-333-4447']
['423-444-5678']
['123-455-1223']
['Paul']
['423-444-5778']

I want a similar output but in string format like below:

John:
222-333-4447
423-444-5678
123-455-1223
Paul:
423-444-5778

收藏 0

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

评论（1）

逆光下的微笑 2025-02-16 04:10:48

您可以使用

import re
text = """John................. .............222-333-4447....... .. 423-444-5678 .... .................123-455-1223............... ........Paul... ......423-444-5778....................................... – 
Nithin"""
pattern=r"(\w+)((?:[\s:.]*\d+[-—–]\d+[-—–]\d+)+)"
print( [(x, " ".join(y.replace('.','').split())) for x,y in re.findall(pattern, text)] )

输出：

[('John', '222-333-4447 423-444-5678 123-455-1223'), ('Paul', '423-444-5778')]

请参阅 python demo 。

详细信息：

（\ w+） - 组1：一个或多个单词chars
（（？：[\ s：。]*\ d+[ - ] \ d+[ - - ] \ d+）+）+） - 第1组：以下模式序列发生一个或多个出现
- [\ s：。]* - 零或更多whitespaces，colons and dots
- \ d+ - 一个或多个数字
- [ - - - ] - 连字符或dash
- \ d+[ - - - ] \ d+ - 一个或多个数字，dash，一个或多个数字。

You can use

import re
text = """John................. .............222-333-4447....... .. 423-444-5678 .... .................123-455-1223............... ........Paul... ......423-444-5778....................................... – 
Nithin"""
pattern=r"(\w+)((?:[\s:.]*\d+[-—–]\d+[-—–]\d+)+)"
print( [(x, " ".join(y.replace('.','').split())) for x,y in re.findall(pattern, text)] )

Output:

[('John', '222-333-4447 423-444-5678 123-455-1223'), ('Paul', '423-444-5778')]

See the Python demo.

Details:

(\w+) - Group 1: one or more word chars
((?:[\s:.]*\d+[-—–]\d+[-—–]\d+)+) - Group 1: one or more occurrences of the following pattern sequence
- [\s:.]* - zero or more whitespaces, colons and dots
- \d+ - one or more digits
- [-—–] - a hyphen or dash
- \d+[-—–]\d+ - one or more digits, a dash, one or more digits.

回复收藏 0 原文

~没有更多了~

关于作者

暂无简介

文章

评论

27 人气

关注发私信

相关话题

热门标签

操作系统程序设计 IT运维 Linux系统管理 JavaScript 服务器应用 solaris C/C++ PHP Shell BSD Vue.js aix Oracle Python HTML 系统管理 HTML5 CSS 前端

推荐作者

alipaysp_giMRmwQ3mK

文章 0 评论 0

爱她像谁

文章 0 评论 0

清风疏影

文章 0 评论 0

mb_OO8gCSDD

文章 0 评论 0

佚名

文章 0 评论 0

汹涌人海

文章 0 评论 0

友情链接

我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的隐私政策了解更多相关信息。单击 接受 或继续使用网站，即表示您同意使用 Cookies 和您的相关数据。

原文