使用正则表达式匹配并替换
有一个字符串a的列表,这是与字符串B的另一个列表匹配的方法。我想使用正则表达式将字符串A替换为匹配字符串B列表。但是我没有得到正确的结果。
解决方案应为 a == [“ Yogesh”,“ Numita”,“ Hero”,“ Yogesh”]
。
import re
A = ["yogeshgovindan","TNumita","Herohonda","Yogeshkumar"]
B=["Yogesh","Numita","Hero"]
for i in A:
for j in B:
replaced=re.sub('i','j',i)
print(replaced)
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
data:image/s3,"s3://crabby-images/d5906/d59060df4059a6cc364216c4d63ceec29ef7fe66" alt="扫码二维码加入Web技术交流群"
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
这对我有用:
如果在列表中找到列表B中的元素。有必要比较较低的单词。添加了
[0]
以获取字符串而不是从理解列表中列表。this one works to me:
This returns element from list B if it is found at A list. It's necessary to compare lowercased words. The
[0]
is added for getting string instead of list from comprehension list.如果循环
B
,则不需要正则表达式;您可以简单地使用成员资格测试。正则表达式可能会导致更好的性能,因为会员资格测试将为
a
中的每个字符串扫描b
中的每个字符串,从而导致o(len(a) * len (b)性能)
。只要单个术语不包含任何metacharacter,并且可以在任何上下文中出现,则形成正则表达式的最简单方法就是将
b
的条目与交替操作:,为了安全起见,应首先逃脱条目
但是 对术语出现的上下文中的任何限制,都需要预先限制并将其附加到模式中。例如,如果该术语必须以完整的词显示:
可以将此正则表达式应用于
a
的每个项目以获取匹配的文本:由于REGEX中的术语是直字符串匹配,因此内容将为正确,但情况可能不正确。可以通过归一化步骤来纠正这一点,将匹配的文本通过
dict
:一个问题仍然存在:如果
a
的项目不匹配该怎么办?然后reterms.search
返回none
,它没有group
属性。如果none
-propagating属性访问添加到Python(例如),使用这种功能很容易解决:在没有此类功能的情况下,有多种方法,例如使用三元表达式和 Walrus分配。在下面的示例中,列表用作备用,以提供比赛的默认值:
If looping over
B
, you don't need a regular expression; you can simply use membership testing.A regex might result in better performance, as membership testing will scan each string in
A
for every string inB
, resulting inO(len(A) * len(B) performance)
.As long as the individual terms don't contain any metacharacters and can appear in any context, the simplest way to form the regex is to join the entries of
B
with the alternation operation:However, to be safe, the entries should first be escaped, in case any contains a metacharacter:
If there is any restrictions on the context the terms appear in, sub-patterns for the restrictions would need to be prepended and appended to the pattern. For example, if the terms must appear as full words:
This regex can be applied to each item of
A
to get the matching text:Since the terms in the regex are straight string matches, the content will be correct, but the case may not. This could be corrected by a normalization step, passing the matched text through a
dict
:One issue remains: what if an item of
A
doesn't match? ThenreTerms.search
returnsNone
, which doesn't have agroup
attribute. IfNone
-propagating attribute access is added to Python (such as suggested by PEP 505), this would be easily addressed by using such:In the absence of such a feature, there are various approaches, such as using a ternary expression and walrus assignment. In the sample below, a list is used as a stand-in to provide a default value for the match: