如何使用python中的npl或spacy在句子中仅删除含义较少的单词示例:fvbfxcfbs,#$%$#$$ ## $,sdhsjkbnfsgfdf
import nltk
from nltk import word_tokenize
words = set(nltk.corpus.words.words())
sent = 'my name is vallem balu fvbfxcfbs gvcx 2124326456 4554 @#@%$@!%^$#@%$ from Andhra Pradesh i was working software engineer'
data=''.join(e for e in sent if (e.isalnum() or e.isspace()))
print(data)
sent =" ".join(w for w in nltk.wordpunct_tokenize(str(data)) \
if w.lower() in words or not w.isalpha())
print(sent)
输出:
我的名字叫Vallem Balu fvbfxcfbs GVCX 2124326456 4554来自安得拉邦,我是在工作软件工程师,
我的名字是Balu 2124326456 4554,我从我开始工作的工程师
如何在下面完成完整句子: 我的名字叫Vallem Balu 2124326456 4554来自安得拉邦
import nltk
from nltk import word_tokenize
words = set(nltk.corpus.words.words())
sent = 'my name is vallem balu fvbfxcfbs gvcx 2124326456 4554 @#@%$@!%^$#@%$ from Andhra Pradesh i was working software engineer'
data=''.join(e for e in sent if (e.isalnum() or e.isspace()))
print(data)
sent =" ".join(w for w in nltk.wordpunct_tokenize(str(data)) \
if w.lower() in words or not w.isalpha())
print(sent)
output:
my name is vallem balu fvbfxcfbs gvcx 2124326456 4554 from Andhra Pradesh i was working software engineer
my name is balu 2124326456 4554 from i was working engineer
how to get below full sentence:
my name is vallem balu 2124326456 4554 from Andhra Pradesh i was working software engineer
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论