当前位置：文江博客话题详情

如何在 swi-prolog 中分割句子

发布于 2024-09-27 22:55:13 字数 223 浏览 17 评论 0原文

我正在尝试在 win xp 中使用 SWI-Prolog。我试图了解如何将 Prolog 中的句子分成单独的原子。

例如：假设我有一个这样的句子：

“这是一个字符串”
有什么方法可以将单个单词存储在变量中吗？

就像：

X = 这个
Y = 是
....
等等。

谁能解释一下这是如何工作的吗？

谢谢。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

最丧也最甜 2024-10-04 22:55:13

我会使用atomic_list_concat/3。请参阅

http://www.swi-prolog.org/pldoc/man? predicate=atomic_list_concat%2F3

通常它是为了插入一个分隔符，但由于 Prolog 统一的双向性，它也可以用来分割一个字符串给定分隔符：

atomic_list_concat(L,' ', 'This is a string').
L = ['This',is,a,string]

当然一旦分割完成后你可以使用列表 L 的元素。

I would use atomic_list_concat/3. See

http://www.swi-prolog.org/pldoc/man?predicate=atomic_list_concat%2F3

Normally it is meant to insert a separator but because of Prolog's bidirectionality of unification, it can also be used to split a string given the separator:

atomic_list_concat(L,' ', 'This is a string').
L = ['This',is,a,string]

Of course once the split is done you can play with the elements of the list L.

回复收藏 0 原文

知你几分 2024-10-04 22:55:13

我喜欢“pat fats”的答案，但你必须先将字符串转换为原子：

...,atom_codes(Atom, String),atomic_list_concat(L, ' ', Atom), ...

如果您需要直接使用字符串，我的“武器库”中有此代码：

%%  split input on Sep
%
%   minimal implementation
%
splitter(Sep, [Chunk|R]) -->
    string(Chunk),
    (   Sep -> !, splitter(Sep, R)
    ;   [], {R = []}
    ).

作为 DCG，必须以这种方式调用：

?- phrase(splitter(" ", L), "this is a string"), maplist(atom_codes, As, L).
L = [[116, 104, 105, 115], [105, 115], [97], [115, 116, 114, 105, 110|...]],
As = [this, is, a, string] .

编辑： 更多说明

我忘了解释它是如何工作的：@larsman 在这个其他答案中很好地解释了 DCG。我引用他的话

-->，实际上向其中添加了两个隐藏参数。第一个是要由语法规则解析的列表；第二个是解析后的“剩下的”。 c(F,X,[]) 在列表 X 上调用 c 以获得结果 F，期望留下 []，即解析器应该消耗整个列表 X。

这里我有 2 个参数，第一个是分隔符，第二个是正在构建的列表。内置字符串//1来自SWI-Prolog库(http/dcg_basics)。这是一个非常方便的构建块，几乎可以匹配回溯中的任何内容。这里它“吃掉”分隔符或字符串结尾之前的每个字符。完成后，我们可以递归......

I like the answer of 'pat fats', but you have to convert your string to atom before:

..., atom_codes(Atom, String), atomic_list_concat(L, ' ', Atom), ...

If you need to work directly with strings, I have this code in my 'arsenal':

%%  split input on Sep
%
%   minimal implementation
%
splitter(Sep, [Chunk|R]) -->
    string(Chunk),
    (   Sep -> !, splitter(Sep, R)
    ;   [], {R = []}
    ).

being a DCG, must be called in this way:

?- phrase(splitter(" ", L), "this is a string"), maplist(atom_codes, As, L).
L = [[116, 104, 105, 115], [105, 115], [97], [115, 116, 114, 105, 110|...]],
As = [this, is, a, string] .

edit: more explanation

I forgot to explain how that works: DCG are well explained by @larsman, in this other answer. I cite him

-->, which actually adds two hidden arguments to it. The first of these is a list to be parsed by the grammar rule; the second is "what's left" after the parse. c(F,X,[]) calls c on the list X to obtain a result F, expecting [] to be left, i.e. the parser should consume the entire list X.

Here I have 2 arguments, the first it's the separator, the second the list being built. The builtin string//1 come from SWI-Prolog library(http/dcg_basics). It's a very handy building block, that match literally anything on backtracking. Here it's 'eating' each char before the separator or the end-of-string. Having done that, we can recurse...

回复收藏 0 原文