Sphinx - 分隔符

发布于 2024-10-07 20:19:22 字数 466 浏览 11 评论 0原文

我想知道 Sphinx 引擎是否可以使用任何分隔符（例如普通 MySQL 中的逗号和句点）。我的问题来自于一种冲动，根本不使用它们，而是逃避它们，或者至少在使用全文搜索执行 MATCH 操作时它们不会发生冲突，因为默认情况下我在 MySQL 中处理它们时遇到问题，并且我不希望被迫用任何其他字符替换这些分隔符来提供一组好的结果。

抱歉，如果我说了一些愚蠢的话，但我没有使用 Sphinx 或其他补充（？）搜索引擎的经验。

举个例子，如果我

"Passat 2.0 TDI"

默认使用 MySQL 执行搜索，则会将这种情况下的句点识别为分隔符，并且由于“2”和“0”太短，默认情况下无法被视为单词，因此结果将是有点乱。

使用 Sphinx（或其他搜索引擎）是否容易处理？我愿意接受建议。

这是一个大型项目，可能有超过 500.000 条可能的记录（一点也不简单）。

干杯!

原文

I would like to know if the Sphinx engine works with any delimiters (like commas and periods in normal MySQL). My question comes from the urge, not to use them at all, but to escape them or at least thay they don't enter in conflict when performing MATCH operations with FULLTEXT searches, since I have problems dealing with them in MySQL by default and I would prefer not to be forced to replace those delimiters by any other characters to provide a good set of results.

Sorry if I'm saying something stupid, but I don't have experience with Sphinx or other complementary (?) search engines.

To give you an example, if I perform a search with

"Passat 2.0 TDI"

MySQL by default would identify the period in this case as a delimiter and since the "2" and "0" are too short to be considered words by default, the results would be a bit messed up.

Is it easy to handle with Sphinx (or other search engine)? I'm open to suggestions.

This is for a large project, with probably more than 500.000 possible records (not trivial at all).

Cheers!

分享到QQ

分享到微博